Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Purchase individual online access for 1 year to this journal.
Price: EUR 315.00Impact Factor 2024: 1.7
The purpose of the Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology is to foster advancements of knowledge and help disseminate results concerning recent applications and case studies in the areas of fuzzy logic, intelligent systems, and web-based applications among working professionals and professionals in education and research, covering a broad cross-section of technical disciplines.
The journal will publish original articles on current and potential applications, case studies, and education in intelligent systems, fuzzy systems, and web-based systems for engineering and other technical fields in science and technology. The journal focuses on the disciplines of computer science, electrical engineering, manufacturing engineering, industrial engineering, chemical engineering, mechanical engineering, civil engineering, engineering management, bioengineering, and biomedical engineering. The scope of the journal also includes developing technologies in mathematics, operations research, technology management, the hard and soft sciences, and technical, social and environmental issues.
Authors: Zhao, Liang | Wang, Jiawei | Liu, Shipeng | Yang, Xiaoyan
Article Type: Research Article
Abstract: Tunnels water leakage detection in complex environments is difficult to detect the edge information due to the structural similarity between the region of water seepage and wet stains. In order to address the issue, this study proposes a model comprising a multilevel transformer encoder and an adaptive multitask decoder. The multilevel transformer encoder is a layered transformer to extract the multilevel characteristics of water leakage information, and the adaptive multitask decoder comprises the adaptive network branches. The adaptive network branches generate the ground truths of wet stains and water seepage through the threshold value and transmit them to the network …for training. The converged network, the U-net, fuses coarse images from the adaptive multitask decoder, and the fusion images are the final segmentation results of water leakage in tunnels. The experimental results indicate that the proposed model achieves 95.1% Dice and 90.4% MIOU, respectively. This proposed model demonstrates a superior level of precision and generalization when compared to other related models. Show more
Keywords: Water leakage, multilevel transformer encoder, adaptive multitask decoder, adaptive network branches, converged network
DOI: 10.3233/JIFS-224315
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-15, 2023
Authors: Luo, Binghui | Liu, Xin | Qin, Long | Jiao, Xiaolong | Li, Wengui
Article Type: Research Article
Abstract: The short text matching models can be roughly divided into representation-based and interaction-based approaches. However, current representation-based text matching models often lack the ability to handle sentence pairs and typically only perform feature interactions at the network’s top layer, which can lead to a loss of semantic focus. The interactive text matching model has significant shortcomings in extracting differential information between sentences and may ignore global information. To address these issues, this article proposes a model structure that combines a dual-tower architecture with an interactive component, which compensates for their respective weaknesses in extracting sentence semantic information. Simultaneously, a method …for integrating semantic information is proposed, enabling the model to capture both the interactive information between sentence pairs and the differential information between sentences, thereby addressing the issues with the aforementioned approaches. In the process of network training, a combination of cross-entropy and cosine similarity is used to calculate the model loss. The model is optimized to a stable state. Experiments on the commonly used datasets of QQP and MRPC validate the effectiveness of the proposed model, and its performance is stably improved. Show more
Keywords: Short text matching, representational structure, interactive structure, BERT, multi-angle information
DOI: 10.3233/JIFS-230268
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
Authors: Diao, Xiu-Li | Zhang, Hao-Ran | Zeng, Qing-Tian | Song, Zheng-Guo | Zhao, Hua
Article Type: Research Article
Abstract: At present, the Chinese text field is facing challenges from low resource issues such as data scarcity and annotation difficulties. Moreover, in the domain of cigarette tasting, cigarette tasting texts tend to be colloquial, making it difficult to obtain valuable and high-quality tasting texts. Therefore, in this paper, we construct a cigarette tasting dataset (CT2023) and propose a novel Chinese text classification method based on ERNIE and Comparative Learning for Low-Resource scenarios (ECLLR). Firstly, to address the issues of limited vocabulary diversity and sparse features in cigarette tasting text, we utilize Term Frequency-Inverse Document Frequency (TF-IDF) to extract key terms, …supplementing the discriminative features of the original text. Secondly, ERNIE is employed to obtain sentence-level vector embedding of the text. Finally, contrastive learning model is used to further refine the text after fusing the keyword features, thereby enhancing the performance of the proposed text classification model. Experiments on the CT2023 dataset demonstrate an accuracy rate of 96.33% for the proposed method, surpassing the baseline model by at least 11 percentage points, and showing good text classification performance. It is thus clear that the proposed approach can effectively provide recommendations and decision support for cigarette production processes in tobacco companies. Show more
Keywords: Low-resource, Cigarette Tasting, Contrastive Learning, Text classification
DOI: 10.3233/JIFS-237816
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-15, 2024
Authors: Ledesma Roque, Diana Anahí | Kolesnikova, Olga | Menchaca Méndez, Ricardo
Article Type: Research Article
Abstract: This study addresses the issue of semantic similarity in sentences using the BERT model through various aggregation techniques, such as max-pooling, mean-pooling, and an LSTM network applied to the output of the BERT model. Subsequently, the linguistic interpretability of the BERT-Base transformer model is analyzed through the unsupervised learning approach, specifically through dimensionality reduction using autoencoders and clustering algorithms, utilizing the representation of the classification token CLS. The results highlight that the CLS classification token achieves better abstractions than the proposed methods. In terms of interpretability, it is observed that sequence length is relevant in the early layers, with …a gradual decrease across the layers. Additionally, attention to semantic similarity is concentrated in the intermediate and upper layers, especially in layers 6, 8, 9, and 10. All these findings were obtained by addressing the semantic similarity task using the STS-Benchmark dataset. Show more
Keywords: Linguistic interpretability, aggregation methods, unsupervised learning, attention mechanisms, token CLS
DOI: 10.3233/JIFS-219359
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-14, 2024
Authors: Cardoso-Moreno, Marco A. | Luján-García, Juan Eduardo | Yáñez-Márquez, Cornelio
Article Type: Research Article
Abstract: In this study, a thorough analysis of the proposed approach in the context of emotion classification using both single-modal (A-13sbj) and multi-modal (B-12sbj) sets from the YAAD dataset was conducted. This dataset encompassed 25 subjects exposed to audiovisual stimuli designed to induce seven distinct emotional states. Electrocardiogram (ECG) and galvanic skin response (GSR) biosignals were collected and classified using two deep learning models, BEC-1D and ELINA, along with two different preprocessing techniques, a classical fourier-based filtering and an Empirical Mode Decomposition (EMD) approach. For the single-modal set, this proposal achieved an accuracy of 84.43±30.03, precision of 85.16±28.91, and F1-score of …84.06±29.97. Moreover, in the extended configuration the model maintained strong performance, yielding scores of 80.95±22.55, 82.44±24.34, and 79.91±24.55, respectively. Notably, for the multi-modal set (B-12sbj), the best results were obtained with EMD preprocessing and the ELINA model. This proposal achieved an improved accuracy, precision, and F1-score scores of 98.02±3.78, 98.31±3.31, and 97.98±3.83, respectively, demonstrating the effectiveness of this approach in discerning emotional states from biosignals. Show more
Keywords: Emotion classification, signal preprocessing, convolutional neural network, ECG, GSR, EMD
DOI: 10.3233/JIFS-219334
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-9, 2024
Authors: Yigezu, Mesay Gemeda | Kolesnikova, Olga | Gelbukh, Alexander | Sidorov, Grigori
Article Type: Research Article
Abstract: The rise of social media and micro-blogging platforms has led to concerns about hate speech, its potential to incite violence, psychological trauma, extremist beliefs, and self-harm. We have proposed a novel model, Odio-BERT for detecting hate speech using a pretrained BERT language model. This specialized model is specifically designed for detecting hate speech in the Spanish language, and when compared to existing models, it consistently outperforms them. The study provides valuable insights into addressing hate speech in the Spanish language and explores the impact of domain tasks.
Keywords: BERT, hate speech, domain task, fine tune, Odio-BERT, Spanish
DOI: 10.3233/JIFS-219349
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-12, 2024
Authors: Liang, Weijing | Xue, Ye | Xu, Jing
Article Type: Research Article
Abstract: With the increasing global disaster risks, constructing more inclusive, flexible, and resilient communities has become crucial for effectively carrying out disaster prevention, mitigation, and relief work. However, existing research on community resilience mostly focuses on the selection of key factors and the assessment of community resilience, lacking in-depth exploration of the interactions between factors and simulation studies of key paths. Therefore, this paper applies the Fuzzy Decision-Making Trial and Evaluation Laboratory (Fuzzy DEMATEL) method to select important factors of community resilience. Based on this, the maximum average difference entropy method is used to analyze the relationships and influence mechanisms among …different factors, thus identifying the key factors and key paths affecting community resilience. The Fuzzy Cognitive Map (FCM) is then used to simulate the paths. The study finds that factors of community resilience can be categorized as input, intermediary, and output types, and further analysis of their influence mechanisms reveals four key paths and four key factors. Through pathway simulation, different improvement states of community resilience are observed when triggering the input-type factors of the key paths. Therefore, under limited resources, a phased and systematic approach to enhancing community resilience should be adopted. The contribution of this study lies in providing a comprehensive analysis of factors and pathway selection methods, and through pathway simulation, it offers a scientific basis and decision support for improving and constructing community resilience in practice. Show more
Keywords: Fuzzy cognitive map, fuzzy DEMATEL, maximum average difference entropy method, community resilience, simulation analysis
DOI: 10.3233/JIFS-232234
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-23, 2024
Authors: Zhang, Shuguang | Xie, Chengyuan | Zhang, Heng | Gong, Wenzheng | Liu, Lingjie | Zhi, Xuntao
Article Type: Research Article
Abstract: Graph Convolutional Networks (GCN) are prevalent techniques in collaborative filtering recommendations. However, current GCN-based approaches for collaborative filtering recommendation have limitations in effectively embedding neighboring nodes during node and neighbor information aggregation. Furthermore, weight allocation for the user (or item) representations after convolution of each layer is too uniform. To resolve these limitations, we propose a new Graph Convolutional Collaborative Filtering recommendation method based on temporal information during the node aggregation process (TA-GCCF). The method aggregates and propagates information using Gated Recurrent Units, while dynamically updating features based on the timing and sequence of interactions between nodes and their neighbors. …Concurrently, we have developed a convolution attention coefficient to ascertain the significance of embedding at distinct layers. Experiments on three benchmark datasets show that our method significantly outperforms the comparison methods in the accuracy of prediction. Show more
Keywords: Graph convolutional neural network, collaborative filtering, recommendation, gated recurrent units, temporal information
DOI: 10.3233/JIFS-238307
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-10, 2024
Authors: Vela-Rincón, Virna V. | Mújica-Vargas, Dante | Luna-Álvarez, Antonio | Arenas Muñiz, Andrés Antonio | Cruz-Prospero, Luis A.
Article Type: Research Article
Abstract: Image segmentation is a very studied area, looking for the best clustering of pixels. However, it is sometimes a complicated task, especially when these pixels are at the edges of regions, where there is a gradient and it is difficult to decide to which region to assign it. Hesitating fuzzy sets (HFS) better describe these situations, allowing to have multiple possible values for each element, giving more flexibility. This type of sets has been mainly applied in decision-making problems, obtaining better results than other types of fuzzy sets. This research proposes a fast and automatic method based on fuzzy hesitant …clustering (FAHFC), which does not require parameters since it is capable of determining the number of clusters, using the Calinski-Harabasz index, in which the segmentation is performed, solving the initialization problem in clustering; it also proposes an alternative to construct the HFS through the use of fuzzy relations. The experiments show superiority in terms of clustering quality and convergence over some selected state-of-the-art algorithms. Show more
Keywords: Fuzzy clustering, hesitant fuzzy sets, image segmentation, Calinski-Harabasz index
DOI: 10.3233/JIFS-219370
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
Authors: Milovanović, Vladimir | Aleksić, Aleksandar | Milenkov, Marjan | Sokolović, Vlada
Article Type: Research Article
Abstract: The paper aims to present a hybrid model for measuring the performance of business processes in complex organizations based on the subjective decision-making of expert teams. The subject of the research is finding ways to measure, analyze and improve the key performance indicators (KPIs) process. Obtaining the values of KPIs, which reflect the real state of the process, creates a basis for their ranking, i.e. insight into KPIs that are extremely important for the process as well as KPIs that are of lesser importance, but as such are not excluded from consideration because they are necessary for the beginning, realization …and completion of the process. The model was compiled through five phases and was tested through a case study in a real business organization, which deals with the maintenance of complex combat systems. The obtained results helped the management to take certain measures in order to improve the performance of the maintenance process. In the model, it is proposed to form two expert teams, which make assessments based on experience and express them in linguistic terms according to a predefined scale. Modeling of linguistic expressions is realized using intuitive fuzzy sets of a higher order, more precisely Fermatean fuzzy sets (FFS). Selecting KPIs, decomposing the process into sub-processes and assessing the relative importance of sub-processes is carried out by one team of experts, while another team carries out the assessment of KPIs at the level of each sub-process. Determining the relative importance of sub-processes is realized using the Delphi method extended to FFS while reaching a consensus. The measurement of process performance, i.e. the value of KPIs, is realized using Multi-Criteria Group Decision-Making (MCGDM), such as the ELECTRE method extended with FFS. The sensitivity analysis of the developed model is realized by uncertainty modeling with q-rung orthopair fuzzy sets. Show more
Keywords: Fermatean fuzzy set (FFSs), ELECTRE method, Delphi method, maintenance, performance
DOI: 10.3233/JIFS-238907
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-17, 2024
Authors: Xin, Ling
Article Type: Research Article
Abstract: In the era of digital economy, the optimization of enterprise supply chain networks has become a key challenge, while the problems of traditional supply chains, including information asymmetry and lack of trust, seriously hinder the development of enterprise supply chain networks. This paper will use the blockchain distributed technology and the digital economy background to explore how to use the blockchain distributed technology to optimize the existing problems. Firstly, study the supply chain information sharing to develop resources to reduce costs, then use the application of block chain technology and smart contract to establish information sharing mechanism to help the …supply chain information more transparent and improve trust; secondly, use the block chain technology decentralized storage model to realize the decentralized supply chain research, and finally use the consensus method to improve the privacy protection of information, to avoid information asymmetry among users. Through experiments, it could be found that the optimization method of enterprise supply chain network based on blockchain distributed technology had a traceability accuracy of over 92.35% for the extracted products, with an average traceability accuracy of 93.791% for 10 products. Research on the transparency of different supply chain information was above 89.73% . By utilizing blockchain distributed technology, information protection in enterprise supply chains could be effectively improved; trust mechanisms could be better established; risk control effectiveness could be improved; optimization of enterprise supply chain networks could be better assisted. Show more
Keywords: Enterprise supply chain network optimization, blockchain distributed technology, digital economy, smart contracts, deep learning
DOI: 10.3233/JIFS-234664
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
Authors: Cruz, Eddy Sánchez-Dela | Fuentes-Ramos, Mirta | Loeza-Mejía, Cecilia-Irene | José-Guzmán, Irahan-Otoniel
Article Type: Research Article
Abstract: Purpose: Vaginal infections are prevalent causes of gynecological consultations. This study introduces and evaluates the efficacy of four Machine Learning algorithms in detecting vaginitis cases in southern Mexico. Methods: Utilizing Simple Perceptron, Naïve Bayes, CART, and AdaBoost, we conducted classification experiments to identify four vaginitis subtypes (gardnerella, candidiasis, trichomoniasis, and chlamydia) in 600 patient cases. Results: The outcomes are promising, with a majority achieving 100% accuracy in vaginitis identification. Conclusion: The successful implementation and high accuracy of these algorithms demonstrate their potential as valuable diagnostic tools for vaginal infections, particularly in southern Mexico. It …is crucial in a region where health technology adoption lags behind, and intelligent software support is limited in gynecological diagnoses. Show more
Keywords: Machine learning, gynecological pathologies, vaginitis, local dataset, correct identification
DOI: 10.3233/JIFS-219377
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-14, 2024
Authors: Xie, Mengtong | Chai, Huaqi
Article Type: Research Article
Abstract: A human resources management plan is presently recognised as one of the most important components of a corporate technique. This is due to the fact that its major purpose is to interact with people, who are the most precious asset that an organisation has. It is impossible for an organisation to achieve its objectives without the participation of individuals. An organisation may effectively plan as well as manage individual processes to support the organization’s objectives and adapt nimbly to any change if it has well-prepared HR techniques and an action plan for its execution. This investigation puts up a fresh …way for the board of directors of a private firm to increase their assets and advance their growth by using cloud programming that is characterised by networks. The small company resource has been improved by strengthening human resource management techniques, and the cloud SDN network is used for job scheduling using Q-convolutional reinforcement recurrent learning. The proposed technique attained Quadratic normalized square error of 60%, existing SDN attained 55%, HRM attained 58% for Synthetic dataset; for Human resources dataset propsed technique attained Quadratic normalized square error of 62%, existing SDN attained 56%, HRM attained 59%; proposed technique attained Quadratic normalized square error of 64%, existing SDN attained 58%, HRM attained 59% for dataset. Show more
Keywords: Small business management, cloud software defined networks, human resource management, task scheduling, recurrent learning
DOI: 10.3233/JIFS-235379
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-11, 2024
Authors: Cortés-Antonio, Prometeo | Valdez, Fevrier | Melin, Patricia | Castillo, Oscar
Article Type: Research Article
Abstract: The computing with words is an approach that has unique characteristics and advantages to model cognitive processes, this article explains the relationship and difference between type-1 and type-2 fuzzy sets in the definition of linguistic values. Here, we perform a compressive review and justify because type-2 sets are more appropriate in modeling linguistic values, and a heuristic procedure by examples is carried out to define linguistic values on a continuous variable. A visual comparison of a rule-based system, when linguistic values use crips, type-1, and type-2 fuzzy sets in modeling a cognitive system.
Keywords: Type-2 and type-1 fuzzy sets, linguistic values and variables, rule-based systems, cognitive computing
DOI: 10.3233/JIFS-219368
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-11, 2024
Authors: Li, Jia | Xue, Shuaihao | Li, Minghui | Shi, Xiaoqiu
Article Type: Research Article
Abstract: Combining the harmony search algorithm (HS) with the local search algorithm (LS) can prevent the HS from falling into a local optimum. However, how LS affects the performance of HS has not yet been studied systematically. Therefore, in this paper, it is first proposed to combine four frequently used LS with HS to obtain several search algorithms (HSLSs). Then, by taking the flexible job-shop scheduling problem (FJSP) as an example and considering decoding times, study how the parameters of HSLSs affect their performance, where the performance is evaluated by the difference rate based on the decoding times. The simulation results …mainly show that (I) as the harmony memory size (HMS) gradually increases, the performance of HSLSs first increases rapidly and then tends to remain unchanged, and HMS is not the larger the better; (II) as harmony memory considering rate increases, the performance continues to improve, while the performance of pitch adjusting rate on HSLSs goes to the opposite; Finally, more benchmark instances are also used to verify the effectiveness of the proposed algorithms. The results of this paper have a certain guiding significance on how to choose LS and other parameters to improve HS for solving FJSP. Show more
Keywords: Algorithm analysis, local search, harmony search, flexible job-shop scheduling problem
DOI: 10.3233/JIFS-239142
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-14, 2024
Authors: Yue, Lizhu | Lv, Yue
Article Type: Research Article
Abstract: The Vlsekriterijumska Optimizacija I Komprosmisno Resenie (VIKOR) method to some extent modifies the utility function to a value function that can consider different risk preferences. However, the weight and risk attitude parameters involved in the model are difficult to determine, which limits its application. To overcome this problem, a Poset-VIKOR model is proposed. A partial order set is a non-parametric decision-making method. Through the combination of partial order set and VIKOR model, the parameters can be “eliminated”, and a robust method that can run the model is obtained. This method uses the Hasse diagram to express the evaluation results, which …can not only directly display the hierarchical and clustering information, but also show the robustness characteristics of the alternative comparison. Show more
Keywords: VIKOR method, poset, weight, multiple attribute decision making
DOI: 10.3233/JIFS-230680
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-17, 2024
Authors: He, Xiaorong | Fang, Anran | Yu, Dejian
Article Type: Research Article
Abstract: Electronic commerce (EC) has become the most critical business activity in the world. China has become the world’s largest market for EC. Over the past three decades, numerous researches have examined the current status of the development of monolingual EC research in specific scenarios. However, the paradigm shift in EC development through the analysis of the dynamic evolution of semantic information has not yet been examined, and the distinctions and connections between multilingual EC studies have not yet been established. This study analyzed 16,207 English and 17,850 Chinese EC-related articles from the Web of Science database and CNKI by combining …the BERTopic topic model and SBERT sentence embedding-based similarity computations. The results reveal the distributions of global and local topics in the English and Chinese EC literature, analyze the semantic intricacies of topic convergence and evolution across continuous time, as well as the distinctions and connections between English and Chinese topics. Finally, the evolutionary patterns and life cycle of three crucial English and Chinese topics are explored respectively, including their emergence, development, maturity, and decline. Overall, this study provides a comprehensive overview of EC studies from a topic perspective. Show more
Keywords: Electronic commerce, BERTopic, topic modeling, topic evolution, sentence embedding
DOI: 10.3233/JIFS-232825
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-22, 2024
Article Type: Research Article
Abstract: Background: Breast cancer diagnosis relies on accurate lesion segmentation in medical images. Automated computer-aided diagnosis reduces clinician workload and improves efficiency, but existing image segmentation methods face challenges in model performance and generalization. Objective: This study aims to develop a generative framework using a denoising diffusion model for efficient and accurate breast cancer lesion segmentation in medical images. Methods: We design a novel generative framework, PalScDiff, that leverages a denoising diffusion probabilistic model to reconstruct the label distribution for medical images, thereby enabling the sampling of diverse, plausible segmentation outcomes. Specifically, with the …condition of the corresponding image, PalScDiff learns to estimate the masses region probability through denoising step by step. Furthermore, we design a Progressive Augmentation Learning strategy to incrementally handle segmentation challenges of irregular and blurred tumors. Moreover, multi-round sampling is employed to achieve robust breast mass segmentation. Results: Our experimental results show that PalScDiff outperforms established models such as U-Net and transformer-based alternatives, achieving an accuracy of 95.15%, precision of 79.74%, Dice coefficient of 77.61%, and Intersection over Union (IOU) of 81.51% . Conclusion: The proposed model demonstrates promising capabilities for accurate and efficient computer-aided segmentation of breast cancer. Show more
Keywords: Diffusion model, consistent regularization, breast cancer, medical image segmentation, data augmentation
DOI: 10.3233/JIFS-239703
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-15, 2024
Authors: Ou, Qiqi | Zhang, Xiaohong | Wang, Jingqian
Article Type: Research Article
Abstract: Fuzzy rough sets (FRSs) play a significant role in the field of data analysis, and one of the common methods for constructing FRSs is the use of the fuzzy logic operators. To further extend FRSs theory to more diverse information backgrounds, this article proposes a covering variable precision fuzzy rough set model based on overlap functions and fuzzy β-neighbourhood operators (OCVPFRS). Some necessary properties of OCVPFRS have also been studied in this work. Furthermore, multi-label classification is a prevalent task in the realm of machine learning. Each object (sample or instance) in multi-label data is associated with various labels (classes), …and there are numerous features or attributes that need to be taken into account within the attribute space. To enhance various performance metrics in the multi-label classification task, attribute reduction is an essential pre-processing step. Therefore, according to overlap functions and fuzzy rough sets’ excellent work on applications: such as image processing and multi-criteria decision-making, we establish an attribute reduction method suitable for multi-label data based on OCVPFRS. Through a series of experiments and comparative analysis with existing multi-label attribute reduction methods, the effectiveness and superiority of the proposed method have been verified. Show more
Keywords: Fuzzy rough sets, overlap functions, fuzzy β-neighbourhood operators, attribute reduction, multi-label classification
DOI: 10.3233/JIFS-238245
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-19, 2024
Authors: Embriz-Islas, Cesar | Benavides-Alvarez, Cesar | Avilés-Cruz, Carlos | Zúñiga-López, Arturo | Ferreyra-Ramírez, Andrés | Rodríguez-Martínez, Eduardo
Article Type: Research Article
Abstract: Speech recognition with visual context is a technique that uses digital image processing to detect lip movements within the frames of a video to predict the words uttered by a speaker. Although models with excellent results already exist, most of them are focused on very controlled environments with few speaker interactions. In this work, a new implementation of a model based on Convolutional Neural Networks (CNN) is proposed, taking into account image frames and three models of audio usage throughout spectrograms. The results obtained are very encouraging in the field of automatic speech recognition.
Keywords: CNN, artificial intelligence, deep learning, speech recognition
DOI: 10.3233/JIFS-219346
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-12, 2024
Authors: Zavala-Díaz, Jonathan | Olivares-Rojas, Juan C. | Gutiérrez-Gnecchi, José A. | Téllez-Anguiano, Adriana C. | Alcaraz-Chávez, J. Eduardo | Reyes-Archundia, Enrique
Article Type: Research Article
Abstract: Efficient medical information management is essential in today’s healthcare, significantly to automate diagnoses of chronic diseases. This study focuses on the automated identification of diabetic patients through a clinical note classification system. This innovative approach combines rules, information extraction, and machine learning algorithms to promise greater accuracy and adaptability. Initially, the four algorithms evaluated showed similar performance, with Gradient Boosting standing out with an accuracy of 0.999. They were tested on our clinical and oncology notes, where SVM excelled in correctly labeling non-oncology notes with a 0.99. Gradient Boosting had the best average with 0.966. The combination of rules, information …extraction, and Random Forest provided the best average performance, significantly improving the classification of clinical notes and reducing the margin of error in identifying diabetic patients. The principal contribution of this research lies in the pioneering integration of rule-based methods, information extraction techniques, and machine learning algorithms for enhanced accuracy in diabetic patient identification. For future work, we consider implementing these algorithms in natural clinical settings to evaluate their practical performance. Additionally, additional approaches will be explored to improve the accuracy and applicability of clinical note-grading systems in healthcare. Show more
Keywords: NLP, diabetes, machine learning, binary classification, word frequency analysis
DOI: 10.3233/JIFS-219375
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-11, 2024
Authors: Martinez, German | Duta, Eduard-Andrei | Sanchez-Romero, Jose-Luis | Jimeno-Morenilla, Antonio | Mora-Mora, Higinio
Article Type: Research Article
Abstract: Within various industrial settings, such as shipping, aeronautics, woodworking, and footwear, there exists a significant challenge: optimizing the extraction of sections from material sheets, a process known as “nesting”, to minimize wasted surface area. This paper investigates efficient solutions to complex nesting problems, emphasizing rapid computation over ultimate precision. We introduce a dual-approach methodology that couples both a greedy technique and a genetic algorithm. The genetic algorithm is instrumental in determining the optimal sequence for placing sections, ensuring each is located in its current best position. A specialized representation system is devised for both the sections and the material sheet, …promoting streamlined computation and tangible results. By balancing speed and accuracy, this study offers robust solutions for real-world nesting challenges within a reduced computational timeframe. Show more
Keywords: Genetic algorithm, 2D nesting, irregular pattern, cutting, industrial automation
DOI: 10.3233/JIFS-219345
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-15, 2024
Authors: Ling, Lina | Wen, Mi | Wang, Haizhou | Zhu, Zhou | Meng, Xiangjie
Article Type: Research Article
Abstract: The detection of out-of-distribution (OoD) samples in semantic segmentation is crucial for autonomous driving, as deep learning models are typically trained under the assumption of a closed environment, whereas the real world presents an open and diverse set of scenarios. Existing methods employ uncertainty estimation, image reconstruction, and other techniques for OoD sample detection. We have observed that different classes may exhibit connections and associations in varying contexts. For example, objects encountered by autonomous vehicles differ in rural road scenes compared to urban environments, and the likelihood of encountering novel objects varies. This aspect is missing in current anomaly detection …methods and is vital for OoD sample detection. Existing approaches solely consider the relative significance of each prediction class, overlooking the inter-object correlation. Although prediction scores (e.g., max logits) obtained from the segmentation network are applicable for OoD sample detection, the same problem persists, particularly for OoD objects. To address this issue, we propose the utilization of the Mahalanobis distance of max logits to evaluate the final predicted score. By calculating the Mahalanobis distance, the paper aims to uncover correlations between different classes, thus enhancing the effectiveness of OoD detection. To this end, we also extend the state-of-the-art segmentation model, DeepLabV3+, to enable OoD sample detection in this paper. Specifically, this paper proposes a novel backbone network, SOD-ResNet101, for extracting contextual and multi-scale semantic information, leveraging the class correlation feature of the Mahalanobis distance to enhance the detection performance of out-of-distribution objects. Notably, our approach eliminates the need for external datasets or separate network training, making it highly applicable to existing pretraining segmentation models. Show more
Keywords: Semantic segmentation, deep learning, anomaly segmentation, automatic driving, out-of-distribution detection
DOI: 10.3233/JIFS-237799
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
Authors: Kumar Sahu, Vinay | Pandey, Dhirendra | Singh, Priyanka | Haque Ansari, Md Shamsul | Khan, Asif | Varish, Naushad | Khan, Mohd Waris
Article Type: Research Article
Abstract: The Internet of Things (IoT) strategy enables physical objects to easily produce, receive, and exchange data. IoT devices are getting more common in our daily lives, with diverse applications ranging from consumer sector to industrial and commercial systems. The rapid expansion and widespread use of IoT devices highlight the critical significance of solid and effective cybersecurity standards across the device development life cycle. Therefore, if vulnerability is exploited directly affects the IoT device and the applications. In this paper we investigated and assessed the various real-world critical IoT attacks/vulnerabilities that have affected IoT deployed in the commercial, industrial and consumer …sectors since 2010. Subsequently, we evoke the vulnerabilities or type of attack, exploitation techniques, compromised security factors, intensity of vulnerability and impacts of the expounded real-world attacks/vulnerabilities. We first categorise how each attack affects information security parameters, and then we provide a taxonomy based on the security factors that are affected. Next, we perform a risk assessment of the security parameters that are encountered, using two well-known multi-criteria decision-making (MCDM) techniques namely Fuzzy-Analytic Hierarchy Process (F-AHP) and Fuzzy-Analytic Network Process (F-ANP) to determine the severity of severely impacted information security measures. Show more
Keywords: IoT attacks, fuzzy-ANP, fuzzy-AHP, MCDM, IoT vulnerabilities
DOI: 10.3233/JIFS-233759
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
Authors: Bochkarev, Vladimir V. | Savinkov, Andrey V. | Shevlyakova, Anna V. | Solovyev, Valery D.
Article Type: Research Article
Abstract: This work considers implementation of a diachronic predictor of valence, arousal and dominance ratings of English words. The estimation of affective ratings is based on data on word co-occurrence statistics in the large diachronic Google Books Ngram corpus. Affective ratings from the NRC VAD dictionary are used as target values for training. When tested on synchronic data, the obtained Pearson‘s correlation coefficients between human affective ratings and their machine ratings are 0.843, 0.779 and 0.792 for valence, aroused and dominance, respectively. We also provide a detailed analysis of the accuracy of the predictor on diachronic data. The main result of …the work is creation of a diachronic affective dictionary of English words. Several examples are considered that illustrate jumps in the time series of affective ratings when a word gains a new meaning. This indicates that changes in affective ratings can serve as markers of lexical-semantic changes. Show more
Keywords: Affective words, affective norms, sentiment dictionary, word valence ratings, lexical semantic change
DOI: 10.3233/JIFS-219358
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
Authors: Zhang, Yingmin | Yi, Afa | Li, Shuo
Article Type: Research Article
Abstract: The constant development and application of new technologies, such as big data, artificial intelligence and the mobile Internet, have profoundly changed the personal and professional spheres. Despite these advances, finance professionals are still faced with a multitude of routine, repetitive and error-prone tasks. At the same time, they are challenged by the shift to management accounting, resulting in reduced productivity. This paper addresses these issues by introducing a financial statement filing robot developed using Robotic Process Automation (RPA) technology. The application of this robot has been shown to provide superior efficiency and accuracy, reduce the heavy burden of routine tasks, …and facilitate a smooth transition to management accounting practices. In addition, this research provides a valuable reference for the application and diffusion of RPA technology in the financial sector. Given the large amount of text data generated by financial processes, this paper proposes an automatic text categorization model. The effectiveness of the model is demonstrated as a response to address the challenges encountered in the consultation and archiving process. This contribution informs the development of text categorization robots tailored to the needs of finance professionals. Show more
Keywords: RPA technology, robot, financial statements, text classification, naive Bayes classifier model
DOI: 10.3233/JIFS-236716
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-10, 2024
Authors: Jun, Dai | Huijie, Shi | Yanqin, Li | Junwei, Zhao | Naohiko, Hanajima
Article Type: Research Article
Abstract: Cylinder liner is an internal part of the automobile engine, which plays an important role in the automobile internal combustion engine. Therefore, it is a top priority to accurately and quickly detect the cylinder liner surface defects. In order to effectively achieve the classification and localization of surface defects on the cylinder liner, this paper establishes a dataset for surface defects on cylinder liner and proposes a based on improved YOLOv5 algorithm for detecting surface defects on cylinder liner. Firstly, a machine vision system is established to acquire on-site images and perform manual annotation to build the dataset of surface …defects on cylinder liner. Secondly, the GSConv SlimNeck mechanism is introduced to reduce the model complexity; the Bi-directional Feature Pyramid Network (BiFPN) is used to fuse the feature information at different scales to enhance the detection accuracy of small surface defects on cylinder liner; and embedding the SimAM attention mechanism to focus on the object region of interest and improve the accuracy and robustness of the model. The final improved YOLOv5 model reduces the number of model parameters by 15.8% compared to the non-improved YOLOv5. And the experimental results on our self-built dataset for cylinder liner defects show that the mAP0.5 is improved by 0.4%. This means that the accuracy of model detection was not compromised. This method can be applied to actual production processes. Show more
Keywords: Cylinder liner defect detection, YOLOv5, GSConv SlimNeck, BiFPN, SimAM
DOI: 10.3233/JIFS-237793
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-14, 2024
Authors: Chen, Xinying | Hu, Mingjie
Article Type: Research Article
Abstract: With the rapid proliferation of substantial textual data from sources such as social media, online comments, and news articles, sentiment analysis has become increasingly crucial. However, existing deep learning methods have overlooked the significance of part-of-speech (POS) and emotional words in understanding the emotion of text. Based on this, this paper proposes a sentiment analysis approach that combines multiple features with a dual-channel network. Firstly, the vector representation of the text is obtained through Robustly Optimized BERT Pretraining Approach (RoBERTa). Secondly, the POS features and word emotional features are separately updated using self-attention to calculate weights. Concatenating words, POS and …emotion, feature dimension reduction and fusion are achieved through a linear layer. Finally, the fused feature vector is input into a dual-channel network composed of Bidirectional Gated Recurrent Unit (BiGRU) and Deep Pyramid Convolutional Neural Network (DPCNN). Experimental results demonstrate that the proposed method achieves higher classification accuracy than the comparative methods on three sentiment analysis datasets. Moreover, the experimental results fully validate the effectiveness of the proposed approach. Show more
Keywords: Sentiment analysis, part-of-speech, RoBERTa, bidirectional gated recurrent unit, deep pyramid convolutional neural network
DOI: 10.3233/JIFS-237749
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-12, 2024
Authors: Gowri, S. | Vennila, B. | Antony Crispin Sweety, C.
Article Type: Research Article
Abstract: The primary focus of this work is to develop the concept of bipolar N-neutrosophic supra topological spaces. Also, extended some concepts such as closure and interior operators of N-neutrosophic supra topological spaces to Bipolar N-neutrosophic supra topological spaces. The properties and relationship between weak forms of bipolar N-neutrosophic supra topological open sets are also established. Further, suggested several separations amongst bipolar N-neutrosophic supra sets. Some distance between bipolar N-neutrosophic sets is introduced and an efficient approachfor group multi-criteria decision making based on bipolar N-neutrosophic sets is proposed.
Keywords: Bipolar N-neutrosophic supra topology, bipolar N-neutrosophic supra α-open set, bipolar N-neutrosophic supra semi-open, bipolar N-neutrosophic supra β-open and bipolar N-neutrosophic supra pre-open, N-valued interval neutrosophic sets
DOI: 10.3233/JIFS-224450
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
Authors: Vallejos, Sebastian | Armentano, Marcelo G. | Berdun, Luis | Schiaffino, Silvia | González Císaro, Sandra | Nigro, Oscar | Balduzzi, Leonardo | Cuesta, Ignacio
Article Type: Research Article
Abstract: Product classification is a critical task for the smooth running of the purchase process in e-commerce websites. When it comes to P2P marketplaces, users can act both as sellers and as buyers, and they need to assign predefined categories to the products they want to sell. Besides being tedious for users, this task can result in ambiguous or inaccurate assignments. This article presents a method for the automatic categorization of items offered in a local P2P marketplace using a multi-level classification approach. Our experiments demonstrated a significant improvement in the classification results of the proposed solution compared to a traditional …direct classification approach. Show more
Keywords: Classification, e-commerce, NLP, P2P marketplace
DOI: 10.3233/JIFS-219344
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-11, 2024
Authors: Brännström, Andreas | Nieves, Juan Carlos
Article Type: Research Article
Abstract: This paper introduces an automated decision-making framework for providing controlled agent behavior in systems dealing with human behavior-change. Controlled behavior in such settings is important in order to reduce unexpected side-effects of a system’s actions. The general structure of the framework is based on a psychological theory, the Theory of Planned Behavior (TPB), capturing causes to human motivational states, which enables reasoning about dynamics of human motivation. The framework consists of two main components: 1) an ontological knowledge-base that models an individual’s behavioral challenges to infer motivation states and 2) a transition system that, in a given motivation state, decides …on motivational support, resulting in transitions between motivational states. The system generates plans (sequences of actions) for an agent to facilitate behavior change. A particular use-case is modeled regarding children with Autism Spectrum Conditions (ASC) who commonly experience difficulties in everyday social situations. An evaluation of a proof-of-concept prototype is performed that presents consistencies between ASC experts’ suggestions and plans generated by the system. Show more
Keywords: Interactive agents, strategic decision-making, behavior-change systems, theory of planned behavior, Autism
DOI: 10.3233/JIFS-219335
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-11, 2024
Authors: López-López, Aurelio | Garcıa-Gorrostieta, Jesús Miguel | González-López, Samuel
Article Type: Research Article
Abstract: Emotion detection in educational dialogues, particularly within student-teacher interactions, has become a crucial research area for improving the learning experience. In this paper, we employ two models, one generic Bidirectional Encoder Representations from Transformers (BERT) and the Emotion detection model Robustly Optimized BERT Approach (EmoRoBERTa), to automatically classify emotions in a corpus of student-teacher chat interactions. Then subsequently, we validate these classifications using a scheme based on oracles, employing two generative large language models (ChatGPT and Bard). Experiments on emotion detection in dialogues between students and teachers revealed that EmoRoBERTa exhibited a reasonable level of agreement with the oracles, while …ChatGPT demonstrated the highest consistency with EmoRoBERTa’s predictions. Furthermore, we identified the impact of specific words on emotion classification, offering insights into the decision-making process of these models. The results not only highlight the prominent presence of emotions like approval, gratitude, curiosity, disapproval, amusement, confusion, remorse, joy , and surprise but also provide substantial support for the utilization of the proposed emotion detection model to enhance the student learning environment. Exploring the emotional aspects of educational dialogues holds the potential to enhance instruction methods, provide timely assistance to students in need, and create an improved learning atmosphere. Show more
Keywords: Emotion detection, learning interaction, transfer learning, large language models, active learning
DOI: 10.3233/JIFS-219340
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-11, 2024
Authors: Shi, Xiaolong | Kosari, Saeed | Rangasamy, Parvathi | Nivedhaa, R.K. | Rashmanlou, Hossein
Article Type: Research Article
Abstract: Modern image processing techniques are improving beyond old methods, which include advanced approaches, for example deep learning. Convolutional Neural Networks (CNNs) are excellent at automatic feature extraction, whereas Generative Adversarial Networks (GANs) produce realistic images. Transfer learning uses pre-trained models, whereas semantic segmentation identifies pixels in images. Super-resolution, style transfer, and attention mechanisms can increase the quality of images and understanding. Adversarial defenses address purposeful manipulations, while 3D image processing handles three-dimensional data. These advancements make use of improved computational power and massive datasets to revolutionize image processing capabilities. Traditional image processing algorithms frequently fail to handle the complex and …multidimensional structure of color images, particularly when dealing with uncertainty and imprecision. In this study, the 3D-EIFIM frame work is extented and scaled aggregation operations 3D-EIFIM tailored for image data are proposed. By representing each pixel as an entry of 3D-EIFIM and applying aggregation techniques to enable more effective image analysis, manipulation, and enhancement. The practical implications of this research are significant, as it can lead to advancements in fields such as computer vision, medical imaging, and remote sensing. Show more
Keywords: IFP, conjunction, disjunction, IFIM, EIFIM, 3D-IFIM, 3D-EIFIM
DOI: 10.3233/JIFS-238252
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-17, 2024
Authors: Vimala, S. | Valarmathi, K.
Article Type: Research Article
Abstract: This study proposes a novel method using hybrid CNN-LSTM networks to measure and predict the effectiveness of speech and vision therapy. Traditional methods for evaluating therapy often rely on subjective assessments, lacking precision and efficiency. By combining CNN for visual data and MFCC for speech, alongside LSTM for temporal dependencies, the system captures dynamic changes in patients’ conditions. Pre-processing of audio and visual data enhances accuracy, and the model’s performance outperforms existing methods. This approach exhibits the potential of deep learning in monitoring patient progress effectively in speech and vision therapy, offering valuable insights for improving treatment outcomes. The proposed …system’s effectiveness is assessed by various performance metrics. The suggested system’s results are compared with those of other methods already in use. The study’s findings indicate that the suggested approach is more accurate than other existing models. In conclusion, this study offers important new information on how deep learning methods are being used to track patients’ progress in speech and vision therapy. Show more
Keywords: Monitor, speech and vision, deep learning, therapy patient, recording device, CNN-LSTM, categorization
DOI: 10.3233/JIFS-237363
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-17, 2024
Authors: Ravi, Vinayakumar
Article Type: Research Article
Abstract: Deep learning-based models are employed in computer-aided diagnosis (CAD) tools development for pediatric pneumonia (P-Pneumonia) detection. The accuracy of the model depends on the scaling of the deep learning model. A survey on deep learning shows that models with a greater number of layers achieve better performances for P-Pneumonia detection. However, the identification of the optimal models is considered to be important work for P-Pneumonia detection. This work presents a hybrid deep learning model for P-Pneumonia detection. The model leverages the EfficientNetV2 model that employs various advanced methodologies to maintain the balance between the model scaling and the performance of …the model in P-Pneumonia detection. The features of EfficientNetV2 models are passed into global weighted average pooling (GWAP) which acts like an attention layer. It helps to extract the important features that point to the infected regions of the radiography image and discard all the unimportant information. The features from GWAP are high in dimension and using kernel-based principal component analysis (K-PCA), the features were reduced. Next, the reduced features are combined together and passed into a stacked classifier. The stacked classifier is a two-stage approach in which the first stage employs a support vector machine (SVM) and random forest tree (RFT) for the prediction of P-Pneumonia using the fused features and logistic regression (LRegr) on values of prediction for classification. Detailed experiments were done for the proposed method in P-Pneumonia detection using publically available benchmark datasets. Various settings in the experimental analysis are done to identify the best model. The proposed model outperformed the other methods by improving the accuracy by 4% in P-Pneumonia detection. To show that the proposed model is robust, the model performances were shown on the completely unseen dataset of P-Pneumonia. The hybrid deep learning-based P-Pneumonia model showed good performance on completely unseen data samples of P-Pneumonia patients. The generalization of the proposed P-Pneumonia model is studied by evaluating the model on similar lung diseases such as COVID-19 (CV-19) and Tuberculosis (TBS). In all the experiments, the P-Pneumonia model has shown good performances on similar lung diseases. This indicates that the model is robust and generalizable on data samples of different patients with similar lung diseases. The P-Pneumonia models can be used in healthcare and clinical environments to assist doctors and healthcare professionals in improving the detection rate of P-Pneumonia. Show more
Keywords: Pediatric pneumonia, machine learning, deep learning, dimensionality reduction, feature fusion
DOI: 10.3233/JIFS-219397
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-18, 2024
Authors: Vaikunta Pai, T. | Nethravathi, P.S. | Birau, Ramona | Popescu, Virgil | Karthik Pai, B.H. | Naik, Pramod Vishnu
Article Type: Research Article
Abstract: Multimodal conversational AI systems have gained significant attention due to their potential to enhance user experience and enable more interactive and engaging interactions. This vital and complex research field seeks to integrate diverse modalities, including text, images, and speech, to develop conversational AI systems capable of comprehending, perceiving, and generating responses within a multimodal framework. By seamlessly incorporating various modalities, these systems can provide a more comprehensive and immersive conversational experience, enabling users to communicate in a more natural and intuitively. This research presents a novel multimodal architecture empowered by Deep Neural Networks (DNNs) for simultaneous integration and processing of …diverse modalities. Multimodal data encompasses various sources like text, images, audio, video, or sensor data. The objective is to merge and harness information from these modalities to amplify learning and enhance performance across a spectrum of tasks. This research explores the extension of ChatGPT, a state-of-the-art conversational AI model, to handle multimodal inputs, including text and images or text and speech. We present a comprehensive analysis of the benefits and challenges of integrating various options into ChatGPT, examining their impact on understanding, interaction, and overall system performance. Through extensive experimentation and evaluation, we demonstrate the potential of multimodal ChatGPT to provide richer, more context-aware conversations, while also highlighting the existing limitations and open research questions in this evolving field. Multimodal ChatGPT outperform the current GPT-3.5 by 16.51% and it is clear that multimodal ChatGPTis capable of better performance and offer a pathway for further progress in the field of language models. Show more
Keywords: Large language model, generative pre-trained transformer, deep learning, State-Of-The-Art (SOTA), artificial intelligence (AI), reinforcement training from human feedback, natural language processing (NLP), convolutional neural networks (CNN), recurrent neural networks (RNN)
DOI: 10.3233/JIFS-239465
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-17, 2024
Authors: Li, Ye | Zhou, Jingkang
Article Type: Research Article
Abstract: Semi-supervised learning (SSL) aims to reduce reliance on labeled data. Achieving high performance often requires more complex algorithms, therefore, generic SSL algorithms are less effective when it comes to image classification tasks. In this study, we propose ComMatch, a simpler and more effective algorithm that combines negative learning, dynamic thresholding, and predictive stability discriminations into the consistency regularization approach. The introduction of negative learning is to help facilitate training by selecting negative pseudo-labels during stages when the network has low confidence. And ComMatch filters positive and negative pseudo-labels more accurately as training progresses by dynamic thresholds. Since high confidence does …not always mean high accuracy due to network calibration issues, we also introduce network predictive stability, which filters out samples by comparing the standard deviation of the network output with a set threshold, thus largely reducing the influence of noise in the training process. ComMatch significantly outperforms existing algorithms over several datasets, especially when there is less labeled data available. For example, ComMatch achieves 1.82% and 3.6% error rate reduction over FlexMatch and FixMatch on CIFAR-10 with 40 labels respectively. And with 4000 labeled samples, ComMatch achieves 0.54% and 2.65% lower error rates than FixMatch and MixMatch, respectively. Show more
Keywords: Semi-supervised learning, negative learning, dynamic threshold, predictive stability
DOI: 10.3233/JIFS-233940
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-12, 2024
Authors: Sun, Haobin | Chen, Bingsan | Zhang, Wenshui | Wei, Songma | Lian, Changwei
Article Type: Research Article
Abstract: In the process of production, the label on the product provides the basic product information. Due to the complex text contained on the product labels, the high accuracy recognition for online production labels has always been a challenging problem. To address this issue, a more effective method for complex text detection by improving the convolutional recurrent neural network has been proposed to enhance the recognition accuracy of complex text. Firstly, the SE-DenseNet feature extraction network has been introduced for feature extraction, aiming to improve the model’s depth and feature extraction capacity. Then, the Bi-GRU network is utilized to learn and …model the hidden states and spatial features extracted by SE-DenseNet, anticipate preliminary sequence results, reduce model parameters, and improve the model’s calculation performance. Finally, the CTC network is employed for transcription to convert each feature sequence prediction output by Bi-GRU into a label sequence, achieving complex text recognition. Experimental results on the SVT, IIIT-5K, ICDAR2013 public dataset, and a self-built dataset demonstrate that the proposed model achieves superior outcomes on both public and self-built datasets. Remarkably, the model exhibits the highest recognition accuracy of 93.2% on the ICDAR2013 public dataset, demonstrating its potential to support complex text recognition for online production labels. Show more
Keywords: Online production labels, complex text recognition, SE-DenseNet, Bi-GRU
DOI: 10.3233/JIFS-234748
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-14, 2024
Authors: Lv, Zhangwei
Article Type: Research Article
Abstract: In the context of China’s cultural and tourism industry, cultural equipment plays a critical role in cultural dissemination, especially in remote areas with harsh road conditions and unique environmental factors. However, the efficiency and stability of manual analysis are significantly challenged by these conditions and the vast yet sparsely collected monitoring data. This study aims to develop a method for extracting valuable information from monitoring data to assess the health status of cultural equipment. We introduce a deep learning-based algorithm that leverages convolutional neural networks (CNNs) to extract local features from multidimensional monitoring indicators and long short-term memory (LSTM) networks …to capture time series features, facilitating the classification of cultural equipment’s health status. The algorithm’s effectiveness is demonstrated through simulation results, highlighting its practicality and applicability in real-world scenarios. This research not only provides a novel approach for cultural equipment health assessment but also contributes significantly to the field by addressing the challenges of data analysis in complex environments, underscoring the importance of technological advancements in preserving cultural heritage. Show more
Keywords: Environmental evaluation, convolutional neural network, long short term memory, health status
DOI: 10.3233/JIFS-241607
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-15, 2024
Authors: Shamma, Aashitha L. | Vekkot, Susmitha | Gupta, Deepa | Zakariah, Mohammed | Alotaibi, Yousef Ajami
Article Type: Research Article
Abstract: This paper investigates the potential of COVID-19 detection using cough, breathing, and voice patterns. Speech-based features, such as MFCC, zero crossing rate, spectral centroid, spectral bandwidth, and chroma STFT are extracted from audio recordings and evaluated for their effectiveness in identifying COVID-19 cases from Coswara dataset. The explainable AI SHAP tool is employed which identified MFCC, zero crossing rate, and spectral bandwidth as the most influential features. Data augmentation techniques like random sampling, SMOTE, Tomek, and Edited Nearest Neighbours (ENN), are applied to improve the performance of various machine learning models used viz. Naive Bayes, K-nearest neighbours, support vector machines, …XGBoost, and Random Forest. Selecting the top 20 features achieves an accuracy of 73%, a precision of 74%, a recall of 94%, and an F1-score of 83% using the Random Forest model with the Tomek sampling technique. These findings demonstrate that a carefully selected subset of features can achieve comparable performance to the entire feature set while maintaining a high recall rate. The success of the Tomek undersampling technique highlights the ability of model to handle sparse clinical data and predict COVID-19 and associated diseases using speech-based features. Show more
Keywords: Covid-19, MFCC, spectral bandwidth, zero crossing rate, SHAP tool, Tomek
DOI: 10.3233/JIFS-219387
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-14, 2024
Authors: Zou, Chao | Zhu, Jiwei | Cao, Jiawei | Wang, Xin | Mei, Zhenyu | Zhou, Kui
Article Type: Research Article
Abstract: Prefabricated buildings (PBs) are a new type of building construction, which are less time-consuming and cause low environmental pollution and resource consumption. They play an important role in industrialized construction and clean production and have gained worldwide attention. However, the high construction costs have become a major obstacle to their popularity and application. This study investigates the factors influencing construction costs of PBs in China using a systematic literature review (SLR), fuzzy interpretive structure modeling (fuzzy ISM), and the Matrice d’Impacts croises-multiplication appliqué an classment (MICMAC) technique. First, 32 influencing factors were identified from the SLR. Second, out of which …16 critical factors were selected and mapped in a hierarchical model through semi-structured interview screening, and the MICMAC technique was used to classify the cost-influencing factors of PBs into different categories. The results revealed that all identified factors played pivotal roles in various capacities and influenced the cost of PB construction. This study may assist administrators and policymakers in better understanding the factors that influence the costs of PBs construction to manage and reduce them. Show more
Keywords: Prefabricated buildings, construction costs, critical factors, fuzzy ISM, MICMAC technique
DOI: 10.3233/JIFS-240206
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-17, 2024
Authors: Ding, Zongchao
Article Type: Research Article
Abstract: The networks have achieved good results by using sparse connections, weight sharing, pooling, and establishing their own localized receptive fields. This work aims to improve the Space Invariant Artificial Neural Network approach and raise its recognition accuracy and convergence rate. Incorporating the continuous neural architecture into the Space Invariant Artificial Neural Network is the first step toward simultaneously learning the deep features of an image. Second, the skip convolution layer of ResNet serves as the foundation for developing a new residual module named QuickCut3-ResNet. A dual evaluation model is then developed to achieve the combined evaluation of the convolutional and …complete connection process. Ultimately, the best network parameters of the Space Invariant Artificial Neural Network are determined after simulation experiments are used to examine the impact of various network parameters on the network performance. Results from experiments demonstrate that the Space Invariant Artificial Neural Network technique described in this research can learn the image’s varied characteristics, which enhances the Space Invariant Artificial Neural Network’s capacity to recognize images and extract features accurately. Show more
Keywords: Artificial intelligence, big data, space invariant artificial neural network, image recognition, QuickCut3-ResNet
DOI: 10.3233/JIFS-239538
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-14, 2024
Authors: Wang, Zhimin | Li, Boquan
Article Type: Research Article
Abstract: This paper introduces an expert system to decision-making. The expert system is linguistic summarization combined with prioritized operators. In the practical decision-making problems, the information of attributes is linguistic type and needs to be converted into numerical type. The validity of the linguistic summarization is recorded as the attribute value. We discuss how to calculate the validity of the linguistic summarization, and present three prioritized operators. Then the three prioritized operators are used to aggregate the attribute values. Finally, a practical example is given. In addition, we conduct a comparative analysis between the expert system method and another multi-attribute decision-making …method by using a measure of specificity, and conclude that the expert system method is better. Show more
Keywords: Expert system, decision-making, linguistic summarization, prioritized operators, comparative analysis
DOI: 10.3233/JIFS-238556
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-15, 2024
Authors: Yang, Fan | Zhou, Qing | Su, Renbin | Xiong, Weihong
Article Type: Research Article
Abstract: Molecular graph representation learning has been widely applied in various domains such as drug design. It leverages deep learning techniques to transform molecular graphs into numerical vectors. Graph Transformer architecture is commonly used for molecular graph representation learning. Nevertheless, existing methods based on the Graph Transformer fail to fully exploit the topological structural information of the molecular graphs, leading to information loss for molecular representation. To solve this problem, we propose a novel molecular graph representation learning method called MTS-Net (Molecular Topological Structure-Network), which combines both global and local topological structure of a molecule. In global topological representation, the molecule …graph is first transformed into a tree structure and then encoded by employing a hash algorithm for tree. In local topological representation, paths between atom pairs are transcoded and incorporated into the calculation of the Transformer attention coefficients. Moreover, MTS-Net has intuitive interpretability for identifying key structures within molecules. Experiments on eight molecular property prediction datasets show that MTS-Net achieves optimal results in three out of five classification tasks, the average accuracy is 0.85, and all three regression tasks. Show more
Keywords: Molecular representation, graph structure, graph transformer, property prediction
DOI: 10.3233/JIFS-236788
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-12, 2024
Authors: Veeraiah, D. | Sai Kumar, S. | Ganiya, Rajendra Kumar | Rao, Katta Subba | Nageswara Rao, J. | Manjith, Ramaswamy | Rajaram, A.
Article Type: Research Article
Abstract: Medical image fusion plays a crucial role in accurate medical diagnostics by combining images from various modalities. To address this need, we propose an AI model for efficient medical image fusion using multiple modalities. Our approach utilizes a Siamese convolutional neural network to construct a weight map based on pixel movement information extracted from multimodality medical images. We leverage medical picture pyramids to incorporate multiscale techniques, enhancing reliability beyond human visual intuition. Additionally, we dynamically adjust the fusion mode based on local comparisons of deconstructed coefficients. Evaluation metrics including F1-score, recall, accuracy, and precision are computed to assess performance, yielding …impressive results: an F1-score of 0.8551 and a mutual information (MI) value of 2.8059. Experimental results demonstrate the superiority of our method, achieving a remarkable 99.61% accuracy in targeted experiments. Moreover, the Structural Similarity Index (SSIM) of our approach is 0.8551. Compared to state-of-the-art approaches, our model excels in medical picture classification, providing accurate diagnosis through high-quality fused images. This research advances medical image fusion techniques, offering a robust solution for precise medical diagnostics across various modalities. Show more
Keywords: Multimodal medical image fusion, image classification, siamese CNN, LSTM, genetic algorithm
DOI: 10.3233/JIFS-240018
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-15, 2024
Authors: Huang, Rongbing | Hanif, Muhammad Farhan | Aleem, Aqsa | Siddiqui, Muhammad Kamran | Hanif, Muhammad Faisal | Hussain, Mazhar
Article Type: Research Article
Abstract: The triangular γ-graphyne structure is highlighted in particular, as it is a new configuration with possible applications in medicine. We shed light on this structure’s special qualities and potential uses in healthcare by computing several topological indices linked to it through computational research. Furthermore, we use Shannon’s entropy measure to express the information content of the connection-based topological indices in tandem. This method offers a thorough comprehension of the intricate features and structural properties of the triangular γ-graphyne structure. A logarithmic regression model is built to establish a quantifiable relationship between the computed indices and entropy. The SPSS program was …used in the development of this model, allowing for a thorough examination of the relationship between structural features and informational entropy. A regression model based on triangular graphyne topological indices is used as a predictive tool for entropy estimation. Show more
Keywords: Connection number (CN), triangular γ-graphyne, line graph, logarithmic regression model, Shannon entropy
DOI: 10.3233/JIFS-240356
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-15, 2024
Authors: Wang, Ke | Gu, Tianrui | Du, Xiaoye
Article Type: Research Article
Abstract: With the rapid economic development and increasingly serious environmental problems, many regions have launched green credit policies. Green credit can reduce the loan interest rate of the environmental protection industry and lower the financing threshold. Traditional risk prediction methods cannot comprehensively evaluate the green credit risk of the enterprise based on the degree of green environmental protection and the industry environment in which the enterprise is located, resulting in the inconsistency between the credit financial risk prediction and the actual results, which increases the bank credit risk. In order to strengthen the management level of green credit and reduce the …probability of non-performing loans, a scientific risk assessment method was constructed by using a combination of automatic encoding network and bidirectional long short-term memory neural network model to predict the financial risks of green credit, driven by multi-modal data. Through the study of multimodal data, this paper took green credit financial risk as the research object, aggregated the information of various enterprises to improve the bank’s capital utilization rate, and also promoted enterprises to take the initiative to transform into the direction of green environmental protection. Finally, the experiment proved that multimodal data fusion model was more superior than random forest in risk prediction, reducing the bank’s non-performing loan rate by 3.1% and improving the bank’s risk control level. Show more
Keywords: Financial risk, green credit, risk prediction, multimodal data
DOI: 10.3233/JIFS-237691
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
Authors: Wang, Hengyou | Ke, Rongji | Jiang, Xiang
Article Type: Research Article
Abstract: Due to its remarkable performance, the convolutional neural network (CNN) has gained widespread usage in image inpainting challenges. However, most of these CNN-based methods reconstruct images only in the spatial domain, which produces satisfactory outcomes for small-region inpainting tasks, but blurs the details and generates incomplete structures for large-region inpainting tasks with complex backgrounds. In this paper, we address the issue of large-region inpainting tasks by our novel Adaptive Fourier Neural Network . Specifically, in our network, a Fourier-based global receptive field module is introduced to incorporate frequency information and expand the receptive field by transforming local convolutions into …global convolutions, enabling the proposed network to transmit global information to the missing region. Furthermore, to better fuse spatial and frequency features, an attention-based joint space-frequency module is proposed to combine spatial and frequency information. Finally, to validate the effectiveness and robustness of our proposed method, we conduct qualitative and quantitative experiments on two popular datasets Paris StreetView and Places. The experimental results demonstrate that our proposed method outperforms state-of-the-art methods by generating sharper, more coherent, and visually plausible inpainting results. Code will be released after this work published: https://github.com/langka9/AFNN.git . Show more
Keywords: Large-region image inpainting, Fourier-based global receptive field, frequency domain, Fourier Neural Network
DOI: 10.3233/JIFS-239513
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-15, 2024
Authors: Ruby Elizabeth, J. | Kesavaraja, D. | Ebenezer Juliet, S.
Article Type: Research Article
Abstract: The retinal illness that causes vision loss frequently on the globe is glaucoma. Hence, the earlier detection of Glaucoma is important. In this article, modified AlexNet deep leaning model is proposed to category the source retinal images into either healthy or Glaucoma through the detection and segmentations of optic disc (OD) and optic cup (OC) regions in retinal pictures. The retinal images are preprocessed and OD region is detected and segmented using circulatory filter. Further, OC regions are detected and segmented using K-means classification algorithm. Then, the segmented OD and OC region are classified and trained by the suggested AlexNet …deep leaning model. This model classifies the source retinal image into either healthy or Glaucoma. Finally, performance measures have been estimated in relation to ground truth pictures in regards to accuracy, specificity and sensitivity. These performance measures are contrasted with the other previous Glaucoma detection techniques on publicly accessible retinal image datasets HRF and RIGA. The suggested technique as described in this work achieves 91.6% GDR for mild case and also achieves 100% GDR for severe case on HRF dataset. The suggested method as described in this work achieves 97.7% GDR for mild case and also achieves 100% GDR for severe case on RIGA dataset. AIM: Segmenting the OD and OC areas and classifying the source retinal picture as either healthy or glaucoma-affected. METHODS: The retinal images are preprocessed and OD region is detected and segmented using circulatory filter. Further, OC region is detected and segmented using K-means classification algorithm. Then, the segmented OD and OC region classified are and trained by the suggested AlexNet deep leaning model. RESULTS: The suggested method as described in this work achieves 91.6% GDR for mild case and also achieves 100% GDR for severe case on HRF dataset. The suggested method as described in this work achieves 97.7% GDR for mild case and also achieves 100% GDR for severe case on RIGA dataset. CONCLUSION: This article proposes the modified AlexNet deep learning models for the detections of Glaucoma utilizing retinal images. The OD region is detected using circulatory filter and OC region is detected using k-means classification algorithm. The detected OD and OC regions are utilized to classify the retinal images into either healthy or Glaucoma using the suggested AlexNet model. The proposed method obtains 100% Sey, 93.7% Spy and 96.6% CA on HRF dataset retinal images. The proposed AlexNet method obtains 97.7% Sey, 98% Spy and 97.8% CA on RIGA dataset retinal images. The proposed method stated in this article achieves 91.6% GDR for mild case and also achieves 100% GDR for severe case on HRF dataset. The suggested method as described in this work achieves 97.7% GDR for mild case and also achieves 100% GDR for severe case on RIGA dataset. Show more
Keywords: Retina, deep learning, OD, OC, AlexNet
DOI: 10.3233/JIFS-234131
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-12, 2024
Authors: Wang, Lu
Article Type: Research Article
Abstract: In this technology world, education is also becoming one of the basic necessities of human life like food, shelter, and clothes. Even in day-to-day daily activities, the world is moving toward an automated process using technology developments. Some of the technology developments in day-to-day life activities are smartphone, internet activities, and home and office appliances. To cope with these advanced technologies, the persons must have basic educational qualification to understand and operate those appliances easily. Apart from this, the education helps the person to develop their personal growth in both knowledge and wealth. With the development of technologies, different Artificial …Intelligence techniques have been applied on the datasets to analyze these factors and enhance the teaching method. But the current techniques were applied to one or two data models that analyze either their educational performance or demographic variable. But these models were not sufficient for analyzing all the factors that affects the education. To overcome this, a single optimized machine-learning approach is proposed in this paper to analyze the factors that affect the education. This analysis helps the faculty to enhance their teaching methodology and understand the student’s mentality toward education. The proposed Hybrid Cuckoo search-particle swarm optimization was implemented on three datasets to determine the factors that affect the education. These optimal factors are determined by identifying their relations to the final results of an individual person. All these optimal factors are combined and grades are grouped to analyze the proposed optimization process performance using regression neural network. The proposed optimization-based neural network was tested on three data models and its performance analysis showed that the proposed model can achieve higher accuracy of 99% that affects the individual education. This shows that the proposed model can help the faculty to enhance their attention to the students individually. Show more
Keywords: Education, demographic factors, optimization, hybrid, cuckoo search optimization, particle swarm, regression neural network
DOI: 10.3233/JIFS-234021
Citation: Journal of Intelligent & Fuzzy Systems, vol. Pre-press, no. Pre-press, pp. 1-13, 2024
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]