Articles

Access the latest knowledge in applied science, electrical engineering, computer science and information technology, education, and health.

Filter Icon

Filters article

Years

FAQ Arrow
0
0

Source Title

FAQ Arrow

Authors

FAQ Arrow

29,922 Article Results

Combining convolutional operators in unsupervised networks for kidney abnormalities

10.11591/ijai.v14.i6.pp4541-4551
Aekkarat Suksukont , Anuruk Prommakhot , Jakkree Srinonchat
Deep learning plays a pivotal role in advancing the diagnosis of renal dysfunction, achieving performance levels comparable to those of medical experts. However, disease domain variations and model differences can impact learning quality. To address renal dysfunction, we propose dual stream convolutional (DSC) and dual-input convolutional (DIC) for unsupervised learning. The proposed network is designed to process multi scale data and employs parallel data aggregation to enhance learning capabilities, improving the reliability of the experimental results. DSC achieved training losses of 0.0069, 0.0056, 0.0042, and 0.0048 for normal, cyst, stone, and tumor datasets, respectively, while DIC achieved losses of 0.0066, 0.0063, 0.0044, and 0.0058 for the same categories. The experimental results demonstrate that our proposed models outperform state of-the-art approaches, making them well-suited for broad application in clinical research studies.
Volume: 14
Issue: 6
Page: 4541-4551
Publish at: 2025-12-01

Classification system of banana types and ripeness levels based on convolutional neural network

10.11591/ijai.v14.i6.pp4891-4901
Lucia Jambola , Arsyad Ramadhan Darlis , Windi Malaha , Dwi Aryanta
Recently, the availability of bananas in supermarkets has been relatively abundant. However, most buyers experience problems categorizing the type and level of ripeness of bananas, so the level of purchases of this fruit decreases. This study implements a system that can automatically classify bananas based on type and level of ripeness so that buyers can choose them based on their needs. In this study, the proposed system could classify the types and degrees of banana ripeness using a Convolutional Neural Network (CNN) where the system was implemented in real-time using the hardware of the Jetson Nano as a processing unit and a camera system as a sensor. The methodology adopted in this research involves implementing CNN architectures, i.e., ResNet-18 and ResNet-50, under various conditions. The training phase comprises 60 epochs, while testing considers illumination parameters from LED lights with power of 6 watts, 12 watts, and 22 watts under distances ranging from 10 to 100 cm. The results show that the system could classify the type and level of ripeness of bananas in real-time with an accuracy of 93% that is achieved using the 22-watt power for all type and ripeness levels. 
Volume: 14
Issue: 6
Page: 4891-4901
Publish at: 2025-12-01

Automated legal content management system for multi-country integration

10.11591/ijai.v14.i6.pp4511-4519
Hardik Pawar , Nidhi Prakash , Smriti Srivastava , Sneha M. , Shaik Mohideen Syedabdulkader , Pratiba D. , Sandhya S.
This paper presents an automated legal content management system (CMS) designed for multi-country integration, addressing the complex challenges of legal document migration across more than 180 countries while ensuring regulatory compliance and accessibility standards. The system implements a hierarchical four-level architecture, migrating more than 2,740 legal documents with zero data loss incidents through fault-tolerant processing pipelines. The automated portable document format (PDF) migration component demonstrates exceptional efficiency, processing documents 36 times faster compared to manual approaches, while article migration achieves 230 times faster processing speeds. The integrated artificial intelligence (AI)-powered accessibility enhancement system generates contextually appropriate alt text descriptions, allowing organizations to process 10,000 images annually with savings of $14,990. The complete country migration process, covering both PDF and article processing, executes in 30 seconds compared to 56 minutes for manual processing, representing a 112-fold improvement in performance. System scalability demonstrates linear performance characteristics up to more than 5,000 documents with consistent processing metrics while maintaining compliance across diverse regulatory frameworks. These quantitative improvements establish a new paradigm for automated legal content management, providing a scalable foundation for global enterprises managing multi jurisdictional legal documentation requirements.
Volume: 14
Issue: 6
Page: 4511-4519
Publish at: 2025-12-01

Graph based semantic email classification: a novel approach for academic institutions

10.11591/ijai.v14.i6.pp5218-5230
Aruna Kumara B. , Madan H. T. , Rashmi C. , Sarvamangala D. R.
Electronic mail classification in educational institutes becomes the fundamental task to manage information efficiently. Due to the globalization and the technological advancement, volume of email users increasing consistently, which in turn increases the volume of digital data exponentially. This necessitates the developing automated email classification systems for the better and organized work. This paper develops a novel graph-based similarity (GBS) approach based on semantic similarity to address these challenges. The method initially selects the most relevant features based on feature weights, later it builds a graph by using Jaccard co efficient method for each category with features as nodes and correlation between the nodes as edges. Later, these graphs are used as templates for each category and classifies each new incoming email into the specific class based on the similarity among the graph templates and a new email. The GBS method was compared with the well-known benchmarked email classifiers and the findings demonstrated that the GBS method outperformed with 98.91% accuracy after fine-tuning of graph parameters and the classifier hyper parameters. Additionally, receiver operating characteristic (ROC) curve analysis was conducted, achieving a highest area under curve (AUC) score 0.989, demonstrating robust classification proficiency across all categories.
Volume: 14
Issue: 6
Page: 5218-5230
Publish at: 2025-12-01

Large language models for pattern recognition in text data

10.11591/ijai.v14.i6.pp5311-5332
Aknur Kosayakova , Kurmashev Ildar , Luigi La Spada , Nida Zeeshan , Makhabbat Bakyt , Moldamurat Khuralay , Omirzak Abdirashev
Large language models (LLMs) are widely deployed in settings where both reliability and efficiency matter. We present a calibrated, seed‑robust empirical comparison of an encoder fine‑tuned model (bidirectional encoder representations from transformers (BERT)‑base) and a decoder in‑context model (generative pre-trained transformer (GPT)‑2 small) across Stanford question answering dataset v2.0 (SQuAD v2.0) and general language understanding evaluation (GLUE)-multi-genre natural language inference (MNLI), Stanford sentiment treebank 2 (SST‑2). Beyond accuracy, we assess reliability (expected calibration error with reliability diagrams and confidence–coverage analysis) and efficiency (latency, memory, throughput) under matched conditions and three fixed seeds. BERT‑base yields higher accuracy and lower calibration error, while GPT‑2 narrows gaps under few‑shot prompting but remains more sensitive to prompt design and context length. Efficiency benchmarks show that decoder‑only prompting incurs near‑linear latency/memory growth with k‑shot exemplars, whereas fine‑tuned encoders maintain stable per‑example cost. These findings offer practical guidance on when to prefer fine‑tuning versus prompting and demonstrate that reliability must be evaluated alongside accuracy for risk‑aware deployment.
Volume: 14
Issue: 6
Page: 5311-5332
Publish at: 2025-12-01

Classifying classical music’s therapeutic effects using deep learning: a review

10.11591/ijai.v14.i6.pp4933-4942
Angelin Angelin , Samuel Ady Sanjaya , Dinar Ajeng Kristiyanti
Mental health issues are the leading cause of global disability, increasing the need for treatment options. While there is much research on the emotional recognition of music in general, there is a gap in studies that directly connect musical features with their therapeutic effects using deep learning. This systematic literature review explores the use of deep learning in classifying the therapeutic effects of classical music for mental health. Following the preferred reporting items for systematic reviews and meta-analyses (PRISMA) framework, a total of 15 papers were reviewed. This review synthesized studies on the role of musical elements that affect mental states. Different feature extraction methods, including mel-frequency cepstral coefficients (MFCCs), spectral contrast, and chroma features, are discussed for their roles in classifying these therapeutic effects. This review also looks at deep learning algorithms like convolutional neural network (CNN), deep neural network (DNN), long short-term memory (LSTM) network, and combined models to assess their effectiveness. Common evaluation methods are also assessed to measure the performance of these models in audio classification. This review highlights the potential of combining deep learning with classical music for mental health support, and to future possibilities for applying these methods in the real world.
Volume: 14
Issue: 6
Page: 4933-4942
Publish at: 2025-12-01

Sentiment classification using gradient modulation and layered attention

10.11591/ijai.v14.i6.pp5193-5200
Bagiyalakshmi Natarajan , T. Veeramakali
Sentiment analysis is a technique for evaluating text to ascertain whether a statement is positive, negative, or neutral. Currently, transformer-based models capture the contextual relationships among words in a phrase and accomplish sentiment analysis in a nuanced manner via multi-head attention. This approach, with a fixed number of layers and heads, struggles to find the complex relationships between phrases and their semantic structures. To mitigate this issue, the suggested technique incorporates the graded multi head attention model (GMHA) at the base of the distilled bidirectional encoder representations from transformers (DistilBERT) model. It is employed to augment the layers and heads progressively, capturing the relationships between sentences in a sophisticated manner. By increasing the layers and heads the proposed model extracts long-term and hierarchical relationships from the sentence. Additionally, the attention sentient optimization technique is introduced, which improves model learning by giving more weight to important words in a sentence. During training, the process checks to see which words (“amazing" or "worst") get more attention and gives them more weight in the model update. This makes it easier for the model to understand important emotions. Our suggested model enhances performance in sentiment exploration, with an accuracy of 96.53%. This interpretation includes a comparison analysis with another contemporary framework.
Volume: 14
Issue: 6
Page: 5193-5200
Publish at: 2025-12-01

A comprehensive impression on identifying plant diseases using machine learning and deep learning methodologies

10.11591/ijai.v14.i6.pp4694-4702
Ravikanth Motupalli , John T Mesia Dhas , Swapna Neerumalla , Janjhyam Venkata Naga Ramesh , Butti Gouthami , Pavan Kumar Ande
Maintaining healthy plants is essential for long-term agricultural production because agriculture is the backbone of many economies. Agricultural productivity is greatly endangered by plant diseases, which result in huge economic losses. Identifying plant diseases using traditional approaches can be quite laborious, time-consuming, and knowledge-intensive. Automated, precise, and quick diagnosis of plant diseases has been made possible by recent developments in artificial intelligence, mainly in deep learning, and machine learning. This study gives a thorough analysis of how machine learning and deep learning are currently being used to detect plant diseases. Methodologies, datasets, evaluation measures, and the inherent difficulties of the area are all examined. In order to better understand these technologies in practical agricultural contexts, this review will try to shed light on their advantages and disadvantages.
Volume: 14
Issue: 6
Page: 4694-4702
Publish at: 2025-12-01

Deep learning approaches for Braille detection and classification: comparative analysis

10.11591/ijai.v14.i6.pp4652-4660
Surekha Janrao , Tavion Fernandes , Ojas Golatkar , Swaraj Dusane
This study proposes a hybrid approach to Braille translation leveraging the strengths of both YOLO for object detection and multitude of classification models such as ResNet, and ResNet for accurate Braille character classification from images. Upon comparing numerous models on various performance metrics, ResNet and DenseNet outperformed other models, exhibiting high accuracy (0.9487 and 0.9647 respectively) and F1-scores (0.9481 and 0.9666) due to their deep, densely connected architectures adept at capturing intricate Braille patterns. CNNs with pooling showed balanced results, while MobileNetV2's lightweight design limited complex classification. ResNeXt's multi-path learning achieved respectable performance but lagged behind ResNet and DenseNet. In the future the results from our study could be further explored on contracted Braille recognition, be adapted to various Braille codes, and optimized for mobile devices, for real time Braille detection and translation on smartphones.
Volume: 14
Issue: 6
Page: 4652-4660
Publish at: 2025-12-01

Hybrid AI framework for anomaly detection and root cause analysis in multi-agent systems

10.11591/ijai.v14.i6.pp5290-5302
Tahri Rachid , Ouammou Abdellah , Lasbahani Abdellatif , Abdessamad Jarrar , Balouki Youssef
Anomaly detection and root cause analysis (RCA) are critical for securing intelligent systems against evolving threats. Traditional models often suffer from high false alarms, weak adaptability to streaming contexts, and limited interpretability. This work proposes a hybrid artificial intelligence (AI) framework that integrates machine learning (ML) with prior knowledge, semantic rules, and bio-inspired modeling. The approach strengthens detection of diverse attacks, including DoS/DDoS, Probe, U2R, and R2L, while reducing human intervention. Experiments on the NSL-KDD dataset demonstrate that our method decreases spurious alerts by up to 90%, improves accuracy by 2–4%, and reduces false positives/negatives by about 4%. Beyond statistical gains, the framework ensures robustness in real-time environments, offering interpretable and scalable anomaly detection for heterogeneous systems. These results highlight the potential of hybrid symbolic–subsymbolic AI to enhance reliability in next-generation security infrastructures.
Volume: 14
Issue: 6
Page: 5290-5302
Publish at: 2025-12-01

Inexpensive human audiometric system using Raspberry Pi and artificial intelligence

10.11591/ijai.v14.i6.pp4502-4510
Abdulrafa Hussain Maray , Muataz Akram Hassan , Taha Hussein Marai Al-Hassan
The most common and widespread disease in Iraq is hearing impairment for children and newborns. Also, in cities, people are exposed to high levels of noise, loud sounds at work, like factories, and machinery noise. In this paper, a system was designed and implemented to measure the level of hearing in the human ear, in order to reduce the cost of these devices. This system uses Raspberry Pi 3 microcontrollers, which are considered cheap and have high capabilities in open-source programming. Their abundant availability will lead to the provision of these systems in homes, health centers, and hospitals. In this proposed algorithm, two sine waves are generated by the microcontroller with different frequencies. It is transmitted by the MP3 audio transmission cable through the analog-to-digital (ADC) port. These audio signals are generated at a frequency of (0.5 to 12 kHz), these frequencies are the ones that humans can hear, and they can be represented by pulse width modulator (PWM) technology (x=255 samples). Convolutional neural network (CNN) is trained on the dataset acquired through deep learning algorithms.
Volume: 14
Issue: 6
Page: 4502-4510
Publish at: 2025-12-01

EmoVibe: AI-driven multimodal emotion analysis for mental health via social media dashboards

10.11591/ijai.v14.i6.pp4565-4578
Deepali Vora , Aryan Sharma , Mudit Garg , Steve Fransis
Monitoring mental health via social media often utilizes unimodal approaches, such as sentiment analysis on text or single-staged image categorization, or executes early feature fusion. However, in real-world contexts where emotions are conveyed via text, emojis, and images, unimodal approach leads to obscured decision-making pathways and overall diminished performance. To overcome these limitations, we propose EmoVibe, a hybrid multimodal AI framework for emotive analysis. EmoVibe uses attention-based late fusion strategy, where text embeddings are generated from bidirectional encoder representations from transformers (BERT) and visual features are extracted by vision transformer. Subsequently, emoticon vectors linked to avatars are processed independently. Later, these independent data features are integrated at higher levels, enhancing interpretability and performance. In contrast to early fusion methods and integrated multimodal large language models (LLMs) like CLIP, Flamingo, GPT-4V, MentaLLaMA, and domain-adapted models like EmoBERTa, EmoVibe preserves modality-specific contexts without premature fusion. This architecture saves processing cost, allowing for clearer, unambiguous rationalization and explanations. EmoVibe outperforms unimodal baselines and early fusion models, obtaining 89.7% accuracy on GoEmotions, FER, and AffectNet, compared to BERT's 87.4% and ResNet-50's 84.2%, respectively. Furthermore, a customizable, real time, privacy-aware dashboard is created, supporting physicians and end users. This technology enables scalable and proactive intervention options and fosters user self-awareness of mental health.
Volume: 14
Issue: 6
Page: 4565-4578
Publish at: 2025-12-01

Arabic text classification using machine learning and deep learning algorithms

10.11591/ijai.v14.i6.pp5201-5217
Rawad Awad Alqahtani , Hoda A. Abdelhafez
The classification of Arabic textual content presents considerable challenges due to the language's rich morphological structure and the wide variation among its dialects. This study aims to enhance classification accuracy by leveraging ensemble learning techniques and a deep bidirectional transformer-based model, specifically the multilingual autoregressive BERT (MARBERT). To address linguistic variability, advanced preprocessing techniques were employed, including Farasa, Tashaphyne, and Assem stemming methods. The Al Khaleej dataset served as the basis for supervised learning, providing a representative sample of Arabic text. Furthermore, term frequency-inverse document frequency (TF-IDF) with bigram and trigram feature extraction was utilized to effectively capture contextual semantics. Experimental results indicate that the proposed approach, particularly with the integration of MARBERT, achieves a peak classification accuracy of 98.59%, outperforming existing models. This research underscores the efficacy of combining ensemble learning with deep transformer-based models for Arabic text classification and highlights the critical role of robust preprocessing techniques in managing linguistic complexity and improving model performance.
Volume: 14
Issue: 6
Page: 5201-5217
Publish at: 2025-12-01

Enhancing communication and interaction in the movie industry based SparkMLlib's recommendation system

10.11591/ijai.v14.i6.pp4661-4674
Said Chakouk , Abdelkerim Zitouni , Nazif Tchagafo , Ahiod Belaid
In the ever-evolving landscape of streaming platforms, recommendation systems contribute significantly to enhancing the user experience. This article examines the significance of these systems in suggesting movies, analyzing their impact on user satisfaction and platform performance. Utilizing SparkMLlib, a powerful tool for large-scale data processing, we explore various recommendation techniques, including collaborative filtering and content-based filtering. We highlight the dimension of digital communication to further enhance the accuracy of recommendations and foster greater user engagement. Our study also addresses the challenges and future opportunities related to recommendation systems, emphasizing the need for transparency and ethical algorithms. This research highlights the potential for recommendation systems to revolutionize the digital entertainment landscape and shape the future of the movie industry.
Volume: 14
Issue: 6
Page: 4661-4674
Publish at: 2025-12-01

Enhancing credit card fraud detection with synthetic minority over-sampling technique-integrated extreme learning machine

10.11591/ijai.v14.i6.pp4749-4762
Iman Kadhim Ajlan , Mohammed Ibrahim Mahdi , Hayder Murad , Fahad Taha AL-Dhief , Nurhizam Safie , Yasir Hussein Shakir , Ali Hashim Abbas
Many works in cybersecurity detection suffer from low accuracy rates, particularly in real-world applications, where imbalanced datasets and evolving fraud strategies pose significant hurdles. This study introduces an optimized extreme learning machine (ELM) algorithm to address these challenges by dynamically adjusting hidden nodes ranging from 10 to 100 with an increment step of 10 and integrating two activation functions. The proposed method utilizes the synthetic minority over-sampling technique (SMOTE) to handle class imbalance effectively and incorporates a comprehensive evaluation using descriptive statistics, visualization, and significance testing. The proposed ELM-SMOTE method achieves the highest results including an accuracy of 99.710%, recall of 85.811%, specificity of 99.743%, and G-mean of 92.068%. These outcomes reflect the robustness and adaptability of the proposed ELM algorithm in detecting fraudulent transactions. This study emphasizes the importance of a holistic performance analysis, addressing gaps in existing methods and providing a scalable framework for real-world fraud detection applications.
Volume: 14
Issue: 6
Page: 4749-4762
Publish at: 2025-12-01
Show 87 of 1995

Discover Our Library

Embark on a journey through our expansive collection of articles and let curiosity lead your path to innovation.

Explore Now
Library 3D Ilustration