Articles

Access the latest knowledge in applied science, electrical engineering, computer science and information technology, education, and health.

Filter Icon

Filters article

Years

FAQ Arrow
0
0

Source Title

FAQ Arrow

Authors

FAQ Arrow

29,922 Article Results

Recognition system based on artificial vision using OpenCV for discarding and detecting ceramics with defects

10.11591/ijai.v15.i2.pp1166-1173
Fernando Alvarado , Ricardo Yauri
Early detection of defects through preventive maintenance is important in industry to avoid economic losses, as in the case of ceramic tile manufacturing, where manual inspection allows defective parts to advance in production, causing delays. The research review shows that computer vision enables the automation of object detection, classification, and elimination tasks in industrial processes, using solutions based on Python, OpenCV, and MATLAB. For this reason, the design of a computer vision recognition system with OpenCV is proposed, which allows automatic discarding of ceramics with defects using an algorithm for detecting ceramics with a camera and Arduino-based hardware, comparing the captured images with a standard image on a conveyor belt. The machine vision system was integrated with a camera connected to a computer running OpenCV, achieving effective automatic detection with a threshold of 25% difference from the standard part. This percentage was calculated by comparing the grayscale pixel values with a reference image. The system calculates the proportion of pixels that exceed the similarity threshold. The conclusion is that the developed system contributes to production, highlighting the possibility of future industrial integration.
Volume: 15
Issue: 2
Page: 1166-1173
Publish at: 2026-04-01

TMA-Net: a transformer-based multi-modal attention network for abnormal behavior detection

10.11591/ijai.v15.i2.pp1441-1450
Huong-Giang Doan , Ngoc-Trung Nguyen
Abnormal behavior detection in crowded environments remains challenging due to complex motion patterns, occlusions, and domain variability. This paper presents transformer-based multi-modal attention network (TMA-Net), a unified framework that integrates red, green, and blue (RGB), optical flow (OF), and heat map (HM) modalities through a dual-stage attention fusion mechanism. The system employs you only look once version 11 (YOLOv11) for human localization and vision transformer (ViT)-B/16 for feature encoding, followed by intra-modal self-attention and cross-modal fusion to capture fine-grained spatial–temporal and motion energy dependencies. Extensive experiments on six public benchmarks as UMN, Crowd-11, UBNormal, ShanghaiTech, CUHK Avenue, UCSD Ped2, and EPUAbN dataset, demonstrate that TMA-Net achieves up to 97.5% area under the curve (AUC) and 96–100% accuracy, outperforming previous other state-of-the-art approaches. These results highlight the framework’s strong generalization and robustness across both single- and cross-dataset evaluations, underscoring its potential for reliable deployment in real intelligent surveillance systems.
Volume: 15
Issue: 2
Page: 1441-1450
Publish at: 2026-04-01

Technical analysis model for stock prediction using a grammatical evolution algorithm

10.11591/ijai.v15.i2.pp1236-1246
Aditya Kusuma Setyanegara , Imas Sukaesih Sitanggang , Mushthofa Mushthofa
Stocks are a popular investment instrument but carry high risks, where investors may incur losses when stocks are bought at high prices and sold at lower prices. Technical analysis is used to study past stock price behavior to predict future prices. In this study, grammatical evolution (GE) is applied as an evolutionary computing technique to discover optimal functions or programs that represent historical stock price data. This study develops GE based prediction models by utilizing objective functions and search spaces defined through grammar. The model integrates technical indicators based on complex statistical models such as autoregressive integrated moving average (ARIMA), prophet, exponential smoothing, and Fibonacci retracements. Furthermore, this study employs GE to generate ensemble weights randomly, ensuring each model contributes equitably to the final prediction formula. Experiments were conducted using multiple stock datasets, including SMAR, S&P 500, the Johannesburg Stock Exchange (JSE), the New York Stock Exchange (NYSE), and Adani Enterprises (ADANIENT), to evaluate the model’s adaptability and generalization capability. The results demonstrate that the proposed GE model effectively captures complex market patterns and produces more reliable stock price predictions compared to deep learning-based approaches. Although GE requires greater computational time, the findings suggest that GE provides a flexible and effective framework for constructing hybrid stock price forecasting models in dynamic market environments.
Volume: 15
Issue: 2
Page: 1236-1246
Publish at: 2026-04-01

AI-induced fatigue among students in higher education: a latent profile analysis

10.11591/ijai.v15.i2.pp1963-1971
Dynah D. Soriano , Jordan L. Salenga , John Paul P. Miranda , Juvy C. Grume , Emerson Q. Fernando , Jr., Amado B. Martinez , Raymond A. Cabrera , Jaymark A. Yambao
The integration of artificial intelligence (AI) tools in education offers significant benefits but also introduces challenges, including AI-induced fatigue among students. This study aimed to classify students’ experiences with AI tools using latent profile analysis (LPA). A quantitative cross sectional design and referral approach were used to collect survey data from 388 college students who actively used AI tools for academic purposes from November to December 2024. The survey measured AI usage intensity, AI literacy, self-efficacy, perceived usefulness, cognitive load, technostress, sleep quality, general fatigue levels, and attitude toward AI. Descriptive results indicated moderate levels of AI usage intensity, AI literacy, perceived usefulness, cognitive load, sleep quality, and general fatigue, with technostress and attitude toward AI also at moderate levels. Model selection considered Akaike information criterion (AIC), Bayesian information criterion (BIC), entropy, and profile size adequacy, and expert review supported the retained six-profile structure. The LPA identified six interpretable user groups: competent but sleep-deprived users, overwhelmed and high-strain users, stable moderate users, strained moderate users, high intensity strained users, and low-strain selective users. The findings show differences in patterns of competence, strain, fatigue, and sleep outcomes associated with AI tool use, which supports the development of profile specific strategies to manage technostress, cognitive load, fatigue, and sleep disruption among higher education students.
Volume: 15
Issue: 2
Page: 1963-1971
Publish at: 2026-04-01

Exponential long short-term memory with Levy flight optimization for lung nodule classification

10.11591/ijai.v15.i2.pp1451-1463
Kaliba Gowthami , Kamalakannan Jayaseelan
Lung cancer, which commonly appears as lung nodules is a deadly type of cancer that develops in a lung. Early detection of lung cancer is critical and challenging task due to presence of overlapping structures, which make it challenging to differentiate the benign and malignant regions. This research proposes long short-term memory (LSTM) with exponential linear unit (ELU) method for the classification of different classes of lung nodules. The hyperparameters of the LSTM network are optimized using the developed dynamic Levy flight – Archimedes optimization algorithm (DLF-AOA), which effectively identifies the optimal parameters for classification. The ResNet-18 method is used for the extraction of high-level features to differentiate various classes of lung nodules. Furthermore, Bayesian active contour (BAC) is employed for the segmentation of images as containing cancerous and non-cancerous regions of lung nodules. The LSTM with ELU method achieves 98.56% accuracy, 97.54% sensitivity, 98.22% specificity, 96.93% precision, 96.33% F1-score, and 1.44 error rate in IQ-OTH/NCCD lung cancer dataset.
Volume: 15
Issue: 2
Page: 1451-1463
Publish at: 2026-04-01

Deep learning for mental health analysis: long short-term memory approach to text-based condition classification

10.11591/ijai.v15.i2.pp1762-1770
Zaqqi Yamani , Dinda Lestarini , Sarifah Putri Raflesia , Purwita Sari , Ghita Athalina
The increasing prevalence of mental health disorders highlights the need for scalable and automated approaches to early detection. This study proposes a deep learning–based text classification framework using a long short-term memory (LSTM) network to identify mental health conditions from user generated textual data. A corpus of 103,488 labeled texts representing anxiety, stress, bipolar disorder, depression, personality disorder, suicidal ideation, and normal states was preprocessed through tokenization, padding, and word embedding. The proposed LSTM model achieved overall accuracy of 87% on test set, with strong class-wise performance reflected by precision, recall, and F1-scores, particularly for anxiety, personality disorder, and normal classes. Comparative error analysis using a confusion matrix revealed challenges in distinguishing depression from suicidal ideation, indicating semantic overlap between these conditions. The results demonstrate that LSTM-based models can effectively capture sequential linguistic patterns relevant to mental health classification. This framework shows potential as a decision-support tool for early screening and digital mental health applications, complementing clinical assessment rather than replacing it.
Volume: 15
Issue: 2
Page: 1762-1770
Publish at: 2026-04-01

Hybrid classical–quantum ensemble learning for real-time flight delay prediction at Tribhuvan International Airport

10.12928/telkomnika.v24i2.27240
Pavan; Civil Aviation Authority of Nepal Khanal , Nanda Bikram; Tribhuvan University Adhikari
This study investigates ensemble learning using classical and quantum-inspired models to predict flight delays at Tribhuvan International Airport (TIA), Nepal. It combines traditional machine learning algorithms with quantum-based approaches, quantum boosting (QBoost) and the hybrid QBoostPlus, leveraging quantum properties for faster computation. The dataset includes flight records from 2020 to 2024 and Meteorological Aerodrome Reports (METAR), analyzed across four sea- sons to capture delay patterns in domestic and international flights. A combined seasonal dataset assesses model generalization. Six models; VotingClassifier, adaptive boosting (AdaBoost), xtreme gradient boosting (XGBoost), categorical boosting (CatBoost), QBoost, and QBoostPlus are evaluated based on accuracy, precision, recall, F1 score, area under the curve(AUC), and execution time. CatBoost achieved high accuracy (up to 0.97) but slower execution (up to 10,570.63 ms). QBoostPlus provides competitive AUC scores (0.83–0.95) with faster execution, improving speed by up to 99.94% and generating predictions in as little as 6.46 ms. Although quantum-inspired models have slightly lower accuracy, their computational efficiency and stability show strong potential for real-time flight delay prediction. This is the first study applying quantum-inspired ensemble learning to Nepalese aviation data, showing promise for regional airports with limited infrastructure.
Volume: 24
Issue: 2
Page: 527-535
Publish at: 2026-04-01

Gradient-based stochastic depth with convolutional neural network for coconut tree leaf disease classification

10.11591/ijai.v15.i2.pp1155-1165
Kavitha Magadi Gopalakrishna , Raviprakash Madenur Lingaraju , Ananda Babu Jayachandra
The coconut palm (Cocos nucifera) is vital plantation crop, valued for their different uses, ranging from their fruit to its trunk. In recent times, it has been observed that many coconut trees are affected by diseases that reduce production and weaken the strength of the coconut. The classification of coconut leaf diseases is challenging because of intra-class and inter-class variability. This research introduces the gradient-based stochastic depth (GSD) with convolutional neural network (CNN) technique to coconut leaf disease classification to overcome these challenges. The GSD technique is incorporated into every layer of the CNN, where it calculates the probability using gradient magnitudes and skips layers that contribute minimally to the classification. The images are segmented using the GrabCut segmentation algorithm, which isolates the leaf from the background using graph-based segmentation, helping to differentiate between various disease classes. The GSD with CNN algorithm obtains an accuracy of 96.42%, precision of 96.15%, recall of 95.87%, and F1-score of 95.93%, while comparing with existing algorithms.
Volume: 15
Issue: 2
Page: 1155-1165
Publish at: 2026-04-01

Transformer-based Hindi image description and storytelling using enhanced attention and FastText embeddings

10.11591/ijai.v15.i2.pp1771-1782
Anjali Sharma , Mayank Aggrwal , Jitin Khanna
This work presents a novel image description generation framework that combines a Transformer-based encoder-decoder architecture with a custom squeeze-and-excitation (SE) attention block integrated into an EfficientNet feature extractor. The decoder uses FastText embeddings specifically trained for Hindi and is evaluated on the Microsoft common objects in context (MS-COCO) dataset. To improve the captioning process, the model incorporates a generative pre-trained transformer (GPT) module to generate narrative descriptions based on the initial captions and applies multiple similarity metrics to assess output quality. The proposed system significantly outperforms existing methods, achieving high bilingual evaluation understudy (BLEU) scores (BLEU-1 to BLEU-4: 83.24, 73.17, 64.56, and 58.22), a consensus-based image description evaluation (CIDEr) score of 81.41, an F1 score of 90.29, and a metric for evaluation of translation with explicit ordering (METEOR) score of 81.18, indicating strong caption accuracy. Furthermore, the model achieves low error rates, with a word error rate (WER) of 15% and a character error rate (CER) of 11%. This work highlights the challenges of applying large-scale datasets like MS-COCO to resource-limited languages and demonstrates the effectiveness of integrating FastText embeddings with transformer-based models for Hindi image captioning.
Volume: 15
Issue: 2
Page: 1771-1782
Publish at: 2026-04-01

Zoneout regularization-gated recurrent unit algorithm on NIDS with class imbalance handling

10.11591/ijai.v15.i2.pp1505-1512
Mala Kariyappa , Manjunath Hanumanthappa Rangappa , Venugopal Dasappa , Gururaja Hebbur Satyanarayana , Girish Keshava Rao , Gousia Thahniyath
Network intrusion detection system (NIDS) is primarily utilized tool to identify malicious threats on the network. It plays an essential role in safeguarding against an increasing variety of attacks and ensures enhanced security for the network. The existing model struggled to handle the imbalance of class issues during the process of classification due to their biased nature, which reduced the performance of the algorithm. In this paper, the zoneout regularization–gated recurrent unit (ZR-GRU) algorithm is developed to detect and classify intrusions in the network. Incorporating the ZR into GRU reduces overfitting by preventing the model from becoming overly dependent on specific features. It provides good generalization by maintaining diversity in learned representation. Synthetic minority oversampling technique (SMOTE) and Near Miss methods are utilized to balance the samples in the dataset, which helps to increase the performance of a classifier in NIDS. The ZR-GRU technique attained 99.91% accuracy on UNSW-NB15, 99.92% accuracy on CIC-IDS2018, and 99.14% accuracy on CIC-DDoS2019 when comparing with a convolutional neural network bidirectional long short-term memory (CNN-BiLSTM).
Volume: 15
Issue: 2
Page: 1505-1512
Publish at: 2026-04-01

Hybrid recommender for computer aided design software

10.11591/ijai.v15.i2.pp1931-1946
Younes Zidani , Younes Zahrou , Salah Nissabouri , Moulay El Houssine Ech-Chhibat , Khalifa Mansouri
Choosing the right computer-aided design (CAD) software is a complex task due to the wide variety of available options. Using user opinions and reviews may not be sufficient, which highlighting the need for a decision support system. In this paper, we develop and evaluate a hybrid recommendation program (HRP) for CAD software written in the Python programming language, combining collaborative filtering (CF) and content-based filtering (CBF) using k-nearest neighbors (KNN). CF uses user ratings to identify similar users, while CBF compares software characteristics to find similar options. In our hybrid approach, we integrate both filtering techniques with KNN to generate personalized recommendations. It will improve the relevance of software options, help users make choices (students, educators, and professionals), and encourage the adoption of tools most appropriate for every profile. We used the analytic hierarchy process (AHP) method to choose the criteria for our recommendation program. We tested the HRP on a simulated CAD dataset and found that it made recommendations much more accurately than using CF and CBF separately. Evaluation metrics like precision (0.81), recall (0.95), and F1-score (0.87) show that this hybrid approach works, making it a more reliable tool for helping people choose CAD software.
Volume: 15
Issue: 2
Page: 1931-1946
Publish at: 2026-04-01

Real-time detection of rider fatigue: a comparative study of black-box and glass-box artificial intelligence approaches

10.11591/ijai.v15.i2.pp1409-1417
Cynthia Hayat , Iwan Aang Soenandi , Budi Harsono
Rider fatigue poses a critical safety challenge in two-wheeled vehicle operation due to limited physical protection, increased balance demands, and prolonged exposure to environmental stressors. Effective real-time fatigue detection is essential to mitigate accident risks, particularly in high-traffic regions such as Indonesia. This study presents a comparative analysis of black-box and glass-box artificial intelligence (AI) models for real-time detection of rider fatigue, evaluated through a human factor’s lens emphasizing interpretability, intrusiveness, and cognitive compatibility. Multimodal data comprising physiological signals, behavioral indicators, and environmental context were collected using wearable sensors and rider telemetry to train and assess the models. Experimental results reveal that black-box models, including convolutional neural network (CNN) + long short-term memory (LSTM), random forest (RF), and support vector machine (SVM), achieve superior predictive accuracy (94.3%, 91.5%, and 88.2%, respectively) but lack inherent transparency. Conversely, glass-box models such as decision tree (DT) and logistic regression (LR) offer greater interpretability, a critical factor in safety-sensitive applications, though with reduced accuracy (approximately 83–85%). These findings underscore the trade-off between predictive performance and explainability, highlighting the need to tailor model choice to specific operational requirements. This research advances the design of intelligent, human-centered rider support systems that balance accuracy, transparency, and user trust, fostering safer two-wheeled transportation.
Volume: 15
Issue: 2
Page: 1409-1417
Publish at: 2026-04-01

Analysis of tuberculosis detection using deep learning technique and explainable artificial intelligence

10.11591/ijai.v15.i2.pp1623-1631
Shashikiran Srinivas , Kavita Avinash Patil , Kushalatha Monappa Rama , Sudha Venkateshlu , Jayanthi Muthuswamy , Srinivas Babu Narayanappa
Tuberculosis (TB) affects the health of many individuals and is still a prime worldwide health concern despite having so many advanced treatments, as it still lacks technical advancement in its treatment and diagnosis. Accuracy in identification and early detection is essential to reduce the spread and improve treatment outcomes. Traditional methods of diagnosis, such as sputum microscopy and culture, are labor-dependent and subject to human mistakes as it is done by lab technicians. Recent improvements in deep learning have demonstrated significant potential for enhancing and automating diagnostic accuracy. Our research proposes a deep learning based technique that detects TB from chest X-rays after image processing techniques like augmentation. After training on big data, our model pulls off an astonishing accuracy of 97.42% and a loss of 7.17%, outperforming traditional methods. The model uses convolutional neural network (CNN) as a base and transfer learning method, like DenseNet-121, and explainable artificial intelligence (XAI) technique, like Grad-CAM, to recognize TB related patterns effectively and with low false positives. This approach has the ability to revolutionize the diagnosis of TB and offer more dependable, scalable, and timely solutions to healthcare systems worldwide.
Volume: 15
Issue: 2
Page: 1623-1631
Publish at: 2026-04-01

Unimodal and multimodal techniques for depression diagnosis: a comprehensive survey

10.11591/ijai.v15.i2.pp1947-1954
Swathy Jayasree , Yashawini Sridhar
Depression is a common and major mental health condition that affects individuals across all age groups and any backgrounds, severely reducing their physical, emotional, and cognitive functioning. It goes beyond typical mood swings and requires a timely and accurate diagnosis to prevent severe consequences such as suicidal tendencies, self-harm, and long-term mental decline. The improving performance of deep learning and machine learning techniques has significantly enhanced the speed and accuracy of depression diagnosis using both unimodal and multimodal features. This comprehensive study gives a complete overview of the unimodal and multimodal methods used to diagnose depression in its early stages. Additionally, this survey summarizes the dataset, methods, and limitations of previous work presented in the domain of depression diagnosis and serves as a suitable reference for future analysis.
Volume: 15
Issue: 2
Page: 1947-1954
Publish at: 2026-04-01

Simultaneous faults diagnosis and prognostic in induction motor drives under nonstationary conditions

10.12928/telkomnika.v24i2.27624
Ameur Fethi; University Tahar Moulay of Saida Aimer , Ahmed Hamida; University of Sciences and Technology of Oran Boudinar , Mohamed El-Amine; University of Sciences and Technology of Oran Khodja , Azeddine; University of Sciences and Technology of Oran Bendiabdellah
In this paper, an auto regressive (AR) model-based approach is applied in the stator current analysis under non-stationary conditions (case of frequency variation due to variable speed operation). Under these conditions, the identification of fault signatures is almost impossible due the variation of the fundamental frequency using conventional analysis methods. Moreover, this approach is used in the diagnosis of multiple faults occurring simultaneously in induction motor drives. In this aim, the stator current signal is decomposed into short segments then the AR modeling approach is applied on each segment. This approach called short-time ROOT-AR is then applied to solve the problem of the non-stationarity of the stator current signal under variable speed operation. The efficiency of the short-time ROOT-AR approach is evaluated through experimental tests in the diagnosis of multiple faults occurring simultaneously in induction motor drive. Finally, the superiority of the proposed approach is highlighted in comparison with conventional techniques in terms of accuracy, computational time and robustness against the noise.
Volume: 24
Issue: 2
Page: 717-726
Publish at: 2026-04-01
Show 13 of 1995

Discover Our Library

Embark on a journey through our expansive collection of articles and let curiosity lead your path to innovation.

Explore Now
Library 3D Ilustration