- 1Department of Information Technology, Mukesh Patel School of Technology Management & Engineering, SVKM’s NMIMS, Mumbai, Maharashtra, India
- 2Department of CSE, Shri L R Tiwari College of Engineering, Mumbai, Maharashtra, India
- 3Institute of Electrical and Electronics Engineers (IEEE), Dallas, TX, United States
- 4Institute of Electrical and Electronics Engineers (IEEE) - Engineering in Medicine and Biology Society, New York, NY, United States
- 5Department of EEE, SR University, Warangal, Telangana, India
1 Introduction
Attention Deficit Hyperactivity Disorder (ADHD), is characterized by abnormalities in brain structure and function, particularly in the prefrontal cortex, which is associated with attention, executive functions, and impulse control. Neurochemical imbalances, especially involving dopamine and norepinephrine, play a crucial role. Genetic factors contribute significantly, with ADHD often running in families. ADHD affects both children and adults, with symptoms typically manifesting in childhood (1). Those with ADHD may experience a range of abnormalities, including difficulties in maintaining attention, hyperactivity, and impulsivity (2). These symptoms can lead to academic challenges, difficulties in social interactions, and problems with task completion and organization. In adults, ADHD can result in issues with time management, occupational performance, and maintaining relationships. Conventional diagnostic methods often depend on subjective evaluations and standardized surveys, which may result in inconsistencies and incorrect diagnoses (3).
Given these challenges, integrating DL into the diagnosis of ADHD represents a significant advancement, facilitating objective, data-driven approaches that enhance diagnostic precision and clinical decision-making (4). Existing diagnostic strategies encounter difficulties in adequately capturing the broad spectrum of ADHD symptoms, underscoring the necessity for more accurate and tailored methodologies (Sathiya et al., 2024).
Convolutional Neural Networks (CNNs)-based specialized models excel at image recognition tasks. In the case of ADHD, researchers can employ CNN to analyze brain-imaging data, such as functional MRI scans (5). By learning hierarchical features from raw pixel values, CNN automatically detect relevant brain structures and abnormalities associated with ADHD (6). The use of Deep Learning (DL) technology marks the beginning of a new era in neuropsychiatric evaluations. DL models can scrutinize extensive datasets, detecting complex patterns that elude human perception. By utilizing CNN, healthcare providers and researchers may overcome enduring obstacles in ADHD care. These sophisticated analytical instruments enhance diagnostic precision, potentially forecasting the efficacy of therapeutic interventions. As technology and healthcare intersect, we can envision a future where personalized ADHD assessments better cater to those impacted by this condition (7).
The updated 2D block diagram illustrates the central role of Convolutional Neural Networks (CNN) in the detection and analysis of ADHD (Attention Deficit Hyperactivity Disorder) by integrating various data sources and influences. The central circle represents the CNN, surrounded by peripheral circles labeled Voice Signals, Brain Images, Behavioral Data, EEG Signals, fMRI Data, Genetic Data, External Influences, and ADHD Init. Each peripheral circle is connected to the CNN, indicating the flow of data from these sources into the neural network for comprehensive analysis. Voice Signals provide insights into behavioral and neurological patterns, while Brain Images and fMRI Data reveal structural and functional brain abnormalities. Behavioral Data and EEG Signals offer information on symptom severity and brain activity, respectively. Genetic Data highlights hereditary aspects, and External Influences encompass prenatal factors, early childhood environment, family dynamics, educational and social settings, and socioeconomic status. The ADHD Init represents the initial diagnosis, serving as the starting point for further analysis. This integration allows the CNN to provide a detailed and accurate detection of ADHD by analyzing multiple dimensions of data, emphasizing a multi-faceted approach to improving diagnostic accuracy and understanding the disorder.
2 Role of CNN in ADHD diagnosis
CNN, a form of deep learning, have demonstrated impressive capabilities in image processing. Through convolution operations, CNN detect and extract local features from images, combining these to form higher-level features. This ability makes CNN particularly effective for classifying images by extracting pixel values and their feature vectors, enhancing the network’s comprehension and resulting in precise classification. Consequently, researchers increasingly utilize CNN to investigate brain, mental, and neurological disorders (8).
In studying ADHD, CNN are favored for their exceptional ability to handle spatial hierarchies and extract relevant features from neuroimaging data like MRI or fMRI scans (9). CNN can effectively capture spatial relationships within data, crucial for brain scans where the positioning and interaction of different brain regions are significant (10). The convolutional layers in CNN automatically learn to detect important features such as edges, textures, and patterns, enabling the identification of abnormalities or patterns associated with ADHD (11). Moreover, CNN are computationally efficient, using parameter sharing and pooling layers to reduce the number of parameters and the dimensionality of the data. This reduction in complexity allows CNN to handle high-dimensional data, such as 3D brain scans, more efficiently than fully connected networks (12).
The robustness of CNN to variations, such as shifts and distortions in input data, adds to their suitability for neuroimaging studies. This robustness is crucial when dealing with biological data with inherent variability. CNN have a proven track record of superior performance in a wide range of image-based tasks, from object detection to medical image analysis, which translates well to neuroimaging tasks (13).
Advanced CNN techniques, such as 3D CNN, are particularly effective for volumetric data like MRI, as they can capture the 3D spatial structure of the brain more accurately than traditional 2D CNN or other deep learning models. Transfer learning further enhances CNN performance by leveraging pre-trained models on large image datasets, which can be fine-tuned for specific ADHD-related tasks, requiring less training data while achieving high accuracy (14). Additionally, CNN offer interpretable visualizations through techniques like Grad-CAM (Gradient-weighted Class Activation Mapping), allowing researchers to see which parts of the brain contribute most to the network’s decisions. This interpretability aids in understanding and trusting the results, making CNN a preferred choice in ADHD research over other deep learning algorithms (15).
The study (16) utilizes EEG signals from children with ADHD and healthy peers, recorded during a task. After pre-processing, the data is segmented, and frequency features are extracted and fed into a CNN. The Layer-wise Relevance Propagation (LRP) algorithm is used to identify and select the most relevant channels for classification. The proposed method achieved an accuracy of 94.52% for validation data. The study demonstrates that the method can effectively diagnose ADHD and provides insights into the importance of specific brain regions and frequency bands, particularly the gamma II band in the frontal and central regions, which are significant for higher-order neurocognitive processes.
The paper introduces the Frequency-Integrated Visual-Language Network (FIVLNet), a DL framework designed to enhance diagnostic accuracy for ADHD using MRI scans. Traditional DL methods often fail to capture the sequential dependencies and complex structural details of MRI images, leading to lower classification accuracy. FIVLNet addresses this by combining high and low-frequency data from MRI images through a CNN and cross-attention mechanism, resulting in more comprehensive image representations. Additionally, it incorporates textual embeddings from Contrastive Language-Image Pre-training (CLIP) to enrich the model’s learning capacity. Despite these enhancements, FIVLNet maintains a lightweight architecture with fewer learnable parameters compared to existing models. FIVLNet achieves an accuracy of 93.89% on fMRI data (17).
A study by Dubreuil-Vall, Ruffini, and Camprodon demonstrated that DL CNN can effectively differentiate between adults with ADHD and healthy individuals using event-related spectral EEG data. The CNN model, trained on spectrograms from the Flanker Task, achieved an 88% classification accuracy, outperforming traditional neural networks. The key findings indicated decreased alpha band power and increased delta-theta band power in ADHD patients, highlighting potential biomarkers for ADHD diagnosis. This research underscores the promise of DL in developing clinically useful diagnostic tools (18).
The article (19) discusses the implementation of a DL model for detecting ADHD using ECG signals. The proposed model utilizes a one-dimensional CNN comprising convolutional, pooling, and fully connected layers. It employs techniques like dropout, ReLU activation, and L2 kernel regularization to mitigate overfitting. The model was trained with the Adam optimizer and utilized weighted loss to address data imbalance. The model achieved significant feature reduction and accurate classification, highlighting the potential of ECG-based DL methods in ADHD diagnosis.
The research article (20) explores the application of CNN to classify ADHD in children using functional Magnetic Resonance Imaging (fMRI) data. The study tests three models—Nadam, SGDM, and a proposed CNN—on fMRI datasets, finding that the proposed CNN achieves the highest accuracy at 98.77%. This superior performance underscores the potential of CNN in providing accurate ADHD diagnosis, suggesting that DL techniques can be effectively utilized for early and precise detection of ADHD, thus aiding in timely intervention and treatment.
The researchers zou et al. in (21) developed a 3D CNN based DL model for classifying ADHD using MRI scans. They extracted 3D low-level features from fMRI and sMRI scans and designed a multimodality CNN architecture to combine them. This approach yielded an accuracy of 69.15%.
The work proposed in (22) utilized the ADHD-200 dataset to diagnose ADHD. They trained a deep multimodal 3D CNN from features obtained from gray matter and fALFF from fMRI. Then output scores were classified with KNN, SVM and LDA algorithms. LDA showed better results among the three classifiers, with a classification accuracy of 74.93%.
The paper (23) presents a novel ADHD classification method combining Convolutional Denoising Autoencoders (CDAE) and Random Forest (RF) algorithms, demonstrating superior performance on the ADHD-200 dataset. The proposed approach extracts deep spatio-temporal features from fMRI data, achieving higher accuracy (75.64%), sensitivity (76.922%), and specificity (73.08%) compared to traditional methods like MKL, MDA-SVM, and 3D-CNN. The research highlights the effectiveness of ensemble learning and grid search optimization for hyperparameter tuning. Future work aims to expand datasets, explore additional feature extraction techniques, and enhance model interpretability to further improve ADHD classification and facilitate clinical applications.
This study (24) introduces the RBP-CNN model, a convolutional neural network designed for precise brain tumor classification in medical imaging. It incorporates regional binary patterns (RBP) and Gray Standard Normalization (GSN) preprocessing to address challenges in extracting image noise and texture features. The model achieves a classification accuracy of 96% with a 7% false classification rate on a dataset of 3000 samples. RBP-CNN’s novel approach and superior performance make it a potential state-of-the-art tool for medical image analysis, demonstrating robustness and scalability on the FigShare dataset. This research provides a new methodology for future exploration in hyperspectral image applications.
A comprehensive search was conducted for English articles on sMRI or/and fMRI-based machine learning techniques for diagnosing ADHD until March 2024. Diagnostic value was assessed by calculating pooled sensitivity, specificity, positive and negative likelihood ratios, and area under the curve (AUC). Heterogeneity was examined using the I2 test and meta-regression analysis, while publication bias was assessed with the Deeks funnel plot asymmetry test. The systematic review included 43 studies, with 27 included in the meta-analysis. The pooled sensitivity and specificity of sMRI or/and fMRI-based ML techniques were 0.74 and 0.75, respectively. The AUC was 0.81, indicating relatively good diagnostic value for ADHD. However, the meta-analysis focused solely on sMRI or/and fMRI-based ML techniques, excluding EEG-based methods, suggesting the need for further analyses on multimodal medical data. In conclusion, sMRI or/and fMRI-based ML techniques show promise as objective diagnostic methods for ADHD (25). The insights, methods used, results, and limitations of recent studies are summarized in the following Table 1.
DL approaches have brought significant advancements in the diagnosis and management of ADHD, marking a substantial improvement over traditional methods. This study focuses on advanced CNN to provide a detailed analysis of their application in ADHD detection. CNN analyze various data sources, such as behavioral patterns, identifying complex features associated with ADHD. For instance, methods utilizing EEG signals have achieved 94.52% accuracy, while those employing fMRI data have reported up to 98.77% accuracy, showcasing the effectiveness of CNN in handling diverse data types (16) (20). Integrating these approaches can enhance diagnostic precision and develop customized treatments, improving efficacy. This study highlights the revolutionary potential of CNN in ADHD diagnosis and care, opening the path for more effective and personalized treatment.
3 Future directions
Future research directions encompass the development of AI models that are explainable, the integration of real-time monitoring tools, and the expansion of collaborative networks for data exchange and validation. Moreover, it is imperative to establish ethical frameworks and regulatory requirements to ensure the responsible utilization of DL technology in clinical settings. Cutting-edge methodologies, like meta-learning and model distillation, present promise in improving model interpretability and transparency, thereby promoting trust and acceptance among healthcare providers and patients.
To address these challenges, future research should focus on the following areas:
1. Combining Neuroimaging with Behavioral Data: Investigate the integration of MRI/fMRI scans with behavioral data to enhance diagnostic accuracy. This could involve developing models that can process and learn from both image and text data simultaneously.
2. Incorporation of EEG and ECG Data: Expand the research to include other physiological data such as EEG and ECG, which can provide complementary information about brain activity and cardiac function, respectively.
3. Exploration of 3D CNN: Further explore the use of 3D CNN for volumetric neuroimaging data to capture the 3D spatial structure of the brain more accurately.
4. Hybrid Models: Develop hybrid models that combine CNN with other DL architectures like LSTM or transformers to capture temporal dynamics and sequential dependencies in neuroimaging data.
5. Prediction of Treatment Outcomes: Use CNN to predict the efficacy of various therapeutic interventions based on individual neuroimaging and behavioral profiles. This can help in creating personalized treatment plans for ADHD patients.
6. Longitudinal Studies: Conduct longitudinal studies to track changes in brain patterns over time with different treatments, helping to refine and personalize therapeutic approaches.
7. Focus on Explainability: Develop methods to enhance the interpretability of CNN models, such as using techniques like Grad-CAM to visualize which brain regions contribute most to the diagnosis. This can help clinicians trust and understand the model’s decisions.
8. Deployment in Clinical Settings: Work on translating these advanced CNN models into practical tools that can be used in clinical settings. This involves addressing challenges related to scalability, real-time processing, and integration with existing healthcare systems.
These future directions aim to enhance the precision, reliability, and applicability of CNN-based approaches in diagnosing and treating ADHD, ultimately improving patient outcomes and advancing the field of neuropsychiatric research.
4 Conclusion
This study demonstrates the transformative potential of CNNs in ADHD diagnosis and treatment, offering significant improvements over traditional methods. CNNs provide an objective, data-driven approach that enhances diagnostic precision and clinical decision-making by analyzing complex neurobiological features from various data sources. Future research should focus on integrating multimodal data and developing personalized treatment plans while ensuring ethical considerations and explainable AI models. In conclusion, CNNs represent a paradigm shift in ADHD care, paving the way for more precise, personalized, and effective treatments, with continuous research promising significant improvements in patient quality of life.
Author contributions
VK: Writing – original draft, Writing – review & editing. BN: Conceptualization, Writing – original draft. SP: Data curation, Writing – review & editing. KP: Methodology, Writing – review & editing. SV: Investigation, Writing – original draft.
Funding
The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Wender PH. ADHD: Attention-deficit hyperactivity disorder in children, adolescents, and adults. New York, NY, USA: Oxford University Press (2001).
3. Peterson BS, Trampush J, Brown M, Maglione M, Bolshakova M, Rozelle M, et al. Tools for the diagnosis of ADHD in children and adolescents: a systematic review. Pediatrics. (2024) 153:e2024065854. doi: 10.1542/peds.2024-065854
4. Sathiya E, Rao TD, Kumar TS. Gabor filter-based statistical features for ADHD detection. Front Hum Neurosci. (2024) 18:1369862. doi: 10.3389/fnhum.2024.1369862
5. Pereira-Sanchez V, Castellanos FX. Neuroimaging in attention-deficit/hyperactivity disorder. Curr Opin Psychiatry. (2021) 34:105–11. doi: 10.1097/YCO.0000000000000669
6. Agarwal S, Raj A, Chowdhury A, Aich G, Chatterjee R, Ghosh K. Investigating the impact of standard brain atlases and connectivity measures on the accuracy of ADHD detection from fMRI data using DL. Multimed Tools Appl. (2024) 83, 67023–67057. doi: 10.1007/s11042-023-17962-7
7. Uyulan C, Erguzel TT, Turk O, Farhad S, Metin B, Tarhan N. A class activation map-based interpretable transfer learning model for automated detection of ADHD from fMRI data. Clin EEG Neurosci. (2023) 54:151–9. doi: 10.1177/15500594221122699
8. Kim B, Park J, Kim T, Kwon Y. Finding essential parts of the brain in rs-fMRI can improve ADHD diagnosis using DL. IEEE Access. (2023) 11, 116065–116075. doi: 10.1109/ACCESS.2023.3324670
9. Shokair M, El-Samie A. Detection attention deficit hyperactivity disorder by using convolution neural network. Int J Telecommun. (2023) 3:1–11. doi: 10.21608/ijt.2023.315782
10. Wang D, Hong D, Wu Q. Attention deficit hyperactivity disorder classification based on DL. IEEE/ACM Trans Comput Biol Bioinf. (2022) 20:1581–6. doi: 10.1109/TCBB.2022.3170527
11. Yin W, Li L, Wu FX. DL for brain disorder diagnosis based on fMRI images. Neurocomputing. (2022) 469:332–45. doi: 10.1016/j.neucom.2020.05.113
12. He Y, Wang C, Wang X, Zhu M, Chen S, Li G. "Brain network connectivity analysis of different ADHD groups based on CNN-LSTM classification model". In: Liu H., et al. Intelligent robotics and applications. ICIRA 2022. Lecture Notes in Computer Science. Switzerland, Yumda: Springer, Cham (2022) 13456, p. 626–35. doi: 10.1007/978-3-031-13822-5_56
13. Ahmadi A, Kashefi M, Shahrokhi H, Nazari MA. Computer aided diagnosis system using deep convolutional neural networks for ADHD subtypes. Biomed Signal Process Control. (2021) 63:102227. doi: 10.1016/j.bspc.2020.102227
14. De Silva S, Dayarathna SU, Ariyarathne G, Meedeniya D, Jayarathna S. fMRI feature extraction model for ADHD classification using convolutional neural network. Int J E-Health Med Commun (IJEHMC). (2021) 12:81–105. doi: 10.4018/IJEHMC
15. Luo S, Meng X, Niu X, Kong H. (2024). Revolutionising ADHD diagnosis: deep learning in 3D medical imaging In Proceedings of the Fourth International Conference on Signal Processing and Machine Learning (CONF-SPML 2024). (SPIE). 13077, 130770E. doi: 10.1117/12.3027125
16. SPIE, Nouri A, Tabanfar Z. Detection of ADHD disorder in children using layer-wise relevance propagation and convolutional neural network: an EEG analysis. Front Biomed Technol. (2024) 11:14–21. doi: 10.18502/fbt.v11i1.14507
17. Hu R, Zhu K, Hou Z, Wang R, Liu F. Enhanced ADHD detection: Frequency information embedded in a visual-language framework. Displays. (2024) 83:102712. doi: 10.1016/j.displa.2024.102712
18. Dubreuil-Vall L, Ruffini G, Camprodon JA. DL convolutional neural networks discriminate adult ADHD from healthy individuals on the basis of event-related spectral EEG. Front Neurosci. (2020) 14:251. doi: 10.3389/fnins.2020.00251
19. Li R, Cao J, Li N. Stabilization control of quaternion-valued fractional-order discrete-time memristive neural networks. Neurocomputing. (2023) 542:126255. doi: 10.1016/j.neucom.2023.126255
20. Salah E, Shokair M, Abd El-Samie FE, Shalaby WA. (2023). “Utilization of Deep Learning to Overcome the Effect of ADHD on Children,” 2023 3rd International Conference on Electronic Engineering (ICEEM). (Egypt: Menouf). 2023, 1–5. doi: 10.1109/ICEEM58740.2023.10319577
21. Zou L, Zheng J, Miao C, Mckeown MJ, Wang ZJ. 3D CNN based automatic diagnosis of attention deficit hyperactivity disorder using functional and structural MRI. IEEE Access. (2017) 5:23626–36. doi: 10.1109/ACCESS.2017.2762703
22. Abdolmaleki S, Abadeh MS. “Brain MR Image Classification for ADHD Diagnosis Using Deep Neural Networks,” 2020 International Conference on Machine Vision and Image Processing (MVIP). (Iran). (2020) 2020:1–5. doi: 10.1109/MVIP49855.2020.9116877
23. Liu S, Zhao L, Wang X, Xin Q, Zhao J, Guttery DS, et al. Deep spatio-temporal representation and ensemble classification for attention deficit/hyperactivity disorder. IEEE Trans Neural Syst Rehabil Eng. (2020) 29:1–10. doi: 10.1109/TNSRE.7333
24. Ramalakshmi K, Rajagopal S, Kulkarni MB, Poddar H. A hyperdimensional framework: Unveiling the interplay of RBP and GSN within CNN for ultra-precise brain tumor classification. Biomed Signal Process Control. (2024) 96:106565. doi: 10.1016/j.bspc.2024.106565
25. Tian L, Zheng H, Zhang K, Qiu J, Song X, Li S, et al. Structural or/and functional MRI-based machine learning techniques for attention-deficit/hyperactivity disorder diagnosis: A systematic review and meta-analysis. J Affect Disord. (2024) 355:459–69. doi: 10.1016/j.jad.2024.03.111
26. Rohini BR, Shoaib K, Yogish HK. A review on machine learning approaches in diagnosis of ADHD based on big data. Big Data Comput. (2024), 281–97.
Keywords: deep learning, attention deficit hyperactivity disorder, convolutional neural networks, behavioral patterns, complex neurobiological features, electroencephalogram (EEG), magnetic resonance imaging (MRI)
Citation: Kulkarni V, Nemade B, Patel S, Patel K and Velpula S (2024) A short report on ADHD detection using convolutional neural networks. Front. Psychiatry 15:1426155. doi: 10.3389/fpsyt.2024.1426155
Received: 02 May 2024; Accepted: 08 August 2024;
Published: 05 September 2024.
Edited by:
Kandala N. V. P. S. Rajesh, VIT-AP University, IndiaReviewed by:
Shishir Maheshwari, Thapar Institute of Engineering & Technology, IndiaAbirami S P, Coimbatore Institute of Technology, India
Copyright © 2024 Kulkarni, Nemade, Patel, Patel and Velpula. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Vikram Kulkarni, vikram.kulkarni@nmims.edu