Submitted:
26 May 2026
Posted:
27 May 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Dataset and Preprocessing
2.2. EEGNet-Based Feature Embedding
2.3. Transformer-Based Contextual Encoding
2.4. Attention Pooling and Classification Head
2.5. Training Strategy and Evaluation Protocol
3. Results and Discussion
3.1. Model Setup and Hyperparameter Tuning
3.2. Model Interpretability and Ablation Analysis
3.3. Performance and Significance Analysis
4. Concluding Remarks and Future Work
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Pino, A.; Papatheodorou, N.; Kouroupetroglou, G.; Giannopoulos, P.A.; Makris, G.; Papageorgiou, C. Hand Dexterity Evaluation Grounded on Cursor Trajectory Investigation in Children with ADHD Using a Mouse and a Joystick. Technologies 2025, 13. [Google Scholar] [CrossRef]
- Salari, N.; Ghasemi, H.; Abdoli, N.; Rahmani, A.; Shiri, M.H.; Hashemian, A.H.; Akbari, H.; Mohammadi, M. The Global Prevalence of ADHD in Children and Adolescents: A Systematic Review and Meta-Analysis. Ital. J. Pediatr. 2023, 49, 48. [Google Scholar] [CrossRef] [PubMed]
- Faraone, S.V.; Banaschewski, T.; Coghill, D.; Zheng, Y.; Biederman, J.; Bellgrove, M.A.; Newcorn, J.H.; Gignac, M.; Al Saud, N.M.; Manor, I.; et al. The World Federation of ADHD International Consensus Statement: 208 Evidence-Based Conclusions About the Disorder. Neurosci. Biobehav. Rev. 2021, 128, 789–818. [Google Scholar] [CrossRef]
- Dhiabi, R.; Walha, R.; Drira, F. A Neural Approach to ADHD Detection in Children: Enhanced EEG Analysis with Wavelet-Transformer Synergy. In Proceedings of the Proceedings of the 18th International Conference on Agents and Artificial Intelligence (ICAART 2026) - Volume 3. SCITEPRESS, 2026; pp. 2899–2906. [Google Scholar] [CrossRef]
- Peterson, B.S.; Trampush, J.; Brown, M.; Maglione, M.; Bolshakova, M.; Rozelle, M.; Miles, J.; Pakdaman, S.; Yagyu, S.; Motala, A.; et al. Tools for the Diagnosis of ADHD in Children and Adolescents: A Systematic Review. Pediatrics 2024, 153, e2024065854. [Google Scholar] [CrossRef]
- Knyazhansky, M.; Shrot, T. ADHD Diagnostic Tools Across Ages: Traditional and Digital Approaches. Front. Psychiatry 2025, 16, 1668070. [Google Scholar] [CrossRef] [PubMed]
- Chan, H.K.; Rowe, R.; Carroll, D. Factors Associated with Parent-Teacher Hyperactivity/Inattention Screening Discrepancy: Findings from a UK National Sample. PLoS ONE 2024, 19, e0299980. [Google Scholar] [CrossRef]
- Gondek, T. Diagnosing ADHD in adults: Diagnostic tools and differential diagnosis. Eur. Psychiatry 2021, 64, S72–S73. [Google Scholar] [CrossRef]
- Fallahpour, B.; Dastjerdi, G.; Akbarian, E.; Emarati, A.; Sadr, Z.; Dastgheib, S.A.; Shahbazi, A.; Bahrami, R.; Golshan-Tafti, M.; Shiri, A.; et al. Artificial Intelligence in ADHD Diagnosis: A Comprehensive Review of Machine Learning Applications, Clinical Validation Challenges, and Implementation Barriers in Precision Medicine. Egypt. Pediatr. Assoc. Gaz. 2026, 74, 36. [Google Scholar] [CrossRef]
- Bian, J.; Liu, X.; Wang, C. Executive Function and Brain Region Development in ADHD: Mechanisms and Interventions in the Prefrontal Cortex and Related Circuits. Adv. Precis. Med. 2025, 10, 15–21. [Google Scholar] [CrossRef]
- Wang, C.; Wang, S.; Sun, L.; Sui, J. Abnormal MRI Features in Children with ADHD: A Narrative Review of Large-Scale Studies. Brain Sci. 2026, 16, 104. [Google Scholar] [CrossRef]
- Tian, L.; Zheng, H.; Zhang, K.; Qiu, J.; Song, X.; Li, S.; Zeng, Z.; Ran, B.; Deng, X.; Cai, J. Structural or/and Functional MRI-Based Machine Learning Techniques for Attention-Deficit/Hyperactivity Disorder Diagnosis: A Systematic Review and Meta-Analysis. J. Affect. Disord. 2024, 355, 459–469. [Google Scholar] [CrossRef]
- Quintero López, C.; Gil Vera, V.D.; Ruiz Quintero, M.J. Diagnosis of ADHD in Children with EEG and Machine Learning: Systematic Review and Meta-Analysis. Clínica Salud 2025, 36, 109–121. [Google Scholar] [CrossRef]
- Mao, Y.; Qi, X.; He, L.; Wang, S.; Wang, Z.; Wang, F. Advanced Machine Learning Techniques Reveal Multidimensional EEG Abnormalities in Children with ADHD: A Framework for Automatic Diagnosis. Front. Psychiatry 2025, 16, 1475936. [Google Scholar] [CrossRef] [PubMed]
- López, C.Q.; Vera, V.D.G.; Quintero, M.J.R. Diagnosis of ADHD in children with EEG and machine learning: Systematic review and meta-analysis. Clin. Health 2025, 36, 109–121. [Google Scholar] [CrossRef]
- Kim, J.W.; Kim, B.N.; Kim, J.I.; Yang, C.M.; Kwon, J. Electroencephalogram (EEG) Based Prediction of Attention Deficit Hyperactivity Disorder (ADHD) Using Machine Learning. Neuropsychiatr. Dis. Treat. 2025, 21, 271–279. [Google Scholar] [CrossRef]
- Maya-Piedrahita, M.C.; Herrera-Gomez, P.M.; Berrio-Mesa, L.; Cardenas-Pena, D.A.; Orozco-Gutierrez, A.A. Supported Diagnosis of Attention Deficit and Hyperactivity Disorder from EEG Based on Interpretable Kernels for Hidden Markov Models. Int. J. Neural Syst. 2022, 32, 2250008. [Google Scholar] [CrossRef]
- Belhadi, A.; Yazidi, A.; Lind, P.G.; Djenouri, Y. EEG Data Classification: Review and Taxonomy. ACM Trans. Comput. Healthc. 2025, 6, 45. [Google Scholar] [CrossRef]
- Lyu, R. Deep Learning Approaches for EEG-Based Healthcare Applications: A Comprehensive Review. Front. Hum. Neurosci. 2026, 19, 1689073. [Google Scholar] [CrossRef]
- Schirrmeister, R.T.; Springenberg, J.T.; Fiederer, L.D.J.; Glasstetter, M.; Eggensperger, K.; Tangermann, M.; Hutter, F.; Burgard, W.; Ball, T. Deep Learning with Convolutional Neural Networks for EEG Decoding and Visualization. Hum. Brain Mapp. 2017, 38, 5391–5420. [Google Scholar] [CrossRef]
- Lawhern, V.J.; Solon, A.J.; Waytowich, N.R.; Gordon, S.M.; Hung, C.P.; Lance, B.J. EEGNet: A Compact Convolutional Neural Network for EEG-Based Brain–Computer Interfaces. J. Neural Eng. 2018, 15, 056013. [Google Scholar] [CrossRef] [PubMed]
- Song, Y.; Zheng, Q.; Liu, B.; Gao, X. EEG Conformer: Convolutional Transformer for EEG Decoding and Visualization. IEEE Trans. Neural Syst. Rehabil. Eng. 2023, 31, 710–719. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention Is All You Need. Proc. Adv. Neural Inf. Process. Syst. 2017, Vol. 30. [Google Scholar] [CrossRef]
- Zhao, W.; Jiang, X.; Zhang, B.; Xiao, S.; Weng, S. CTNet: A Convolutional Transformer Network for EEG-Based Motor Imagery Classification. Sci. Rep. 2024, 14, 20237. [Google Scholar] [CrossRef] [PubMed]
- Nasrabadi, A.M.; Allahverdy, A.; Samavati, M.; Mohammadi, M.R. EEG data for ADHD/control children. IEEE DataPort 2020. [Google Scholar] [CrossRef]
- Ekhlasi, A.; Motie Nasrabadi, A.; Mohammadi, M. Analysis of Effective Connectivity Strength in Children with Attention Deficit Hyperactivity Disorder Using Phase Transfer Entropy. Iran. J. Psychiatry 2021, 16, 374–382. [Google Scholar] [CrossRef] [PubMed]
- Li, L.; Guo, X.; Yang, Z.; Zhao, Y.; Liu, X.; Yang, J.; Chen, Y.; Peng, X.; Han, L. ADHD Detection from EEG Signals Using GCN Based on Multi-Domain Features. Front. Neurosci. 2025, 19, 1561994. [Google Scholar] [CrossRef] [PubMed]
- Attallah, O. ADHD-AID: Aiding Tool for Detecting Children’s Attention Deficit Hyperactivity Disorder via EEG-Based Multi-Resolution Analysis and Feature Selection. Biomimetics 2024, 9, 188. [Google Scholar] [CrossRef]
- Jovanović, V.; Petrušić, I.; Ković, V.; Savić, A.M. The Practical Implications of Re-Referencing in ERP Studies: The Case of N400 in the Picture–Word Verification Task. Diagnostics 2025, 15, 156. [Google Scholar] [CrossRef]
- Kaur, G.; Aggarwal, H.; Goel, N. Artificial Intelligence Driven Neuropsychiatry: A Systematic Review of Electroencephalography-Based Computational Techniques for Major Depressive Disorder Prediction. Neuroscience 2025, 581, 179–207. [Google Scholar] [CrossRef]
- Ilse, M.; Tomczak, J.; Welling, M. Attention-based Deep Multiple Instance Learning. In Proceedings of the Proceedings of the 35th International Conference on Machine Learning; Dy, J., Krause, A., Eds.; PMLR; 10–15 Jul 2018; Vol. 80, Proceedings of Machine Learning Research; pp. 2127–2136. [Google Scholar]
- Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd ed.; Springer: New York, 2009. [Google Scholar]
- Liu, K.; Yang, M.; Yu, Z.; Wang, G.; Wu, W. FBMSNet: A Filter-Bank Multi-Scale Convolutional Neural Network for EEG-Based Motor Imagery Decoding. IEEE Trans. Biomed. Eng. 2023, 70, 436–445. [Google Scholar] [CrossRef]
- Salami, A.; Andreu-Perez, J.; Gillmeister, H. EEG-ITNet: An Explainable Inception Temporal Convolutional Network for Motor Imagery Classification. IEEE Access 2022, 10, 36672–36685. [Google Scholar] [CrossRef]
- Zhang, S.; Yu, S.; Cui, X.; Liang, L.; Li, X. Neural Oscillation Features of ADHD Symptoms in Children: EEG Evidence From Resting State and Oddball Task. J. Atten. Disord. 2026, 30, 552–565. [Google Scholar] [CrossRef]
- Jain, S.; Wallace, B.C. Attention is not Explanation. In Proceedings of the Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Minneapolis, Minnesota, 2019; Vol. 1, pp. 3543–3556. [Google Scholar] [CrossRef]
- Wiegreffe, S.; Pinter, Y. Attention is not not Explanation. In Proceedings of the Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, Hong Kong, China, 2019; pp. 11–20. [Google Scholar] [CrossRef]
- van der Maaten, L.; Hinton, G. Visualizing Data using t-SNE. J. Mach. Learn. Res. 2008, 9, 2579–2605. [Google Scholar]
- Kapoor, S.; Narayanan, A. Leakage and the Reproducibility Crisis in Machine-Learning-Based Science. Patterns 2023, 4, 100804. [Google Scholar] [CrossRef]
- Wang, C.; Wang, X.; Jing, X.; Yokoi, H.; Huang, W.; Zhu, M.; Chen, S.; Li, G. Towards High-Accuracy Classifying Attention-Deficit/Hyperactivity Disorders Using CNN-LSTM Model. J. Neural Eng. 2022, 19, 046015. [Google Scholar] [CrossRef] [PubMed]
- Alim, A.; Imtiaz, M.H. Automatic Identification of Children with ADHD from EEG Brain Waves. Signals 2023, 4, 193–205. [Google Scholar] [CrossRef]
- Alsharif, N.; Al-Adhaileh, M.H.; Al-Yaari, M. Diagnosis of attention deficit hyperactivity disorder: A deep learning approach. AIMS Math. 2024, 9, 10580–10608. [Google Scholar] [CrossRef]
- Salazar-Dubois, D.V.; Álvarez-Meza, A.M.; Castellanos-Dominguez, G. T-GARNet: A Transformer and Multi-Scale Gaussian Kernel Connectivity Network with Alpha-Rényi Regularization for EEG-Based ADHD Detection. Mathematics 2025, 13, 4026. [Google Scholar] [CrossRef]
- Rainio, O.; Teuho, J.; Klén, R. Evaluation Metrics and Statistical Tests for Machine Learning. Sci. Rep. 2024, 14, 6086. [Google Scholar] [CrossRef]
- Pugnana, A.; Perini, L.; Davis, J.; Ruggieri, S. Deep Neural Network Benchmarks for Selective Classification. J. Data-Centric Mach. Learn. Res. 2024, 1, 1–58. [Google Scholar]
- Tay, Y.; Dehghani, M.; Bahri, D.; Metzler, D. Efficient Transformers: A Survey. ACM Comput. Surv. 2022, 55, 1–28. [Google Scholar] [CrossRef]
- Kuruppu, G.; Wagh, N.; Kremen, V.; Varatharajah, Y. EEG Foundation Models: A Critical Review of Current Progress and Future Directions. J. Neural Eng. 2026, 23, 021001. [Google Scholar] [CrossRef]
- Gawlikowski, J.; Tassi, C.R.N.; Ali, M.; Lee, J.; Humt, M.; Feng, J.; Kruspe, A.; Triebel, R.; Jung, P.; Roscher, R.; et al. A Survey of Uncertainty in Deep Neural Networks. Artif. Intell. Rev. 2023, 56, 1513–1589. [Google Scholar] [CrossRef]
left frontal,
right frontal,
midline (Fz, Cz, Pz),
left central-temporal,
right central-temporal,
left posterior, and
right posterior.
left frontal,
right frontal,
midline (Fz, Cz, Pz),
left central-temporal,
right central-temporal,
left posterior, and
right posterior.





| Layer | Variable | Output Shape | Hyperparameter | Setting |
|---|---|---|---|---|
| Feature Extractor | ||||
| Input epoch | C, T | 19, 256 | ||
| Reshape | – | – | ||
| Temporal Conv | , kernel | 8, | ||
| Batch Norm | – | – | ||
| Depthwise Conv | D, kernel | 3, | ||
| Batch Norm | Activation | ELU | ||
| Average Pooling | 4 | |||
| Spatial Dropout | Dropout | |||
| Separable Conv | , kernel | 48, | ||
| Batch Norm | Activation | ELU | ||
| Average Pooling | 8 | |||
| Spatial Dropout | Dropout | |||
| Transformer Encoder | ||||
| Reshape | , d | 8, 48 | ||
| Transformer Encoder | L, M, , Dropout | 1, 2, 2, | ||
| Classification Head | ||||
| Attention Pooling | d | – | – | |
| Dropout | d | Dropout | ||
| Dense classifier | 1 | Activation | Sigmoid | |
| Architecture | Performance (%) | ||||
|---|---|---|---|---|---|
| Configuration | Transformer | Pooling | Accuracy | Recall | Precision |
| EEGNet-like Baseline | ✗ | Flatten | |||
| Baseline + Attention Pooling | ✗ | Attention | |||
| Baseline + Transformer | ✔ | Global Average | |||
| EEG-TACT (Proposal) | ✔ | Attention | |||
| Model | EEGNet | ShallowConvNet | CNN-LSTM | Multi-Stream | IM-CBGT | T-GARNet | EEG-TACT |
|---|---|---|---|---|---|---|---|
| Year | 2017 | 2017 | 2022 | 2023 | 2024 | 2025 | 2026 |
| Parameters | 1,746 | 36,522 | 9,052 | 574,082 | 1,195,266 | 9,071 | 7,322 |
| Varying seed (average over 50 folds) | |||||||
| Accuracy (%) | |||||||
| Recall (%) | |||||||
| Precision (%) | |||||||
| Average Rank | |||||||
| Best seed (average over 5 folds) | |||||||
| Accuracy (%) | |||||||
| Recall (%) | |||||||
| Precision (%) | |||||||
| Odds ratio | – | ||||||
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.