Submitted:
13 June 2026
Posted:
16 June 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
- Propose an end-to-end EEG-language model alignment framework that achieves mapping from underlying neuropsychological signals to high-level semantic logical representations.This contribution focuses on constrained evidence-grounded alignment rather than pure classification optimization.
- Design a collaborative XAI mechanism that addresses the limitations of traditional post-hoc explanation by embedding interpretable feature information as prior knowledge directly into the question-answering generation process of the LLM.This mechanism aims to improve explanation transparency by linking model outputs with EEG channels, frequency-band evidence, heatmap patterns, and surrogate decision rules.
- Construct a structured QA dataset that provides a solid data foundation for instruction fine-tuning and training of subsequent LLMs in the continuous physiological signal domain. The QA formulation enables the model to generate mental-state predictions together with structured reasoning and interpretable evidence, while classification performance is treated as one evaluation aspect of the overall framework.
2. Related Work
3. Background and Preliminaries
3.1. EEG Signals and Mental States
3.2. LLM Tuning
3.3. Explainable AI
4. Proposed Method
4.1. Proposed Overall Framework
4.2. EEG Data Collection and Processing
4.3. QA Formulation Process
4.4. QA-Tuning Process
4.5. Explanation Process
5. Experimental Design and Results
5.1. Experimental Setup
5.2. Heatmap Explanation
5.3. Decision Tree Explanation
5.4. Comparisons with Baseline Classification and Explanation Methods
5.5. Analysis and Discussion
6. Conclusions
References
- Kuriyakose, D.; et al. Explainable AI uncovers novel EEG microstate candidate neurophysiological markers for autism spectrum disorder. Front. Comput. Neurosci. 2026, 20, 1763727. [Google Scholar] [CrossRef] [PubMed]
- Torres, J.M.M.; Medina-DeVilliers, S.; Clarkson, T.; Lerner, M.D.; Riccardi, G. Evaluation of interpretability for deep learning algorithms in EEG emotion recognition: A case study in autism. Artif. Intell. Med. 2023, 143, 102545. [Google Scholar] [CrossRef] [PubMed]
- Rehman, A.; Mun, S. Explainable AI-Enhanced Ensemble Protocol Using Gradient-Boosted Models for Zero-False-Alarm Seizure Detection from EEG. Sensors 2026, 26, 863. [Google Scholar] [CrossRef] [PubMed]
- Zhai, L.; Zhao, M.; Zhang, J.; Jamil, M.; Naz, R.; Li, C. A systematic review of EEG-based biomarkers for depression, anxiety, and bipolar disorder: trends in explainable artificial intelligence (XAI). BMC Psychiatry 2025. [Google Scholar] [CrossRef] [PubMed]
- Zanola, A.; Fabrice Tshimanga, L.; Del Pup, F.; Baiesi, M.; Atzori, M. xEEGNet: towards explainable AI in EEG dementia classification. J. Neural Eng. 2025, 22, 046042. [Google Scholar] [CrossRef] [PubMed]
- Ahmad, I.; Zhu, M.; Li, G.; Javeed, D.; Kumar, P.; Chen, S. A secure and interpretable AI for smart healthcare system: A case study on epilepsy diagnosis using EEG signals. IEEE J. Biomed. Health Inform. 2024, 28, 3236–3247. [Google Scholar] [CrossRef] [PubMed]
- Islam, M.S.; Hussain, I.; Rahman, M.M.; Park, S.J.; Hossain, M.A. Explainable artificial intelligence model for stroke prediction using EEG signal. Sensors 2022, 22, 9859. [Google Scholar] [CrossRef] [PubMed]
- Jayaram, V.; Alamgir, M.; Altun, Y.; Scholkopf, B.; Grosse-Wentrup, M. Transfer learning in brain-computer interfaces. IEEE Comput. Intell. Mag. 2016, 11, 20–31. [Google Scholar] [CrossRef]
- Lotte, F.; Bougrain, L.; Cichocki, A.; Clerc, M.; Congedo, M.; Rakotomamonjy, A.; Yger, F. A review of classification algorithms for EEG-based brain–computer interfaces: a 10 year update. J. Neural Eng. 2018, 15, 031005. [Google Scholar] [CrossRef] [PubMed]
- Caruana, R.; Lou, Y.; Gehrke, J.; Koch, P.; Sturm, M.; Elhadad, N. Intelligible models for healthcare: Predicting pneumonia risk and hospital 30-day readmission. In Proceedings of the Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining, 2015; pp. 1721–1730. [Google Scholar] [CrossRef]
- Zhang, Z.; Damiani, E.; Hamadi, H.; Yeun, C.; Taher, F. A late multi-modal fusion model for detecting hybrid spam e-mail. Int. J. Comput. Theory Eng. 2023, 15, 76–81. [Google Scholar] [CrossRef]
- Moor, M.; Banerjee, O.; Abad, Z.S.H.; Krumholz, H.M.; Leskovec, J.; Topol, E.J.; Rajpurkar, P. Foundation models for generalist medical artificial intelligence. Nature 2023, 616, 259–265. [Google Scholar] [CrossRef] [PubMed]
- Lu, W.; Song, C.; Wu, J.; Zhu, P.; Zhou, Y.; Mai, W.; Zheng, Q.; Ouyang, W. Unimind: Unleashing the power of llms for unified multi-task brain decoding. arXiv 2025, arXiv:2506.18962. [Google Scholar] [CrossRef]
- Sánchez-Hernández, S.E.; Torres-Ramos, S.; Román-Godínez, I.; Salido-Ruiz, R.A. Evaluation of the relation between ictal EEG features and XAI explanations. Brain Sci. 2024, 14, 306. [Google Scholar] [CrossRef] [PubMed]
- Hussain, I.; Jany, R.; Boyer, R.; Azad, A.; Alyami, S.A.; Park, S.J.; Hasan, M.M.; Hossain, M.A. An explainable EEG-based human activity recognition model using machine-learning approach and LIME. Sensors 2023, 23, 7452. [Google Scholar] [CrossRef] [PubMed]
- Schirrmeister, R.T.; Springenberg, J.T.; Fiederer, L.D.J.; Glasstetter, M.; Eggensperger, K.; Tangermann, M.; Hutter, F.; Burgard, W.; Ball, T. Deep learning with convolutional neural networks for EEG decoding and visualization. Hum. Brain Mapp. 2017, 38, 5391–5420. [Google Scholar] [CrossRef] [PubMed]
- Ribeiro, M.T.; Singh, S.; Guestrin, C. Why should i trust you?" Explaining the predictions of any classifier. In Proceedings of the Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, 2016; pp. 1135–1144. [Google Scholar] [CrossRef]
- AlSaad, R.; Abd-Alrazaq, A.; Boughorbel, S.; Ahmed, A.; Renault, M.A.; Damseh, R.; Sheikh, J. Multimodal large language models in health care: applications, challenges, and future outlook. J. Med. Internet Res. 2024, 26, e59505. [Google Scholar] [CrossRef] [PubMed]
- Carmona-Martos, L.; Martín-Palomeque, P.; Escudero-Arnanz, Ó.; Soguero-Ruiz, C. Interpretable large language models for early prediction of antimicrobial multidrug resistance. Health Inf. Sci. Syst. 2025, 14, 11. [Google Scholar] [CrossRef] [PubMed]
- Feli, M.; Azimi, I.; Liljeberg, P.; Rahmani, A.M. An llm-powered agent for physiological data analysis: A case study on ppg-based heart rate estimation. In Proceedings of the 2025 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC); IEEE, 2025; pp. 1–7. [Google Scholar] [CrossRef] [PubMed]
- Guo, F.; Zhang, Z.; Mo, H.; Li, C. A method for battery soh estimation based on k-means and lightgbm algorithm. In Proceedings of the 2024 6th International Conference on System Reliability and Safety Engineering (SRSE); IEEE, 2024; pp. 1–7. [Google Scholar] [CrossRef]
- Zou, H.; Tian, Y.; Wang, B.; Bariah, L.; Lasaulce, S.; Huang, C.; Debbah, M. RF-GPT: Teaching AI to See the Wireless World. arXiv 2026, arXiv:2602.14833. [Google Scholar] [CrossRef]
- Craik, A.; He, Y.; Contreras-Vidal, J.L. Deep learning for electroencephalogram (EEG) classification tasks: a review. J. Neural Eng. 2019, 16, 031001. [Google Scholar] [CrossRef] [PubMed]
- Xia, M.; Zhang, Y.; Wu, Y.; Wang, X. An end-to-end deep learning model for EEG-based major depressive disorder classification. IEEE Access 2023, 11, 41337–41347. [Google Scholar] [CrossRef]
- Subhani, A.R.; Mumtaz, W.; Saad, M.N.B.M.; Kamel, N.; Malik, A.S. Machine learning framework for the detection of mental stress at multiple levels. IEEE Access 2017, 5, 13545–13556. [Google Scholar] [CrossRef]
- Wan, Z.; Yang, R.; Huang, M.; Zeng, N.; Liu, X. A review on transfer learning in EEG signal analysis. Neurocomputing 2021, 421, 1–14. [Google Scholar] [CrossRef]
- Xue, B.; Lv, Z.; Xue, J. Feature transfer learning in EEG-based emotion recognition. In Proceedings of the 2020 Chinese Automation Congress (CAC); IEEE, 2020; pp. 3608–3611. [Google Scholar] [CrossRef]
- Lundberg, S.M.; Lee, S.I. A unified approach to interpreting model predictions. Adv. Neural Inf. Process. Syst. 2017, 30. [Google Scholar]
- Shoeibi, A.; Sadeghi, D.; Moridian, P.; Ghassemi, N.; Heras, J.; Alizadehsani, R.; Khadem, A.; Kong, Y.; Nahavandi, S.; Zhang, Y.D.; et al. Automatic diagnosis of schizophrenia in EEG signals using CNN-LSTM models. Front. Neuroinformatics 2021, 15, 777977. [Google Scholar] [CrossRef] [PubMed]
- Borra, D.; Fantozzi, S.; Magosso, E. Interpretable and lightweight convolutional neural network for EEG decoding: Application to movement execution and imagination. Neural Netw. 2020, 129, 55–74. [Google Scholar] [CrossRef] [PubMed]
- Tonekaboni, S.; Joshi, S.; McCradden, M.D.; Goldenberg, A. What clinicians want: contextualizing explainable machine learning for clinical end use. In Proceedings of the Machine learning for healthcare conference. PMLR, 2019; pp. 359–380. [Google Scholar] [CrossRef]
- Holzinger, A.; Langs, G.; Denk, H.; Zatloukal, K.; Müller, H. Causability and explainability of artificial intelligence in medicine. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2019, 9, e1312. [Google Scholar] [CrossRef] [PubMed]
- Theissler, A.; Spinnato, F.; Schlegel, U.; Guidotti, R. Explainable AI for time series classification: a review, taxonomy and research directions. Ieee Access 2022, 10, 100700–100724. [Google Scholar] [CrossRef]
- Singhal, K.; Azizi, S.; Tu, T.; Mahdavi, S.S.; Wei, J.; Chung, H.W.; Scales, N.; Tanwani, A.; Cole-Lewis, H.; Pfohl, S.; et al. Large language models encode clinical knowledge. Nature 2023, 620, 172–180. [Google Scholar] [CrossRef] [PubMed]
- Thirunavukarasu, A.J.; Ting, D.S.J.; Elangovan, K.; Gutierrez, L.; Tan, T.F.; Ting, D.S.W. Large language models in medicine. Nat. Med. 2023, 29, 1930–1940. [Google Scholar] [CrossRef] [PubMed]
- Ouyang, L.; Wu, J.; Jiang, X.; Almeida, D.; Wainwright, C.; Mishkin, P.; Zhang, C.; Agarwal, S.; Slama, K.; Ray, A.; et al. Training language models to follow instructions with human feedback. Adv. Neural Inf. Process. Syst. 2022, 35, 27730–27744. [Google Scholar] [CrossRef]
- Peng, Q.; Li, J.; Huang, S.; Jiang, Y.; Gong, K.; Ding, R.; Ye, S.; Zheng, C.; Wei, X.Y.; Li, Q. Aligning clinical needs and AI capabilities: a survey on LLMs for medical reasoning. Authorea Prepr. 2025. [Google Scholar] [CrossRef] [PubMed]
- Babu, N.; Mathew, J.; Vinod, A. Large language models for eeg: A comprehensive survey and taxonomy. arXiv 2025, arXiv:2506.06353. [Google Scholar] [CrossRef]
- Babu, N.; Mathew, J.; Satija, U.; Vinod, A. Modality reprogramming: Adapting frozen LLMs for multi-channel EEG classification. Neurocomputing 2025, 132407. [Google Scholar] [CrossRef]
- Al Hammadi, A.Y.; Yeun, C.Y.; Damiani, E.; Yoo, P.D.; Hu, J.; Yeun, H.K.; Yim, M.S. Explainable artificial intelligence to evaluate industrial internal security using EEG signals in IoT framework. Ad. Hoc Netw. 2021, 123, 102641. [Google Scholar] [CrossRef]
- Al Hammadi, A.Y.; Lee, D.; Yeun, C.Y.; Damiani, E.; Kim, S.K.; Yoo, P.D.; Choi, H.J. Novel EEG Sensor-Based Risk Framework for the Detection of Insider Threats in Safety Critical Industrial Infrastructure. IEEE Access 2020, 8, 206222–206234. [Google Scholar] [CrossRef]
- Joshi, V.M.; Ghongade, R.B. IDEA: Intellect database for emotion analysis using EEG signal. J. King Saud. Univ.-Comput. Inf. Sci. 2022, 34, 4433–4447. [Google Scholar] [CrossRef]
- Kim, J.; Park, Y.; Chung, W. Transform based feature construction utilizing magnitude and phase for convolutional neural network in EEG signal classification. In Proceedings of the 2020 8th International Winter Conference on Brain-Computer Interface (BCI), 2020; pp. 1–4. [Google Scholar] [CrossRef]
- Agarwal, T.; Raturi, S.; Vybhav, T.; Singh, M. Classification of EEG signal using lstms under audiovisual stimuli. In Proceedings of the 2020 international conference on communication and signal processing (iccsp); IEEE, 2020; pp. 1229–1232. [Google Scholar] [CrossRef]
- Chao, H.; Dong, L. Emotion Recognition Using Three-Dimensional Feature and Convolutional Neural Network from Multichannel EEG Signals. IEEE Sens. J. 2021, 21, 2024–2034. [Google Scholar] [CrossRef]
- Chattopadhyay, S.; Zary, L.; Quek, C.; Prasad, D.K. Motivation detection using EEG signal analysis by residual-in-residual convolutional neural network. Expert Syst. With Appl. 2021, 184, 115548. [Google Scholar] [CrossRef]
- Zhang, Z.; Umar, S.; Hammadi, A.Y.A.; Yoon, S.; Damiani, E.; Ardagna, C.A.; Bena, N.; Yeun, C.Y. Explainable Data Poison Attacks on Human Emotion Evaluation Systems Based on EEG Signals. IEEE Access 2023, 11, 18134–18147. [Google Scholar] [CrossRef]
- Wu, X.K.; Chen, M.; Li, W.; Wang, R.; Lu, L.; Liu, J.; Hwang, K.; Hao, Y.; Pan, Y.; Meng, Q.; et al. Llm fine-tuning: Concepts, opportunities, and challenges. Big Data Cogn. Comput. 2025, 9, 87. [Google Scholar] [CrossRef]
- Zhang, B.; Wang, J.; Du, Q.; Zhang, J.; Tu, Z.; Chu, D. A survey on data selection for llm instruction tuning. J. Artif. Intell. Res. 2025, 83. [Google Scholar] [CrossRef]
- Che, C.; Wang, Z.; Yang, P.; Wang, C.; Ma, H.; Shi, Z. LoRA in LoRA: Towards parameter-efficient architecture expansion for continual visual instruction tuning. Proc. Proc. AAAI Conf. Artif. Intell. 2026, Vol. 40, 19978–19986. [Google Scholar] [CrossRef]
- Zhang, Z.; Hamadi, H.A.; Damiani, E.; Yeun, C.Y.; Taher, F. Explainable Artificial Intelligence Applications in Cyber Security: State-of-the-Art in Research. IEEE Access 2022, 10, 93104–93139. [Google Scholar] [CrossRef]
- Huang, X.; Zhang, Z.; Guo, F.; Wang, X.; Chi, K.; Wu, K. Research on older adults’ interaction with e-health interface based on explainable artificial intelligence. In Proceedings of the International Conference on Human-Computer Interaction; Springer, 2024; pp. 38–52. [Google Scholar] [CrossRef]
- Ribeiro, M.T.; Singh, S.; Guestrin, C. Anchors: High-precision model-agnostic explanations. In Proceedings of the Proceedings of the AAAI conference on artificial intelligence, 2018; Vol. 32. [Google Scholar] [CrossRef]
- Li, H.; Kam-Kwai, W.; Luo, Y.; Chen, J.; Liu, C.; Zhang, Y.; Lau, A.K.H.; Qu, H.; Liu, D. Save It for the “Hot” Day: An LLM-Empowered Visual Analytics System for Heat Risk Management. IEEE Trans. Vis. Comput. Graph. 2025, 31, 8928–8943. [Google Scholar] [CrossRef] [PubMed]
- Ku, J.; Kim, S.; Lee, E.; Zaman, U.; Kim, K. Enhancing Autonomous Ship Communication: A Cost-Effective and High-Accuracy LLM Framework Using Decision Trees and RAG. Proceedings of the 2025 International Conference on Artificial Intelligence in Information and Communication (ICAIIC) 2025, 0420–0426. [Google Scholar] [CrossRef]
- Bradley, M.M.; Lang, P.J. International Affective Picture System. In Encyclopedia of Personality and Individual Differences; Zeigler-Hill, V., Shackelford, T.K., Eds.; Springer International Publishing: Cham, 2017; pp. 1–4. [Google Scholar] [CrossRef]
- Alhammadi, A.; Yeob Yeun, C.; Damiani, E.; D. Yoo, P.; Hu, J.; Ku Yeun, H.; Yim, M.-S. EEG Brainwave Dataset. [CrossRef]








| Item | Setting |
|---|---|
| Evaluated LLMs | Gemma-3-4B, LLaMA-3-4B, Qwen-3-4B |
| Tuned base model | Qwen/Qwen3-4B |
| Fine-tuning method | LoRA instruction tuning |
| Training data format | Structured EEG QA pairs |
| Maximum sequence length | 2048 tokens |
| LoRA rank | 16 |
| LoRA alpha | 32 |
| LoRA dropout | 0.05 |
| Batch size | 2 per device |
| Gradient accumulation | 8 steps |
| Epochs | 1 |
| Maximum training steps | 100 |
| Learning rate | |
| Optimizer | paged_adamw_8bit |
| Quantization | 4-bit NF4 during tuning; Q4_K_M for GGUF export |
| Implementation | Google Colab, PyTorch, HuggingFace, PEFT |
| Method | Accuracy | Macro-Precision | Macro-Recall | Macro-F1 |
|---|---|---|---|---|
| EEGNet | 0.713 | 0.729 | 0.714 | 0.715 |
| DeepConvNet | 0.919 | 0.926 | 0.919 | 0.920 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).