Submitted:
09 June 2025
Posted:
10 June 2025
Read the latest preprint version here
Abstract
Keywords:
1. Introduction
1.1. NLP Transformers
1.2. The Neuro-BOT Transformer Architecture
1.3. Input Embeddings
1.4. Positional Encodings
1.5. Self-Attention Matrices
1.6. Mono-Head vs Multi-Head Attention
1.7. Statistical Learning
1.8. Task-Specific Application: Parkinson’s Disease
2. Methods
2.1. Participants
2.2. MRI Acquisition
2.3. Image Pre-Processing
2.4. NEUROBOT Transformer Implementation
2.5. Statistics
2.5.1. Parametric Outlier Detection (Grubbs’ Test)
2.5.2. Non-Parametric Outlier Detection (MAD)
3. Results
4. Discussion
Acknowledgments
Conflicts of Interest
Author Contributions
Data Availability Statement
References
- Beliveau, V., M. Ganz, L. Feng, B. Ozenne, L. Højgaard, P. M. Fisher, C. Svarer, D. N. Greve, and G. M. Knudsen. 2017. ’A High-Resolution In Vivo Atlas of the Human Brain’s Serotonin System’, J Neurosci, 37: 120-28.
- Bethlehem, R. A. I., J. Seidlitz, S. R. White, J. W. Vogel, K. M. Anderson, C. Adamson, S. Adler, G. S. Alexopoulos, E. Anagnostou, A. Areces-Gonzalez, D. E. Astle, B. Auyeung, M. Ayub, J. Bae, G. Ball, S. Baron-Cohen, R. Beare, S. A. Bedford, V. Benegal, F. Beyer, J. Blangero, M. Blesa Cábez, J. P. Boardman, M. Borzage, J. F. Bosch-Bayard, N. Bourke, V. D. Calhoun, M. M. Chakravarty, C. Chen, C. Chertavian, G. Chetelat, Y. S. Chong, J. H. Cole, A. Corvin, M. Costantino, E. Courchesne, F. Crivello, V. L. Cropley, J. Crosbie, N. Crossley, M. Delarue, R. Delorme, S. Desrivieres, G. A. Devenyi, M. A. Di Biase, R. Dolan, K. A. Donald, G. Donohoe, K. Dunlop, A. D. Edwards, J. T. Elison, C. T. Ellis, J. A. Elman, L. Eyler, D. A. Fair, E. Feczko, P. C. Fletcher, P. Fonagy, C. E. Franz, L. Galan-Garcia, A. Gholipour, J. Giedd, J. H. Gilmore, D. C. Glahn, I. M. Goodyer, P. E. Grant, N. A. Groenewold, F. M. Gunning, R. E. Gur, R. C. Gur, C. F. Hammill, O. Hansson, T. Hedden, A. Heinz, R. N. Henson, K. Heuer, J. Hoare, B. Holla, A. J. Holmes, R. Holt, H. Huang, K. Im, J. Ipser, C. R. Jack, Jr., A. P. Jackowski, T. Jia, K. A. Johnson, P. B. Jones, D. T. Jones, R. S. Kahn, H. Karlsson, L. Karlsson, R. Kawashima, E. A. Kelley, S. Kern, K. W. Kim, M. G. Kitzbichler, W. S. Kremen, F. Lalonde, B. Landeau, S. Lee, J. Lerch, J. D. Lewis, J. Li, W. Liao, C. Liston, M. V. Lombardo, J. Lv, C. Lynch, T. T. Mallard, M. Marcelis, R. D. Markello, S. R. Mathias, B. Mazoyer, P. McGuire, M. J. Meaney, A. Mechelli, N. Medic, B. Misic, S. E. Morgan, D. Mothersill, J. Nigg, M. Q. W. Ong, C. Ortinau, R. Ossenkoppele, M. Ouyang, L. Palaniyappan, L. Paly, P. M. Pan, C. Pantelis, M. M. Park, T. Paus, Z. Pausova, D. Paz-Linares, A. Pichet Binette, K. Pierce, X. Qian, J. Qiu, A. Qiu, A. Raznahan, T. Rittman, A. Rodrigue, C. K. Rollins, R. Romero-Garcia, L. Ronan, M. D. Rosenberg, D. H. Rowitch, G. A. Salum, T. D. Satterthwaite, H. L. Schaare, R. J. Schachar, A. P. Schultz, G. Schumann, M. Schöll, D. Sharp, R. T. Shinohara, I. Skoog, C. D. Smyser, R. A. Sperling, D. J. Stein, A. Stolicyn, J. Suckling, G. Sullivan, Y. Taki, B. Thyreau, R. Toro, N. Traut, K. A. Tsvetanov, N. B. Turk-Browne, J. J. Tuulari, C. Tzourio, É Vachon-Presseau, M. J. Valdes-Sosa, P. A. Valdes-Sosa, S. L. Valk, T. van Amelsvoort, S. N. Vandekar, L. Vasung, L. W. Victoria, S. Villeneuve, A. Villringer, P. E. Vértes, K. Wagstyl, Y. S. Wang, S. K. Warfield, V. Warrier, E. Westman, M. L. Westwater, H. C. Whalley, A. V. Witte, N. Yang, B. Yeo, H. Yun, A. Zalesky, H. J. Zar, A. Zettergren, J. H. Zhou, H. Ziauddeen, A. Zugman, X. N. Zuo, E. T. Bullmore, and A. F. Alexander-Bloch. 2022. ’Brain charts for the human lifespan’, Nature, 604: 525-33.
- Betzel, R. F., A. Griffa, P. Hagmann, and B. Mišić. 2019. ’Distance-dependent consensus thresholds for generating group-representative structural brain networks’, Netw Neurosci, 3: 475-96.
- Braak, H., K. Del Tredici, U. Rüb, R. A. de Vos, E. N. Jansen Steur, and E. Braak. 2003. ’Staging of brain pathology related to sporadic Parkinson’s disease’, Neurobiol Aging, 24: 197-211.
- Calhoun, V. D., T. Adali, G. D. Pearlson, and J. J. Pekar. 2001. ’A method for making group inferences from functional MRI data using independent component analysis’, Hum Brain Mapp, 14: 140-51.
- Dafflon, J., W. H. L. Pinaya, F. Turkheimer, J. H. Cole, R. Leech, M. A. Harris, S. R. Cox, H. C. Whalley, A. M. McIntosh, and P. J. Hellyer. 2020. ’An automated machine learning approach to predict brain age from cortical anatomical measures’, Hum Brain Mapp, 41: 3555-66.
- Delaville, C., P. D. Deurwaerdère, and A. Benazzouz. 2011. ’Noradrenaline and Parkinson’s disease’, Front Syst Neurosci, 5: 31.
- Devlin, Jacob, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding." In, 4171-86. Minneapolis, Minnesota: Association for Computational Linguistics.
- Doppler, C. E. J., J. A. M. Smit, M. Hommelsen, A. Seger, J. Horsager, M. B. Kinnerup, A. K. Hansen, T. D. Fedorova, K. Knudsen, M. Otto, A. Nahimi, P. Borghammer, and M. Sommerauer. 2021. ’Microsleep disturbances are associated with noradrenergic dysfunction in Parkinson’s disease’, Sleep, 44.
- Dosovitskiy, Alexey, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, and Neil Houlsby. 2020. ’An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale’, ArXiv, abs/2010.11929.
- Elman, Jeffrey L. 1990. ’Finding Structure in Time’, Cognitive Science, 14: 179-211.
- Esteban, O., C. J. Markiewicz, R. W. Blair, C. A. Moodie, A. I. Isik, A. Erramuzpe, J. D. Kent, M. Goncalves, E. DuPre, M. Snyder, H. Oya, S. S. Ghosh, J. Wright, J. Durnez, R. A. Poldrack, and K. J. Gorgolewski. 2019. ’fMRIPrep: a robust preprocessing pipeline for functional MRI’, Nat Methods, 16: 111-16.
- Fagerholm, E. D., R. Leech, F. E. Turkheimer, G. Scott, and M. Brázdil. 2024. ’Estimating the energy of dissipative neural systems’, Cogn Neurodyn, 18: 3839-46.
- Fischl, B. 2012. ’FreeSurfer’, Neuroimage, 62: 774-81.
- Fu, Linling, and Lei Liu. 2024. ’What are the differences? A comparative study of generative artificial intelligence translation and human translation of scientific texts’, Humanities and Social Sciences Communications, 11: 1236.
- Glasser, M. F., and D. C. Van Essen. 2011. ’Mapping human cortical areas in vivo based on myelin content as revealed by T1- and T2-weighted MRI’, J Neurosci, 31: 11597-616.
- Grubbs, F. E. 1950. ’Sample Criteria for Testing Outlying Observations’, Ann. Math. Statist. , 21: 27-58.
- Guyon, I. 2003. ’An Introduction to Variable and Feature Selection’, Journal of Machine Learning Research, 3: 1157-82.
- Hansen, J. Y., G. Shafiei, R. D. Markello, K. Smart, S. M. L. Cox, M. Nørgaard, V. Beliveau, Y. Wu, J. D. Gallezot, É Aumont, S. Servaes, S. G. Scala, J. M. DuBois, G. Wainstein, G. Bezgin, T. Funck, T. W. Schmitz, R. N. Spreng, M. Galovic, M. J. Koepp, J. S. Duncan, J. P. Coles, T. D. Fryer, F. I. Aigbirhio, C. J. McGinnity, A. Hammers, J. P. Soucy, S. Baillet, S. Guimond, J. Hietala, M. A. Bedard, M. Leyton, E. Kobayashi, P. Rosa-Neto, M. Ganz, G. M. Knudsen, N. Palomero-Gallagher, J. M. Shine, R. E. Carson, L. Tuominen, A. Dagher, and B. Misic. 2022. ’Mapping neurotransmitter systems to the structural and functional organization of the human neocortex’, Nat Neurosci, 25: 1569-81.
- Hauglund, Natalie L., Mie Andersen, Klaudia Tokarska, Tessa Radovanovic, Celia Kjaerby, Frederikke L. Sørensen, Zuzanna Bojarowska, Verena Untiet, Sheyla B. Ballestero, Mie G. Kolmos, Pia Weikop, Hajime Hirase, and Maiken Nedergaard. 2025. ’Norepinephrine-mediated slow vasomotion drives glymphatic clearance during sleep’, Cell, 188: 606-22.e17.
- Hawrylycz, M. J., E. S. Lein, A. L. Guillozet-Bongaarts, E. H. Shen, L. Ng, J. A. Miller, L. N. van de Lagemaat, K. A. Smith, A. Ebbert, Z. L. Riley, C. Abajian, C. F. Beckmann, A. Bernard, D. Bertagnolli, A. F. Boe, P. M. Cartagena, M. M. Chakravarty, M. Chapin, J. Chong, R. A. Dalley, B. David Daly, C. Dang, S. Datta, N. Dee, T. A. Dolbeare, V. Faber, D. Feng, D. R. Fowler, J. Goldy, B. W. Gregor, Z. Haradon, D. R. Haynor, J. G. Hohmann, S. Horvath, R. E. Howard, A. Jeromin, J. M. Jochim, M. Kinnunen, C. Lau, E. T. Lazarz, C. Lee, T. A. Lemon, L. Li, Y. Li, J. A. Morris, C. C. Overly, P. D. Parker, S. E. Parry, M. Reding, J. J. Royall, J. Schulkin, P. A. Sequeira, C. R. Slaughterbeck, S. C. Smith, A. J. Sodt, S. M. Sunkin, B. E. Swanson, M. P. Vawter, D. Williams, P. Wohnoutka, H. R. Zielke, D. H. Geschwind, P. R. Hof, S. M. Smith, C. Koch, S. G. N. Grant, and A. R. Jones. 2012. ’An anatomically comprehensive atlas of the adult human brain transcriptome’, Nature, 489: 391-99.
- Jolliffe, I.T. 2002. Principal Component Analysis (Springer-Verlag New York, Inc: New York).
- Kopeć, K., S. Szleszkowski, D. Koziorowski, and S. Szlufik. 2023. ’Glymphatic System and Mitochondrial Dysfunction as Two Crucial Players in Pathophysiology of Neurodegenerative Disorders’, Int J Mol Sci, 24.
- Lawn, T., A. Giacomel, D. Martins, M. Veronese, M. Howard, F. E. Turkheimer, and O. Dipasquale. 2024. ’Normative modelling of molecular-based functional circuits captures clinical heterogeneity transdiagnostically in psychiatric patients’, Commun Biol, 7: 689.
- Lawn, T., M. A. Howard, F. Turkheimer, B. Misic, G. Deco, D. Martins, and O. Dipasquale. 2023. ’From neurotransmitters to networks: Transcending organisational hierarchies with molecular-informed functional imaging’, Neurosci Biobehav Rev, 150: 105193.
- McIntosh, A. R., F. L. Bookstein, J. V. Haxby, and C. L. Grady. 1996. ’Spatial pattern analysis of functional brain images using partial least squares’, Neuroimage, 3: 143-57.
- Mehta, Kahini, Taylor Salo, Thomas J. Madison, Azeez Adebimpe, Danielle S. Bassett, Max Bertolero, Matthew Cieslak, Sydney Covitz, Audrey Houghton, Arielle S. Keller, Jacob T. Lundquist, Audrey Luo, Oscar Miranda-Dominguez, Steve M. Nelson, Golia Shafiei, Sheila Shanmugan, Russell T. Shinohara, Christopher D. Smyser, Valerie J. Sydnor, Kimberly B. Weldon, Eric Feczko, Damien A. Fair, and Theodore D. Satterthwaite. 2024. ’XCP-D: A robust pipeline for the post-processing of fMRI data’, Imaging Neuroscience, 2: 1-26.
- Mendes, Sergio Leonardo, Walter Hugo Lopez Pinaya, Pedro Mario Pan, Ary Gadelha, Sintia Belangero, Andrea Parolin Jackowski, Luis Augusto Rohde, Euripedes Constantino Miguel, and João Ricardo Sato. 2024. ’GPT-based normative models of brain sMRI correlate with dimensional psychopathology’, Imaging Neuroscience, 2: 1-15.
- Mosharov, E. V., A. M. Rosenberg, A. S. Monzel, C. A. Osto, L. Stiles, G. B. Rosoklija, A. J. Dwork, S. Bindra, Y. Zhang, M. Fujita, M. B. Mariani, M. Bakalian, D. Sulzer, P. L. De Jager, V. Menon, O. S. Shirihai, J. J. Mann, M. Underwood, M. Boldrini, M. T. de Schotten, and M. Picard. 2024. ’A Human Brain Map of Mitochondrial Respiratory Capacity and Diversity’, bioRxiv.
- Paluszek, Michael, Stephanie Thomas, and Eric Ham. 2022. ’MATLAB Machine Learning Toolboxes.’ in Michael Paluszek, Stephanie Thomas and Eric Ham (eds.), Practical MATLAB Deep Learning: A Projects-Based Approach (Apress: Berkeley, CA).
- Petri, G., P. Expert, F. Turkheimer, R. Carhart-Harris, D. Nutt, P. J. Hellyer, and F. Vaccarino. 2014. ’Homological scaffolds of brain functional networks’, J R Soc Interface, 11: 20140873.
- Pudjihartono, N., T. Fadason, A. W. Kempa-Liehr, and J. M. O’Sullivan. 2022. ’A Review of Feature Selection Methods for Machine Learning-Based Disease Risk Prediction’, Front Bioinform, 2: 927312.
- Radford, Alec, Karthik Narasimhan, Tim Salimans, and Ilya Sutskever. 2018. ’Improving language understanding by generative pre-training’.
- Radford, Alec, Jeff Wu, Rewon Child, David Luan, Dario Amodei, and Ilya Sutskever. 2019. "Language Models are Unsupervised Multitask Learners." In.
- Razavi, Ali, Aaron Van den Oord, and Oriol Vinyals. 2019. ’Generating diverse high-fidelity images with vq-vae-2’, Advances in neural information processing systems, 32.
- Schaefer, A., R. Kong, E. M. Gordon, T. O. Laumann, X. N. Zuo, A. J. Holmes, S. B. Eickhoff, and B. T. T. Yeo. 2018. ’Local-Global Parcellation of the Human Cerebral Cortex from Intrinsic Functional Connectivity MRI’, Cereb Cortex, 28: 3095-114.
- Scholtens, L. H., M. A. de Reus, S. C. de Lange, R. Schmidt, and M. P. van den Heuvel. 2018. ’An MRI Von Economo - Koskinas atlas’, Neuroimage, 170: 249-56.
- Sporns, O. 2013. ’Structure and function of complex brain networks’, Dialogues Clin Neurosci, 15: 247-62.
- Surmeier, D. J., J. A. Obeso, and G. M. Halliday. 2017. ’Selective neuronal vulnerability in Parkinson disease’, Nat Rev Neurosci, 18: 101-13.
- van den Oord, Aäron, Oriol Vinyals, and Koray Kavukcuoglu. 2017. ’Neural discrete representation learning. CoRR abs/1711.00937 (2017)’, arXiv preprint arXiv:1711.00937.
- Van Essen, D. C., S. M. Smith, D. M. Barch, T. E. Behrens, E. Yacoub, and K. Ugurbil. 2013. ’The WU-Minn Human Connectome Project: an overview’, Neuroimage, 80: 62-79.
- Vaswani, Ashish, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. ’Attention Is All You Need’.
- Wagstyl, K., S. Larocque, G. Cucurull, C. Lepage, J. P. Cohen, S. Bludau, N. Palomero-Gallagher, L. B. Lewis, T. Funck, H. Spitzer, T. Dickscheid, P. C. Fletcher, A. Romero, K. Zilles, K. Amunts, Y. Bengio, and A. C. Evans. 2020. ’BigBrain 3D atlas of cortical layers: Cortical and laminar thickness gradients diverge in sensory and motor cortices’, PLoS Biol, 18: e3000678.
- Weinshenker, D. 2018. ’Long Road to Ruin: Noradrenergic Dysfunction in Neurodegenerative Disease’, Trends Neurosci, 41: 211-23.
- Yarkoni, T., R. A. Poldrack, T. E. Nichols, D. C. Van Essen, and T. D. Wager. 2011. ’Large-scale automated synthesis of human functional neuroimaging data’, Nat Methods, 8: 665-70.
- Zilles, K., and N. Palomero-Gallagher. 2017. ’Multiple Transmitter Receptors in Regions and Layers of the Human Cerebral Cortex’, Front Neuroanat, 11: 78.





| TYPE OF LAYER | LAYER | ACCURACY (%) |
| NONE | 71.3 | |
|
POSITIONAL ENCODERS |
DOPAMINE | 73.0 |
| SEROTONIN | 74.7 | |
| NORADRENALINE | 89.7* | |
| ACETYLCHOLINE | 71.3 | |
| MIT COMPLEX II | 73.0 | |
| MIT COMPLEX IV | 71.8 | |
| MIT DENSITY | 71.3 | |
| TIS. RESPIRATORY CAPACITY | 75.3 | |
| MIT. RESPIRATORY CAPACITY | 73.0 | |
|
SELF-ATTENTION LAYERS |
PRINCIPAL COMPONENT 1 | 71.3 |
| PRINCIPAL COMPONENT 2 | 73.0 | |
| PRINCIPAL COMPONENT 3 | 73.0 | |
| PRINCIPAL COMPONENT 4 | 71.8 | |
| PRINCIPAL COMPONENT 5 | 71.8 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).