Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

A Deep Learning Approach to Analyze NMR Spectra of SH-SY5Y Cells for Alzheimer’s Disease Diagnosis

Version 1 : Received: 12 May 2023 / Approved: 15 May 2023 / Online: 15 May 2023 (05:42:08 CEST)

A peer-reviewed article of this Preprint also exists.

Costanti, F.; Kola, A.; Scarselli, F.; Valensin, D.; Bianchini, M. A Deep Learning Approach to Analyze NMR Spectra of SH-SY5Y Cells for Alzheimer’s Disease Diagnosis. Mathematics 2023, 11, 2664. Costanti, F.; Kola, A.; Scarselli, F.; Valensin, D.; Bianchini, M. A Deep Learning Approach to Analyze NMR Spectra of SH-SY5Y Cells for Alzheimer’s Disease Diagnosis. Mathematics 2023, 11, 2664.

Abstract

Alzheimer’s Disease (AD) affects the quality of life of millions of people worldwide and represents one of the biggest challenges for the whole society. The SH-SY5Y neuroblastoma cell line is often used as an in vitro model of neuronal function and is widely applied to study the molecular events leading to AD. In the last few years, basic research on SH-SY5Y cells has provided interesting insights for the discovery of new drugs and biomarkers for improved AD treatment and diagnosis. At the same time, untargeted NMR metabolomics is widely applied on biological fluids for (i) metabolic profile analysis, (ii) screening for differential metabolites, (iii) analysis of metabolic pathways, and (iv) the discovery of new biomarkers. Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) have proved to be powerful methods for processing NMR data, being useful in signal quantization, even if more sophisticated --- typically non--linear --- techniques are needed to obtain compact yet information--rich embeddings for complex spectra. In this paper, a compression technique based on convolutional autoencoders is proposed, which can perform a high dimensionality reduction of the spectral signal (up to more than 300 times), maintaining informative features (guaranteed by a reconstruction error always smaller than 5%). Moreover, before compression, an ad hoc preprocessing method was devised to remedy the scarcity of available data. The compressed spectral data were then used to train some SVM classifiers to distinguish diseased from healthy cells, achieving an accuracy close to 78%, a significantly better performance with respect to using PCA--compressed data.

Keywords

Alzheimer’s Disease; SH-SY5Y cells; Nuclear Magnetic Resonance (NMR); Convolutional autoencoders; Embedding of NMR spectra; Data augmentation.

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.