Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Evaluation and Exploration of Machine Learning and CNN Classifiers in Detection of Lung Cancer from Microarray Gene - A Paradigm Shift

Version 1 : Received: 6 July 2023 / Approved: 7 July 2023 / Online: 7 July 2023 (16:30:59 CEST)

A peer-reviewed article of this Preprint also exists.

S, K.M.; Rajaguru, H.; Nair, A.R. Evaluation and Exploration of Machine Learning and Convolutional Neural Network Classifiers in Detection of Lung Cancer from Microarray Gene—A Paradigm Shift. Bioengineering 2023, 10, 933. S, K.M.; Rajaguru, H.; Nair, A.R. Evaluation and Exploration of Machine Learning and Convolutional Neural Network Classifiers in Detection of Lung Cancer from Microarray Gene—A Paradigm Shift. Bioengineering 2023, 10, 933.

Abstract

Microarray gene expression-based detection and classification of medical conditions have been prominent in research studies over the past few decades. However, extracting relevant data from the high-volume microarray gene expression with inherent nonlinearity and inseparable noise components raises significant challenges during data classification and disease detection. So, this paper proposes a two-level strategy involving feature extraction and selection methods before the classification step. The feature extraction step utilizes Short Term Fourier Transform (STFT), and the feature selection step employs Particle Swarm Optimization (PSO) and Harmonic Search (HS) metaheuristic methods. The classifiers employed are Non-Linear Regression, Gaussian Mixture Model, Softmax Discriminant, Naive Bayes, SVM (Linear), SVM (Polynomial), and SVM (RBF). The two-level extracted relevant features are compared with raw data classification results, including Convolutional Neural Network (CNN) Methodology. Among the methods, STFT with PSO feature selection and SVM (RBF) classifier produced the highest accuracy of 94.47%.

Keywords

Lung cancer classification; dimensionality reduction; feature selection techniques; STFT; Particle Swarm Optimization; Harmonic Search; Non-Linear Regression; Mixture Model; Convolutional Neural Network (CNN) for Lung Cancer; Microarray gene expression dataset

Subject

Engineering, Bioengineering

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.