Submitted:
30 August 2024
Posted:
02 September 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Approach to Gamma Knife Radiosurgery Implementation
2.2. Dataset
2.2.1. Data Preprocessing
2.2.2. Data Balancing
2.3. Models Description
2.3.1. Traditional ML Models
2.3.2. Autoencoders for Feature Extraction
- An input layer is defined.
- The first encoder layer has twice the number of input features, followed by batch normalization and LeakyReLU activation.
- The second encoder layer has the same number of input features, followed by batch normalization and LeakyReLU activation.
- LatentSpace(Bottleneck):
- The latentspace has the same number of neurons as the input features (no compression).
- Decoder:
- The first decoder layer has the same number of input features, followed by batch normalization and LeakyReLU activation.
- The second decoder layer has twice the number of input features, followed by batch normalization and LeakyReLU activation.
- Output layer:
- The output layer has the same number of input features with a linear activation function.
- B.) With compression in the bottleneck layer
- Encoder:
- An input layer is defined.
- The first encoder layer has twice the number of input features, followed by batch normalization and LeakyReLU activation.
- The second encoder layer has the same number of input features, followed by batch normalization and LeakyReLU activation.
- LatentSpace (Bottleneck):
- The latentspace has half the number of neurons as the input features (compressed size).
- Decoder:
- The first decoder layer has the same number of input features, followed by batch normalization and LeakyReLU activation.
- The second decoder layer has twice the number of input features, followed by batch normalization and LeakyReLU activation.
- Output layer:
- The output layer has the same number of input features with a linear activation function.
2.4. Training and Validation
2.4.1. Traditional ML Models
2.4.2. Autoencoders
3. Results and Discussion
3.1. Performance Metrics for Traditional ML
3.2. Performance Metrics in Autoencoders for Feature Extraction



3.3. Comparative Analysis
3.4. Discussion
4. Conclusion
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Conflicts of Interest
References
- Alan D Lopez, Colin D Mathers, MajidEzzati, Dean T Jamison, Christopher JL Murray,Global and regional burden of disease and risk factors, 2001: systematic analysis of population health data,TheLancet,Volume 367, Issue 9524,2006,Pages 1747-1757,ISSN 0140-6736. [CrossRef]
- NayanLamba, Patrick Y Wen, Ayal A Aizer, Epidemiology of brain metastases and leptomeningeal disease, Neuro-Oncology, Volume 23, Issue 9, September 2021, Pages 1447–1456. [CrossRef]
- Soffietti R, Rudā R, Mutani R. Management of brain metastases. J Neurol. 2002 Oct;249(10):1357-69. [CrossRef]
- Gavrilovic IT, Posner JB: Brain metastases: epidemiology and pathophysiology. J Neurooncol 2005; 75(1):5–14. [CrossRef]
- Posner JB, Chernik NL. Intracranial metastases from systemic cancer. Adv Neurol. 1978;19:579-92. [PubMed]
- Schouten JP, McElgunn CJ, Waaijer R, Zwijnenburg D, Diepvens F, Pals G. Relative quantification of 40 nucleic acid sequences by multiplex ligation-dependent probe amplification. Nucleic Acids Res. 2002 Jun 15;30(12):e57. [CrossRef]
- Patricia Sacks, Maryam Rahman,Epidemiology of Brain Metastases,Neurosurgery Clinics of North America,Volume 31, Issue 4,2020,Pages 481-488. [CrossRef]
- Thomas Noh, Tobias Walbert,Chapter 6 - Brain metastasis: clinical manifestations, symptom management, and palliative care,Editor(s): David Schiff, Martin J. van den Bent,Handbook of Clinical Neurology,Elsevier,Volume 149,2018,Pages 75-88,ISSN 0072-9752,ISBN 9780128111611.
- Dasgupta A, Maitre M, Pungavkar S, Gupta T. Magnetic Resonance Imaging in the Contemporary Management of Medulloblastoma: Current and Emerging Applications. Methods Mol Biol. 2022;2423:187-214.
- Suh, J.H., Kotecha, R., Chao, S.T. et al. Current approaches to the management of brain metastases. Nat Rev ClinOncol 17, 279–299 (2020). [CrossRef]
- Jaime Gállego Pérez-Larraya, Jerzy Hildebrand,Chapter 77 - Brain metastases††This chapter is dedicated to the memory of Professor Jerzy Hildebrand,Editor(s): José Biller, José M. Ferro,Handbook of Clinical Neurology,Elsevier,Volume 121,2014,Pages 1143-1157,ISSN 0072-9752,ISBN 9780702040887.
- Gerosa, M., Nicolato, A., Foroni, R., Zanotti, B., Tomazzoli, L., Miscusi, M., Alessandrini, F., &Bricolo, A. (2002). Gamma knife radiosurgery for brain metastases: a primary therapeutic option. Journal of Neurosurgery, 97, 515-524. [CrossRef]
- S. U. Hirway and S. H. Weinberg, A review of computational modeling, machine learning and image analysis in cancer metastasis dynamics, ComputSystOncol. 3 (2022), e1044. [CrossRef]
- Uzunova, H., Schultz, S., Handels, H. et al. Unsupervised pathology detection in medical images using conditional variational autoencoders. Int J CARS 14, 451–461 (2019). [CrossRef]
- Segato, A.; Marzullo, A.; Calimeri, F.; De Momi, E. Artificial Intelligence for Brain Diseases: A Systematic Review. APL Bioeng. 2020, 4, 041503. [CrossRef]
- Basu, K.; Sinha, R.; Ong, A.; Basu, T. Artificial Intelligence: How Is It Changing Medical Sciences and Its Future? Indian J. Dermatol. 2020, 65, 365–370. [CrossRef]
- Senders, J.T.; Staples, P.C.; Karhade, A.V.; Zaki, M.M.; Gormley, W.B.; Broekman, M.L.D.; Smith, T.R.; Arnaout, O. Machine Learning and Neurosurgical Outcome Prediction: A Systematic Review. World Neurosurg. 2018, 109, 476–486.e1. [CrossRef]
- Yang, Y.J.; Bang, C.S. Application of Artificial Intelligence in Gastroenterology. World J. Gastroenterol. 2019, 25, 1666–1683. [CrossRef]
- Panesar, S.S.; D’Souza, R.N.; Yeh, F.-C.; Fernandez-Miranda, J.C. Machine Learning Versus Logistic Regression Methods for 2-Year Mortality Prognostication in a Small, Heterogeneous Glioma Database. World Neurosurg. X 2019, 2, 100012. [CrossRef]
- Marcus, J.L.; Sewell, W.C.; Balzer, L.B.; Krakower, D.S. Artificial Intelligence and Machine Learning for HIV Prevention: Emerging Approaches to Ending the Epidemic. Curr. HIV/AIDS Rep. 2020, 17, 171–179. [CrossRef]
- Vega-Huerta, H.; Villanueva-Alarcón, R.; Mauricio, D.; Moreno, J.G.; Vilca, H.D.C.; Rodriguez, D.; Rodriguez, C. Convolutional Neural Networks on Assembling Classification Models to Detect Melanoma Skin Cancer. Int. J. Online Biomed. Eng. IJOE 2022, 18, 59–76. [CrossRef]
- Uspenskaya-Cadoz, O.; Alamuri, C.; Wang, L.; Yang, M.; Khinda, S.; Nigmatullina, Y.; Cao, T.; Kayal, N.; O’Keefe, M.; Rubel, C. Machine Learning Algorithm Helps Identify Non-Diagnosed Prodromal Alzheimer’s Disease Patients in the General Population. J. Prev. Alzheimers Dis. 2019, 6, 185–191. [CrossRef]
- Ghafouri-Fard, S.; Taheri, M.; Omrani, M.D.; Daaee, A.; Mohammad-Rahimi, H. Application of Artificial Neural Network for Prediction of Risk of Multiple Sclerosis Based on Single Nucleotide Polymorphism Genotypes. J. Mol. Neurosci. MN 2020, 70, 1081–1087. [CrossRef]
- LeCun, Y. (1987). Mod` elesconnexionistes de l’apprentissage. Ph.D.thesis, Universite de Paris VI.
- Bourlard H, Kamp Y. Auto-association by multilayer perceptrons and singular value decomposition. BiolCybern. 1988;59(4):291–294. [CrossRef]
- Kramer, M.A. (1991), Nonlinear principal component analysis using autoassociative neural networks. AIChE J., 37: 233-243. [CrossRef]
- Hinton, G. E. and Zemel, R. S. (1994). Autoencoders, minimumdescription length, and helmholtz free energy. In NIPS’1993.
- Courville, A. (2012). Unsupervised feature learning and deep learning: A review and new perspectives. ArXiv Preprint ArXiv:1206.5538. [CrossRef]
- https://en.wikipedia.org/wiki/Autoencoder (accessed on 10 June 2024).
- Brenner, D.J. the linear-quadratic model is an appropriate methodology for determining isoeffective doses at large doses per fraction. Semin. Radiat. Oncol. 2008, 18, 234–239. [CrossRef]
- Fowler, J.F. The linear-quadratic formula and progress in fractionated radiotherapy. Br. J. Radiol. 1989, 62, 679–694. [CrossRef]
- Higuchi, Y.; Serizawa, T.; Nagano, O.; Matsuda, S.; Ono, J.; Sato, M.; Iwadate, Y.; Saeki, N. Three-staged stereotactic radiotherapy without whole brain irradiation for large metastatic brain tumors. Int. J. Radiat. Oncol. Biol. Phys. 2009, 74, 1543–1548. [CrossRef]
- Rodríguez, P.; Bautista, M.A.; Gonzàlez, J.; Escalera, S. Beyond one-hot encoding: Lower dimensional target embedding. Image Vis. Comput. 2018, 75, 21–31. [CrossRef]
- Ahsan, M.M.; Mahmud, M.A.P.; Saha, P.K.; Gupta, K.D.; Siddique, Z. Effect of Data Scaling Methods on Machine Learning Algorithms and Model Performance. Technologies 2021, 9, 52. [CrossRef]
- Rao, A.; Monteiro, J.M.; Mourao-Miranda, J.; Alzheimer’s Disease Initiative. Predictive Modelling Using Neuroimaging Data in the Presence of Confounds. NeuroImage2017, 150, 23–49. [CrossRef]
- Cramer, H. Mathematical Methods of Statistics; Princeton Mathematical Series; Princeton University Press: Princeton, NJ, USA, 1946.
- Mukaka, M. A Guide to Appropriate Use of Correlation Coefficient in Medical Research. Malawi Med. J. J. Med. Assoc. Malawi 2012, 24, 69–71.
- Zychlinski, S. Dython: A Set of Data Tools in Python. Available online: http://shakedzy.xyz/dython (accessed on 20 June 2024).
- Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer,W.P. SMOTE: Synthetic Minority Over-Sampling Technique. J. Artif. Intell. Res. 2002, 16, 321–357. [CrossRef]
- Zhu, T.; Lin, Y.; Liu, Y. Synthetic Minority Oversampling Technique for Multiclass Imbalance Problems. Pattern Recognit. 2017, 72, 327–340. [CrossRef]
- Raschka, S.; Patterson, J.; Nolet, C. machine learning in python: main developments and technology trends in data science, machine learning, and artificial intelligence. Information 2020, 11, 193. [CrossRef]
- Bzdok, D.; Altman, N.; Krzywinski, M. Statistics versus Machine Learning. Nat. Methods 2018, 15, 233–234. [CrossRef]
- Zhang, Y.; Yang, Y. Cross-Validation for Selecting a Model Selection Procedure. J. Econom. 2015, 187, 95–112. [CrossRef]
- Franklin, J. The Elements of Statistical Learning: Data Mining, Inference and Prediction. Math. Intell. 2005, 27, 83–85. [CrossRef]
- https://www.researchgate.net/publication/380366289_Hybrid_Learning_Systems_Integrating_Traditional_Machine_Learning_with_Deep_learning_Techniques (accessed 15 June 2024).
- Baldi, P. (2012). Autoencoders, unsupervised learning, and deep architectures. Proceedings of ICML Workshop on Unsupervised and Transfer Learning, 37-49.
- Litjens, G., et al. (2017). A survey on deep learning in medical image analysis. Medical Image Analysis, 42, 60-88. [CrossRef]
- Esteva, A., et al. (2019). A guide to deep learning in healthcare. Nature Medicine, 25(1), 24-29. [CrossRef]
- Chen, T., &Guestrin, C. (2016). XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785-794. [CrossRef]
- Lundberg, S. M., et al. (2018). Explainable machine learning predictions for the prevention of hypoxaemia during surgery. Nature Biomedical Engineering, 2(10), 749-760. [CrossRef]
- Sahiner B, Pezeshk A, Hadjiiski LM, Wang X, Drukker K, Cha KH, Summers RM, Giger ML. Deep learning in medical imaging and radiation therapy. Med Phys. 2019 Jan;46(1):e1-e36. [CrossRef]
- Sanchez-Martinez S., Camara O. , Piella G., Maja C., Gonzales-Ballester M.Á., Miron M., Alfredo V., Gómez E., Fraser A.G., Bijnens B., Machine Learning for Clinical Decision-Making: Challenges and Opportunities in Cardiovascular Imaging, Frontiers in Cardiovascular Medicine,8, 2022. [CrossRef]
- Dietterich, T.G. (2000). Ensemble Methods in Machine Learning. In: Multiple Classifier Systems. MCS 2000. Lecture Notes in Computer Science, vol 1857. Springer, Berlin, Heidelberg. [CrossRef]
- Rudin, C. (2019). Stop explaining black box machine learning models for high-stakes decisions and use interpretable models instead. Nature Machine Intelligence, 1(5), 206-215. [CrossRef]
- Rajkomar, A., et al. (2019). Machine learning in medicine. New England Journal of Medicine, 380, 1347-1358. [CrossRef]











| Category | Characteristics | Value |
| (S) | Age(yr) Median (range) |
64(39-85) |
| (S) | Sex Male #(%) Female #(%) |
45(58.44%) 32(41.56%) |
| C1yr – control over one year (cm3) Median(range) |
17* 0.9(0-30) |
|
| Patience with extra cranial MTS |
6* 54 |
|
| (T) | Receiving pre-treatment, systemic treatment | 68 |
| Deceased before 1 year | 1* 25 |
|
| (C) | KPS score 100 90 80 70 |
9* 26(33.77%) 8(10.39%) 22(28.57%) 12(15.58%) |
| The number of lesions Median(range) 1-3 4-6 7-10 >10 |
2(1-30) 52 12 8 5 |
|
| (R) | Beam on time on V1 (min/cm3) Median(range) |
0.82(0.47-2.33) |
| (R) | Beam on time on V2 (min/cm3) Median(range) |
1* 0.83(0.60-3.00) |
| (R) | Beam on time on V3 (min/cm3) Median(range) |
2* 0.83(0.46-4.00) |
| Total tumor volume (# of patients with) : < 5 cm3 <= 10 cm3 > 10 cm3 |
34 13 30 |
|
| (R) | Tumor dynamics (# of patients with) : - Progression - Regression |
6 71 |
| Model | Accuracy | Balanced Accuracy | ROC AUC | F1 Score | Time Taken |
| SVC | 1.00 | 1.00 | 1.00 | 1.00 | 0.02 |
| ExtraTreesClassifier | 1.00 | 1.00 | 1.00 | 1.00 | 0.15 |
| KNeighborsClassifier | 0.97 | 0.97 | 0.97 | 0.97 | 0.04 |
| LogisticRegression | 0.94 | 0.95 | 0.95 | 0.94 | 0.03 |
| XGBClassifier | 0.94 | 0.95 | 0.95 | 0.94 | 0.05 |
| RandomForestClassifier | 0.92 | 0.92 | 0.92 | 0.92 | 0.19 |
| Model | Parameters | Grid Search Space |
|
SVC |
C | 0.1, 1, 10, 100 |
| gamma | scale, auto | |
| kernel | linear, rbf | |
| KNeighborsClassifier | n_neighbors | 3, 5, 7, 10 |
| weights | uniform | |
| p | 1, 2 | |
| ExtraTreesClassifier | n_estimators | 100, 200 |
| min_samples_split | 2 | |
| max_depth | None, 10 | |
| min_samples_leaf | 1 | |
| max_features | sqrt | |
| LogisticRegression (LR) | penalty | l1,l2 |
| solver | liblinear | |
| C | 0.001, 0.01, 0.1, 1, 10, 100 | |
| max_iter | 1000, 5000 | |
| XGBClassifier | n_estimators | 100, 200, 300 |
| learning_rate | 0.01, 0.05, 0.1 | |
| max_depth | 3, 5 | |
| min_child_weight | 1 | |
| subsample | 0.8 | |
| colsample_bytree | 0.8 | |
| gamma | 0 | |
| RandomForestClassifier | n_estimators | 10, 50, 100, 200 |
| max_depth | None, 10, 20, 30 | |
| min_samples_split | 2, 5 | |
| min_samples_leaf | 1, 2 |
| Classification Report for Logistic Regression: | ||||
| precision | recall | F1-score | Support | |
| 0 | 1 | 0.96 | 0.98 | 24 |
| 1 | 0.96 | 1 | 0.98 | 23 |
| accuracy | 0.98 | 47 | ||
| macro avg | 0.98 | 0.98 | 0.98 | 47 |
| weighted avg | 0.98 | 0.98 | 0.98 | 47 |
| Classification Report for SVM: | ||||
| precision | recall | F1-score | Support | |
| 0 | 0.96 | 0.92 | 0.94 | 24 |
| 1 | 0.92 | 0.96 | 0.94 | 23 |
| accuracy | 0.94 | 47 | ||
| macro avg | 0.94 | 0.94 | 0.94 | 47 |
| weighted avg | 0.94 | 0.94 | 0.94 | 47 |
| Classification Report for KNN: | ||||
| precision | recall | F1-score | Support | |
| 0 | 0.95 | 0.88 | 0.91 | 24 |
| 1 | 0.88 | 0.96 | 0.92 | 23 |
| accuracy | 0.91 | 47 | ||
| macro avg | 0.92 | 0.92 | 0.91 | 47 |
| weighted avg | 0.92 | 0.91 | 0.91 | 47 |
| Classification Report for Extra Trees: | ||||
| precision | recall | F1-score | Support | |
| 0 | 1 | 1 | 1 | 24 |
| 1 | 1 | 1 | 1 | 23 |
| accuracy | 1 | 47 | ||
| macro avg | 1 | 1 | 1 | 47 |
| weighted avg | 1 | 1 | 1 | 47 |
| Classification Report for Random Forest: | ||||
| precision | recall | F1-score | Support | |
| 0 | 1 | 0.92 | 0.96 | 24 |
| 1 | 0.92 | 1 | 0.96 | 23 |
| accuracy | 0.96 | 47 | ||
| macro avg | 0.96 | 0.96 | 0.96 | 47 |
| weighted avg | 0.96 | 0.96 | 0.96 | 47 |
| Classification Report for XGBoost: | ||||
| precision | recall | F1-score | Support | |
| 0 | 1 | 0.96 | 0.98 | 24 |
| 1 | 0.96 | 1 | 0.98 | 23 |
| accuracy | 0.98 | 47 | ||
| macro avg | 0.98 | 0.98 | 0.98 | 47 |
| weighted avg | 0.98 | 0.98 | 0.98 | 47 |
| Model | Accuracy | AUC |
| Logistic Regression | 0.979 | 0.98 |
| SVM | 0.936 | 0.94 |
| KNN | 0.915 | 0.92 |
| Extra Trees | 0.979 | 0.98 |
| Random Forest | 1.000 | 1.00 |
| XGBoost | 0.979 | 0.98 |
| Classification Report for Logistic Regression: | |||||||
| precision | recall | F1-score | Support | ||||
| 0 | 1(1) | 0.88(0.83) | 0.93(0.91) | 24 | |||
| 1 | 0.88(0.85) | 1(1) | 0.94(0.92) | 23 | |||
| accuracy | 0.94(0.91) | 47 | |||||
| macro avg | 0.94(0.93) | 0.94(0.92) | 0.94(0.91) | 47 | |||
| weighted avg | 0.94(0.93) | 0.94(0.91) | 0.94(0.91) | 47 | |||
| Classification Report for SVM: | |||||||
| precision | recall | F1-score | Support | ||||
| 0 | 0.95(1) | 0.75(0.92) | 0.84(0.96) | 24 | |||
| 1 | 0.79(0.92) | 0.96(1) | 0.86(0.96) | 23 | |||
| accuracy | 0.85(0.96) | 47 | |||||
| macro avg | 0.87(0.96) | 0.85(0.96) | 0.85(0.96) | 47 | |||
| weighted avg | 0.87(0.96) | 0.85(0.96) | 0.85(0.96) | 47 | |||
| Classification Report for KNN: | |||||||
| precision | recall | F1-score | Support | ||||
| 0 | 0.95(1) | 0.79(0.83) | 0.86(0.91) | 24 | |||
| 1 | 0.81(0.85) | 0.96(1) | 0.88(0.92) | 23 | |||
| accuracy | 0.87(0.91) | 47 | |||||
| macro avg | 0.88(0.93) | 0.87(0.92) | 0.87(0.91) | 47 | |||
| weighted avg | 0.88(0.93) | 0.87(0.91) | 0.87(0.91) | 47 | |||
| Classification Report for Extra Trees: | |||||||
| precision | recall | F1-score | Support | ||||
| 0 | 0.96(0.96) | 0.96(1) | 0.96(0.98) | 24 | |||
| 1 | 0.96(1) | 0.96(0.96) | 0.96(0.98) | 23 | |||
| accuracy | 0.96(0.98) | 47 | |||||
| macro avg | 0.96(0.98) | 0.96(0.98) | 0.96(0.98) | 47 | |||
| weighted avg | 0.96(0.98) | 0.96(0.98) | 0.96(0.98) | 47 | |||
| Classification Report for Random Forest: | |||||||
| precision | recall | F1-score | Support | ||||
| 0 | 0.96(1) | 0.92(0.96) | 0.94(0.98) | 24 | |||
| 1 | 0.92(0.96) | 0.96(1) | 0.94(0.98) | 23 | |||
| accuracy | 0.94(0.98) | 47 | |||||
| macro avg | 0.94(0.98) | 0.94(0.98) | 0.94(0.98) | 47 | |||
| weighted avg | 0.94(0.98) | 0.94(0.98) | 0.94(0.98) | 47 | |||
| Classification Report for XGBoost: | |||||||
| precision | recall | F1-score | Support | ||||
| 0 | 0.96(0.96) | 0.92(0.92) | 0.94(0.94) | 24 | |||
| 1 | 0.92(0.92) | 0.96(0.96) | 0.94(0.94) | 23 | |||
| accuracy | 0.94(0.94) | 47 | |||||
| macro avg | 0.94(0.94) | 0.94(0.94) | 0.94(0.94) | 47 | |||
| weighted avg | 0.94(0.94) | 0.94(0.94) | 0.94(0.94) | 47 | |||
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).