PreprintArticleVersion 1Preserved in Portico This version is not peer-reviewed
Use of 3-Way Voting of Machine Learning Algorithms Improves Prediction Performance of the Efficacy of Antisense-Mediated Exon Skipping and Reduces the Computational Burden
Version 1
: Received: 8 March 2023 / Approved: 9 March 2023 / Online: 9 March 2023 (04:43:55 CET)
How to cite:
Zhu, A.; Chiba, S.; Shimizu, Y.; Kunitake, K.; Okuno, Y.; Aoki, Y.; Yokota, T. Use of 3-Way Voting of Machine Learning Algorithms Improves Prediction Performance of the Efficacy of Antisense-Mediated Exon Skipping and Reduces the Computational Burden. Preprints2023, 2023030167. https://doi.org/10.20944/preprints202303.0167.v1.
Zhu, A.; Chiba, S.; Shimizu, Y.; Kunitake, K.; Okuno, Y.; Aoki, Y.; Yokota, T. Use of 3-Way Voting of Machine Learning Algorithms Improves Prediction Performance of the Efficacy of Antisense-Mediated Exon Skipping and Reduces the Computational Burden. Preprints 2023, 2023030167. https://doi.org/10.20944/preprints202303.0167.v1.
Cite as:
Zhu, A.; Chiba, S.; Shimizu, Y.; Kunitake, K.; Okuno, Y.; Aoki, Y.; Yokota, T. Use of 3-Way Voting of Machine Learning Algorithms Improves Prediction Performance of the Efficacy of Antisense-Mediated Exon Skipping and Reduces the Computational Burden. Preprints2023, 2023030167. https://doi.org/10.20944/preprints202303.0167.v1.
Zhu, A.; Chiba, S.; Shimizu, Y.; Kunitake, K.; Okuno, Y.; Aoki, Y.; Yokota, T. Use of 3-Way Voting of Machine Learning Algorithms Improves Prediction Performance of the Efficacy of Antisense-Mediated Exon Skipping and Reduces the Computational Burden. Preprints 2023, 2023030167. https://doi.org/10.20944/preprints202303.0167.v1.
Abstract
Antisense oligonucleotide (ASO)-mediated exon skipping has emerged as a powerful tool for examining the function of genes and exons in basic research, as well as gene therapy. Computational methods, such as eSkip-Finder, have been developed to predict the efficacy of ASOs via exon skipping using machine learning. However, these methods can be computationally demanding and the prediction accuracy of the tool is not yet optimal. In this study, we propose an approach to reduce computational burden and improve prediction performance by utilizing feature selection within machine learning algorithms and employing ensemble learning techniques. The method was evaluated using a dataset of genes with experimentally validated exon skipping events. The dataset was divided into training and testing sets to assess the accuracy of the algorithm. Our results demonstrate that using a 3-way voting approach with random forest, gradient boosting, and XGBoost can significantly reduce computation time to under ten seconds while improving prediction performance, as measured by R2 for both 2’-O-methyl nucleotides (2OMe) and phosphorodiamidate morpholino oligomers (PMOs). Additionally, the feature importance ranking derived from our approach is in good agreement with previously published results. These findings suggest that this approach has the potential to enhance the efficiency and accuracy of predicting ASO efficacy via exon skipping, facilitating the development of novel therapeutic strategies.
Copyright:
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.