Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Use of 3-Way Voting of Machine Learning Algorithms Improves Prediction Performance of the Efficacy of Antisense-Mediated Exon Skipping and Reduces the Computational Burden

Version 1 : Received: 8 March 2023 / Approved: 9 March 2023 / Online: 9 March 2023 (04:43:55 CET)

How to cite: Zhu, A.; Chiba, S.; Shimizu, Y.; Kunitake, K.; Okuno, Y.; Aoki, Y.; Yokota, T. Use of 3-Way Voting of Machine Learning Algorithms Improves Prediction Performance of the Efficacy of Antisense-Mediated Exon Skipping and Reduces the Computational Burden. Preprints 2023, 2023030167. https://doi.org/10.20944/preprints202303.0167.v1 Zhu, A.; Chiba, S.; Shimizu, Y.; Kunitake, K.; Okuno, Y.; Aoki, Y.; Yokota, T. Use of 3-Way Voting of Machine Learning Algorithms Improves Prediction Performance of the Efficacy of Antisense-Mediated Exon Skipping and Reduces the Computational Burden. Preprints 2023, 2023030167. https://doi.org/10.20944/preprints202303.0167.v1

Abstract

Antisense oligonucleotide (ASO)-mediated exon skipping has emerged as a powerful tool for examining the function of genes and exons in basic research, as well as gene therapy. Computational methods, such as eSkip-Finder, have been developed to predict the efficacy of ASOs via exon skipping using machine learning. However, these methods can be computationally demanding and the prediction accuracy of the tool is not yet optimal. In this study, we propose an approach to reduce computational burden and improve prediction performance by utilizing feature selection within machine learning algorithms and employing ensemble learning techniques. The method was evaluated using a dataset of genes with experimentally validated exon skipping events. The dataset was divided into training and testing sets to assess the accuracy of the algorithm. Our results demonstrate that using a 3-way voting approach with random forest, gradient boosting, and XGBoost can significantly reduce computation time to under ten seconds while improving prediction performance, as measured by R2 for both 2’-O-methyl nucleotides (2OMe) and phosphorodiamidate morpholino oligomers (PMOs). Additionally, the feature importance ranking derived from our approach is in good agreement with previously published results. These findings suggest that this approach has the potential to enhance the efficiency and accuracy of predicting ASO efficacy via exon skipping, facilitating the development of novel therapeutic strategies.

Keywords

antisense oligonucleotides; exon skipping; machine learning; ensemble learning; personalized medicine; n-of-1 therapy; splice switching; genetic disease; splicing; RNA

Subject

Biology and Life Sciences, Biology and Biotechnology

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.