Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Improved Prediction of Antisense Oligonucleotide Efficacy for Exon Skipping Using Ensemble Learning and Feature Selection

Version 1 : Received: 21 March 2023 / Approved: 22 March 2023 / Online: 22 March 2023 (03:24:48 CET)

A peer-reviewed article of this Preprint also exists.

Zhu, A.; Chiba, S.; Shimizu, Y.; Kunitake, K.; Okuno, Y.; Aoki, Y.; Yokota, T. Ensemble-Learning and Feature Selection Techniques for Enhanced Antisense Oligonucleotide Efficacy Prediction in Exon Skipping. Pharmaceutics 2023, 15, 1808. Zhu, A.; Chiba, S.; Shimizu, Y.; Kunitake, K.; Okuno, Y.; Aoki, Y.; Yokota, T. Ensemble-Learning and Feature Selection Techniques for Enhanced Antisense Oligonucleotide Efficacy Prediction in Exon Skipping. Pharmaceutics 2023, 15, 1808.

Abstract

Antisense oligonucleotide (ASO)-mediated exon skipping has become a valuable tool for investigating gene function and developing gene therapy. Machine learning-based computational methods such as eSkip-Finder have been developed to predict the efficacy of ASOs via exon skipping. However, these methods are computationally demanding, and the accuracy of predictions remains suboptimal. In this study, we propose a new approach to reduce computational burden and improve prediction performance by using feature selection within machine learning algorithms and ensemble learning techniques. We evaluated our approach using a dataset of experimentally validated exon skipping events, dividing it into training and testing sets. Our results demonstrate that using a 3-way voting approach with random forest, gradient boosting, and XGBoost can significantly reduce computation time to under ten seconds while improving prediction performance, as measured by R2 for both 2’-O-methyl nucleotides (2OMe) and phosphorodiamidate morpholino oligomers (PMOs). Additionally, the feature importance ranking derived from our approach is in good agreement with previously published results. Our findings suggest that our approach has the potential to enhance the accuracy and efficiency of predicting ASO efficacy via exon skipping. It could also facilitate the development of novel therapeutic strategies. This study could contribute to the ongoing efforts to improve ASO design and optimize gene therapy approaches.

Keywords

antisense oligonucleotides; exon skipping; machine learning; ensemble learning; personalized medicine; n-of-1 therapy, splice switching; genetic disease; splicing; RNA

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.