Submitted:
14 March 2024
Posted:
15 March 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Data Collection and Preprocessing
2.2. DL-DVE Architecture and Working for Sequence Generation
2.2.1. Comparison of Generative Models2.2.2. ConV1d
2.2.3. GRU
2.2.4. Simple RNN
2.3. DL-DVE Architecture and Working for Sequence Classification
3. Results
4. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Bartoszewicz, J. M.; Seidel, A.; Renard, B. Y. Interpretable Detection of Novel Human Viruses from Genome Sequencing Data. NAR Genomics Bioinforma. 2021, 3 (1), lqab004. [CrossRef]
- Kuno, G. A Re-Examination of the History of Etiologic Confusion between Dengue and Chikungunya. PLoS Negl. Trop. Dis. 2015, 9 (11), e0004101. [CrossRef]
- Mello-Román, J. D.; Mello-Román, J. C.; Gómez-Guerrero, S.; García-Torres, M. Predictive Models for the Medical Diagnosis of Dengue: A Case Study in Paraguay. Comput. Math. Methods Med. 2019, 2019, 1–7. [CrossRef]
- Pineda-Peña, A.-C.; Faria, N. R.; Imbrechts, S.; Libin, P.; Abecasis, A. B.; Deforche, K.; Gómez-López, A.; Camacho, R. J.; De Oliveira, T.; Vandamme, A.-M. Automated Subtyping of HIV-1 Genetic Sequences for Clinical and Surveillance Purposes: Performance Evaluation of the New REGA Version 3 and Seven Other Tools. Infect. Genet. Evol. 2013, 19, 337–348. [CrossRef]
- Rachata, N.; Charoenkwan, P.; Yooyativong, T.; Chamnongthal, K.; Lursinsap, C.; Higuchi, K. Automatic Prediction System of Dengue Haemorrhagic-Fever Outbreak Risk by Using Entropy and Artificial Neural Network. In 2008 International Symposium on Communications and Information Technologies; IEEE: Vientiane, Laos, 2008; pp 210–214. [CrossRef]
- Kosakovsky Pond, S. L.; Posada, D.; Stawiski, E.; Chappey, C.; Poon, A. F. Y.; Hughes, G.; Fearnhill, E.; Gravenor, M. B.; Leigh Brown, A. J.; Frost, S. D. W. An Evolutionary Model-Based Algorithm for Accurate Phylogenetic Breakpoint Mapping and Subtype Prediction in HIV-1. PLoS Comput. Biol. 2009, 5 (11), e1000581. [CrossRef]
- Altschul, S. F.; Gish, W.; Miller, W.; Myers, E. W.; Lipman, D. J. Basic Local Alignment Search Tool. J. Mol. Biol. 1990, 215 (3), 403–410. [CrossRef]
- Olson, R. D.; Assaf, R.; Brettin, T.; Conrad, N.; Cucinell, C.; Davis, J. J.; Dempsey, D. M.; Dickerman, A.; Dietrich, E. M.; Kenyon, R. W.; Kuscuoglu, M.; Lefkowitz, E. J.; Lu, J.; Machi, D.; Macken, C.; Mao, C.; Niewiadomska, A.; Nguyen, M.; Olsen, G. J.; Overbeek, J. C.; Parrello, B.; Parrello, V.; Porter, J. S.; Pusch, G. D.; Shukla, M.; Singh, I.; Stewart, L.; Tan, G.; Thomas, C.; VanOeffelen, M.; Vonstein, V.; Wallace, Z. S.; Warren, A. S.; Wattam, A. R.; Xia, F.; Yoo, H.; Zhang, Y.; Zmasek, C. M.; Scheuermann, R. H.; Stevens, R. L. Introducing the Bacterial and Viral Bioinformatics Resource Center (BV-BRC): A Resource Combining PATRIC, IRD and ViPR. Nucleic Acids Res. 2023, 51 (D1), D678–D689. [CrossRef]
- Cui, F.; Zhang, Z.; Zou, Q. Sequence Representation Approaches for Sequence-Based Protein Prediction Tasks That Use Deep Learning. Brief. Funct. Genomics 2021, 20 (1), 61–73. [CrossRef]
- Dasari, C. M.; Bhukya, R. Explainable Deep Neural Networks for Novel Viral Genome Prediction. Appl. Intell. 2022, 52 (3), 3002–3017. [CrossRef]
- Bailey, T. L.; Boden, M.; Buske, F. A.; Frith, M.; Grant, C. E.; Clementi, L.; Ren, J.; Li, W. W.; Noble, W. S. MEME SUITE: Tools for Motif Discovery and Searching. Nucleic Acids Res. 2009, 37 (Web Server), W202–W208. [CrossRef]
- Choong, A. C. H.; Lee, N. K. Evaluation of Convolutionary Neural Networks Modeling of DNA Sequences Using Ordinal versus One-Hot Encoding Method. In 2017 International Conference on Computer and Drone Applications (IConDA); IEEE: Kuching, 2017; pp 60–65. [CrossRef]
- Langton, J.; Srihasam, K.; Jiang, J. Comparison of Machine Learning Methods for Multi-Label Classification of Nursing Education and Licensure Exam Questions. In Proceedings of the 3rd Clinical Natural Language Processing Workshop; Association for Computational Linguistics: Online, 2020; pp 85–93. [CrossRef]
- Shim, H. Futuristic Methods in Virus Genome Evolution Using the Third-Generation DNA Sequencing and Artificial Neural Networks. In Global Virology III: Virology in the 21st Century; Shapshak, P., Balaji, S., Kangueane, P., Chiappelli, F., Somboonwit, C., Menezes, L. J., Sinnott, J. T., Eds.; Springer International Publishing: Cham, 2019; pp 485–513. [CrossRef]
- Ali, S.; Sahoo, B.; Zelikovsky, A.; Chen, P.-Y.; Patterson, M. Benchmarking Machine Learning Robustness in Covid-19 Genome Sequence Classification. Sci. Rep. 2023, 13 (1), 4154. [CrossRef]
- Basu, S.; Campbell, R. H. Classifying COVID-19 Variants Based on Genetic Sequences Using Deep Learning Models. In System Dependability and Analytics; Wang, L., Pattabiraman, K., Di Martino, C., Athreya, A., Bagchi, S., Eds.; Springer Series in Reliability Engineering; Springer International Publishing: Cham, 2023; pp 347–360. [CrossRef]
- De Souza, L. C.; Azevedo, K. S.; De Souza, J. G.; Barbosa, R. D. M.; Fernandes, M. A. C. New Proposal of Viral Genome Representation Applied in the Classification of SARS-CoV-2 with Deep Learning. BMC Bioinformatics 2023, 24 (1), 92. [CrossRef]
- Manoharan, S. N.; Kumar, K. M. V. M.; Vadivelan, N. A Novel CNN-TLSTM Approach for Dengue Disease Identification and Prevention Using IoT-Fog Cloud Architecture. Neural Process. Lett. 2023, 55 (2), 1951–1973. [CrossRef]
- Majeed, M. A.; Shafri, H. Z. M.; Zulkafli, Z.; Wayayok, A. A Deep Learning Approach for Dengue Fever Prediction in Malaysia Using LSTM with Spatial Attention. Int. J. Environ. Res. Public. Health 2023, 20 (5), 4130. [CrossRef]
- Nguyen, V.-H.; Tuyet-Hanh, T. T.; Mulhall, J.; Minh, H. V.; Duong, T. Q.; Chien, N. V.; Nhung, N. T. T.; Lan, V. H.; Minh, H. B.; Cuong, D.; Bich, N. N.; Quyen, N. H.; Linh, T. N. Q.; Tho, N. T.; Nghia, N. D.; Anh, L. V. Q.; Phan, D. T. M.; Hung, N. Q. V.; Son, M. T. Deep Learning Models for Forecasting Dengue Fever Based on Climate Data in Vietnam. PLoS Negl. Trop. Dis. 2022, 16 (6), e0010509. [CrossRef]
- Nadda, W.; Boonchieng, W.; Boonchieng, E. Influenza, Dengue and Common Cold Detection Using LSTM with Fully Connected Neural Network and Keywords Selection. BioData Min. 2022, 15 (1), 5. [CrossRef]
- Doni, A.; Sasipraba, T. LSTM-RNN Based Approach for Prediction of Dengue Cases in India. Ingénierie Systèmes Inf. 2020, 25 (3), 327–335. [CrossRef]
- Zhao, X.; Li, K.; Ang, C. K. E.; Cheong, K. H. A Deep Learning Based Hybrid Architecture for Weekly Dengue Incidences Forecasting. Chaos Solitons Fractals 2023, 168, 113170. [CrossRef]
- Gunasekaran, H.; Ramalakshmi, K.; Rex Macedo Arokiaraj, A.; Deepa Kanmani, S.; Venkatesan, C.; Suresh Gnana Dhas, C. Analysis of DNA Sequence Classification Using CNN and Hybrid Models. Comput. Math. Methods Med. 2021, 2021, 1–12. [CrossRef]
- Helaly, M. A.; Rady, S.; Aref, M. M. Convolutional Neural Networks for Biological Sequence Taxonomic Classification: A Comparative Study. In Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2019; Hassanien, A. E., Shaalan, K., Tolba, M. F., Eds.; Advances in Intelligent Systems and Computing; Springer International Publishing: Cham, 2020; Vol. 1058, pp 523–533. [CrossRef]
- Ao, C.; Jiao, S.; Wang, Y.; Yu, L.; Zou, Q. Biological Sequence Classification: A Review on Data and General Methods. Research 2022, 2022, 0011. [CrossRef]
- Pérez-Enciso; Zingaretti. A Guide for Using Deep Learning for Complex Trait Genomic Prediction. Genes 2019, 10 (7), 553. [CrossRef]
- Tsai, S.-T.; Kuo, E.-J.; Tiwary, P. Learning Molecular Dynamics with Simple Language Model Built upon Long Short-Term Memory Neural Network. Nat. Commun. 2020, 11 (1), 5115. [CrossRef]
- Haga, H.; Sato, H.; Koseki, A.; Saito, T.; Okumoto, K.; Hoshikawa, K.; Katsumi, T.; Mizuno, K.; Nishina, T.; Ueno, Y. A Machine Learning-Based Treatment Prediction Model Using Whole Genome Variants of Hepatitis C Virus. PLOS ONE 2020, 15 (11), e0242028. [CrossRef]
- Lv, Z.; Ao, C.; Zou, Q. Protein Function Prediction: From Traditional Classifier to Deep Learning. PROTEOMICS 2019, 19 (14), 1900119. [CrossRef]
- Ahmad, I.; Iqbal, M. J.; Basheri, M. Biological Data Classification and Analysis Using Convolutional Neural Network. J. Med. Imaging Health Inform. 2020, 10 (10), 2459–2465. [CrossRef]
- Murad, T.; Ali, S.; Khan, I.; Patterson, M. Spike2CGR: An Efficient Method for Spike Sequence Classification Using Chaos Game Representation. Mach. Learn. 2023, 112 (10), 3633–3658. [CrossRef]
- Dieng, I.; Cunha, M. D. P.; Diagne, M. M.; Sembène, P. M.; Zanotto, P. M. D. A.; Faye, O.; Faye, O.; Sall, A. A. Origin and Spread of the Dengue Virus Type 1, Genotype V in Senegal, 2015–2019. Viruses 2021, 13 (1), 57. [CrossRef]
- Sánchez-González, G.; Belak, Z. R.; Lozano, L.; Condé, R. Probability of Consolidation Constrains Novel Serotype Emergence in Dengue Fever Virus. PLOS ONE 2021, 16 (4), e0248765. [CrossRef]
- Katzelnick, L. C.; Fonville, J. M.; Gromowski, G. D.; Arriaga, J. B.; Green, A.; James, S. L.; Lau, L.; Montoya, M.; Wang, C.; VanBlargan, L. A.; Russell, C. A.; Thu, H. M.; Pierson, T. C.; Buchy, P.; Aaskov, J. G.; Muñoz-Jordán, J. L.; Vasilakis, N.; Gibbons, R. V.; Tesh, R. B.; Osterhaus, A. D. M. E.; Fouchier, R. A. M.; Durbin, A.; Simmons, C. P.; Holmes, E. C.; Harris, E.; Whitehead, S. S.; Smith, D. J. Dengue Viruses Cluster Antigenically but Not as Discrete Serotypes. Science 2015, 349 (6254), 1338–1343. [CrossRef]
- Srionrod, N.; Nooroong, P.; Poolsawat, N.; Minsakorn, S.; Watthanadirek, A.; Junsiri, W.; Sangchuai, S.; Chawengkirttikul, R.; Anuracpreeda, P. Molecular Characterization and Genetic Diversity of Babesia Bovis and Babesia Bigemina of Cattle in Thailand. Front. Cell. Infect. Microbiol. 2022, 12, 1065963. [CrossRef]







| Sr. no. | Model | Accuracy(%) | Epoch | Sequence generated (length) |
|---|---|---|---|---|
| 1 | ConV1d | 29 | 10 | <10kb |
| 2 | GRU | 35 | 10 | <10kb |
| 3 | Simple RNN | 30 | 10 | <10kb |
| 4 | Our proposed LSTM model | 37 | 10 | >10kb |
| 93 | 15 | >10kb |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).