Preprint Review Version 1 Preserved in Portico This version is not peer-reviewed

Data Science in Unveiling COVID-19 Pathogenesis and Diagnosis: Evolutionary Origin to Drug Repurposing

Version 1 : Received: 12 August 2020 / Approved: 14 August 2020 / Online: 14 August 2020 (11:01:56 CEST)

A peer-reviewed article of this Preprint also exists.

Jayanta Kumar Das, Giuseppe Tradigo, Pierangelo Veltri, Pietro H Guzzi, Swarup Roy, Data science in unveiling COVID-19 pathogenesis and diagnosis: evolutionary origin to drug repurposing, Briefings in Bioinformatics, Volume 22, Issue 2, March 2021, Pages 855–872, https://doi.org/10.1093/bib/bbaa420 Jayanta Kumar Das, Giuseppe Tradigo, Pierangelo Veltri, Pietro H Guzzi, Swarup Roy, Data science in unveiling COVID-19 pathogenesis and diagnosis: evolutionary origin to drug repurposing, Briefings in Bioinformatics, Volume 22, Issue 2, March 2021, Pages 855–872, https://doi.org/10.1093/bib/bbaa420

Abstract

The outbreak of novel Coronavirus (SARS-COV-2 ) disease (COVID-19) in Wuhan has attracted worldwide attention. SARS-COV-2 known to share a similar clinical manifestation that includes various symptoms such as pneumonia, fever, breathing difficulty, and in particular, SARS-COV-2 also causes a severe in ammation state that leads to death. Consequently, massive and rapid research growth has been observed across the globe to elucidate the mechanisms of infections and disease progression in genotype and phenotype scale. Data Science is playing a pivotal role in in-silico analysis to draw hidden and novel insights about the SARS-COV-2 origin, pathogenesis, COVID-19 outbreak forecasting, medical diagnosis, and drug discovery. With the availability of multi-omics, radiological, biomolecular, and medical data urges to develop novel exploratory and predictive models or customise exiting learning models to t the current problem domain. The presence of many approaches generates the need for the systematic surveys to guide both data scientists and medical practitioners. We perform an elaborate study on the state-of-the-art data science method ologies in action to tackle the current pandemic scenario. We consider various active COVID-19 data analytics domains such as phylogeny analysis, SARS-COV-2 genome identi cation, protein structure prediction, host-viral protein interactomics, clinical imaging, epidemiological analysis, and most importantly (existing) drug discovery. We highlight types of data, their generation pipeline, and the data science models in use. We believe that the current study will give a detailed sketch of the road map towards handling COVID-19 like situation by leveraging data science in the future. We summarise our review focusing on prime challenges and possible future research directions .

Keywords

Data Science; Machine Learning; Deep Learning; Genomics; COVID-19; Drug Discovery; Image Analysis; Interactomics; Epidemiology

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.