Computational Approaches to Functionally Annotate Long Noncoding RNA (lncRNA)

Yashpal Ramakrishnaiah; Levin Kuhlmann; Sonika Tyagi

doi:10.20944/preprints202006.0116.v1

Submitted:

08 June 2020

Posted:

09 June 2020

You are already at the latest version

Abstract

Long noncoding RNA (lncRNA) are implicated in various genetic diseases and cancer, attributed to their critical role in gene regulation. RNA sequencing is used to capture their transcripts from certain cell types or conditions. For some studies, lncRNA interactions with other biomolecules have also been captured, which can give clues to their mechanisms of action. Complementary \textit{in silico} methods have been proposed to predict non-coding nature of transcripts and to analyze available RNA interaction data. Here we provide a critical review of such methods and identify associated challenges. Broadly, these can be categorized as reference-based and reference-free or \textit{ab initio}, with the former category of methods requiring a comprehensive annotated reference. The \textit{ab initio} methods can make use of machine learning classifiers that are trained on features extracted from sequences, making them suitable to predict novel transcripts, especially in non-model species. Machine learning approaches such as Logistic Regression, Support Vector Machines, Random Forest, and Deep Learning are commonly used. Initial approaches relied on basic sequential features to train the model, whereas the use of secondary structural features appears to be a promising approach for functional annotation. However, adding secondary features will result in model complexities, thus demanding an algorithm that can handle it and furthermore, considerably increasing the utilization of computation resources. Computational strategies combining identification and functional annotation which can be easily customized are currently lacking. These can be of immense value to accelerate research in this class of RNAs.

Keywords:

noncoding RNA

;

lncRNA

;

epigenomics

;

gene regulation

;

machine learning

;

bioinformatics

Subject:

Biology and Life Sciences - Biochemistry and Molecular Biology

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Computational Approaches to Functionally Annotate Long Noncoding RNA (lncRNA)

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe