Preprint Concept Paper Version 1 Preserved in Portico This version is not peer-reviewed

Genesis of Non-coding RNA Genes- A Sequence Connection with Protein Genes Separated by Evolutionary Time

Version 1 : Received: 19 July 2020 / Approved: 20 July 2020 / Online: 20 July 2020 (04:39:41 CEST)

A peer-reviewed article of this Preprint also exists.

Journal reference: Non-coding RNA 2020
DOI: 10.3390/ncrna6030036


A small phylogenetically conserved sequence of 11,231 bp termed FAM247 is repeated in human chromosome 22 by segmental duplications. This sequence forms part of diverse genes that span evolutionary time, the protein genes being the earliest as they are present in zebrafish and/or mice genomes, the long non-coding RNA genes and pseudogenes the most recent as they appear to be present only in the human genome. We propose that the conserved sequence provides a nucleation site for new gene development at evolutionary conserved chromosomal loci where the FAM247 sequences reside. The FAM247 sequence also carries information in its open reading frames that provides protein exon amino acid sequences; one exon plays an integral role in immune system regulation, specifically, the function of ubiquitin specific protease (USP18) in the regulation of interferon. An analysis of this multifaceted sequence and the genesis of genes that contain it are presented.


gene evolution; gene formation; long non-coding RNA genes; pseudogenes; USP18; GGT5



Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our diversity statement.

Leave a public comment
Send a private comment to the author(s)
Views 0
Downloads 0
Comments 0
Metrics 0

Notify me about updates to this article or when a peer-reviewed version is published.

We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.