Submitted:
13 August 2023
Posted:
15 August 2023
You are already at the latest version
Abstract
Keywords:
Introduction
Results
PacBio sequencing and data analysis
Identification of lncRNAs from alfalfa long and short read RNA-seq data
Sequence conservation of lncRNAs between species
Identification and characterizition of plastid lncRNAs

lncRNAs associated miRNAs

Small ORFs analysis of alfalfa lncRNAs


Discussion
Material and methods
Plant materials and sampling
Total RNA isolation and PACBIO library construction
Analysis of PacBio sequencing data
The pipeline to identify lncRNA from transcriptome data
Sequence conservation analysis of lncRNAs
Prediction of microRNA target mimics
Small ORFs analysis of alfalfa lncRNAs
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflflicts of Interest
References
- Wang, C.; Ma, B.L.; Yan, X.; Han, J.; Guo, Y.; Wang, Y.; Li, P. Yields of alfalfa varieties with different fall-dormancy levels in a temperate environment. Agronomy Journal 2009, 101, 1146–1152. [Google Scholar] [CrossRef]
- Li, Y.; Wan, L.; Bi, S.; Wan, X.; Li, Z.; Cao, J.; Tong, Z.; Xu, H.; He, F.; Li, X. Identification of drought-responsive microRNAs from roots and leaves of alfalfa by high-throughput sequencing, Genes 2017, 8.
- O’Rourke, J.A.; Fu, F.; Bucciarelli, B.; Yang, S.S.; Samac, D.A.; Lamb, J.F.S.; Monteros, M.J.; Gronwald, J.W.; Krom, N.; Li, J.; et al. The medicago sativa gene index 1. 2: a web-accessible gene expression atlas for investigating expression differences between medicago sativa subspecies. BMC Genomics 2015, 16, 1–17. [Google Scholar]
- Long, R.; Zhang, F.; Zhang, Z.; Li, M.; Chen, L.; Wang, X.; Liu, W.; Zhang, T.; Yu, L.X.; He, F.; et al. Genome assembly of alfalfa cultivar zhongmu-4 and identification of SNPs associated with agronomic traits. Genomics, Proteomics & Bioinformatics 2022. [CrossRef]
- Chen, H.; Zeng, Y.; Yang, Y.; Huang, L.; Tang, B.; Zhang, H.; Hao, F.; Liu, W.; Li, Y.; Liu, Y.; et al. Allele-aware chromosome-level genome assembly and efficient transgene-free genome editing for the autotetraploid cultivated alfalfa. Nat. Commun. 2020, 11, 2494. [Google Scholar] [CrossRef] [PubMed]
- Shen, C.; Du, H.; Chen, Z.; Lu, H.; Zhu, F.; Chen, H.; Meng, X.; Liu, Q.; Liu, P.; Zheng, L.; et al. The chromosome-level genome sequence of the autotetraploid alfalfa and resequencing of core germplasms provide genomic resources for alfalfa research. Molecular Plant 2020, 13, 1250–1261. [Google Scholar] [CrossRef] [PubMed]
- Chao, Y.; Yuan, J.; Guo, T.; Xu, L.; Mu, Z.; Han, L. Analysis of transcripts and splice isoforms in medicago sativa l. by single-molecule long-read sequencing. Plant Molecular Biology 2019, 99, 219–235. [Google Scholar]
- Wan, L.; Li, Y.; Li, S.; Li, X. Transcriptomic profling revealed genes involved in response to drought stress in alfalfa, Journal of plant growth regulation 2022, 41: 92-112.
- Ng, S.Y.; Lin, L.; Soh, B.S.; Stanton, L.W. Long noncoding RNAs in development and disease of the central nervous system. Trends Genet. 2013, 29, 461–468. [Google Scholar] [CrossRef]
- Song, X.; Sun, L.; Luo, H.; Ma, Q.; Zhao, Y.; Pei, D. Genome-Wide Identification and Characterization of Long Non-Coding RNAs from Mulberry (Morus notabilis) RNA-seq Data. Genes (Basel) 2016, 7, 11. [Google Scholar] [CrossRef]
- Grote, P.; Wittler, L.; Hendrix, D.; Koch, F.; Wahrisch, S.; Beisaw, A.; Macura, K.; Blass, G.; Kellis, M.; Werber, M.; et al. The tissue-specific lncRNA Fendrr is an essential regulator of heart and body wall development in the mouse. Dev. Cell 2013, 24, 206–214. [Google Scholar] [CrossRef]
- Mercer, T.R.; Dinger, M.E.; Sunkin, S.M.; Mehler, M.F.; Mattick, J.S. Specific expression of long noncoding RNAs in the mouse brain. Proc. Natl. Acad. Sci. 2008, 105, 716–721. [Google Scholar] [CrossRef]
- Derrien, T.; Johnson, R.; Bussotti, G.; Tanzer, A.; Djebali, S.; Tilgner, H.; Guernec, G.; Martin, D.; Merkel, A.; Knowles, D.G.; et al. The GENCODE v7 catalog of human long noncoding RNAs: Analysis of their gene structure, evolution, and expression. Genome Res. 2012 22, 1775–1789. [CrossRef]
- Swiezewski, S.; Liu, F.; Magusin, A.; Dean, C. Cold-induced silencing by long antisense transcripts of an arabidopsis polycomb target. Nature 2009, 462, 799–802. [Google Scholar] [CrossRef] [PubMed]
- Bi, X. Functions of chromatin remodeling factors in heterochromatin formation and maintenance. Sci. China Life Sci. 2012, 55, 89−96. [Google Scholar] [CrossRef] [PubMed]
- Zhou, H.; Liu, Q.J.; Li, J.; Jiang, D.G.; Zhou, L.Y.; Wu, P.; Lu, S.; Li, F.; Zhu, L.Y.; Liu, Z.L.; et al. Photoperiod- and thermo-sensitive genic male sterility in rice are caused by a point mutation in a novel noncoding RNA that produces a small RNA. Cell Res. 2012, 22, 649–60. [Google Scholar] [CrossRef]
- Camblong, J. ; Beyrouthy, N,; Guffanti, E. ; Schlaepfer, G.; Steinmetz, L.M.; Stutz, F. Trans-acting antisense RNAs mediate transcriptional gene cosuppression in S. cerevisiae. Genes Dev. 2009, 23, 1534–1545. [Google Scholar]
- Shin, H.; Shin, H.S.; Chen, R.; Harrison, M.J. Loss of At4 function impacts phosphate distribution between the roots and the shoots during phosphate starvation. Plant J. 2006, 45, 712−726. [Google Scholar] [CrossRef]
- Pauli, A.; Norris, M.L.; Valen, E.; Chew, G.L.; Gagnon, J.A.; Zimmerman, S.; Mitchell, A.; Ma, J.; Dubrulle, J.; Reyon, D.; et al. Toddler: an embryonic signal that promotes cell movement via Apelin receptors. Science 2014, 343, 1248636. [Google Scholar] [CrossRef] [PubMed]
- Anderson, D.M.; Anderson, K.M.; Chang, C.L.; Makarewich, C.A.; Nelson, B.R.; McAnally, J.R.; Kasaragod, P.; Shelton, J.M.; Liou, J.; Bassel-Duby, R.; et al. A micropeptide encoded by a putative long noncoding RNA regulates muscle performance. Cell 2015, 160, 595–606. [Google Scholar] [CrossRef]
- Nelson, B.R.; Makarewich, C.A.; Anderson, D.M.; Winders, B.R.; Troupes, C.D.; Wu, F.F.; Reese, A.L.; McAnally, J.R.; Chen, X.W.; Kavalali, E.T.; et al. A peptide encoded by a transcript annotated as long noncoding RNA enhances SERCA activity in muscle. Science, 2016, 351, 271–275. [Google Scholar] [CrossRef]
- Crespi, M.D.; Jurkevitch, E.; Poiret, M.; d’Aubenton-Carafa, Y.; Petrovics, G.; Kondorosi, E.; Kondorosi, A. enod40, a gene expressed during nodule organogenesis, codes for a non-translatable RNA involved in plant growth. EMBO J. 1994, 13, 5099−5112. [Google Scholar] [CrossRef]
- Wang, T.Z.; Liu, M.; Zhao, M.G.; Chen, R.; Zhang, W.H. Identification and characterization of long non-coding RNAs involved in osmotic and salt stress in Medicago truncatula using genome-wide high-throughput sequencing. BMC Plant Biol. 2015, 15, 131. [Google Scholar] [CrossRef]
- Liu, C.; Bai, B.; Geir, S.; Lun, C.; Deng, W.; Zhang, Y.; Bu, D.; Zhao, Y.; Chen, R. Noncode: an integrated knowledge database of non-coding RNAs. Nucleic Acids Research 2005, 33, D112–D115. [Google Scholar] [CrossRef] [PubMed]
- Gao, S.; Tian, X.; Chang, H.; Sun, Y.; Wu, Z.; Cheng, Z.; Dong, P.; Zhao, Q.; Ruan, J.; Bu, W. Two novel lncrnas discovered in human mitochondrial dna using pacbio full-length transcriptome data. Mitochondrion 2018, 38, 41–47. [Google Scholar] [CrossRef] [PubMed]
- Mccarthy, A. Third generation dna sequencing: pacific biosciences’ single molecule real time technology. Chemistry & Biology 2010, 17, 675–676. [Google Scholar]
- Li, W.; Godzik, A. CD-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences”, Weizhong Li & Adam Godzik Bioinformatics 2006, 22, 1658-1659.
- Kang, Y.J.; Yang, D.C.; Kong, L.; Hou, M.; Meng, Y.Q.; Wei, L.; Gao, G. CPC2: a fast and accurate coding potential calculator based on sequence intrinsic features. Nucleic acids research 2017, 45, W12–W16. [Google Scholar] [CrossRef] [PubMed]
- Li, A.; Zhang, J.; Zhou, Z. PLEK: a tool for predicting long non-coding rnas and messenger rnas based on an improved k-mer scheme. BMC Bioinformatics 2014, 15, 311. [Google Scholar] [CrossRef]
- Chen, C.; Chen, H.; Zhang, Y.; Thomas, H. R.; Frank, M. H.; He, Y.; Xia, R. Tbtools: an integrative toolkit developed for interactive analyses of big biological data. Molecular Plant 2020, 13, 1194–1202. [Google Scholar] [CrossRef]
- Lavorgna, G.; Guffanti, A.; Borsani, G.; Ballabio, A.; Boncinelli, E. Targetfinder: searching annotated sequence databases for target genes of transcription factors. Bioinformatics 1999, 15, 172–173. [Google Scholar] [CrossRef]
- Rombel, I.T.; Sykes, K.F.; Rayner, S.; Johnston, S. A. Orf-finder: a vector for high-throughput gene identification. Gene 2002, 282, 33–41. [Google Scholar] [CrossRef]
- Zhu, M.; Gribskov, M. MiPepid: MicroPeptide identification tool using machine learning. BMC Bioinformatics 2019, 20, 559. [Google Scholar] [CrossRef]




| Terms | Number |
|---|---|
| Reads of insert | 1,089,299 |
| 5′ prime reads | 533,904 |
| 3′ prime reads | 569,127 |
| Poly-A reads | 549,977 |
| Filtered short reads | 665 |
| Non-full-length reads | 687,477 |
| Full-length reads | 401,157 |
| Full-length non-chimeric reads | 391,677 |
| Average length of full-length non-chimeric reads | 2300.8 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).