Preprint Article Version 1 This version is not peer-reviewed

Constructing a Reference Genome in a Single Lab: The Possibility to Use Oxford Nanopore Technology

Version 1 : Received: 12 June 2019 / Approved: 13 June 2019 / Online: 13 June 2019 (09:26:20 CEST)

How to cite: Lee, Y.G.; Choi, S.C.; Kang, Y.; Kim, K.M.; Kang, C.; Kim, C. Constructing a Reference Genome in a Single Lab: The Possibility to Use Oxford Nanopore Technology. Preprints 2019, 2019060117 (doi: 10.20944/preprints201906.0117.v1). Lee, Y.G.; Choi, S.C.; Kang, Y.; Kim, K.M.; Kang, C.; Kim, C. Constructing a Reference Genome in a Single Lab: The Possibility to Use Oxford Nanopore Technology. Preprints 2019, 2019060117 (doi: 10.20944/preprints201906.0117.v1).

Abstract

The whole genome sequencing (WGS) has become a crucial tool to understand genome structure and genetic variation. The MinION sequencing of Oxford Nanopore Technologies (ONT) is an excellent approach for performing WGS and has advantages in comparison with other Next-Generation Sequencing (NGS): It is relatively inexpensive, portable, has simple library preparation, can be monitored in real-time, and has no theoretical limits on read length. Sorghum bicolor (L.) Moench is diploid (2n = 2x = 20) with a genome size of about 730 Mb, and its genome sequence information is released in the Phytozome database. Therefore, sorghum can be be used as a good reference. However, plant species have complex and large genomes compared to animals or microorganisms. As a result, complete genome sequencing is difficult for plant species. MinION sequencing that produces long-reads can be an excellent tool to overcome the weak assembly of short-reads generated from NGS by minimizing the generation of gaps or covering the repetitive sequence that appears on the plant genome. Here, we conducted the genome sequencing for S. bicolor cv. BTx623 using the MinION platform and obtained 895,678 reads and 17.9 gigabytes(Gb) (ca. 25X coverage of reference) from long-read sequence data. Through a de novo assembly using two different tools and mapped assembled contigs against the sorghum reference genome, a total of 6,124 contigs (covering 45.9%) were generated from Canu, and a total of 2,661 contigs (covering 50%) were generated from Minimap and Miniasm with a Racon pipeline. Our results provide a pipeline of long-read sequencing analysis for plant species using the MinION platform and a clue to determine the total sequencing scale for optimal coverage based on various genome sizes.

Subject Areas

sorghum; Canu; Miniasm; MinION; long-read sequencing

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our diversity statement.

Leave a public comment
Send a private comment to the author(s)
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.