Submitted:
13 June 2024
Posted:
18 June 2024
You are already at the latest version
Abstract
Keywords:
1. Summary
2. Data Description
2.1. Genome Size and K-mer Analysis
2.2. Assembly De Novo, Reference-Assisted Scaffolding and Validation
| Statistic | Contigs | Scaffolds |
|---|---|---|
| N50 | 8,350 | 108,009,562 |
| N90 | 2,324 | 52,732,619 |
| L50 | 86,076 | 11 |
| L90 | 307,212 | 26 |
| Largest contig | 108,709 | 163,170,176 |
| Total length | 2,582,951,014 | 2,735,224,841 |
| GC (%) | 41.9 | 41.87 |
| # contigs (≥1000 bp) | 419,259 | 14,156 |
| # contigs (≥5000 bp) | 172,406 | 1,754 |
| # contigs (≥10,000 bp) | 63,792 | 623 |
| # contigs (≥25,000 bp) | 7,999 | 161 |
| # contigs (≥50,000 bp) | 432 | 68 |
| # N’s per 100 kbp | 0 | 4820.63 |
| Terms | Contigs | Scaffold |
|---|---|---|
| Complete BUSCOs | 4,025 (43.6%) | 8,831 (95.7%) |
| Complete and single-copy BUSCOs | 3,921 (42.5%) | 8,688 (94.2%) |
| Complete and duplicated BUSCOs | 104 (1.1%) | 143 (1.5%) |
| Fragmented BUSCOs | 2,058 (22.3%) | 194 (2.1%) |
| Missing BUSCOs | 3,143 (34.1%) | 201 (2.2%) |
2.3. SSR Data Mining
3. Methods
3.1. Sample Collection and DNA Extraction
3.2. DNA Sequencing and Estimation of Genome Size
3.3. De Novo Assembly and Validation
3.4. SSR-Mining
Author Contributions
Funding
Institutional Review Board Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Bickhart, D.M.; The Bovine Pan-Genome Consortium. Nick Bickhart’s. Available online: https://njdbickhart.github.io/ (accessed on 5 June 2024).
- Rosen, B.D.; Bickhart, D.M.; Schnabel, R.D.; Koren, S.; Elsik, C.G.; Tseng, E.; Rowan, T.N.; Low, W.Y.; Zimin, A.; Couldrey, C.; et al. De novo assembly of the cattle reference genome with single-molecule sequencing. GigaScience 2020, 9, 1–9. [Google Scholar] [CrossRef]
- Heaton, M.P.; Smith, T.P.L.; Bickhart, D.M.; Vander Ley, B.L.; Kuehn, L.A.; Oppenheimer, J.; Shafer, W.R.; Schuetze, F.T.; Stroud, B.; McClure, J.C.; et al. A Reference Genome Assembly of Simmental Cattle, Bos taurus taurus. J. Hered. 2021, 112, 184–191. [Google Scholar] [CrossRef] [PubMed]
- Koç, A. A review on Simmental Raising: 1. Simmental raising in the World and in Turkey, Adü Ziraat Derg., 2016, 2, 97–102. [CrossRef]
- AgroPerú. Avanza la crianza de vacunos Fleckvieh Simmental. Available online: https://www.agroperu.pe/avanza-la-crianza-de-vacunos-fleckvieh-simmental/ (accessed on 5 June 2024).
- 2020. Available online: https://www.inia.gob.pe/2020-nota-089/ (accessed on 5 June 2024).
- Instituto Nacional de Estadística e Informática IV Censo Nacional Agropecuario 2012. Available online: http://censos.inei.gob.pe/Cenagro/redatam/# (accessed on 5 June 2024).
- Toledo-Alvarado, H.; Cecchinato, A.; Bittante, G. Fertility traits of Holstein, Brown Swiss, Simmental, and Alpine Grey cows are differently affected by herd productivity and milk yield of individual cows. J. Dairy Sci. 2017, 100, 8220–8231. [Google Scholar] [CrossRef] [PubMed]
- Windig, J.J.; Calus, M.P.L.; Veerkamp, R.F. Influence of herd environment on health and fertility and their relationship with milk production. J. Dairy Sci. 2005, 88, 335–347. [Google Scholar] [CrossRef] [PubMed]
- Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A Flexible Trimmer for Illumina Sequence Data. Bioinform. Oxf. Engl. 2014, 30, 2114–2120. [Google Scholar] [CrossRef] [PubMed]
- Martin, M. Cutadapt Removes Adapter Sequences from High-Throughput Sequencing Reads. EMBnet. J. 2011, 17, 10–12. [Google Scholar] [CrossRef]
- Marçais, G.; Kingsford, C. A Fast, Lock-Free Approach for Efficient Parallel Counting of Occurrences of k-Mers. Bioinformatics 2011, 27, 764–770. [Google Scholar] [CrossRef] [PubMed]
- Vurture, G.W.; Sedlazeck, F.J.; Nattestad, M.; Underwood, C.J.; Fang, H.; Gurtowski, J.; Schatz, M.C. GenomeScope: Fast Reference-Free Genome Profiling from Short Reads. Bioinformatics 2017, 33, 2202–2204. [Google Scholar] [CrossRef] [PubMed]
- Luo, R.; Liu, B.; Xie, Y.; Li, Z.; Huang, W.; Yuan, J.; He, G.; Chen, Y.; Pan, Q.; Liu, Y.; et al. SOAPdenovo2: An Empirically Improved Memory-Efficient Short-Read de Novo Assembler. Gigascience 2012, 1, 2047–217X-1-18. [Google Scholar] [CrossRef] [PubMed]
- Zimin, A.V.; Marçais, G.; Puiu, D.; Roberts, M.; Salzberg, S.L.; Yorke, J.A. The MaSuRCA Genome Assembler. Bioinformatics 2013, 29, 2669–2677. [Google Scholar] [CrossRef] [PubMed]
- Gurevich, A.; Saveliev, V.; Vyahhi, N.; Tesler, G. QUAST: Quality Assessment Tool for Genome Assemblies. Bioinformatics 2013, 29, 1072–1075. [Google Scholar] [CrossRef] [PubMed]
- Zimin, A.V.; Salzberg, S.L. The SAMBA Tool Uses Long Reads to Improve the Contiguity of Genome Assemblies. PLoS Comput. Biol. 2022, 18, e1009860. [Google Scholar] [CrossRef] [PubMed]
- Langmead, B.; Salzberg, S.L. Fast Gapped-Read Alignment with Bowtie 2. Nat. Methods 2012, 9, 357–359. [Google Scholar] [CrossRef] [PubMed]
- Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R. The Sequence Alignment/Map Format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef] [PubMed]
- Simão, F.A.; Waterhouse, R.M.; Ioannidis, P.; Kriventseva, E.V.; Zdobnov, E.M. BUSCO: Assessing Genome Assembly and Annotation Completeness with Single-Copy Orthologs. Bioinformatics 2015, 31, 3210–3212. [Google Scholar] [CrossRef] [PubMed]
- Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic Local Alignment Search Tool. J. Mol. Biol. 1990, 215, 403–410. [Google Scholar] [CrossRef] [PubMed]


| Property | Min. | Max. |
|---|---|---|
| Heterozygosity | 0.56% | 0.57% |
| Genome haploid length | 2,056,357,349 bp | 2,060,144,712 bp |
| Genome repeat length | 237,799,733 bp | 238,237,708 bp |
| Genome unique length | 1,818,557,616 bp | 1,821,907,004 bp |
| Model fit | 96.34% | 97.89% |
| Read error rate | 0.32% | 0.32% |
| Type | Pumpo |
|---|---|
| Total number of identified SSRs | 973,925 |
| Frequency (SSR/Kb) | 2,808 |
| Number of SSRs present in compound formation | 85,453 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).