Submitted:
31 August 2023
Posted:
04 September 2023
You are already at the latest version
Abstract
Keywords:
Summary
Introduction
Course content
Teaching strategy
Lessons learned and recommendations
Conclusion
Supplementary Materials
Author Contributions
Acknowledgements
Conflict of Interest
References
- EMBL-EBI. European Nucleotide Archive. 2023. Available online: https://www.ebi.ac.uk/ena/browser/home (accessed on 23 July 2023).
- NCBI. GenBank. 2023. Available online: https://www.ncbi.nlm.nih.gov/genbank/ (accessed on 23 July 2023).
- Coudert E, Gehant S, de Castro E, Pozzato M, Baratin D, Neto T, et al. Annotation of biologically relevant ligands in UniProtKB using ChEBI. Bioinformatics. 2023, 39, btac793. [Google Scholar]
- Sielemann K, Hafner A, Pucker B. The Reuse of Public Datasets in the Life Sciences: Potential Risks and Rewards. 2020.
- Zhang H, Mittal N, Leamy LJ, Barazani O, Song B-H. Back into the wild—Apply untapped genetic diversity of wild relatives for crop improvement. Evol Appl. 2017, 10, 5–24. [Google Scholar] [CrossRef] [PubMed]
- Capistrano-Gossmann GG, Ries D, Holtgräwe D, Minoche A, Kraft T, Frerichmann SLM, et al. Crop wild relative populations of Beta vulgaris allow direct mapping of agronomically important genes. Nat Commun. 2017, 8, 15708. [Google Scholar] [CrossRef] [PubMed]
- Price WN, Cohen IG. Privacy in the age of medical big data. Nat Med. 2019, 25, 37–43. [Google Scholar] [CrossRef] [PubMed]
- Işık EB, Brazas MD, Schwartz R, Gaeta B, Palagi PM, van Gelder CWG, et al. Grand challenges in bioinformatics education and training. Nat Biotechnol. 2023, 41, 1171–1174. [Google Scholar] [CrossRef]
- Pucker B, Schilbert HM, Schumacher SF. Integrating Molecular Biology and Bioinformatics Education. J Integr Bioinforma. 2019, 16. [Google Scholar]
- Dorn M, Ligabue-Braun R, Verli H. Transdisciplinary Approach for Bioinformatics Education in Southern Brazil. Front Educ. 2021, 6. [Google Scholar]
- Johnston IG, Slater M, Cazier J-B. Interdisciplinary and Transferable Concepts in Bioinformatics Education: Observations and Approaches From a UK MSc Course. Front Educ. 2022, 7. [Google Scholar]
- Garzón A, Rubio A, Pérez-Pulido AJ. E-learning strategies from a bioinformatics postgraduate programme to improve student engagement and completion rate. Bioinforma Adv. 2022, 2, vbac031. [Google Scholar]
- Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, et al. Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 2012, 40, D1178–86. [Google Scholar] [CrossRef]
- Van Bel M, Silvestri F, Weitz EM, Kreft L, Botzki A, Coppens F, et al. PLAZA 5.0: extending the scope and power of comparative and functional genomics in plants. Nucleic Acids Res. 2022, 50, D1468–74. [Google Scholar] [CrossRef] [PubMed]
- Droc G, Martin G, Guignon V, Summo M, Sempéré G, Durant E, et al. The banana genome hub: a community database for genomics in the Musaceae. Hortic Res. 2022, 9, uhac221. [Google Scholar] [CrossRef] [PubMed]
- Fernandez-Pozo N, Menda N, Edwards JD, Saha S, Tecle IY, Strickler SR, et al. The Sol Genomics Network (SGN)--from genotype to phenotype to breeding. Nucleic Acids Res. 2015, 43, D1036–D1041. [Google Scholar] [CrossRef]
- Rice Genome Hub. Rice Genome Hub. 2023. Available online: https://rice-genome-hub.southgreen.fr (accessed on 23 July 2023).
- Schilbert HM, Rempel A, Pucker B. Comparison of Read Mapping and Variant Calling Tools for the Analysis of Plant NGS Data. Plants. 2020, 9, 439. [Google Scholar] [CrossRef] [PubMed]
- Marks RA, Hotaling S, Frandsen PB, VanBuren R. Representation and participation across 20 years of plant genome sequencing. Nat Plants. 2021, 7, 1571–1578. [Google Scholar] [CrossRef]
- Sun Y, Shang L, Zhu Q-H, Fan L, Guo L. Twenty years of plant genome sequencing: achievements and challenges. Trends Plant Sci. 2022, 27, 391–401. [Google Scholar] [CrossRef]
- Kress WJ, Soltis DE, Kersey PJ, Wegrzyn JL, Leebens-Mack JH, Gostel MR, et al. Green plant genomes: What we know in an era of rapidly expanding opportunities. Proc Natl Acad Sci. 2022, 119, e2115640118. [Google Scholar] [CrossRef]
- Cheng S, Melkonian M, Smith SA, Brockington S, Archibald JM, Delaux P-M, et al. 10KP: A phylodiverse genome sequencing plan. GigaScience. 2018, 7, giy013. [Google Scholar]
- Pucker B, Irisarri I, Vries J de, Xu B. Plant genome sequence assembly in the era of long reads: Progress, challenges and future directions. Quant Plant Biol. 2022, 3, e5. [Google Scholar]
- The Arabidopsis Genome Initiative. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000, 408, 796–815. [Google Scholar] [CrossRef]
- Pucker, B.; Data Literacy In Genome Research. GitHub. 2023. Available online: https://github.com/bpucker/teaching/tree/master/FRX_DataLiteracyInGenomeResearch (accessed on 23 July 2023).
- Meckoni SN, Nass B, Pucker B. Phylogenetic placement of Ceratophyllum submersum based on a complete plastome sequence derived from nanopore long read sequencing data. BMC Res Notes. 2023, 16, 187. [Google Scholar]
- Siadjeu C, Pucker B, Viehöver P, Albach DC, Weisshaar B. High Contiguity de novo Genome Sequence Assembly of Trifoliate Yam (Dioscorea dumetorum) Using Long Read Sequencing. Genes. 2020, 11, 274. [Google Scholar] [CrossRef] [PubMed]
- Fukasawa Y, Ermini L, Wang H, Carty K, Cheung M-S. LongQC: A Quality Control Tool for Third Generation Sequencing Long Read Data. G3 GenesGenomesGenetics. 2020, 10, 1193–1196. [Google Scholar] [CrossRef]
- Wick, R. Filtlong. 2023.
- Shafin K, Pesout T, Lorig-Roach R, Haukness M, Olsen HE, Bosworth C, et al. Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes. Nat Biotechnol. 2020, 38, 1044–1053. [Google Scholar] [CrossRef]
- Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 2017, 27, 722–736. [Google Scholar] [CrossRef] [PubMed]
- GrandOmics. NextDenovo. 2023.
- Kolmogorov M, Yuan J, Lin Y, Pevzner PA. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol. 2019, 37, 540–546. [Google Scholar] [CrossRef]
- Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: quality assessment tool for genome assemblies. Bioinforma Oxf Engl. 2013, 29, 1072–1075. [Google Scholar]
- Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015, 31, 3210–3212. [Google Scholar] [CrossRef]
- Manni M, Berkeley MR, Seppey M, Simão FA, Zdobnov EM. BUSCO Update: Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes. Mol Biol Evol. 2021, 38, 4647–4654. [Google Scholar] [CrossRef]
- Huang N, Li H. miniBUSCO: a faster and more accurate reimplementation of BUSCO. 2023, 2023.06.03.543588.
- Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018, 34, 3094–3100. [Google Scholar] [CrossRef] [PubMed]
- Robinson JT, Thorvaldsdottir H, Turner D, Mesirov JP. igv.js: an embeddable JavaScript implementation of the Integrative Genomics Viewer (IGV). Bioinformatics. 2023, 39, btac830. [Google Scholar]
- Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, et al. Integrative Genomics Viewer. Nat Biotechnol. 2011, 29, 24–26. [Google Scholar] [CrossRef]
- Stanke M, Keller O, Gunduz I, Hayes A, Waack S, Morgenstern B. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res. 2006, 34, W435–W439. [Google Scholar] [CrossRef] [PubMed]
- Stanke M, Diekhans M, Baertsch R, Haussler D. Using native and syntenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics. 2008, 24, 637–644. [Google Scholar] [CrossRef] [PubMed]
- Brůna T, Hoff KJ, Lomsadze A, Stanke M, Borodovsky M. BRAKER2: automatic eukaryotic genome annotation with GeneMark-EP+ and AUGUSTUS supported by a protein database. NAR Genomics Bioinforma. 2021, 3, lqaa108. [Google Scholar]
- Gabriel L, Brůna T, Hoff KJ, Ebel M, Lomsadze A, Borodovsky M, et al. BRAKER3: Fully Automated Genome Annotation Using RNA-Seq and Protein Evidence with GeneMark-ETP, AUGUSTUS and TSEBRA. 2023, 2023.06.10.544449.
- Ou S, Su W, Liao Y, Chougule K, Agda JRA, Hellinga AJ, et al. Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline. Genome Biol. 2019, 20, 275. [Google Scholar]
- Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013, 29, 15–21. [Google Scholar] [CrossRef]
- Dobin A, Gingeras TR. Mapping RNA-seq Reads with STAR. Curr Protoc Bioinforma. 2015, 51, 11–14. [Google Scholar]
- Kim D, Paggi JM, Park C, Bennett C, Salzberg SL. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol. 2019, 37, 907–915. [Google Scholar] [CrossRef]
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990, 215, 403–410. [Google Scholar] [CrossRef] [PubMed]
- Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25, 3389–3402. [Google Scholar] [CrossRef] [PubMed]
- Pucker B, Holtgräwe D, Stadermann KB, Frey K, Huettel B, Reinhardt R, et al. A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set. PLOS ONE. 2019, 14, e0216233. [Google Scholar]
- Jones P, Binns D, Chang H-Y, Fraser M, Li W, McAnulla C, et al. InterProScan 5: genome-scale protein function classification. Bioinformatics. 2014, 30, 1236–1240. [Google Scholar] [CrossRef] [PubMed]
- Schwacke R, Ponce-Soto GY, Krause K, Bolger AM, Arsova B, Hallab A, et al. MapMan4: A Refined Protein Classification and Annotation Framework Applicable to Multi-Omics Data Analysis. Mol Plant. 2019, 12, 879–892. [Google Scholar] [CrossRef]
- Bolger M, Schwacke R, Usadel B. MapMan Visualization of RNA-Seq Data Using Mercator4 Functional Annotations. In: Dobnik D, Gruden K, Ramšak Ž, Coll A, editors. Solanum tuberosum: Methods and Protocols. New York, NY: Springer US; 2021. p. 195–212.
- Rempel A, Choudhary N, Pucker B. KIPEs3: Automatic annotation of biosynthesis pathways. 2023, :2022.06.30.498365.
- Pucker, B. Automatic identification and annotation of MYB gene family members in plants. BMC Genomics. 2022, 23, 220. [Google Scholar] [CrossRef]
- Thoben C, Pucker B. Automatic annotation of the bHLH gene family in plants. 2023, 2023.05.02.539087.
- Price MN, Dehal PS, Arkin AP. FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments. PLOS ONE. 2010, 5, e9490. [Google Scholar]
- Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, von Haeseler A, et al. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol Biol Evol. 2020, 37, 1530–1534. [Google Scholar] [CrossRef]
- Katoh K, Standley DM. MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability. Mol Biol Evol. 2013, 30, 772–780. [Google Scholar] [CrossRef]
- Edgar, RC. Muscle5: High-accuracy alignment ensembles enable unbiased assessments of sequence homology and phylogeny. Nat Commun. 2022, 13, 6968. [Google Scholar] [CrossRef]
- Pucker B, Iorizzo M. Apiaceae FNS I originated from F3H through tandem gene duplication. PLOS ONE. 2023, 18, e0280155. [Google Scholar]
- Brown JW, Walker JF, Smith SA. Phyx: phylogenetic tools for unix. Bioinformatics. 2017, 33, 1886–1888. [Google Scholar] [CrossRef]
- Letunic I, Bork P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021, 49, W293–6. [Google Scholar] [CrossRef] [PubMed]
- Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009, 25, 2078–2079. [Google Scholar] [CrossRef] [PubMed]
- Heller D, Vingron M. SVIM: structural variant identification using mapped long reads. Bioinformatics. 2019, 35, 2907–2915. [Google Scholar] [CrossRef]
- Smolka M, Paulin LF, Grochowski CM, Mahmoud M, Behera S, Gandhi M, et al. Comprehensive Structural Variant Detection: From Mosaic to Population-Level. 2022, 2022.04.04.487055.
- Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, Wang L, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff. Fly (Austin). 2012, 6, 80–92. [Google Scholar] [CrossRef]
- Friedrich A, Pucker B. Peer-review as a teaching method. working Paper. 2018.




Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).