Submitted:
02 March 2024
Posted:
04 March 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Literature Review
2.1. Historic Discovery and Structure of the DNA Double Helix
2.2. Unraveling the Double Helix: Historic Discovery and Structural Basis of DNA
2.2.1. Early Theories on the Nature of Genetic Material
2.2.2. Contributions of Rosalind Franklin, Maurice Wilkins, James Watson, and Francis Crick
2.3. DNA Structure and the Double Helix
2.3.1. Chemical Composition of DNA
2.3.2. The Double Helix: Two Strands in Antiparallel Orientation
2.4. Impact on DNA Replication, Transcription, and Translation
2.4.1. DNA Replication
2.4.2. Transcription: DNA to RNA
2.4.3. Translation: RNA to Proteins
2.5. DNA Replication, Transcription, and Translation: Fundamentals of Genetic Information Flow
2.6. DNA and Beyond: Exploring the Complex World of Gene Regulation and Epigenetics
2.6.1. DNA Methylation and Histone Modifications
2.6.2. Chromatin Remodeling and Non-Coding RNAs
2.6.3. Epigenetics and Developmental Biology
2.6.3.1. Gene Regulation and Epigenetic Modifications in Embryonic Development
2.6.3.2. Epigenetic Regulation of Hox Genes and Cell Fate Determination
2.6.3.3. DNA Methylation in Selecting and Maintaining Cell Identity
2.6.3.4. Polycomb-Group Genes in Embryonic Regulation
2.6.3.5. Epigenetic Memory and Developmental Processes
2.6.3.6. Epigenetics in Cancer Stem Cells
2.6.4. Epigenetic Contributions to Disease Mechanisms
2.6.4.1. Epigenetics in Neurodegeneration
2.6.4.2. Crosstalk between Epigenetics and mTOR in Alzheimer’s Disease
2.6.4.3. Epigenetics and Alzheimer’s Disease
2.6.4.4. Genomic Alterations in Non-Cancer Diseases
2.6.4.5. Epigenetics in Cancer
2.6.5. Epigenetics and Personalized Medicine
2.6.6. Gene Regulation and Epigenetics in Drug Discovery
2.6.7. Emerging Technologies in Epigenetic Research
2.6.8. Ethical Considerations in Epigenetic Research
2.7. Revolutionizing Genomics: Breakthroughs in DNA Sequencing Technologies
2.8. Computational Genomics and Bioinformatics: Managing the Data Deluge in DNA Research
2.9. Decoding Human Diseases: DNA-Based Insights into Cancer, Rare Disorders, Infections, and Pharmacogenomics
2.10. Towards Precision Medicine: Utilizing DNA Knowledge for Personalized Healthcare
2.10.1. DNA and Disease Understanding
2.10.1.1. Cancer Biology
2.10.1.2. Rare Genetic Disorders
2.10.1.3. Infectious Diseases
2.10.1.4. Pharmacogenomics
2.10.2. Precision Medicine in Therapeutics
2.10.2.1. Targeted Therapies
2.10.2.2. Gene Therapy
2.10.3. Disease Prevention and Health Management
2.10.3.1. Genetic Risk Assessment
2.10.3.2. Lifestyle Interventions
2.10.4. Ethical Considerations
2.10.4.1. Privacy Concerns
2.10.4.2. Genome Editing Controversies
- (1)
- Privacy concerns
- (2)
- Genome editing controversies
2.11. Beyond the Genome: DNA Nanotechnology and Synthetic Biology on the Horizon
2.12.1. Privacy Concerns in DNA Research
2.12.2. Genome Editing and CRISPR-Cas9
2.12.3. Societal Implications of DNA Research
2.13. Future Perspectives: Emerging Technologies and the Exciting Frontier of DNA Research
2.13.1. CRISPR-Cas9: Precision Gene Editing
2.13.2. DNA Nanotechnology: Building at the Nanoscale
2.13.3. Synthetic Biology: Engineering Life
2.13.4. DNA Sequencing Advancements: Unlocking the Genomic Code
2.13.5. Personalized Medicine and Beyond
3. The Transformative Impact of DNA Sequencing on Genomics and Personalized Medicine
4. Potential limitations and Future Research Directions
4.1. Potential Limitations
4.1.1. Scope of Current Data
4.1.2. Interdisciplinary Challenges
4.1.3. Representation of Diverse Populations in Genomic Data
4.1.4. Technological Limitations
4.2. Directions for Future Research
4.2.1. Enhancing Diversity in Genomic Studies
4.2.2. Integrating Multi-Omics Data
4.2.3. Advanced Computational Models
4.2.4. Longitudinal Studies
4.2.5. Ethical Framework Development
4.2.6. Environmental and Lifestyle Factors
4.2.7. Clinical Trials and Implementation Studies
5. Challenges
- (1)
- Data management in DNA sequencing
- (2)
- Accuracy and reliability of sequencing technologies
- (3)
- Ethical concerns
- (4)
- Integration of genomic data in clinical practice
- (5)
- Handling of genetic variability in disease diagnosis and treatment
- (6)
- Public perception and acceptance
- (7)
- Regulatory and policy challenges
- (8)
- Technological limitations in emerging areas
6. Discussion
6.1. Advancements in Gene Therapy and Genome Editing
6.1.1. CRISPR-Cas9 Developments
6.1.2. Gene Replacement Therapies
6.2. Progress in Pharmacogenomics
6.2.1. Personalized Medicine
6.2.2. Pharmacogenomic Testing
6.3. Advances in Diagnostic Techniques
6.3.1. Early Detection of Genetic Disorders
6.3.2. Enhancements in Prenatal Screening
6.4. Preventive Strategies in Genetic Disorders
6.4.1. Genetic Counseling
6.4.2. Public Health Initiatives
6.5. Ethical, Legal, and Social Considerations
6.5.1. Ethical Debates
6.5.2. Policy and Legal Frameworks
6.6. Future Outlook
6.6.1. Emerging Technologies
6.6.2. Challenges and Opportunities
7. Conclusions
Use of AI tools declaration
Acknowledgments
Conflict of Interest
References
- Watson, J., Crick, F. (1953) Molecular Structure of Nucleic Acids: A Structure for Deoxyribose Nucleic Acid. Nature 171, 737–738. [CrossRef]
- Meselson M, Stahl FW. (1958) The Replication of DNA In Escherichia Coli. Proc Natl Acad Sci U S A. 15;44(7):671-82. [CrossRef]
- Mardis ER. (2008) Next-generation DNA sequencing methods. Annu Rev Genomics Hum Genet. 9:387-402. PMID: 18576944. [CrossRef]
- Lander ES. (2011) Initial impact of the sequencing of the human genome. Nature. 10;470(7333):187-97. https://doi.org/10.1038/nature09792. Doudna JA, Charpentier E. Genome editing. The new frontier of genome engineering with CRISPR-Cas9. Science. 2014 Nov 28;346(6213):1258096. [CrossRef]
- Franklin R, Gosling R. (1953) Molecular Configuration in Sodium Thymonucleate. Nature 171, 740–741. [CrossRef]
- Chargaff E. (1950) Chemical specificity of nucleic acids and mechanism of their enzymatic degradation. Experientia. 15;6(6):201-9. [CrossRef]
- Bird A. (2002) DNA methylation patterns and epigenetic memory. Genes Dev. 16(1):6-21. [CrossRef]
- Sanger F, Nicklen S, Coulson AR. (1977) DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci USA. 74(12):5463-7. [CrossRef]
- Hall K, Sankaran N. (2021) DNA translated: Friedrich Miescher’s discovery of nuclein in its original context. Br J Hist Sci. 54(1):99-107. [CrossRef]
- Frixione E, Ruiz-Zamarripa L. (2019) The “scientific catastrophe” in nucleic acids research that boosted molecular biology. J Biol Chem. 15;294(7):2249-2255. [CrossRef]
- Avery OT, MacLeod CM, McCarty M. (1944) Studies on the chemical nature of the substance inducing transformation of pneumococcal types: Induction of transformation by a desoxyribonucleic acid fraction isolated from pneumococcus type III. J Exp Med. 79(2):137-58. [CrossRef]
- Dahm R. (2005) Friedrich Miescher and the discovery of DNA. Develop Biol. 278(2):274-288. 1016.
- Rheinberger, HJ. (2020). Mendel, Gregor Johann: Experiments on Plant Hybrids. In: Arnold, H.L. (eds) Kindlers Literatur Lexikon (KLL). J.B. Metzler, Stuttgart. [CrossRef]
- Franklin RE. (1953) Evidence for 2-chain helix in crystalline structure of sodium deoxyribonucleate. Nature. 172(4369):156-7. [CrossRef]
- DeMaria AN. (2003) A structure for deoxyribose nucleic acid. J Am Coll Cardiol. 16;42(2):373-4. [CrossRef]
- Jacob F, Monod J. (1961) Genetic regulatory mechanisms in the synthesis of proteins. J Mol Biol. 3:318-56. [CrossRef]
- Nirenberg MW, Matthaei JH. (1961) The dependence of cell-free protein synthesis in E. coli upon naturally occurring or synthetic polyribonucleotides. Proc Natl Acad Sci USA. 47(10):1588-1602. [CrossRef]
- Watson JD, Crick FH. (1953) Genetical implications of the structure of deoxyribonucleic acid. Nature. 171(4361):964-967. [CrossRef]
- Okazaki R, Okazaki T, Sakabe K, Sugimoto K, & Sugino A. (1968) Mechanism of DNA chain growth. I. Possible discontinuity and unusual secondary structure of newly synthesized chains. Proc Natl Acad Sci USA. 59(2):598-605. [CrossRef]
- Kunkel TA, Erie DA. (2005) DNA mismatch repair. Annu Rev Biochem. 74:681-710. [CrossRef]
- Lee TI, Young RA. (2000) Transcription of eukaryotic protein-coding genes. Annu Rev Genet. 34:77-137. [CrossRef]
- Kozak M. (1999) Initiation of translation in prokaryotes and eukaryotes. Gene. 234(2):187-208. [CrossRef]
- Ray S. (2014) The Cell: A Molecular Approach. Yale J Biol Med. 12;87(4):603–4. PMCID: PMC4257047.
- High KA, Roncarolo MG. (2019) Gene therapy. N Engl J Med. 381(5):455-464. [CrossRef]
- Clapier CR, Cairns BR. (2009) The biology of chromatin remodeling complexes. Annu Rev Biochem. 78:273-304. [CrossRef]
- Jaenisch R, Bird A. (2003) Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat Genet. 33 Suppl:245-54. [CrossRef]
- Reik W, Dean W, Walter J. (2001) Epigenetic reprogramming in mammalian development. Science. 293(5532):1089-93. [CrossRef]
- Soshnikova N, Duboule D. (2009) Epigenetic regulation of vertebrate Hox genes: a dynamic equilibrium. Epigenetics. 16;4(8):537-40. Epub 2009 Nov 21. PMID: 19923920. [CrossRef] [PubMed]
- Smith ZD, Meissner A. (2013) DNA methylation: roles in mammalian development. Nat Rev Genet. 14(3):204-20. [CrossRef]
- Schuettengruber B, Cavalli G. (2009) Recruitment of polycomb group complexes and their role in the dynamic regulation of cell fate choice. Development. 136(21):3531-42. [CrossRef]
- Probst AV, Dunleavy E, Almouzni G. (2009) Epigenetic inheritance during the cell cycle. Nat Rev Mol Cell Biol. 10(3):192-206. [CrossRef]
- Baylin SB, Jones PA. (2011) A decade of exploring the cancer epigenome - biological and translational implications. Nat Rev Cancer. 11(10):726-34. [CrossRef]
- Esteller M. (2008) Epigenetics in cancer. N Engl J Med. 358(11):1148-59. 1148. [CrossRef]
- Coppieters N, Dieriks BV, Lill C, Faull RL, Curtis MA, Dragunow M. (2014) Global changes in DNA methylation and hydroxymethylation in Alzheimer’s disease human brain. Neurobiol Aging. 35(6):1334-44. [CrossRef]
- Caccamo A, Majumder S, Richardson A, Strong R, Oddo S. (2010) Molecular interplay between mammalian target of rapamycin (mTOR), amyloid-beta, and Tau: effects on cognitive impairments. J Biol Chem. 23;285(17):13107-20. Epub 2010 Feb 23. [CrossRef]
- Qureshi IA, Mehler MF. (2010) Epigenetic mechanisms underlying human epileptic disorders and the process of epileptogenesis. Neurobiol Dis. 39(1):53-60. [CrossRef]
- Feinberg AP, Tycko B. (2004) The history of cancer epigenetics. Nat Rev Cancer. 4(2):143-53. [CrossRef]
- Jones PA, Baylin SB. (2002) The fundamental role of epigenetic events in cancer. Nat Rev Genet. 3(6):415-28. [CrossRef]
- Hood L, Friend SH. (2011) Predictive, personalized, preventive, participatory (P4) cancer medicine. Nat Rev Clin Oncol. 8(3):184-7. [CrossRef]
- Copeland RA, Solomon ME, Richon VM. (2009) Protein methyltransferases as a target class for drug discovery. Nat Rev Drug Discov. 8(9):724-32. [CrossRef]
- Dekker J, Rippe K, Dekker M, Kleckner N. (2002) Capturing chromosome conformation. Science. 15;295(5558):1306-11. [CrossRef] [PubMed]
- Lander ES, Linton LM, Birren B, et al. (2001) Initial sequencing and analysis of the human genome. Nature. 409(6822):860-921. [CrossRef]
- Metzker, M. (2010) Sequencing technologies — the next generation. Nat Rev Genet 11, 31–46. [CrossRef]
- Collins FS, Varmus H. (2015) A new initiative on precision medicine. N Engl J Med. 26;372(9):793-5. [CrossRef]
- Schadt EE, Turner S, Kasarskis A. (2010) A window into third-generation sequencing. Hum Mol Genet. 15;19(R2):R227-40. [CrossRef]
- Eid J, Fehr A, Gray J, et al. (2009) Real-time DNA sequencing from single polymerase molecules. Science. 323(5910):133-8. [CrossRef]
- Reuter JA, Spacek DV, Snyder MP. (2015) High-throughput sequencing technologies. Mol Cell. 58(4):586-97. [CrossRef]
- Kellis M, Wold B, Snyder MP, et al. (2014) Defining functional DNA elements in the human genome. Proc Natl Acad Sci USA. 111(17):6131-8. [CrossRef]
- Stein LD. (2010) The case for cloud computing in genome informatics. Genome Biol. 11(5):207. [CrossRef]
- Schatz MC, Langmead B, Salzberg SL. (2010) Cloud computing and the DNA data race. Nat Biotechnol. 28(7):691-3. [CrossRef]
- Li H, Durbin R. (2009) Fast and accurate short read alignment with Burrows-Wheeler Transform. Bioinformatics. 25(14):1754-60. [CrossRef]
- McKenna A, Hanna M, Banks E, et al. (2010) The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20(9):1297-303. [CrossRef]
- Zerbino DR, Birney E. (2008) Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 18(5):821-9. [CrossRef]
- Besemer J, Lomsadze A, Borodovsky M. (2001) GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions. Nucleic Acids Res. 29(12):2607-18. [CrossRef]
- Lister R, O’Malley RC, Tonti-Filippini J, et al. (2008) Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell. 133(3):523-36. [CrossRef]
- Kumar P, Henikoff S, Ng PC. (2009) Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat Protoc. 4(7):1073-81. [CrossRef]
- Joyce AR, Palsson BØ. (2006) The model organism as a system: integrating ‘omics’ data sets. Nat Rev Mol Cell Biol. 7(3):198-210. [CrossRef]
- Hamburg MA, Collins FS. (2010) The path to personalized medicine. N Engl J Med. 363(4):301-4. [CrossRef]
- Bick D, Fraser PC, Guttmacher AE, et al. (2017) Successful application of whole genome sequencing in a medical genetics clinic. J Pediatr Genet. 6(2):61-76. [CrossRef]
- Sboner A, Mu XJ, Greenbaum D, et al. (2011) The real cost of sequencing: higher than you think! Genome Biol. 12(8):125. [CrossRef]
- Mardis ER. (2011) A decade’s perspective on DNA sequencing technology. Nature. 470(7333):198-203. [CrossRef]
- Vogelstein B, Papadopoulos N, Velculescu VE, Zhou S, Diaz LA Jr, Kinzler KW. (2013) Cancer genome landscapes. Science. 339(6127):1546-58. [CrossRef]
- Druker BJ, Talpaz M, Resta DJ, et al. (2001) Efficacy and safety of a specific inhibitor of the BCR-ABL tyrosine kinase in chronic myeloid leukemia. N Engl J Med. 344(14):1031-7. [CrossRef]
- Stratton MR, Campbell PJ, Futreal PA. (2009) The cancer genome. Nature. 458(7239):719-24. [CrossRef]
- Bettegowda C, Sausen M, Leary RJ, et al. (2014) Detection of circulating tumor DNA in early- and late-stage human malignancies. Sci Transl Med. 6(224):224ra24. [CrossRef]
- Bamshad MJ, Ng SB, Bigham AW, et al. (2011) Exome sequencing as a tool for Mendelian disease gene discovery. Nat Rev Genet. 12(11):745-55. [CrossRef]
- Yang Y, Muzny DM, Reid JG, et al. (2013) Clinical whole-exome sequencing for the diagnosis of mendelian disorders. N Engl J Med. 369(16):1502-11. [CrossRef]
- Cox DBT, Platt RJ, Zhang F. (2015) Therapeutic genome editing: prospects and challenges. Nat Med. 21(2):121-31. [CrossRef]
- Mendell JR, Al-Zaidy S, Shell R, et al. (2017) Single-dose gene-replacement therapy for spinal muscular atrophy. N Engl J Med. 377(18):1713-22. [CrossRef]
- Rappuoli R, Pizza M, Del Giudice G, De Gregorio E. (2014) Vaccines, new opportunities for a new society. Proc Natl Acad Sci USA. 111(34):12288-93. [CrossRef]
- Weinshilboum R, Wang L. (2017) Pharmacogenomics: precision medicine and drug response. Mayo Clin Proc. 2017;92(11):1711-22. [CrossRef]
- Roden DM, McLeod HL, Relling MV, et al. (2019) Pharmacogenomics. Lancet. 394(10197):521-32. [CrossRef]
- Johnson JA. (2013) Pharmacogenetics in clinical practice: how far have we come and where are we going? Pharmacogenomics. 14(7):835-43. [CrossRef]
- Jinek M, Chylinski K, Fonfara I, Hauer M, Doudna JA, Charpentier E. (2012) A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science. 337(6096):816-21. [CrossRef]
- Keasling JD. (2012) Synthetic biology and the development of tools for metabolic engineering. Metab Eng. 14(3):189-95. [CrossRef]
- Seeman NC. (2010) Nanomaterials based on DNA. Annu Rev Biochem. 79:65-87. [CrossRef]
- Lanphier E, Urnov F, Haecker SE, Werner M, Smolenski J. (2015) Don’t edit the human germ line. Nature. 519(7544):410-1. [CrossRef]
- Slamon DJ, Leyland-Jones B, Shak S, et al. (2001) Use of chemotherapy plus a monoclonal antibody against HER2 for metastatic breast cancer that overexpresses HER2. N Engl J Med. 344(11):783-92. [CrossRef]
- Katsanis SH, Katsanis N. (2013) Molecular genetic testing and the future of clinical genomics. Nat Rev Genet. 14(6):415-26. [CrossRef]
- Sanger F, Coulson AR, Hong GF, Hill DF, Petersen GB. (1977) Nucleotide sequence of bacteriophage φX174 DNA. Nature. 265(5596):687-95. [CrossRef]
- Weinshilboum RM. (2003) Inheritance and drug response. N Engl J Med. 348(6):529-37. [CrossRef]
- Yaffe MB, Leparc GG, Lai L, Obata T, Volinia S, Cantley LC. (2001) A motif-based profile scanning approach for genome-wide prediction of signaling pathways. Nat Biotechnol. 19(4):348-53. [CrossRef]
- Biesecker LG, Green RC. (2014) Diagnostic clinical genome and exome sequencing. N Engl J Med. 370(25):2418-25. [CrossRef]
- Gymrek M, McGuire AL, Golan D, Halperin E, Erlich Y. (2013) Identifying personal genomes by surname inference. Science. 339(6117):321-4. [CrossRef]
- Baltimore D, Berg P, Botchan M, et al. (2015) A prudent path forward for genomic engineering and germline gene modification. Science. 348(6230):36-38. [CrossRef]
- Seeman NC. (2003) DNA in a material world. Nature. 421(6921):427-31. [CrossRef]
- Rothemund PWK. (2006) Folding DNA to create nanoscale shapes and patterns. Nature. 440(7082):297-302. [CrossRef]
- Douglas SM, Bachelet I, Church GM. (2012) A logic-gated nanorobot for targeted transport of molecular payloads. Science. 335(6070):831-4. [CrossRef]
- Cutler JI, Auyeung E, Mirkin CA. (2012) Spherical nucleic acids. J Am Chem Soc. 134(3):1376-91. [CrossRef]
- Li J, Fan C, Pei H, Shi J, Huang Q. (2013) Smart drug delivery nanocarriers with self-assembled DNA nanostructures. Adv Mater. 25(31):4386-92. [CrossRef]
- Modi S, Swetha MG, Goswami D, Gupta GD, Mayor S, Krishnan Y. (2009) A DNA nanomachine that maps spatial and temporal pH changes inside living cells. Nat Nanotechnol. 4(5):325-30. [CrossRef]
- Zhang DY, Seelig G. (2011) Dynamic DNA nanotechnology using strand-displacement reactions. Nat Chem. 3(2):103-13. [CrossRef]
- Qian L, Winfree E. (2011) Scaling up digital circuit computation with DNA strand displacement cascades. Science. 332(6034):1196-201. [CrossRef]
- Ke Y, Ong LL, Shih WM, Yin P. (2012) Three-dimensional structures self-assembled from DNA bricks. Science. 338(6111):1177-83. [CrossRef]
- Khalil AS, Collins JJ. (2010) Synthetic biology: applications come of age. Nat Rev Genet. 11(5):367-79. [CrossRef]
- Oye KA, Wellhausen R, Irwin RS. (2014) Regulating gene drives. Science. 345(6197):626-8. [CrossRef]
- Church GM, Gao Y, Kosuri S. (2012) Next-generation digital information storage in DNA. Science. 337(6102):1628. [CrossRef]
- Endy D. (2005) Foundations for engineering biology. Nature. 438(7067):449-53. [CrossRef]
- Rothstein MA. (2010) The ethical implications of personal genome sequencing. J Law Med Ethics. 38(1):64-73. [CrossRef]
- [101] Kaye J, Heeney C, Hawkins N, de Vries J, Boddington P. (2009) Data sharing in genomics — re-shaping scientific practice. Nat Rev Genet. 10(5):331-5. [CrossRef]
- Fullerton SM, Knerr S, Burke W. (2012) Finding a place for genomics in health disparities research. Public Health Genomics. 15(3-4):156-63. [CrossRef]
- Nelkin D, Lindee MS. (2004) The DNA mystique: The gene as a cultural icon. 2nd ed. Ann Arbor: University of Michigan Press. [CrossRef]
- Caulfield T, McGuire AL. (2012) Direct-to-consumer genetic testing: Perceptions, problems, and policy responses. Annu Rev Med. 63:23-33. [CrossRef]
- Keasling JD. (2008) Synthetic biology for synthetic chemistry. ACS Chem Biol. 3(1):64-76. [CrossRef]
- Shendure J, Ji H. Next-generation DNA sequencing. Nat Biotechnol. 2008;26(10):1135-45. [CrossRef]
- Collins FS, Morgan M, Patrinos A. (2003) The Human Genome Project: Lessons from large-scale biology. Science. 300(5617):286-90. [CrossRef]
- International Human Genome Sequencing Consortium. (2004) Finishing the euchromatic sequence of the human genome. Nature. 431(7011):931-45. [CrossRef]
- Wetterstrand KA. DNA Sequencing Costs: Data from the NHGRI Genome Sequencing Program (GSP). National Human Genome Research Institute. https://www.genome.gov/about-genomics/fact-sheets/DNA-Sequencing-Costs-Data.
- Druker BJ, Guilhot F, O’Brien SG, et al. (2006) Five-year follow-up of patients receiving imatinib for chronic myeloid leukemia. N Engl J Med. 355(23):2408-17. [CrossRef]
- Wright CF, Fitzpatrick DR, Firth HV. (2018) Paediatric genomics: diagnosing rare disease in children. Nat Rev Genet. 19(4):253-268. [CrossRef]
- Turro E, Astle WJ, Megy K, et al. (2020) Whole-genome sequencing of patients with rare diseases in a national health system. Nature. 583(7814):96-102. [CrossRef]
- Relling MV, Evans WE. (2015) Pharmacogenomics in the clinic. Nature. 2015;526(7573):343-50. [CrossRef]
| Landmark | Description |
|---|---|
| Friedrich Miescher (1869) | The journey began with Friedrich Miescher, a Swiss biochemist who, in 1869, isolated a novel phosphorus-containing substance from white blood cells. This substance, which he named “nuclein”, is now known as nucleic acid and was the first identification of what we now call DNA (Deoxyribonucleic Acid) [10]. |
| Phoebus Levene (1910s) | In the 1910s, American biochemist Phoebus Levene made significant advances in understanding nucleic acids. He identified the sugar components (ribose in RNA and deoxyribose in DNA) and incorrectly proposed the “tetranucleotide hypothesis”, suggesting that DNA was composed of equal amounts of four nucleotides in a fixed sequence, which limited DNA’s role in genetic variability. [11]. |
| Oswald Avery, Colin MacLeod, and Maclyn McCarty (1944) | A major turning point came with the transformation experiment by Avery, MacLeod, and McCarty in 1944, demonstrating that DNA is the substance that causes bacterial transformation, a landmark discovery suggesting DNA’s role in heredity [12]. |
| Erwin Chargaff (1950) | In the early 1950s, Erwin Chargaff formulated what came to be known as Chargaff’s rules, showing that in DNA, the amount of adenine (A) equals thymine (T) and the amount of cytosine (C) equals guanine (G), hinting at the pairing mechanism within the DNA structure [7]. |
| Rosalind Franklin and Maurice Wilkins (1950s) | The contributions of Rosalind Franklin, a British X-ray crystallographer, and Maurice Wilkins were crucial. They produced high-quality X-ray diffraction images of DNA, notably “Photo 51”, which were key in identifying the helical structure of DNA [6]. |
| James Watson and Francis Crick (1953) | The culmination of these efforts was the proposal of the double helix model of DNA by James Watson and Francis Crick in 1953. They integrated previous findings, especially using Franklin’s X-ray data, to propose that DNA is a double helix with base pairing (A with T, C with G) [1]. |
| Later developments | Following the discovery of the DNA structure, further research delved into the mechanisms of DNA replication, transcription, and repair. This discovery has had a profound impact on the field of molecular biology, leading to developments such as the Human Genome Project and CRISPR gene editing technologies. |
| Process | Description |
|---|---|
| DNA replication | DNA replication is the process by which a cell duplicates its DNA before cell division. This ensures that each daughter cell receives a complete and identical copy of the genetic material. The process involves the unwinding of the DNA double helix, formation of replication forks, and synthesis of new DNA strands by complementary base pairing with the template strand [20]. Key enzymes, such as DNA polymerases, helicases, and topoisomerases, participate in orchestrating this complex process. Errors during DNA replication can lead to mutations and, in turn, contribute to the development of genetic disorders and cancer [21]. |
| Transcription | Transcription is the process by which genetic information encoded in DNA is converted into RNA. The enzyme RNA polymerase synthesizes a complementary RNA strand based on the template DNA strand. The transcribed RNA, known as mRNA, carries the genetic information from the nucleus to the cytoplasm, where it serves as a template for protein synthesis. Additional players, including transcription factors and enhancers, regulate the transcription process, enabling fine-tuned gene expression [22]. |
| Translation: | Translation is the final step in the flow of genetic information, where mRNA is used as a template to synthesize proteins. Ribosomes, the molecular machines responsible for translation, read the mRNA codons and match them with specific amino acids to build a polypeptide chain. The genetic code, characterized by the codon-amino acid correspondence, is universal and shared across all living organisms. Post-translational modifications of proteins further regulate their function, stability, and localization [23]. |
| Implications in genetic diseases | Errors or mutations in the processes of DNA replication, transcription, and translation can lead to a wide range of genetic disorders. Mutations in DNA replication-associated genes have been linked to conditions like Bloom syndrome and Werner syndrome, while aberrant transcription and translation regulation are involved in diseases such as thalassemia and muscular dystrophy. Understanding these processes’ nuances is vital in diagnosing and developing targeted therapies for genetic diseases [24]. |
| Biotechnological and therapeutic application | The understanding of DNA replication, transcription, and translation has revolutionized biotechnology and medicine. Techniques such as polymerase chain reaction (PCR), DNA cloning, and gene expression profiling rely on these processes. Additionally, advances in personalized medicine, gene therapies, and RNA-based therapeutics have emerged due to our in-depth knowledge of these fundamental processes [25]. |
| Advancement | Description |
|---|---|
| Sanger sequencing | The pioneering technique Sanger sequencing, also known as the chain-termination method, was the first DNA sequencing technique developed by Frederick Sanger in the 1970s [9]. This method relies on the incorporation of chain-terminating dideoxynucleotides during DNA synthesis, generating DNA fragments of varying lengths that can be separated by gel electrophoresis. Despite being a labor-intensive process, Sanger sequencing was pivotal in numerous groundbreaking studies, including the Human Genome Project. |
| Next-generation sequencing (NGS) | The introduction of NGS technologies in the mid-2000s brought a paradigm shift in DNA sequencing. NGS techniques, such as Illumina sequencing, ion torrent sequencing, and Oxford nanopore sequencing, enabled massively parallel sequencing of DNA fragments [44]. This dramatic increase in throughput significantly reduced the cost and time required for whole-genome sequencing, exome sequencing, and targeted sequencing. |
| Applications in precision genomics and personalized medicine | The application of DNA sequencing in precision genomics has revolutionized disease diagnosis, prognosis, and treatment. NGS has enabled the identification of disease-causing mutations, pharmacogenetic variations, and genetic risk factors in complex disorders [45]. Personalized medicine, driven by DNA sequencing data, has facilitated the development of targeted therapies and tailored treatment plans, improving patient outcomes. |
| Challenges and limitations | Despite the tremendous progress in DNA sequencing technologies, certain challenges persist. Generating and managing vast amounts of DNA data poses computational and bioinformatics challenges [46]. Additionally, the accuracy and reliability of certain sequencing technologies need further refinement. Addressing ethical concerns, such as data privacy, consent, and responsible use of genetic information, is crucial for the ethical practice of genomics. |
| Emerging single-molecule sequencing techniques | The emergence of single-molecule sequencing techniques, such as PacBio SMRT (single-molecule real-time) sequencing [47], has provided an alternative approach to DNA sequencing. By directly reading the DNA sequence in real time, these techniques offer longer read lengths, enabling the assembly of complex genomic regions and resolving repetitive sequences. |
| Future prospects | The future of DNA sequencing technologies is promising. As sequencing costs continue to decline and technologies improve, whole-genome sequencing is becoming more accessible in clinical settings. Furthermore, advances in long-read sequencing and third-generation sequencing technologies are expected to enhance the accuracy and resolution of genomic analyses [48]. Integrating DNA sequencing with other ‘omics’ data, such as transcriptomics and epigenomics, will yield a comprehensive understanding of biological systems [49]. |
| Technology | Description |
|---|---|
| Challenges in managing DNA data | The data deluge in DNA research presents significant challenges in data storage, processing, and analysis. Traditional computing infrastructures struggle to handle the immense volume of data, leading to increased processing time and resource consumption. Moreover, as DNA sequencing technologies continue to evolve, the diversity of data formats and file sizes further complicates data management [51]. The need for efficient and scalable solutions has catalyzed the development of novel computational strategies. |
| Computational techniques in DNA data analysis | A plethora of computational techniques has been devised to process and analyze DNA data. Alignment algorithms, such as Burrows-Wheeler aligner (BWA), enable the comparison of sequencing reads to reference genomes, facilitating the identification of genetic variations [52]. Variant calling algorithms, including GATK (Genome Analysis Toolkit), are employed to detect single nucleotide polymorphisms (SNPs), insertions, deletions, and structural variations [53]. |
| Genome assembly and annotation | The reconstruction of complete genomes from fragmented sequencing reads, known as genome assembly, is a critical task in computational genomics. Numerous assembly algorithms, such as Velvet and SPAdes [54], have been developed to address this challenge. Additionally, computational tools like AUGUSTUS [39] and GeneMark [55] are used for gene prediction and functional annotation of DNA sequences. |
| Transcriptomics and epigenomics | Computational genomics plays a crucial role in transcriptomics and epigenomics studies. RNA-seq data analysis involves quantification of gene expression levels, differential gene expression analysis, and alternative splicing detection. Similarly, epigenomic data analysis, including DNA methylation and histone modification profiles, relies heavily on computational methods [56]. |
| Bioinformatics tools for genomic variant interpretation | Interpreting genomic variants to understand their functional impact is a critical aspect of DNA research. Various bioinformatics tools, such as SIFT, PolyPhen-2, and PROVEAN [57], are widely used for predicting the potential effects of genetic variants on protein function and structure. |
| Integrative genomics: | Integrative genomics involves the integration of diverse biological datasets to gain a comprehensive understanding of complex biological processes [58]. Computational techniques, such as pathway analysis and network inference, aid in deciphering the relationships between genes, proteins, and regulatory elements. |
| Personalized medicine and pharmacogenomics | The integration of genomics data with clinical information has opened avenues for personalized medicine and pharmacogenomics. Computational methods are employed to identify genomic biomarkers, predict drug responses, and stratify patient populations based on genetic profiles [59]. |
| DNA-related diseases | Description |
|---|---|
| DNA and Cancer: | The field of cancer biology has witnessed significant advancements due to DNA-based research. Genetic mutations in oncogenes and tumor suppressor genes play critical roles in carcinogenesis [63]. Understanding these mutations has led to targeted therapies, such as tyrosine kinase inhibitors in the treatment of specific types of leukemia [64]. Moreover, whole-genome sequencing studies have identified driver mutations in various cancer types, providing insights into potential therapeutic targets [65]. Additionally, liquid biopsy techniques using circulating tumor DNA have shown promise in cancer diagnosis and monitoring treatment response [66]. |
| DNA and Rare Genetic Disorders: | DNA sequencing technologies have revolutionized the diagnosis of rare genetic disorders [67]. Exome sequencing and whole-genome sequencing have facilitated the identification of disease-causing variants in patients with previously undiagnosed conditions [68]. Furthermore, CRISPR-Cas9-based gene editing has shown potential for treating genetic disorders by correcting pathogenic mutations [69]. Case studies of successful gene therapies, such as in spinal muscular atrophy, demonstrate the potential of DNA-based approaches in treating rare diseases [70]. |
| DNA and Infectious Diseases: | DNA-based research has significantly advanced our understanding of infectious diseases. Whole-genome sequencing of pathogens has helped in tracking outbreaks, identifying drug-resistant strains, and developing targeted therapies. The use of DNA-based techniques, such as polymerase chain reaction (PCR) and next-generation sequencing (NGS), has greatly improved the speed and accuracy of diagnosing infectious agents. Furthermore, DNA vaccines have emerged as a promising avenue for immunization against various infectious diseases [71]. |
| DNA and Pharmacogenomics: | Pharmacogenomics aims to personalize drug treatments based on an individual’s genetic makeup. Genetic variations in drug-metabolizing enzymes and drug targets can significantly influence drug response and toxicity [72]. DNA-based testing has been instrumental in identifying individuals at risk of adverse drug reactions and guiding drug selection and dosing [86]. The implementation of pharmacogenomics has the potential to enhance treatment outcomes and reduce adverse effects, ultimately leading to improved patient care [74]. |
| Emerging Technologies and Future Perspectives: | The advent of CRISPR-Cas9 gene editing has sparked excitement for its potential applications in treating various genetic diseases, including cancer [75]. The field of synthetic biology offers opportunities for designing and engineering novel DNA-based therapeutic agents [76]. DNA nanotechnology, with its unique ability to create nanostructures and devices, holds promise for targeted drug delivery and diagnostic applications [77]. However, along with these advancements, ethical considerations and controversies surrounding genome editing and privacy concerns need to be carefully addressed [78]. |
| DNA Nanotechnology: | DNA nanotechnology utilizes the inherent ability of DNA to self-assemble into various structures. Among the notable achievements is the development of DNA origami, a method pioneered by Paul Rothemund in 2006 [88]. DNA origami involves folding a long single-stranded DNA scaffold with shorter staple strands, resulting in diverse nanostructures with precise control over shape and size. This technique has shown great promise in creating nanoscale devices for drug delivery [89], bioimaging [90], and even nanorobots capable of targeted therapeutic interventions [91]. |
| DNA-Based Nanodevices: | DNA-based nanodevices are engineered systems with functional components made from DNA. These nanodevices can be designed to respond to specific stimuli, enabling a wide range of applications. For instance, researchers have developed DNA nanoswitches capable of detecting disease-related biomarkers and releasing therapeutic agents upon detection [92]. Additionally, DNA-based nanodevices have been employed as biosensors for detecting pathogens and environmental pollutants [93]. |
| Programmable Self-Assembly: | DNA’s programmable base-pairing properties enable the precise and programmable self-assembly of complex nanostructures. The directed assembly of DNA nanostructures has shown great potential in creating nanoscale circuits for computation and data storage [94]. Moreover, programmable self-assembly techniques hold promise for developing new materials with unique properties and functions [95]. |
| Synthetic Biology: | Synthetic biology involves the engineering of biological systems using synthetic DNA constructs. Advances in gene synthesis and editing technologies have paved the way for the construction of artificial genetic circuits and organisms with novel functionalities [96]. Synthetic biology has applications in various fields, including biofuel production, pharmaceuticals, and bioremediation. |
| Ethical Considerations: | With the vast potential of DNA nanotechnology and synthetic biology, ethical considerations are paramount. As these technologies progress, concerns about biosecurity, dual-use applications, and unintended consequences must be addressed [97]. Additionally, discussions around responsible use and regulation are crucial to ensure the ethical application of these powerful tools. |
| Future Prospects: | Looking ahead, DNA nanotechnology and synthetic biology hold immense promise in shaping the future of medicine, biotechnology, and nanoscience. Research in these areas continues to advance, with the potential to revolutionize precision medicine, drug delivery, and bio-computing [98]. As we gain a deeper understanding of DNA’s properties and engineering capabilities, we can expect transformative breakthroughs and new applications that were once thought impossible [99]. |
| Year | Milestone in DNA research |
| 1953 | Watson and Crick reveal the DNA double helix structure |
| 1958 | Meselson and Stahl demonstrate semi-conservative replication |
| 1961 | Genetic code deciphered, mRNA’s role in protein synthesis |
| 1970s | Discovery of DNA polymerases and DNA sequencing methods |
| 1980s | Invention of polymerase chain reaction (PCR) |
| 1990-2003 | Human Genome Project maps the entire human genome |
| Late 20th century | Advances in gene expression regulation and epigenetics |
| 2005 | First next-generation sequencing (NGS) platforms |
| 2012 | CRISPR-Cas9 gene editing system introduced |
| 2010s | Rapid growth of precision medicine and personalized genomics |
| Present | Exploration of DNA nanotechnology and synthetic biology |
| Future | Potential applications of emerging DNA technologies |
| Impact | Ref. |
|---|---|
| Impact on genomics and disease understanding through the Human Genome Project: Completed in 2003, the Human Genome Project (HGP) was a monumental effort that resulted in the sequencing of the entire human genome, comprising approximately 3,000,000,000 base pairs. This project, which cost about $2,700,000,000 billion, facilitated the identification of over 1,800 disease genes and revolutionized our understanding of human biology. | [107] |
| Reduction in sequencing costs: There has been a dramatic decrease in the cost of sequencing a human genome, from $100,000,000 in 2001 to around $600 in recent years, making whole-genome sequencing more accessible for research and clinical settings. | [108] |
| Advancements in precision medicine - targeted cancer therapies: DNA sequencing in oncology has led to targeted therapies. The identification of the BCR-ABL gene fusion in chronic myeloid leukemia (CML) patients, for instance, resulted in the development of Imatinib. This drug, which specifically targets the BCR-ABL fusion protein, has increased the five-year survival rate for CML patients from less than 30% in the early 2000s to about 90% currently. | [109] |
| Diagnosis of rare genetic disorders: DNA sequencing has been crucial in diagnosing rare genetic disorders. Approximately 25% of previously undiagnosed patients with rare diseases have been precisely diagnosed through genomic sequencing, as reported in a study published in The New England Journal of Medicine. | [110] |
| Impact on infectious diseases - pathogen identification: Rapid sequencing technologies have been essential in identifying and tracking infectious disease outbreaks. During the COVID-19 pandemic, for example, genomic sequencing was instrumental in identifying the SARS-CoV-2 virus and its variants, thereby guiding public health responses and vaccine development | [111] |
| Pharmacogenomics - personalized drug prescriptions: DNA sequencing has enabled personalized drug prescriptions based on individual genetic makeup. This is exemplified by the identification of genetic variations in the CYP450 family of enzymes responsible for drug metabolism, leading to personalized dosing recommendations for drugs like warfarin, significantly reducing adverse drug reactions, | [112] |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).