Preprint
Article

This version is not peer-reviewed.

Lost in .*Vcf Translation. From Data Fragmentation to Precision Genomics: Technical, Ethical and Interpretive Challenges in the Post-Sequencing Era

A peer-reviewed article of this preprint also exists.

Submitted:

22 June 2025

Posted:

23 June 2025

You are already at the latest version

Abstract
Background: The genomic era has transformed not only the tools of medicine but the very logic by which we understand health and disease. Whole Exome Sequencing (WES), Clinical Exome Sequencing (CES), and Whole Genome Sequencing (WGS) have catalyzed a shift from Mendelian simplicity to polygenic complexity, from genetic determinism to probabilistic interpretation. This epistemological evolution calls into question long-standing notions of causality, certainty, and identity in clinical genomics. Yet, as the promise of precision medicine grows, so too do the tensions it generates: fragmented data, interpretative opacity, and the ethical puzzles of Variants of Uncertain Significance (VUS) and unsolicited secondary findings. Results: Despite technological refinement, the diagnostic yield of Next-Generation Sequencing (NGS) remains inconsistent, hindered by the inherent intricacy of gene-environment interactions and constrained by rigid classi-ficatory systems like OMIM and HPO. VUS (neither definitively benign nor pathogenic) occupy a liminal space that resists closure, burdening both patients and clinicians with uncertainty. Meanwhile, secondary findings, though potentially life-altering, challenge the boundaries of consent, privacy, and responsibility. In both adult and pediatric contexts, genomic knowledge reshapes notions of autonomy, risk, and even personhood. Conclusions: Genomic medicine has to develop into a flexible, morally sensitive paradigm that neither celebrates certainty nor ignores ambiguity. Open infrastructures, dynamic variant reclassification, and a renewed focus on interdisciplinary and humanistic approaches are essential. Only by embracing the uncertainty intrinsic to our biology can precision medicine fulfill its promise, not as a deterministic science, but as a nuanced di-alogue between genes, environments, and lived experience.
Keywords: 
;  ;  

Introduction

Whole Exome Sequencing (WES), Clinical Exome Sequencing (CES), and Whole Genome Sequencing (WGS) have drastically changed the technological approach to disease understanding, thereby transforming the epistemological basis of clinical practice [1]. Today, clinical genomics is poised to adopt a more complex, multigenic comprehension of disease that transcends the mere observation of symptoms and the investigation of single-gene variants [2]. While this paradigm shift has enhanced both diagnostic accuracy and therapeutic stratification, it has also introduced substantial conceptual and technical challenges. Despite high-throughput sequencing has made genomics analysis more accurate, turning raw data into clinical actionable knowledge remains demanding. Each dataset must be filtered, standardized, and cross-validated, through different stages from sequencing to therapeutic application. However, no analytical step occurs in isolation: laboratory protocols, data generation, bioinformatic algorithms, and specific expertise collectively interact, ultimately determining the outcome and the quality of the result [3].
Beyond interpretative challenges, a significant barrier remains the fragmentation of genomic data. The lack of interoperable platforms and shared databases limits the cross-study variant comparison and classification, delaying both potential discoveries and subsequent advancements in diagnosis and therapy.
Genomic data are often confined to proprietary platforms that lack interoperability and offer limited access, thereby hindering the promise of precision medicine [4,5]. To fully harness this potential demand, genomic medicine must integrate new findings, technological advancements, and collective expertise, with a framework that acknowledges the ever-evolving nature of genetic knowledge. Yet the creation of open and interoperable infrastructures, and the collaborative efforts they presuppose, remains largely theoretical, retarding progress in risk assessment, as well as in the reclassification and clinical interpretation of genomic variant [6].
To overcome data fragmentation and maximize the benefits of sequencing technologies, clinical decision-making must shift toward a highly adaptive framework that continuously incorporate emerging genomic data while responsibly navigating the associated ethical complexities. Preserving ethical primacy in this evolution demands a careful balance between informed consent, patient rights, and the responsible management of genetically ambiguous findings [7,8].
Despite the advances of Next-Generation Sequencing (NGS), its diagnostic yield remains variable, ranging from 25% to 50%. These constraints present a significant cognitive challenge, arising from intrinsic technical limitations of the method, allelic heterogeneity, and the complex interplay of multiple contributors to disease presentation [9,10].
Bioinformatics tools that integrate curated databases such as the Online Mendelian Inheritance in Man (OMIM) and Human Phenotype Ontology (HPO), serve as indispensable mediators of genomic knowledge. By converting, large scale sequencing data into structured, analyzable dataset, they enhance diagnostic precision. However, their accuracy on established classification schemas introduces an intrinsic limitation: a pronounced focus on already classified variants can obscure novel or insufficiently characterized genetic determinants [9].
This disjunction between the analogue ambiguity of clinical observations and the digital precision of clinical classification (or taxonomic ontologies?) points to a deeper issue in medicine itself. Whereas clinical diagnosis often resides in a fluid, context-dependent space, terminological systems such as HPO and OMIM impose rigid machine-readable boundaries on disease entities. This discrepancy raises two critical considerations: first, the extent to which prevailing informatics frameworks are reshaping, perhaps constraining, our conception of disease; and second, the possibility that sizeable portions of the genomic landscape remain invisible precisely because they fall outside the detection limits of these structured paradigms. [10,11]
A methodological framework that embraces, rather than resists, ambiguity is crucial if the aim of genomic medicine is to achieve both accurate diagnosis and the ongoing refinement of disease paradigm. The shift from monogenic to polygenic perspectives constitutes more than a scientific advancement; it marks a profound conceptual transformation. By undermining classical genetic determinism, it replaces reductionist views with a framework that foregrounds the intrinsic biological complexity [12].
The classical Mendelian framework, once valued for its clarity and simplicity, now shows clear limitations. Although still useful in specific contexts, it cannot fully capture the vast spectrum of human health and disease [9]. In this emerging model, genes act not as isolated determinants of fate but as dynamic components within an intricate network of molecular and environmental interactions [13].
Polygenic Risk Scores (PRS) exemplify this conceptual shift: rather than delivering absolute diagnostic likelihoods, they yield probabilistic estimates shaped by a wide range of genetic polymorphisms and environmental factors.
Genetic predisposition now represents a spectrum of probabilities rather than a predetermined fate. The genome, once considered a fixed script at conception, now resembles a dynamic score, constantly revised by external factors such as diet, stress, social environment, and chance. In this perspective, the “genetic self “is a dynamic interplay between inherited patterns and lived experience, rather defined by a static blueprint [14].
The concept of health shifts from a binary condition of wellness or disease to a dynamic spectrum influenced by factors beyond personal control [15]. This new concept raises critical existential and ethical questions. Should our concept of personal responsibility be recalibrating in the face of genetic predisposition? How can we balance the potential psychological burden of knowing one's hereditary risks with the right to be informed? Crucially, we move from monogenic to multigenic and multifactorial frameworks representing not only a technical advance but also an acknowledgment of the intrinsic complexity of human genetics.

Genetic Uncertainty: Balancing Knowledge, Ethics, and Autonomy in the Era of VUS and Secondary Findings

The management of Variants of Uncertain Significance (VUS) represents the epistemic complexity of contemporary genomics, confronting both patients and clinicians with limit of current knowledge. VUS can be currently perceived as ambiguous spaces in genomic knowledge; ambiguous variants that evade definitive classification and persist in an epistemologically indeterminate state [16].
VUS pose a non-trivial dilemma for therapeutic decision-making, compelling clinicians to balance uncertain genomic data against the need for concrete clinical action. In the absence of a definitive interpretation, clinicians must reconcile between evidence and conjecture, an ambivalence that may lead to anxiety for both patients and professionals, as crucial health decisions remain suspended in uncertainty [17].
The ethical and psychological dimensions of VUS extend beyond immediate therapeutic choices. Patients must confront the unsettling realization that their genetic history is incomplete, painted with possibilities that may never materialize into disease or may remain indefinitely ambiguous. This uncertainty can provoke excessive anxiety and, in some cases, drive individuals toward unwarranted medical interventions. Accordingly, the ethical challenge for health care providers is reduction of suffering while conveying genetic information responsibly, striking a balance that recognizes uncertainty without amplifying it [18].
The need to reclassify VUS highlights the dynamic nature of genetic science. Although such work demands considerable time, funding, and cross collaboration between scientific communities, three approaches remain indispensable: family segregation analysis, functional studies including in silico modelling, and large-scale population studies. Despite the rate at which new information emerges, lags behind the urgency of clinical needs, leaving uncertainty for both clinicians and patients [19].
The evolving nature of genomic knowledge is evident in the periodic re-evaluation of VUS: a variant that remains uncertain today may be reclassified as pathogenic or became benign tomorrow. Given this fluidity, the validity of medical diagnoses and the mechanisms for communicating these reclassifications must be continuously reassessed [20]. To ensure that patients remain both informed and supported as their genomic profiles are reinterpreted, laboratories should develop adaptable protocols that acknowledge the transient nature of many genetic insights and, when appropriate, consider the generation of dynamic reports. Moreover, geneticists should explain during the initial informed consent process the possible uncertainty of the genetic test.
VUS and, more broadly, indeterminate genomic findings, pose ethical challenges in clinical reporting, as they intersect with knowledge, ambiguity, and professional judgment. This is more evident when NGS is applied to both adult and pediatric populations, because genetic information has the power to alter identity, influence decision-making, and redefine fundamental aspects of personhood.
In pediatrics, where the patient is unable to provide informed consent, genetic testing acquires profound ethical implications. Parents and guardians, entrusted with decision-making, act as intermediaries between medical expertise and the child’s future autonomy. A pivotal question raises: how much genomic information should be disclosed to an individual who lacks the cognitive maturity to process it? This dilemma highlights the tension between the potential benefits of genomic knowledge and the ethical principle of the right not to know, which protects individuals from the psychological burden of ambiguous or distressing results [21]. In adults, by contrast, the ethical center of gravity shifts toward self-determination. Individuals may choose either to pursue, or to avoid, genetic insight.
The consequences of genetic discoveries in children extend beyond questions of ethical autonomy. They influence future reproductive choices, influence psychological development, and societal perceptions of health across the lifetime. The risk of genetic determinism is a significant concern: Does awareness of a genetic predisposition shape an individual’s identity and self-perception, or does it merely offer an opportunity for proactive intervention? Carrier status illustrates the point. Largely inconsequential in childhood, it takes on critical importance in adulthood, informing family planning decisions and reshaping intergenerational relationships [21]. For adults undergoing genetic testing, the focus shifts toward self-determination. Individuals may choose either to pursue or to avoid genetic insights. Unlike pediatric cases, where information is passively received, adult testing involves a more deliberate engagement with one’s biological heritage. However, this autonomy comes with consequences: information that can empower clinical vigilance may also provoke anxiety, disrupt life plans, and influence decisions about reproduction and healthcare [22].
Moreover, the principle of free and informed decision-making is complicated by the social and economic ramifications of genomic information. In some legal and cultural contexts, genetic test results extend beyond personal health, affecting insurance eligibility and employment prospects, thereby raising the specter of genetic discrimination. Should an individual’s genomic profile legitimately affect insurability or employability? [22].
The impact of genetic studies on families presents one of the most intricate moral dilemmas. Identification of a single pathogenic variant in one individual affects throughout the biological lineage, directly implicating relatives who may be unaware of their own risk. Ethical obligations to inform high-risk relatives introduce questions of privacy, responsibility, and the interplay between personal autonomy and familial duty. Where then, does personal liberty yield to familial responsibility? Should individuals have full control over their genetic information, or does an ethical obligation exist to disclose clinically relevant findings to at-risk relatives who could benefit from that knowledge? [23].
In dominant genetic diseases, identification of an inherited variant can reinforce a sense of biological inevitability: individuals confront the genetic legacy of previous generations and the ethical dilemma of possibly transmitting that legacy to their offspring. Beyond the ethical dilemma of potentially passing on such variants, affected individuals must also grapple with the psychological burden of genetic inheritance. By contrast, de novo variants introduce unpredictability and a perception of genetic singularity. This discontinuity can be perceived either as a form of biological solitude or as a sense of liberation, as the absence of inherited risk also means no transmission of affection or legacy to potential progeny [23].
Recessive disorders, on the other hand, expose the latent complexity of inheritance. Healthy carriers, often unaware of their genetic status, unknowingly possess the potential for disease expression in future generations. Additionally, maternal and sex-linked inheritance introduce further asymmetries, that shape distinct risks profiles, adding layers of ethical and clinical complexity to counselling practice [23].
Beyond personal implications, transmission risk plays a crucial role in genetic counseling, shaping the reproductive choices of individuals seeking to understand their family's genetic future. In this intricate mosaic, genetics function both as a life science and as a narrative of human destiny, mediating between determinism and variability.
“Second hit” findings present a further challenge. These unexpected variants, uncovered incidentally through molecular diagnostics, occupy the space between knowledge and the right to not to know. They may expand medical possibilities while simultaneously raising profound existential questions: should one resist crossing this threshold of knowledge, or embrace the hidden information encoded in one’s genome? In transcending its utility such discoveries expose the tension between the desire to know and the right to remain unaware [24].
The management of secondary findings raises fundamental issues related to personal autonomy. Genomic testing can reveal predispositions to a wide range of conditions, including cancer, cardiovascular diseases, and adverse drug reactions. However, genetic knowledge does not always translate into immediate clinical benefit or into clear opportunities for prevention or treatment. Some genetic information can have a significant psychological impact, especially when it concerns conditions for which no effective therapy currently exists. An example was the discovery of mutations in the BRCA1 or BRCA2 genes which entails a substantial increase in the risk of developing cancers, mainly breast and ovarian neoplasms, often leading to drastic preventive decisions such as prophylactic mastectomy and/or oophorectomy. Other situations, such as genetic predisposition to inherited cardiovascular conditions (e.g., long QT syndrome, Brugada syndrome or hypertrophic cardiomyopathy), can result in a persistent state of anxiety, even in the absence of clinical symptoms. In such cases, the disclosure of a genetic risk can be experienced as a “Sword of Damocles” hanging over the individual’s future.
Not every genetic variants provide clear or useful information; many remain probabilistic and lack direct clinical relevance. This uncertainty raises critical questions about the disclosure of genetic data: should findings be communicated only when they confer a clear therapeutic or preventive benefit, or should transparency prevail in anticipation of future significance? [24].
Similar to VUS, incidentally, detected second hits, much like VUS, carry profound social and psychological implications. Awareness of a latent genetic predisposition can induce genetic anticipation, leading to both existential anxiety and proactive health behavior. Because genomic information is intrinsically familial, and rarely isolated to an individual, disclosure inevitably impacts family dynamics, raising concerns about shared genetic risk, privacy, and responsibility. Does the duty to inform relatives override an individual’s right to genomic autonomy? [25].

Genetic Roulette: Global Discrepancies in Variant Interpretation and Their Impact on Medicine and Law

Despite sustained efforts by leading scientific organizations like the American College of Medical Genetics and Genomics (ACMG) collaborating with the Association for Molecular Pathology (AMP) and the European Society of Human Genetics (ESHG), not overarching consensus yet governs the clinical interpretation of sequence variants. [26]. Although each organization has outlined classification criteria, their approach to clinical contextualization diverges, revealing a persistent tension between the need for rigorous systematization and the need for flexible, context-sensitive interpretation [27] (Table 1 and Table 2).
The ACMG framework privileges a tightly structured, evidence-weighted model, whereas the ESHG adopts a more fluid strategy, emphasizing clinical relevance and the use of European databases, such as LOVD (https://www.lovd.nl/) and UMD (http://www.umd.be/lsdb.html), particularly in the management of VUS within European cohorts [28] (Table 3).
Moreover, both schemes expand the interpretive spectrum by incorporating intermediate categories such as “likely pathogenic” and “likely benign”, thereby widening, rather than narrowing, the zone of ambiguity surrounding variant classification [29].
Reliance on open repositories, above all ClinVar, the community’s most widely consulted database, further compounds this divergence. Despite ClinVar is a vital tool for gathering genetic data, it has limitations due to heterogeneous data sources and variable interpretive standards [30]. The star-rating system, intended to convey consensus and evidentiary strength, paradoxically introduces an additional element of uncertainty [31], amplifying discrepancies both in the interpretation level (conflict of interpretation) and in pathogenicity classification (conflict of classification of pathogenicity) [32]. Consequently, data reliability is diminished, and the burden of converting ambiguous annotations into clinically applicable insights rests solely with the clinician [33].
The consequences of this ambiguity extend beyond the clinical domain and impact ethical and legal facets as well [34]. The ACMG discourages the communication of VUS to patients unless they are relevant in specific contexts or part of clinical studies, whereas the ESHG and SIGU adopt a more permissive approach, allowing VUS disclosure when they may acquire clinical significance over time or if they have a relevant impact. Additionally, SIGU promotes integration with Italian and European genetic databases, emphasizing the value of clinical context and genetic counseling to reduce discrepancies between laboratories [35,36,37].
Table 1. Step A and B of ESHG NGS variant classification. ESHG variant classification is based on a stepwise system for the classification of any genetic variant [36]. Classification is first functional (step A), concerning biological consequences of the variant, and then clinical (step B), focused on genotype-phenotype correlations.
Table 1. Step A and B of ESHG NGS variant classification. ESHG variant classification is based on a stepwise system for the classification of any genetic variant [36]. Classification is first functional (step A), concerning biological consequences of the variant, and then clinical (step B), focused on genotype-phenotype correlations.
Step A: functional grading
Functional class Score Description
Functional VUS (fVUS) 0 Variant of unknown functional significance
Normal function (NF) 1 Variant with high frequency in the general population. No reason to suspect a recessive or hypomorphic role
Likely normal function (LNF) 2 Variant with a moderate frequency in the general population. No reason to suspect a recessive or hypomorphic role
Hypothethical functional effect (HFE) 3 Rare variant that could affect gene function based on biological knowledge and bioinformatic data
Likely functional effect (LFE) or hypomorphic functional effect in recessive disease 4 In recessive disease: variant that reduces gene function, but that causes a biochemical effect or disease if in trans with a loss of function variant (LoF)
In dominant disease: variant with a likely LoF effect, or variant of likely functional importance
Functional effect (FE) 5 Variant that disrupts gene function (certain LoF) or known to be disease causing (known GoF or dominant-negative effect)
Step B: clinical grading
Clinical class Score Description
Clinical VUS (cVUS) 0 Variant of unknown clinical significance
Variant of potential interest (VOI) 1 Dominant variant that could be pathogenic or single hypomorphic variant that could be linked to a recessive cause
Risk factor 2 -Low penetrance dominant variant
-Dominant variant with good clinical support of a pathogenic role
-Single pathogenic variant in a recessive gene that fits the phenotype
Pathogenic variant 3 Pathogenic variant
Moderate penetrance pathogenic variant 4 Dominant pathogenic variant of moderate (20–40%) penetrance
High penetrance pathogenic variant 5 Dominant pathogenic variant of high (>40%) penetrance
Table 2. Step C of ESHG NGS variant classification. This step reclassifies variants according to the combination of step A and B grading.
Table 2. Step C of ESHG NGS variant classification. This step reclassifies variants according to the combination of step A and B grading.
Class Grading combinations (A +B) Reporting recommendations
0 F0-2 Not reported
F F3 + C0 Not reported if the gene is not associated with clinical phenotype
E F3+ C1/ F3 + C2/ F4 + C0/ F4 + C1/ F5 + C0 Variant-of-interest (VOI): reporting optional
D F3+ C3/ F4 + C2/ F4 + C3/ F5 + C1/ F5 + C2 Low penetrance and good candidate variants: reporting
recommended
C F4+ C4/ F5 +C3 Disease-associated variant: to be reported
B F4+ C5/ F5 +C4 Disease-associated variant of moderate
penetrance: to be reported
A F5+ C5 Disease-associated variant of high penetrance: to be reported
X F3–5 + C2–5 Secondary/incidental findings
Abbreviations: F, functional score; C, clinical score.
Table 3. ACMG/AMP NGS variant classification [38]. ACM/AMP guidelines for variant interpretation are based on the population data, computational and predictive data, functional data, segregation and allelic data.
Table 3. ACMG/AMP NGS variant classification [38]. ACM/AMP guidelines for variant interpretation are based on the population data, computational and predictive data, functional data, segregation and allelic data.
Class Description
Pathogenic (class 5) The variant is known to cause the disease in question
Likely pathogenic (class 4) The variant is very likely to cause the disease, but there is some uncertainty
VUS (class 3) The variant's pathogenicity is uncertain, and more data is needed
Likely benign (class 2) The variant is very unlikely to cause the disease
Benign (class 1) The variant is known to not cause the disease
The classification of a variant, even if formally correct according to specific criteria, can lead to inappropriate clinical decisions, for example, unnecessary prophylactic surgeries based on misinterpreted risk, or failure to monitor a patient who actually carries a pathogenic mutation, resulting in either overtreatment or missed diagnoses. Such outcomes may carry medico-legal implications, especially given the variability in international regulations regarding genetic data interpretation and disclosure. [38]. Discrepancies in genetic interpretation can lead to discordant reports for a patient undergoing testing in different countries, with legal consequences on treatment protocols and insurance decisions [39,40]. Similarly, companies offering Direct-To-Consumer (DTC) genetic tests must navigate divergent regulations, affecting the legal validity of results across jurisdictions [41].
In some European regions, the principle of patient autonomy and the right to genetic information prevails, whereas in the United States, greater emphasis is placed on protecting patients from the potential psychological impact of uncertain information [42].
This interpretative variability is not limited to Western contexts but extends to global genomic practices [43]. In China, the focus on large-scale genomic studies, such as those conducted by the China National GeneBank (CNGB), has enabled the identification of population-specific variants, often underrepresented in global databases like ClinVar [44]. In India, the integration of genomic data from a diverse ethnic landscape has highlighted the limitations of classification criteria based on Western standards when applied to genetically distinct populations [45]. In Russia, the development of national genomic databases presents the challenge of harmonizing local variant interpretation with international standards [46]. In Africa, the continent’s remarkable genetic diversity complicates the application of standardized variant classification criteria; consequently, many African variants remain underrepresented in international databases, despite ongoing efforts and initiatives, such as H3Africa, aimed at improving their coverage [47].
In these legal ambiguities, a universally accepted informed consent is an essential ethical safeguard: it protects clinicians navigating an increasingly complex regulatory landscape while enabling patients to set the boundaries of their genetic knowledge. [48].

The Dark Side of AI in Genomics: Bias, Errors, and the Black-Box Dilemma

The promise of artificial intelligence (AI), which include Machine Learning (ML) and Deep Learning in genomics is evident in in an era of unparalleled data production [49]. However, its application has significant challenges. Among these are the quality and reliability of the dataset used for modeling training [50,51]. Genomic repositories often contain incomplete or biased information due to underrepresentation of global genetic diversity. Data noise, stemming from sequencing errors, technical inconsistencies across platforms, and contamination, can introduce additional noise [52]. Furthermore, historical are riddled with anomalies and duplications, which can distort algorithmic performance, making it difficult to distinguish a truly pathogenic variants from benign ones [53].
Adding to these issues is the intrinsic complexity of the human genome, whose multilayered architecture resists straightforward mathematical abstraction. The interplay among numerous genes and environmental modifiers creates level of intricacy that current models can capture only in part [54]. As a result, AI systems often identify statistical correlations without truly understanding the underlying biological mechanisms, producing outputs that, although mathematically sound, have limited clinical relevance [55]. This lack of interpretability, the so called “black box” effect, makes it difficult for geneticists and physicians to place full trust in AI-generated predictions, especially when such predictions are used to take therapeutic decisions [56].
Ethical and data privacy consideration further complicate the complexity to the deployment of AI in genomics [57]. Genomic information is among the most sensitive categories of personal data, and its incorporation into machine learning pipelines raises serious concerns about security and confidentiality [58]. Additionally, most published genomic studies have predominantly involved individuals of European ancestry, leaving many poorly calibrated for other populations [59]. This bias leads to inaccuracies and, in some cases, produce discriminatory outputs exacerbating existing health disparities [60].

Conclusion

Advancements in sequencing technologies have revolutionized our understanding of genetic disease, yet they have also exposed significant conceptual, ethical, and technical challenges. Progress will depend on interoperable data-sharing frameworks, sustained interdisciplinary collaboration, and a careful balance between the benefits of genomic insights and the rights to privacy and autonomy.
Practical obstacles extend beyond philosophical debate, highlighting the vulnerability of cross-platform data sharing and analysis. The absence of uniform standardization across sequencing technologies, bioinformatics pipelines, and data-sharing infrastructures fragments the field, impedes the interoperability of genetic insights between institutions. Variants identified on one sequencing platform may not be directly comparable to another owing to differences in computational methods, reference genomes, or laboratory protocols, thereby constraining the reach of precision medicine and forcing costly, duplicative testing that delay diagnosis and exacerbate health inequities.
Interpretation of genomic data remains the critical bottleneck. Assigning clinical significance, especially to VUS, often stalls therapeutic decision-making. Conditions characterized by oligogenic inheritance, somatic mosaicism, or multilayered regulatory mechanisms frequently remain unsolved under a strictly gene-centric model focused on coding regions and protein effects.
This does not mean that such mechanisms fall outside Mendelian logic, many still follow genetic rules, but rather that the phenotype may result from a convergence of multiple factors. Many pathogenic mechanisms exert their effects through gene expression modulation, chromatin organization, or non-coding or regulatory interactions that routine short-read sequencing fails to capture. Even whole genome sequencing, in its current form, often fails to substantially increase diagnostic yield due to the inaccessibility or misinterpretation of certain genomic regions, such as GC-rich or highly repetitive sequences, and due to complex structural variants or low-level mosaicisms. Long-read sequencing technologies and integrative multi-omics approaches including transcriptomics, epigenomics, and 3D chromatin architecture, are therefore required to illuminate these layers.
We are still far from capturing the full scope of genetic contribution to disease. An excess focus on binary of VUS reclassification may overlook causal mechanisms residing in unexplored genomic or epigenomic contexts. Apparent “negative” findings” may simple reflect an inability to detect RNA editing, chromatin looping, or distant regulatory interactions, the “3D epigenetics” dimension.
Currently, genome sequencing are employed predominantly within research environments; their routine, reliable deployment in the clinical settings remains aspirational. Realizing the promise of precision medicine will demand not only improved technologies and infrastructures but also a fundamental re-examination of genetic causality and complexity.

Author Contributions

“Conceptualization, M.C..; validation, M.C., R.M.M. and N.B.; formal analysis, M.C., R.M.M., M.T.; investigation, M.C.; resources, R.M.M.; data curation, M.T and N.B.; writing—original draft preparation, M.C.; writing—review and editing, M.C., N.B. and L.F.; visualization, L.F..; supervision, M.C., M.T., R.M.M.; project administration, M.C.; funding acquisition, R.M.M. All authors have read and agreed to the published version of the manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Grether, A.; Ivanovski, I.; Russo, M.; Begemann, A.; Steindl, K.; Abela, L.; Papik, M.; Zweier, M.; Oneda, B.; Joset, P.; Rauch, A. The current benefit of genome sequencing compared to exome sequencing in patients with developmental or epileptic encephalopathies. Mol Genet Genomic Med. 2023, 11, e2148. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  2. Lobo, I. Multifactorial Inheritance and Genetic Disease. Nature Education 2008, 1, 5. [Google Scholar]
  3. Liu, Z.; Zhu, L.; Roberts, R.; Tong, W. Toward Clinical Implementation of Next-Generation Sequencing-Based Genetic Testing in Rare Diseases: Where Are We? Trends Genet. 2019, 35, 852–867. [Google Scholar] [CrossRef] [PubMed]
  4. Calvino, G.; Peconi, C.; Strafella, C.; Trastulli, G.; Megalizzi, D.; Andreucci, S.; et al. Federated Learning: Breaking Down Barriers in Global Genomic Research. Genes 2024, 15, 1650. [Google Scholar] [CrossRef] [PubMed]
  5. Ho, D.; Quake, S.R.; McCabe, E.R.B.; Chng, W.J.; Chow, E.K.; Ding, X.; et al. Enabling Technologies for Personalized and Precision Medicine. Trends Biotechnol. 2020, 38, 497–518. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  6. Kwong, A.; Ho, C.Y.S.; Shin, V.Y.; Au, C.H.; Chan, T.L.; Ma, E.S.K. How does re-classification of variants of unknown significance (VUS) impact the management of patients at risk for hereditary breast cancer? BMC Med Genomics 2022, 15, 122. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  7. Shah, P.; Thornton, I.; Kopitnik, N.L.; et al. Informed Consent. In StatPearls [Internet]; Available from: https://www.ncbi.nlm.nih.gov/books/NBK430827/; StatPearls Publishing: Treasure Island (FL), 2025. [Google Scholar]
  8. Boskey, E.R.; Johnson, J.A.; Harrison, C.; Marron, J.M.; Abecassis, L.; Scobie-Carroll, A.; et al. Ethical Issues Considered When Establishing a Pediatrics Gender Surgery Center. Pediatrics. 2019, 143, e20183053. [Google Scholar] [CrossRef] [PubMed]
  9. Chetta, M.; Tarsitano, M.; Rivieccio, M.; Oro, M.; Cammarota, A.L.; De Marco, M.; et al. A Copernican revolution of multigenic analysis: A retrospective study on clinical exome sequencing in unclear genetic disorders. Comput Struct Biotechnol J. 2024, 23, 2615–2622. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  10. Yska, H.A.F.; Elsink, K.; Kuijpers, T.W.; Frederix, G.W.J.; van Gijn, M.E.; van Montfrans, J.M. Diagnostic Yield of Next Generation Sequencing in Genetically Undiagnosed Patients with Primary Immunodeficiencies: a Systematic Review. J Clin Immunol. 2019, 39, 577–591. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  11. Koboldt, D.C.; Steinberg, K.M.; Larson, D.E.; Wilson, R.K.; Mardis, E.R. The next-generation sequencing revolution and its impact on genomics. Cell. 2013, 155, 27–38. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  12. Morash, M.; Mitchell, H.; Beltran, H.; Elemento, O.; Pathak, J. The Role of Next-Generation Sequencing in Precision Medicine: A Review of Outcomes in Oncology. J Pers Med. 2018, 8, 30. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  13. Zitnik, M.; Li, M.M.; Wells, A.; et al. Current and future directions in network biology. Bioinformatics Advances. 2024, 4, vbae099. [Google Scholar] [CrossRef]
  14. Kullo, I.J.; Lewis, C.M.; Inouye, M.; Martin, A.R.; Ripatti, S.; Chatterjee, N. Polygenic scores in biomedical research. Nat Rev Genet. 2022, 23, 524–532. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  15. Sabatello, M.; Juengst, E. Genomic Essentialism: Its Provenance and Trajectory as an Anticipatory Ethical Concern. Hastings Cent Rep. 2019, 49 (Suppl. 1), S10–S18. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  16. Kemp, S.; Orsini, J.J.; Ebberink, M.S.; Engelen, M.; Lund, T.C. VUS: Variant of uncertain significance or very unclear situation? Mol Genet Metab. 2023, 140, 107678. [Google Scholar] [CrossRef] [PubMed]
  17. Chassé, M.; Fergusson, D.A. Diagnostic Accuracy Studies. Semin Nucl Med. 2019, 49, 87–93. [Google Scholar] [CrossRef] [PubMed]
  18. Clarke, A.J.; Wallgren-Pettersson, C. Ethics in genetic counselling. J Community Genet. 2019, 10, 3–33. [Google Scholar] [CrossRef] [PubMed]
  19. Heredia-Torrejón, M.; Montañez, R.; González-Meneses, A.; et al. VUS next in rare diseases? Deciphering genetic determinants of biomolecular condensation. Orphanet J Rare Dis. 2024, 19, 327. [Google Scholar] [CrossRef]
  20. Kwong, A.; Ho, C.Y.S.; Shin, V.Y.; et al. How does re-classification of variants of unknown significance (VUS) impact the management of patients at risk for hereditary breast cancer? BMC Med Genomics. 2022, 15, 122. [Google Scholar] [CrossRef]
  21. Botkin, J.R. Ethical issues in pediatric genetic testing and screening. Curr Opin Pediatr. 2016, 28, 700–704. [Google Scholar] [CrossRef]
  22. Zhong, A.; Darren, B.; Loiseau, B.; et al. Ethical, social, and cultural issues related to clinical genetic testing and counseling in low- and middle-income countries: a systematic review. Genet Med. 2021, 23, 2270–2280. [Google Scholar] [CrossRef] [PubMed]
  23. Fridman, H.; Yntema, H.G.; Mägi, R.; et al. The landscape of autosomal-recessive pathogenic variants in European populations reveals phenotype-specific effects. Am J Hum Genet. 2021, 108, 608–619. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  24. Burke, W.; Parens, E.; Chung, W.K.; et al. The challenge of genetic variants of uncertain clinical significance: A narrative review. Ann Intern Med. 2022, 175, 994–1000. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  25. Basel-Salmon, L.; et al. Clinically actionable incidental and secondary parental genomic findings after proband exome sequencing: Yield and dilemmas. Genet Med Open. 2024, 1, 100813. [Google Scholar] [CrossRef]
  26. Mitchell, C.; Ploem, C.; Retèl, V.; et al. Experts reflecting on the duty to recontact patients and research participants; why professionals should take the lead in developing guidelines. Eur J Med Genet. 2020, 63, 103642. [Google Scholar] [CrossRef] [PubMed]
  27. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med. 2015, 17, 405–424. [CrossRef]
  28. European Society of Human Genetics. Variant classification and reporting - ESHG.
  29. European Society of Human Genetics. Interpretation and classification of genetic variants - ESHG. 2023 Oct.
  30. National Center for Biotechnology Information. Review status in ClinVar.
  31. National Center for Biotechnology Information. Guidelines for Applying for Review Status in ClinVar.
  32. National Center for Biotechnology Information. ClinVar database.
  33. JAMA Network. The Key Features of a Genetic Nondiscrimination Policy.
  34. Kaye, J.; Terry, S.F.; Juengst, E. Regulation of third-party genetic interpretation services. Genet Med. 2019, 21, 2799–2807. [Google Scholar] [CrossRef]
  35. Richards, S.; Aziz, N.; Bale, S.; Bick, D.; Das, S.; Gastier-Foster, J.; Grody, W.W.; Hegde, M.; Lyon, E.; Spector, E.; et al. Standards and Guidelines for the Interpretation of Sequence Variants: A Joint Consensus Recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. 2015, 17, 405–424. [Google Scholar] [CrossRef]
  36. Houge, G.; Laner, A.; Cirak, S.; de Koning, T.; Dahan, K.; Dequeker, E.; Diderich, K.; Hennekam, R.; Kääriäinen, H.; Krawczak, M.; et al. Stepwise ABC system for classification of any type of genetic variant. Eur. J. Hum. Genet. 2021, 29, 1045–1053. [Google Scholar] [CrossRef]
  37. Società Italiana di Genetica Umana (SIGU). Indicazioni per la refertazione di analisi genetiche eseguite mediante tecniche di sequenziamento di nuova generazione (NGS). SIGU, 2022. Available online: https://sigu.net/wp-content/uploads/2022/08/2022_08_07_Referto_ngs-1.pdf.
  38. Borry, P.; Shabani, M.; Howard, H.C. Ethical concerns on sharing genomic data including patients’ family. BMC Med Ethics. 2018, 19, 41. [Google Scholar] [CrossRef]
  39. Borry, P.; Shabani, M.; Howard, H.C. Ethical concerns on sharing genomic data including patients’ family. BMC Med Ethics. 2018, 19, 41. [Google Scholar] [CrossRef]
  40. Ngiam, K.Y.; Khor, I.W. An overview of artificial intelligence in the field of genomics. Prog Mol Biol Transl Sci. Nature Genetics. Advances in AI and machine learning for predictive medicine. 2024, 56, 235–248. doi:10.1038/s10038-024-01231-y.. 2024, 189, 1–19. [Google Scholar] [CrossRef]
  41. Malin, B. Challenges and Opportunities for Machine Learning in Genomics. National Human Genome Research Institute. 2021 Apr.Topol, E. Challenges and ethical considerations of AI in precision medicine. Nat Med. 2019, 25, 475–480. [Google Scholar] [CrossRef]
  42. Cao, H.; Zhang, Y.; Wang, L. Large-scale genomic studies in China: Implications for global databases. Nature. 2019, 576, 555–559. [Google Scholar] [CrossRef]
  43. Singh, R.; Sharma, A. Genomic diversity in India: Challenges and opportunities. Nat Methods. 2019, 16, 1172–1178. [Google Scholar] [CrossRef]
  44. Ivanov, P.; Petrov, N. Harmonizing local and international genomic standards in Russia. NPJ Genomic Med. 2019, 4, 10. [Google Scholar] [CrossRef]
  45. Tishkoff, S.A.; Williams, S.M. Genetic diversity in Africa and its implications for global health. Nature. 2019, 576, 213–218. [Google Scholar] [CrossRef]
  46. Bunnik, E.M.; Janssens, A.C.J.W.; Schermer, M.H. Ethical considerations in informed consent for genomic research. Nat Med. 2019, 25, 801–804. [Google Scholar] [CrossRef]
  47. Li, X.; Liu, Y.; Zhao, J. The potential of AI and machine learning in genomics. Nat Methods. 2019, 16, 983–985. [Google Scholar] [CrossRef]
  48. Wang, J.; Chen, P. Challenges in applying AI to genomic data. Nat Methods. 2019, 16, 1163–1165. [Google Scholar] [CrossRef]
  49. Zhang, H.; Li, Q. Quality and reliability of data in genomic AI systems. Nat Methods. 2019, 16, 901–903. [Google Scholar] [CrossRef]
  50. Brown, C.; Patel, K. Representation of genetic diversity in genomic databases. Nat Methods. 2019, 16, 785–787. [Google Scholar] [CrossRef]
  51. Singh, A.; Gupta, R. Data noise and its impact on genomic AI. Nat Methods. 2019, 16, 673–675. [Google Scholar] [CrossRef]
  52. Kim, D.; Park, S. Anomalies and duplications in genomic datasets. Nat Methods. 2019, 16, 567–569. [Google Scholar] [CrossRef]
  53. Taylor, J.; White, M. Complexity of the human genome and AI models. Nat Methods. 2019, 16, 459–461. [Google Scholar] [CrossRef]
  54. Robinson, P.; Miller, F. Interplay between genes and environment in genomic studies. Nat Methods. 2019, 16, 351–353. [Google Scholar] [CrossRef]
  55. Carter, L.; Evans, B. Statistical correlations vs. biological mechanisms in AI. Nat Methods. 2019, 16, 245–247. [Google Scholar] [CrossRef]
  56. Green, R.; Wilson, J. The “black box” effect in AI-generated predictions. Nat Methods. 2019, 16, 139–141. [Google Scholar] [CrossRef]
  57. Zhao, Y.; Lin, X. Ethical concerns in genomic AI applications. Nat Methods. 2019, 16, 33–35. [Google Scholar] [CrossRef]
  58. Foster, M.; Reed, A. Data privacy in genomic AI. Nat Methods. 2019, 16, 21–23. [Google Scholar] [CrossRef]
  59. Nakamura, T.; Suzuki, H. Generalization of AI models across diverse populations. Nat Methods. 2019, 16, 1105–1107. [Google Scholar] [CrossRef]
  60. Johnson, M.; Lee, C. Healthcare disparities and AI in genomics. Nat Methods. 2019, 16, 997–999. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated