Preprint
Review

This version is not peer-reviewed.

Beyond Mutation Detection: Cell‐Free DNA for Functional Inference and Adaptive Oncology

Submitted:

30 March 2026

Posted:

01 April 2026

You are already at the latest version

Abstract
Liquid biopsy has evolved beyond its original role as a minimally invasive approach for mutation detection and is now being developed as a broader analytical framework for cancer detection, stratification, and longitudinal monitoring. Improvements in next-generation sequencing, assay chemistry, and computational analysis have increased analytical sensitivity, including in settings with low tumor fraction and very low variant allele abundance. These advances have expanded the utility of cfDNA analysis in measurable residual disease assessment and in the detection of low-abundance tumor-derived signals across multiple clinical contexts. At the same time, the field has shifted toward interpreting cfDNA as a carrier of higher-order biological information rather than solely a substrate for mutation calling. Fragmentation profiles, nucleosome positioning, and chromatin accessibility patterns derived from plasma DNA have been used to infer transcriptional and regulatory states, raising the possibility that cfDNA may capture functional tumor states not readily accessible through genotype-focused assays alone. These developments have prompted growing interest in chromatin-informed cfDNA analysis as a means of identifying pathway activity, enhancer usage, transcription factor occupancy, and potentially actionable biological dependencies. However, the translational relevance of many such inferences remains incompletely established. In this review, we examine the analytical advances underlying these approaches, assess the current evidence supporting their biological and clinical utility, and consider the extent to which cfDNA-derived regulatory inference may contribute to adaptive oncology and therapeutic decision-making.
Keywords: 
;  ;  ;  ;  ;  ;  ;  

1. Introduction

Liquid biopsy has substantially expanded the analytical toolkit available for cancer monitoring by enabling minimally invasive access to tumor-derived material in blood and other biofluids. Tumors release multiple analytes into circulation, including cell-free DNA (cfDNA), circulating tumor DNA (ctDNA), circulating tumor cells, and extracellular vesicles. Collectively, these analytes can provide a dynamic view of disease that partly mitigates the sampling bias and temporal limitations inherent to tissue biopsy [1,2,3,4]. Among them, cfDNA has emerged as a central component of precision oncology because it can be repeatedly sampled and profiled for genomic, epigenomic, and structural features associated with tumor evolution and treatment response [2,3]. In this review, cfDNA refers to extracellular DNA recovered from plasma using standard isolation workflows, which predominantly capture nucleosome-associated fragments; extracellular vesicle–associated DNA is not specifically resolved in most current sequencing protocols and is therefore not considered separately.
The field was initially driven by the demonstration that circulating DNA could reveal tumor-derived genomic alterations, enabling minimal residual disease (MRD) assessment. With the introduction of ultra-deep sequencing, molecular barcoding, and duplex error-correction approaches, ctDNA assays achieved sensitivity sufficient for clinically relevant detection at very low allele fractions. These technical advances enabled practical applications in advanced disease, recurrence monitoring, and increasingly, early detection [5,6,7,8]. Nevertheless, mutation-centered approaches provide only a partial view of tumor biology and are less informative in tumors shaped by epigenetic dysregulation, transcriptional plasticity, or adaptive resistance programs that arise without recurrent genomic alterations [9,10].
This limitation has motivated a broader interpretation of cfDNA. Plasma DNA retains structural and epigenetic features related to its chromatin origin, including information linked to nucleosome positioning, transcription factor occupancy, chromatin accessibility, and fragmentation topology [11,12,13,14,15]. cfDNA should therefore not be regarded solely as sequence material. Under appropriate analytical conditions, it may also function as a readout of chromatin organization and regulatory state.
Accordingly, fragmentomics has expanded from descriptive analysis of fragment size into a broader framework encompassing fragment-length distributions, fragment-end motifs, nucleosome phasing, regional fragmentation patterns, and methylation-associated cleavage signatures [13,14,21,22]. These features may provide access to functional aspects of tumor biology that are not readily captured by mutation profiling alone [9,10,11,12,14]. Artificial intelligence and machine learning have further accelerated this area by enabling multimodal integration of fragmentation, methylation, copy-number, and sequence-derived features for cancer detection, tissue-of-origin prediction, and biological interpretation [15,16,17,18,24]. Some recent models aim to move beyond classification by inferring regulatory activity, pathway dysregulation, and aspects of transcriptional state from cfDNA data [12,14,27,28,29].
A related and more speculative extension is the use of cfDNA-derived chromatin features to nominate candidate therapeutic vulnerabilities, particularly when integrated with CRISPR dependency maps, regulatory atlases, and orthogonal tumor profiling data [12,14,27,28]. At present, this does not justify treating cfDNA as a stand-alone target discovery platform. It does, however, support the view that cfDNA may serve as a useful discovery layer for generating and prioritizing mechanistically grounded therapeutic hypotheses.
In this review, we evaluate how cfDNA is being positioned for this broader role in oncology. We first consider the biological basis of cfDNA as an informational substrate, then examine multimodal and AI-enabled analytical strategies, and finally assess how genomic and epigenomic cfDNA features may be applied to resistance monitoring, patient stratification, target nomination, and treatment optimization. We also address the methodological and translational limitations that currently constrain the field.

2. Biological Foundations and Translational Utility of cfDNA in Adaptive Oncology

2.1. Biological Foundations of cfDNA as an Informational Substrate

2.1.1. Origins and Release Mechanisms

Most circulating cfDNA is thought to arise from apoptotic cells, in which endonuclease-mediated chromatin cleavage generates nucleosome-protected double-stranded DNA fragments associated with histones and other chromatin proteins [11,21,23]. As a result, a substantial fraction of plasma cfDNA exists as cell-free nucleosomes. This is a central observation because it provides the biological basis for interpreting cfDNA as a structured analyte rather than merely a degradation product.
Additional release mechanisms contribute further heterogeneity. Necrosis typically generates longer and more heterogeneous fragments because chromatin degradation is less orderly [2,30]. NETosis contributes extracellular chromatin derived from neutrophils and links cfDNA biology to inflammation, immune signaling, and thrombosis [33,34]. Active secretion through extracellular vesicles may also contribute DNA cargo, including nucleosomal and supercoiled forms, particularly in settings of inflammation or treatment-induced stress [2,23]. The plasma cfDNA pool should therefore be understood as a composite output of multiple cell death and secretion processes rather than a uniform analyte.

2.1.2. Fragmentation Biology, Nucleosome Footprints, and Regulatory Information

cfDNA fragmentation is non-random and is shaped by chromatin architecture, nuclease activity, local DNA accessibility, and methylation state [11,12,13,21,22]. Genome-wide analyses have shown that plasma DNA preserves an in vivo nucleosome footprint: nucleosome-protected regions are relatively shielded from cleavage, whereas open chromatin regions are more accessible and generate shorter fragments with characteristic endpoint distributions [11,12,13]. Nuclease activity contributes an additional layer of specificity. DNASE1L3, for example, has been associated with C-rich fragment ends, while altered nuclease activity in cancer influences end-motif composition and motif diversity [21,22]. Preferential cleavage at methylated CpG sites further links fragmentation patterns to local epigenetic states.
At actively transcribed loci, nucleosome depletion near transcription start sites and phased fragmentation patterns in adjacent regions have been used to infer transcriptional activity directly from plasma DNA [12,14]. Collectively, these observations underpin fragmentomics and nucleosomics as analytical strategies that use fragment length, endpoints, end motifs, and nucleosome phasing to reconstruct regulatory landscapes from cfDNA [11,12,13,14,21,22,23]. Although the biological rationale is strong, the degree to which such inferences are robust across platforms, cohorts, and clinical settings remains an active question.

2.1.3. Distinctive Features of ctDNA and the Relevance of Epigenetic Signals

Tumor-derived cfDNA differs from background plasma DNA across several feature classes, including fragment size, end-motif composition, methylation state, chromatin accessibility signatures, and genomic content [13,14,15,16,21,22,23]. These differences may offer a richer representation of tumor biology than mutation profiling alone, particularly in cancers driven by epigenetic instability, lineage plasticity, or adaptive resistance mechanisms that do not depend on recurrent coding mutations [9,10,14,15,16,17,18].
However, biological richness should not be conflated with clinical utility. Only a subset of circulating molecules is tumor derived, and many high-dimensional cfDNA features are strongly influenced by tumor fraction, tissue admixture, preanalytical handling, library preparation, sequencing depth, and computational modeling choices [2,7,20,24,25]. Consequently, the field has produced increasingly detailed characterizations of fragment length, end motifs, nucleosome spacing, chromatin accessibility, and vesicle-associated DNA without yet achieving comparable gains in routine treatment decision-making. At present, the strongest clinical use cases remain mutation detection, methylation-based classification, and residual disease assessment. Many higher-order structural and epigenomic features are better regarded as biologically informative research characteristics than as validated clinical endpoints [15,16,17,18,19,20].
The critical issue is therefore not whether cfDNA can generate additional descriptors, but whether these descriptors can be converted into reproducible functional inferences—such as pathway activation, lineage transition, resistance state, or therapeutic vulnerability—that improve patient stratification or treatment selection. Until such relationships are prospectively demonstrated and analytically standardized, much of the apparent richness of cfDNA remains descriptive rather than decisional.

2.2. Multimodal cfDNA Analysis and AI/ML-Enabled Translation

2.2.1. Rationale for Multimodal Integration

Multimodal cfDNA analysis arises from a fundamental constraint: no single feature class adequately captures the complexity of tumor-derived signals in plasma, particularly in low-tumor-fraction settings. Current frameworks therefore integrate partially independent signals—including DNA methylation, fragment size, fragment-end motifs, copy-number alterations, and somatic variants—within unified computational models [15,16,17,18]. In principle, this approach can improve analytical sensitivity when individual features are too weak or noisy to support stable inference on their own.
However, multimodal integration should not be assumed to confer greater biological insight or clinical utility. Combining analytes introduces additional technical variability, increases model complexity, and complicates attribution of model outputs to specific biological features. In practice, reported gains are often modest and highly dependent on preprocessing, cohort composition, and model training, raising the possibility that performance improvements reflect technical rather than biological integration [24].
The central translational question is therefore not whether additional cfDNA features can be combined, but whether multimodal models yield outputs that are reproducible, interpretable, and clinically actionable. At present, such advantages have not been consistently demonstrated in prospective or interventional settings [16,17,18,19,20,24].

2.2.2. Representative Multimodal Research Platforms

Several recent studies illustrate the analytical potential of multimodal cfDNA modeling. THEMIS demonstrated that enzyme-mediated methylome sequencing can preserve fragmentation information while simultaneously capturing whole-genome methylation, fragment size, fragment-end motifs, and copy-number alterations from a single cfDNA sample [15]. MESA extended this approach by incorporating chromatin-derived features, including nucleosome occupancy and window protection scores, supporting the view that plasma DNA retains structurally informative signatures of tumor chromatin organization [16]. Additional platforms, such as AlphaLiquid and SPOT-MAS, further suggest that integrated models combining methylation, copy number, and fragmentation may outperform single-feature approaches for cancer detection and tissue-of-origin prediction [17,18].
A conceptual extension of this work is that certain fragmentation-derived features may support functional interpretation rather than classification alone. For example, promoter fragmentation entropy and nucleosome-depleted region signals have been linked to inferred gene-expression states, raising the possibility that cfDNA may encode aspects of regulatory architecture in addition to tumor presence [14]. These observations are biologically informative, but most mechanistic applications remain investigational and require validation across independent datasets and clinical contexts.

2.2.3. Clinical Translation and Commercial Assays

Clinical translation of AI- and ML-based cfDNA analysis has advanced most clearly in cancer detection and tissue-of-origin classification. Its role in treatment selection or target nomination remains substantially less developed. Galleri, developed through the CCGA and PATHFINDER programs, is a prominent example of a methylation-based multi-cancer early detection assay that relies on machine-learning classification [19,20]. In colorectal cancer screening, Shield and the PREEMPT CRC validation study further demonstrate that blood-based classification frameworks can be deployed in large prospective settings [31,32].
By contrast, multimodal platforms such as THEMIS and MESA remain primarily research-oriented. Their principal contribution has been to show that a single cfDNA dataset can encode multiple layers of biologically relevant information, including methylation, fragmentation, copy-number changes, and chromatin-related structure [15,16]. However, this analytical breadth has not yet translated into treatment-directing applications supported by comparable prospective validation.
Accordingly, the more meaningful distinction in the field is not simply between unimodal and multimodal assays, but between platforms optimized for clinically validated detection and those designed to extract richer biological information that may eventually support adaptive oncology. The latter remain conceptually important but largely investigational [15,16,19,20].

2.3. Mining Genomic and Epigenomic cfDNA Features for Adaptive Oncology

2.3.1. Resistance Monitoring Through Serial cfDNA Profiling

A core objective of adaptive oncology is to modify treatment according to the evolving molecular state of the tumor rather than relying solely on baseline tissue profiling. cfDNA is particularly well suited to this application because it enables repeated, minimally invasive monitoring of tumor-derived genomic and epigenomic signals over time [2,5,20]. This practical advantage has been a major driver of its clinical adoption.
Importantly, plasma cfDNA mutation analysis is no longer confined to exploratory use. In selected settings, professional recommendations and disease-specific clinical literature support its use when tumor tissue is unavailable, insufficient, or impractical to re-biopsy [42,43]. This establishes a defined role for cfDNA in real-world molecular genotyping for certain cancers, albeit in a context-dependent manner.
Its clearest clinical utility lies in the serial detection of resistance-associated alterations. Such changes may emerge before or near the time of overt progression and can therefore inform earlier therapeutic intervention than conventional imaging or repeat tissue biopsy alone [2,5]. Representative examples include ESR1 mutation monitoring in hormone receptor-positive metastatic breast cancer, prospectively demonstrated in PADA-1 [38]; plasma detection of EGFR T790M to guide osimertinib therapy in EGFR-mutant non-small cell lung cancer [39]; emergent KRAS-mediated resistance during anti-EGFR therapy in metastatic colorectal cancer [40]; and BRCA1/2 reversion mutations associated with resistance to platinum therapy or PARP inhibition [41].
These examples support the use of cfDNA as a practical readout of molecular escape mechanisms and, in selected contexts, as a tool for therapy switching or rational combination strategies. Nonetheless, the strength of evidence remains uneven across tumor types and therapeutic settings, and plasma findings do not uniformly obviate the need for tissue confirmation.

2.3.2. Copy-Number, Structural Variant, and Synthetic Lethality Inference

The contribution of cfDNA to adaptive oncology extends beyond single-nucleotide variants. Plasma sequencing can identify clinically relevant copy-number alterations, amplifications, structural rearrangements, gene fusions, and subclonal drivers that emerge during metastatic progression or treatment [2,44,45].
Examples include ERBB2 or MET amplification and oncogenic kinase rearrangements, which may refine patient stratification and uncover treatment opportunities not captured by limited genotyping panels [44,45]. cfDNA profiling may also contribute to synthetic-lethal inference. Detection of BRCA1/2 alterations remains relevant to PARP inhibitor selection, whereas BRCA1/2 reversion mutations can signal acquired resistance and loss of therapeutic vulnerability [41].
However, the broader translational value of structural and copy-number inference still depends on sufficient tumor fraction, assay sensitivity, analytical standardization, and the limited availability of prospective evidence that these findings improve treatment assignment beyond established mutation-focused workflows [2,41,44,45].

2.3.3. Epigenomic Inference

Epigenomic cfDNA profiling extends adaptive oncology beyond mutation tracking by attempting to capture tumor states not adequately explained by genotype alone [14,15,16,17,18]. Methylome- and fragmentome-derived features may, in principle, reflect pathway activation, tumor-suppressor silencing, lineage plasticity, epithelial-to-mesenchymal transition programs, and dependence on chromatin regulators. This is particularly relevant when resistance is driven by transcriptional reprogramming or lineage switching rather than newly acquired coding mutations [10,46,47].
Although the biological rationale is compelling, the translational significance of such inferences remains uncertain. Combined analyses of methylation, nucleosome organization, and fragmentation can reveal meaningful tumor states, but the extent to which these features can be converted into robust treatment-directing biomarkers has not yet been established [15,16,17,18,19,20].
A more plausible near-term role for epigenomic cfDNA lies in risk stratification and therapeutic hypothesis generation. In this context, cfDNA-derived patterns may help identify lineage switching, therapy-associated dedifferentiation, or chromatin-regulator dependence, which can then be integrated with orthogonal data to refine patient selection and guide follow-up investigation [14,15,16,46,47].

2.3.4. cfDNA as a Real-Time Therapeutic Control System

This treatment-directed perspective reframes liquid biopsy as more than a diagnostic assay and positions cfDNA as a potential real-time therapeutic monitoring system. Within this framework, methylation programs may suggest sensitivity to targeted or epigenetic therapies, chromatin accessibility signatures may inform immunotherapy responsiveness, and serial cfDNA measurements may reveal emerging resistant subclones or epigenetic state transitions before they become radiographically apparent [14,15,16,17,18,20].
The near-term translational objective is therefore not limited to disease detection, but extends to dynamic treatment optimization through repeated cfDNA-guided assessment. Whether this concept can be implemented reliably in routine clinical practice remains uncertain. Even so, the shift in perspective is important because it recasts cfDNA from a passive marker of disease burden into a potentially actionable tool for therapeutic control.
The principal cfDNA signal classes discussed in this review, together with the biology they may capture, their potential translational relevance, and their current evidentiary maturity, are summarized in Table 1.

2.3.5. Target Nomination

A more ambitious extension of the field is the proposition that cfDNA may contribute to target discovery. Framed cautiously, this is plausible. cfDNA can reveal tumor states associated with actionable dependencies, including pathway activation, lineage switching, enhancer engagement, and transcription factor programs not evident from mutation analysis alone [12,14,27,28].
However, cfDNA is better regarded as a prioritization tool rather than a stand-alone discovery platform. Its value lies in narrowing the search space by identifying candidate vulnerabilities that can then be evaluated through integration with CRISPR dependency maps, tumor profiling, and functional validation.

2.4. Future Directions: From Multimodal Biomarkers to Therapeutic Decision Engines

The translational landscape of cfDNA currently spans a spectrum from clinically established mutation-based applications to research-stage efforts aimed at functional inference and therapeutic guidance. Most target-related uses remain provisional and continue to require orthogonal validation, as summarized in Table 1.

2.4.1. Foundation-Style Multimodal Models

The next generation of cfDNA models are increasingly being developed to be multimodal by design, integrating methylation, fragmentomics, copy number, mutations, and clinical variables within unified analytical frameworks [15,16,17,18]. The objective is not simply to improve binary classification, but to align complementary biological signals with clinically relevant outputs spanning diagnosis, prognosis, resistance emergence, and likely treatment sensitivity [15,16].
A logical extension of this trend is the development of foundation-style models trained on large cfDNA datasets. In principle, such systems could generate integrated predictions across several clinically relevant dimensions rather than a single narrow endpoint. However, their value will depend not only on predictive performance, but also on interpretability, biological robustness, and prospective validation.
If such frameworks mature, they may support patient stratification, improve pharmacodynamic monitoring, enrich biomarker-driven trials, and help prioritize rational combination strategies [17,18,19,20]. At present, however, these applications remain aspirational rather than established.

2.4.2. Integration into Biomarker-Driven Clinical Trials

An important next step is the integration of genomic and epigenomic cfDNA features into biomarker-driven clinical trials [17,18,19,20,31,32]. These readouts could improve patient selection, enable earlier identification of likely responders or non-responders, and support the use of molecular residual disease or early plasma ctDNA response as intermediate endpoints.
If validated prospectively, this approach could make trials smaller, faster, and more biologically focused while improving alignment between therapy mechanism and enrolled population [19,20,31,32]. However, its utility will depend on assay reproducibility, interpretability, and evidence that cfDNA-based readouts improve trial decisions rather than merely add correlates.

2.5. Pitfalls

Confounding and batch effects further complicate interpretation. Cases and controls may differ not only in cancer status, but also in collection site, preanalytical handling, sequencing chemistry, and clinical characteristics. As a result, classifiers may capture technical or demographic artifacts rather than tumor biology. These issues are particularly pronounced in retrospective and multicenter datasets lacking harmonization and independent validation [24,46,47,48].
More broadly, prospective evidence demonstrating that AI-guided cfDNA strategies improve clinical outcomes remains limited. Many studies are retrospective or case-control in design, and the gap between analytical performance and treatment-directed utility remains substantial [19,20,24].

2.6. Conclusions

At present, the strongest clinical validation remains in detection and classification. The more consequential long-term opportunity lies in adaptive oncology, where genomic and epigenomic cfDNA features may eventually support resistance monitoring, patient stratification, and treatment redirection [16,24].

References

  1. Pantel, K.; Alix-Panabières, C. Liquid biopsy and minimal residual disease—latest advances and implications for cure. Nat. Rev. Clin. Oncol. 2019, 16, 409–424.
  2. Wan, J. C. M.; Massie, C.; Garcia-Corbacho, J.; Mouliere, F.; Brenton, J. D.; Caldas, C.; Pacey, S.; Baird, R.; Rosenfeld, N. Liquid biopsies come of age: towards implementation of circulating tumour DNA. Nat. Rev. Cancer 2017, 17, 223–238.
  3. Bettegowda, C.; Sausen, M.; Leary, R. J.; Kinde, I.; Wang, Y.; Agrawal, N.; Bartlett, B. R.; Wang, H.; Luber, B.; Alani, R. M.; et al. Detection of circulating tumor DNA in early- and late-stage human malignancies. Sci. Transl. Med. 2014, 6, 224ra24.
  4. Diehl, F.; Schmidt, K.; Choti, M. A.; Romans, K.; Goodman, S.; Li, M.; Thornton, K.; Agrawal, N.; Sokoll, L.; Szabo, S. A.; Kinzler, K. W.; Vogelstein, B.; Diaz, L. A., Jr. Circulating mutant DNA to assess tumor dynamics. Nat. Med. 2008, 14, 985–990.
  5. Abbosh, C.; Birkbak, N. J.; Wilson, G. A.; Jamal-Hanjani, M.; Constantin, T.; Salari, R.; Le Quesne, J.; Moore, D. A.; Veeriah, S.; Rosenthal, R.; et al. Phylogenetic ctDNA analysis depicts early-stage lung cancer evolution. Nature 2017, 545, 446–451.
  6. Cohen, J. D.; Li, L.; Wang, Y.; Thoburn, C.; Afsari, B.; Danilova, L.; Douville, C.; Javed, A. A.; Wong, F.; Mattox, A.; et al. Detection and localization of surgically resectable cancers with a multi-analyte blood test. Science 2018, 359, 926–930.
  7. Newman, A. M.; Lovejoy, A. F.; Klass, D. M.; Kurtz, D. M.; Chabon, J. J.; Scherer, F.; Stehr, H.; Liu, C. L.; Bratman, S. V.; Say, C.; et al. Integrated digital error suppression for improved detection of circulating tumor DNA. Nat. Biotechnol. 2016, 34, 547–555.
  8. Schmitt, M. W.; Kennedy, S. R.; Salk, J. J.; Fox, E. J.; Hiatt, J. B.; Loeb, L. A. Detection of ultra-rare mutations by next-generation sequencing. Proc. Natl. Acad. Sci. U.S.A. 2012, 109, 14508–14513.
  9. Guo, L.; Lee, Y. T.; Zhou, Y.; Huang, Y. Targeting epigenetic regulatory machinery to overcome cancer therapy resistance. Semin. Cancer Biol. 2022, 83, 487–502.
  10. Davies, A.; Zoubeidi, A.; Beltran, H.; Selth, L. A. The transcriptional and epigenetic landscape of cancer cell lineage plasticity. Cancer Discov. 2023, 13, 1771–1788.
  11. Snyder, M. W.; Kircher, M.; Hill, A. J.; Daza, R. M.; Shendure, J. Cell-free DNA comprises an in vivo nucleosome footprint that informs its tissues-of-origin. Cell 2016, 164, 57–68.
  12. Ulz, P.; Thallinger, G. G.; Auer, M.; Graf, R.; Kashofer, K.; Jahn, S. W.; Abete, L.; Pristauz, G.; Petru, E.; Geigl, J. B.; Heitzer, E.; Speicher, M. R. Inferring expressed genes by whole-genome sequencing of plasma DNA. Nat. Genet. 2016, 48, 1273–1278.
  13. [13] Cristiano, S.; Leal, A.; Phallen, J.; Fiksel, J.; Adleff, V.; Bruhm, D. C.; Jensen, S. Ø.; Medina, J. E.; Hruban, C.; White, J. R.; et al. Genome-wide cell-free DNA fragmentation in patients with cancer. Nature 2019, 570, 385–389.
  14. Esfahani, M. S.; Hamilton, E. G.; Mehrmohamadi, M.; Nakhleh, M. K.; Choi, G.; Salem, M.; Daniel, B. J.; Zhang, J.; Smith, M. N.; Dries, R.; et al. Inferring gene expression from cell-free DNA fragmentation profiles. Nat. Biotechnol. 2022, 40, 585–597.
  15. Bie, F.; Wang, J.; Cheng, X.; et al. Multimodal analysis of cell-free DNA whole-methylome sequencing for cancer detection and localization. Nat. Commun. 2023, 14, 6042.
  16. Li, Y.; Song, Y.; Hu, H.; et al. Multimodal epigenetic sequencing analysis of cell-free DNA for non-invasive colorectal cancer detection. Genome Med. 2024, 16, 10.
  17. Kim, S. Y.; Kim, H.; Lee, S.; et al. Cancer signature ensemble integrating cfDNA methylation, copy number, and fragmentation facilitates multi-cancer early detection. Exp. Mol. Med. 2023, 55, 2443–2456.
  18. Nguyen, V. T. C.; Pham, T. M. Q.; Tran, T. T. T.; et al. Multimodal analysis of genome-wide methylation, copy number aberrations, and end motif signatures enhances detection of early-stage breast cancer. eLife 2023, 12, e89083.
  19. Vittone, J.; Gill, D.; Goldsmith, A.; Klein, E. A.; Karlitz, J. J. A multi-cancer early detection blood test using machine learning detects early-stage cancers lacking USPSTF-recommended screening. npj Precis. Oncol. 2024, 8, 91.
  20. Pantel, K.; Alix-Panabières, C. Minimal residual disease as a target for liquid biopsy in patients with solid tumours. Nat. Rev. Clin. Oncol. 2025, 22, 65–77.
  21. Serpas, L.; Chan, R. W. Y.; Jiang, P.; Ni, M.; Sun, K.; Rashidfarrokhi, A.; Soni, C.; Sisirak, V.; Lee, W. S.; Cheng, S. H.; et al. Dnase1l3 deletion causes aberrations in length and end-motif frequencies in plasma DNA. Proc. Natl. Acad. Sci. U.S.A. 2019, 116, 641–649.
  22. Zhou, Z.; Sun, K.; Chen, X.; et al. Fragmentation landscape of cell-free DNA revealed by deconvolution of end motifs. Proc. Natl. Acad. Sci. U.S.A. 2023.
  23. Qi, T.; Li, Y.; Wang, Y.; et al. Cell-free DNA fragmentomics: the novel promising biomarker in cancer screening. Int. J. Mol. Sci. 2023, 24, 1503.
  24. Luo, X.; Xie, S.; Hong, F.; Li, X.; Wei, Y.; Zhou, Y.; Su, W.; Yang, Y.; Tang, L.; Dao, F.; Cai, P.; Lin, H.; Lai, H.; Lyu, H. From multi-omics to deep learning: advances in cfDNA-based liquid biopsy for multi-cancer screening. Biomark. Res. 2025, 14, 3.
  25. Tsui, W. H. A.; Ding, S. C.; Jiang, P.; Lo, Y. M. D. Artificial intelligence and machine learning in cell-free-DNA-based diagnostics. Genome Res. 2025, 35, 1–19.
  26. Wang, H. Y.; Lin, W. Y.; Zhou, C.; Yang, Z. A.; Kalpana, S.; Lebowitz, M. S. Integrating artificial intelligence for advancing multiple-cancer early detection via serum biomarkers: a narrative review. Cancers 2024, 16, 862.
  27. Lai, K.; Dilger, K.; Cunningham, R.; Lam, K. T.; Boquiren, R.; Truong, K.; Louie, M. C.; Rava, R.; Abdueva, D. Extracting regulatory active chromatin footprint from cell-free DNA. Commun. Biol. 2024, 7, 1086.
  28. Wang, Y.; Hou, Y.; Xiong, Z.; Lu, L.; Zong, Z.; Wang, B.; Chen, H.; Zhang, W.; Zhou, X. Fragment dispersity index analysis of cfDNA fragments reveals chromatin accessibility and enables early cancer detection. Cell Rep. Methods 2025, 5, 101083.
  29. Shen, H.; et al. Development of a deep learning model for cancer discrimination based on cfDNA end-motifs. npj Precis. Oncol. 2024.
  30. Thierry, A. R.; El Messaoudi, S.; Gahan, P. B.; Anker, P.; Stroun, M. Origins, structures, and functions of circulating DNA in oncology. Cancer Metastasis Rev. 2016, 35, 347–376.
  31. Chung, D. C.; Raymond, V. M.; Grady, W. M.; et al. A cell-free DNA blood-based test for colorectal cancer screening. N. Engl. J. Med. 2024, 390, 973–983.
  32. Shaukat, A.; Burke, C. A.; Chan, A. T.; et al. Clinical validation of a circulating tumor DNA–based blood test to screen for colorectal cancer. JAMA 2025, 334, 56–63.
  33. Wang, H.; et al. Neutrophil extracellular traps in homeostasis and disease. Signal Transduct. Target. Ther. 2024, 9, 250.
  34. Zhou, Y.; et al. Impact of neutrophil extracellular traps on thrombosis: a narrative review. Front. Cell. Infect. Microbiol. 2022, 12, 910908.
  35. Yu, S. C. Y.; Cheng, S. H.; Peng, W.; Lau, S. L.; Choy, L. Y. L.; Leung, T. Y.; Wong, J.; Wong, V. W.-S.; Wong, G. L.-H.; Jiang, P.; Chiu, R. W. K.; Chan, K. C. A.; Lo, Y. M. D. Comparison of single molecule, real-time sequencing and nanopore sequencing for analysis of the size, end-motif, and tissue-of-origin of long cell-free DNA in plasma. Clin. Chem. 2023, 69, 168–179.
  36. García-Silva, S.; Benito-Martín, A.; Nogués, L.; Hernández-Barranco, A.; Mazariegos, M. S.; Santos, V.; Hergueta-Redondo, M.; Ximénez-Embún, P.; Kataru, R. P.; Lopez, A. A.; et al. DNA-loaded extracellular vesicles in liquid biopsy. Front. Cell Dev. Biol. 2021, 8, 622579.
  37. Lin, B.; et al. Whole-genome methylation profiling of extracellular vesicle DNA for gastric cancer detection using low-input TEMPT chemistry. Nat. Commun. 2025.
  38. Bidard, F.-C.; Hardy-Bessard, A.-C.; Dalenc, F.; et al. Switch to fulvestrant and palbociclib versus no switch in advanced breast cancer with rising ESR1 mutation during aromatase inhibitor and palbociclib therapy (PADA-1): a randomised, open-label, multicentre, phase 3 trial. Lancet Oncol. 2022.
  39. Oxnard, G. R.; Thress, K. S.; Alden, R. S.; et al. Association between plasma genotyping and outcomes of treatment with osimertinib (AZD9291) in advanced non-small-cell lung cancer. J. Clin. Oncol. 2016, 34, 3375–3382.
  40. Parseghian, C. M.; Loree, J. M.; Morris, V. K.; et al. Resistance mechanisms to anti–epidermal growth factor receptor therapy in RAS/BRAF/EGFR wild-type metastatic colorectal cancer from first-line panitumumab trials. J. Clin. Oncol. 2023.
  41. Weigelt, B.; Comino-Méndez, I.; de Bruijn, I.; et al. Diverse BRCA1 and BRCA2 reversion mutations in circulating cell-free DNA of therapy-resistant breast or ovarian cancer. Clin. Cancer Res. 2017, 23, 6708–6720.
  42. Battaglin, F.; et al. Clinical applications of ctDNA profiling in gastrointestinal cancers. JCO Oncol. Pract. 2024.
  43. Pascual, J.; Attard, G.; Bidard, F.-C.; Curigliano, G.; De Mattos-Arruda, L.; Diehn, M.; Italiano, A.; Lindberg, J.; Merker, J. D.; Montagut, C.; et al. ESMO recommendations on the use of circulating tumour DNA assays for patients with cancer: a report from the ESMO Precision Medicine Working Group. Ann. Oncol. 2022, 33, 750–768.
  44. Jang, H.; et al. Copy number alteration profiling from plasma cell-free DNA by whole-exome sequencing. Int. J. Mol. Sci. 2025.
  45. Hasegawa, N.; et al. Highly sensitive fusion detection using plasma cell-free DNA. Cancer Sci. 2021.
  46. Jamshidi, A.; et al. Evaluation of cell-free DNA approaches for multi-cancer early detection. Cancer Cell 2022, 40, 1537–1549.e12.
  47. Zhang, K.; et al. Circulating cell-free DNA-based multi-cancer early detection. Trends Cancer 2024, 10, 1002–1018.
  48. De Falco, A.; et al. Knowledge-informed multimodal cfDNA analysis improves sensitivity and generalization in cancer detection. bioRxiv 2025.
Table 1. cfDNA feature classes, inferred biology, and translational relevance for adaptive oncology and therapeutic hypothesis generation.
Table 1. cfDNA feature classes, inferred biology, and translational relevance for adaptive oncology and therapeutic hypothesis generation.
cfDNA feature class Inferred
biological
information
Potential
translational applications
Current
evidence
maturity
Major
limitations
References
Somatic mutations Tumor genotype, emerging resistance alterations, clonal evolution Therapy selection, serial resistance monitoring, measurable residual disease assessment Clinically established in selected settings Limited by low tumor fraction, assay sensitivity, clonal hematopoiesis, and inability to capture non-mutational resistance states [2,5,7,20,38,39,40,41,42,43]
Copy-number alterations and amplifications Structural instability, focal amplification events, subclonal genomic evolution Patient stratification, resistance profiling, context-dependent inference of pathway activation Clinically relevant but less standardized than mutation testing Requires sufficient tumor fraction; plasma CNA calls may be noisy; prospective treatment-directing evidence remains limited [2,41,44]
Structural variants and gene fusions Rearrangements, oncogenic fusions, and other structural drivers Detection of actionable rearrangements, resistance profiling, complementary genotyping when tissue is limited Useful in selected contexts Analytical sensitivity varies by assay design; structural resolution may remain incomplete in plasma; broader standardization is needed [2,39,45]
Fragment size, fragment ends, and fragmentation topology Nuclease activity, chromatin accessibility, tumor-associated fragmentation patterns Cancer detection, tissue-of-origin modeling, exploratory biological inference Strong analytical evidence, but limited routine clinical decision utility Highly sensitive to preanalytical variables, sequencing workflow, and computational modeling; biological interpretation is often indirect [13,21,22,23,24,25]
Nucleosome positioning and promoter fragmentation features Chromatin organization, nucleosome depletion, inferred transcriptional activity Functional-state inference, pathway nomination, exploratory therapeutic hypothesis generation Research-stage Signals are indirect and require computational inference; prospective validation for treatment direction remains limited [11,12,14,27,28]
DNA methylation patterns Lineage identity, tissue of origin, tumor class, silencing states, epigenomic reprogramming Multi-cancer early detection, classification, risk stratification, exploratory therapeutic redirection Clinically advanced for detection and classification; investigational for treatment direction Rich biological information does not yet consistently translate into actionable therapeutic decisions [15,16,17,18,19,20,31,32]
Multimodal integrated cfDNA models Composite tumor state derived from combined methylation, fragmentation, CNA, and mutational signals Improved classification, adaptive-oncology modeling, exploratory target nomination frameworks Promising, but not yet treatment-directing in routine practice Vulnerable to overfitting, data leakage, confounding, batch effects, and weak cross-cohort generalization [15,16,17,18,24,46,47,48]
This table summarizes the major categories of cfDNA-derived signals discussed in this review, the biology they may capture, their potential clinical or drug-discovery relevance, and their current evidentiary maturity. Not all signal classes carry equivalent translational weight. Somatic mutation profiling and selected serial ctDNA applications are already clinically useful in defined settings, whereas many fragmentation-, chromatin-, methylation-, and multimodal AI-derived features remain primarily research-stage and are more appropriately viewed as tools for biological inference, target nomination, or therapeutic hypothesis generation than as stand-alone treatment-directing biomarkers.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated