Submitted:
02 June 2026
Posted:
03 June 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Related Works
| Study | Study Focus | Scope | Relevance | Gap |
|---|---|---|---|---|
| Rush et al. [6] | MDD treatment outcomes | STAR*D sequential treatment outcomes and remission across treatment steps | Supports the argument that MDD treatment often requires multiple steps and that remission is difficult to achieve | Does not address digital phenotyping or synthetic data generation |
| Trivedi et al. [9] | Measurement-based care | Clinical symptom monitoring, adherence, and side-effect assessment in STAR*D | Supports structured symptom assessment and treatment monitoring during MDD care | Relies on scheduled clinical assessments without passive smartphone/wearable monitoring |
| Kroenke et al. [18] | PHQ-9 validation | Validation of PHQ-9 as a depression severity measure | Justifies the use of PHQ-9 as a clinical anchor for symptom severity and treatment-response monitoring | Does not address passive behavioral data or synthetic longitudinal data generation |
| Zierer et al. [11] | Digital biomarkers | Review of passive digital biomarkers associated with depression | Supports the selection of mobility, activity, sleep, communication, and physiological variables | Reviews digital biomarkers but does not generate a synthetic treatment-response dataset |
| Leaning et al. [19] | Smartphone phenotyping | Smartphone-derived data for clinically relevant predictions in MDD | Supports the use of smartphone data for depression-related prediction and monitoring | Does not integrate synthetic data generation with PHQ-9-anchored treatment trajectories |
| Vignapiano et al. [20] | Digital biomarkers in mood disorders | Smartphone and wearable indicators for mood disorder management | Supports the role of digital biomarkers in tracking depression severity and treatment response | Does not provide a rule-based synthetic dataset for MDD treatment monitoring |
| Jung et al. [21] | Wearable/sensor monitoring | Sensor features for predicting depression and anxiety | Supports the relevance of wearable-derived features for mental health monitoring | Focuses on feature identification, not synthetic multimodal longitudinal data generation |
| Martinez-Martin et al. [22] | Ethics and privacy | Ethical guidance for mental health digital phenotyping | Supports privacy, consent, data protection, transparency, and accountability concerns | Does not propose a synthetic data framework for MDD treatment-response experimentation |
| Oudin et al. [23] | Digital psychiatry ethics | Privacy, confidentiality, consent, and data ownership in digital psychiatry | Supports the claim that psychiatric digital phenotyping raises important ethical and governance concerns | Does not develop a privacy-preserving synthetic dataset for MDD monitoring |
| Giuffrè and Shung [24] | Synthetic data in healthcare | Benefits, applications, and limitations of synthetic healthcare data | Supports synthetic data as a way to improve privacy, data sharing, and predictive analytics | Broad healthcare focus; not specific to MDD, PHQ-9, or digital phenotyping |
| Pezoulas et al. [25] | Healthcare synthetic data | Review of synthetic data generation methods in healthcare | Supports synthetic data generation as a response to data scarcity and privacy concerns | Does not focus on clinically guided MDD treatment-response monitoring |
| Mendes et al. [26] | Privacy-preserving synthetic data | Synthetic data for bridging data gaps and enabling simulation | Supports synthetic data as a privacy-preserving method for methodological experimentation | Not specific to MDD, smartphone/wearable data, or treatment-response trajectories |
| Qian et al. [27] | Privacy-preserving clinical modelling | Synthetic data for clinical risk prediction under privacy constraints | Demonstrates how synthetic data can support clinical modelling pipelines where real data access is limited | Focuses on clinical risk prediction, not MDD digital phenotyping or treatment-response monitoring |
| Loni et al. [28] | Synthetic health records | Review of synthetic medical text, time-series, and longitudinal health records | Relevant to longitudinal synthetic health data generation | Broad health-record focus; not specific to MDD treatment-response trajectories |
2. Materials and Methods
2.1. Methodological Framework
2.2. Identifying Digital Phenotyping Variables
2.3. Development of an Expert Interview Instrument
2.4. Clinical Calibration Through Expert Input
2.5. PHQ-9 as the Clinical Anchor for Symptom Severity and Treatment Response
2.6. Definition of Measurement Frequencies and Assessment Time Points
| Time Scale | Timepoints |
|---|---|
| Baseline only | Week 0 |
| Daily observations | Everyday from week 0 to week 12 |
| Weekly summary | End of weeks 1-12 |
| Biweekly clinical assessment | Weeks 0, 2, 4, 6, 8, 10, 12 |
| Clinical assessment points | Weeks 0, 2, 4, 6, 8, 10, 12 |
| Treatment review points | Week 4, 8, 12 |
2.7. Construction of the Synthetic Data Schema
2.8. Rule-Based Synthetic Data Generation
2.8.1. Participant-Level Initialization
2.8.2. Assignment of Treatment-Response Trajectories
2.8.3. Generation of Clinical Assessment Scores
2.8.4. Generation of Behavioral and Physiological Features
2.8.5. Introduction of Variability, Noise, and Missingness
2.9. Dataset Validation and Quality Assessment
2.10. Ethical and Data Governance Considerations
3. Results
3.1. Plausibility Assessment of the Generated Rule-Based Synthetic Dataset
3.2. Dataset Structure and Composition
3.3. Distribution Plausibility
3.4. Temporal Plausibility
3.5. Dependency Plausibility
3.6. Responder-Group Plausibility
3.7. Missingness and Data Quality Patterns
4. Discussion
| Strengths | Limitations |
|---|---|
|
Clinical interpretability: Through the PHQ-9 score, the dataset provides a clear clinical anchor for symptom severity and treatment-response monitoring. |
Synthetic and rule-based: The results demonstrate internal plausibility, but they do not validate the dataset against actual real-world data. |
| Multimodal structure: The dataset goes beyond symptom scores to include behavioral and physiological indicators. |
Potentially too coherent: In real-world mental health data, behavioral and physiological variables do not always align neatly with symptom improvement or worsening. |
| Longitudinal design: Observations are repeated throughout a 12-week treatment period. | Limited representation of real-world heterogeneity: The dataset does not fully represent the heterogeneity inherent in real-world populations. |
|
Internal consistency: Distribution checks, correlation analysis, trajectory plots, and responder-group comparisons indicate that the generated data follow the intended clinical and behavioral logic. |
Less complex than real-world digital phenotyping datasets: Smartphone and wearable data often contain irregular gaps, dropout, sensor-specific failure, and participant disengagement. |
| Reproducibility and controllability: Since the dataset is rule-based, the generation logic is transparent and can be adjusted, audited, or extended. | Limited external validity: Plausible ranges and trajectories support its use for simulation and modelling, but future validation against real-world or clinically reviewed datasets is necessary before drawing conclusions about clinical deployment. |
5. Conclusion
Supplementary Materials
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| MDD | Major Depressive Disorder |
| EMA | Ecological Momentary Assessment |
| PHQ-9 | Patient Health Questionnaire 9 |
| MBC | Measurement-Based Care (MBC). |
| TMAP | Texas Medication Algorithm Project (TMAP) |
| HRV | Heart Rate Variability (HRV) |
References
- Erritzoe, D.; et al. , ‘A short-acting psychedelic intervention for major depressive disorder: a phase IIa randomized placebo-controlled trial’. Nat. Med. 2026, vol. 32(no. 2), 591–598. [Google Scholar] [CrossRef]
- Malhi, G. S.; Mann, J. J. ‘Depression’. The Lancet 2018, vol. 392(no. 10161), 2299–2312. [Google Scholar] [CrossRef]
- Marx, W.; et al. , ‘Major depressive disorder’. Nat. Rev. Dis. Primer 2023, vol. 9(no. 1), 44. [Google Scholar] [CrossRef]
- Santomauro, D. F.; Vos, T.; Whiteford, H. A.; Chisholm, D.; Saxena, S.; Ferrari, A. J. ‘Service coverage for major depressive disorder: estimated rates of minimally adequate treatment for 204 countries and territories in 2021’. Lancet Psychiatry 2024, vol. 11(no. 12), 1012–1021. [Google Scholar] [CrossRef]
- Kim, H.-Y.; et al. , ‘Predictors of Remission in Acute and Continuation Treatment of Depressive Disorders’. Clin. Psychopharmacol. Neurosci. 2021, vol. 19(no. 3), 490–497. [Google Scholar] [CrossRef] [PubMed]
- Rush, J.; et al. , ‘Acute and Longer-Term Outcomes in Depressed Outpatients Requiring One or Several Treatment Steps: A STAR*D Report’. Am. J. Psychiatry 2006, vol. 163(no. 11), 1905–1917. [Google Scholar] [CrossRef] [PubMed]
- Aboraya, et al. , ‘Measurement-based Care in Psychiatry-Past, Present, and Future’. Innov. Clin. Neurosci. 2018, vol. 15(no. 11–12), 13–26. [Google Scholar]
- Trivedi, M. H.; Daly, E. J. ‘Treatment strategies to improve and sustain remission in major depressive disorder’. Dialogues Clin. Neurosci. 2008, vol. 10(no. 4), 377–384. [Google Scholar] [CrossRef]
- Trivedi, M. H.; et al. , ‘Evaluation of Outcomes With Citalopram for Depression Using Measurement-Based Care in STAR*D: Implications for Clinical Practice’. Am. J. Psychiatry 2006, vol. 163(no. 1), 28–40. [Google Scholar] [CrossRef]
- Taliaz, D.; Souery, D. ‘A New Characterization of Mental Health Disorders Using Digital Behavioral Data: Evidence from Major Depressive Disorder’. J. Clin. Med. 2021, vol. 10(no. 14), 3109. [Google Scholar] [CrossRef]
- Zierer; Behrendt, C.; Lepach-Engelhardt, A. C. ‘Digital biomarkers in depression: A systematic review and call for standardization and harmonization of feature engineering’. J. Affect. Disord. 2024, vol. 356, 438–449. [Google Scholar] [CrossRef] [PubMed]
- Aledavood, T.; et al. , ‘Multimodal Digital Phenotyping Study in Patients With Major Depressive Episodes and Healthy Controls (Mobile Monitoring of Mood): Observational Longitudinal Study’. JMIR Ment. Health 2025, vol. 12, e63622. [Google Scholar] [CrossRef]
- Busshart, L.; Petrovic, M.; Amin, R.; Hegerl, U. ‘Distinguishing Common Digital Phenotyping and Self-Report Parameters for Monitoring and Predicting Depression: Scoping Review’. JMIR MHealth UHealth 2026, vol. 14, e70840–e70840. [Google Scholar] [CrossRef] [PubMed]
- Pezoulas, V. C.; et al. ‘Synthetic data generation methods in healthcare: A review on open-source tools and methods’. Comput. Struct. Biotechnol. J. 2024, vol. 23, 2892–2910. [Google Scholar] [CrossRef]
- Adams, T.; et al. , ‘On the fidelity versus privacy and utility trade-off of synthetic patient data’. iScience 2025, vol. 28(no. 5), 112382. [Google Scholar] [CrossRef]
- Pezoulas, V. C.; et al. ‘Synthetic data generation methods in healthcare: A review on open-source tools and methods’. Comput. Struct. Biotechnol. J. 2024, vol. 23, 2892–2910. [Google Scholar] [CrossRef]
- Giuffrè, M.; Shung, D. L. ‘Harnessing the power of synthetic data in healthcare: innovation, application, and privacy’. npj Digit. Med. 2023, vol. 6(no. 1), 186. [Google Scholar] [CrossRef]
- Kroenke, K.; Spitzer, R. L.; Williams, J. B. ‘The PHQ-9: validity of a brief depression severity measure’. J. Gen. Intern. Med. 2001, vol. 16(no. 9), 606–613. [Google Scholar] [CrossRef] [PubMed]
- Leaning, E.; et al. , ‘From smartphone data to clinically relevant predictions: A systematic review of digital phenotyping methods in depression’. Neurosci. Biobehav. Rev. 2024, vol. 158, 105541. [Google Scholar] [CrossRef]
- Vignapiano, et al. , ‘A narrative review of digital biomarkers in the management of major depressive disorder and treatment-resistant forms’. Front. Psychiatry 2023, vol. 14, 1321345. [Google Scholar] [CrossRef]
- Jung, H. W.; et al. , ‘Key Features of Digital Phenotyping for Monitoring Mental Disorders: Systematic Review’. J. Med. Internet Res. 2025, vol. 27, e77331–e77331. [Google Scholar] [CrossRef]
- Martinez-Martin, N.; Greely, H. T.; Cho, M. K. ‘Ethical Development of Digital Phenotyping Tools for Mental Health Applications: Delphi Study’. JMIR MHealth UHealth 2021, vol. 9(no. 7), e27343. [Google Scholar] [CrossRef]
- Oudin, et al. , ‘Digital Phenotyping: Data-Driven Psychiatry to Redefine Mental Health’. J. Med. Internet Res. 2023, vol. 25, e44502. [Google Scholar] [CrossRef]
- Giuffrè, M.; Shung, D. L. ‘Harnessing the power of synthetic data in healthcare: innovation, application, and privacy’. npj Digit. Med. 2023, vol. 6(no. 1), 186. [Google Scholar] [CrossRef]
- Pezoulas, V. C.; et al. ‘Synthetic data generation methods in healthcare: A review on open-source tools and methods’. Comput. Struct. Biotechnol. J. 2024, vol. 23, 2892–2910. [Google Scholar] [CrossRef]
- Mendes, M.; Barbar, A.; Refaie, M. ‘Synthetic data generation: a privacy-preserving approach to accelerate rare disease research’. Front. Digit. Health 2025, vol. 7, 1563991. [Google Scholar] [CrossRef] [PubMed]
- Qian, Z.; Callender, T.; Cebere, B.; Janes, S. M.; Navani, N.; Van Der Schaar, M. ‘Synthetic data for privacy-preserving clinical risk prediction’. Sci. Rep. 2024, vol. 14(no. 1), 25676. [Google Scholar] [CrossRef]
- Loni, M.; Poursalim, F.; Asadi, M.; Gharehbaghi, A. ‘A review on generative AI models for synthetic medical text, time series, and longitudinal data’. npj Digit. Med. 2025, vol. 8(no. 1), 281. [Google Scholar] [CrossRef] [PubMed]
- Zhan, Y.; Liu, H.; Wang, Y. ‘Digital phenotyping of depression: A multi-modal passive sensing approach to identifying novel behavioral and physiological markers of treatment response’. J. Psychiatr. Res. 2026, vol. 194, 40–50. [Google Scholar] [CrossRef] [PubMed]
- Bufano, P.; Laurino, M.; Said, S.; Tognetti, A.; Menicucci, D. ‘Digital Phenotyping for Monitoring Mental Disorders: Systematic Review’. J. Med. Internet Res. 2023, vol. 25, e46778. [Google Scholar] [CrossRef]
- Shen, S.; et al. , ‘Passive Sensing for Mental Health Monitoring Using Machine Learning With Wearables and Smartphones: Scoping Review’. J. Med. Internet Res. 2025, vol. 27, e77066. [Google Scholar] [CrossRef]
- Jung, H. W.; et al. , ‘Key Features of Digital Phenotyping for Monitoring Mental Disorders: Systematic Review’. J. Med. Internet Res. 2025, vol. 27, e77331–e77331. [Google Scholar] [CrossRef]
- Taliaz; Souery, D. ‘A New Characterization of Mental Health Disorders Using Digital Behavioral Data: Evidence from Major Depressive Disorder’. J. Clin. Med. 2021, vol. 10(no. 14), 3109. [Google Scholar] [CrossRef]


Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).