Submitted:
16 July 2025
Posted:
17 July 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
3. Artificial Intelligence in the Diagnosis of Pediatric Rare Diseases
3.1. The Role of AI in Genomic Data Interpretation
3.1.1. AI Tools for Variant Interpretation
3.1.2. Role of Large Language Models (LLMs)
3.2. Phenotype-Genotype Integration Through Automated Tools
3.3. Real-World Data: Opportunities and Challenges for AI-Assisted Rare Disease Diagnosis
3.4. Comparative Diagnostic Performance of AI and Human Experts
3.5. Challenges in Clinical Implementation
3.6. Ethical Considerations in Pediatric Settings
4. Conclusion and Future Direction
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| AI | Artificial Intelligence |
| LLM /LLMs | Large Language Model(s) |
| NGS | Next-Generation Sequencing |
| WES | Whole-Exome Sequencing |
| WGS | Whole-Genome Sequencing |
| HPO | Human Phenotype Ontology |
| RF | Reverse Phenotyping |
| VUS | Variant of Uncertain Significance |
| NLP | Natural Language Processing |
| EMR / EMRs | Electronic Medical Record(s) |
| EHR / EHRs | Electronic Health Record(s) |
| RWD | Real-World Data |
| RCT / RCTs | Randomized Controlled Trial(s) |
| OMIM | Online Mendelian Inheritance in Man |
References
- Bauskis, A.; Strange, C.; Molster, C.; Fisher, C. The diagnostic odyssey: insights from parents of children living with an undiagnosed condition. Orphanet J. Rare Dis. 2022, 17, 1–13. [Google Scholar] [CrossRef] [PubMed]
- The Voice of 12,000 Patients [Internet]. EURORDIS-Rare Diseases Europe. [cited 2025 Jul 13]. Available from: https://www.eurordis.org/publications/the-voice-of-12000-patients/.
- Aldharman, S.S.; Al-Jabr, K.H.; Alharbi, Y.S.; Alnajar, N.K.; Alkhanani, J.J.; Alghamdi, A.; A Abdellatif, R.; Allouzi, A.; Almallah, A.M.; Jamil, S.F. Implications of Early Diagnosis and Intervention in the Management of Neurodevelopmental Delay (NDD) in Children: A Systematic Review and Meta-Analysis. Cureus 2023, 15, e38745. [Google Scholar] [CrossRef] [PubMed]
- Bowling, K.M.; Thompson, M.L.; Amaral, M.D.; Finnila, C.R.; Hiatt, S.M.; Engel, K.L.; Cochran, J.N.; Brothers, K.B.; East, K.M.; Gray, D.E.; et al. Genomic diagnosis for children with intellectual disability and/or developmental delay. Genome Med. 2017, 9, 1–11. [Google Scholar] [CrossRef] [PubMed]
- Ao, G.; Chen, M.; Li, J.; Nie, H.; Zhang, L.; Chen, Z. Comparative analysis of large language models on rare disease identification. Orphanet J. Rare Dis. 2025, 20, 150. [Google Scholar] [CrossRef] [PubMed]
- Sánchez Fernández I, Loddenkemper T, Gaínza-Lein M, Sheidley BR, Poduri A. Diagnostic yield of genetic tests in epilepsy: A meta-analysis and cost-effectiveness study. Neurology. 2019 Jan 28;92(5):e418–28.
- Sun, Y.; Peng, J.; Liang, D.; Ye, X.; Xu, N.; Chen, L.; Yan, D.; Zhang, H.; Xiao, B.; Qiu, W.; et al. Genome sequencing demonstrates high diagnostic yield in children with undiagnosed global developmental delay/intellectual disability: A prospective study. Hum. Mutat. 2022, 43, 568–581. [Google Scholar] [CrossRef] [PubMed]
- Best, S.; Yu, J.; Lord, J.; Roche, M.; Watson, C.M.; Bevers, R.P.J.; Stuckey, A.; Madhusudhan, S.; Jewell, R.; Sisodiya, S.M.; et al. Uncovering the burden of hidden ciliopathies in the 100 000 Genomes Project: a reverse phenotyping approach. J. Med Genet. 2022, 59, 1151–1164. [Google Scholar] [CrossRef] [PubMed]
- Burdick, K.J.; Cogan, J.D.; Rives, L.C.; Robertson, A.K.; Koziura, M.E.; Brokamp, E.; Duncan, L.; Hannig, V.; Pfotenhauer, J.; Vanzo, R.; et al. Limitations of exome sequencing in detecting rare and undiagnosed diseases. Am. J. Med Genet. Part A 2020, 182, 1400–1406. [Google Scholar] [CrossRef] [PubMed]
- Abbasi, A.; Alexandrov, L.B. Significance and limitations of the use of next-generation sequencing technologies for detecting mutational signatures. DNA Repair 2021, 107, 103200. [Google Scholar] [CrossRef] [PubMed]
- Austin-Tse, C.A.; Jobanputra, V.; Perry, D.L.; Bick, D.; Taft, R.J.; Venner, E.; Gibbs, R.A.; Young, T.; Barnett, S.; Belmont, J.W.; et al. Best practices for the interpretation and reporting of clinical whole genome sequencing. npj Genom. Med. 2022, 7, 27. [Google Scholar] [CrossRef] [PubMed]
- Richards, S.; Aziz, N.; Bale, S.; Bick, D.; Das, S.; Gastier-Foster, J.; Grody, W.W.; Hegde, M.; Lyon, E.; Spector, E.; et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genetics in Medicine 2015, 17, 405–424. [Google Scholar] [CrossRef] [PubMed]
- Alvarez-Costes, S. Deciphering Genomic Complexity: The Role of Explainable AI in Evolutionary Genomics. Methods Mol Biol. 2025;2927:221–34.
- Aster, A.; Laupichler, M.C.; Rockwell-Kollmann, T.; Masala, G.; Bala, E.; Raupach, T. ChatGPT and Other Large Language Models in Medical Education — Scoping Literature Review. Med Sci. Educ. 2024, 35, 555–567. [Google Scholar] [CrossRef] [PubMed]
- Ji, Z.; Lee, N.; Frieske, R.; Yu, T.; Su, D.; Xu, Y.; Ishii, E.; Bang, Y.J.; Madotto, A.; Fung, P. Survey of Hallucination in Natural Language Generation. ACM Comput. Surv. 2023, 55, 1–38. [Google Scholar] [CrossRef]
- Sharma, J.; Goel, P. The Use of AI for Phenotype-Genotype Mapping. Methods Mol Biol. 2025;2952:369–410.
- Obermeyer, Z.; Powers, B.; Vogeli, C.; Mullainathan, S. Dissecting racial bias in an algorithm used to manage the health of populations. Science 2019, 366, 447–453. [Google Scholar] [CrossRef] [PubMed]
- Deng, J.; Zubair, A.; Park, Y.-J. Limitations of large language models in medical applications. Postgrad. Med J. 2023, 99, 1298–1299. [Google Scholar] [CrossRef] [PubMed]
- Carbonari V, Veltri P, Guzzi PH. Decoding Rarity: Large Language Models in the Diagnosis of Rare Diseases [Internet]. arXiv; 2025 [cited 2025 Jun 7]. Available from: http://arxiv.org/abs/2505.17065.
- Iqbal, U.; Tanweer, A.; Rahmanti, A.R.; Greenfield, D.; Lee, L.T.-J.; Li, Y.-C.J. Impact of large language model (ChatGPT) in healthcare: an umbrella review and evidence synthesis. J. Biomed. Sci. 2025, 32, 45. [Google Scholar] [CrossRef] [PubMed]
- Wilczewski, C.M.; Obasohan, J.; Paschall, J.E.; Zhang, S.; Singh, S.; Maxwell, G.L.; Similuk, M.; Wolfsberg, T.G.; Turner, C.; Biesecker, L.G.; et al. Genotype first: Clinical genomics research through a reverse phenotyping approach. Am. J. Hum. Genet. 2023, 110, 3–12. [Google Scholar] [CrossRef] [PubMed]
- Smail, C.; Ge, B.; Keever-Keigher, M.R.; Schwendinger-Schreck, C.; Cheung, W.A.; Johnston, J.J.; Barrett, C.; Genomic Answers for Kids Consortium; Feldman, K. ; Cohen, A.S.A.; et al. Complex trait associations in rare diseases and impacts on Mendelian variant interpretation. Nat. Commun. 2024, 15, 1–11. [Google Scholar] [CrossRef] [PubMed]
- A Gargano, M.; Matentzoglu, N.; Coleman, B.; Addo-Lartey, E.B.; Anagnostopoulos, A.V.; Anderton, J.; Avillach, P.; Bagley, A.M.; Bakštein, E.; Balhoff, J.P.; et al. The Human Phenotype Ontology in 2024: phenotypes around the world. Nucleic Acids Res. 2023, 52, D1333–D1346. [Google Scholar] [CrossRef] [PubMed]
- Bajwa, J.; Munir, U.; Nori, A.; Williams, B. Artificial intelligence in healthcare: transforming the practice of medicine. Futur. Heal. J. 2021, 8, e188–e194. [Google Scholar] [CrossRef] [PubMed]
- Garcelon, N.; Neuraz, A.; Salomon, R.; Bahi-Buisson, N.; Amiel, J.; Picard, C.; Mahlaoui, N.; Benoit, V.; Burgun, A.; Rance, B. Next generation phenotyping using narrative reports in a rare disease clinical data warehouse. Orphanet J. Rare Dis. 2018, 13, 1–11. [Google Scholar] [CrossRef] [PubMed]
- Mishima, H.; Suzuki, H.; Doi, M.; Miyazaki, M.; Watanabe, S.; Matsumoto, T.; Morifuji, K.; Moriuchi, H.; Yoshiura, K.-I.; Kondoh, T.; et al. Evaluation of Face2Gene using facial images of patients with congenital dysmorphic syndromes recruited in Japan. J. Hum. Genet. 2019, 64, 789–794. [Google Scholar] [CrossRef] [PubMed]
- Birgmeier, J.; Haeussler, M.; Deisseroth, C.A.; Steinberg, E.H.; Jagadeesh, K.A.; Ratner, A.J.; Guturu, H.; Wenger, A.M.; Diekhans, M.E.; Stenson, P.D.; et al. AMELIE speeds Mendelian diagnosis by matching patient phenotype and genotype to primary literature. Sci. Transl. Med. 2020, 12. [Google Scholar] [CrossRef] [PubMed]
- Rao, A.; Joseph, T.; Saipradeep, V.G.; Kotte, S.; Sivadasan, N.; Srinivasan, R.; Mamidi, S. PRIORI-T: A tool for rare disease gene prioritization using MEDLINE. PLOS ONE 2020, 15, e0231728. [Google Scholar] [CrossRef] [PubMed]
- Kafkas, Ş.; Abdelhakim, M.; Althagafi, A.; Toonsi, S.; Alghamdi, M.; Schofield, P.N.; Hoehndorf, R. The application of Large Language Models to the phenotype-based prioritization of causative genes in rare disease patients. Sci. Rep. 2025, 15, 1–11. [Google Scholar] [CrossRef] [PubMed]
- Smith, C.T.; Williamson, P.R.; Beresford, M.W. Methodology of clinical trials for rare diseases. Best Pr. Res. Clin. Rheumatol. 2014, 28, 247–262. [Google Scholar] [CrossRef] [PubMed]
- Liu J, Barrett JS, Leonardi ET, Lee L, Roychoudhury S, Chen Y, et al. Natural History and Real-World Data in Rare Diseases: Applications, Limitations, and Future Perspectives. J Clin Pharmacol. 2022 Dec;62(Suppl 2):S38–55.
- Sherman RE, Anderson SA, Dal Pan GJ, Gray GW, Gross T, Hunter NL, et al. Real-World Evidence — What Is It and What Can It Tell Us? N Engl J Med. 2016 Dec 8;375(23):2293–7.
- Hampson, G.; Towse, A.; Dreitlein, W.B.; Henshall, C.; Pearson, S.D. Real-world evidence for coverage decisions: opportunities and challenges. J. Comp. Eff. Res. 2018, 7, 1133–1143. [Google Scholar] [CrossRef] [PubMed]
- Orsini LS, Berger M, Crown W, Daniel G, Eichler HG, Goettsch W, et al. Improving Transparency to Build Trust in Real-World Secondary Data Studies for Hypothesis Testing—Why, What, and How: Recommendations and a Road Map from the Real-World Evidence Transparency Initiative. Value in Health. 2020 Sep 1;23(9):1128–36.
- Weiss A, Michels C, Burgmer P, Mussweiler T, Ockenfels A, Hofmann W. Trust in everyday life. Journal of Personality and Social Psychology. 2021;121(1):95–114.
- hanjianwei. Everything About DeepSeek: Key Features, Usage, and Technical Advantages [Internet]. PopAi. 2025 [cited 2025 Jun 6]. Available from: https://www.popai.pro/resources/everything-about-deepseek/.
- Germain, D.P.; Gruson, D.; Malcles, M.; Garcelon, N. Applying artificial intelligence to rare diseases: a literature review highlighting lessons from Fabry disease. Orphanet J. Rare Dis. 2025, 20, 1–16. [Google Scholar] [CrossRef] [PubMed]
- Wojtara, M.; Rana, E.; Rahman, T.; Khanna, P.; Singh, H. Artificial intelligence in rare disease diagnosis and treatment. Clin. Transl. Sci. 2023, 16, 2106–2111. [Google Scholar] [CrossRef] [PubMed]
- Wojtara, M.; Rana, E.; Rahman, T.; Khanna, P.; Singh, H. Artificial intelligence in rare disease diagnosis and treatment. Clin. Transl. Sci. 2023, 16, 2106–2111. [Google Scholar] [CrossRef] [PubMed]
- Ilić, N.; Marić, N.; Cvetković, D.; Bogosavljević, M.; Bukara-Radujković, G.; Krstić, J.; Paunović, Z.; Begović, N.; Zarić, S.P.; Todorović, S.; et al. The Artificial Intelligence-Assisted Diagnosis of Skeletal Dysplasias in Pediatric Patients: A Comparative Benchmark Study of Large Language Models and a Clinical Expert Group. Genes 2025, 16, 762. [Google Scholar] [CrossRef]
- Jandoubi, B.; Akhloufi, M.A. Multimodal Artificial Intelligence in Medical Diagnostics. Information 2025, 16, 591. [Google Scholar] [CrossRef]
- Harada, T.; Shimizu, T.; Kaji, Y.; Suyama, Y.; Matsumoto, T.; Kosaka, C.; Shimizu, H.; Nei, T.; Watanuki, S. A Perspective from a Case Conference on Comparing the Diagnostic Process: Human Diagnostic Thinking vs. Artificial Intelligence (AI) Decision Support Tools. Int. J. Environ. Res. Public Heal. 2020, 17, 6110. [Google Scholar] [CrossRef] [PubMed]
- Ilić, N.; Marić, N.; Cvetković, D.; Bogosavljević, M.; Bukara-Radujković, G.; Krstić, J.; Paunović, Z.; Begović, N.; Zarić, S.P.; Todorović, S.; et al. The Artificial Intelligence-Assisted Diagnosis of Skeletal Dysplasias in Pediatric Patients: A Comparative Benchmark Study of Large Language Models and a Clinical Expert Group. Genes 2025, 16, 762. [Google Scholar] [CrossRef]
- Brasil S, Pascoal C, Francisco R, Dos Reis Ferreira V, Videira PA, Valadão AG. Artificial Intelligence (AI) in Rare Diseases: Is the Future Brighter? Genes (Basel). 2019 Nov 27;10(12):978.
- Rubeis, G.; Dubbala, K.; Metzler, I. “Democratizing” artificial intelligence in medicine and healthcare: Mapping the uses of an elusive term. Front. Genet. 2022, 13, 902542. [Google Scholar] [CrossRef] [PubMed]
- Holmes JH, Beinlich J, Boland MR, Bowles KH, Chen Y, Cook TS, et al. Why Is the Electronic Health Record So Challenging for Research and Clinical Care? Methods Inf Med. 2021 May;60(1–02):32–48.
- Kersloot, M.G.; van Putten, F.J.P.; Abu-Hanna, A.; Cornet, R.; Arts, D.L. Natural language processing algorithms for mapping clinical text fragments onto ontology concepts: a systematic review and recommendations for future studies. J. Biomed. Semant. 2020, 11, 1–21. [Google Scholar] [CrossRef] [PubMed]
- Holzinger A, Zatloukal K, Müller H. Is human oversight to AI systems still possible? N Biotechnol. 2025 Mar 25;85:59–62.
- London, A.J. Artificial Intelligence and Black-Box Medical Decisions: Accuracy versus Explainability. Häst. Cent. Rep. 2019, 49, 15–21. [Google Scholar] [CrossRef] [PubMed]
- Murdoch, B. Privacy and artificial intelligence: challenges for protecting health information in a new era. BMC Med Ethic- 2021, 22, 1–5. [Google Scholar] [CrossRef] [PubMed]
- Rigby, M.J. Ethical Dimensions of Using Artificial Intelligence in Health Care. AMA J. Ethic- 2019, 21, E121–124. [Google Scholar] [CrossRef]
- Saint James Aquino, Y. Making decisions: Bias in artificial intelligence and data-driven diagnostic tools. Aust J Gen Pract. 2023 Jul;52(7):439–42.
- Kolbinger, F.R.; Veldhuizen, G.P.; Zhu, J.; Truhn, D.; Kather, J.N. Reporting guidelines in medical artificial intelligence: a systematic review and meta-analysis. Commun. Med. 2024, 4, 1–10. [Google Scholar] [CrossRef] [PubMed]
- Mennella, C.; Maniscalco, U.; De Pietro, G.; Esposito, M. Ethical and regulatory challenges of AI technologies in healthcare: A narrative review. Heliyon 2024, 10, e26297. [Google Scholar] [CrossRef] [PubMed]
| Application Area | Description | Example Tools/Platforms |
| Variant Prioritization | Automated ranking of genetic variants based on pathogenicity predictions, allele frequencies, and gene-disease associations |
MOON (Diploid), Fabric Genomics, Emedgene, GEM |
| Phenotype-Genotype Matching | Linking patient phenotypic features (HPO terms) to known gene-disease relationships | Phenomizer, GEM |
| Reverse Phenotyping | AI-driven re-evaluation of clinical features based on unexpected or novel genetic findings | LLM-assisted reverse phenotyping workflows |
| NLP1 | Extracting structured phenotypic information from unstructured clinical notes | NLP modules integrated within genomic AI pipelines |
| Clinical Summarization & Decision Support | Generating diagnostic hypotheses and literature-informed interpretations |
ChatGPT (OpenAI), DeepSeek Medical AI |
| Tool | Function | Integration | Validation Status |
| MOON | Variant prioritization based on phenotype-genotype correlation | Standalone; requires manual HPO input | Used in clinical diagnostics; validated in internal benchmarking |
| GEM | AI-based interpretation and scoring of variants | Integrated with Fabric Genomics platform | Deployed in hospital settings; comparative benchmarking with human panels |
| Phenomizer | Suggests differential diagnoses from HPO terms | Standalone; research use | Open-access tool; used in academic projects |
| Face2Gene | Image-based facial phenotype recognition | Mobile/web platform | High accuracy in syndromic conditions; not validated for nonsyndromic cases |
| Emedgene | AI-supported variant analysis with automated reporting | Commercial clinical platform | Regulatory-cleared in some jurisdictions; limited open-access data |
| DeepPhen | Phenotype-driven gene ranking using ML | Research-use; experimental | Experimental validation in selected cohorts |
| Feature | Phenotype-Driven Algorithms | Large Language Models (LLMs) |
| Primary Input Type | Structured data (HPO terms) | Natural language, unstructured text |
| Strengths | Precise gene-disease matching, standardized outputs |
Flexible interpretation, literature summarization, clinical reasoning |
| Limitations | Dependence on structured inputs, limited in ambiguous cases |
Potential hallucinations, interpretability concerns |
| Examples | MOON, GEM, Phenomizer | ChatGPT, DeepSeek Medical AI |
| Ideal Use Case | Variant prioritization with detailed phenotypic data |
Complex differential diagnosis, summarizing patient histories |
| Challenge | Category | Impact on Diagnosis | Addressable by: |
| Unstructured EMR data | Data issue | Limits phenotypic precision; weakens AI inputs | NLP tools; structured phenotyping |
| Lack of interoperability | Data/workflow | Prevents integration with AI tools and databases | Cross-platform EMR integration |
| Clinician skepticism and unfamiliarity |
Workflow/human factor | Delays adoption; mistrust of AI recommendations | Targeted training, demonstration studies |
| Hallucination risk in LLMs | Algorithmic/technical | Produces plausible but false diagnoses |
Validation, hybrid expert oversight |
| Regulatory ambiguity | Legal/ethical | Unclear liability; discourages clinical use |
Guidelines, legal frameworks |
| Bias in training data | Ethical/data quality | Overlooks underrepresented populations |
Diverse datasets, fairness auditing |
| Challenge | Description | Potential Solutions |
| Data Interoperability | Lack of standardized EMR1 and genomic data integration |
Harmonized data standards |
| Workflow Integration | AI tools functioning as standalone systems | Seamless integration into hospital information systems |
| Clinician Training and Trust | Limited familiarity with AI methodologies | Targeted educational programs- demonstration projects |
| Validation and Regulation |
Lack of universal validation standards | Development of regulatory frameworks specific to AI diagnostics |
| Resource Constraints | Infrastructure and cost barriers in low-resource settings |
Cloud-based AI platforms, tiered implementation models |
| Ethical Domain | Key Issues | Proposed Mitigations |
| Transparency and Explainability | “Black box” decision-making processes | Develop interpretable AI models- provide output rationales |
| Informed Consent | Complexity of explaining AI involvement to parents | Tailored consent forms detailing AI role, benefits, and limitations |
| Equity and Bias | Underrepresentation of certain ethnic groups in training datasets | Diversify training data- continuous model revalidation |
| Privacy and Data Security |
Handling identifiable genomic and phenotypic data |
Robust encryption- compliance with pediatric data protection laws |
| Psychosocial Impact |
Emotional burden of AI-generated diagnoses |
Ensure clinician-led communication with empathy and support |
| Aspect | Advantages of AI Approach | Limitations of AI Approach |
| Speed | Rapid analysis of large-scale genomic and phenotypic datasets | Limited validation for ultra-rare and atypical cases |
| Accuracy | High precision for syndromically well-defined conditions (e.g. achondroplasia) | Reduced accuracy in genetically heterogeneous or phenotypically ambiguous disorders |
| Accessibility | Expands diagnostic capacity in settings lacking subspecialty expertise | Dependent on data quality and input standardization |
| Result Interpretability | Transparent algorithms in some platforms allow reasoning review | “Black box” models hinder interpretability and trust |
| Cost-effectiveness | Long-term reduction in diagnostic odyssey costs | Initial investment required for infrastructure and training |
| Ethical Considerations | Enables faster diagnosis and personalized therapies | Risks of bias propagation and unequal diagnostic accuracy across populations |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).