Submitted:
11 July 2025
Posted:
11 July 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Results
2.1. Transcriptomic Datasets
2.2. Correlated Gene Modules
2.3. Candidate Causal Genes in IPF
2.4. Candidate Genes Enriched in Pro-Fibrotic Niches
2.5. Mediator Genes Associated with IPF Severity
2.6. Biomarker Candidates in IPF
2.7. Small-Molecule Compounds Targeting the Causal Genes
3. Discussion
4. Materials and Methods
4.1. WGCNA and Candidate Correlated Modules
4.2. Mediation Analysis and Candidate Causal Genes


4.3. Machine Learning Models for Biomarker Analysis
4.4. DeepCE Model for Small-Molecule Screening
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Sun, D.; et al. Why 90% of clinical drug development fails and how to improve it? Acta Pharmaceutica Sinica B 2022, 12, 3049–3062. [Google Scholar] [CrossRef] [PubMed]
- Pham, T.H.; et al. A deep learning framework for high-throughput mechanism-driven phenotype compound screening and its application to COVID-19 drug repurposing. Nat Mach Intell 2021, 3, 247–257. [Google Scholar] [CrossRef] [PubMed]
- Xie, L.; et al. Towards Structural Systems Pharmacology to Study Complex Diseases and Personalized Medicine. PLOS Computational Biology 2014, 10, e1003554. [Google Scholar] [CrossRef] [PubMed]
- Zhavoronkov, A.; et al. Deep learning enables rapid identification of potent DDR1 kinase inhibitors. Nature Biotechnology 2019, 37, 1038–1040. [Google Scholar] [CrossRef] [PubMed]
- Lounkine, E.; et al. Large-scale prediction and testing of drug activity on side-effect targets. Nature 2012, 486, 361–367. [Google Scholar] [CrossRef] [PubMed]
- Ghandikota, S.K.; Jegga, A.G. Chapter Seven - Application of artificial intelligence and machine learning in drug repurposing, in Progress in Molecular Biology and Translational Science; Singh, V., Ed.; Academic Press, 2024; pp. 171–211. [Google Scholar]
- Langfelder, P.; Horvath, S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics 2008, 9, 559. [Google Scholar] [CrossRef] [PubMed]
- Zhang, B.; Horvath, S. A General Framework for Weighted Gene Co-Expression Network Analysis. 2005, 4. [Google Scholar] [CrossRef] [PubMed]
- Liu, Y. CWGCNA: an R package to perform causal inference from the WGCNA framework. NAR Genom Bioinform 2024, 6, lqae042. [Google Scholar] [CrossRef] [PubMed]
- Kaur, A.; Mathai, S.K.; Schwartz, D.A. Genetics in Idiopathic Pulmonary Fibrosis Pathogenesis, Prognosis, and Treatment. Frontiers in Medicine 2017, 4. [Google Scholar] [CrossRef] [PubMed]
- Maher, T.M. Interstitial Lung Disease: A Review. JAMA 2024, 331, 1655–1665. [Google Scholar] [CrossRef] [PubMed]
- Wang, Y.; et al. Pan-transcriptome-based candidate therapeutic discovery for idiopathic pulmonary fibrosis. Therapeutic Advances in Respiratory Disease 2020, 14, 1753466620971143. [Google Scholar] [CrossRef] [PubMed]
- Kolb, M.; Bonella, F.; Wollin, L. Therapeutic targets in idiopathic pulmonary fibrosis. Respiratory Medicine 2017, 131, 49–57. [Google Scholar] [CrossRef] [PubMed]
- Furusawa, H.; et al. Chronic Hypersensitivity Pneumonitis, an Interstitial Lung Disease with Distinct Molecular Signatures. Am J Respir Crit Care Med 2020, 202, 1430–1444. [Google Scholar] [CrossRef] [PubMed]
- McDonough, J.E.; et al. Transcriptional regulatory model of fibrosis progression in the human lung. JCI Insight 2019, 4. [Google Scholar] [CrossRef] [PubMed]
- Jaffar, J.; et al. Matrix metalloproteinase-7 is increased in lung bases but not apices in idiopathic pulmonary fibrosis. ERJ Open Res 2022, 8. [Google Scholar] [CrossRef] [PubMed]
- Barrett, T.; et al. NCBI GEO: archive for functional genomics data sets--update. Nucleic Acids Res 2013, 41, D991–D995. [Google Scholar] [CrossRef] [PubMed]
- Subramanian, A.; et al. A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles. Cell 2017, 171, 1437–1452. [Google Scholar] [CrossRef] [PubMed]
- Law, C.W.; et al. voom: precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biology 2014, 15, R29. [Google Scholar] [CrossRef] [PubMed]
- Wishart, D.S.; et al. DrugBank: a comprehensive resource for in silico drug discovery and exploration. Nucleic Acids Res 2006, 34, D668–D672. [Google Scholar] [CrossRef] [PubMed]
- Cannon, M.; et al. DGIdb 5.0: rebuilding the drug-gene interaction database for precision medicine and drug discovery platforms. Nucleic Acids Res 2024, 52, D1227–D1235. [Google Scholar] [CrossRef] [PubMed]
- Corsello, S.M.; et al. The Drug Repurposing Hub: a next-generation drug library and information resource. Nat Med 2017, 23, 405–408. [Google Scholar] [CrossRef] [PubMed]
- Mayr, C.H.; et al. Spatial transcriptomic characterization of pathologic niches in IPF. Science Advances 10, eadl5473. [CrossRef] [PubMed]
- Ghandikota, S.; et al. Consensus Gene Co-Expression Network Analysis Identifies Novel Genes Associated with Severity of Fibrotic Lung Disease. International Journal of Molecular Sciences 2022, 23. [Google Scholar] [CrossRef] [PubMed]
- Friedman, J.H.; Hastie, T.; Tibshirani, R. Regularization Paths for Generalized Linear Models via Coordinate Descent. Journal of Statistical Software 2010, 33, 1–22. [Google Scholar] [CrossRef] [PubMed]
- Tay, J.K.; Narasimhan, B.; Hastie, T. Elastic Net Regularization Paths for All Generalized Linear Models. Journal of Statistical Software 2023, 106, 1–31. [Google Scholar] [CrossRef] [PubMed]
- Zhang, T.; et al. Single Cell RNA-Seq Identifies Cell Subpopulations Contributing to Idiopathic Pulmonary Fibrosis in Humans. J Cell Mol Med 2025, 29, e70402. [Google Scholar] [CrossRef] [PubMed]
- Prele, C.M.; et al. Plasma cell but not CD20-mediated B-cell depletion protects from bleomycin-induced lung fibrosis. Eur Respir J 2022, 60. [Google Scholar] [CrossRef] [PubMed]
- Cui, H.; et al. Inhibition of Glutaminase 1 Attenuates Experimental Pulmonary Fibrosis. Am J Respir Cell Mol Biol 2019, 61, 492–500. [Google Scholar] [CrossRef] [PubMed]
- Ge, J.; et al. Glutaminolysis Promotes Collagen Translation and Stability via alpha-Ketoglutarate-mediated mTOR Activation and Proline Hydroxylation. Am J Respir Cell Mol Biol 2018, 58, 378–390. [Google Scholar] [CrossRef] [PubMed]
- He, A.R.; et al. First-in-Human Phase I Study of Merestinib, an Oral Multikinase Inhibitor, in Patients with Advanced Cancer. Oncologist 2019, 24, e930–e942. [Google Scholar] [CrossRef] [PubMed]
- Crestani, B.; et al. Hepatocyte growth factor and lung fibrosis. Proc Am Thorac Soc 2012, 9, 158–163. [Google Scholar] [CrossRef] [PubMed]
- Stella, G.M.; et al. Ockham's razor for the MET-driven invasive growth linking idiopathic pulmonary fibrosis and cancer. J Transl Med 2016, 14, 256. [Google Scholar] [CrossRef] [PubMed]
- Stella, G.M.; et al. The oncogenic landscape of the idiopathic pulmonary fibrosis: a narrative review. Translational Lung Cancer Research 2022, 11, 472–496. [Google Scholar] [CrossRef] [PubMed]
- El Awdan, S.A.; et al. Regression of fibrosis by cilostazol in a rat model of thioacetamide-induced liver fibrosis: Up regulation of hepatic cAMP, and modulation of inflammatory, oxidative stress and apoptotic biomarkers. PLoS One 2019, 14, e0216301. [Google Scholar] [CrossRef] [PubMed]
- Hada, Y.; et al. Cilostazol Attenuates AngII-Induced Cardiac Fibrosis in apoE Deficient Mice. Int J Mol Sci 2022, 23. [Google Scholar] [CrossRef] [PubMed]
- Zhou, Y.; et al. Inhibition of mechanosensitive signaling in myofibroblasts ameliorates experimental pulmonary fibrosis. The Journal of Clinical Investigation 2013, 123, 1096–1108. [Google Scholar] [CrossRef] [PubMed]
- Langfelder, P.; Zhang, B.; Horvath, S. Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R. Bioinformatics 2008, 24, 719–720. [Google Scholar] [CrossRef] [PubMed]
- VanderWeele, T.J. Mediation Analysis: A Practitioner's Guide. Annu Rev Public Health 2016, 37, 17–32. [Google Scholar] [CrossRef] [PubMed]
- Weininger, D. SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. Journal of Chemical Information and Computer Sciences 1988, 28, 31–36. [Google Scholar] [CrossRef]
- Grover, A.; Leskovec, J. node2vec: Scalable feature learning for networks. in Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining. 2016.
- Szklarczyk, D.; et al. The STRING database in 2023: protein-protein association networks and functional enrichment analyses for any sequenced genome of interest. Nucleic Acids Res 2023, 51, D638–D646. [Google Scholar] [CrossRef] [PubMed]


| GEO Accession ID | # IPF | # Controls | Reference |
|---|---|---|---|
| GSE150910 | 103 | 103 | [14] |
| GSE124685 | 49 (19=mild (IPF1); 16=moderate (IPF2); 14=advanced (IPF3)) | 35 | [15] |
| GSE213001 | 61 (22=Advanced, 27=Severe) | 40 | [16] |
| Gene | AUPRC | AUROC | |
|---|---|---|---|
| IPF vs Controls | ITM2C | 0.993 0.006 | 0.987 0.01 |
| PRTFDC1 | 0.989 0.007 | 0.983 0.011 | |
| CRABP2 | 0.984 0.01 | 0.971 0.018 | |
| CPNE7 | 0.969 0.018 | 0.955 0.024 | |
| FAM83D | 0.965 0.018 | 0.943 0.029 | |
| NMNAT2 | 0.964 0.006 | 0.945 0.026 | |
| P4HA3 | 0.95 0.03 | 0.933 0.034 | |
| PDGFD | 0.945 0.031 | 0.932 0.036 | |
| PAPPA2 | 0.943 0.023 | 0.902 0.04 | |
| Severe IPF vs Controls | ITM2C | 0.996 0.004 | 0.997 0.003 |
| CPNE7 | 0.97 0.036 | 0.98 0.026 | |
| PRTFDC1 | 0.969 0.024 | 0.975 0.02 | |
| CRABP2 | 0.96 0.022 | 0.958 0.025 | |
| NMNAT2 | 0.943 0.018 | 0.956 0.014 | |
| LAX1 | 0.943 0.039 | 0.957 0.029 | |
| PAPPA2 | 0.92 0.042 | 0.927 0.045 | |
| Advanced IPF vs Controls | ITM2C | 0.996 0.006 | 0.998 0.004 |
| CRABP2 | 0.991 0.013 | 0.994 0.008 | |
| PRTFDC1 | 0.983 0.017 | 0.99 0.01 | |
| FAM83D | 0.976 0.02 | 0.986 0.011 | |
| NMNAT2 | 0.947 0.033 | 0.968 0.021 | |
| MYOF | 0.936 0.033 | 0.955 0.026 | |
| P4HA3 | 0.927 0.058 | 0.965 0.022 | |
| CDH3 | 0.923 0.047 | 0.949 0.029 | |
| CPNE7 | 0.903 0.058 | 0.936 0.038 |
| Drug | A375 | HA1E | HELA | HT29 | MCF7 | PC3 | YAPC |
|---|---|---|---|---|---|---|---|
| 1,4-Bis((3,4-dimethoxyphenyl)sulfonyl)-1,4-diazepane | -0.25 (0.051) | -0.27 (0.036) | -0.34 (0.006) | -0.28 (0.027) | -0.33 (0.009) | -0.25 (0.053) | -0.28 (0.03) |
| CB-839 (Telaglenastat) | -0.29 (0.023) | 0.04 (0.76) | -0.29 (0.02) | -0.26 (0.044) | -0.25 (0.049) | -0.28 (0.03) | -0.18 (0.16) |
| [2-(4-Amino-1,2,5-oxadiazol-3-yl)-1-ethylimidazo[4,5-c]pyridin-7-yl]-[(3S)-3-aminopyrrolidin-1-yl]methanone | -0.31 (0.014) | -0.18 (0.17) | -0.26 (0.041) | -0.32 (0.014) | -0.31 (0.015) | -0.29 (0.02) | -0.19 (0.13) |
| Aminofurazanyl-azabenzimidazole 6n | -0.35 (0.005) | -0.27 (0.03) | -0.3 (0.02) | -0.23 (0.07) | -0.29 (0.023) | -0.25 (0.053) | -0.19 (0.15) |
| [2-(4-Amino-furazan-3-yl)-1-ethyl-1H-imidazo[4,5-c]pyridin-7-ylmethyl]-piperidin-4-yl-amine | -0.28 (0.025) | -0.1 (0.42) | -0.25 (0.052) | -0.3 (0.02) | -0.28 (0.026) | -0.3 (0.02) | -0.09 (0.51) |
| Pentamidine | -0.31 (0.015) | -0.16 (0.24) | -0.21 (0.098) | -0.21 (0.11) | -0.26 (0.046) | -0.26 (0.044) | -0.27 (0.034) |
| RHC-80267 | -0.22 (0.089) | 0.08 (0.52) | -0.18 (0.16) | -0.35 (0.006) | -0.29 (0.023) | -0.23 (0.075) | -0.26 (0.039) |
| RK-682 | -0.27 (0.033) | 0.12 (0.35) | 0.019 (0.88) | 0.22 (0.097) | -0.18 (0.17) | -0.22 (0.091) | -0.29 (0.025) |
| Merestinib | -0.08 (0.54) | -0.09 (0.48) | -0.33 (0.008) | -0.06 (0.64) | -0.28 (0.03) | -0.07 (0.57) | -0.05 (0.73) |
| LY-255283 | -0.25 (0.053) | 0.17 (0.2) | -0.18 (0.16) | -0.26 (0.047) | -0.2 (0.12) | -0.28 (0.028) | -0.07 (0.58) |
| Cilostazol | -0.31 (0.015) | 0.018 (0.89) | -0.15 (0.25) | -0.16 (0.22) | -0.11 (0.41) | -0.06 (0.67) | -0.28 (0.033) |
| M2-PK-activator | -0.19 (0.15) | -0.26 (0.04) | -0.21 (0.11) | 0.25 (0.05) | -0.22 (0.089) | -0.23 (0.07) | -0.29 (0.02) |
| NNC-711 | -0.11 (0.4) | -0.03 (0.79) | -0.2 (0.11) | -0.3 (0.018) | -0.27 (0.038) | 0.1 (0.44) | 0.009 (0.95) |
| CID 11973736 | -0.23 (0.077) | -0.2 (0.12) | -0.33 (0.009) | -0.24 (0.06) | -0.32 (0.014) | -0.18 (0.16) | -0.07 (0.6) |
| Azeloyl diethyl salicylate | -0.15 (0.26) | 0.1 (0.45) | -0.34 (0.008) | -0.32 (0.011) | -0.16 (0.22) | -0.19 (0.13) | 0.02 (0.88) |
| Cetrimonium | -0.31 (0.015) | -0.04 (0.78) | -0.15 (0.26) | 0.23 (0.07) | -0.2 (0.13) | -0.28 (0.027) | -0.24 (0.06) |
| Decamethonium | -0.26 (0.043) | -0.04 (0.75) | -0.06 (0.65) | -0.26 (0.047) | -0.23 (0.072) | -0.23 (0.07) | -0.24 (0.056) |
| Gemcadiol | -0.29 (0.021) | -0.004 (0.98) | -0.06 (0.63) | -0.25 (0.049) | -0.14 (0.29) | -0.25 (0.056) | 0.24 (0.066) |
| Clofilium | -0.28 (0.026) | 0.23 (0.079) | -0.19 (0.14) | -0.25 (0.051) | -0.14 (0.29) | -0.27 (0.03) | -0.06 (0.64) |
| 1-[[4,5-Bis(4-methoxyphenyl)-2-thiazolyl]carbonyl]-4-methylpiperazine | -0.13 (0.33) | -0.15 (0.24) | -0.29 (0.025) | -0.17 (0.19) | -0.28 (0.029) | -0.17 (0.19) | 0.04 (0.78) |
| Zindotrine | -0.03 (0.82) | 0.17 (0.18) | 0.05 (0.68) | -0.28 (0.03) | -0.19 (0.15) | 0.04 (0.74) | -0.31 (0.015) |
| Eliglustat | -0.04 (0.78) | -0.01 (0.94) | -0.26 (0.04) | -0.27 (0.036) | -0.1 (0.44) | -0.16 (0.22) | -0.01 (0.93) |
| 3-(Azepan-1-ylsulfonyl)-N-(3-bromophenyl)benzamide | -0.02 (0.87) | -0.25 (0.054) | -0.26 (0.04) | -0.08 (0.54) | -0.11 (0.43) | -0.08 (0.53) | -0.25 (0.047) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).