Submitted:
11 June 2026
Posted:
12 June 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Cohort and Data Sources
2.2. Feature Engineering
2.3. Statistical Analysis Overview
2.4. Logistic Regression and Incremental Discrimination
2.5. Sensitivity Analyses
2.6. Survival Analysis
2.7. Differential Expression and Pathway Analysis
2.8. Pathway-Level Concordance
2.9. Software
3. Results
3.1. Cohort Characteristics
3.2. Association Between SPAG1 and the PFI Captured by the Clinical Stage
3.3. SPAG1 Expression Independently Predicted Lymph Node Metastasis
3.4. SPAG1 Transcriptional Programmes Were Concordant with the Molecular Signature of Nodal Metastasis
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Briganti, A.; Blute, M.L.; Eastham, J.H.; Graefen, M.; Heidenreich, A.; Karnes, J.R.; Montorsi, F.; Studer, U.E. Pelvic lymph node dissection in prostate cancer. Eur. Urol. 2009, 55, 1251–1265. [Google Scholar] [CrossRef] [PubMed]
- Cornford, P.; Tilki, D.; van den Bergh, R.C.N.; Eberli, D.; Fonteyne, V.; Gandaglia, G.; Gillessen, S.; Henry, A.M.; van Leenders, G.J.L.H.; Oldenburg, J.; et al. EAU Guidelines on Prostate Cancer. Available online: http://uroweb.org/guidelines/compilations-of-all-guidelines/.
- Bustos, M.A.; Chong, K.K.; Koh, Y.; Kim, S.; Ziarnik, E.; Ramos, R.I.; Jimenez, G.; Krasne, D.L.; Allen, W.M.; Wilson, T.G.; et al. Transcriptomic miRNA and mRNA signatures in primary prostate cancer that are associated with lymph-node invasion. Clin. Transl. Med. 2025, 15, e70288. [Google Scholar] [CrossRef] [PubMed]
- Siliņa, K.; Zayakin, P.; Kalniņa, Z.; Ivanova, L.; Meistere, I.; Endzeliņš, E.; Abols, A.; Stengrēvics, A.; Leja, M.; Ducena, K.; et al. Sperm-associated antigens as targets for cancer immunotherapy: expression pattern and humoral immune response in cancer patients. J. Immunother. 2011, 34, 28–44. [Google Scholar] [CrossRef] [PubMed]
- Neesse, A.; Gangeswaran, R.; Luettges, J.; Feakins, R.; Weeks, M.E.; Lemoine, N.R.; Crnogorac-Jurcevic, T. Sperm-associated antigen 1 is expressed early in pancreatic tumorigenesis and promotes motility of cancer cells. Oncogene 2007, 26, 1533–1545. [Google Scholar] [CrossRef] [PubMed]
- Lin, S.; Lv, Y.; Zheng, L.; Mao, G.; Peng, F. Expression and Prognosis of Sperm-Associated Antigen 1 in Human Breast Cancer. Onco Targets Ther. 2021, 14, 2689–2698. [Google Scholar] [CrossRef] [PubMed]
- Gu, Y.; Chu, M.Q.; Xu, Z.J.; Yuan, Q.; Zhang, T.J.; Lin, J.; Zhou, J.D. Comprehensive analysis of SPAG1 expression as a prognostic and predictive biomarker in acute myeloid leukemia by integrative bioinformatics and clinical validation. BMC Med. Genom. 2022, 15, 38. [Google Scholar] [CrossRef] [PubMed]
- Shamsara, E.; Shamsara, J. Bioinformatics analysis of the genes involved in the extension of prostate cancer to adjacent lymph nodes by supervised and unsupervised machine learning methods: The role of SPAG1 and PLEKHF2. Genomics 2020, 112, 3871–3882. [Google Scholar] [CrossRef] [PubMed]
- Goldman, M.J.; Craft, B.; Hastie, M.; Repečka, K.; McDade, F.; Kamath, A.; Banerjee, A.; Luo, Y.; Rogers, D.; Brooks, A.N.; et al. Visualizing and interpreting cancer genomics data via the Xena platform. Nat. Biotechnol. 2020, 38, 675–678. [Google Scholar] [CrossRef] [PubMed]
- Network, C.G.A.R. The Molecular Taxonomy of Primary Prostate Cancer. Cell 2015, 163, 1011–1025. [Google Scholar] [CrossRef] [PubMed]
- Liu, J.; Lichtenberg, T.; Hoadley, K.A.; Poisson, L.M.; Lazar, A.J.; Cherniack, A.D.; Kovatich, A.J.; Benz, C.C.; Levine, D.A.; Lee, A.V.; et al. An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics. Cell 2018, 173, 400–416.e411. [Google Scholar] [CrossRef] [PubMed]
- Yoshihara, K.; Shahmoradgoli, M.; Martínez, E.; Vegesna, R.; Kim, H.; Torres-Garcia, W.; Treviño, V.; Shen, H.; Laird, P.W.; Levine, D.A.; et al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat. Commun. 2013, 4, 2612. [Google Scholar] [CrossRef] [PubMed]
- Li, T.; Fu, J.; Zeng, Z.; Cohen, D.; Li, J.; Chen, Q.; Li, B.; Liu, X.S. TIMER2.0 for analysis of tumor-infiltrating immune cells. Nucleic Acids Res. 2020, 48, W509–w514. [Google Scholar] [CrossRef] [PubMed]
- Ritchie, M.E.; Phipson, B.; Wu, D.; Hu, Y.; Law, C.W.; Shi, W.; Smyth, G.K. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015, 43, e47. [Google Scholar] [CrossRef] [PubMed]
- Korotkevich, G.; Sukhov, V.; Budin, N.; Shpak, B.; Artyomov, M.N.; Sergushichev, A. Fast gene set enrichment analysis. bioRxiv 2021, 060012. [Google Scholar] [CrossRef]



| Characteristic | N0 (n = 345) | N1 (n = 79) | Missing N (n = 73) | p-valueᵃ |
| Age at diagnosis (years), median (IQR) | 62 (57–66) | 63 (57–68) | 59 (54–66) | 0.715 |
| PSA at sample collection (ng/mL), median (IQR) | 0.10 (0.03–0.11) | 0.10 (0.03–0.60) | 0.04 (0.03–0.10) | 0.187 |
| Pathological T stage, n (%) | 7.0 × 10⁻¹⁰ | |||
| T2 | 140 (40.6%) | 3 (3.8%) | 44 (60.3%) | |
| T3/T4 | 201 (58.3%) | 76 (96.2%) | 26 (35.6%) | |
| Unknown | 4 (1.2%) | 0 (0.0%) | 3 (4.1%) | |
| Gleason grade group, n (%) | 6.1 × 10⁻¹³ | |||
| Gleason ≤ 7 (Low/Intermediate) | 218 (63.2%) | 14 (17.7%) | 60 (82.2%) | |
| Gleason ≥ 8 (High) | 127 (36.8%) | 65 (82.3%) | 13 (17.8%) | |
| Surgical margin, n (%) | 9.0 × 10⁻⁹ | |||
| Negative | 240 (69.6%) | 27 (34.2%) | 48 (65.8%) | |
| Positive | 89 (25.8%) | 46 (58.2%) | 17 (23.3%) | |
| Biochemical recurrence, n (%) | 0.085 | |||
| Yes | 39 (11.3%) | 15 (19.0%) | 4 (5.5%) | |
| No | 266 (77.1%) | 54 (68.4%) | 51 (69.9%) | |
| Vital status, n (%) | 0.378 | |||
| Alive | 339 (98.3%) | 76 (96.2%) | 72 (98.6%) | |
| Deceased | 6 (1.7%) | 3 (3.8%) | 1 (1.4%) | |
| Progression-free interval (months), median (IQR) | 26.9 (15.6–44.9) | 23.9 (11.5–41.0) | 27.3 (12.5–41.6) | 0.145 |
| PFI events, n (%) | 62 (18.0%) | 22 (27.8%) | 9 (12.3%) | |
| SPAG1 expression (log₂ RSEM), median (IQR) | 7.22 (6.74–7.67) | 7.85 (7.39–8.38) | 6.93 (6.57–7.39) | 6.3 × 10⁻¹¹ |
| SPAG1 quartile | n | N0, n | N1, n | N1 rate (%) |
| Q1 (lowest) | 105 | 97 | 8 | 7.6 |
| Q2 | 105 | 96 | 9 | 8.6 |
| Q3 | 105 | 84 | 21 | 20.0 |
| Q4 (highest) | 105 | 64 | 41 | 39.0 |
| Variable | Reference | Adjusted OR | 95% CI | p-value |
| T stage T3/T4 | T2 | 8.59 | 2.98–36.40 | 4.9 × 10⁻⁴ |
| Gleason grade ≥ 8 (High) | Gleason ≤ 7 | 4.10 | 2.18–8.15 | 2.5 × 10⁻⁵ |
| SPAG1 expression (per log₂ RSEM unit) | — | 2.14 | 1.50–3.13 | 4.8 × 10⁻⁵ |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.