Submitted:
03 June 2026
Posted:
03 June 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
- Novel ensemble architecture: A weighted late-fusion of GRU and CoxPH that uniquely combines deep temporal learning with statistical survival analysis—unlike prior work that uses either deep OR statistical models, not both synergistically.
- Interpretability in nonlinear hazard modeling: The GCE captures time-varying risk patterns while providing SHAP-based feature importance and CoxPH-anchored transparency—addressing the black-box criticism of DL in clinical settings.
- Robust rare-disease validation: Rigorous feature engineering (LASSO, RSF, PCA) and 10-fold CV with strict leakage prevention on a small SEER cohort (n=727), achieving strong performance (C-index 0.9830, IBS 0.03958) with clinical actionability.
2. Literature Review
2.1. Cancer Survival Prediction Using AI
2.2. Challenges in SEER-Based Prognostic Modeling
3. Materials and Methods
3.1. Data Collection and Patient Selection
3.2. Feature Selection
3.3. Proposed GCE Framework
| Algorithm 1:GCE Framework for Survival Risk Prediction |
|
3.4. Baseline and Core Models
3.5. Experimental Setup and Evaluation
4. Results
4.1. Performance of GCE Framework
4.2. Impact of Feature Selection and Reduction Strategies
4.3. Performance of Classical and Baseline Models
4.4. Deep Learning Models and Ensemble Behavior
4.5. Interpretability and Feature Importance Analysis
4.6. Comparison with Existing Studies
5. Discussion
Limitations
6. Conclusions
Funding
Author Contributions: Muhammad Shoaib Kareem
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Qiu, Y. L.; Zheng, H.; Devos, A.; Selby, H.; Gevaert, O. A meta-learning approach for genomic survival analysis. Nat. Commun. 2020, 11, 6350. [Google Scholar] [CrossRef]
- Yin, K.; et al. Primary cardiac lymphoma. J. Thorac. Cardiovasc. Surg. 2022, 164, 573–580. [Google Scholar] [CrossRef]
- Bussani, R.; et al. Cardiac tumors: diagnosis, prognosis, and treatment. Curr. Cardiol. Rep. 2020, 22, 169. [Google Scholar] [CrossRef] [PubMed]
- Bangolo, A.; et al. Ten-Year Trends in Hepatocellular Carcinoma Mortality. Diseases 2025, 13, 256. [Google Scholar] [CrossRef] [PubMed]
- Huang, Y.; et al. Deep learning prediction model for patient survival outcomes. Cancers CrossRtef. 2023, 15, 2232. [Google Scholar] [CrossRef] [PubMed]
- Bishnoi, R.; et al. Real-world experience of carfilzomib-associated cardiovascular adverse events. Cancer Med. 2021, 10, 70–78. [Google Scholar] [CrossRef]
- Liao, J.; Zhou, Z. Long-term cardiovascular mortality risk in patients with bladder cancer. Front. Cardiovasc. Med. 2023, 10, 1142417. [Google Scholar] [CrossRef]
- Rahouma, M.; et al. Geographic variation in malignant cardiac tumors and their outcomes. Front. Oncol. 2023, 13, 1071770. [Google Scholar] [CrossRef]
- Cheng, P.; Xie, X.; Knoedler, S.; Mi, B.; Liu, G. Predicting overall survival in chordoma patients using machine learning models. J. Orthop. Surg. Res. 2023, 18, 652. [Google Scholar] [CrossRef]
- Zhang, S.; et al. Personalized prediction for multiple chronic diseases by multi-task Cox learning model. PLoS Comput. Biol. 2023, 19, e1011396. [Google Scholar] [CrossRef]
- Peng, C.; et al. Predicting overall survival in chordoma patients using machine learning models. J. Orthop. Surg. Res. 2023, 18. [Google Scholar] [CrossRef]
- Yan, L.; et al. Deep learning models for predicting survival of chondrosarcoma. Front. Oncol. 2022, 12. [Google Scholar] [CrossRef]
- Zeng, J.J.; Li, K.; Cao, F.; Zheng, Y. Deep learning prognosis prediction of gastrointestinal stromal tumor. Sci. Rep. 2024, 14. [Google Scholar] [CrossRef] [PubMed]
- Liu, Y.; Xie, L.; Wang, D.; Xia, K. Deep learning algorithm for cancer-specific survival in osteosarcoma. PLoS ONE 2023, 18. [Google Scholar] [CrossRef]
- Kiessling, J.; et al. AI outperforms Kaplan–Meier survival estimation. Eur. J. Vasc. Endovasc. Surg. 2023, 65, 600–607. [Google Scholar] [CrossRef] [PubMed]
- Yin, Q.; Chen, W.; Zhang, C.; Wei, Z. CNN model for survival prediction. Lab. Investig. 2022, 102, 1064–1074. [Google Scholar] [CrossRef]
- Tran, K.; et al. Deep learning in cancer diagnosis and prognosis. Genome Med. 2021, 13. [Google Scholar] [CrossRef]
- Vale-Silva, L.A.; Rohr, K. MultiSurv multimodal survival prediction. medRxiv 2020. [Google Scholar] [CrossRef]
- Yao, Z.; et al. Multimodal deep learning with imaging and clinical data. arXiv 2024. [Google Scholar] [CrossRef]
- Maigari, A.; et al. Multimodal deep learning breast cancer prognosis. J. Med. Artif. Intell. 2023. [Google Scholar] [CrossRef]
- Liao, J.; Zhou, Z. Long-term cardiovascular mortality risk in bladder cancer patients. Front. Cardiovasc. Med. 2023, 10. [Google Scholar] [CrossRef] [PubMed]
- National Cancer Institute (NCI). Surveillance, Epidemiology, and End Results (SEER) Program. SEER Database. Available online: https://seer.cancer.gov/.
- Vo, J.B.; et al. Heart disease mortality among breast cancer survivors. Breast Cancer Res. Treat. 2022, 192, 611–622. [Google Scholar] [CrossRef] [PubMed]
- Zheng, Y.; et al. Machine learning in cardio-oncology. Rev. Cardiovasc. Med. 2023, 24. [Google Scholar] [CrossRef] [PubMed]
- Felix, A.S.; et al. Cardiovascular mortality after endometrial cancer. Int. J. Cancer 2016, 140, 555–564. [Google Scholar] [CrossRef]
- Mo, X.; et al. Competing risk analysis in kidney cancer. BMC Cancer 2021, 21. [Google Scholar] [CrossRef]
- Zhang, X.; et al. Predictors of five-year survival in hepatocellular carcinoma. Cancer Causes Control 2021, 32, 317–325. [Google Scholar] [CrossRef]
- Wang, S.C.; et al. Deep learning survival prediction in hepatocellular carcinoma. Sci. Rep. 2024, 14. [Google Scholar] [CrossRef]
- Ezaz, G.; et al. Risk prediction for heart failure after trastuzumab therapy. J. Am. Heart Assoc. 2014, 3. [Google Scholar] [CrossRef]
- Luo, Y.; et al. Heart-specific death in breast cancer patients. Sci. Rep. 2025, 15. [Google Scholar] [CrossRef]
- Xing, H.; et al. Cardiovascular mortality in lung carcinoid tumors. Medicine 2023, 102. [Google Scholar] [CrossRef]
- Caruso, C.M.; et al. Deep learning survival prediction in lung cancer. Comput. Methods Programs Biomed. 2024, 254, 108308. [Google Scholar] [CrossRef] [PubMed]
- Xiao, M.; et al. Survival outcomes of primary cardiac lymphoma. Hematol. Oncol. 2020, 38, 334–343. [Google Scholar] [CrossRef] [PubMed]
- Bishnoi, R.; et al. Carfilzomib cardiovascular events. Cancer Med. 2020, 10, 70–78. [Google Scholar] [CrossRef]
- Hammami, M.B.; et al. Survival outcomes of primary cardiac sarcoma. Anatol. J. Cardiol. 2021, 25, 104. [Google Scholar] [CrossRef] [PubMed]
- Babaei Rikan, S.; et al. Survival prediction of glioblastoma. Sci. Rep. 2024, 14, 2371. [Google Scholar] [CrossRef]
- Zhang, J.; et al. Deep-learning survival prediction of melanoma. Discov. Oncol. 2023, 14. [Google Scholar] [CrossRef]
- Xu, Y.; et al. Deep learning survival prediction in breast cancer. Sci. Rep. 2025, 15. [Google Scholar]
- Faghiri, F.; Kohansal, A. Cox model with Bayesian neural network. Sci. Rep. 2025, 15. [Google Scholar] [CrossRef]
- Al-Badawi, I.A.; et al. Cardiovascular mortality in ovarian cancer. Medicina 2023, 59, 1476. [Google Scholar] [CrossRef]
- Huang, J.; et al. Cardiovascular mortality in Merkel cell carcinoma. BMC Geriatr. 2024, 24. [Google Scholar] [CrossRef]
- Vale-Silva, L.A.; Rohr, K. Multimodal survival prediction. Sci. Rep. 2021, 11. [Google Scholar] [CrossRef]
- Zeng, J.; et al. Survival prediction in gastric adenocarcinoma. Front. Oncol. 2023, 13, 1131859. [Google Scholar] [CrossRef]
- Cao, G.; et al. Postoperative survival prediction for hepatocellular carcinoma. Res. Sq. 2023. [Google Scholar] [CrossRef]
- Sedighi-Maman, Z.; Heath, J.J. Interpretable lung cancer survivability model. Sensors 2022, 22, 6783. [Google Scholar] [CrossRef] [PubMed]
- Pickett, K.; et al. Random survival forests for dynamic predictions. BMC Med. Res. Methodol. 2021, 21. [Google Scholar] [CrossRef]
- Utkin, L.V.; et al. Weighted random survival forest. Knowl.-Based Syst. 2019, 177, 136–144. [Google Scholar] [CrossRef]
- Miandoab, P.; et al. CNN-GRU model for liver tumor tracking. Med. Phys. 2025, 52. [Google Scholar] [CrossRef]
- Asghar, N.; et al. Hybrid CoxPH and DeepHit survival prediction. BMC Med. Inform. Decis. Mak. 2024, 24, 120. [Google Scholar] [CrossRef]
- Tian, D.; et al. ML prognostic model after lung transplantation. JAMA Netw. Open 2023, 6. [Google Scholar] [CrossRef]
- Cai, M.; et al. Random survival forests for spinal chordomas. J. Clin. Neurosci. 2025, 142, 111697. [Google Scholar] [CrossRef]
- Lin, W.; et al. DeepSurv for tongue cancer survival prediction. J. Craniomaxillofac. Surg. 2025, 53, 1334–1343. [Google Scholar] [CrossRef]
- Obite, C.P.; et al. Factor-enhanced DeepSurv model. Comput. Biol. Med. 2025, 189, 109963. [Google Scholar] [CrossRef]
- Aslan, M.F.; et al. CNN-based survival prediction for heart failure. Biomed. Signal Process. Control 2021, 68, 102716. [Google Scholar] [CrossRef]
- Mustafa, E.; et al. Ensemble framework for breast cancer survivability. Diagnostics 2023, 13, 1688. [Google Scholar] [CrossRef]
- Absar, N.; et al. LSTM model for disease outbreak prediction. Infect. Dis. Model. 2022, 7, 170–183. [Google Scholar] [CrossRef]
- Saha, A.; et al. GRU model for COVID-19 patient representation. ACM BCB 2023. [Google Scholar] [CrossRef]
- Pradeepa, M.; et al. EfficientNet-GRU model for breast cancer detection. Sci. Rep. 2025. [Google Scholar] [CrossRef]
- Zhang, S.; et al. Multi-task Cox model for chronic diseases. PLoS Comput. Biol. 2023, 19. [Google Scholar] [CrossRef]
- Pruitt, S.L.; et al. Survival of pancreatic cancer patients. Cancer Med. 2023, 12, 200–212. [Google Scholar] [CrossRef]
- Zhao, Y.; et al. Wavelet deep learning for cancer prognosis. BMC Bioinform. 2020, 21. [Google Scholar] [CrossRef]
- Katzman, J.; et al. DeepSurv recommendation system. BMC Med. Res. Methodol. 2018, 18. [Google Scholar] [CrossRef]
- Tuersun, A.; et al. Interpretable ML for esophageal cancer survival. Front. Physiol. 2025, 16. [Google Scholar] [CrossRef]
- Yang, X.; Qiu, H.; Wang, L.; Wang, X. Predicting colorectal cancer survival using ML. J. Med. Internet Res. 2023, 25. [Google Scholar] [CrossRef]
- Rasool, A.; Tao, R.; et al. Statistic solution for machine learning to analyze heart disease data. In Proceedings of the 12th International Conference on Machine Learning and Computing (ICMLC), 2020; pp. 134–139. [Google Scholar] [CrossRef]
- Adam, N.; Wieder, R. Predictive Modeling of Long-Term Survivors with Stage IV Breast Cancer Using the SEER-Medicare Dataset. Cancers 2024, 16(23), 4033. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]






| Author (Year) | Disease / Dataset | Features | Model | Evaluation Parameter / Findings | Challenges |
|---|---|---|---|---|---|
| [43] (2023) | Gastric adenocarcinoma / SEER | Clinical and demographic variables | DeepSurv, CoxPH, RSF | C-index 0.825–0.871, IBS reported; Improved nonlinear handling over CoxPH | Modest accuracy (below 0.90); Limited interpretability of DL predictions; No ensemble fusion; Overlooks temporal dependencies in rare cohorts |
| [28] (2024) | Hepatocellular carcinoma / SEER | Tumor characteristics, demographics, treatments | N-MTLR, CoxPH, DeepSurv, RSF | C-index 0.824, IBS; Better than baselines in high-dimensional data | Accuracy below 90%; No hybrid statistical-DL integration; Ignores competing risks like cardiovascular mortality; Lacks SHAP-based feature insights |
| [38] (2025) | Breast cancer / SEER | Clinical, imaging, demographics | N-MTLR, CoxPH, RSF | C-index 0.771–0.821, IBS; CoxPH strong but DL adds nonlinearity | Low overall accuracy; No temporal modeling (GRU/LSTM); Limited to non-rare tumors; Absence of feature engineering (PCA/LASSO) for small datasets |
| [8] (2023) | Cardiac angiosarcoma / NCDB | Histology, stage, treatments | Cox regression | 95% CI statistical analysis; Identified geographic variations | Descriptive only; No predictive modeling; Low focus on rare cardiac sarcoma; No DL for nonlinear patterns; Generalizability issues |
| [35] (2021) | Primary cardiac sarcoma / SEER | Demographics, tumor size, survival | Univariate/ multivariate regression | Regression analysis; Prognostic factors identified | Analysis-focused, not prediction; No DL/ML for complexity; Missing interpretability; Fails to address SEER’s data sparsity |
| [36] (2024) | Glioblastoma / SEER | Clinical features | XGBoost, AdaBoost, DT, KNN, RF, DNN | MSE, RMSE (%) 90.25; ML vs DL comparison | Uses non-survival metrics; No hybrids for temporal data; Overfitting in small cohorts; Lacks transparency for clinical use |
| [2] (2020) | Primary cardiac lymphoma / SEER | Age, histology, survival | Kaplan-Meier, statistical analysis | IQR, survival curves; Descriptive trends | Descriptive only; No predictive models; Ignores nonlinear interactions; Limited to lymphoma, not sarcoma; No validation for robustness |
| [32] (2024) | Lung cancer / CLARO | Clinical features | AI model | C-index = 80.72 | Low accuracy; No ensemble or hybrid; Limited interpretability; Dataset-specific, not generalizable |
| [5] (2023) | Actigraphy Data & Clinical Information | Clinical Information | DL models: LSTM, BiLSTM, GRU, RNN | KPS, Palliative Performance Index (PPI) = 0.89 | Few researchers used this; No SEER integration; Lacks fusion with statistical models; No SHAP for feature importance |
| Configuration Group | Component / Parameter | Value / Setting |
|---|---|---|
| Data & Input | Dataset | SEER survival dataset |
| Survival Time Variable | Survival months | |
| Event Indicator | Vital status recode (0 = censored, 5 = event) | |
| Value Handling | Median imputation (numeric) | |
| Random Seed | 42 | |
| Validation Strategy | Cross-Validation | 10-Fold Cross-Validation (shuffle = True) |
| Train–Validation Split | 80% Train, 20% Validation (within training fold) | |
| Stratification | Based on event status | |
| Cox Proportional Hazards | Model Type | Baseline/Lifelines CoxPH |
| Penalization | L2 penalizer = 0.1 | |
| Output | Partial hazard scores (Risk) | |
| Survival Model | Architecture–GRU | 3-layer GRU |
| Hidden Units | 64 Units | |
| Dropout | 0.2 (between layers) | |
| Temporal Modeling | Learns nonlinear feature interactions & latent survival dynamics | |
| Input Format | Feature vector reshaped to sequence | |
| Optimizer | Adam | |
| Epochs | 50 | |
| Batch Size | 64 (Model training) | |
| Output | Fully Connected linear layer | |
| Ensemble Strategy | Fusion Method | Weighted averaging |
| Ensemble Weights | 0.2 × CoxPH + 0.8 × GRU | |
| Evaluation Setup | Evaluation Time Grid | 100 time points between min–max survival |
| Baseline Survival | Kaplan–Meier estimator | |
| Performance Metrics | Discrimination Metric | Concordance Index (C-index) |
| Calibration Metric | Integrated Brier Score (IBS) | |
| Reporting | Per-fold and mean performance across 10-fold |
| Evaluation Category | Model | C-index | IBS |
|---|---|---|---|
| Performance of Deep learning models | CNN [54] | 0.884399 | 0.00585 |
| LSTM [56] | 0.893473 | 0.00604 | |
| GRU [58] | 0.901388 | 0.00516 | |
| Proposed Framework | 0.936179 | 0.00417 | |
| Mean Score across 10-fold CV | CoxPH [49] | 0.8842 | 0.03280 |
| LSTM [56] | 0.9289 | 0.04985 | |
| GRU [58] | 0.9345 | 0.05105 | |
| Proposed Framework | 0.9830 | 0.03958 |
| Principal Component | Explained Variance Ratio | Principal Component | Explained Variance Ratio |
|---|---|---|---|
| PC1 | 0.155156 | PC9 | 0.039989 |
| PC2 | 0.126531 | PC10 | 0.039268 |
| PC3 | 0.093936 | PC11 | 0.036696 |
| PC4 | 0.077580 | PC12 | 0.036647 |
| PC5 | 0.068787 | PC13 | 0.034826 |
| PC6 | 0.051185 | PC14 | 0.030896 |
| PC7 | 0.048991 | PC15 | 0.027285 |
| PC8 | 0.042818 | PC16 | 0.079200 |
| Evaluation Category | Model | Test C-index | IBS |
|---|---|---|---|
| PCA-transformed features (20) | CoxPH [49] | 0.8719 | 0.03541 |
| RSF [51] | 0.7858 | 0.05762 | |
| DeepSurv [53] | 0.8649 | 0.01463 | |
| MTLR [15] | 0.8381 | 0.02344 | |
| Proposed Framework | 0.9830 | 0.03958 | |
| Selected features (27) | CoxPH [49] | 0.8842 | 0.03280 |
| RSF [51] | 0.8322 | 0.04830 | |
| DeepSurv [53] | 0.9061 | 0.01990 | |
| MTLR [15] | 0.8411 | 0.03290 | |
| Proposed Framework | 0.9830 | 0.03958 |
| Feature Set | Model | Train C-index | Test C-index | IBS |
|---|---|---|---|---|
| LASSO | CoxPH | 0.311968 | 0.307459 | 0.082238 |
| LASSO | RSF | 0.702466 | 0.678802 | 0.084416 |
| LASSO | DeepSurv | 0.696201 | 0.669805 | 0.074979 |
| LASSO | MTLR | 0.676988 | 0.615818 | 0.080173 |
| RSF | CoxPH | 0.278054 | 0.251886 | 0.076725 |
| RSF | RSF | 0.737036 | 0.727185 | 0.077034 |
| RSF | DeepSurv | 0.846882 | 0.828495 | 0.043584 |
| RSF | MTLR | 0.763219 | 0.786194 | 0.044982 |
| Union | CoxPH | 0.238360 | 0.215671 | 0.066030 |
| Union | RSF | 0.766275 | 0.744114 | 0.065525 |
| Union | DeepSurv | 0.896908 | 0.856182 | 0.047134 |
| Union | MTLR | 0.764552 | 0.705477 | 0.051882 |
| Author (Year) | Disease or Dataset | Key Models | C-index (mean/test) | IBS | Our Framework Advantage |
|---|---|---|---|---|---|
| [43] (2023) | Gastric adenocarcinoma / SEER | DeepSurv, CoxPH, RSF | 0.825–0.871 | 0.1421 | Higher C-index (0.9830); ensemble + SHAP interpretability |
| [28] (2024) | Hepatocellular carcinoma / SEER | N-MTLR, DeepSurv, RSF | 0.824 | 0.1598 | Superior discrimination/calibration; temporal GRU focus |
| [38] (2025) | Breast cancer / SEER | N-MTLR, CoxPH, RSF | 0.771–0.821 | 0.110 | Far higher C-index; addresses small-data engineering gaps |
| [36] (2024) | Glioblastoma / SEER | XGBoost, DNN, RF | (MSE/RMSE ≈90%) | N/A | Survival-specific metrics (C-index/IBS); fusion robustness |
| [8] (2023) | Cardiac angiosarcoma / NCDB | Cox regression | (descriptive) | N/A | DL-based prediction; high C-index in related rare tumor |
| [5] (2023) | Actigraphy Data & Clinical Information | DL models: LSTM, BiLSTM, GRU, RNN | Palliative Performance Index=0.89 | N/A | Strong temporal modeling baseline |
| [32] (2024) | Lung cancer | AI model | C-index = 80.72 | N/A | Lower accuracy compared to proposed ensemble |
| [63] (2025) | Esophageal cancer SEER | CoxPH, RSF, GLMboost, DeepSurv | AUC > 0.81 | 0.175 | Lower calibration in related rare tumor |
| GCE | Primary cardiac sarcoma / SEER | GCE framework | 0.9830 (mean) | 0.03958 | — |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.