Research Article | | Peer-Reviewed

Accurate Prediction of Survival Based on Kaplan–Meier Analytics

Received: 27 November 2025     Accepted: 10 December 2025     Published: 29 December 2025
Views:       Downloads:
Abstract

This study integrates Kaplan–Meier survival analysis with the Stochastic and Augmented Interpretable Health Analytics (SAIHA) framework to model long-term survival in pancreatic cancer, a malignancy characterized by late diagnosis, rapid progression, and poor prognosis. The Kaplan–Meier estimator was first employed to nonparametrically characterize empirical survival probabilities across the observed follow-up period, capturing censoring patterns and short-term mortality dynamics without imposing distributional assumptions. This step provided a transparent baseline representation of survival up to approximately four years post-diagnosis, where empirical data density remains sufficient for reliable estimation. To address the limitations of traditional Kaplan–Meier analysis in extrapolating beyond observed follow-up, the SAIHA framework was then applied using a Weibull survival model to propagate uncertainty, incorporate population heterogeneity, and generate probabilistic survival projections into the long-term horizon. The Weibull distribution was selected for its flexibility in modeling monotonic hazard functions commonly observed in aggressive cancers and for its interpretability within clinical contexts. Parameter uncertainty was explicitly modeled to reflect variability in disease progression and treatment response across patients. The combined model predicts a pronounced decline in survival beyond year four, with the most likely five-year survival probability estimated near 3% and a median six-year survival approaching 1.5%. These projections align with known epidemiological patterns of pancreatic cancer and underscore the persistent lethality of the disease despite advances in therapy. Importantly, the SAIHA framework provides full survival distributions rather than point estimates, enabling clinicians and researchers to assess uncertainty bounds and tail risks associated with long-term outcomes. Overall, the integrated Kaplan–Meier–SAIHA approach extends classical survival analysis by combining empirical rigor with stochastic, distribution-aware forecasting. This methodology offers a robust and interpretable framework for high-risk clinical prediction, supporting more informed decision-making in oncology research, population health modeling, and precision medicine applications.

Published in Cancer Research Journal (Volume 13, Issue 4)
DOI 10.11648/j.crj.20251304.14
Page(s) 173-185
Creative Commons

This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited.

Copyright

Copyright © The Author(s), 2025. Published by Science Publishing Group

Keywords

Prediction Analytics, Kaplan-Meier Method, Weibull Distribution, SAIHA Framework

References
[1] Kaplan, E. L. (1958). Nonparametric Estimation from Incomplete Observations. Journal of the American Statistical Association:
[2] Schober P, V. T. (2018). Survival Analysis and Interpretation of Time-to-Event Data. National Library of Medicine:
[3] Cox, D. R. (1972). Regression models and life‐tables. Journal of the Royal Statistical Society: Series B (Methodological), 34(2), 187–220.
[4] Schober P, Vetter TR. Survival Analysis and Interpretation of Time-to-Event Data: The Tortoise and the Hare. Anesth Analg. 2018 Sep; 127(3): 792-798.
[5] Pang M, Platt RW, Schuster T, Abrahamowicz M. Spline-based accelerated failure time model. Stat Med. 2021 Jan 30; 40(2): 481-497.
[6] Fleming, S. T. (2020). Managerial Epidemiology: Cases and Concepts (4th ed.). Health Administration Press. Harrell, F. E. (2015). Regression Modeling Strategies: With Applications to Linear Models, Logistic and Ordinal Regression, and Survival Analysis (2nd ed.). Springer.
[7] Arcuri LJ, Souza Santos FP, Perini GF, Hamerschlak N. (2020). Fine and Gray or Cox model? Blood Adv. 2024 Mar 26; 8(6): 1420-1421.
[8] She Y, Jin Z, Wu J, Deng J, Zhang L, Su H, Jiang G, Liu H, Xie D, Cao N, Ren Y, Chen C. (2020). Development and Validation of a Deep Learning Model for Non-Small Cell Lung Cancer Survival. JAMA Netw Open. 2020 Jun 1; 3(6): e205842.
[9] Steingrimsson JA, Morrison S. (2020). Deep learning for survival outcomes. Stat Med. Author manuscript. 2020 Apr 13; 39(17): 2339-2349.
[10] Lee C, Yoon J, Schaar MV. Dynamic-DeepHit: A Deep Learning Approach for Dynamic Survival Analysis With Competing Risks Based on Longitudinal Data. IEEE Trans Biomed Eng. 2020 Jan; 67(1): 122-133.
[11] De Melo, P (2025). Prediction Modeling: Basic Metabolic Panel. Advances in Bioscience and Biotechnology, 16, 360-378.
[12] National Cancer Institute. (2023). Pancreatic cancer treatment (PDQ®)–Patient version.
[13] American Cancer Society. (2025). Survival rates for pancreatic cancer.
[14] Siegel, R. L., Miller, K. D., Wagle, N. S., & Jemal, A. (2024). Cancer statistics, 2024. CA: A Cancer Journal for Clinicians, 74(1), 12–43.
[15] Bakasa W, Viriri S. Pancreatic Cancer Survival Prediction: A Survey of the State-of-the-Art. Comput Math Methods Med. 2021 Sep 30; 2021:1188414.
[16] Osareh A., Shadgar B. Machine learning techniques to diagnose breast cancer. 2010 5th international symposium on health informatics and bioinformatics; 2010; Ankara, Turkey. pp. 114–120.
[17] Safiyari A., Javidan R. Predicting lung cancer survivability using ensemble learning methods. in 2017 Intelligent Systems Conference (IntelliSys); 2017; London, UK. pp. 684–688.
[18] Osman M. H. Pancreatic cancer survival prediction using machine learning and comparing its performance with TNM staging system and prognostic nomograms. AACR Annual Meeting 2019; 2019; Atlanta, GA.
[19] Song W., Miao D.-L., Chen L. Nomogram for predicting survival in patients with pancreatic cancer. Oncotargets and Therapy. 2018; 11:539–545.
[20] Chaves DO, Bastos AC, Almeida AM, Guerra MR, Teixeira MTB, Melo APS, Passos VMA. The increasing burden of pancreatic cancer in Brazil from 2000 to 2019: estimates from the Global Burden of Disease Study 2019. Rev Soc Bras Med Trop. 2022 Jan 28; 55(suppl 1): e0271.
[21] De Melo, P., (2024) Public Health Informatics and Technology, Library of Congress. Washington DC.
[22] Lanjewar, MG., Panchbhai, KG., Patle, LB., (2024), Fusion of transfer learning models with LSTM for detection of breast cancer using ultrasound images, Computers in biology and medicine, 2024-02, Vol. 169, p. 107914-107914, Article 107914.
[23] de Melo, P. and St. Rose, M. (2025) Accurate Classification of Diabetes via PM Generative AI. Advances in Bioscience and Biotechnology, 16, 379-409.
[24] Ying GS, Heitjan DF. Weibull prediction of event times in clinical trials. Pharm Stat. 2008 Apr-Jun; 7(2): 107-20.
Cite This Article
  • APA Style

    Melo, P. D., DiLella, M., Holman, T., McElveen, S. (2025). Accurate Prediction of Survival Based on Kaplan–Meier Analytics. Cancer Research Journal, 13(4), 173-185. https://doi.org/10.11648/j.crj.20251304.14

    Copy | Download

    ACS Style

    Melo, P. D.; DiLella, M.; Holman, T.; McElveen, S. Accurate Prediction of Survival Based on Kaplan–Meier Analytics. Cancer Res. J. 2025, 13(4), 173-185. doi: 10.11648/j.crj.20251304.14

    Copy | Download

    AMA Style

    Melo PD, DiLella M, Holman T, McElveen S. Accurate Prediction of Survival Based on Kaplan–Meier Analytics. Cancer Res J. 2025;13(4):173-185. doi: 10.11648/j.crj.20251304.14

    Copy | Download

  • @article{10.11648/j.crj.20251304.14,
      author = {Philip de Melo and Michele DiLella and Tameka Holman and Shakira McElveen},
      title = {Accurate Prediction of Survival Based on Kaplan–Meier Analytics},
      journal = {Cancer Research Journal},
      volume = {13},
      number = {4},
      pages = {173-185},
      doi = {10.11648/j.crj.20251304.14},
      url = {https://doi.org/10.11648/j.crj.20251304.14},
      eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.crj.20251304.14},
      abstract = {This study integrates Kaplan–Meier survival analysis with the Stochastic and Augmented Interpretable Health Analytics (SAIHA) framework to model long-term survival in pancreatic cancer, a malignancy characterized by late diagnosis, rapid progression, and poor prognosis. The Kaplan–Meier estimator was first employed to nonparametrically characterize empirical survival probabilities across the observed follow-up period, capturing censoring patterns and short-term mortality dynamics without imposing distributional assumptions. This step provided a transparent baseline representation of survival up to approximately four years post-diagnosis, where empirical data density remains sufficient for reliable estimation. To address the limitations of traditional Kaplan–Meier analysis in extrapolating beyond observed follow-up, the SAIHA framework was then applied using a Weibull survival model to propagate uncertainty, incorporate population heterogeneity, and generate probabilistic survival projections into the long-term horizon. The Weibull distribution was selected for its flexibility in modeling monotonic hazard functions commonly observed in aggressive cancers and for its interpretability within clinical contexts. Parameter uncertainty was explicitly modeled to reflect variability in disease progression and treatment response across patients. The combined model predicts a pronounced decline in survival beyond year four, with the most likely five-year survival probability estimated near 3% and a median six-year survival approaching 1.5%. These projections align with known epidemiological patterns of pancreatic cancer and underscore the persistent lethality of the disease despite advances in therapy. Importantly, the SAIHA framework provides full survival distributions rather than point estimates, enabling clinicians and researchers to assess uncertainty bounds and tail risks associated with long-term outcomes. Overall, the integrated Kaplan–Meier–SAIHA approach extends classical survival analysis by combining empirical rigor with stochastic, distribution-aware forecasting. This methodology offers a robust and interpretable framework for high-risk clinical prediction, supporting more informed decision-making in oncology research, population health modeling, and precision medicine applications.},
     year = {2025}
    }
    

    Copy | Download

  • TY  - JOUR
    T1  - Accurate Prediction of Survival Based on Kaplan–Meier Analytics
    AU  - Philip de Melo
    AU  - Michele DiLella
    AU  - Tameka Holman
    AU  - Shakira McElveen
    Y1  - 2025/12/29
    PY  - 2025
    N1  - https://doi.org/10.11648/j.crj.20251304.14
    DO  - 10.11648/j.crj.20251304.14
    T2  - Cancer Research Journal
    JF  - Cancer Research Journal
    JO  - Cancer Research Journal
    SP  - 173
    EP  - 185
    PB  - Science Publishing Group
    SN  - 2330-8214
    UR  - https://doi.org/10.11648/j.crj.20251304.14
    AB  - This study integrates Kaplan–Meier survival analysis with the Stochastic and Augmented Interpretable Health Analytics (SAIHA) framework to model long-term survival in pancreatic cancer, a malignancy characterized by late diagnosis, rapid progression, and poor prognosis. The Kaplan–Meier estimator was first employed to nonparametrically characterize empirical survival probabilities across the observed follow-up period, capturing censoring patterns and short-term mortality dynamics without imposing distributional assumptions. This step provided a transparent baseline representation of survival up to approximately four years post-diagnosis, where empirical data density remains sufficient for reliable estimation. To address the limitations of traditional Kaplan–Meier analysis in extrapolating beyond observed follow-up, the SAIHA framework was then applied using a Weibull survival model to propagate uncertainty, incorporate population heterogeneity, and generate probabilistic survival projections into the long-term horizon. The Weibull distribution was selected for its flexibility in modeling monotonic hazard functions commonly observed in aggressive cancers and for its interpretability within clinical contexts. Parameter uncertainty was explicitly modeled to reflect variability in disease progression and treatment response across patients. The combined model predicts a pronounced decline in survival beyond year four, with the most likely five-year survival probability estimated near 3% and a median six-year survival approaching 1.5%. These projections align with known epidemiological patterns of pancreatic cancer and underscore the persistent lethality of the disease despite advances in therapy. Importantly, the SAIHA framework provides full survival distributions rather than point estimates, enabling clinicians and researchers to assess uncertainty bounds and tail risks associated with long-term outcomes. Overall, the integrated Kaplan–Meier–SAIHA approach extends classical survival analysis by combining empirical rigor with stochastic, distribution-aware forecasting. This methodology offers a robust and interpretable framework for high-risk clinical prediction, supporting more informed decision-making in oncology research, population health modeling, and precision medicine applications.
    VL  - 13
    IS  - 4
    ER  - 

    Copy | Download

Author Information
  • Nursing and Allied Health, Norfolk State University, Norfolk, United States of America

  • Nursing and Allied Health, Norfolk State University, Norfolk, United States of America

  • Nursing and Allied Health, Norfolk State University, Norfolk, United States of America

  • Nursing and Allied Health, Norfolk State University, Norfolk, United States of America

  • Sections