Submitted:
25 August 2025
Posted:
26 August 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Generalized Age-Period-Cohort Models
2.1. Data and Notation
2.2. Age-Period-Cohort Structure
3. Tree-Based Machine Learning Models
- Decision tree model, the substructure of tree-based models,
- Random forest model, “ensemble” method constructs more than one decision tree,
- Gradient boosting model, “ensemble” method constructs decision trees sequentially,
- Extreme gradient boosting model, “ensemble” method constructs decision trees in parallel.
3.1. Decision Trees
3.2. Random Forest
3.3. Gradient Boosting
3.4. Extreme Gradient Boosting
4. ML Integrated Model Development
4.1. Improving the Accuracy of a GAPC Model
4.2. Evaluating the Forecasting Performance of a Model
5. A Procedure for Improving the Forecasting Ability
- 1.
- Reserve a hold-out testing period,
- 2.
- Fit a mortality model to the training data,
- 3.
- Extract fitted mortality rates,
- 4.
- Calculate for each age and year,
- 5.
- Forecast with same mortality model over the testing period,
- 6.
- Extract forecasted mortality rates and calculate model RMSE,
- 7.
- Calibrate with tree-based methods;
- a.
- Determine lower & upper limits of hyperparameters,
- b.
- Extract re-estimated series calculated with different set of hyperparameters,
- 8.
- Obtain each series using tree-based methods,
- 9.
- Calculate each series of over the testing period,
- 10.
- Identify the series that give less RMSE for testing period,
- 11.
- Find series used to forecast and calculate over training period,
- 12.
- Search for series that also give less RMSE for training period,
- 13.
- Repeat the steps for each mortality model.
6. Discussion
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A
Appendix A.1. Test and Train RMSE of ML Integrated Models Below Pure Mortality Model’s RMSE in Use













Appendix A.2. Hyperparameter Set and Limits for Each ML Model
| Decision Tree | Random Forest | Gradient Boosting | Extreme Gradient Boosting | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| LC, RH | CBD | APC | M7, Plat | All | All | All | ||||
| cp | 0.001 | 0.001 | 0.01 | 0.001 | mtry | 1:4 | shrinkage | 0.01, 0.05, 0.1 | nrounds | 50, 100, 150 |
| minsplit | 1:10 | 1:10 | 1:10 | 1:5 | num.trees | 50,100,150,200,300 | n.trees | 50, 100, 150 | eta | 0.01, 0.05, 0.1 |
| maxdepth | 1:30 | 1:10 | 1:10 | 1:30 | min.node.size | 1:5 | interaction.depth | 1, 3, 5 | max_depth | 1, 3, 5 |
| minbucket | 1:20 | 1:10 | 1:10 | 10:40 | ||||||
Appendix B.1. Efficient Frontier of the Models






Appendix B.2. Male Version of Table 2.
| Pure | DT | RF | GB | XGB | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Test | Train | Min Test | Train | Min Train |
Min Test | Train | Min Train |
Min Test | Train | Min Train |
Min Test | Train | Min Train |
|
| LC | 2.1922 | 4.0438 | 2.1244 | 1.8127 | 1.7225 | 2.0977 | 2.1543 | 1.5052 | 2.1233 | 3.8256 | 3.3274 | 2.1167 | 2.8815 | 2.1907 |
| CBD | 2.0877 | 3.7642 | 2.0545 | 3.5016 | 2.8520 | 1.9631 | 1.7566 | 1.4695 | 2.0233 | 3.4770 | 3.3159 | 1.8876 | 3.6668 | 2.2994 |
| APC | 2.9781 | 3.9230 | 2.5461 | 3.6140 | 3.6140 | 2.3023 | 1.9350 | 1.4311 | 2.3170 | 3.3490 | 3.1862 | 2.2897 | 2.0956 | 2.0956 |
| RH | 2.2360 | 3.6468 | 2.2323 | 3.6204 | 3.2732 | 2.2293 | 3.1565 | 3.1402 | 2.2252 | 3.3834 | 3.3150 | 2.2074 | 2.8230 | 2.8230 |
| M7 | 2.1140 | 3.6614 | 2.0973 | 3.5582 | 3.3742 | 2.0878 | 1.9122 | 1.7146 | 2.0868 | 3.5766 | 3.3615 | 2.1051 | 3.5421 | 2.8256 |
| Plat | 2.3274 | 3.6173 | 1.9786 | 3.2534 | 3.1819 | 2.0428 | 2.2212 | 1.4345 | 2.1522 | 3.1389 | 3.1389 | 1.8804 | 3.2549 | 1.9184 |
References
- Richman, R. AI in Actuarial Science - A Review of Recent Advances - Part 2. Ann. Actuar. Sci. 2021, 15, 230–258. [CrossRef]
- Commentary on Title IV of Directive 2009/138/EC on the Taking up and Pursuit of the Business of Insurance and Reinsurance (Solvency II); 2009.
- Cairns, A.J.G.; Blake, D.; Dowd, K. Modelling and Management of Mortality Risk: A Review. Scand. Actuar. J. 2008, 1238, 79–113. [CrossRef]
- Lee, R.D.; Carter, L.R. Modeling and Forecasting U. S. Mortality. J. Am. Stat. Assoc. 1992, 87, 659. [CrossRef]
- Basellini, U.; Camarda, C.G.; Booth, H. Thirty Years on: A Review of the Lee–Carter Method for Forecasting Mortality. Int. J. Forecast. 2023, 39, 1033–1049. [CrossRef]
- Zeddouk, F.; Devolder, P. Mean Reversion in Stochastic Mortality: Why and How? Eur. Actuar. J. 2020, 10, 499–525. [CrossRef]
- Long-Term Budgetary Pressures and Policy Options; Washington, D.C., 1998.
- Brouhns, N.; Denuit, M.; Vermunt, J.K. A Poisson Log-Bilinear Regression Approach to the Construction of Projected Lifetables. Insur. Math. Econ. 2002, 31, 373–393. [CrossRef]
- Renshaw, A.E.; Haberman, S. A Cohort-Based Extension to the Lee-Carter Model for Mortality Reduction Factors. Insur. Math. Econ. 2006, 38, 556–570. [CrossRef]
- Hobcraft, J.; Menken, J.; Preston, S. Age, Period, and Cohort Effects in Demography: A Review. Popul. Index 1982, 48, 4–43. [CrossRef]
- Osmond, C. Using Age, Period and Cohort Models to Estimate Future Mortality Rates. Int. J. Epidemiol. 1985, 14.
- Currie, I.D. Smoothing and Forecasting Mortality Rates with P-Splines. 2006.
- Cairns, A.J.G.; Blake, D.; Dowd, K. A Two-Factor Model for Stochastic Mortality with Parameter Uncertainty: Theory and Calibration. J. Risk Insur. 2006, 73, 687–718. [CrossRef]
- Cairns, A.J.G.; Blake, D.; Dowd, K.; Coughlan, G.D.; Epstein, D.; Ong, A.; Balevich, I. A Quantitative Comparison of Stochastic Mortality Models Using Data From England and Wales and the United States. North Am. Actuar. J. 2009, 13, 1–35. [CrossRef]
- Plat, R. On Stochastic Mortality Modeling. Insur. Math. Econ. 2009, 45, 393–404. [CrossRef]
- Currie, I.D. On Fitting Generalized Linear and Non-Linear Models of Mortality. Scand. Actuar. J. 2016, 2016, 356–383. [CrossRef]
- Villegas, A.M.; Millossovich, P.; Kaishev, V.K. StMoMo: Stochastic Mortality Modeling in R. J. Stat. Softw. 2018, 84. [CrossRef]
- Hunt, A.; Blake, D. A General Procedure for Constructing Mortality Models. North Am. Actuar. J. 2014, 18, 116–138. [CrossRef]
- Hunt, A.; Blake, D. On the Structure and Classification of Mortality Models. North Am. Actuar. J. 2021, 25, S215–S234. [CrossRef]
- Haberman, S.; Renshaw, A. A Comparative Study of Parametric Mortality Projection Models. Insur. Math. Econ. 2011, 48, 35–55. [CrossRef]
- Booth, H.; Tickle, L. MORTALITY MODELLING AND FORECASTING: A REVIEW OF METHODS By H. Booth and L. Tickle. Ann. Actuar. Sci. H., L. Tickle, ‘MORTALITY Model. Forecast. A Rev. METHODS By H. Booth L. Tickle’, Ann. Actuar. Sci. 3/I/II (2008), 3–43 <https//openresearch-repository.anu.edu.au/bitstream/1885/55237/2/ 2008, 3, 3–43.
- Pitacco, E., Denuit, M., Haberman, S., Olivieri, A. Modelling Longevity Dynamics for Pensions and Annuity Business; Oxford University Press: Okford, 2009.
- Dowd, K.; Cairns, A.J.G.; Blake, D.; Coughlan, G.D.; Epstein, D.; Khalaf-Allah, M. Evaluating the Goodness of Fit of Stochastic Mortality Models. Insur. Math. Econ. 2010, 47, 255–265. [CrossRef]
- Zamzuri, Z.H.; Hui, G.J. Comparing and Forecasting Using Stochastic Mortality Models: A Monte Carlo Simulation. Sains Malaysiana 2020, 49, 2013–2022. [CrossRef]
- Redzwan, N.; Ramli, R. A Bibliometric Analysis of Research on Stochastic Mortality Modelling and Forecasting. Risks 2022, 10. [CrossRef]
- Nithya C; Saravanan V A Study of Machine Learning Techniques in Data Mining. Int. Sci. Ref. Res. J. © 2018 SISRRJ | 2018, 1, 31–34.
- Richman, R.; Wüthrich, M. V. A Neural Network Extension of the Lee-Carter Model to Multiple Populations. Ann. Actuar. Sci. 2021, 15, 346–366. [CrossRef]
- Levantesi, S.; Pizzorusso, V. Application of Machine Learning to Mortality Modeling and Forecasting. Risks 2019, 7, 1–19. [CrossRef]
- Deprez, P.; Shevchenko, P. V.; Wüthrich, M. V. Machine Learning Techniques for Mortality Modeling. Eur. Actuar. J. 2017, 7, 337–352. [CrossRef]
- Levantesi, S.; Nigri, A. A Random Forest Algorithm to Improve the Lee–Carter Mortality Forecasting: Impact on q-Forward. Soft Comput. 2020, 24, 8553–8567. [CrossRef]
- Bjerre, D.S. Tree-Based Machine Learning Methods for Modeling and Forecasting Mortality. ASTIN Bull. 2022, 52, 765–787. [CrossRef]
- Gyamerah, S.A.; Mensah, A.A.; Asare, C.; Dzupire, N. Improving Mortality Forecasting Using a Hybrid of Lee–Carter and Stacking Ensemble Model. Bull. Natl. Res. Cent. 2023, 47. [CrossRef]
- Qiao, Y.; Wang, C.W.; Zhu, W. Machine Learning in Long-Term Mortality Forecasting. Geneva Pap. Risk Insur. Issues Pract. 2024, 49, 340–362. [CrossRef]
- Levantesi, S.; Lizzi, M.; Nigri, A. Enhancing Diagnostic of Stochastic Mortality Models Leveraging Contrast Trees: An Application on Italian Data. Qual. Quant. 2024, 58, 1565–1581. [CrossRef]
- Friedman, J.H. Contrast Trees and Distribution Boosting. Proc. Natl. Acad. Sci. U. S. A. 2020, 117, 21175–21184. [CrossRef]
- Human Mortality Database Available online: www.mortality.org.
- McCullagh P., N.J.A. Generalized Linear Models; 2nd ed.; Chapman and Hall: London, 1989; ISBN 9780128186299.
- RStudio: Integrated Development for R. 2020.
- Gross, K. Tree-Based Models: How They Work in Plain English Available online: https://blog.dataiku.com/tree-based-models-how-they-work-in-plain-english.
- Breiman, L., Friedman, J. H., Olshen, R. A., Stone, C.J. Classification and Regression Trees; Boca Raton: CRC Press, 1984.
- Shalizi, C. Regression Trees; 2006.
- Breiman, L. Random Forests. Mach. Learn. 2001, 45, 32.
- Theobald, O. Machine Learning for Absolute Beginners; 2017; ISBN 1549617214.
- Friedman, J.H. Greedy Function Approximation: A Gradient Boosting Machine. Ann. Stat. 2001, 29, 1189–1232.
- Gradient-Boosting @ Www.Ibm.Com Available online: https://www.ibm.com/think/topics/gradient-boosting.
- Truong, H.P. Predicting Aircraft Availability: A Machine Learning Approach, California State University, 2022.
- James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning; Springer Texts in Statistics, 2023; ISBN 9780387781884.
- Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. Proc. ACM SIGKDD Int. Conf. Knowl. Discov. Data Min. 2016, 13-17-Augu, 785–794. [CrossRef]
- Sheridan, R.P.; Wang, W.M.; Liaw, A.; Ma, J.; Gifford, E.M. Extreme Gradient Boosting as a Method for Quantitative Structure-Activity Relationships. J. Chem. Inf. Model. 2016, 56, 2353–2360. [CrossRef]
- Bhuse, P.; Gandhi, A.; Meswani, P.; Muni, R.; Katre, N. Machine Learning Based Telecom-Customer Churn Prediction. Proc. 3rd Int. Conf. Intell. Sustain. Syst. ICISS 2020 2020, 1297–1301. [CrossRef]
- Pareto, V. Cours d’Économie Politique. In; 1986; Vol. 1.
- Chang, K.-H. Multiobjective Optimization and Advanced Topics; 2015; ISBN 9780123820389.





| Mortality Model | Short Definition | Structure |
|---|---|---|
| Lee-Carter (LC) | Static age function, an age-period term, no cohort effect. | |
| Renshaw-Haberman (RH) | Generalizes LC by adding cohort effect. | |
| Age-Period-Cohort (APC) | Basic form of age-period-cohort models. | |
| Cairns-Blake-Dowd (CBD) | Two age-period terms, no static age function, no cohort effect. | |
| M7 | CBD with quadratic age effect and a cohort effect. | |
| Plat | Hybrid version of LC and CBD. |
where, |
| Pure | DT | RF | GB | XGB | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Min Test | Min Train | Min Test | Train | Min Train |
Min Test | Train | Min Train |
Min Test | Train | Min Train |
Min Test | Train | Min Train |
|
| LC | 0.8084 | 2.1349 | 0.7615 | 1.9795 | 1.2229 | 0.7635 | 1.1311 | 0.8207 | 0.7668 | 1.9176 | 1.8477 | 0.7547 | 1.6795 | 1.3928 |
| CBD | 1.0778 | 2.6752 | 0.9048 | 1.9989 | 1.4155 | 1.0201 | 1.9784 | 1.8203 | 0.9776 | 2.0735 | 1.8669 | 1.0075 | 1.7899 | 1.4575 |
| APC | 1.7488 | 2.4273 | 1.3095 | 2.0338 | 2.0338 | 1.0152 | 1.1911 | 0.8452 | 1.0411 | 1.8730 | 1.8061 | 0.9701 | 1.3537 | 1.3537 |
| RH | 1.0053 | 1.9992 | 1.0001 | 1.4051 | 1.4051 | 0.8101 | 1.2610 | 0.8504 | 0.9330 | 1.8479 | 1.7672 | 0.7681 | 1.8907 | 1.3177 |
| M7 | 2.7858 | 1.9877 | 2.4128 | 1.9242 | 1.9198 | 2.4457 | 1.2030 | 0.8489 | 2.3781 | 1.7950 | 1.7824 | 1.9853 | 1.8368 | 1.2736 |
| Plat | 2.1286 | 2.0379 | 1.8430 | 1.9432 | 1.9277 | 1.9029 | 1.2281 | 0.8454 | 1.8686 | 1.7536 | 1.7536 | 1.5092 | 1.9064 | 1.2454 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).