Submitted:
01 July 2025
Posted:
01 July 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
3. Results
- COVID-19 confirmed cases per million population (monthly aggregation);
- Temporal decomposition (year as continuous variable, month as categorical one-hot encoding);
- Interpolated CPI data with temporal lags;
- Regularization analysis using Ridge and Lasso regression for feature selection.
3.1. Deep Machine Learning Models
- Feedforward Neural Network (XGBoost Top-10 Features): Feature-selected neural network using XGBoost importance rankings;
- XGBoost (Tabular): Gradient boosting with tabular data structure;
- BiLSTM + MultiHead Attention: Bidirectional LSTM with transformer-style attention mechanisms;
- Prophet (Seasonal Components Only): Facebook's Prophet algorithm utilizing solely seasonal patterns;
- BiLSTM + Attention: Bidirectional LSTM with standard attention layers.
3.2. Machine Learning Models
- Prophet: Seasonal expertise and external regressor integration. Prophet was specifically designed for business time series with strong seasonal effects and external influences - exactly matching tourism demand characteristics.
- Ridge: Regularized stability and interpretable baseline. Ridge provides a regularized linear baseline that prevents overfitting while offering interpretable coefficients for stakeholder communication.
- LightGBM: Nonlinear pattern recognition and feature interaction modeling. LightGBM excels at capturing complex nonlinear relationships and feature interactions that traditional time series models miss.
- Ensemble: Combines strengths while mitigating individual weaknesses. Model combination + Variance reduction.

4. Discussion
5. Conclusions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| CPI | Consumer Price Index |
| ML | Machine Learning |
| DML | Deep Machine Learning |
| ARIMA | Autoregressive Integrated Moving Average |
| MAE | Mean Absolute Error |
| RMSE | Root Mean Square Error |
| R2 | Coefficient of Determination |
| MAD | Mean Absolute Deviation |
| SMAPE | Symmetric Mean Absolute Percentage Error |
| ANN | Artificial Neural Network |
| SARIMAX | Seasonal Autoregressive Integrated Moving Average with Exogenous Regressors |
| LSTM | Long Short-Term Memory |
| BiLSTM | Bidirectional Long Short-Term Memory Network |
| XGBoost | eXtreme Gradient Boosting |
| LightGBM | Light Gradient Boosting Machine |
| 1 | MAE Difference: Absolute difference in forecast errors. |
| 2 | Significance Levels: p < 0.01 (Highly significant); p < 0.05 (Significant); p < 0.10 (Marginally significant). |
References
- Ammar Aamer, Luh Putu Eka Yani, and I Made Alan Priyatna. 2020. Data Analytics in the Supply Chain Management: Review of Machine Learning Applications in Demand Forecasting. Oper. Supply Chain Manag. Int. J. (December 2020), 1–13. [CrossRef]
- Marcos Álvarez-Díaz, Manuel González-Gómez, and María Otero-Giráldez. 2018. Forecasting International Tourism Demand Using a Non-Linear Autoregressive Neural Network and Genetic Programming. Forecasting 1, 1 (September 2018), 90–106. [CrossRef]
- Apostolos Ampountolas. 2021. Modeling and Forecasting Daily Hotel Demand: A Comparison Based on SARIMAX, Neural Networks, and GARCH Models. Forecasting 3, 3 (August 2021), 580–595. [CrossRef]
- Jason Brownlee. 2016. Time Series Prediction with LSTM Recurrent Neural Networks in Python with Keras. MachineLearningMastery.com. Retrieved June 30, 2025 from https://www.machinelearningmastery.com/time-series-prediction-lstm-recurrent-neural-networks-python-keras/.
- Geoffrey I. Crouch. 1994. Price Elasticities in International Tourism. Hosp. Res. J. 17, 3 (May 1994), 27–39. [CrossRef]
- Athanasia Dimitriadou, Gogas ,Periklis, and Theophilos and Papadimitriou. Tourism and uncertainty: a machine learning approach. Curr. Issues Tour. 0, 0 , 1–21. [CrossRef]
- G. Djurovic, V. Djurovic, and M.M. Bojaj. 2020. The macroeconomic effects of COVID-19 in Montenegro: a Bayesian VARX approach. Financ. Innov. 6, 1 (2020). [CrossRef]
- Yunxuan Dong, Ling Xiao, Jiasheng Wang, and Jujie Wang. 2023. A time series attention mechanism based model for tourism demand forecasting. Inf. Sci. 628, (May 2023), 269–290. [CrossRef]
- Noomesh Dowlut and Baby Gobin-Rahimbux. 2023. Forecasting resort hotel tourism demand using deep learning techniques – A systematic literature review. Heliyon 9, 7 (July 2023), e18385. [CrossRef]
- Yuntao Du, Jindong Wang, Wenjie Feng, Sinno Pan, Tao Qin, Renjun Xu, and Chongjun Wang. 2021. AdaRNN: Adaptive Learning and Forecasting of Time Series. [CrossRef]
- Martin Falk and Xiang Lin. 2018. Income elasticity of overnight stays over seven decades. Tour. Econ. 24, 8 (December 2018), 1015–1028. [CrossRef]
- Hanxi Fang, Song Gao, and Feng Zhang. 2023. Forecasting Inter-Destination Tourism Flow via a Hybrid Deep Learning Model. [CrossRef]
- Tsvetanka Georgieva-Trifonova and Olga Mancheva-Ali. 2024. Predicting Tourist Arrivals: A Google Trends-Based Model for Destination Management. TEM J. (August 2024), 1945–1951. [CrossRef]
- D. Girolimetto, G. Athanasopoulos, T. Di Fonzo, and R.J. Hyndman. 2024. Cross-temporal probabilistic forecast reconciliation: Methodological and practical issues. Int. J. Forecast. 40, 3 (2024), 1134–1151. [CrossRef]
- Ulrich Gunter, Egon Smeral, and Bozana Zekan. 2022. Forecasting Tourism in the EU after the COVID-19 Crisis. J. Hosp. Tour. Res. (September 2022), 10963480221125130. [CrossRef]
- Tomas Havranek and Ayaz Zeynalov. 2021. Forecasting tourist arrivals: Google Trends meets mixed-frequency data. Tour. Econ. 27, 1 (February 2021), 129–148. [CrossRef]
- Julien Herzen, Francesco Lässig, Samuele Giuliano Piazzetta, Thomas Neuer, Léo Tafti, Guillaume Raille, Tomas Van Pottelbergh, Marek Pasieka, Andrzej Skrodzki, Nicolas Huguenin, Maxime Dumonal, Jan Kościsz, Dennis Bader, Frédérick Gusset, Mounir Benheddi, Camila Williamson, Michal Kosinski, Matej Petrik, and Gaël Grosch. 2021. Darts: User-Friendly Modern Machine Learning for Time Series. (2021). [CrossRef]
- Shun-Chieh Hsieh. 2021. Tourism Demand Forecasting Based on an LSTM Network and Its Variants. Algorithms 14, 8 (August 2021), 243. [CrossRef]
- Mingming Hu and Haiyan Song. 2020. Data source combination for tourism demand forecasting. Tour. Econ. 26, 7 (November 2020), 1248–1265. [CrossRef]
- Indra Gunawan, Dwi Purnomo Putro, and Adhika Pramita Widyassari. 2023. Can Google Trends(GT) be used to predict tourist arrivals?: FB Prophet Machine Learning(ML) for Predicting Tourist Arrivals. Int. Conf. Digit. Adv. Tour. Manag. Technol. 1, 1 (December 2023), 132–142. [CrossRef]
- R. Shakir Al Jassim, Karan Jetly, Ahmad Abushakra, and Sh Al Mansori. 2023. A Review of the Methods and Techniques Used in Tourism Demand Forecasting. EAI Endorsed Trans. Creat. Technol. 9, 4 (January 2023), e1. [CrossRef]
- Dongbin Kim, Jinseong Park, Jaewook Lee, and Hoki Kim. 2024. Are Self-Attentions Effective for Time Series Forecasting? [CrossRef]
- Thanasis Kotsiopoulos, Panagiotis Sarigiannidis, Dimosthenis Ioannidis, and Dimitrios Tzovaras. 2021. Machine Learning and Deep Learning in smart manufacturing: The Smart Grid paradigm. Comput. Sci. Rev. 40, (May 2021), 100341. [CrossRef]
- Anna Kovacs-Györi, Dagmar Lahnsteiner, Johanna Schmitt, and Thomas Prinz. 2025. Spatiotemporal clustering based on international tourists’ overnight stay data in Salzburg, Austria: a seasonal analysis using space-time data cubes to enhance airport connectivity. Tour. Recreat. Res. (January 2025), 1–20. [CrossRef]
- Rob Law, Gang Li, Davis Ka Chio Fong, and Xin Han. 2019. Tourism demand forecasting: A deep learning approach. Ann. Tour. Res. 75, (March 2019), 410–423. [CrossRef]
- Geun-Cheol Lee. 2025. A Data-Driven Approach to Tourism Demand Forecasting: Integrating Web Search Data into a SARIMAX Model. Data 10, 5 (May 2025), 73. [CrossRef]
- Gang Li, Haiyan Song, and Stephen F. Witt. 2005. Recent Developments in Econometric Modeling and Forecasting. J. Travel Res. 44, 1 (August 2005), 82–99. [CrossRef]
- Hao Li, Gopi Krishnan Rajbahadur, Dayi Lin, Cor-Paul Bezemer, Zhen Ming, and Jiang. 2024. Keeping Deep Learning Models in Check: A History-Based Approach to Mitigate Overfitting. IEEE Access 12, (2024), 70676–70689. [CrossRef]
- Xin Li, Xu Zhang, Chengyuan Zhang, and Shouyang Wang. 2024. Forecasting tourism demand with a novel robust decomposition and ensemble framework. Expert Syst. Appl. 236, (February 2024), 121388. [CrossRef]
- YiFei Li and Han Cao. 2018. Prediction for Tourism Flow based on LSTM Neural Network. Procedia Comput. Sci. 129, (January 2018), 277–283. [CrossRef]
- A. Liu, V.S. Lin, G. Li, and H. Song. 2022. Ex Ante Tourism Forecasting Assessment. J. Travel Res. 61, 1 (2022), 64–75. [CrossRef]
- Wenxing Lu, Jieyu Jin, Binyou Wang, Keqing Li, Changyong Liang, Junfeng Dong, and Shuping Zhao. 2020. Intelligence in Tourist Destinations Management: Improved Attention-based Gated Recurrent Unit Model for Accurate Tourist Flow Forecasting. Sustainability 12, 4 (February 2020), 1390. [CrossRef]
- Christine A. Martin and Stephen F. Witt. 1989. Forecasting tourism demand: A comparison of the accuracy of several quantitative methods. Int. J. Forecast. 5, 1 (January 1989), 7–19. [CrossRef]
- Symi Nyns and Serge Schmitz. 2022. Using mobile data to evaluate unobserved tourist overnight stays. Tour. Manag. 89, (April 2022), 104453. [CrossRef]
- J.U. Obogo and F.F. Adedoyin. 2021. Data-Driven Business Analytics for the Tourism Industry in the UK: A Machine Learning Experiment Post-COVID. 2021. 78–86. [CrossRef]
- El Houssin Ouassou and Hafsa Taya. 2022. Forecasting Regional Tourism Demand in Morocco from Traditional and AI-Based Methods to Ensemble Modeling. Forecasting 4, 2 (April 2022), 420–437. [CrossRef]
- Deelip Patil and Kamal Alaskar. 2025. A hybrid approach for rainfall prediction: Leveraging prophet for seasonality and light GBM for residuals. In Progressive Computational Intelligence, Information Technology and Networking. CRC Press.
- Fotios Petropoulos, Daniele Apiletti, Vassilios Assimakopoulos, Mohamed Zied Babai, Devon K. Barrow, Souhaib Ben Taieb, Christoph Bergmeir, Ricardo J. Bessa, Jakub Bijak, John E. Boylan, Jethro Browell, Claudio Carnevale, Jennifer L. Castle, Pasquale Cirillo, Michael P. Clements, Clara Cordeiro, Fernando Luiz Cyrino Oliveira, Shari De Baets, Alexander Dokumentov, Joanne Ellison, Piotr Fiszeder, Philip Hans Franses, David T. Frazier, Michael Gilliland, M. Sinan Gönül, Paul Goodwin, Luigi Grossi, Yael Grushka-Cockayne, Mariangela Guidolin, Massimo Guidolin, Ulrich Gunter, Xiaojia Guo, Renato Guseo, Nigel Harvey, David F. Hendry, Ross Hollyman, Tim Januschowski, Jooyoung Jeon, Victor Richmond R. Jose, Yanfei Kang, Anne B. Koehler, Stephan Kolassa, Nikolaos Kourentzes, Sonia Leva, Feng Li, Konstantia Litsiou, Spyros Makridakis, Gael M. Martin, Andrew B. Martinez, Sheik Meeran, Theodore Modis, Konstantinos Nikolopoulos, Dilek Önkal, Alessia Paccagnini, Anastasios Panagiotelis, Ioannis Panapakidis, Jose M. Pavía, Manuela Pedio, Diego J. Pedregal, Pierre Pinson, Patrícia Ramos, David E. Rapach, J. James Reade, Bahman Rostami-Tabar, Michał Rubaszek, Georgios Sermpinis, Han Lin Shang, Evangelos Spiliotis, Aris A. Syntetos, Priyanga Dilini Talagala, Thiyanga S. Talagala, Len Tashman, Dimitrios Thomakos, Thordis Thorarinsdottir, Ezio Todini, Juan Ramón Trapero Arenas, Xiaoqian Wang, Robert L. Winkler, Alisa Yusupova, and Florian Ziel. 2022. Forecasting: theory and practice. Int. J. Forecast. 38, 3 (July 2022), 705–871. [CrossRef]
- Pyae-Pyae Phyo and Chawalit Jeenanunta. 2022. Advanced ML-Based Ensemble and Deep Learning Models for Short-Term Load Forecasting: Comparative Analysis Using Feature Engineering. Appl. Sci. 12, 10 (January 2022), 4882. [CrossRef]
- Greg Rafferty. 2021. Forecasting time series data with Facebook Prophet: build, improve, and optimize time series forecasting models using the advanced forecasting tool. Packt Publishing, Birmingham Mumbai.
- Ahmed Shoukry Rashad. 2022. The Power of Travel Search Data in Forecasting the Tourism Demand in Dubai. Forecasting 4, 3 (July 2022), 674–684. [CrossRef]
- Athanasios Salamanis, Georgia Xanthopoulou, Dionysios Kehagias, and Dimitrios Tzovaras. 2022. LSTM-Based Deep Learning Models for Long-Term Tourism Demand Forecasting. Electronics 11, 22 (November 2022), 3681. [CrossRef]
- Eleonora Santos. 2024. Sustainable Tourism Dynamics: Understanding the Impact of Tourist Stays on Regional Revenue and Development. Sustainability 16, 19 (September 2024), 8403. [CrossRef]
- Francesco Scotti, Andrea Flori, Piercesare Secchi, Marika Arena, and Giovanni Azzone. 2024. Exploring drivers of overnight stays and same-day visits in the tourism sector. Sci. Rep. 14, 1 (April 2024), 9840. [CrossRef]
- Zdravko Šergo, Jasmina Gržinić, and Anita Silvana Ilak Peršurić. 2021. The Effect of Tourism Overnight Stays on Croatia’s Extra Virgin Olive Oil Prices and Market Power: An Empirical Study. Interdiscip. Descr. Complex Syst. 19, 4 (December 2021), 526–541. [CrossRef]
- Jaime Serra, Antónia Correia, and Paulo M. M. Rodrigues. 2014. A comparative analysis of tourism destination demand in Portugal. J. Destin. Mark. Manag. 2, 4 (January 2014), 221–227. [CrossRef]
- Isabel Silva and Hugo Alonso. 2020. New developments in the forecasting of monthly overnight stays in the North Region of Portugal. J. Appl. Stat. 47, 13–15 (November 2020), 2927–2940. [CrossRef]
- Emmanouil Sofianos, Christos Alexakis, Periklis Gogas, and Theophilos Papadimitriou. 2025. Machine learning forecasting in the macroeconomic environment: the case of the US output gap. Econ. Change Restruct. 58, 1 (February 2025), 9. [CrossRef]
- Haiyan Song and Gang Li. 2008. Tourism demand modelling and forecasting—A review of recent research. Tour. Manag. 29, 2 (April 2008), 203–220. [CrossRef]
- Sean J. Taylor and Benjamin Letham. 2018. Forecasting at Scale. Am. Stat. 72, 1 (January 2018), 37–45. [CrossRef]
- Josip Tica and Ivan Kožić. 2015. Forecasting Croatian inbound tourism demand. Econ. Res.-Ekon. Istraživanja 28, 1 (January 2015), 1046–1062. [CrossRef]
- G. Tovmasyan. 2023. FACTORS THAT INFLUENCE DOMESTIC TOURISM DEMAND: EVIDENCE FROM ARMENIA. Econ. Sociol. 16, 2 (2023), 75–88. [CrossRef]
- Kevin K.F. Wong, Haiyan Song, and Kaye S. Chon. 2006. Bayesian models for tourism demand forecasting. Tour. Manag. 27, 5 (October 2006), 773–780. [CrossRef]
- Doris Chenguang Wu, Zheng Cao, Long Wen, and Haiyan Song. 2021. Scenario Forecasting for Global Tourism. J. Hosp. Tour. Res. 45, 1 (January 2021), 28–51. [CrossRef]
- Ke Xu, Junli Zhang, Junhao Huang, Hongbo Tan, Xiuli Jing, and Tianxiang Zheng. 2024. Forecasting Visitor Arrivals at Tourist Attractions: A Time Series Framework with the N-BEATS for Sustainable Tourism. Sustainability 16, 18 (September 2024), 8227. [CrossRef]
- Nan Yu and Jiaping Chen. 2022. Design of Machine Learning Algorithm for Tourism Demand Prediction. Comput. Math. Methods Med. 2022, (June 2022), 1–9. [CrossRef]
- Ying Yu, Yirui Wang, Shangce Gao, and Zheng Tang. 2017. Statistical Modeling and Prediction for Tourism Economy Using Dendritic Neural Network. Comput. Intell. Neurosci. 2017, (2017), 1–9. [CrossRef]
- Dong Zhang and Baozhuang Niu. 2024. Leveraging online reviews for hotel demand forecasting: A deep learning approach. Inf. Process. Manag. 61, 1 (January 2024), 103527. [CrossRef]
- Shuxin Zheng and Zhongguo Zhang. 2023. Adaptive tourism forecasting using hybrid artificial intelligence model: a case study of Xi’an international tourist arrivals. PeerJ Comput. Sci. 9, (November 2023), e1573. [CrossRef]
- Jin Zhou, Haiqi Li, and Wanling Zhong. 2021. A modified Diebold–Mariano test for equal forecast accuracy with clustered dependence. Econ. Lett. 207, (October 2021), 110029. [CrossRef]
- Diebold-Mariano Test. Retrieved June 28, 2025 from https://maggima.github.io/pages/stats/tests/forecasts/dm.html.


| Model | MAPE (%) | R² | MAE | RMSE |
| BiLSTM + Attention | 204,66 | 0,2812 | 970626,93 | 1046324,12 |
| Feedforward (Top10 XGBoost) | 53,78 | 0,1542 | 774433,87 | 1242585,89 |
| XGBoost (tabular) | 61,2 | 0,2014 | 802100,5 | 1314000 |
| Prophet (seasonal only) | 72,8 | 0,02 | 910000 | 1450000 |
| Model | MAE | RMSE | MAPE (%) | MAD | SMAPE (%) | Theil's U |
| Ensemble | 156,847 | 298,245 | 14.23 | 145,234 | 13.89 | 0.678 |
| Prophet | 174,592 | 327,891 | 16.85 | 162,567 | 15.67 | 0.743 |
| LightGBM | 168,934 | 315,672 | 15.94 | 156,789 | 14.78 | 0.712 |
| Ridge | 203,756 | 389,123 | 21.47 | 189,456 | 19.34 | 0.856 |
| Model | Ljung-Box p-value | ADF Test p-value | Normality p-value | Heteroscedasticity p-value |
| Ensemble | 0.234 | 0.001* | 0.156 | 0.089 |
| Prophet | 0.087 | 0.003** | 0.234 | 0.045** |
| LightGBM | 0.134 | 0.002** | 0.098 | 0.067 |
| Ridge | 0.023** | 0.001*** | 0.034** | 0.012** |
| Model Comparison | DM Statistic | p-value | Significance | Better Model | MAE Difference1 |
| Ensemble vs Prophet | -2.347 | 0.0189 | Yes | Ensemble | 17,745 |
| Ensemble vs LightGBM | -1.892 | 0.0585 | Marginal | Ensemble | 12,087 |
| Ensemble vs Ridge | -3.456 | 0.0005 | Yes | Ensemble | 46,909 |
| Prophet vs LightGBM | -0.673 | 0.5009 | No | Prophet | 5,658 |
| Prophet vs Ridge | -2.189 | 0.0286 | Yes | Prophet | 29,164 |
| LightGBM vs Ridge | -2.567 | 0.0103 | Yes | LightGBM | 34,822 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).