Submitted:
20 October 2025
Posted:
21 October 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
- Knowledge Graph Enhancement: The pharmaceutical knowledge graph is designed to capture the interrelationships between drugs and symptoms, thereby supporting more accurate pharmaceutical demand forecasting. Ablation analysis indicates that approximately half of the model’s performance improvement can be attributed to the relational information derived from the knowledge graph, highlighting its crucial role in enhancing predictive accuracy.
- Model Innovation: We propose a hybrid model by combining GCN and LSTM. This design allows the model to simultaneously capture inter-drug relational dependencies and temporal demand patterns, leading to more accurate and robust predictions of pharmaceutical demand.
- Empirical Validation: Extensive experiments on a real-world pharmaceutical sales dataset demonstrate that the proposed model significantly outperforms classical statistical and deep learning models across multiple metrics.
2. Literature Review
3. Methods
- Graph Data Construction via Knowledge Graph:
- 2.
- GCN-Based Relational Embedding:
- 3.
- LSTM-Based Temporal Modeling:
- 4.
- Prediction:
3.1. Graph Data Construction
3.2. Graph Convolutional Network for Relation Learning
- is the hidden representation at the -th layer, with
- is the trainable weight matrix.
- is a nonlinear activation function ReLU.
- is the symmetrically normalized adjacency matrix, which ensures numerical stability and avoids scaling issues during aggregation.
3.3. LSTM for Temporal Demand Forecasting
4. Results
4.1. Baselines for Comparison
4.2. Ablation Study
- (1)
- Analysis at the Component Level
- (2)
- Analyzing the Contribution of the Knowledge Graph
5. Conclusions
- Multi-feature data integration: Incorporating demographic data, pharmacy location, and holiday information may capture additional factors influencing demand and further improve prediction accuracy.
- Model interpretability: While the proposed model enhances prediction precision through the integration of knowledge graphs and deep learning, it still lacks interpretability in explaining inter-drug relationships and the mechanisms underlying demand fluctuations.
References
- Bilal, A. I.; Bititci, U. S.; Fenta, T. G. , Challenges and the Way Forward in Demand-Forecasting Practices within the Ethiopian Public Pharmaceutical Supply Chain. Pharmacy 2024, 12, 86. [Google Scholar] [CrossRef]
- Bilal, A. I.; Rostami-Tabar, B.; Hewage, H. H.; Bititci, U. S.; Fenta, T. G. , The missing puzzle piece: contextual insights for enhanced pharmaceutical supply chain forecasting. International Journal of Production Research 2025, 1–19. [Google Scholar] [CrossRef]
- Gers, F. A.; Schmidhuber, J.; Cummins, F. , Learning to forget: Continual prediction with LSTM. Neural computation 2000, 12, 2451–2471. [Google Scholar] [CrossRef]
- Zhao, L.; Song, Y.; Zhang, C.; Liu, Y.; Wang, P.; Lin, T.; Deng, M.; Li, H. , T-GCN: A temporal graph convolutional network for traffic prediction. IEEE transactions on intelligent transportation systems 2019, 21, 3848–3858. [Google Scholar] [CrossRef]
- Shi, Y.; Wang, Y.; Qu, Y.; Chen, Z. , Integrated GCN-LSTM stock prices movement prediction based on knowledge-incorporated graphs construction. International Journal of Machine Learning and Cybernetics 2024, 15, 161–176. [Google Scholar] [CrossRef]
- Yang, C.; Li, S.; Gou, Z. , Spatiotemporal prediction of urban building rooftop photovoltaic potential based on GCN-LSTM. Energy and Buildings 2025, 334, 115522. [Google Scholar] [CrossRef]
- Wei, M.; Wen, M.; Zhang, Y. , A novel spatial electric load forecasting method based on LDTW and GCN. IET Generation, Transmission & Distribution 2024, 18, 491–505. [Google Scholar]
- Wang, X.; Wang, X.; Yin, X.; Li, K.; Wang, L.; Wang, R.; Song, R. , Distributed LSTM-GCN-based spatial–temporal indoor temperature prediction in multizone buildings. IEEE Transactions on Industrial Informatics 2023, 20, 482–491. [Google Scholar] [CrossRef]
- Gilbert, K. , An ARIMA supply chain model. Management Science 2005, 51, 305–310. [Google Scholar] [CrossRef]
- Kareem, Y.; Majeed, A. R. In Monthly Peak-load Demand Forecasting for Sulaimany Governorate Using SARIMA, 2006 IEEE/PES Transmission & Distribution Conference and Exposition: Latin America, 2006; IEEE: 2006; pp 1-5.
- Jian, Y.; Zhu, D.; Zhou, D.; Li, N.; Du, H.; Dong, X.; Fu, X.; Tao, D.; Han, B. , ARIMA model for predicting chronic kidney disease and estimating its economic burden in China. BMC Public Health 2022, 22, 2456. [Google Scholar] [CrossRef] [PubMed]
- Vairagade, N.; Logofatu, D.; Leon, F.; Muharemi, F. In Demand forecasting using random forest and artificial neural network for supply chain management, Computational Collective Intelligence: 11th International Conference, ICCCI 2019, Hendaye, France, September 4–6, 2019, Proceedings, Part I 11, 2019; Springer: 2019; pp 328-339.
- Ji, S.; Wang, X.; Zhao, W.; Guo, D. , An application of a three-stage XGBoost-based model to sales forecasting of a cross-border e-commerce enterprise. Mathematical Problems in Engineering 2019, 2019. [Google Scholar] [CrossRef]
- Yani, L. P. E.; Aamer, A. , Demand forecasting accuracy in the pharmaceutical supply chain: a machine learning approach. International journal of pharmaceutical and healthcare marketing 2023, 17, 1–23. [Google Scholar] [CrossRef]
- Douaioui, K.; Oucheikh, R.; Benmoussa, O.; Mabrouki, C. , Machine Learning and Deep Learning Models for Demand Forecasting in Supply Chain Management: A Critical Review. Applied System Innovation (ASI), 2024; 7. [Google Scholar]
- Yildiz, O.; Sucuoglu, H. S. , Development of Real-Time IoT-Based Air Quality Forecasting System Using Machine Learning Approach. Sustainability 2025, 17, 8531. [Google Scholar] [CrossRef]
- Şen, S. G. , Machine Learning-Based Water Level Forecast in a Dam Reservoir: A Case Study of Karaçomak Dam in the Kızılırmak Basin, Türkiye. Sustainability 2025, 17, 1–23. [Google Scholar]
- Punia, S.; Nikolopoulos, K.; Singh, S. P.; Madaan, J. K.; Litsiou, K. , Deep learning with long short-term memory networks and random forests for demand forecasting in multi-channel retail. International journal of production research 2020, 58, 4964–4979. [Google Scholar] [CrossRef]
- Hou, Z.; Du, Z.; Yang, G.; Yang, Z. , Short-term passenger flow prediction of urban rail transit based on a combined deep learning model. Applied Sciences 2022, 12, 7597. [Google Scholar] [CrossRef]
- Jeon, Y.; Seong, S. , Robust recurrent network model for intermittent time-series forecasting. International Journal of Forecasting 2022, 38, 1415–1425. [Google Scholar] [CrossRef]
- Zha, W.; Liu, Y.; Wan, Y.; Luo, R.; Li, D.; Yang, S.; Xu, Y. , Forecasting monthly gas field production based on the CNN-LSTM model. Energy 2022, 260, 124889. [Google Scholar] [CrossRef]
- Liu, Y.; Du, M.; Ma, X.; Hu, S.; Tuo, Z. , Groundwater Level Prediction Using a Hybrid TCN–Transformer–LSTM Model and Multi-Source Data Fusion: A Case Study of the Kuitun River Basin, Xinjiang. Sustainability 2025, 17, 8544. [Google Scholar] [CrossRef]
- Ashtar, D.; Mohammadi Ziabari, S. S.; Alsahag, A. M. M. , Hybrid Forecasting for Sustainable Electricity Demand in The Netherlands Using SARIMAX, SARIMAX-LSTM, and Sequence-to-Sequence Deep Learning Models. Sustainability 2025, 17, 7192. [Google Scholar] [CrossRef]
- Fourkiotis, K. P.; Tsadiras, A. , Applying machine learning and statistical forecasting methods for enhancing pharmaceutical sales predictions. Forecasting 2024, 6, 170–186. [Google Scholar] [CrossRef]
- Zhu, X.; Ninh, A.; Zhao, H.; Liu, Z. , Demand forecasting with supply-chain information and machine learning: Evidence in the pharmaceutical industry. Production and Operations Management 2021, 30, 3231–3252. [Google Scholar] [CrossRef]
- Rathipriya, R.; Abdul Rahman, A. A.; Dhamodharavadhani, S.; Meero, A.; Yoganandan, G. , Demand forecasting model for time-series pharmaceutical data using shallow and deep neural network model. Neural Computing and Applications 2023, 35, 1945–1957. [Google Scholar] [CrossRef] [PubMed]
- Hapsari, I. , Cluster-specific Bi-LSTM models for improved pharmaceutical sales forecasting. Journal Industrial Servicess (JISS) 2024, 10, 259–270. [Google Scholar] [CrossRef]
- Mbonyinshuti, F.; Nkurunziza, J.; Niyobuhungiro, J.; Kayitare, E. , Health supply chain forecasting: a comparison of ARIMA and LSTM time series models for demand prediction of medicines. Acta Logistica 2024, 11, 269–280. [Google Scholar] [CrossRef]
- Priyadharshini, P.; Keerthana, S.; Devi, G. K. In Optimizing Pharmaceutical Supply Chains using a Hybrid CNN-LSTM Model for Accurate Medicine Demand Forecasting, 2025 3rd International Conference on Self Sustainable Artificial Intelligence Systems (ICSSAS), 2025; IEEE: 2025; pp 1525-1532.
- Amiri, B.; Haddadi, A.; Mojdehi, K. F. , A novel hybrid GCN-LSTM algorithm for energy stock price prediction: leveraging temporal dynamics and inter-stock relationships. IEEE Access 2025. [Google Scholar] [CrossRef]
- Zhang, Y.; Xu, S.; Zhang, L.; Jiang, W.; Alam, S.; Xue, D. , Short-term multi-step-ahead sector-based traffic flow prediction based on the attention-enhanced graph convolutional LSTM network (AGC-LSTM). Neural Computing and Applications 2024, 1–20. [Google Scholar] [CrossRef]
- Dara, O. N.; Ibrahim, A. A.; Mohammed, T. A. , Advancing medical imaging: detecting polypharmacy and adverse drug effects with Graph Convolutional Networks (GCN). BMC Medical Imaging 2024, 24, 174. [Google Scholar] [CrossRef] [PubMed]
- Gustriansyah, R. , Retail sales dataset of a pharmacy in Indonesia. In V1 ed.; Data, M., Ed. 2022.
- Knox, C.; Wilson, M.; Klinger, C. M.; Franklin, M.; Oler, E.; Wilson, A.; Pon, A.; Cox, J.; Chin, N. E.; Strawbridge, S. A. , DrugBank 6.0: the DrugBank knowledgebase for 2024. Nucleic acids research 2024, 52, (D1), D1265–D1275. [Google Scholar] [CrossRef]
- Hyndman, R. J.; Khandakar, Y. , Automatic time series forecasting: the forecast package for R. Journal of statistical software 2008, 27, 1–22. [Google Scholar] [CrossRef]
- Aghelpour, P.; Mohammadi, B.; Biazar, S. M. , Long-term monthly average temperature forecasting in some climate types of Iran, using the models SARIMA, SVR, and SVR-FA. Theoretical and Applied Climatology 2019, 138, 1471–1480. [Google Scholar] [CrossRef]
- Abbasi, R. A.; Javaid, N.; Ghuman, M. N. J.; Khan, Z. A.; Ur Rehman, S. ; Amanullah In Short term load forecasting using XGBoost, Workshops of the International Conference on Advanced Information Networking and Applications, 2019; Springer: 2019; pp 1120-1131.
- Tokgöz, A.; Ünal, G. In A RNN based time series approach for forecasting turkish electricity load, 2018 26th Signal processing and communications applications conference (SIU), 2018; IEEE: 2018; pp 1-4.
- Oreshkin, B. N.; Carpov, D.; Chapados, N.; Bengio, Y. , N-BEATS: Neural basis expansion analysis for interpretable time series forecasting. arXiv preprint arXiv:1905.10437, arXiv:1905.10437 2019.







| Model | Description | Key parameters |
|---|---|---|
| auto-ARIMA | ARIMA is a classical time series forecasting model that captures linear dependencies through autoregressive, differencing, and moving average components. | The parameters p, d, and q are selected by minimizing the AIC, with up to 100 iterations. |
| SVR | SVR is a machine learning regression method based on support vector machines, suitable for small-scale and nonlinear data. | Kernel = RBF, C=100, gamma=0.01, epsilon = 0.1 |
| XGBoost | XGBoost is a powerful gradient boosting framework using decision trees, known for its accuracy and handling of complex non-linear patterns. | max_depth=6, eta=0.1, subsample=0.8, n_estimators=100 |
| RNN | The RNN baseline models sequential dependencies using recurrent hidden states, making it suitable for time series forecasting. | hidden_size=64, num_layers=1, dropout=0.2, |
| CNN-LSTM | CNN-LSTM integrates CNN for short-term feature extraction and LSTM for long-term sequence modeling. | CNN kernel size = 3, filters = 64, LSTM hidden_size=64, dropout=0.2, |
| NBEATS (Neural Basis Expansion Analysis for Interpretable Time Series) |
N-BEATS is a deep learning architecture for time series forecasting that employs backward and forward residual blocks to capture both trend and seasonal components in a fully interpretable manner. | number of stacks = 3, block layers = 2, hidden units = 512, dropout = 0.2, batch size = 256 |
| KG-GCN-LSTM (Proposed Model) | KG-GCN-LSTM combines graph convolutional networks to model relational dependencies between drugs and LSTM to capture temporal dynamics. | GCN: layers = 2, hidden dimension = 64, LSTM:hidden_size=64, dropout=0.2, |
| Model | MAE | RMSE | SMAPE (%) |
|---|---|---|---|
| AutoARIMA | 67.8769 | 79.0307 | 10.63% |
| SVR | 71.1110 | 81.1767 | 11.02% |
| XGBoost | 68.1974 | 77.6257 | 10.56% |
| RNN | 67.3514 | 80.8032 | 10.68% |
| CNN-LSTM | 82.7919 | 96.0930 | 12.60% |
| NBEATS | 78.2468 | 96.3988 | 11.86% |
| KG-GCN-LSTM | 53.9080 | 65.4058 | 8.24% |
| Model | MAE | RMSE | SMAPE (%) |
| KG-GCN-MLP | 65.2841 | 80.7454 | 10.32% |
| LSTM | 72.6728 | 84.4461 | 11.21% |
| GCN-LSTM | 61.6119 | 72.4715 | 9.85% |
| KG-GCN-LSTM | 53.9080 | 65.4058 | 8.24% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).