Submitted:
21 April 2026
Posted:
22 April 2026
You are already at the latest version
Abstract

Keywords:
1. Introduction
2. Materials and Methods
2.1. Study Area and Data
2.2. Model Framework: Hybrid Forecasting System
2.3. Muskingum Method
2.4. Machine Learning Models
2.5. Feature Engineering and Model Training
2.6. Rolling Forecasting Method
- Initialization: At the initial time step of the test dataset, historical observations of upstream inflow and downstream discharge were available.
- Multi-step rolling procedure: The s-th forecasting (s = 1, 2, …, 6) corresponding to lead times of 4–24 h was utilized as an example.
- Physical baseline routing: The calibrated Muskingum model was first applied using upstream inflows reaching the current time to generate the baseline discharge Q(musk)(t). In operational practice, these observed inflows should be replaced with upstream inflow forecasts. The downstream discharge sequence was initialized using observations and subsequently updated with the prediction results from the previous step.
- Feature construction: Based on all information available at the current time step, a multidimensional feature vector was constructed following the same feature engineering rules used during training.
- Residual forecasting: After standardization, the constructed features were input into the trained machine learning models to obtain the predicted residual .
- Integrated output: The final discharge forecasting was determined by .
- State update: The predicted discharge at the current time step was subsequently stored as an observation for computing lagged features at the next time step, and the residual history sequence was updated accordingly. The procedure was iteratively repeated until the end of the test dataset.
3. Results
3.1. Muskingum Method Baseline Model
3.2. Comparative Analysis of Residual Forecasting
3.3. Comprehensive Performance Evaluation of Models
3.3.1. Overall Accuracy and Error Analysis
3.3.2. Comparison of Hydrological Feature Forecasting Capability
3.4. Optimal Model Analysis
| Lead time | Optimal model | Muskingum NSE | Optimal model NSE | Muskingum RMSE (m3/s) | Optimal model RMSE (m3/s) | Muskingum PE (%) | Optimal model PE (%) |
|---|---|---|---|---|---|---|---|
| 4 h | Ridge | 0.567 | 0.977 | 183.94 | 51.04 | −15.55 | −3.4 |
| 8 h | Ridge | 0.567 | 0.954 | 183.94 | 71.9 | −15.55 | −4.8 |
| 12 h | Ridge | 0.567 | 0.94 | 183.94 | 82.3 | −15.55 | −4.52 |
| 16 h | Lasso | 0.567 | 0.932 | 183.94 | 87.37 | −15.55 | −2.06 |
| 20 h | Lasso | 0.567 | 0.924 | 183.94 | 91.58 | −15.55 | −3.94 |
| 24 h | Lasso | 0.567 | 0.911 | 183.94 | 97.97 | −15.55 | −3.68 |
4. Discussion
4.1. Comparison with Other Decision Methods
4.2. Research Advantages and Implications
4.3. Limitations and Future Directions
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| PCHIP | Piecewise Cubic Hermite Interpolating Polynomial |
| L-BFGS-B | Limited-memory Broyden–Fletcher–Goldfarb–Shanno with Bound Constraints |
| LSTM | Long Short-Term Memory |
| RF | Random Forest |
| NSE | Nash–Sutcliffe Efficiency |
| RMSE | Root Mean Square Error |
| PE | Peak Error |
| ETP | Error in Time to Peak |
References
- Portalés-Julià, E.; Mateo-García, G.; Purcell, C.; Gómez-Chova, L. Global flood extent segmentation in optical satellite images. Scientific reports 2023, 13(1), 20316. [Google Scholar] [CrossRef]
- Meresa, H.; Tischbein, B.; Mekonnen, T. Climate change impact on extreme precipitation and peak flood magnitude and frequency: observations from CMIP6 and hydrological models. Natural Hazards 2022, 111(3), 2649–2679. [Google Scholar] [CrossRef]
- Nevo, S.; Morin, E.; Gerzi Rosenthal, A.; Metzger, A.; Barshai, C.; Weitzner, D.; Voloshin, D.; Kratzert, F.; Elidan, G.; Dror, G. Flood forecasting with machine learning models in an operational framework. Hydrology and Earth System Sciences 2022, 26(15), 4013–4032. [Google Scholar] [CrossRef]
- Salvati, A.; Moghaddam Nia, A.; Salajegheh, A.; Shirzadi, A.; Shahabi, H.; Ahmadisharaf, E.; Han, D.; Clague, J. J. A systematic review of Muskingum flood routing techniques. Hydrological Sciences Journal 2024, 69(6), 810–831. [Google Scholar] [CrossRef]
- Bindas, T.; Tsai, W. P.; Liu, J.; Rahmani, F.; Feng, D.; Bian, Y.; Lawson, K.; Shen, C. Improving river routing using a differentiable Muskingum-Cunge model and physics-informed machine learning. Water Resources Research 2024, 60(1), e2023WR035337. [Google Scholar] [CrossRef]
- Yang, D.; Yan, B.; Gu, D.; Chang, J.; Du, S. A hybrid deep learning-Muskingum framework for enhanced runoff prediction: Model coupling and hydrological process integration. Journal of Hydrology: Regional Studies 2026, 63, 103077. [Google Scholar] [CrossRef]
- Gauch, M.; Kratzert, F.; Klotz, D.; Nearing, G.; Lin, J.; Hochreiter, S. Rainfall–runoff prediction at multiple timescales with a single Long Short-Term Memory network. Hydrology and Earth System Sciences 2021, 25(4), 2045–2062. [Google Scholar] [CrossRef]
- Klotz, D.; Kratzert, F.; Gauch, M.; Keefe Sampson, A.; Brandstetter, J.; Klambauer, G.; Hochreiter, S.; Nearing, G. Uncertainty estimation with deep learning for rainfall–runoff modeling. Hydrology and Earth System Sciences 2022, 26(6), 1673–1693. [Google Scholar] [CrossRef]
- Akbari, R.; Hessami-Kermani, M.-R.; Shojaee, S. Flood routing: improving outflow using a new non-linear Muskingum model with four variable parameters coupled with PSO-GA algorithm. Water Resources Management 2020, 34(10), 3291–3316. [Google Scholar] [CrossRef]
- Lin, Y.; Wang, D.; Wang, G.; Qiu, J.; Long, K.; Du, Y.; Xie, H.; Wei, Z.; Shangguan, W.; Dai, Y. A hybrid deep learning algorithm and its application to streamflow prediction. Journal of Hydrology 2021, 601, 126636. [Google Scholar] [CrossRef]
- Li, W.; Liu, C.; Xu, Y.; Niu, C.; Li, R.; Li, M.; Hu, C.; Tian, L. An interpretable hybrid deep learning model for flood forecasting based on Transformer and LSTM. Journal of Hydrology: Regional Studies 2024, 54, 101873. [Google Scholar] [CrossRef]
- Xiang, Z.; Yan, J.; Demir, I. A rainfall-runoff model with LSTM-based sequence-to-sequence learning. Water resources research 2020, 56(1), e2019WR025326. [Google Scholar] [CrossRef]
- Zhang, Z.; Xiao, Y.; Chen, R.; Long, K.; Deng, H.; Zheng, Z.; Liao, J. Comparative assessment of machine learning models for daily streamflow prediction in a subtropical monsoon watershed. Scientific Reports 2026. [Google Scholar] [CrossRef]
- Fang, G.; Li, Z.; Yang, J.; Chen, Y.; Duan, W.; Amory, C.; Wang, Y. Changes in flooding in the alpine catchments of the Tarim River Basin, Central Asia. Journal of Flood Risk Management 2023, 16(1), e12869. [Google Scholar] [CrossRef]
- Qianjuan, S.; Hongbo, L.; Yong, L.; Guangpeng, Z.; Lianchao, Z.; Aihua, L. Extreme climate and human interventions driving flood dynamics in arid inland rivers: regional and global perspectives. Journal of Environmental Management 2025, 395, 127788. [Google Scholar] [CrossRef]
- Fritsch, F. N.; Carlson, R. E. Monotone piecewise cubic interpolation. SIAM Journal on Numerical Analysis 1980, 17(2), 238–246. [Google Scholar] [CrossRef]
- Frame, J. M.; Kratzert, F.; Raney, A.; Rahman, M.; Salas, F. R.; Nearing, G. S. Post-processing the national water model with long short-term memory networks for streamflow predictions and model diagnostics. JAWRA Journal of the American Water Resources Association 2021, 57(6), 885–905. [Google Scholar] [CrossRef]
- Farzin, S.; Singh, V. P.; Karami, H.; Farahani, N.; Ehteram, M.; Kisi, O.; Allawi, M. F.; Mohd, N. S.; El-Shafie, A. Flood routing in river reaches using a three-parameter Muskingum model coupled with an improved bat algorithm. Water 2018, 10(9), 1130. [Google Scholar] [CrossRef]
- Byrd, R. H.; Lu, P.; Nocedal, J.; Zhu, C. A limited memory algorithm for bound constrained optimization. SIAM Journal on scientific computing 1995, 16(5), 1190–1208. [Google Scholar] [CrossRef]
- Khorram, S.; Jehbez, N. Improving the streamflow prediction accuracy in sparse data regions: a fresh perspective on integrated hydrological-hydrodynamic and hybrid machine learning models. Engineering Applications of Computational Fluid Mechanics 2024, 18(1), 2387051. [Google Scholar] [CrossRef]
- Kumar, V.; Kedam, N.; Sharma, K. V.; Mehta, D. J.; Caloiero, T. Advanced machine learning techniques to improve hydrological prediction: A comparative analysis of streamflow prediction models. Water 2023, 15(14), 2572. [Google Scholar] [CrossRef]
- Wahba, M.; Essam, R.; El-Rawy, M.; Al-Arifi, N.; Abdalla, F.; Elsadek, W. M. Forecasting of flash flood susceptibility mapping using random forest regression model and geographic information systems. Heliyon 2024, 10(13). [Google Scholar] [CrossRef] [PubMed]
- Kumar, V.; Azamathulla, H. M.; Sharma, K. V.; Mehta, D. J.; Maharaj, K. T. The state of the art in deep learning applications, challenges, and future prospects: A comprehensive review of flood forecasting and management. Sustainability 2023, 15(13), 10543. [Google Scholar] [CrossRef]
- Galelli, S.; Castelletti, A. Tree-based iterative input variable selection for hydrological modeling. Water Resources Research 2013, 49(7), 4295–4310. [Google Scholar] [CrossRef]
- Talei, A.; Chua, L. H. Influence of lag time on event-based rainfall–runoff modeling using the data driven approach. Journal of hydrology 2012, 438, 223–233. [Google Scholar] [CrossRef]
- Liang, Z.; Tang, T.; Li, B.; Liu, T.; Wang, J.; Hu, Y. Long-term streamflow forecasting using SWAT through the integration of the random forests precipitation generator: case study of Danjiangkou Reservoir. Hydrology Research 2018, 49(5), 1513–1527. [Google Scholar] [CrossRef]
- He, M.; Xu, X.; Wu, S.; Kang, C.; Huang, B. Multi-step ahead forecasting of daily streamflow based on the transform-based deep learning model under different scenarios. Scientific Reports 2025, 15(1), 5451. [Google Scholar] [CrossRef]






| Station Name | Xiaota | Yimapashe | Alar |
|---|---|---|---|
| Start Time | 2021/1/1 8:00 | 2021/1/1 8:00 | 2021/1/1 8:00 |
| End Time | 2025/10/13 16:00 | 2025/10/13 16:00 | 2025/10/13 16:00 |
| Total Days | 1747 | 1747 | 1747 |
| 4-hour Data (days) | 811 | 812 | 829 |
| 4-hour Proportion (%) | 46.42 | 46.48 | 47.45 |
| Daily Data (days) | 932 | 932 | 915 |
| Daily Proportion (%) | 53.35 | 53.35 | 52.38 |
| Completely Missing Days | 4 | 3 | 3 |
| Event ID | K1_local | x1_local | K2_local | x2_local | RMSE | NSE | PE |
|---|---|---|---|---|---|---|---|
| 2021070916 | 7.98 | 0.01 | 44.93 | 0.49 | 38.96 | 0.77 | −6.61 |
| 2021071916 | 17.21 | 0.49 | 4.00 | 0.01 | 100.66 | 0.83 | −2.57 |
| 2022052908 | 24.10 | 0.30 | 72.00 | 0.01 | 32.83 | 0.88 | −2.05 |
| 2022061008 | 13.49 | 0.04 | 22.85 | 0.01 | 48.54 | 0.82 | −7.17 |
| 2022062700 | 18.74 | 0.26 | 20.86 | 0.01 | 72.06 | 0.92 | −10.35 |
| 2022091004 | 37.45 | 0.11 | 72.00 | 0.01 | 6.23 | 0.89 | −1.49 |
| 2023071712 | 72.00 | 0.02 | 34.45 | 0.45 | 30.25 | 0.03 | −7.12 |
| 2023072412 | 20.09 | 0.01 | 9.38 | 0.09 | 73.94 | 0.77 | −8.13 |
| 2023082312 | 4.00 | 0.01 | 4.00 | 0.01 | 75.64 | 0.63 | −1.29 |
| 2023082912 | 30.94 | 0.31 | 8.65 | 0.49 | 13.26 | 0.92 | 0.42 |
| 2023090120 | 36.47 | 0.26 | 37.21 | 0.49 | 16.69 | 0.94 | −2.3 |
| 2024071008 | 15.61 | 0.01 | 4.37 | 0.49 | 47.09 | 0.77 | 6.08 |
| 2024071620 | 14.70 | 0.37 | 7.71 | 0.01 | 110.51 | 0.82 | −1.45 |
| 2024091016 | 20.36 | 0.22 | 17.27 | 0.01 | 15.08 | −0.20 | −2.44 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.