Submitted:
22 June 2026
Posted:
23 June 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Methods
2.1. Multimodel Partition Filter (MMPF)
2.2. MMPF – NARX
2.3. GARA
- μt = the mutation rate at generation t
- μ0 = the initial mutation rate
- tmax = the maximum number of generations
- = the residual mutation term, ensuring minimal diversity near convergence
2.4. PSO
- ω = the inertia weight
- c1, c2 = the acceleration coefficients
- r1,r2 ~ U(0,1).
- = the personal best position of particle
- = the global best position of the swarm
2.5. Data Preparation
- Moderate correlated noise: correlation coefficient , noise ratio = 0.25
- Heavy correlated noise: correlation coefficient , noise ratio = 0.50
3. Results
4. Discussion
4.1. Comparative Evaluation of Optimization Strategies
- The overall forecast accuracy regarding the noise level and the number of submodels implemented has risen from 2.4% to 14.8%. This is important in terms of % MAPE for the PSO over GARA.
- Similarly, in the “Heavy Noise” optimal configuration (i.e., in Table 1 for N=9), the PSO achieved a reduced 2.4% forecasting, indicating an accuracy improvement.
4.2. Non-Monotonic Relationship Between Filter Complexity and Accuracy
4.3. Robustness and Practical Considerations
4.4. Error Scatter Analysis and Symmetric Distribution Characteristics
-
PSO precision and concentration:
- This symmetric spread is smaller compared to GARA.
- PSO particles tend to move, “swarm” quickly toward the global best solution.
- This is happening because the particles follow their own experience as well as the group’s.
- This swarm movement keeps forecasts close to each other and reduces variation in errors.
-
GARA Balance Diversity.
- They appear more symmetrical balanced atop the horizontal axis than the PSO ones.
- The mutation process creates diversity of the Genetic Algorithm and prevents its collapse into one solution.
- As a result, the errors are more uniform or evenly symmetric spread.
- Generally, they have bigger absolute values than the respective PSO ones, although over time they are less systematically biased.
- Using this feature can help you avoid cumulative bias in longer-term operational horizons.
4.5. Optimization Dynamics and Convergence Behavior
-
PSO Smoothness and Stability
- Smoother and more stable convergence curves toward the global minimum.
- The error decreases more consistently.
- PSO updates particles using velocity and momentum [56], as described earlier.
- Each particle move toward their own best and global best position, causing the swarm to move smoothly.
- This prevents considerable fluctuation of the pattern.
-
GARA Oscillatory
- Between the 10th generation and the 30th generation, the way is more oscillatory.
- he peaks indicate that some errors rise sharply on occasion but fall afterwards.
- The Genetic Algorithm possesses a stochastic nature for this reason [57].
- The addition of a mutation operator assures diversity and avoids convergence to any one region for any long time, which helps in exploring better solutions in the search space. Also, it creates spikes in performance during training.
4.6. Computational Burden
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A

| Ref | Approach | Main Concept | Strength | Limitation / Gap |
| [1,2] | ARIMA / ARIMA + PSO | Linear time-series forecasting with parameter tuning | Clear structure, easy implementation | Limited performance under nonlinear demand |
| [8] | Random Forests, Gradient Boosting, RNN | Machine learning for nonlinear load patterns | Superior to classical ARIMA | Higher computational cost |
| [9] | Ensemble probabilistic forecasting | A combination of multiple forecasts | Improved scalability and precision | Model management complexity |
| [11] | Quantile Regression + KDE | Full probability distribution modeling | Better risk evaluation | Requires reliable density estimation |
| [14] | Hybrid with economic indicators | Use of early warning and industrial metrics | Improved performance near turning points | Depends on external data quality |
| [16] | Conditional hidden semi-Markov models | State-duration-based load modeling | Captures dynamic operating states | Sensitive to preprocessing |
| [18] | Gaussian filtering of smart meter data | Noise reduction before training | Cleaner input data | Additional preprocessing step |
| [19,20,21] | PSO + ANN | Swarm optimization for neural networks | Higher short-term accuracy | Possible local stagnation |
| [21,22] | GA-based MMPF weight optimization | Evolutionary adaptive filtering | Strong global search | Oscillatory convergence behavior |
| [22,23,24] | PSO + SVM | Swarm-based SVM tuning | Improved generalization | Increased training time |
| [25] | PSO + Grey / Wavelet models | Hybrid nonlinear forecasting | Better nonlinear handling | Layered model complexity |
| [26] | PSO for long-term forecasting | Swarm tuning for extended horizon | Stable convergence | Parameter sensitivity |
| [27] | PSO in power system parameter estimation | Optimization of system cost functions | Efficient high-dimensional search | Initialization sensitivity |
| [28] | PSO theoretical analysis | Exploration-exploitation balance | Simple structure | Trade-off not fully eliminated |
| [38,39] | Bias-variance analysis | Theoretical explanation of trade-off | Conceptual clarity | Does not prescribe the optimal method |
| [61] | ANN + Regression + SVR (Fuzzy logic) | A hybrid combination of statistical and AI predictors | Better than standalone models | Increased model complexity |
| [62] | Hybrid with intelligent optimization | Optimization for weight tuning and feature selection | Improved accuracy | Depends on optimizer efficiency |
References
- Marrero, L.; Garcia-Santander, L.; Carrizo, D.; Ulloa, F. An application of load forecasting based on ARIMA models and particle swarm optimization. In Proceedings of the 2019 11th International Symposium on Advanced Topics in Electrical Engineering (ATEE), 2019. [Google Scholar] [CrossRef]
- Xiao, L.; Xiao, L. Combined modeling for electrical load forecasting with particle swarm optimization. In Proceedings of the 2014 IEEE Workshop on Electronics, Computer and Applications, 2014; pp. 395–400. [Google Scholar] [CrossRef]
- Shiblee, M.F.H.; Koukaras, P. Short-term load forecasting in the Greek power distribution system: A comparative study of gradient boosting and deep learning models. Energies 2025, 18, 5060. [Google Scholar] [CrossRef]
- Gong, J.; Qu, Z.; Zhu, Z.; Xu, H.; Yang, Q. Ensemble models of TCN-LSTM-LightGBM based on ensemble learning methods for short-term electrical load forecasting. Energy 2025, 318, 134757. [Google Scholar] [CrossRef]
- Dakheel, F.; Çevik, M. Optimizing smart grid load forecasting via a hybrid long short-term memory-XGBoost framework: Enhancing accuracy, robustness, and energy management. Energies 2025, 18, 2842. [Google Scholar] [CrossRef]
- Li, K.; Yuan, L.; Qian, F.; Song, L.; Wu, X.; Wang, L.; Dai, J.; Shen, L. Short-Term Load Forecasting for Electricity Spot Markets Across Different Seasons Based on a Hybrid VMD-LSTM-Random Forest Model. Energies 2025, 18, 6097. [Google Scholar] [CrossRef]
- Rani, S.; Ahmed, U.; Mahmood, A.; Manzoor, S.; Razzaq, S.; Jamil, U.; Zia, M.F.; Benbouzid, M. Dual Attention-Empowered Bidirectional Long Short-Term Memory Network for Short-Term Electric Load Forecasting. Energy Technol. 2026, 14, e202500914. [Google Scholar] [CrossRef]
- Prashanthi, P.; Priyadarsini, K. A comparative study of the performance of machine learning based load forecasting methods. In Proceedings of the 2021 International Conference on Artificial Intelligence and Smart Systems (ICAIS), 2021; pp. 132–136. [Google Scholar] [CrossRef]
- Landgraf, A.J. An ensemble approach to GEFCom2017 probabilistic load forecasting. Int. J. Forecast. 2019, 35, 1432–1438. [Google Scholar] [CrossRef]
- Gupta, T.; Bhatia, R.; Sharma, S. An ensemble-based enhanced short and medium term load forecasting using optimized missing value imputation. Sci. Rep. 2025, 15, 21857. [Google Scholar] [CrossRef]
- Wang, S.; Wang, S.; Wang, D. Combined probability density model for medium term load forecasting based on quantile regression and kernel density estimation. Energy Procedia 2019, 158, 6446–6451. [Google Scholar] [CrossRef]
- Liu, Y.; Mei, F.; Zhang, J.; Dai, X.; Li, W. Power Load Probabilistic Prediction Based on Multi-Value Quantile Regression and Timing Fusion Ensemble Learning Model. Entropy 2026, 28, 329. [Google Scholar] [CrossRef] [PubMed]
- Shah, S.A.H.; Ahmed, U.; Bilal, M.; Khan, A.R.; Razzaq, S.; Aziz, I.; Mahmood, A. Improved electric load forecasting using quantile long short-term memory network with dual attention mechanism. Energy Rep. 2025, 13, 2343–2353. [Google Scholar] [CrossRef]
- Wang, Y.; Wang, Z.; Li, X.; Wang, Z.; Liu, R.; Tong, X. Medium and long-term load intelligent forecasting method based on the comprehensive and multiple early warning indicators. J. Phys. Conf. Ser. 2020, 1449, 012124. [Google Scholar] [CrossRef]
- Wang, X.; Wang, Z.; Yang, K.; Song, Z.; Bian, C.; Feng, J.; Deng, C. A survey on deep learning for cellular traffic prediction. Intell. Comput. 2024, 3, 0054. [Google Scholar] [CrossRef]
- Ji, Y.; Buechler, E.; Rajagopal, R. Data-driven load modeling and forecasting of residential appliances. In Proceedings of the 2021 IEEE Power & Energy Society General Meeting (PESGM), 2021; pp. 1–1. [Google Scholar] [CrossRef]
- Wu, Z.; Wang, C.; Wu, J.; Wang, X.; Li, M.; Dong, Y.; Zhu, H.; Zhang, H. Dynamic adaptive modeling for non-intrusive load monitoring with unknown loads. Energy Build. 2025, 329, 115246. [Google Scholar] [CrossRef]
- Rai, S.; De, M. Analysis of classical and machine learning based short-term and mid-term load forecasting for smart grid. Int. J. Sustain. Energy 2021, 40, 821–839. [Google Scholar] [CrossRef]
- Abdullah, A.G.; Suranegara, G.M.; Hakim, D.L. Hybrid PSO-ANN Application for Improved Accuracy of Short Term Load Forecasting. WSEAS Trans. POWER Syst. 2014, 9, 446–451. Available online: https://wseas.com/journals/articles.php?id=5330 (accessed on 1 June 2026).
- Shafiei Chafi, Z.; Afrakhte, H. Short-term load forecasting using neural network and particle swarm optimization (PSO) algorithm. Math. Probl. Eng. 2021, 2021, 1–10. [Google Scholar] [CrossRef]
- Ozerdem, O.C.; Olaniyi, E.O.; Oyedotun, O.K. Short term load forecasting using particle swarm optimization neural network. Procedia Comput. Sci. 2017, 120, 382–393. [Google Scholar] [CrossRef]
- Jiang, D. Study on short-term load forecasting method based on the PSO and SVM model. Int. J. Control Autom. 2015, 8, 181–188. [Google Scholar] [CrossRef]
- Ji, G.; Li, S.; Shi, Z.; Zhang, X.; Zhao, W. Regional power load forecasting based on PSOSVM. In Proceedings of the 2018 IEEE 4th Information Technology and Mechatronics Engineering Conference (ITOEC), 2018; pp. 1685–1688. [Google Scholar] [CrossRef]
- Ren, G.; Wen, S.; Yan, Z.; Hu, R.; Zeng, Z.; Cao, Y. Power load forecasting based on support vector machine and particle swarm optimization. In Proceedings of the 2016 12th World Congress on Intelligent Control and Automation (WCICA), 2016; pp. 2003–2008. [Google Scholar] [CrossRef]
- Bahrami, S.; Hooshmand, R.; Parastegari, M. Short term electric load forecasting by wavelet transform and grey model improved by PSO (particle swarm optimization) algorithm. Energy 2014, 72, 434–442. [Google Scholar] [CrossRef]
- AlRashidi, M.; EL-Naggar, K. Long term electric load forecasting based on particle swarm optimization. Appl. Energy 2010, 87, 320–326. [Google Scholar] [CrossRef]
- Alrashidi, M.R.; El-Naggar, K.M.; Al-Othman, A.K. Particle swarm optimization based approach for estimating the fuel-cost function parameters of thermal power plants with valve loading effects. Electr. Power Compon. Syst. 2009, 37, 1219–1230. [Google Scholar] [CrossRef]
- Coello Coello, C.A.; Reyes-Sierra, M. Multi-objective particle swarm optimizers: A survey of the state-of-the-Art. Int. J. Comput. Intell. Res. 2006, 2. Available online: https://www.researchgate.net/publication/216301306_Multi-Objective_Particle_Swarm_Optimizers_A_Survey_of_the_State-of-the-Art (accessed on 1 June 2026). [CrossRef]
- Beligiannis, G.; Skarlas, L.; Likothanassis, S. A generic applied evolutionary hybrid technique. IEEE Signal Process. Mag. 2004, 23, 28–38. [Google Scholar] [CrossRef]
- Katsikas, S.; Likothanassis, S.; Beligiannis, G.; Berkeris, K.; Fotakis, D. Genetically determined variable structure multiple model estimation. IEEE Trans. Signal Process. 2001, 49, 2253–2261. [Google Scholar] [CrossRef]
- Mustaffa, Z.; Sulaiman, M.H. Advanced forecasting of building energy loads with XGBoost and metaheuristic algorithms integration. Energy Storage Sav. 2025. [Google Scholar] [CrossRef]
- Camacho-Villalón, C.L.; Stützle, T.; Dorigo, M. Designing new metaheuristics: manual versus automatic approaches. Intell. Comput. 2023, 2, 0048. [Google Scholar] [CrossRef]
- Parvathareddy, S.; Yahya, A.; Amuhaya, L.; Samikannu, R.; Suglo, R.S. A hybrid machine learning and optimization framework for energy forecasting and management. Results Eng. 2025, 26, 105425. [Google Scholar] [CrossRef]
- Saber, M.; Ouahi, M.; Motahhir, S.; El Akchioui, N. Neural Network-Aided Linear Parameter-Varying Modeling and Finite-Time Control for Autonomous Driving. Intell. Comput. 2026, 5, 0227. [Google Scholar] [CrossRef]
- Hammam, I.M.; El-Kharbotly, A.K.; Sadek, Y.M. Adaptive demand forecasting framework with weighted ensemble of regression and machine learning models along life cycle variability. Sci. Rep. 2025, 15, 38482. [Google Scholar] [CrossRef] [PubMed]
- Avinash, G.; Mishra, S. Bayesian model averaging based deep learning forecasts of inpatient bed occupancy in mental health facilities. Sci. Rep. 2025, 15, 38294. [Google Scholar] [CrossRef] [PubMed]
- Pappas, S.; Ekonomou, L.; Karampelas, P.; Katsikas, S.; Liatsis, P. Modeling of the grounding resistance variation using ARMA models. Simul. Model. Pract. Theory 2008, 16, 560–570. [Google Scholar] [CrossRef]
- Pappas, S.; Ekonomou, L.; Karamousantas, D.; Chatzarakis, G.; Katsikas, S.; Liatsis, P. Electricity demand loads modeling using autoregressive moving average (ARMA) models. Energy 2008, 33, 1353–1360. [Google Scholar] [CrossRef]
- Pappas, S.S.; Karamousantas, D.C.; Chatzarakis, G.E.; Pappas, C.S. A New Hybrid Forecasting Strategy Applied to Mean Hourly Wind Speed Time Series. J. Wind Energy 2014, 2014, 683939. [Google Scholar] [CrossRef]
- Pappas, S.S.; Ekonomou, L.; Moussas, V.C.; Karampelas, P.; Katsikas, S.K. Adaptive load forecasting of the Hellenic electric grid. J. Zhejiang Univ.-Sci. A 2008, 9, 1724–1730. [Google Scholar] [CrossRef]
- Pappas, S.S.; Adam, S. Prediction of the long-term electrical energy consumption in Greece using adaptive algorithms. WSEAS Trans. Power Syst. 2018, 13, 291–299. [Google Scholar]
- Pappas, S. Application and comparison of evolutionary techniques for forecasting the Hellenic grid electricity load. Int. J. Power Energy Res. 2017, 1, 139–149. [Google Scholar] [CrossRef]
- Sakib, M.; Siddiqui, T.; Mustajab, S.; Alotaibi, R.M.; Alshareef, N.M.; Khan, M.Z. An ensemble deep learning framework for energy demand forecasting using genetic algorithm-based feature selection. PLoS ONE 2025, 20, e0310465. [Google Scholar] [CrossRef] [PubMed]
- Tsoulos, I.G.; Charilogis, V.; Tsalikakis, D. Introducing a new genetic operator based on differential evolution for the effective training of neural networks. Computers 2025, 14, 125. [Google Scholar] [CrossRef]
- Montgomery, D.C. Introduction to statistical quality control; John Wiley & Sons, 2019; Available online: https://www.wiley.com/en-us/shop/general-introductory-industrial-engineering/introduction-to-statistical-quality-control-8th-edition-p-9781119399308 (accessed on 1 June 2026).
- Zhang, Z.; Xu, W.; Gong, Q. Short-term power load forecasting based on particle swarm optimization long short-term memory neural network. In Proceedings of the 2023 IEEE 2nd International Conference on Electrical Engineering, Big Data and Algorithms (EEBDA), 2023; pp. 412–416. [Google Scholar] [CrossRef]
- Zhou, F.; Zhou, H.; Li, Z.; Zhao, K. Multi-step ahead short-term electricity load forecasting using VMD-TCN and error correction strategy. Energies 2022, 15, 5375. [Google Scholar] [CrossRef]
- Valentini, G.; Dietterich, T. Bias-Variance Analysis of Support Vector Machines for the Development of SVM-Based Ensemble Methods. J. Mach. Learn. Res. 2004, 5, 725–775. [Google Scholar] [CrossRef] [PubMed]
- Geman, S.; Bienenstock, E.; Doursat, R. Neural networks and the bias/Variance dilemma. Neural Comput. 1992, 4, 1–58. [Google Scholar] [CrossRef]
- Hosseini, E.; Møller, J.K.; Banaei, M.; Ebrahimy, R. Optimal weights and Heuristic Meta-Models: Learning-based multi-model strategies for energy forecasting. Array 2025, 100550. [Google Scholar] [CrossRef]
- Wu, C.; Zhang, Z. Surfing information: The challenge of intelligent decision-making. Intell. Comput. 2023, 2, 0041. [Google Scholar] [CrossRef]
- Wang, S.; Nikfar, M.; Agar, J.C.; Liu, Y. Stacked Deep Learning Models for Fast Approximations of Steady-State Navier–Stokes Equations for Low Re Flow. Intell. Comput. 2024, 3, 0093. [Google Scholar] [CrossRef]
- Tindall, J.R.; Rogage, K.; Doukari, O. Optimising deep learning for smart building energy prediction: A particle swarm approach with real-world validation. J. Build. Eng. 2025, 114652. [Google Scholar] [CrossRef]
- Lengler, J.; Offermann, T. Diversity-Preserving Exploitation of Crossover. In Proceedings of the 18th ACM/SIGEVO Conference on Foundations of Genetic Algorithms; 2025; pp. 154–165. [Google Scholar] [CrossRef]
- Khan, S.; Mazhar, T.; Shahzad, T.; Ali, T.; Ayaz, M.; Ghadi, Y.Y.; Aggoune, E.H.M.; Hamam, H. Optimizing load demand forecasting in educational buildings using quantum-inspired particle swarm optimization (QPSO) with recurrent neural networks (RNNs): a seasonal approach. Sci. Rep. 2025, 15, 19349. [Google Scholar] [CrossRef] [PubMed]
- Yang, B.; Zhang, Q.L. Parallelizing a modified particle swarm optimizer (PSO). Adv. Mater. Res. 2010, 163-167, 2404–2409. [Google Scholar] [CrossRef]
- Goldberg, D.E. Genetic algorithms in search, optimization, and machine learning; Addison-Wesley Professional, 1989; Available online: https://dl.acm.org/doi/10.5555/534133 (accessed on 1 June 2026).
- Slowik, A.; Kwasnicka, H. Evolutionary algorithms and their applications to engineering problems. Neural Comput. Appl. 2020, 32, 12363–12379. [Google Scholar] [CrossRef]
- Osaba, E.; Villar-Rodriguez, E.; Del Ser, J.; Nebro, A.J.; Molina, D.; LaTorre, A.; Ponnuthurai, N.S.; Coello, C.C.A.; Herrera, F. A tutorial on the design, experimentation and application of metaheuristic algorithms to real-world optimization problems. Swarm Evol. Comput. 2021, 64, 100888. [Google Scholar] [CrossRef]
- Črepinšek, M.; Liu, S.; Mernik, M. Exploration and exploitation in evolutionary algorithms. ACM Comput. Surv. 2013, 45, 1–33. [Google Scholar] [CrossRef]
- Keitsch, K.A.; Bruckner, T. Modular electrical demand forecasting framework - A novel hybrid model approach. In Proceedings of the 2016 13th International Multi-Conference on Systems, Signals & Devices (SSD), 2016; pp. 454–458. [Google Scholar] [CrossRef]
- Ghiasi, M.; Irani Jam, M.; Teimourian, M.; Zarrabi, H.; Yousefi, N. A new prediction model of electricity load based on hybrid forecast engine. Int. J. Ambient Energy 2017, 40, 179–186. [Google Scholar] [CrossRef]



























| Noise Scenario | # of NARX models (N) | MMPF-NARX | Behavior | |
| GARA | PSO | |||
| Medium | 7 | 2.70% | 2.30% | Under-Capacity |
| 8 | 2.42% | 2.08% | Improving | |
| 9 | 2.15% | 2.02% | Optimal | |
| 10 | 2.17% | 2.06% | Plateau | |
| Heavy | 7 | 3.1% | 2.88% | Under-Capacity |
| 8 | 2.88% | 2.58% | Improving | |
| 9 | 2.55% | 2.49% | Optimal | |
| 10 | 2.61% | 2.53% | Plateau | |
| Case | # of NARX models (N) | Behavior Description | |
|
7, 8 | Higher error rates are observed. The model cannot fully capture nonlinear oscillations and measurement noise. The number of subfilters is not enough. | |
|
9 | Best performance for both optimization methods. The diversity of subfilters is well balanced. This gives a more accurate representation of load dynamics. | |
|
10 | Error increases slightly (e.g., MAPE rises from 2.49% to 2.53% under heavy noise with PSO). Extra filters add unnecessary parameters. This may cause overfitting and reduce generalization ability. |
| Metric | GARA | PSO | Improvement (%) |
| Complexity per Iteration | ~25% Faster | ||
| Avg. Iterations to Convergence | 26 - 30 | 14 -18 | ~45% Faster |
| Floating Point Ops (FLOPs) | High (Sorting/ Randomizing) | Low (Vector Addition) |
- |
| Memory Overhead | Moderate (Population buffer) | Minimal (Velocity vectors) |
- |
| Stability (Figure 25, Figure 26 and Figure 27) | Stochastic (High Variance) |
Deterministic (Low Var.) | - |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).