Submitted:
17 February 2026
Posted:
25 February 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Related Works
1.2. Contributions
- A Transformer-based IL model predicts optimal refrigeration setpoint trajectories directly from historical price and temporal data, removing the need for intermediate load or production forecasts.
- Viability of IL-based prediction is proven within a systematic comparison to the optimal trajectory.
- Comprehensive evaluation of sequence, non-sequence, and linear SL models highlights the importance of temporal and non-linear modeling.
- Combined with a lower-level P controller, the model ensures stable operation while achieving cost-optimal setpoints, facilitating practical industrial implementation.
2. Methods
2.1. Data Acquisition and Processing
2.2. Industrial Warehouse Model
2.3. Mixed-Integer Linear Programming
2.4. Supervised Imitation Learning Models
- Electricity price at the current timestep.
- Hour of the day encoded as sine and cosine functions to capture daily periodicity.
- Weekday encoded as a one-hot vector to represent weekly patterns.
- Day of the year encoded as sine and cosine functions to capture seasonality.
2.5. Deep Reinforcement Learning
2.6. Heuristic Approaches
2.7. Ablation Study
2.8. Exploration of Additional Feature Candidates
- Ambient temperature (current and lagged)
- Thermal load (current and lagged)
3. Results and Discussion
3.1. Cost Savings
3.2. Prediction Quality
3.3. Computational Efficiency
3.4. Temperature Constraint Violations
3.5. Ablation Study
3.6. Exploration of Additional Feature Candidates
3.7. Practical Applicability, Limitations and Future Research Directions
4. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| DQN | Deep Q-network |
| EXAA | Energy exchange Austria |
| HVAC | Heating, ventilation and air conditioning |
| IL | Imitation Learning |
| LSTM | Long Short-Term Memory |
| MPC | Model predictive control |
| MAE | Mean absolute error |
| MILP | Mixed-Integer Linear Programming |
| P | Proportional |
| PI | Proportional-integral |
| RL | Reinforcement learning |
| RMSE | Root Mean Square Error |
| SL | Supervised Learning |
| sMAPE | Symmetric mean absolute percentage error |
Appendix A. Model Parameters
| Parameter | Value |
|---|---|
| 4 360 MJ/K | |
| 500 t | |
| 1 260 t | |
| 480 J/(kg K) | |
| 3 270 J/(kg K) | |
| 4.938 | |
| 202 511 W | |
| 0 W | |
| 5 °C | |
| 0 °C | |
| 60 s | |
| 500 000 W/K |
Appendix B. MILP Formulation
- is the set point temperature of the warehouse during the time period p.
- is the warehouse temperature at the time point t.
- is the output signal of the P-controller before saturation during the time period p.
- is a helper variable for calculating the saturation during the time period p.
- are binary variables to calculate the saturation during the time period p.
- is the electrical power consumption of the industrial refrigeration system p.
- is the price signal during the time period p.
- is the heat flow rate of the load during the time period p.
- is the length of a time period.
- is the initial warehouse temperature at the time point 0.
- is the thermal capacity of the warehouse.
- is the energy efficiency ratio of the industrial refrigeration system.
- proportional factor of the controller.
- is the minimum electrical power.
- is the maximum electrical power.
- is the minimum set point temperature.
- is the maximum set point temperature.
- N is the number of time periods.
- and are big M constraints.
- is the set of time period indices.
- is the set of time point indices.
- is a set to index of every hour of a day.
- is a set to index every minute in a hour.
Appendix C. Optimal Hyperparameters
| Model | Key hyperparameters |
|---|---|
| Transformer | model_size=2k, epochs=80 |
| LSTM | model_size=10k, epochs=60 |
| DecisionTree | max_depth=10, min_samples_split=15, min_samples_leaf=4, |
| max_features=None | |
| RandomForest | , max_depth=32, min_samples_split=3, |
| min_samples_leaf=1, max_features=None, bootstrap=True | |
| ExtraTrees | , max_depth=37, min_samples_split=8, |
| min_samples_leaf=1, max_features=None | |
| GradientBoosting | , learning_rate=0.080, max_depth=8, |
| subsample=0.951, min_samples_split=7, min_samples_leaf=4 | |
| AdaBoost | , learning_rate=0.032, loss=linear |
| XGBoost | , learning_rate=0.0205, max_depth=10, |
| subsample=0.854, colsample_bytree=0.860, | |
| reg_alpha=, reg_lambda=, | |
| min_child_weight=1.53 | |
| LightGBM | , learning_rate=0.0191, num_leaves=176, |
| subsample=0.765, colsample_bytree=0.734, | |
| reg_alpha=0.00742, reg_lambda=0.0423, | |
| min_child_samples=5 | |
| KNN | , weights=uniform, leaf_size=35, |
| SVR | kernel=RBF, , , scale |
| KernelRidge | kernel=laplacian, , |
| LinearRegression | – |
| LinearSVR | – |
| Ridge | |
| Lasso | |
| ElasticNet | , -ratio=0.970 |
| BayesianRidge | , , |
| , | |
| SGDRegressor | max_iter=1000, tol=, penalty=l1, |
| HuberRegressor | , |
| PassiveAggressive | max_iter=1000, tol=, , |
| PLSRegression | |
| DQN | num_episodes=10 000, batch_size=64, replay_memory=100 000, |
| eps_decay=10 000, network_depth=4, network_width=512 |
Appendix D. Hyperparameter Search Spaces
- model_size
- epochs (step size 20)
- max depth
- min samples split
- min samples leaf
- max features
- bootstrap
- max depth
- min samples split
- min samples leaf
- max features
- max depth
- min samples split
- min samples leaf
- max features
- learning rate (log)
- max depth
- subsample
- min samples split
- min samples leaf
- learning rate (log)
- loss
- learning rate (log)
- max depth
- subsample
- colsample_bytree
- regα, regλ (log)
- min child weight (log)
- tree method = hist
- learning rate (log)
- num leaves
- max depth
- subsample
- colsample_bytree
- regα, regλ (log)
- min child samples
- weights
- leaf size
- (Minkowski metric)
- kernel
- (log)
- (log)
- degree (only if kernel = poly)
- kernel
- (log)
- (log) (only if kernel = rbf or laplacian)
- degree (only if kernel = poly)
- coef0 (only if kernel = poly)
- (log)
- (log)
- (log)
- ratio
- (log)
- (log)
- penalty
- max_iter = 1 000, tol =
- (log)
- max_iter = 1 000
- (log)
- (log)
- max_iter = 1 000, tol =
- Number of training episodes
- Batch size
- Replay buffer size
- Exploration decay steps
- Network depth (number of hidden layers)
- Network width (neurons per hidden layer)
Appendix E. Detailed Cost and Performance Results
| Family | Model | Costs [EUR] | Costs vs Base [%] |
|---|---|---|---|
| Base | - | 141 705 | 0.00 |
| Global Optimum | MILP | 111 848 | 21.07 |
| Sequence | Transformer | 114 314 | 19.33 |
| LSTM | 114 388 | 19.28 | |
| Tree-based | DecisionTree | 118 561 | 16.33 |
| RandomForest | 117 730 | 16.92 | |
| ExtraTrees | 117 128 | 17.34 | |
| GradientBoosting | 117 402 | 17.15 | |
| AdaBoost | 123 655 | 12.74 | |
| XGBoost | 117 320 | 17.21 | |
| LightGBM | 117 388 | 17.15 | |
| Instance | KNN | 117044 | 17.40 |
| Kernel | SVR | 119 655 | 15.56 |
| KernelRidge | 117 231 | 17.27 | |
| Linear | LinearRegression | 128 117 | 9.59 |
| LinearSVR | 127 970 | 9.69 | |
| Ridge | 128 134 | 9.58 | |
| Lasso | 128 117 | 9.59 | |
| ElasticNet | 128 117 | 9.59 | |
| BayesianRidge | 128 136 | 9.58 | |
| SGD | 128 309 | 9.45 | |
| Huber | 128 012 | 9.66 | |
| PassiveAggressive | 129 351 | 8.72 | |
| PLS | 128 116 | 9.59 | |
| RL | DQN | 113 800 | 19.69 |
| Heuristics | DayNight | 146 884 | -3.65 |
| Mean | 120 784 | 14.76 | |
| Median | 123 226 | 13.04 | |
| MinMaxPrice | 121 258 | 14.43 |
Appendix F. Detailed Prediction Results
| Family | Model | MAE | RMSE | sMAPE [%] | |
|---|---|---|---|---|---|
| Sequence | Transformer | 0.77 | 1.19 | 0.42 | 31.95 |
| LSTM | 0.78 | 1.23 | 0.38 | 32.91 | |
| Tree-based | DecisionTree | 0.90 | 1.24 | 0.37 | 38.27 |
| RandomForest | 0.89 | 1.23 | 0.38 | 37.80 | |
| ExtraTrees | 0.88 | 1.21 | 0.40 | 37.77 | |
| GradientBoosting | 0.90 | 1.24 | 0.37 | 37.87 | |
| AdaBoost | 1.03 | 1.27 | 0.34 | 42.60 | |
| XGBoost | 0.89 | 1.23 | 0.38 | 37.73 | |
| LightGBM | 0.90 | 1.25 | 0.36 | 38.10 | |
| Instance | KNN | 0.88 | 1.20 | 0.41 | 38.17 |
| Kernel | SVR | 0.93 | 1.20 | 0.41 | 38.86 |
| KernelRidge | 0.96 | 1.30 | 0.31 | 40.46 | |
| Linear | LinearRegression | 1.08 | 1.34 | 0.26 | 48.33 |
| LinearSVR | 1.07 | 1.36 | 0.24 | 46.06 | |
| Ridge | 1.08 | 1.34 | 0.26 | 48.33 | |
| Lasso | 1.08 | 1.34 | 0.26 | 48.33 | |
| ElasticNet | 1.08 | 1.34 | 0.26 | 48.33 | |
| BayesianRidge | 1.08 | 1.34 | 0.26 | 48.33 | |
| SGD | 1.08 | 1.34 | 0.26 | 48.58 | |
| Huber | 1.08 | 1.34 | 0.26 | 48.01 | |
| PassiveAggressive | 1.09 | 1.35 | 0.25 | 47.97 | |
| PLS | 1.08 | 1.34 | 0.26 | 48.33 | |
| RL | DQN | 1.34 | 1.77 | -0.28 | 49.79 |
Appendix G. Computational Performance: Training and Inference
| Family | Model | Train time | Runtime per prediction |
|---|---|---|---|
| Global Optimum | MILP | - | 22.8 s |
| Sequence | Transformer | 3.003 s | 526 ns |
| LSTM | 3.931 s | 1 µs | |
| Tree-based | DecisionTree | 0.033 s | 47 ns |
| RandomForest | 2.437 s | 9 µs | |
| ExtraTrees | 0.752 s | 6 µs | |
| GradientBoosting | 12.210 s | 5 µs | |
| AdaBoost | 1.362 s | 4 µs | |
| XGBoost | 1.038 s | 4 µs | |
| LightGBM | 0.215 s | 25 µs | |
| Instance | KNN | 0.004 s | 15 µs |
| Kernel | SVR | 4.457 s | 222 µs |
| KernelRidge | 17.736 s | 121 µs | |
| Linear | LinearRegression | 0.003 s | 89 ns |
| LinearSVR | 0.057 s | 36 ns | |
| Ridge | 0.001 s | 17 ns | |
| Lasso | 0.369 s | 17 ns | |
| ElasticNet | 0.363 s | 16 ns | |
| BayesianRidge | 0.004 s | 12 ns | |
| SGD | 0.009 s | 19 ns | |
| Huber | 0.008 s | 9 ns | |
| PassiveAggressive | 0.005 s | 18 ns | |
| PLS | 0.005 s | 21 ns | |
| RL | DQN | 20 min | 2 ms |
References
- Ergun, S.; Dik, A.; Boukhanouf, R.; Omer, S. Large-Scale Renewable Energy Integration: Tackling Technical Obstacles and Exploring Energy Storage Innovations. Sustainability 2025, 17, 1311. [CrossRef]
- Panda, S.; Mohanty, S.; Rout, K.; Sahu, B.; Parida, S. M.; Samanta, I.; Bajaj, M.; Piecha, M.; Blažek, V.; Prokop, L. A Comprehensive Review on Demand Side Management and Market Design for Renewable Energy Support and Integration. Energy Reports 2023, Volume 10, 2228–2250. [CrossRef]
- Eyimaya, S. E.; Altin, N. Review of Energy Management Systems in Microgrids. Applied Sciences 2024, 14 (3), 1249. [CrossRef]
- Integer Programming: 2nd Edition. In Integer Programming; John Wiley & Sons, Ltd, 2020; pp 1–34. [CrossRef]
- Yaghoubi, E.; Yaghoubi, E.; Maghami, M. R.; Rahebi, J.; Zareian Jahromi, M.; Ghadami (Melisa Rahebi), R.; Yusupov, Z. A Systematic Review and Meta-Analysis of Model Predictive Control in Microgrids: Moving Beyond Traditional Methods. Processes 2025, 13 (7), 2197. [CrossRef]
- Vázquez-Canteli, J. R.; Nagy, Z. Reinforcement Learning for Demand Response: A Review of Algorithms and Modeling Techniques. Applied Energy 2019, 235, 1072–1089. [CrossRef]
- Bakare, M. S.; Abdulkarim, A.; Zeeshan, M.; Shuaibu, A. N. A Comprehensive Overview on Demand Side Energy Management towards Smart Grids: Challenges, Solutions, and Future Direction. Energy Informatics 2023, 6 (1), 4. [CrossRef]
- García, J.; Fernández, F. A Comprehensive Survey on Safe Reinforcement Learning. The Journal of Machine Learning Research. 2015, 16 (1), 1437–1480.
- Bakare, M. S.; Abdulkarim, A.; Shuaibu, A. N.; Muhamad, M. M. Energy Management Controllers: Strategies, Coordination, and Applications. Energy Informatics 2024, 7 (1), 57. [CrossRef]
- Elmouatamid, A.; Ouladsine, R.; Bakhouya, M.; EL KAMOUN, N.; Khaidar, M.; Zine-dine, K. Review of Control and Energy Management Approaches in Micro-Grid Systems. Energies 2020, 14, 168. [CrossRef]
- Yassuda Yamashita, D.; Vechiu, I.; Gaubert, J.-P. Two-Level Hierarchical Model Predictive Control with an Optimised Cost Function for Energy Management in Building Microgrids. Applied Energy 2021, 285, 116420. [CrossRef]
- Shan, Y.; Ma, L.; Yu, X. Hierarchical Control and Economic Optimization of Microgrids Considering the Randomness of Power Generation and Load Demand. Energies 2023, 16, 5503. [CrossRef]
- Villalón, A.; Rivera, M.; Salgueiro, Y.; Muñoz, J.; Dragičević, T.; Blaabjerg, F. Predictive Control for Microgrid Applications: A Review Study. Energies 2020, 13 (10). [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is All You Need. In Proceedings of the 31st International Conference on Neural Information Processing Systems—NIPS’17, Long Beach, CA, USA, 4–9 December 2017; Curran Associates Inc.: Red Hook, NY, USA, 2017; pp. 6000–6010.
- Liu, J.; Fu, Y. Renewable Energy Forecasting: A Self-Supervised Learning-Based Transformer Variant. Energy 2023, 284, 128730. [CrossRef]
- Tian, Z.; Liu, W.; Jiang, W.; Wu, C. CNNs-Transformer Based Day-Ahead Probabilistic Load Forecasting for Weekends with Limited Data Availability. Energy 2024, 293, 130666. [CrossRef]
- Moosbrugger, L.; Seiler, V.; Wohlgenannt, P.; Hegenbart, S.; Ristov, S.; Eder, E.; Kepplinger, P. Load Forecasting for Households and Energy Communities: Are Deep Learning Models Worth the Effort? arXiv July 18, 2025. [CrossRef]
- Wohlgenannt, P.; Moosbrugger, L.; Hegenbart, S.; Huber, G.; Kolhe, M.; Kepplinger, P. Optimal Sizing and Operation of a Battery Energy Storage System for Demand Response in a Food Processing Plant Using Deep Learning for Load Forecasting. In Data Science – Analytics and Applications. Proceedings of the 6th International Data Sciene Conference - iDSC2025; Springer Nature: Salzburg, Austria, 2025. [CrossRef]
- Wohlgenannt, P.; Moosbrugger, L.; Vetter, V.; Hegenbart, S.; Kolhe, M.; Eder, E.; Kepplinger, P. Energy Cost and Emission Optimization in Food Industry Using Deep Reinforcement Learning and Transformer-Based Load Forecasting. Energy Conversion and Management: X 2025, 28, 101364. [CrossRef]
- Yassuda Yamashita, D.; Vechiu, I.; Gaubert, J.-P. Two-Level Hierarchical Model Predictive Control with an Optimised Cost Function for Energy Management in Building Microgrids. Applied Energy 2021, 285, 116420. [CrossRef]
- Kepplinger, P.; Huber, G.; Petrasch, J. Autonomous Optimal Control for Demand Side Management with Resistive Domestic Hot Water Heaters Using Linear Optimization. Energy and Buildings 2015, 100, 50–55. [CrossRef]
- Ireshika, M. A. S. T.; Preissinger, M.; Kepplinger, P. Autonomous Demand Side Management of Electric Vehicles in a Distribution Grid. In 2019 7th International Youth Conference on Energy (IYCE); 2019; pp 1–6. [CrossRef]
- Ireshika, M. A. S. T.; Kepplinger, P. Uncertainties in Model Predictive Control for Decentralized Autonomous Demand Side Management of Electric Vehicles. Journal of Energy Storage 2024, 83, 110194. [CrossRef]
- Baumann, C.; Wohlgenannt, P.; Streicher, W.; Kepplinger, P. Optimizing Heat Pump Control in an NZEB via Model Predictive Control and Building Simulation. Energies 2025, 18, 100. [CrossRef]
- Seiler, V.; Moosbrugger, L.; Huber, G.; Kepplinger, P. Assessing Model Predictive Control for Energy Communities’ Flexibilities. In e-nova 2024 Intelligente Energie- und Klimastrategien: Energie - Gebäude - Umwelt BAND 27; Holzhausen, 2024. [CrossRef]
- Wohlgenannt, P.; Preißinger, M.; Kolhe, M. L.; Kepplinger, P. Demand Side Management of a Battery-Supported Manufacturing Process with On-Site Generation. In 2022 IEEE 7th International Energy Conference (ENERGYCON); IEEE: Riga, Latvia, 2022. [CrossRef]
- Wohlgenannt, P.; Huber, G.; Rheinberger, K.; Preißinger, M.; Kepplinger, P. Modelling of a Food Processing Plant for Industrial Demand Side Management. In HEAT POWERED CYCLES 2021 Conference Proceedings; Heat Powered Cycles: Bilbao, Spain, 2022; pp 638–649. [CrossRef]
- Wohlgenannt, P.; Huber, G.; Rheinberger, K.; Kolhe, M.; Kepplinger, P. Comparison of Demand Response Strategies Using Active and Passive Thermal Energy Storage in a Food-Processing Plant. Energy Rep. 2024, 12, 226–236. [CrossRef]
- Saffari, M.; de Gracia, A.; Fernández, C.; Belusko, M.; Boer, D.; Cabeza, L. F. Optimized Demand Side Management (DSM) of Peak Electricity Demand by Coupling Low Temperature Thermal Energy Storage (TES) and Solar PV. Applied Energy 2018, 211, 604–616. [CrossRef]
- Kepplinger, P.; Huber, G.; Petrasch, J. Field Testing of Demand Side Management via Autonomous Optimal Control of a Domestic Hot Water Heater. Energy and Buildings 2016, 127, 730–735. [CrossRef]
- Baumann, C.; Huber, G.; Alavanja, J.; Preißinger, M.; Kepplinger, P. Experimental Validation of a State-of-the-Art Model Predictive Control Approach for Demand Side Management with a Hot Water Heat Pump. Energy and Buildings 2023, 285, 112923. [CrossRef]
- Afroosheh, S.; Esapour, K.; Khorram-Nia, R.; Karimi, M. Reinforcement Learning Layout-Based Optimal Energy Management in Smart Home: AI-Based Approach. IET Gener. Transm. Distrib. 2024, 18, 2509–2520. [CrossRef]
- Muriithi, G.; Chowdhury, S. Deep Q-Network Application for Optimal Energy Management in a Grid-Tied Solar PV-Battery Microgrid. J. Eng. 2022, 2022, 422–441. [CrossRef]
- Jiang, Z.; Risbeck, M. J.; Ramamurti, V.; Murugesan, S.; Amores, J.; Zhang, C.; Lee, Y. M.; Drees, K. H. Building HVAC Control with Reinforcement Learning for Reduction of Energy Cost and Demand Charge. Energy Build. 2021, 239, 110833. [CrossRef]
- Vetter, V.; Wohlgenannt, P.; Kepplinger, P.; Eder, E. Deep Reinforcement Learning Approaches the MILP Optimum of a Multi-Energy Optimization in Energy Communities. Energies 2025, 18 (17), 4489. [CrossRef]
- Wohlgenannt, P.; Hegenbart, S.; Eder, E.; Kolhe, M.; Kepplinger, P. Energy Demand Response in a Food-Processing Plant: A Deep Reinforcement Learning Approach. Energies 2024, 17 (24), 6430. [CrossRef]
- Novak, M.; Dragicevic, T. Supervised Imitation Learning of Finite-Set Model Predictive Control Systems for Power Electronics. IEEE Transactions on Industrial Electronics 2021, 68 (2), 1717–1723. [CrossRef]
- Gao, S.; Xiang, C.; Yu, M.; Tan, K. T.; Lee, T. H. Online Optimal Power Scheduling of a Microgrid via Imitation Learning. IEEE Transactions on Smart Grid 2022, 13 (2), 861–876. [CrossRef]
- Dinh, H. T.; Kim, D. MILP-Based Imitation Learning for HVAC Control. IEEE Internet of Things Journal 2022, 9 (8), 6107–6120. [CrossRef]
- Mnih, V.; Kavukcuoglu, K.; Silver, D.; Rusu, A. A.; Veness, J.; Bellemare, M.G.; Graves, A.; Riedmiller, M.; Fidjeland, A. K.; Ostrovski, G.; et al. Human-Level Control through Deep Reinforcement Learning. Nature 2015, 518, 529–533. [CrossRef]
- Watkins, C.J.C.H.; Dayan, P. Q-Learning. Mach. Learn. 1992, 8, 279–292. [CrossRef]
- van Hasselt, H.; Guez, A.; Silver, D. Deep Reinforcement Learning with Double Q-learning. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16), Phoenix, AZ, USA, 12–17 February 2016; pp.2094–2100. [CrossRef]
- Azuatalam, D.; Lee, W.-L.; de Nijs, F.; Liebman, A. Reinforcement Learning for Whole-Building HVAC Control and Demand Response. Energy AI 2020, 2, 100020. [CrossRef]
- DAY-AHEAD PREISE. Available online: https://markttransparenz.apg.at/de/markt/Markttransparenz/Uebertragung/EXAA-Spotmarkt (accessed on 23 September 2024).
- Lillicrap, T.P.; Hunt, J.J.; Pritzel, A.; Heess, N.; Erez, T.; Tassa, Y.; Silver, D.; Wierstra, D. Continuous Control with Deep Reinforcement Learning. arXiv 2019, arXiv:1509.02971.
- Gurobi version 11.0. Available online: https://www.gurobi.com (accessed on 23 September 2024).
- scikit-learn version 1.6.0. Available online: https://scikit-learn.org/ (accessed on 1 December 2025).
- Pytorch version 2.1.1. Available online: https://pytorch.org (accessed on 23 September 2024).
- Optuna version 4.4. Available online: https://optuna.org (accessed on 1 December 2025).








| Family | Model | > +5.25 | < -0.25 °C | Sum | Max (°C) | Min (°C) |
|---|---|---|---|---|---|---|
| Base | - | 0 | 0 | 0 | 2.71 | 2.50 |
| Global Optimum | MILP | 0 | 0 | 0 | 5.17 | 0.08 |
| Sequence | Transformer | 0 | 0 | 0 | 5.22 | 0.14 |
| LSTM | 2 | 0 | 2 | 5.29 | 0.06 | |
| Tree-based | DecisionTree | 0 | 0 | 0 | 5.17 | 0.16 |
| RandomForest | 0 | 0 | 0 | 5.12 | 0.16 | |
| ExtraTrees | 0 | 0 | 0 | 5.12 | 0.14 | |
| GradientBoosting | 25 | 0 | 25 | 5.55 | 0.14 | |
| AdaBoost | 0 | 0 | 0 | 4.56 | 1.86 | |
| XGBoost | 1 | 0 | 1 | 5.27 | -0.13 | |
| LightGBM | 2 | 0 | 2 | 5.31 | -0.06 | |
| Neighbors | KNN | 0 | 0 | 0 | 5.14 | 0.10 |
| Kernel | SVR | 30 | 3 | 33 | 6.81 | -1.47 |
| KernelRidge | 205 | 27 | 232 | 6.69 | -1.68 | |
| Linear | LinearRegression | 32 | 0 | 32 | 5.43 | 1.43 |
| LinearSVR | 157 | 0 | 157 | 5.66 | 1.28 | |
| Ridge | 32 | 0 | 32 | 5.43 | 1.44 | |
| Lasso | 32 | 0 | 32 | 5.43 | 1.43 | |
| ElasticNet | 32 | 0 | 32 | 5.43 | 1.43 | |
| BayesianRidge | 32 | 0 | 32 | 5.42 | 1.44 | |
| SGD | 9 | 0 | 9 | 5.33 | 1.60 | |
| Huber | 49 | 0 | 49 | 5.46 | 1.43 | |
| PassiveAggressive | 51 | 0 | 51 | 5.45 | 1.63 | |
| PLS | 36 | 0 | 36 | 5.43 | 1.44 | |
| RL | DQN | 0 | 0 | 0 | 5.16 | 0.06 |
| Heuristics | DayNight | 0 | 0 | 0 | 5.10 | 0.06 |
| Mean | 0 | 0 | 0 | 5.19 | 0.06 | |
| Median | 0 | 0 | 0 | 4.26 | 0.06 | |
| MinMaxPrice | 0 | 0 | 0 | 5.13 | 0.06 |
| Variant | sMAPE [%] | Costs [€] | Cost penalty [%] |
|---|---|---|---|
| reference/all | 32.57 | 114 274 | - |
| no EXAA price | 36.35 | 119 078 | 4.20 |
| no weekdays | 34.11 | 115 589 | 1.15 |
| no day of the year | 32.23 | 114 754 | 0.42 |
| no hour of the day | 34.11 | 114 660 | 0.34 |
| Feature Set | sMAPE [%] | Predicted Costs [€] | Cost Penalty [%] |
|---|---|---|---|
| reference | 32.08 | 114 380 | - |
| with temperature | 32.06 | 114 388 | 0.01 |
| with lagged temperature | 32.09 | 114 463 | 0.07 |
| with thermal load | 32.04 | 114 474 | 0.08 |
| with lagged thermal load | 32.04 | 114 380 | 0.00 |
| with all candidates | 32.09 | 114 456 | 0.07 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).