Submitted:
17 December 2023
Posted:
29 December 2023
You are already at the latest version
Abstract
Keywords:
1. Introduction


2. Related Work
2.1. Models for LSTF
2.2. temporal covariates
3. Preliminary

4. Informer with Season-aware Block
4.1. Model inputs
4.2. Season-aware Block
4.3. Informer
4.3.1. ProbSparse Self-attention
4.3.2. Encoder
4.3.3. Decoder
4.3.4. Loss Function
5. Results
5.1. Datasets
| Models | Ours | Informer | Autoformer | LogTrans | LSTM | TCN | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Metric | MSE | MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | MAE | |
| ElecZJ | 48 | 0.062 | 0.166 | 0.088 | 0.216 | 0.098 | 0.229 | 0.097 | 0.224 | 0.189 | 0.268 | 0.181 | 0.259 |
| 96 | 0.071 | 0.197 | 0.097 | 0.213 | 0.108 | 0.230 | 0.109 | 0.226 | 0.201 | 0.312 | 0.198 | 0.305 | |
| 192 | 0.131 | 0.255 | 0.154 | 0.273 | 0.146 | 0.261 | 0.154 | 0.273 | 0.231 | 0.330 | 0.289 | 0.361 | |
| 336 | 0.176 | 0.273 | 0.204 | 0.316 | 0.189 | 0.301 | 0.214 | 0.346 | 0.372 | 0.369 | 0.378 | 0.389 | |
| 720 | 0.354 | 0.372 | 0.401 | 0.425 | 0.410 | 0.399 | 0.426 | 0.425 | 0.512 | 0.498 | 0.489 | 0.476 | |
| ETTm1 | 48 | 0.431 | 0.449 | 0.494 | 0.503 | 0.498 | 0.482 | 0.507 | 0.583 | 1.392 | 0.939 | 2.941 | 1.299 |
| 96 | 0.472 | 0.432 | 0.565 | 0.553 | 0.505 | 0.475 | 0.568 | 0.642 | 1.352 | 0.902 | 3.041 | 1.330 | |
| 192 | 0.503 | 0.451 | 0.733 | 0.763 | 0.553 | 0.496 | 0.989 | 0.857 | 1.532 | 1.059 | 3.072 | 1.339 | |
| 336 | 0.569 | 0.500 | 1.056 | 0.786 | 0.621 | 0.537 | 1.462 | 1.320 | 1.740 | 1.124 | 3.105 | 1.348 | |
| 720 | 0.629 | 0.501 | 1.192 | 0.926 | 0.671 | 0.561 | 1.669 | 1.461 | 2.736 | 1.555 | 3.135 | 1.354 | |
| ETTm2 | 48 | 0.168 | 0.234 | 0.344 | 0.423 | 0.195 | 0.266 | 0.536 | 0.501 | 1.453 | 1.223 | 0.69 | 0.579 |
| 96 | 0.165 | 0.262 | 0.355 | 0.462 | 0.205 | 0.293 | 0.768 | 0.642 | 1.523 | 1.346 | 0.751 | 0.642 | |
| 192 | 0.223 | 0.301 | 0.595 | 0.586 | 0.278 | 0.336 | 0.989 | 0.757 | 1.763 | 1.673 | 1.230 | 0.975 | |
| 336 | 0.304 | 0.331 | 1.270 | 0.871 | 0.343 | 0.379 | 1.334 | 0.872 | 1.985 | 1.879 | 2.334 | 1.332 | |
| 720 | 0.389 | 0.392 | 3.001 | 1.267 | 0.414 | 0.419 | 3.048 | 1.328 | 3.012 | 2.563 | 3.971 | 1.561 | |
| ETTh1 | 48 | 0.376 | 0.364 | 0.685 | 0.625 | 0.405 | 0.409 | 0.766 | 0.757 | 0.702 | 0.675 | 0.689 | 0.664 |
| 96 | 0.410 | 0.442 | 0.882 | 0.642 | 0.449 | 0.459 | 0.826 | 0.771 | 0.753 | 0.701 | 0.769 | 0.713 | |
| 192 | 0.473 | 0.463 | 0.989 | 0.852 | 0.500 | 0.482 | 1.103 | 0.871 | 1.252 | 0.893 | 1.312 | 0.921 | |
| 336 | 0.502 | 0.474 | 1.128 | 0.873 | 0.521 | 0.496 | 1.362 | 0.952 | 1.424 | 0.994 | 1.486 | 1.191 | |
| 720 | 0.506 | 0.489 | 1.215 | 0.896 | 0.514 | 0.512 | 1.397 | 1.291 | 1.96 | 1.322 | 4.192 | 1.991 | |
| ETTh2 | 48 | 0.302 | 0.319 | 1.457 | 1.001 | 0.324 | 0.334 | 1.806 | 1.034 | 1.671 | 1.221 | 1.703 | 1.239 |
| 96 | 0.345 | 0.366 | 1.861 | 1.102 | 0.358 | 0.397 | 2.806 | 1.234 | 2.553 | 1.432 | 2.430 | 1.319 | |
| 192 | 0.421 | 0.412 | 2.923 | 1.483 | 0.456 | 0.452 | 3.992 | 1.712 | 2.776 | 1.589 | 3.134 | 1.841 | |
| 336 | 0.459 | 0.432 | 3.489 | 1.515 | 0.482 | 0.486 | 4.070 | 1.763 | 3.434 | 1.549 | 3.539 | 1.864 | |
| 720 | 0.471 | 0.455 | 3.467 | 1.473 | 0.515 | 0.511 | 3.913 | 1.552 | 3.963 | 1.788 | 4.192 | 1.991 | |
| Electricity | 48 | 0.268 | 0.268 | 0.368 | 0.424 | 0.282 | 0.294 | 0.368 | 0.432 | 0.574 | 0.602 | 0.559 | 0.579 |
| 96 | 0.265 | 0.281 | 0.370 | 0.426 | 0.299 | 0.305 | 0.370 | 0.431 | 0.601 | 0.642 | 0.598 | 0.631 | |
| 192 | 0.289 | 0.298 | 0.391 | 0.435 | 0.320 | 0.336 | 0.388 | 0.450 | 0.920 | 0.806 | 1.012 | 0.891 | |
| 336 | 0.302 | 0.312 | 0.406 | 0.443 | 0.339 | 0.351 | 0.409 | 0.454 | 1.676 | 1.095 | 1.921 | 1.469 | |
| 720 | 0.324 | 0.345 | 0.460 | 0.548 | 0.355 | 0.381 | 0.477 | 0.589 | 1.591 | 1.128 | 2.011 | 1.691 | |
| Weather | 48 | 0.212 | 0.286 | 0.395 | 0.459 | 0.243 | 0.301 | 0.426 | 0.495 | 0.829 | 0.677 | 0.513 | 0.479 |
| 96 | 0.245 | 0.312 | 0.470 | 0.484 | 0.266 | 0.336 | 0.458 | 0.520 | 0.988 | 0.720 | 0.615 | 0.589 | |
| 192 | 0.273 | 0.330 | 0.658 | 0.584 | 0.307 | 0.367 | 0.738 | 0.671 | 1.475 | 0.925 | 0.629 | 0.600 | |
| 336 | 0.324 | 0.351 | 0.702 | 0.620 | 0.359 | 0.395 | 0.754 | 0.670 | 1.657 | 1.059 | 0.639 | 0.608 | |
| 720 | 0.376 | 0.398 | 0.831 | 0.731 | 0.419 | 0.428 | 0.885 | 0.773 | 1.536 | 1.109 | 0.650 | 0.610 | |
| Count | 70 | 6 | 6 | 0 | 0 | 0 | |||||||
5.2. Implement details
5.3. BaseLines
5.4. Accuracy
5.5. Train and Inference Speed
5.6. Ablation studies
5.7. Model Analysis
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A
Appendix A.1. Fast Fourier Transform
References
- Zhou, H.; Zhang, S.; Peng, J.; Zhang, S.; Li, J.; Xiong, H.; Zhang, W. Informer: Beyond efficient transformer for long sequence time-series forecasting. Proceedings of the AAAI conference on artificial intelligence, 2021, Vol. 35, pp. 11106–11115.
- Torres, J.F.; Hadjout, D.; Sebaa, A.; Martínez-Álvarez, F.; Troncoso, A. Deep learning for time series forecasting: a survey. Big Data 2021, 9, 3–21. [Google Scholar] [CrossRef] [PubMed]
- Wu, H.; Xu, J.; Wang, J.; Long, M. Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting. Advances in Neural Information Processing Systems 2021, 34, 22419–22430. [Google Scholar]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. Advances in neural information processing systems 2017, 30. [Google Scholar]
- Kitaev, N.; Kaiser, Ł.; Levskaya, A. Reformer: The efficient transformer. arXiv preprint arXiv:2001.04451 2020.
- Zeng, A.; Chen, M.; Zhang, L.; Xu, Q. Are transformers effective for time series forecasting? Proceedings of the AAAI conference on artificial intelligence, 2023, Vol. 37, pp. 11121–11128.
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S.; others. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 2020.
- Salinas, D.; Flunkert, V.; Gasthaus, J.; Januschowski, T. DeepAR: Probabilistic forecasting with autoregressive recurrent networks. International Journal of Forecasting 2020, 36, 1181–1191. [Google Scholar] [CrossRef]
- Bai, S.; Kolter, J.Z.; Koltun, V. An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv preprint arXiv:1803.01271 2018.
- Wu, N.; Green, B.; Ben, X.; O’Banion, S. Deep transformer models for time series forecasting: The influenza prevalence case. arXiv preprint arXiv:2001.08317 2020.
- Li, S.; Jin, X.; Xuan, Y.; Zhou, X.; Chen, W.; Wang, Y.X.; Yan, X. Enhancing the locality and breaking the memory bottleneck of transformer on time series forecasting. Advances in neural information processing systems 2019, 32. [Google Scholar]
- Yu, F.; Koltun, V.; Funkhouser, T. Dilated residual networks. Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 472–480.
- Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 2014.
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770–778.
- Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural computation 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]





Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).