Submitted:
14 January 2025
Posted:
15 January 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Preprocessing Time Series Data
2.2. Generating Time Series Segments
2.3. Segment-Level Normalization and Denormalization
2.4. Time Series Segment Embedding
2.5. Asymmetric Autoencoder
2.6. Training
2.7. Evaluation
3. Comparative Experiments and Results
3.1. Benchmark Datasets
3.1.1. Multivariate datasets
| # sub-datasets | # dimensions | # training data points | # test data points | Anomaly ratio (%) | |
| MSL | 27 | 55 | 58,317 | 73,729 | 10.53 |
| SMAP | 54 | 25 | 138,004 | 435,826 | 12.84 |
| SMD | 28 | 38 | 708,405 | 708,420 | 4.16 |
| PSM | 1 | 25 | 132,481 | 87,841 | 27.76 |
| GECCO | 1 | 9 | 69,260 | 69,261 | 1.05 |
| SWAN-SF | 1 | 38 | 60,000 | 60,000 | 32.60 |
3.1.2. Univariate datasets
| # sub-datasets | # training data points | # test data points | Anomaly ratio (%) | |
| ABP | 42 | 1,036,746 | 1,841,461 | 0.37 |
| Acceleration | 7 | 38,400 | 62,337 | 1.71 |
| Air Temperature | 13 | 52,000 | 54,392 | 0.82 |
| ECG | 91 | 1,795,083 | 6,047,314 | 0.38 |
| EPG | 25 | 119,000 | 410,415 | 0.45 |
| Gait | 33 | 1,157,571 | 2,784,520 | 0.38 |
| NASA | 11 | 38,500 | 86,296 | 0.86 |
| Power Demand | 11 | 197,149 | 311,629 | 0.61 |
| RESP | 17 | 868,000 | 2,452,953 | 0.12 |
3.2. Baselines
3.3. Implementation Details
3.4. Main Results
3.5. Ablation Study
3.5.1. Segment-level normalization and denormalization
3.5.2. Positional Encoding
3.5.3. Preprocessing Time Series Data
3.6. Hyperparameter Sensitivity
4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| 1D | one-dimensional |
| ABP | Arterial blood pressure benchmark |
| APE | Absolute Positional Encoding |
| CNN | Convolutional Neural Network |
| ECG | Electrocardiogram benchmark |
| EPG | Electrical Penetration Graph benchmark |
| GECCO | Genetic and Evolutionary Computation Conference benchmark |
| GRU | Gated Recurrent Unit |
| LSTM | Long Short Term Memory |
| MSL | Mars Science Laboratory benchmark |
| PA | Point Adjustment |
| PE | Positional Encoding |
| PSM | Pooled Server Metrics benchmark |
| RESP | Respiration benchmark |
| RevIN | Reversible Instance Normalization |
| RNN | Recurrent Neural Network |
| SegND | Segment-level Normalization-Denormalization |
| SMAP | Soil Moisture Active Passive satellite benchmark |
| SMD | Server Machine Dataset benchmark |
| SWAN-SF | Space Weather Analytics for Solar Flares benchmark |
| TCN | Temporal Convolutional Network |
Appendix A
| MSL | SMAP | SMD | PSM | GECCO | SWAN-SF | |
| # epochs | 1 | 1 | 6 | 4 | 4 | 5 |
| Batch size | 32 | 32 | 32 | 32 | 32 | 32 |
| # layers | 1 | 3 | 5 | 2 | 2 | 2 |
| # heads | 1 | 1 | 8 | 8 | 8 | 8 |
| Model dimension size | 96 | 16 | 128 | 256 | 128 | 512 |
| Feed-forward dimension size | 96 | 32 | 256 | 512 | 256 | 1024 |
| Window length | 25 | 50 | 300 | 30 | 10 | 400 |
| Training window stride | 100 | 100 | 300 | 30 | 10 | 200 |
| Test window stride | 25 | 50 | 300 | 30 | 10 | 400 |
| Dropout ratio | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.0 |
| Learning rate | 3e-4 | 3e-4 | 3e-4 | 3e-4 | 3e-4 | 3e-4 |
| Sub-dataset-level norm | - | - | - | - | - | - |
| Dataset-level norm | - | - | - | - | z-score | - |
| ABP | Acceleration | Air Temperature | ECG | EPG | Gait | NASA | Power Demand | RESP | |
| # epochs | 3 | 5 | 10 | 1 | 3 | 2 | 5 | 2 | 1 |
| Batch size | 32 | 32 | 32 | 32 | 32 | 32 | 8 | 32 | 32 |
| # layers | 2 | 2 | 1 | 2 | 2 | 2 | 2 | 3 | 1 |
| # heads | 8 | 8 | 8 | 8 | 8 | 8 | 1 | 8 | 8 |
| Model dimension size | 512 | 256 | 256 | 256 | 256 | 256 | 32 | 128 | 256 |
| Feed-forward dimension size | 1024 | 512 | 512 | 512 | 512 | 512 | 32 | 256 | 512 |
| Window length | 100 | 200 | 50 | 100 | 30 | 350 | 100 | 100 | 100 |
| Training window stride | 100 | 400 | 50 | 100 | 30 | 350 | 100 | 100 | 100 |
| Test window stride | 100 | 200 | 50 | 100 | 30 | 350 | 100 | 100 | 100 |
| Dropout ratio | 0.0 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.0 | 0.1 |
| Learning rate | 5e-5 | 3e-5 | 3e-3 | 3e-4 | 5e-5 | 5e-5 | 5e-5 | 3e-3 | 3e-3 |
| Sub-dataset-level norm | minmax | minmax | - | minmax | minmax | minmax | - | minmax | minmax |
| Dataset-level norm | z-score | z-score | - | z-score | - | - | - | z-score | z-score |
| MSL | SMAP | SMD | PSM | GECCO | SWAN-SF | |||||||||||||
| P | R | F1 | P | R | F1 | P | R | F1 | P | R | F1 | P | R | F1 | P | R | F1 | |
| Anomaly Transformer | 0.921 | 0.952 | 0.936 | 0.942 | 0.994 | 0.967 | 0.894 | 0.955 | 0.923 | 0.969 | 0.989 | 0.979 | - | - | - | - | - | - |
| DCdetector | 0.937 | 0.997 | 0.966 | 0.956 | 0.989 | 0.970 | 0.836 | 0.911 | 0.872 | 0.971 | 0.987 | 0.979 | 0.383 | 0.597 | 0.466 | 0.955 | 0.596 | 0.734 |
| MEMTO | 0.921 | 0.968 | 0.944 | 0.938 | 0.996 | 0.966 | 0.891 | 0.984 | 0.935 | 0.975 | 0.992 | 0.983 | - | - | - | - | - | - |
| AnomalyLLM | 0.937 | 0.979 | 0.958 | 0.944 | 0.969 | 0.956 | 0.934 | 0.998 | 0.965 | 0.996 | 0.998 | 0.997 | 0.511 | 0.793 | 0.620 | 0.873 | 0.745 | 0.804 |
| Ours | 0.917 | 0.971 | 0.943 | 0.964 | 0.986 | 0.975 | 0.894 | 0.989 | 0.939 | 0.992 | 0.994 | 0.993 | 0.919 | 0.921 | 0.920 | 0.861 | 0.749 | 0.801 |
| ABP | Acceleration | Air Temperature | ECG | EPG | |||||||||||
| AP | AR | AF1 | AP | AR | AF1 | AP | AR | AF1 | AP | AR | AF1 | AP | AR | AF1 | |
| TS-TCC | 0.763 | 0.745 | 0.754 | 0.555 | 0.543 | 0.549 | 0.980 | 0.957 | 0.969 | 0.758 | 0.782 | 0.784 | 0.928 | 0.935 | 0.931 |
| THOC | 0.822 | 0.808 | 0.815 | 0.782 | 0.770 | 0.776 | 0.984 | 0.958 | 0.971 | 0.762 | 0.758 | 0.760 | 0.911 | 0.905 | 0.908 |
| NCAD | 0.802 | 0.786 | 0.794 | 0.855 | 0.842 | 0.849 | 0.762 | 0.747 | 0.758 | 0.737 | 0.732 | 0.735 | 0.795 | 0.783 | 0.789 |
| AnomalyLLM | 0.931 | 0.910 | 0.920 | 0.965 | 0.948 | 0.956 | 0.989 | 0.959 | 0.974 | 0.768 | 0.808 | 0.787 | 0.935 | 0.932 | 0.933 |
| Ours | 0.725 | 0.991 | 0.838 | 0.934 | 1.000 | 0.966 | 0.951 | 0.993 | 0.972 | 0.698 | 0.942 | 0.802 | 0.743 | 0.966 | 0.840 |
| Gait | NASA | Power Demand | RESP | Average | |||||||||||
| AP | AR | AF1 | AP | AR | AF1 | AP | AR | AF1 | AP | AR | AF1 | AP | AR | AF1 | |
| TS-TCC | 0.798 | 0.790 | 0.794 | 0.512 | 0.508 | 0.511 | 0.767 | 0.759 | 0.763 | 0.561 | 0.560 | 0.560 | 0.736 | 0.731 | 0.735 |
| THOC | 0.788 | 0.780 | 0.784 | 0.902 | 0.891 | 0.896 | 0.777 | 0.772 | 0.775 | 0.382 | 0.395 | 0.389 | 0.790 | 0.782 | 0.786 |
| NCAD | 0.864 | 0.852 | 0.858 | 0.869 | 0.853 | 0.861 | 0.724 | 0.723 | 0.723 | 0.613 | 0.612 | 0.613 | 0.780 | 0.770 | 0.776 |
| AnomalyLLM | 0.891 | 0.852 | 0.871 | 0.969 | 0.953 | 0.961 | 0.888 | 0.884 | 0.886 | 0.736 | 0.736 | 0.736 | 0.897 | 0.887 | 0.892 |
| Ours | 0.767 | 0.998 | 0.867 | 0.892 | 0.962 | 0.926 | 0.766 | 0.994 | 0.865 | 0.629 | 0.969 | 0.763 | 0.789 | 0.979 | 0.871 |
References
- Cook, A.A.; Mısırlı, G.; Fan, Z. Anomaly detection for IoT time-series data: A survey. IEEE Internet of Things Journal 2019, 7, 6481–6494. [Google Scholar] [CrossRef]
- Lai, K.H.; Zha, D.; Xu, J.; Zhao, Y.; Wang, G.; Hu, X. (2021, June). Revisiting time series outlier detection: Definitions and benchmarks. In Thirty-fifth conference on neural information processing systems datasets and benchmarks track (round 1).
- Zamanzadeh Darban, Z.; Webb, G.I.; Pan, S.; Aggarwal, C.; Salehi, M. Deep learning for time series anomaly detection: A survey. ACM Computing Surveys 2024, 57, 1–42. [Google Scholar] [CrossRef]
- Choi, K.; Yi, J.; Park, C.; Yoon, S. Deep learning for anomaly detection in time-series data: Review, analysis, and guidelines. IEEE access 2021, 9, 120043–120065. [Google Scholar] [CrossRef]
- Basu, S.; Meckesheimer, M. Automatic outlier detection for time series: An application to sensor data. Knowledge and Information Systems 2007, 11, 137–154. [Google Scholar] [CrossRef]
- Hyndman, R.J.; Athanasopoulos, G. (2018). Forecasting: Principles and Practice 2nd Edition, 2018. Available: Otexts. com/fpp2/.[Accessed 8 May 2018].
- Breunig, M.M.; Kriegel, H.P.; Ng, R.T.; Sander, J. (2000, May). LOF: Identifying density-based local outliers. In Proceedings of the 2000 ACM SIGMOD international conference on Management of data (pp. 93–104).
- Hahsler, M.; Bolaños, M. Clustering data streams based on shared density between micro-clusters. IEEE transactions on knowledge and data engineering 2016, 28, 1449–1461. [Google Scholar] [CrossRef]
- He, J.; Cheng, Z.; Guo, B. Anomaly detection in satellite telemetry data using a sparse feature-based method. Sensors 2022, 22, 6358. [Google Scholar] [CrossRef]
- Paffenroth, R.; Kay, K.; Servi, L. Robust pca for anomaly detection in cyber networks. arXiv 2018, arXiv:1801.01571. [Google Scholar]
- Ramaswamy, S.; Rastogi, R.; Shim, K. (2000, May). Efficient algorithms for mining outliers from large data sets. In Proceedings of the 2000 ACM SIGMOD international conference on Management of data (pp. 427–438).
- Chen, T.; Guestrin, C. (2016, August). Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining (pp. 785–794).
- Schmidl, S.; Wenig, P.; Papenbrock, T. Anomaly detection in time series: A comprehensive evaluation. Proceedings of the VLDB Endowment 2022, 15, 1779–1797. [Google Scholar] [CrossRef]
- Schmidt, M.; Simic, M. Normalizing flows for novelty detection in industrial time series data. arXiv 2019, arXiv:1906.06904. [Google Scholar]
- Hundman, K.; Constantinou, V.; Laporte, C.; Colwell, I.; Soderstrom, T. (2018, July). Detecting spacecraft anomalies using lstms and nonparametric dynamic thresholding. In Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery data mining (pp. 387–395).
- Su, Y.; Zhao, Y.; Niu, C.; Liu, R.; Sun, W.; Pei, D. (2019, July). Robust anomaly detection for multivariate time series through stochastic recurrent neural network. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery data mining (pp. 2828–2837).
- Malhotra, P.; Ramakrishnan, A.; Anand, G.; Vig, L.; Agarwal, P.; Shroff, G. LSTM-based encoder-decoder for multi-sensor anomaly detection. arXiv 2016, arXiv:1607.00148. [Google Scholar]
- Munir, M.; Siddiqui, S.A.; Dengel, A.; Ahmed, S. DeepAnT: A deep learning approach for unsupervised anomaly detection in time series. IEEE Access 2018, 7, 1991–2005. [Google Scholar] [CrossRef]
- Bashar, M.A.; Nayak, R. (2020, December). TAnoGAN: Time series anomaly detection with generative adversarial networks. In 2020 IEEE Symposium Series on Computational Intelligence (SSCI) (pp. 1778–1785). IEEE.
- Shen, L.; Li, Z.; Kwok, J. Timeseries anomaly detection using temporal hierarchical one-class network. Advances in Neural Information Processing Systems 2020, 33, 13016–13026. [Google Scholar]
- Shen, L.; Yu, Z.; Ma, Q.; Kwok, J.T. (2021, May). Time series anomaly detection with multiresolution ensemble decoding. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 35, No. 11, pp. 9567–9575).
- Ngu, H.C.V.; Lee, K.M. CL-TAD: A Contrastive-Learning-Based Method for Time Series Anomaly Detection. Applied Sciences 2023, 13, 11938. [Google Scholar] [CrossRef]
- Cherdo, Y.; Miramond, B.; Pegatoquet, A.; Vallauri, A. Unsupervised anomaly detection for cars CAN sensors time series using small recurrent and convolutional neural networks. Sensors 2023, 23, 5013. [Google Scholar] [CrossRef]
- Yue, Z.; Wang, Y.; Duan, J.; Yang, T.; Huang, C.; Tong, Y.; Xu, B. (2022, June). Ts2vec: Towards universal representation of time series. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 36, No. 8, pp. 8980–8987).
- Zhao, M.; Peng, H.; Li, L.; Ren, Y. Graph Attention Network and Informer for Multivariate Time Series Anomaly Detection. Sensors 2024, 24, 1522. [Google Scholar] [CrossRef] [PubMed]
- Deng, A.; Hooi, B. (2021, May). Graph neural network-based anomaly detection in multivariate time series. In Proceedings of the AAAI conference on artificial intelligence (Vol. 35, No. 5, pp. 4027–4035).
- Chen, Z.; Chen, D.; Zhang, X.; Yuan, Z.; Cheng, X. Learning graph structures with transformer for multivariate time-series anomaly detection in IoT. IEEE Internet of Things Journal 2021, 9, 9179–9189. [Google Scholar] [CrossRef]
- Geiger, A.; Liu, D.; Alnegheimish, S.; Cuesta-Infante, A.; Veeramachaneni, K. (2020, December). Tadgan: Time series anomaly detection using generative adversarial networks. In 2020 ieee international conference on big data (big data) (pp. 33–43). IEEE.
- Su, Y.; Zhao, Y.; Niu, C.; Liu, R.; Sun, W.; Pei, D. (2019, July). Robust anomaly detection for multivariate time series through stochastic recurrent neural network. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery data mining (pp. 2828–2837).
- Talukder, S.; Yue, Y.; Gkioxari, G. TOTEM: TOkenized Time Series EMbeddings for General Time Series Analysis. arXiv 2024, arXiv:2402.16412. [Google Scholar]
- Vaswani, A. (2017). Attention is all you need. Advances in Neural Information Processing Systems.
- Wang, C.; Xing, S.; Gao, R.; Yan, L.; Xiong, N.; Wang, R. Disentangled dynamic deviation transformer networks for multivariate time series anomaly detection. Sensors 2023, 23, 1104. [Google Scholar] [CrossRef]
- Lai, C.Y.A.; Sun, F.K.; Gao, Z.; Lang, J.H.; Boning, D. Nominality score conditioned time series anomaly detection by point/sequential reconstruction. Advances in Neural Information Processing Systems 2024, 36. [Google Scholar]
- Feng, Y.; Zhang, W.; Fu, Y.; Jiang, W.; Zhu, J.; Ren, W. (2024, August). Sensitivehue: Multivariate time series anomaly detection by enhancing the sensitivity to normal patterns. In Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (pp. 782–793).
- Yue, W.; Ying, X.; Guo, R.; Chen, D.; Shi, J.; Xing, B.; ... Chen, T. Sub-Adjacent Transformer: Improving Time Series Anomaly Detection with Reconstruction Error from Sub-Adjacent Neighborhoods. arXiv 2024, arXiv:2404.18948.
- Liu, C.; He, S.; Zhou, Q.; Li, S.; Meng, W. Large language model guided knowledge distillation for time series anomaly detection. arXiv 2024, arXiv:2401.15123. [Google Scholar]
- Xu, J. Anomaly transformer: Time series anomaly detection with association discrepancy. arXiv 2021, arXiv:2110.02642. [Google Scholar]
- Song, J.; Kim, K.; Oh, J.; Cho, S. Memto: Memory-guided transformer for multivariate time series anomaly detection. Advances in Neural Information Processing Systems 2023, 36, 57947–57963. [Google Scholar]
- Yang, Y.; Zhang, C.; Zhou, T.; Wen, Q.; Sun, L. (2023, August). Dcdetector: Dual attention contrastive representation learning for time series anomaly detection. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (pp. 3033–3045).
- Kim, T.; Kim, J.; Tae, Y.; Park, C.; Choi, J.H.; Choo, J. (2021, May). Reversible instance normalization for accurate time-series forecasting against distribution shift. In International Conference on Learning Representations.
- Wang, X.; Pi, D.; Zhang, X.; Liu, H.; Guo, C. Variational transformer-based anomaly detection approach for multivariate time series. Measurement 2022, 191, 110791. [Google Scholar] [CrossRef]
- Ren, H.; Xu, B.; Wang, Y.; Yi, C.; Huang, C.; Kou, X.; ... Zhang, Q. (2019, July). Time-series anomaly detection service at microsoft. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery data mining (pp. 3009–3017).
- Garg, A.; Zhang, W.; Samaran, J.; Savitha, R.; Foo, C.S. An evaluation of anomaly detection and diagnosis in multivariate time series. IEEE Transactions on Neural Networks and Learning Systems 2021, 33, 2508–2517. [Google Scholar] [CrossRef] [PubMed]
- Kim, S.; Choi, K.; Choi, H.S.; Lee, B.; Yoon, S. (2022, June). Towards a rigorous evaluation of time-series anomaly detection. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 36, No. 7, pp. 7194–7201).
- Huet, A.; Navarro, J.M.; Rossi, D. (2022, August). Local evaluation of time series anomaly detection algorithms. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (pp. 635–645).
- Abdulaal, A.; Liu, Z.; Lancewicki, T. (2021, August). Practical approach to asynchronous multivariate time series anomaly detection and localization. In Proceedings of the 27th ACM SIGKDD conference on knowledge discovery data mining (pp. 2485–2494).
- Wu, R.; Keogh, E.J. Current time series anomaly detection benchmarks are flawed and are creating the illusion of progress. IEEE transactions on knowledge and data engineering 2021, 35, 2421–2429. [Google Scholar] [CrossRef]
- Goswami, M.; Challu, C.; Callot, L.; Minorics, L.; Kan, A. Unsupervised model selection for time-series anomaly detection. arXiv 2022, arXiv:2210.01078. [Google Scholar]
- Eldele, E.; Ragab, M.; Chen, Z.; Wu, M.; Kwoh, C.K.; Li, X.; Guan, C. Time-series representation learning via temporal and contextual contrasting. arXiv 2021, arXiv:2106.14112. [Google Scholar]
- Carmona, C.U.; Aubet, F.X.; Flunkert, V.; Gasthaus, J. Neural contextual anomaly detection for time series. arXiv 2021, arXiv:2107.07702. [Google Scholar]
- Kingma, D.P. Adam: A method for stochastic optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
- Hendrycks, D.; Mazeika, M.; Dietterich, T. Deep anomaly detection with outlier exposure. arXiv 2018, arXiv:1812.04606. [Google Scholar]
- Zhang, C.; Zhou, T.; Wen, Q.; Sun, L. (2022, October). TFAD: A decomposition time series anomaly detection architecture with time-frequency analysis. In Proceedings of the 31st ACM International Conference on Information Knowledge Management (pp. 2497–2507).
- Battaglia, P.W.; Hamrick, J.B.; Bapst, V.; Sanchez-Gonzalez, A.; Zambaldi, V.; Malinowski, M. ;... Pascanu, R. Relational inductive biases, deep learning, and graph networks. arXiv, 2018; arXiv:1806.01261. [Google Scholar]
- Audibert, J.; Michiardi, P.; Guyard, F.; Marti, S.; Zuluaga, M.A. (2020, August). Usad: Unsupervised anomaly detection on multivariate time series. In Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery data mining (pp. 3395–3404).
- Tuli, S.; Casale, G.; Jennings, N.R. Tranad: Deep transformer networks for anomaly detection in multivariate time series data. arXiv 2022, arXiv:2201.07284. [Google Scholar] [CrossRef]
- Wu, H.; Hu, T.; Liu, Y.; Zhou, H.; Wang, J.; Long, M. Timesnet: Temporal 2d-variation modeling for general time series analysis. arXiv 2022, arXiv:2210.02186. [Google Scholar]


| MSL | SMAP | SMD | PSM | GECCO | SWAN-SF | |
| Anomaly Transformer | 0.936 | 0.967 | 0.923 | 0.979 | - | - |
| DCdetector | 0.966 | 0.970 | 0.872 | 0.979 | 0.466 | 0.734 |
| MEMTO | 0.944 | 0.966 | 0.935 | 0.983 | - | - |
| AnomalyLLM | 0.956 | 0.965 | 0.958 | 0.997 | 0.620 | 0.804 |
| Ours | 0.943 | 0.975 | 0.939 | 0.993 | 0.920 | 0.801 |
| ABP | Acceleration | Air Temperature | ECG | EPG | |
| TS-TCC | 0.754 | 0.549 | 0.969 | 0.784 | 0.931 |
| THOC | 0.815 | 0.776 | 0.971 | 0.760 | 0.908 |
| NCAD | 0.794 | 0.849 | 0.758 | 0.735 | 0.789 |
| AnomalyLLM | 0.920 | 0.956 | 0.974 | 0.787 | 0.933 |
| Ours | 0.838 | 0.966 | 0.972 | 0.802 | 0.840 |
| Gait | NASA | Power Demand | RESP | Average | |
| TS-TCC | 0.794 | 0.511 | 0.763 | 0.560 | 0.735 |
| THOC | 0.784 | 0.896 | 0.775 | 0.389 | 0.786 |
| NCAD | 0.858 | 0.861 | 0.723 | 0.613 | 0.776 |
| AnomalyLLM | 0.871 | 0.961 | 0.886 | 0.736 | 0.892 |
| Ours | 0.867 | 0.926 | 0.865 | 0.763 | 0.871 |
| SegND | Positional Encoding | MSL | SMAP | SMD |
| ✗ | ✗ | 0.909 | 0.740 | 0.830 |
| ✗ | APE (sinusoid) | 0.846 | 0.734 | 0.794 |
| ✗ | APE (learnable) | 0.902 | 0.710 | 0.845 |
| ✓ | ✗ | 0.943 | 0.975 | 0.939 |
| ✓ | APE (sinusoid) | 0.914 | 0.966 | 0.904 |
| ✓ | APE (learnable) | 0.924 | 0.957 | 0.923 |
| SegND | Positional Encoding | ECG | Gait | RESP |
| ✗ | ✗ | 0.739 | 0.783 | 0.693 |
| ✗ | APE (sinusoid) | 0.772 | 0.768 | 0.702 |
| ✗ | APE (learnable) | 0.746 | 0.819 | 0.703 |
| ✓ | ✗ | 0.802 | 0.867 | 0.763 |
| ✓ | APE (sinusoid) | 0.752 | 0.820 | 0.729 |
| ✓ | APE (learnable) | 0.759 | 0.850 | 0.721 |
| Sub-Dataset-Level | Dataset-Level | MSL | SMAP | SMD |
| ✗ | ✗ | 0.943 | 0.975 | 0.939 |
| ✗ | min-max | 0.919 | 0.925 | 0.931 |
| ✗ | z-score | 0.887 | 0.707 | 0.877 |
| min-max | ✗ | 0.611 | 0.842 | 0.812 |
| min-max | min-max | 0.605 | 0.830 | 0.807 |
| min-max | z-score | 0.750 | 0.640 | 0.740 |
| z-score | ✗ | 0.923 | 0.692 | 0.811 |
| z-score | min-max | 0.922 | 0.823 | 0.808 |
| z-score | z-score | 0.887 | 0.655 | 0.799 |
| Sub-Dataset-Level | Dataset-Level | ECG | EPG | Gait | NASA |
| ✗ | ✗ | 0.672 | 0.840 | 0.725 | 0.926 |
| ✗ | min-max | 0.657 | 0.829 | 0.709 | 0.872 |
| ✗ | z-score | 0.670 | 0.812 | 0.712 | 0.892 |
| min-max | ✗ | 0.792 | 0.723 | 0.867 | 0.846 |
| min-max | min-max | 0.779 | 0.726 | 0.850 | 0.862 |
| min-max | z-score | 0.802 | 0.707 | 0.857 | 0.850 |
| z-score | ✗ | 0.735 | 0.667 | 0.825 | 0.914 |
| z-score | min-max | 0.756 | 0.677 | 0.815 | 0.883 |
| z-score | z-score | 0.764 | 0.711 | 0.831 | 0.870 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).