Submitted:
24 March 2025
Posted:
24 March 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Work
1.1. Traditional Supply Chain Anomaly Detection
2.1. Impact of Digitization and Machine Learning on Supply Chain Anomaly Detection
2.3. Analysis and Clustering of Large-Scale Time Series Data
3. Methodology
- Data preprocessing: Cleaning, transforming, and filtering raw data for effective anomaly detection.
- Anomaly detection: Identifying abnormal behavior in processed data using various algorithms or rules.
- Early warning: Notifying relevant personnel or systems of detected anomalies to facilitate timely corrective actions.
3.1. Algorithm Principle and Mathematical Formula
3.1.1. Z-Value
- X is the data point,
- μ is the mean,
- σ is the standard deviation.
- Q1 is the first quartile,
- Q2 is the third quartile.
3.1.2. DBSCAN Clustering Method
- ε (epsilon): Defines the neighborhood radius.
- MinPits: Minimum number of points required to form a dense region.
3.2. Regular Method
3.2.1. Fixed Threshold Method
3.2.2. Clustering Method
3.3. The Difference Between Anomaly Detection and Early Warning
- Anomaly detection refers to the process of identifying abnormal behavior in data through some algorithm or rule. The purpose of anomaly detection is to find out the different or inconsistent behavior compared with the normal behavior, to conduct subsequent early warning processing.
- Early warning refers to the process of alerting the identified abnormal behavior to the relevant person or system. The purpose of early warning is to remind relevant personnel or systems through alarm information so that appropriate measures can be taken to deal with abnormal behavior.
3.4. Application Scenarios of Anomaly Detection and Early Warning [11]
- Finance: Fraud detection, market volatility monitoring, credit risk assessment.
- Network security: Identifying cyberattacks, malware, and network anomalies.
- Manufacturing and logistics: Detecting equipment failures, inventory shortages, and supply chain disruptions.
- Healthcare: Disease diagnosis, medical resource allocation, and case management.
- Transportation: Identifying traffic congestion, cargo damage, and route deviations.
3.5. The Challenge of Anomaly Detection and Early Warning
- Data quality and integrity: Anomaly detection and early warning require high-quality and complete data, but there may be problems in the actual data such as missing, errors, and noise, which will affect the effect of detection and early warning.
- Algorithm complexity and efficiency: anomaly detection and early warning require efficient algorithms, but the complexity of the algorithm in practice may affect the real-time and accuracy.
- Model update and maintenance: Anomaly detection and early warning require regular updates and maintenance of the model to adapt to changing application scenarios and data characteristics.
- Human intervention and collaboration: Anomaly detection and early warning require human intervention and collaboration to ensure the effectiveness and timeliness of early warning.
4. Experiment and Conclusions
- Defining and detecting anomalous demand.
- Effectively handling detected anomalies.
- Therefore, dealing with anomalies in demand becomes an important step to improve the accuracy of demand forecasting. Specifically, handling exceptional requirements involves two key issues [10]:
- How to define and detect abnormal requirements?
- How to deal with abnormal demand effectively after it is detected?
4.1. IQR Model
- Use the IQR (quartile range) method to detect abnormal demand, smooth abnormal demand, and adjust it using mean, median, or moving average.
- The IQR method is used to detect the demand changes in the adjacent 7 periods, and the abnormal demand detected is smoothed.

4.2. Abnormal Detection of Warehouse Requirement Data

4.3. Demand Forecasting and Anomaly Detection Results Analysis
4.4. Prediction Model Evaluation
4.5. KPI Result Analysis
- The following is a comparison of KPI results based on the above two demand forecasting methods:
- Coefficient of variation: The first method has a higher coefficient of variation, indicating that the demand data without exception processing is more volatile; The second method is smoothed by moving average, and the coefficient of variation is significantly reduced, indicating that the stability of the data is improved.
- Root mean square error (RMSE%): The second method has a lower root mean square error, indicating that the forecast results are closer to the actual demand.
- Mean absolute value error (MAE%): The second method also performs better in terms of mean absolute value error, further demonstrating its superiority.
5. Conclusions
- Impact of Prediction Models on Forecast Accuracy: The choice of prediction model significantly affects the forecasting performance. In this experiment, we compared the method of “demand anomaly processing and prediction error-based anomaly detection” with “demand anomaly detection and moving average processing + prediction error-based anomaly detection.” The comparison shows differences in forecast accuracy, as observed from the root mean square error (RMSE) and mean absolute error (MAE):
- Root Mean Square Error (RMSE): The second method (demand anomaly detection and moving average processing) demonstrated lower prediction error (0.20184 < 0.213496), indicating better demand data prediction accuracy.
- Mean Absolute Error (MAE): The first method (no anomaly processing on demand) showed a slightly better performance (0.144684 < 0.146127), reflecting a more accurate representation of the actual data when anomalies were processed minimally.
- Role of Anomaly Detection and Processing: The smoothing of outliers in the demand data, particularly using the moving average method, can improve prediction results to a certain extent. Although the second method (with anomaly detection and moving average smoothing) performed better in terms of RMSE, the slight increase in MAE suggests that anomaly detection and processing, particularly when smoothing data, can result in some loss of information. The trade-off between improving short-term forecast accuracy and retaining key data features for long-term predictions remains an important consideration.
- Trade-off Between Short-Term and Long-Term Forecasting: While short-term accuracy is improved through smoothing techniques like the moving average, the smoothing process could interfere with long-term trends and seasonal patterns. For instance, demand forecasting that accounts for long-term growth or cyclical variations may be compromised if smoothing overly flattens the data. This paper calls for future research to examine how data smoothing techniques can be adapted to preserve long-term trends, ensuring that the model strikes an effective balance between short-term accuracy and long-term forecasting.
- Application of KPI Functions: By defining KPI functions, we can quantitatively evaluate the effectiveness of different forecasting methods in terms of RMSE and MAE. This allows for the flexible selection of anomaly detection methods depending on the requirements of the specific forecasting scenario, optimizing the model’s performance.
References
- Charavanan, Varun. Investigating the Financial Implications of Converting Manufacturing from Existing Portfolio to PPE Products During COVID-19 for Canadian Manufacturing Companies-Ontario Case Study. MS thesis. University of Windsor (Canada), 2023.
- Chen, Yuh-Min, Tsung-Yi Chen, and Jyun-Sian Li. “A Machine Learning-Based Anomaly Detection Method and Blockchain-Based Secure Protection Technology in Collaborative Food Supply Chain.” International Journal of e-Collaboration (IJeC)19.1 (2023): 1-24. [CrossRef]
- Glaser, Ana E., Jake P. Harrison, and David Josephs. “Anomaly Detection Methods to Improve Supply Chain Data Quality and Operations.” SMU Data Science Review 6.1 (2022): 3.
- Liu, H., Li, N., Zhao, S., Xue, P., Zhu, C., & He, Y. (2024). The impact of supply chain and digitization on the development of environmental technologies: Unveiling the role of inflation and consumption in G7 nations. Energy Economics, 108165.
- Gong, C., Zhong, Y., Zhao, S., & Liu, Y. (2024). Application of Machine Learning in Predicting Extreme Volatility in Financial Markets: Based on Unstructured Data. [CrossRef]
- Ding, R., Wang, Q., Dang, Y., Fu, Q., Zhang, H., & Zhang, D. (2015). Yading: Fast clustering of large-scale time series data. Proceedings of the VLDB Endowment, 8(5), 473-484. [CrossRef]
- Nguyen, H. Du, et al. “Forecasting and Anomaly Detection approaches using LSTM and LSTM Autoencoder techniques with the applications in supply chain management.” International Journal of Information Management 57 (2021): 102282. [CrossRef]
- Zeevenhoven, M. H. J. (2020). Anomaly detection to improve the quality of customer forecast data (Doctoral dissertation, Master’s thesis, Eindhoven University of Technology).
- Davuluri, M. (2023). Optimizing Supply Chain Efficiency Through Machine Learning-Driven Predictive Analytics. International Meridian Journal, 5(5).
- El Filali, Aicha, et al. “Machine Learning Applications in Supply Chain Management: A Deep Learning Model Using an Optimized LSTM Network for Demand Forecasting.” International Journal of Intelligent Engineering & Systems 15.2 (2022).
- Chuang, H. H. C., Chou, Y. C., & Oliva, R. (2021). Cross-item learning for volatile demand forecasting: An intervention with predictive analytics. Journal of Operations Management, 67(7), 828-852. [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).