Submitted:
23 September 2024
Posted:
24 September 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Abnormal Behaviors of Pedestrians in Public Places
2.1. Definition of Crowd Levels
2.2. Definition and Classification of Abnormal Behaviors
2.2.1. Definition of Abnormal Behaviors
2.2.2. Classification of Abnormal Behaviors
3. Challenges Brought by Crowd Density
3.1. Target Occlusion Problem
3.2. Motion Blur and Postural Diversity
4. Research Status of Crowd Abnormal Behavior Recognition
4.1. Traditional Methods
4.1.1. Methods Based on Statistical Models
4.1.2. Methods Based on Motion Features
4.1.3. Methods Based on Dynamic Models
4.1.4. Methods Based on Clustering Discrimination
4.2. Deep Learning-Based Methods
4.2.1. Methods Based on Convolutional Neural Network
4.2.2. Methods Based on Autoencoders

4.2.3. Methods Based on Generative Adversarial Networks
4.2.4. Methods Based on Long Short-Term Memory Network



4.2.5. Methods Based on Self-Attention Mechanism





4.3. Mainstream Software Tools
5. Comparative Analysis of Different Algorithm Experiments
5.1. Experimental Datasets
5.1.1. UCSD
5.1.2. UMN
5.1.3. ShanghaiTech
5.1.4. CUHK-Avenue
5.2. Experimental Evaluation Indicators
5.3. Experimental Comparative Analysis
6. Summary and Research Prospects
Acknowledgements
References
- Haghani, M.; Lovreglio, R. Data-based tools can prevent crowd crushes. Science 2022, 378, 1060–1061. [Google Scholar] [CrossRef] [PubMed]
- Luo, L.; Xie, S.; Yin, H.; Peng, C.; Ong, Y.-S. Detecting and Quantifying Crowd-level Abnormal Behaviors in Crowd Events. IEEE Trans. Inf. Forensics Secur. 2024, PP, 1–1. [Google Scholar] [CrossRef]
- Fadhel, M.A.; Duhaim, A.M.; Saihood, A.; Sewify, A.; Al-Hamadani, M.N.; Albahri, A.; Alzubaidi, L.; Gupta, A.; Mirjalili, S.; Gu, Y. Comprehensive systematic review of information fusion methods in smart cities and urban environments. Inf. Fusion 2024, 107. [Google Scholar] [CrossRef]
- Zhang Bingbing, Ge Shuyu, Wang Qilong, Li Peihua. Research on behavior recognition method based on multi-order information fusion. Acta Automatica Sinica, 2021, 47(3): 609-619. [CrossRef]
- Jiang Jun, Zhang Zhuojun, Gao Mingliang, et al. A crowd abnormal behavior detection algorithm based on pulse line flow convolutional neural network. Journal of Engineering Science and Technology, 2020, 52(6):215-222.
- Xiao Jinsheng, Shen Mengyao, Jiang Mingjun, Lei Junfeng, Bao Zhenyu. Abnormal behavior detection in surveillance videos by fusing bag attention mechanism. Acta Automatica Sinica, 2022, 48(12): 2951−2959. [CrossRef]
- Alam, E.; Sufian, A.; Dutta, P.; Leo, M. Vision-based human fall detection systems using deep learning: A review. Comput. Biol. Med. 2022, 146, 105626. [Google Scholar] [CrossRef]
- Yang Fan. Xiao Bin, Yu Zhiwen. Review of anomaly detection and modeling in surveillance videos. Journal of Computer Research and Development, 2021(012):058.
- Li Y C, Jia R S, Hu Y X, et al. A Weakly-Supervised Crowd Density Estimation Method Based on Two-Stage Linear Feature Calibration. IEEE/CAA Journal of Automatica Sinica, 2024, 11(4): 965-981.
- Zhao, R.; Dong, D.; Wang, Y.; Li, C.; Ma, Y.; Enriquez, V.F. Image-Based Crowd Stability Analysis Using Improved Multi-Column Convolutional Neural Network. IEEE Trans. Intell. Transp. Syst. 2021, 23, 5480–5489. [Google Scholar] [CrossRef]
- Zhao Rongyong, Wei Bingyu, Zhu Wenjie, et al. Review of pedestrian abnormal behavior recognition methods in public places. China Safety Science Journal, 2024, 34(2):83-93. [CrossRef]
- Wang Duorui, Du Yang, Dong Lanfang, Hu Weiming, Li Bing. Small sample learning algorithm based on feature transformation and metric network. Acta Automatica Sinica, 2024, 50(7): 1305−1314.
- Ng S H, Platow M J. The violent turn in non-violent collective action: What happens?. Asian Journal of Social Psychology, 2024.
- Chen Chong, Bai Shuo, Huang Lida, et al. Research on passenger flow monitoring and early warning in crowded places based on video analysis. Journal of Safety Science and Technology, 2020, 16 (04): 143-148.
- Wang, X.; Li, Y. Edge Detection and Simulation Analysis of Multimedia Images Based on Intelligent Monitoring Robot. Informatica 2024, 48. [Google Scholar] [CrossRef]
- Chen X, Yan B, Zhu J, et al. High-performance transformer tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 45(7): 8507-8523.
- Farooq, M.U.; Saad, M.N.B.M.; Malik, A.S.; Ali, Y.S.; Khan, S.D. Motion estimation of high density crowd using fluid dynamics. Imaging Sci. J. 2020, 68, 141–155. [Google Scholar] [CrossRef]
- Zhou Zhiqiang, Sun Riming, Guo Chenglong, et al. Hierarchical motion estimation method for spatially unstable targets under Gaussian mixture model[J/OL]. Acta Optica Sinica:1-19[2024-05-26].http://kns.cnki.net/kcms/detail/31.1252.O4.20240517.1540.029.html.
- Zhou, Y.; Liu, C.; Ding, Y.; Yuan, D.; Yin, J.; Yang, S.-H. Crowd Descriptors and Interpretable Gathering Understanding. IEEE Trans. Multimedia 2024, 26, 8651–8664. [Google Scholar] [CrossRef]
- Alhothali, A.; Balabid, A.; Alharthi, R.; Alzahrani, B.; Alotaibi, R.; Barnawi, A. Anomalous event detection and localization in dense crowd scenes. Multimedia Tools Appl. 2022, 82, 15673–15694. [Google Scholar] [CrossRef]
- Nayan, N.; Sahu, S.S.; Kumar, S. Detecting anomalous crowd behavior using correlation analysis of optical flow. Signal, Image Video Process. 2019, 13, 1233–1241. [Google Scholar] [CrossRef]
- Aziz, Z.; Bhatti, N.; Mahmood, H.; Zia, M. Video anomaly detection and localization based on appearance and motion models. Multimedia Tools Appl. 2021, 80, 1–21. [Google Scholar] [CrossRef]
- Matkovic, F.; Ivasic-Kos, M.; Ribaric, S. A new approach to dominant motion pattern recognition at the macroscopic crowd level. Eng. Appl. Artif. Intell. 2022, 116. [Google Scholar] [CrossRef]
- Ganga, B.; B. T., L.; K.R., V. Object detection and crowd analysis using deep learning techniques: Comprehensive review and future directions. Neurocomputing 2024, 597. [Google Scholar] [CrossRef]
- Madan, N.; Ristea, N.-C.; Ionescu, R.T.; Nasrollahi, K.; Khan, F.S.; Moeslund, T.B.; Shah, M. Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 46, 525–542. [Google Scholar] [CrossRef]
- Chai, L.; Liu, Y.; Liu, W.; Han, G.; He, S. CrowdGAN: Identity-Free Interactive Crowd Video Generation and Beyond. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 44, 2856–2871. [Google Scholar] [CrossRef]
- Song, X.; Chen, K.; Li, X.; Sun, J.; Hou, B.; Cui, Y.; Zhang, B.; Xiong, G.; Wang, Z. Pedestrian Trajectory Prediction Based on Deep Convolutional LSTM Network. IEEE Trans. Intell. Transp. Syst. 2020, 22, 3285–3302. [Google Scholar] [CrossRef]
- Huang Y, Abeliuk A, Morstatter F, et al. Anchor attention for hybrid crowd forecasts aggregation. arXiv:2003.12447, 2020.
- He, F.; Xiang, Y.; Zhao, X.; Wang, H. Informative scene decomposition for crowd analysis, comparison and simulation guidance. ACM Trans. Graph. 2020, 39. [Google Scholar] [CrossRef]
- Afiq A A, Zakariya M A, Saad M N, et al. A review on classifying abnormal behavior in crowd scene. Journal of Visual Communication and Image Representation, 2019, 58: 285-303.
- Goel S, Koundal D, Nijhawan R. Learning Models in Crowd Analysis: A Review. Archives of Computational Methods in Engineering, 2024: 1-19.
- Rajasekaran, G.; Sekar, J.R. Abnormal Crowd Behavior Detection Using Optimized Pyramidal Lucas-Kanade Technique. Intell. Autom. Soft Comput. 2023, 35, 2399–2412. [Google Scholar] [CrossRef]
- Gündüz M Ş, Işık G. A new YOLO-based method for real-time crowd detection from video and performance analysis of YOLO models. Journal of Real-Time Image Processing, 2023, 20(1): 5.
- Miao, Y.; Yang, J.; Alzahrani, B.; Lv, G.; Alafif, T.; Barnawi, A.; Chen, M. Abnormal Behavior Learning Based on Edge Computing toward a Crowd Monitoring System. IEEE Netw. 2022, 36, 90–96. [Google Scholar] [CrossRef]
- Wang Kunlun, Liu Wencan, He Xiaohai, et al. A motion feature descriptor for abnormal behavior detection. Computer Science, 2020, 47(4):119-124. [CrossRef]
- Qin Yue, Shi Yuexiang. Human behavior recognition based on two-stream network fusion and spatio-temporal convolution. Computing Technology and Automation, 2021, 40 (02): 140-147. [CrossRef]
- Jiang, J.; Wang, X.; Gao, M.; Pan, J.; Zhao, C.; Wang, J. Abnormal behavior detection using streak flow acceleration. Appl. Intell. 2022, 52, 10632–10649. [Google Scholar] [CrossRef]
- Clarke K, C. Cellular automata and agent-based models[M]//Handbook of regional science. Berlin, Heidelberg: Springer Berlin Heidelberg, 2021: 1751-1766.
- Li Changhua, Bi Chenggong, Li Zhijie. Crowd evacuation model based on improved PSO algorithm. Journal of System Simulation, 2020, 32 (06): 1000-1008. [CrossRef]
- Chang, D.; Cui, L.; Huang, Z. A Cellular-Automaton Agent-Hybrid Model for Emergency Evacuation of People in Public Places. IEEE Access 2020, 8, 79541–79551. [Google Scholar] [CrossRef]
- Chebi H, Dalila A, Mohamed K. Dynamic detection of abnormalities in video analysis of crowd behavior with DBSCAN and neural networks. Advances in Science, Technology and Engineering Systems Journal, 2020, 1(5): 56-63.
- Tay N C, Connie T, Ong T S, et al. A robust abnormal behavior detection method using convolutional neural network[C]//Computational Science and Technology: 5th ICCST 2018, Kota Kinabalu, Malaysia, 29-30 August 2018. Springer Singapore, 2019: 37-47.
- Koonce B, Koonce B. Vgg network. Convolutional Neural Networks with Swift for Tensorflow: Image Recognition and Dataset Categorization, 2021: 35-50.
- Koonce B, Koonce B. ResNet 50. Convolutional neural networks with swift for tensorflow: image recognition and dataset categorization, 2021: 63-72.
- Behera N K S, Sa P K, Muhammad K, et al. Large-Scale Person Re-Identification for Crowd Monitoring in Emergency. IEEE Transactions on Automation Science and Engineering, 2023.
- Wang, S.; Pu, Z.; Li, Q.; Wang, Y. Estimating crowd density with edge intelligence based on lightweight convolutional neural networks. Expert Syst. Appl. 2022, 206. [Google Scholar] [CrossRef]
- Dai F, Huang P, Mo Q, et al. ST-InNet: Deep spatio-temporal inception networks for traffic flow prediction in smart cities. IEEE Transactions on Intelligent Transportation Systems, 2022, 23(10): 19782-19794.
- Liu W, Luo W, Li Z, et al. Margin Learning Embedded Prediction for Video Anomaly Detection with A Few Anomalies[C]//IJCAI. 2019, 3: 023-3.
- Bhuiyan, R.; Abdullah, J.; Hashim, N.; Al Farid, F.; Uddin, J. Hajj pilgrimage abnormal crowd movement monitoring using optical flow and FCNN. J. Big Data 2023, 10, 1–21. [Google Scholar] [CrossRef]
- Garg, S.; Sharma, S.; Dhariwal, S.; Priya, W.D.; Singh, M.; Ramesh, S. Human crowd behaviour analysis based on video segmentation and classification using expectation–maximization with deep learning architectures. Multimedia Tools Appl. 2024, 1–23. [Google Scholar] [CrossRef]
- Singh, K.; Rajora, S.; Vishwakarma, D.K.; Tripathi, G.; Kumar, S.; Walia, G.S. Crowd anomaly detection using Aggregation of Ensembles of fine-tuned ConvNets. Neurocomputing 2019, 371, 188–198. [Google Scholar] [CrossRef]
- Sabokrou, M.; Fayyaz, M.; Fathy, M.; Moayed, Z.; Klette, R. Deep-anomaly: Fully convolutional neural network for fast anomaly detection in crowded scenes. Comput. Vis. Image Underst. 2018, 172, 88–97. [Google Scholar] [CrossRef]
- Hu, X.; Dai, J.; Huang, Y.; Yang, H.; Zhang, L.; Chen, W.; Yang, G.; Zhang, D. A weakly supervised framework for abnormal behavior detection and localization in crowded scenes. Neurocomputing 2020, 383, 270–281. [Google Scholar] [CrossRef]
- Wang Hongyan, Zhou Mengxing. Crowd abnormal behavior detection based on optical flow and trajectory. Journal of Jilin University (Engineering Edition), 2020, 50 (06): 2229-2237. [CrossRef]
- Hu Xuemin, Chen Qin, Yang Li, et al. Crowd abnormal behavior detection and location based on deep spatio-temporal convolutional neural network. Application Research of Computers, 2020, 37(3):891-895. [CrossRef]
- Gong, M.; Zeng, H.; Xie, Y.; Li, H.; Tang, Z. Local distinguishability aggrandizing network for human anomaly detection. Neural Networks 2020, 122, 364–373. [Google Scholar] [CrossRef]
- Pinaya W H L, Vieira S, Garcia-Dias R, et al. Autoencoders[M]//Machine learning. Academic Press, 2020: 193-208.
- Tyagi, B.; Nigam, S.; Singh, R. A Review of Deep Learning Techniques for Crowd Behavior Analysis. Arch. Comput. Methods Eng. 2022, 29, 5427–5455. [Google Scholar] [CrossRef]
- Xu Tao, Tian Chongyang. Review of abnormal behavior detection of crowd based on deep learning. Computer Science, 2021, 48(9):125-134. [CrossRef]
- Zhang, X.; Ma, D.; Yu, H.; Huang, Y.; Howell, P.; Stevens, B. Scene perception guided crowd anomaly detection. Neurocomputing 2020, 414, 291–302. [Google Scholar] [CrossRef]
- Zhou, S.; Shi, R.; Wang, L. Extracting macroscopic quantities in crowd behaviour with deep learning. Phys. Scr. 2024, 99, 065213. [Google Scholar] [CrossRef]
- Nguyen T N, Meunier J. Anomaly detection in video sequence with appearance-motion correspondence[C]//Proceedings of the IEEE/CVF international conference on computer vision. 2019: 1273-1283.
- Wei H, Li K, Li H, et al. Detecting video anomaly with a stacked convolutional LSTM framework[C]//International Conference on Computer Vision Systems. Cham: Springer International Publishing, 2019: 330-342.
- Xiao Jinsheng, Guo Haowen, Xie Honggang, et al. Probabilistic memory autoencoder network for abnormal behavior detection in surveillance videos. Journal of Software, 2023, 34(9): 4362-4377.
- Nawaratne, R.; Alahakoon, D.; De Silva, D.; Yu, X. Spatiotemporal Anomaly Detection Using Deep Learning for Real-Time Video Surveillance. IEEE Trans. Ind. Informatics 2019, 16, 393–402. [Google Scholar] [CrossRef]
- Roka, S.; Diwakar, M. Deep stacked denoising autoencoder for unsupervised anomaly detection in video surveillance. J. Electron. Imaging 2023, 32, 033015. [Google Scholar] [CrossRef]
- Li, J.; Huang, Q.; Du, Y.; Zhen, X.; Chen, S.; Shao, L. Variational Abnormal Behavior Detection With Motion Consistency. IEEE Trans. Image Process. 2021, 31, 275–286. [Google Scholar] [CrossRef]
- Ashfahani, A.; Pratama, M.; Lughofer, E.; Ong, Y.-S. DEVDAN: Deep evolving denoising autoencoder. Neurocomputing 2019, 390, 297–314. [Google Scholar] [CrossRef]
- Wang, J.; Xia, L. Abnormal behavior detection in videos using deep learning. Clust. Comput. 2018, 22, 9229–9239. [Google Scholar] [CrossRef]
- Wang, T.; Qiao, M.; Lin, Z.; Li, C.; Snoussi, H.; Liu, Z.; Choi, C. Generative Neural Networks for Anomaly Detection in Crowded Scenes. IEEE Trans. Inf. Forensics Secur. 2018, 14, 1390–1399. [Google Scholar] [CrossRef]
- Lü Chengkan, Shen Fei, Zhang Zhengtao, et al. Review of the Research Status of Image Anomaly Detection. Acta Automatica Sinica, 2022, 48(6):1402-1428. [CrossRef]
- Song, H.; Sun, C.; Wu, X.; Chen, M.; Jia, Y. Learning Normal Patterns via Adversarial Attention-Based Autoencoder for Abnormal Event Detection in Videos. IEEE Trans. Multimedia 2019, 22, 2138–2148. [Google Scholar] [CrossRef]
- Chen, D.; Yue, L.; Chang, X.; Xu, M.; Jia, T. NM-GAN: Noise-modulated generative adversarial network for video anomaly detection. Pattern Recognit. 2021, 116. [Google Scholar] [CrossRef]
- Tang, Y.; Zhao, L.; Zhang, S.; Gong, C.; Li, G.; Yang, J. Integrating prediction and reconstruction for anomaly detection. Pattern Recognit. Lett. 2019, 129, 123–130. [Google Scholar] [CrossRef]
- Lee S, Kim H G, Ro Y M. BMAN: Bidirectional multi-scale aggregation networks for abnormal event detection. IEEE Transactions on Image Processing, 2019, 29: 2395-2408.
- Schlegl T, Seeböck P, Waldstein S M, et al. f-AnoGAN: Fast unsupervised anomaly detection with generative adversarial networks. Medical image analysis, 2019, 54: 30-44.
- Meng B, Wang L, Li D W. Detection Method for Crowd Abnormal Behavior Based on Long Short-Term Memory Network[C]//Advances in Intelligent Information Hiding and Multimedia Signal Processing: Proceeding of the 16th International Conference on IIHMSP in conjunction with the 13th international conference on FITAT, November 5-7, 2020, Ho Chi Minh City, Vietnam, Volume 1. Springer Singapore, 2021: 305-313.
- Wu Guangli, Guo Zhenzhou, Li Leiting, et al. Video Anomaly Event Detection Fusing FCN and LSTM. Journal of Shanghai Jiao Tong University, 2021, 55(5): 607.
- Sabih, M.; Vishwakarma, D.K. Crowd anomaly detection with LSTMs using optical features and domain knowledge for improved inferring. Vis. Comput. 2021, 38, 1719–1730. [Google Scholar] [CrossRef]
- Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need. Advances in Neural Information Processing Systems, 2017, 30.
- Ye Z, Li Y, Cui Z, et al. Unsupervised Video Anomaly Detection with Self-Attention Based Feature Aggregating[C]//2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC). IEEE, 2023: 3551-3556.
- Zhang Hongmin, Fang Xiaobing, Zhuang Xu. Video Human Abnormal Behavior Detection Model of Autoencoder Fusing Attention Mechanism. Laser Journal, 2023, 44 (02): 69-75. [CrossRef]
- Zhang J, Qi X, Ji G. Self Attention Based Bi-Directional Long Short-Term Memory Auto Encoder for Video Anomaly Detection[C]//2021 Ninth International Conference on Advanced Cloud and Big Data (CBD). IEEE, 2022: 107-112.
- Zhang, W.; Wang, G.; Huang, M.; Wang, H.; Wen, S. Generative Adversarial Networks for Abnormal Event Detection in Videos Based on Self-Attention Mechanism. IEEE Access 2021, 9, 124847–124860. [Google Scholar] [CrossRef]
- Singh, R.; Sethi, A.; Saini, K.; Saurav, S.; Tiwari, A.; Singh, S. Attention-guided generator with dual discriminator GAN for real-time video anomaly detection. Eng. Appl. Artif. Intell. 2024, 131. [Google Scholar] [CrossRef]
- Sharma, V.K.; Mir, R.N.; Singh, C. Scale-aware CNN for crowd density estimation and crowd behavior analysis. Comput. Electr. Eng. 2023, 106. [Google Scholar] [CrossRef]
- Zhao, R.; Wang, Y.; Jia, P.; Zhu, W.; Li, C.; Ma, Y.; Li, M. Abnormal Behavior Detection Based on Dynamic Pedestrian Centroid Model: Case Study on U-Turn and Fall-Down. IEEE Trans. Intell. Transp. Syst. 2023, 24, 8066–8078. [Google Scholar] [CrossRef]
- UCSD Anomaly Detection Dataset, http://www.svcl.ucsd.edu/projects/anomaly/dataset.htm.
- UMN Crowd Dataset, http://mha.cs.umn.edu/projevents.shtml#crowd.
- Cao, C.; Lu, Y.; Zhang, Y. Context Recovery and Knowledge Retrieval: A Novel Two-Stream Framework for Video Anomaly Detection. IEEE Trans. Image Process. 2024, 33, 1810–1825. [Google Scholar] [CrossRef]
- Wang, L.; Wang, X.; Li, M.; Liu, F. Anomaly Detection Method Based on Temporal Spatial Information Enhancement. Meas. Sci. Technol. 2023, 35. [Google Scholar] [CrossRef]





















| Crowd States | Density Range (p/m²) | Crowd Level |
|---|---|---|
| Free Flow | < 0.5 | Very Low (VL) |
| Restricted Flow | 0.50 - 0.80 | Low (L) |
| Dense Flow | 0.81 - 1.26 | Medium (M) |
| Very Dense Flow | 1.27 - 2.00 | High (H) |
| Congestion | > 2.00 | Very High (VH) |
| Method Categories | Design Ideas | Advantages | Limitations | References |
|---|---|---|---|---|
| Statistical Model | Build the background model through statistical methods | Strong pattern recognition ability and capable of handling the complex dynamics of time series data | Complex parameter estimation and sensitivity to initial values | [29,30,31,32,33] |
| Motion Feature | Identify abnormalities by analyzing the motion characteristics (such as speed, acceleration, etc.) of individuals or groups | Has high sensitivity and specificity for the recognition of motion patterns | Vulnerable to environmental factors interference (such as illumination changes, occlusion) | [34,35,36,37] |
| Dynamic Model | Build a microscopic model of group behavior by simulating the interaction between individuals and environmental constraints | Suitable for simulation and prediction, and model parameters can be adjusted to adapt to different scenarios | High model complexity and unable to fully simulate |
[38,39,40] |
| Clustering Discrimination | Group the behavior data into different clusters and identify abnormalities based on the differences between clusters | Unsupervised learning is applicable in the case of lacking labeled data | Poor processing ability for time series data | [41] |
| Method | Design Ideas | Advantages | Limitations | References |
|---|---|---|---|---|
| CNN | Through local receptive fields and pooling operations, capture the spatial hierarchical features of the original data | Good at extracting local features and combining them into more complex patterns | Lack of time series processing capabilities | [Error! Reference source not found.-Error! Reference source not found.] |
| AutoEncoder | An unsupervised learning method that captures the main features of the data by encoding and decoding the original data | Reduce the data dimension and extract key features | Easily lead to overfitting of training data | [Error! Reference source not found.-Error! Reference source not found.] |
| GNN | It is composed of a generator and a discriminator, and the two play a game. | Reconstruct and generate new samples close to real data | Prone to model collapse and non-convergence of training | [Error! Reference source not found.-Error! Reference source not found.] |
| LSTM | Introduces forget gates, input gates, and output gates to solve the problems of gradient vanishing and gradient explosion | Good at processing sequence data of any length | The structure is complex, and training and reasoning take a long time | [Error! Reference source not found.-Error! Reference source not found.] |
| Self-Attention | Automatically assigns different attention weights according to different parts of the input sequence | Capture global interdependencies | Easy to overfit on small datasets | [Error! Reference source not found.-Error! Reference source not found.] |
| Name | Scene Description | Scale | Resolution | Abnormal Behavior | Object | Limitation |
|---|---|---|---|---|---|---|
| UCSD [Error! Reference source not found.] | The crowd movement on the sidewalk from the perspective of the surveillance camera | Ped1 14000frame 34 Training segments 36 Test segments |
238*158 | Fast moving, Reverse driving Riding a bicycle, Driving a car, Sitting in a wheelchair, skateboarding, etc. | individual | Low resolution; few types of abnormal behaviors; relatively simple background |
| Ped2 4560 frame 12 Test segments |
360*240 | |||||
| UMN [Error! Reference source not found.] | Video clips of pedestrian activities in different backgrounds such as campuses, shopping malls, and streets | 8010 frame 11 Video segment |
320*240 | The crowd suddenly scattered, ran, and gathered. | group | Simple background; Limited abnormal types |
| Shanghai-Tech [Error! Reference source not found.] | 13 campus area scenes with complex lighting conditions and camera angles | 317398 frame 130 Video segment |
856*480 | Crowd gathering, fighting, running, cycling. | group | Abnormal events are repetitive; The annotations have errors |
| CUHK-Avenue [Error! Reference source not found.] | Video surveillance clips of outdoor public places | 30625 frame 16 Training segments 21 Test segments |
640*360 | Pedestrians fighting, throwing objects, running | individual Vehicle |
The shooting angle is single; The resolution is low |
| Classification | Method | Frame level AUC / EER / % | Year | |||||||||
| UCSDPed1 | UCSDPed2 | UMN | Shanghai Tech | Avenue | ||||||||
| EER | AUC | EER | AUC | EER | AUC | EER | AUC | EER | AUC | |||
| CNN | FCN [Error! Reference source not found.] | -- | -- | 11.00 | -- | -- | -- | -- | -- | -- | -- | 2019 |
| ABDL [Error! Reference source not found.] | 22.00 | -- | 16.00 | -- | 5.80 | 98.90 | -- | -- | 21.00 | 84.50 | 2020 | |
| TS-CNN [Error! Reference source not found.] | -- | -- | -- | -- | -- | 99.60 | -- | -- | -- | -- | 2020 | |
| DSTCNN [Error! Reference source not found.] | -- | 99.74 | -- | 99.94 | -- | -- | -- | -- | ---- | -- | 2020 | |
| LDA-Net [Error! Reference source not found.] | -- | -- | 5.63 | 97.87 | -- | -- | -- | -- | -- | -- | 2020 | |
| AE | CAE-UNet [Error! Reference source not found.] | -- | -- | -- | 96.20 | -- | -- | -- | -- | -- | 86.90 | 2019 |
| PMAE [Error! Reference source not found.] | -- | -- | -- | 95.90 | -- | -- | -- | 72.90 | -- | -- | 2023 | |
| ISTL [Error! Reference source not found.] | 29.80 | 75.20 | 8.90 | 91.10 | -- | -- | -- | -- | 29.20 | 76.8 | 2019 | |
| S2-VAE [Error! Reference source not found.] | 14.30 | 94.25 | -- | -- | -- | 99.81 | -- | -- | -- | 87.6 | 2019 | |
| GAN | Ada-Net [Error! Reference source not found.] | 11.90 | 90.50 | 11.50 | 90.70 | -- | -- | -- | -- | 17.60 | 89.20 | 2019 |
| NM-GAN [Error! Reference source not found.] | 15.00 | 90.70 | 6.00 | 96.30 | -- | -- | 17.00 | 85.30 | 15.30 | 88.60 | 2021 | |
| D-UNet [Error! Reference source not found.] | 84.70 | 96.30 | -- | -- | -- | 73.00 | 85.10 | 2019 | ||||
| BMAN [Error! Reference source not found.] | -- | -- | -- | 96.60 | -- | 99.60 | 76.20 | 90.00 | 2019 | |||
| LSTM | FocalLoss-LSTM [Error! Reference source not found.] | -- | -- | -- | -- | -- | 99.83 | -- | -- | -- | -- | 2021 |
| FCN-LSTM [Error! Reference source not found.] | -- | -- | -- | 98.20 | -- | 93.70 | -- | -- | -- | -- | 2021 | |
| CNN-LSTM [Error! Reference source not found.] | -- | 94.83 | 96.50 | -- | -- | -- | -- | -- | -- | 2022 | ||
| SA | SAFA [Error! Reference source not found.] | -- | -- | -- | 96.80 | -- | -- | -- | -- | -- | 87.30 | 2023 |
| SA-AE [Error! Reference source not found.] | -- | -- | -- | 95.69 | -- | -- | -- | -- | -- | 84.10 | 2023 | |
| SABiAE [Error! Reference source not found.] | -- | -- | 9.80 | 95.60 | -- | -- | -- | -- | 20.90 | 84.70 | 2022 | |
| SA-GAN [Error! Reference source not found.] | -- | -- | -- | -- | -- | -- | -- | 75.70 | -- | 89.20 | 2021 | |
| A2D-GAN [Error! Reference source not found.] | 9.70 | 94.10 | 5.10 | 97.40 | -- | -- | 25.20 | 74.20 | 9.00 | 91.00 | 2024 | |
| SA-CNN [Error! Reference source not found.] | -- | -- | -- | -- | -- | 99.29 | -- | -- | -- | -- | 2023 | |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).