Submitted:
28 July 2023
Posted:
01 August 2023
You are already at the latest version
Abstract
Keywords:
1. Introduction
- Volume: the amount of traffic data is enormous due to the number of vehicles crossing roads all day, which is the main reason the community is interested in solving this problem.
- Variety: available data range from videos to database records or textual logs, often depending on the specific task; also the available technologies (e.g., sensors and cameras) do not provide the same structure.
- Velocity: it goes without saying that traffic flow data is collected at a rapid pace.
- Vinculation: Streams of traffic data are strongly interrelated: any traffic lights, pedestrians, or accidents that come into play affect the whole scenario.
- Validity: it is assured by the technologies used. More specifically, cameras provide instantaneous feedback on what happens; for sensors, specific tests assure the good quality of the detected facts.
- At the urban or suburban level, several cameras spread along the main roads are already present, which can detect the passage of vehicles from pre-defined points of interest, also recognizing the specific vehicle identity based on their plate numbers. Additional data can be obtained from location sensors (e.g., based on GPS) carried by vehicles (especially public transportation ones) and pedestrians, that continuously track their position. The vehicle passage events generated by this network of sensors can be used to automatically create models of traffic behaviour, to be used for supervision and/or prediction. These models may concern normal behaviour or abnormal behaviours of interest. In the former case, the aim might be checking that everything is compliant with the normal model, and raising alarms in case of deviations. In the latter case, the aim might be recognizing anomalies and notifying the target users (e.g., a vehicle on highways carrying out burglary or other kinds of illegal activities).
- As said, inside towns, the network of streets allows for an extremely large number of path variations. So, a relevant issue here may be identifying automatically the most relevant locations in the town to be supervised, where a more fine-grained analysis of traffic behaviour can be carried out.
-
Given an (urban or suburban) place of interest, cameras can be placed not only to detect vehicle passage but also to monitor the behaviour of people and vehicles. Here, both the above-mentioned tasks can be needed:
- –
- The most relevant zones (places of interest) in the camera framing can be manually defined or automatically detected, in order to specifically check the vehicles or pedestrians traversing them.
- –
- Once more, models of normal or abnormal behaviour(s) in traversing these zones can be learned and used for supervision or prediction purposes.
Additionally, the same data coming from the camera can also be fed into an automated reasoning system that reasons on the detected events as long as they take place, interprets them according to the perspectives of interest to the system’s users, and raises warnings or notifications for noteworthy situations, also being able to explain why the situation was recognized and suggesting possible actions to handle them.
- understandability
- the outcomes of the AI system are expressed in human-level terms;
- interpretability
- the output of the AI system can be easily traced back to the input;
- explainability
- the full rationale connecting the output of the AI system to the input can be explicitly reported.
2. Related Works
- physics-based prediction: studying kinematics models to estimate the quality of the possible state transitions.
- pattern-based prediction: a cluster of trajectory data to determine trajectories or manoeuvres.
- planning-based prediction: learning optimal behavior by cost function estimation (Inverse Reinforcement Learning).
- symbolic rules, differently from visual operators, better describe the semantics of traffic scenes (a vehicle being a physical entity that must exist from its entrance in the scene until its exit; it cannot overlap with other vehicles, split or change shape or size, and so on);
- symbolic rules have a clear and simple syntax and are a natural interface with human experts, helping with decision-making processes;
- the reasoning module is independent of the specific facts and rules added; hence, the base can be incremented by simply adding new rules;
- the high-level module is able to correct errors of the low-level modules by exploiting temporal and scene consistency along the image sequence.
3. AI-based Framework Description
3.1. Computer Vision for Information Extraction from Traffic Videos
3.1.1. Vehicle Detection & Tracking
- Load Model: This function handles loading an object detection model. In our example, we used the YOLOv8 model based on the YOLO algorithm. This model was trained on the COCO128 dataset to identify objects in videos.
- Display Tracker Options: This function enables users to select options for object-tracking visualization. Users can choose whether to display tracking and select the tracking algorithm to use.
- Log Detection: This function records object detection information in a log. It takes input such as the bounding box coordinates of the object, object ID, object type, detection confidence, and a dedicated object for log writing.
-
Display Detected Frames: This function displays video frames with the detected objects (see Figure 3). It accepts input such as the detection confidence level, object detection model, frame to display, object for log writing, frame image, and other options for object tracking. The following operations are performed within this function:
- –
- Drawing RoI: made to allow users to directly highlight points of interest on the frame (if they want). It visualizes object tracking if specified by the user, otherwise predicting objects using the YOLOv8 model. Displaying detection results on the frame. Recording detections in the log using the "Log Detection" function.
- –
- Play Stored Video: This function manages video playback and analysis of detected objects. It takes inputs such as the detection confidence level, object detection model, start and stop buttons, and the video to be analysed.
3.1.2. Trajectory analysis and event detection
3.2. Clustering for the Identification of Noteworthy Areas
- DBSCAN
- [58] is a density-based algorithm that groups data points into dense regions, considering noise points as well.
- K-means
- [59] is a clustering algorithm that assigns data points to clusters based on their similarity.
- Agglomerative Clustering
- [60] is a hierarchical clustering algorithm that progressively combines data points into larger clusters.
- Gaussian Mixture Models (GMM)
- [61] is a statistical model that assumes data are generated from a set of Gaussian distributions.
- Spectral Clustering
- [62] is a graph-based clustering algorithm that utilizes the spectral representation of data.
- The Elbow Method
- [63] is based on inertia, which represents the sum of squared distances between data points and the nearest cluster centroid. This method involves plotting a graph of inertia against the number of clusters and identifying the point where inertia stops decreasing rapidly and becomes more gradual. This point, often referred to as the elbow in the graph, is considered the optimal number of clusters.
- The Silhouette Method
- [64] evaluates the cohesion within clusters and the separation between clusters. For each data point, the silhouette score is calculated, measuring how similar the object is to its own cluster compared to other clusters. The Silhouette Method entails plotting a graph of the silhouette score against the number of clusters and identifying the point where the score reaches its maximum. This point represents the optimal number of clusters.
- The External Validity Index
- [65] compares clustering results with known class labels. Measures of completeness, homogeneity, and V-measure are calculated to assess how well the clusters correspond to the known class labels. The optimal number of clusters can be determined by considering the maximum value of these measures.
- The Cohesion-Separation Index
- [66] evaluates the cohesion within clusters and the separation between clusters based on distances between data points. Cohesion is calculated as the sum of distances between points within the same cluster, while separation is calculated as the sum of distances between points in different clusters. The optimal number of clusters can be determined by considering the maximum value of the cohesion-separation index.
3.3. Process Mining for Automatic (Urban or Suburban) Behavioral Modeling, Supervision, Prediction & Classification
- A process corresponds to the behavior of vehicles along the road. Single-vehicle or overall traffic behaviors can be modeled.
- the agents are the vehicles.
- A case is a route on the road by one or more vehicles, from their first detection in to its last detection. More specifically, the first detection of the vehicle starts a case, that ends when the vehicle is not detected anymore.
- Activities are associated to the vehicle passing from the pre-defined places, e.g., under selected gates along the road or inside predefined RoIs; the driving behavior of the vehicle (in terms of direction and speed) may also be associated to activities.
- the event timestamp,
- the type of the event: begin or end of a process, begin or end of an activity (allowing to consider time span and concurrency), or context description,
- the name of the workflow the process refers to,
- a unique identifier for each process execution,
- the name of the activity,
- the progressive number of occurrences of that activity in that process,
- (optionally) the agent that carries out the activity.
- Model Learning
- WoMan’s learning module is called WIND (Workflow INDucer) which learns models for a case, after the acquisition of events. It usually does not need a large dataset to obtain a stable model.
- Conformance Check
-
WoMan’s supervision module WEST (Workflow Enactment Supervisor and Trainer) can check the compliance of each event in new process executions with respect to a given model. For each event, the system returns a compliance outcome among the following options:
- ok
- the event is compliant with the process model;
- warning
- the event is not compliant with the process model, for one of several reasons; each warning is associated with a degree of severity. Of course, warnings that encompass others have a greater severity degree than the others.
- error
- the event cannot be processed, being syntactically or semantically wrong.
After all the events of a case have been processed, the case can be used to refine the model. - Activity Prediction
- During process execution, WoMan can be asked to predict the next activities that will be carried out using the SNAP (Suggester of Next Action in Process) module. The candidate predictions are ranked based on a combination of several parameters.
- Process Prediction
- Given a case of an unknown workflow, the system may be asked for a classification of the process that is being executed, among a set of candidate processes whose models are available to the system. WoGue (Workflow Guesser) is the model in charge of making this prediction, based on a comparison of the events of the current process enactment to the candidate models.
3.4. Automated MultiStrategy Reasoning for Traffic Interpretation
- Deduction
- aims at making explicit knowledge that is implicit in the available knowledge but is a necessary consequence thereof.
- Abstraction
- reduces the amount of information conveyed by a set of facts. This reduces the computational load needed to process the set of facts, provided that the information that is relevant to the achievement of a goal is preserved.
- Abduction
- is devoted to coping with missing information, by guessing unknown facts that are not stated in the available knowledge but are needed to solve a given problem, provided that they satisfy some integrity constraints. Of course, there may be many plausible explanations for a given observation.
- Uncertainty
- The possibility of handling uncertainty may dramatically improve flexibility and robustness of reasoning.
- Argumentation
- deals with inconsistent knowledge, to distinguish which of several contrasting, but internally consistent, positions are justified, based on the relationships among the involved knowledge items and on their properties.
- Induction
- is the inference of general knowledge starting from specific observations.
- Ontological
- An ontology defines and describes the kinds of entities that are of interest in a domain, their properties and relationships. Typical ontology-based reasoning tasks are inheritance and consistency checking.
- Similarity-based
- computation between FOL descriptions is complex due to non-unique mapping between the descriptions.
- Analogy
- Analogy is the cognitive process of matching the characterizing features of two subjects, objects, situations, etc. After finding an analogy on some roles, the association can be extended to further features.
3.5. Visual Analytics for Traffic Understanding
4. Experiments on a Sample Scenario
4.1. Positioning-based Critical Regions Identification
4.2. Camera-based Critical Regions Identification
4.2.1. Suburban case: Two-way linear highway
4.2.2. Suburban case: highway junction
4.2.3. Urban case: Piazza Venezia in Rome
4.2.4. Urban case: Largo di Torre Argentina in Rome
4.3. Trajectory analysis and event detection
4.4. Process-based Traffic Modeling and Supervision
4.5. Automated Reasoning for Traffic Interpretation
- object(O,X0,X1,Y0,Y1,T)
- : object with identifier O is recognised in the scene at time T, enclosed in a bounding box with coordinates .
- next(T’,T”)
- : time follows time .
- move(O,X0,X1,Y0,Y1,T)
- : object O moved (by a considerable distance) at time T, where are the displacements of each coordinate of its bounding box.
- enter(O,P,T)
- : object O entered place P (a RoI) at time T.
- leave(O,P,T)
- : object O left place P (a RoI) at time T.
- still(O,T)
- : object O stopped at time T.
- halt(O,L,T)
- : object O stopped for a certain time period L starting from time T.
- stay(O,P)
- : object O stayed in place P (a RoI).
- placetime(O,P,T)
- : object O was in place P (a RoI) for a considerable time T.
- status(O,T,S)
- : S is the status of the object O at time T, where S can be `moving’, `still’ or someplace (RoI) identifier.
- meet(L,T,P)
- : objects in list L were in the same place P at time T.
- wait(X,Y,T)
- : object X was still at time T, but is now in the same place as object Y.
- distance(X00,X01,Y00,Y01,X10,X11,Y10,Y11,D)
- : D is the Euclidean distance between the coordinates and of two bounding boxes.
- closetimes(X,Y,T,L)
- : L is the last timestamp, starting from T, in which X and Y were close to each other.
- close(X,Y,T,L)
- : L is the amount of time for which X and Y were close to each other, starting from timestamp T.
- accomplices(X,Y,T,D)
- : objects X and Y are close to each other for a certain amount of time D in the `halt’ state, and so still from a certain considerable time, at time T.
- fast(O,T1,T2,D)
- : object O moved many times between timestamps and with distances greater than or equal to D.
- traffic jam;
- vehicle going faster than the maximum allowed speed;
- vehicle passing from forbidden zones of the road;
- vehicle stopping in places where the stop is forbidden;
- vehicle taking a wrong turn;
- vehicle going around the square in a loop;
4.6. Visual Analytics for Traffic Understanding
5. Conclusions
Author Contributions
Funding
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| AI | Artificial Intelligence |
| KDD | Knowledge Discovery in Databases |
| GPS | Global Positioning System |
| CV | Computer Vision |
| VMOT | Visual multiple object tracking |
| VSD | Value Sensitive Design |
| BDI | Belief-Desire-Intention |
| KRR | Knowledge Representation and Reasoning |
| YOLO | You Only Look Once |
| FPS | frames per second |
| RoI | Regions of Interests |
| GMM | Gaussian Mixture Models |
| FOL | First-Order Logic |
| KB | Knowledge Base |
| WoMan | Workflow Manager |
| GEAR | General Engine for Automated Reasoning |
References
- The top 10 causes of death. https://www.who.int/en/news-room/fact-sheets/detail/the-top-10-causes-of-death. Accessed: 2023-02-27.
- Savithramma, R.; Sumathi, R.; Sudhira, H. Reinforcement learning based traffic signal controller with state reduction. Journal of Engineering Research 2023, 11, 100017.
- Elkin, D.; Vyatkin, V. IoT in Traffic Management: Review of Existing Methods of Road Traffic Regulation. In Applied Informatics and Cybernetics in Intelligent Systems, Volume 3; Silhavy, R., Ed.; Springer, 2020; Vol. 1226, Advances in Intelligent Systems and Computing, pp. 536–551. [CrossRef]
- Leuzzi, F.; Ferilli, S., Eds. Traffic Mining Applied to Police Activities - Proceedings of the 1st Italian Conference for the Traffic Police (TRAP-2017), Rome, Italy, October 25-26, 2017, Vol. 728, Advances in Intelligent Systems and Computing. Springer, 2018. [CrossRef]
- Loiseau, E.; Saikku, L.; Antikainen, R.; Droste, N.; Hansjürgens, B.; Pitkänen, K.; Leskinen, P.; Kuikman, P.; Thomsen, M. Green economy and related concepts: An overview. Journal of cleaner production 2016, 139, 361–371. [CrossRef]
- Sagiroglu, S.; Sinanc, D. Big data: A review. In Proceedings of the 2013 International Conference on Collaboration Technologies and Systems, CTS 2013, San Diego, CA, USA, May 20-24, 2013; Fox, G.C.; Smari, W.W., Eds. IEEE, 2013, pp. 42–47. [CrossRef]
- Tsai, C.; Lai, C.; Chao, H.; Vasilakos, A.V. Big data analytics: A survey. J. Big Data 2015, 2, 21. [CrossRef]
- Tarlochan, F.; Ibrahim, M.I.M.; Gaben, B. Understanding traffic accidents among young drivers in Qatar. International journal of environmental research and public health 2022, 19, 514. [CrossRef]
- Hammad, H.M.; Ashraf, M.; Abbas, F.; Bakhat, H.F.; Qaisrani, S.A.; Mubeen, M.; Fahad, S.; Awais, M. Environmental factors affecting the frequency of road traffic accidents: A case study of sub-urban area of Pakistan. Environmental Science and Pollution Research 2019, 26, 11674–11685. [CrossRef]
- Government, A. Impact of road trauma and measures to improve outcomes / Bureau of Infrastructure, Transport and Regional Economics; Department of Infrastructure and Regional Development, Bureau of Infrastructure, Transport and Regional Economics Canberra, ACT, 2014; pp. x, 71 pages :.
- Sofuoglu, S.E.; Aviyente, S. GLOSS: Tensor-based anomaly detection in spatiotemporal urban traffic data. Signal Processing 2022, 192, 108370. [CrossRef]
- Kong, X.; Gao, H.; Alfarraj, O.; Ni, Q.; Zheng, C.; Shen, G. Huad: Hierarchical urban anomaly detection based on spatio-temporal data. IEEE Access 2020, 8, 26573–26582. [CrossRef]
- Lindemann, B.; Maschler, B.; Sahlab, N.; Weyrich, M. A survey on anomaly detection for technical systems using LSTM networks. Computers in Industry 2021, 131, 103498. [CrossRef]
- Di Mauro, N.; Ferilli, S. Unsupervised LSTMs-based learning for anomaly detection in highway traffic data. In Proceedings of the 24th International Symposium on Foundations of Intelligent Systems, ISMIS 2018, Limassol, Cyprus, October 29–31, 2018, Proceedings 24. Springer, 2018, pp. 281–290.
- D’Andrea, E.; Marcelloni, F. Detection of traffic congestion and incidents from GPS trace analysis. Expert Systems with Applications 2017, 73, 43–56. [CrossRef]
- Ahmadi, S.A.; Ghorbanian, A.; Mohammadzadeh, A. Moving vehicle detection, tracking and traffic parameter estimation from a satellite video: A perspective on a smarter city. International journal of remote sensing 2019, 40, 8379–8394. [CrossRef]
- Dubská, M.; Herout, A.; Sochor, J. Automatic Camera Calibration for Traffic Understanding. In Proceedings of the British Machine Vision Conference, BMVC 2014, Nottingham, UK, September 1-5, 2014; Valstar, M.F.; French, A.P.; Pridmore, T.P., Eds. BMVA Press, 2014.
- Hamilton, J.D. Time series analysis; Princeton university press, 2020.
- Cryer, J.D. Time series analysis; Vol. 286, Duxbury Press Boston, 1986.
- Trinh, H.D.; Giupponi, L.; Dini, P. Mobile traffic prediction from raw data using LSTM networks. In Proceedings of the 2018 IEEE 29th annual international symposium on personal, indoor and mobile radio communications (PIMRC). IEEE, 2018, pp. 1827–1832.
- Feng, J.; Chen, X.; Gao, R.; Zeng, M.; Li, Y. Deeptp: An end-to-end neural network for mobile cellular traffic prediction. IEEE Network 2018, 32, 108–115. [CrossRef]
- Thiagarajan, A.; Ravindranath, L.; LaCurts, K.; Madden, S.; Balakrishnan, H.; Toledo, S.; Eriksson, J. Vtrack: Accurate, energy-aware road traffic delay estimation using mobile phones. In Proceedings of the Proceedings of the 7th ACM conference on embedded networked sensor systems, 2009, pp. 85–98.
- Xu, F.; Lin, Y.; Huang, J.; Wu, D.; Shi, H.; Song, J.; Li, Y. Big Data Driven Mobile Traffic Understanding and Forecasting: A Time Series Approach. IEEE Transactions on Services Computing 2016, 9, 796–805. [CrossRef]
- Wang, J.; Li, Y.; Gao, R.X.; Zhang, F. Hybrid physics-based and data-driven models for smart manufacturing: Modelling, simulation, and explainability. Journal of Manufacturing Systems 2022, 63, 381–391. [CrossRef]
- Dias, T.; Oliveira, N.; Sousa, N.; Praça, I.; Sousa, O. A Hybrid Approach for an Interpretable and Explainable Intrusion Detection System. In Proceedings of the 21st International Conference on Intelligent Systems Design and Applications (ISDA 2021); Abraham, A.; Gandhi, N.; Hanne, T.; Hong, T.; Rios, T.N.; Ding, W., Eds. Springer, 2021, Vol. 418, Lecture Notes in Networks and Systems, pp. 1035–1045. [CrossRef]
- De, T.; Giri, P.; Mevawala, A.; Nemani, R.; Deo, A. Explainable AI: A hybrid approach to generate human-interpretable explanation for deep learning prediction. Procedia Computer Science 2020, 168, 40–48. [CrossRef]
- Mehdizadeh, A.; Cai, M.; Hu, Q.; Alamdar Yazdi, M.A.; Mohabbati-Kalejahi, N.; Vinel, A.; Rigdon, S.E.; Davis, K.C.; Megahed, F.M. A review of data analytic applications in road traffic safety. Part 1: Descriptive and predictive modeling. Sensors 2020, 20, 1107. [CrossRef]
- Huang, C.; Li, Y.; Nevatia, R. Multiple target tracking by learning-based hierarchical association of detection responses. IEEE transactions on pattern analysis and machine intelligence 2012, 35, 898–910. [CrossRef]
- Santhosh, K.K.; Dogra, D.P.; Roy, P.P. Anomaly detection in road traffic using visual surveillance: A survey. ACM Computing Surveys (CSUR) 2020, 53, 1–26. [CrossRef]
- Karle, P.; Geisslinger, M.; Betz, J.; Lienkamp, M. Scenario understanding and motion prediction for autonomous vehicles-review and comparison. IEEE Transactions on Intelligent Transportation Systems 2022, 23, 16962–16982. [CrossRef]
- Umbrello, S.; Yampolskiy, R.V. Designing AI for explainability and verifiability: A value sensitive design approach to avoid artificial stupidity in autonomous vehicles. International Journal of Social Robotics 2022, 14, 313–322. [CrossRef]
- Friedman, B. Value-sensitive design. interactions 1996, 3, 16–23.
- Fichera, L.; Marletta, D.; Nicosia, V.; Santoro, C. Flexible robot strategy design using belief-desire-intention model. In Proceedings of the Research and Education in Robotics-EUROBOT 2010: International Conference, Rapperswil-Jona, Switzerland, May 27-30, 2010, Revised Selected Papers. Springer, 2011, pp. 57–71.
- Li, N.; Liu, S.; Liu, Y.; Zhao, S.; Liu, M. Neural Speech Synthesis with Transformer Network. In Proceedings of the The Thirty-Third AAAI Conference on Artificial Intelligence, AAAI 2019, The Thirty-First Innovative Applications of Artificial Intelligence Conference, IAAI 2019, The Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019, Honolulu, Hawaii, USA, January 27 - February 1, 2019. AAAI Press, 2019, pp. 6706–6713. [CrossRef]
- Dong, J.; Chen, S.; Zong, S.; Chen, T.; Labi, S. Image transformer for explainable autonomous driving system. In Proceedings of the 2021 IEEE International Intelligent Transportation Systems Conference (ITSC). IEEE, 2021, pp. 2732–2737.
- Shen, Y.; Jiang, S.; Chen, Y.; Yang, E.; Jin, X.; Fan, Y.; Campbell, K.D. To explain or not to explain: A study on the necessity of explanations for autonomous vehicles. arXiv preprint arXiv:2006.11684 2020.
- Levesque, H.J. Knowledge representation and reasoning. Annual review of computer science 1986, 1, 255–287.
- Xiao, L.; Gerth, J.; Hanrahan, P. Enhancing visual analysis of network traffic using a knowledge representation. In Proceedings of the 2006 IEEE symposium on visual analytics science and technology. IEEE, 2006, pp. 107–114.
- Hernández, J.; Cuena, J.; Molina, M. Real-time traffic management through knowledge-based models: The TRYS approach. ERUDIT Tutorial on Intelligent Traffic Management Models, Helsinki, Finland 1999.
- Cuena, J.; Hernández, J.; Molina, M. Knowledge-based models for adaptive traffic management systems. Transportation Research Part C: Emerging Technologies 1995, 3, 311–337. [CrossRef]
- Hülsen, M.; Zöllner, J.M.; Weiss, C. Traffic intersection situation description ontology for advanced driver assistance. In Proceedings of the 2011 IEEE Intelligent Vehicles Symposium (IV), 2011, pp. 993–999. [CrossRef]
- Choi, C.; Wang, T.; Esposito, C.; Gupta, B.B.; Lee, K. Sensored Semantic Annotation for Traffic Control Based on Knowledge Inference in Video. IEEE Sensors Journal 2021, 21, 11758–11768. [CrossRef]
- Toal, A.F.; Buxton, H. Spatio-temporal Reasoning within a Traffic Surveillance System. In Proceedings of the Second European Conference on Computer Vision - ECCV’92,, Santa Margherita Ligure, Italy, May 19-22, 1992; Sandini, G., Ed. Springer, 1992, Vol. 588, Lecture Notes in Computer Science, pp. 884–892. [CrossRef]
- Logi, F.; Ritchie, S.G. Development and evaluation of a knowledge-based system for traffic congestion management and control. Transportation Research Part C: Emerging Technologies 2001, 9, 433–459. [CrossRef]
- Manley, E.; Cheng, T. Exploring the role of spatial cognition in predicting urban traffic flow through agent-based modelling. Transportation Research Part A: Policy and Practice 2018, 109, 14–23. [CrossRef]
- Cucchiara, R.; Piccardi, M.; Mello, P. Image analysis and rule-based reasoning for a traffic monitoring system. IEEE transactions on intelligent transportation systems 2000, 1, 119–130. [CrossRef]
- Stockman, G.; Shapiro, L.G. Computer vision; Prentice Hall PTR, 2001.
- Rezaei, F.; Yazdi, M. Real-time crowd behavior recognition in surveillance videos based on deep learning methods. Journal of Real-Time Image Processing 2021, 18, 1669–1679. [CrossRef]
- Meynberg, O.; Cui, S.; Reinartz, P. Detection of high-density crowds in aerial images using texture classification. Remote Sensing 2016, 8, 470. [CrossRef]
- Meier, E.B.; Ade, F. Tracking cars in range image sequences. In Proceedings of the Proceedings of Conference on Intelligent Transportation Systems. IEEE, 1997, pp. 105–110.
- Jiang, P.; Ergu, D.; Liu, F.; Cai, Y.; Ma, B. A Review of Yolo algorithm developments. Procedia Computer Science 2022, 199, 1066–1073. [CrossRef]
- Lan, W.; Dang, J.; Wang, Y.; Wang, S. Pedestrian detection based on YOLO network model. In Proceedings of the 2018 IEEE international conference on mechatronics and automation (ICMA). IEEE, 2018, pp. 1547–1551.
- Aboah, A.; Wang, B.; Bagci, U.; Adu-Gyamfi, Y. Real-time multi-class helmet violation detection using few-shot data sampling technique and yolov8. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 5349–5357.
- Zhang, Y.; Sun, P.; Jiang, Y.; Yu, D.; Weng, F.; Yuan, Z.; Luo, P.; Liu, W.; Wang, X. ByteTrack: Multi-object Tracking by Associating Every Detection Box. In Proceedings of the 17th European Conference on Computer Vision - ECCV 2022, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part XXII; Avidan, S.; Brostow, G.J.; Cissé, M.; Farinella, G.M.; Hassner, T., Eds. Springer, 2022, Vol. 13682, Lecture Notes in Computer Science, pp. 1–21. [CrossRef]
- Aharon, N.; Orfaig, R.; Bobrovsky, B.Z. BoT-SORT: Robust Associations Multi-Pedestrian Tracking. arXiv preprint arXiv:2206.14651 2022.
- Amit, Y.; Felzenszwalb, P.; Girshick, R. Object detection. Computer Vision: A Reference Guide 2020, pp. 1–9.
- Omran, M.G.; Engelbrecht, A.P.; Salman, A. An overview of clustering methods. Intelligent Data Analysis 2007, 11, 583–605. [CrossRef]
- Khan, K.; Rehman, S.U.; Aziz, K.; Fong, S.; Sarasvady, S. DBSCAN: Past, present and future. In Proceedings of the The fifth international conference on the applications of digital information and web technologies (ICADIWT 2014). IEEE, 2014, pp. 232–238.
- Ahmed, M.; Seraj, R.; Islam, S.M.S. The k-means algorithm: A comprehensive survey and performance evaluation. Electronics 2020, 9, 1295. [CrossRef]
- Ackermann, M.R.; Blömer, J.; Kuntze, D.; Sohler, C. Analysis of agglomerative clustering. Algorithmica 2014, 69, 184–215. [CrossRef]
- Reynolds, D.A.; et al. Gaussian mixture models. Encyclopedia of biometrics 2009, 741.
- Jia, H.; Ding, S.; Xu, X.; Nie, R. The latest research progress on spectral clustering. Neural Computing and Applications 2014, 24, 1477–1486. [CrossRef]
- Cui, M.; et al. Introduction to the k-means clustering algorithm based on the elbow method. Accounting, Auditing and Finance 2020, 1, 5–8. [CrossRef]
- Dedeoğlu, Y.; Töreyin, B.U.; Güdükbay, U.; Çetin, A.E. Silhouette-based method for object classification and human action recognition in video. In Proceedings of the Computer Vision in Human-Computer Interaction: ECCV 2006 Workshop on HCI, Graz, Austria, May 13, 2006. Proceedings 9. Springer, 2006, pp. 64–77.
- Rendón, E.; Abundez, I.M.; Gutierrez, C.; Zagal, S.D.; Arizmendi, A.; Quiroz, E.M.; Arzate, H.E. A Comparison of Internal and External Cluster Validation Indexes. In Proceedings of the the 2011 American Conference on Applied Mathematics and the 5th WSEAS International Conference on Computer Engineering and Applications; World Scientific and Engineering Academy and Society (WSEAS): Stevens Point, Wisconsin, USA, 2011; AMERICAN-MATH’11/CEA’11, p. 158–163.
- Lee, K.M.; Lee, K.M.; Lee, C.H. Statistical cluster validity indexes to consider cohesion and separation. In Proceedings of the 2012 international conference on fuzzy theory and its applications (ifuzzy2012). IEEE, 2012, pp. 228–232.
- McCallum, D.; Avis, D. A linear algorithm for finding the convex hull of a simple polygon. Information Processing Letters 1979, 9, 201–206. [CrossRef]
- Ferilli, S.; Redavid, D. A process mining approach to the identification of normal and suspect traffic behavior. In Proceedings of the 1st Italian Conference for the Traffic Police (TRAP-2017): Traffic Mining Applied to Police Activities:. Springer, 2018, pp. 37–56.
- Ferilli, S. Woman: Logic-based workflow learning and management. IEEE Transactions on Systems, Man, and Cybernetics: Systems 2013, 44, 744–756. [CrossRef]
- Ferilli, S.; Esposito, F.; Redavid, D.; Angelastro, S. Predicting process behavior in woman. In Proceedings of the AI* IA 2016 Advances in Artificial Intelligence: XVth International Conference of the Italian Association for Artificial Intelligence, Genova, Italy, November 29–December 1, 2016, Proceedings XV. Springer, 2016, pp. 308–320.
- Ferilli, S.; Angelastro, S. Activity prediction in process mining using the WoMan framework. Journal of Intelligent Information Systems 2019, 53, 93–112. [CrossRef]
- Ferilli, S. GEAR: A General Inference Engine for Automated MultiStrategy Reasoning. Electronics 2023, 12. [CrossRef]
- Buono, P.; Legretto, A.; Ferilli, S.; Angelastro, S. A Visual Analytic Approach to Analyze Highway Vehicular Traffic. In Proceedings of the 22nd International Conference Information Visualisation (IV), 2018, pp. 204–209. [CrossRef]











Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).