1. Introduction
Sustainable mobility has emerged in response to the environmental and social challenges associated with urban growth and increased vehicular traffic. This paradigm seeks to transform modes of travel, promoting alternatives that reduce greenhouse gas emissions and minimize the impact on ecosystems. Research efforts have focused on a variety of fronts, from the development of more efficient and cleaner vehicle technologies to mobility-oriented urban planning. Sustainable mobility research has become an interdisciplinary and constantly evolving field, driven by the urgent need to find viable and sustainable solutions to the growing transportation demands in cities.
The most crucial research areas, revealing a diverse and complex landscape, have emerged as a crucial convergence with Intelligent Transportation Systems (ITS), marking a transition towards more advanced and effective solutions. These systems, supported by innovative technologies such as real-time data analytics and artificial intelligence, offer unprecedented opportunities to improve traffic management, facilitate urban planning, and encourage the adoption of sustainable modes of transportation.
Research and analysis of Intelligent Transportation Systems in urban areas have become essential today due to the complexity of this problem and its profound impact on society. The constant growth of the population in urban areas, as well as the increase in vehicular traffic, are obvious factors that require careful attention [
1].
Intelligent Transportation Systems emerge as an innovative and technological response to address these challenges in an efficient and sustainable manner. In their search for effective solutions to the challenges of urban mobility, they employ a variety of machine learning techniques to obtain practical applications and offer analytical approaches in the field of transportation.
Intelligent Transportation Systems use machine learning algorithms to detect patterns in vehicle behavior, such as regular congestion in certain areas or drivers’ preferred routes. This information is essential for congestion prediction, optimal route planning and real-time adaptation of traffic management strategies.
The applicability of these approaches is broad, ranging from real-time traffic management to long-term planning of transportation infrastructure. By better understanding traffic patterns and driver behaviors, intelligent systems can offer more effective solutions, such as traffic light optimization, public transport route management, and the implementation of sustainable mobility policies.
In this regard, the management of vehicular traffic in urban areas is of great importance due to the constant population growth and increase in vehicles, which poses significant challenges [
2]. This management must address multiple dimensions, including environmental impact and road safety. Traffic congestion is a recurring problem that affects the quality of life of citizens.
There are challenges in managing traffic congestion such as the lack of an accurate and uniform representation of vehicle trajectory data that makes early identification of congested areas difficult [
3]. Dispersion and incompleteness of data collection points are also common problems.
Efficient traffic management is essential to improve road flow, reduce travel time and reduce pollutant emissions. Traditional approaches may not adapt quickly to changing traffic conditions, which is essential given that congestion can vary significantly at different times of the day [
4].
Data streams, collected from various sources such as traffic sensors and GPS navigation systems, are essential for understanding the real-time behavior of vehicles and pedestrians in urban areas [
5,
6,
7]. Clustering techniques are valuable for representing these data streams effectively, allowing identification of traffic patterns, organization of data into clusters based on similarities, and prediction of future trends in urban traffic. These techniques are fundamental for traffic planning and management tailored to the specific needs of each area.
The analysis of vehicular trajectory data streams is a widely researched area [
8], and several studies have developed clustering techniques adapted to different domains [
9,
10,
11]. The study of various approaches has proven effective in identifying sets with shared attributes in the analysis of the joint behavior of vehicles [
12,
13].
Some researchers have adapted conventional clustering methods, such as k-means [
9] and DBSCAN [
14], by adapting methods and calculations designed specifically for trajectories [
15]. Several investigations have resorted to alternative representation [
16] of trajectories such as subdvision or cell representation to improve clustering results [
17,
18].
In some cases static vehicle analysis may be limited in its ability to capture real traffic dynamics. Because vehicle behavior can change over time [
19], dynamic analysis has become important for understanding the causes of congestion [
20]. In recent years, there has been an increase in artificial intelligence and machine learning approaches that add features such as memory, scalability and accuracy [
21,
22,
23]. Machine learning has proven its effectiveness by leveraging the use of historical information combined with information associated with vehicles and the road environment in which they travel [
24,
25,
26]. These combinations, enriched by the inclusion of data from Big Data, especially generated from social networks, have become an invaluable resource for detecting traffic congestion in real time [
27].
Several studies have developed methodologies and techniques to identify congested areas accurately, using a variety of traffic and environmental characteristics [
28,
29,
30,
31,
32,
33].
Several proposals with combined approaches focusing on traffic congestion assessment and the use of clustering algorithms constitute a highly promising field of research [
34,
35,
36], providing an effective method to closely examine vehicular flow in different scenarios [
28,
37,
38].
The paper proposed by Almeida et al. [
39] proposes a method for traffic congestion detection considering speed, traffic flow and road occupancy and then uses clustering techniques to detect various degrees of congestion in vehicular data. The paper proposed by Reyes et al. [
40] analyzes vehicular flow by identifying speed ranges with a constant update. Although it is a simplified view of vehicular traffic flow, in many cases it is beneficial to include additional data in order to enrich the study of vehicular traffic [
41].
A detailed understanding of how congestion manifests and evolves in different environments is critical to strategically plan mobility, alleviate congestion, and ensure more efficient and sustainable traffic flow.
This paper proposes a methodology to analyze vehicular flow by clustering vehicle trajectory data with GPS points. This methodology allows an accurate representation of the data, especially useful when points are scarce. It uses clusters to detect areas of congestion patterns. The constant updating of the clusters ensures up-to-date data and real congestion management. It also uses a congestion indicator to measure traffic saturation allowing a dynamic view of the traffic situation in different areas.
This article is organized as follows:
Section 2 describes the proposed methodology,
Section 3 present the obtained results,
Section 4 discusses the obtained results, and
Section 5 presents the conclusions and future lines of work.