Submitted:
09 May 2025
Posted:
15 May 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Work
3. Methodology
3.1. Z-Score Normalization
3.2. Fuzzy Core Membership
3.3. Fuzzy DBSCAN Clustering
- is the distance between point and cluster
- is a parameter controlling the spread of the exponential function,
- is the total number of clusters.
4. Experimental Evaluation
4.1. Dataset
- Type of facility (categorical: education, health, or entertainment)
- Year of establishment (numerical)
- Geographic coordinates (latitude and longitude)
4.2. Results
4.3. GIS Visualization
5. Architecture and Application in Smart Cities
5.1. System Architecture
- Data Collection Layer: This layer ingests structured and semi-structured data from a variety of sources, including urban sensors (e.g., GPS trackers, environmental sensors), citizen mobile applications, public service directories, and historical databases. Integration is facilitated via standardized APIs and IoT gateways to support data acquisition in real time and batch modes.
- Data Preprocessing Layer: Raw inputs undergo data cleansing, transformation, and normalization. Missing values are handled through imputation, while Z-score normalization is applied to ensure comparability across heterogeneous attributes such as geographic coordinates, service categories, and timestamps. Technologies like Python Pandas or Apache Spark may be used to ensure scalability.
- Parameter Optimization Layer: This layer employs Particle Swarm Optimization (PSO) to automatically search for the optimal combination of DBSCAN parameters—specifically, Eps, , and . The fitness function is based on clustering quality metrics such as the silhouette score. This automation eliminates the need for manual tuning, which is often suboptimal in dynamic urban environments.
- Fuzzy Clustering Layer: Using the PSO-optimized parameters, a fuzzy-enhanced version of DBSCAN is executed. Unlike traditional DBSCAN, this method calculates membership degrees for each point, allowing partial assignment to multiple clusters. This fuzziness better captures the overlapping nature of urban services (e.g., education centers serving multiple districts).
- Analysis and Visualization Layer: The final output clusters are visualized using GIS tools, interactive dashboards, and spatial heatmaps. Platforms such as QGIS, Power BI, or Tableau are used to display clusters, highlight service coverage gaps, and enable real-time monitoring. These outputs are critical for urban planners, public service managers, and policymakers.
5.2. Urban Analytics Use Case
6. Conclusions
- Pre-2010 dataset: 0.82 vs. 0.67
- Post-2010 dataset: 0.75 vs. 0.66
References
- G. Schoier and G. Borruso. A Clustering Method for Large Spatial Databases. in Computational Science and Its Applications – ICCSA 2004, Springer, 2004, 1089–1095.
- R. R. Kumar et al. OPTCLOUD: An Optimal Cloud Service Selection Framework Using QoS Correlation Lens. Comput. Intell. Neurosci. 2022, 2022. [Google Scholar] [CrossRef]
- P. Das et al. [Retracted] Heterogeneous Network-Based Inductive Matrix Methods for Predicting Biomedical Gene Disease. Biomed. Res. Int. 2023, 2023, 7121514. [Google Scholar] [CrossRef]
- S. Nasir et al. Machine Learning Approach for Solid Waste Categorization in Ethiopia. Int. J. Innov. Sci. Res. Technol. 2020, 5, 283–288. [Google Scholar]
- S. Mehammed et al. Optimizing Multi-Dimensional Data-Index Algorithms for MIC Architectures. Int. J. Eng. Technol. 2022, 7, 366–372. [Google Scholar]
- M. N. Alam et al. Efficient MAC Protocol for Wireless Sensor Nodes to Lessen Hidden and Deaf Node Problem. Int. J. Electr. Electron. Eng. Technol. 2020, 11, 28–43. [Google Scholar]
- A. Ramachandran et al. Machine Learning Algorithms for Fall Detection Using Kinematic and Heart Rate Parameters – A Comprehensive Analysis. IAES Int. J. Artif. Intell. 2020, 9, 772–780. [Google Scholar] [CrossRef]
- N. S. Al-Blihed et al. Blockchain and Machine Learning in the Internet of Things: A Review of Smart Healthcare. IAES Int. J. Artif. Intell. 2023, 12, 995–1006. [Google Scholar] [CrossRef]
- N. Seman and N. A. Razmi. Machine Learning-Based Technique for Big Data Sentiments Extraction. IAES Int. J. Artif. Intell. 2020, 9, 473–479. [Google Scholar] [CrossRef]
- E. Mathew and S. Abdulla. The LSTM Technique for Demand Forecasting of E-Procurement in the Hospitality Industry in the UAE. IAES Int. J. Artif. Intell. 2020, 9, 584–590. [Google Scholar]
- S. Kumar and A. Vidhate. Internet of Things and Blockchain Integration for Security and Privacy. IAES Int. J. Artif. Intell. 2024, 13, 4037–4044. [Google Scholar] [CrossRef]
- M. Meeradevi et al. Evaluating the Machine Learning Models Based on Natural Language Processing Tasks. IAES Int. J. Artif. Intell. 2024, 13, 1954. [Google Scholar] [CrossRef]
- E. Barzizza et al. Machine Learning-Based Decision-Making Approach for Predicting Defects Detection: A Case Study. IAES Int. J. Artif. Intell. 2024, 13, 3052–3060. [Google Scholar] [CrossRef]
- G. P. Yee et al. K-Means Clustering Analysis and Multiple Linear Regression Model on Household Income in Malaysia. IAES Int. J. Artif. Intell. 2023, 12, 731–738. [Google Scholar] [CrossRef]
- S. Masrom et al. Machine Learning of Tax Avoidance Detection Based on Hybrid Metaheuristic Algorithms. IAES Int. J. Artif. Intell. 2022, 11, 1153–1163. [Google Scholar] [CrossRef]
- D. Ienco and G. Bordogna. Fuzzy Extensions of the DBSCAN Clustering Algorithm. Soft Comput. 2018, 22, 1719–1730. [Google Scholar] [CrossRef]
- H.-P. Kriegel and M. Pfeifle. Density-Based Clustering of Uncertain Data. in Proc. 11th ACM SIGKDD Int. Conf. Knowl. Disc. Data Min. (KDD) 2005, 672–677.
- E. N. Nasibov and G. Ulutagay. Robustness of Density-Based Clustering Methods with Various Neighborhood Relations. Fuzzy Sets Syst. 2009, 160, 3601–3615. [Google Scholar] [CrossRef]
- C. Guan et al. Particle Swarm Optimized Density-Based Clustering and Classification: Supervised and Unsupervised Learning Approaches. Swarm EComput. 2019, 44, 876–896. [Google Scholar]
- M. Ester et al. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. in Proc. 2nd Int. Conf. Knowl. Disc. Data Min. (KDD) 1996, 226–231.
- M. N. Alam and Y.-C. Kim. Efficient MAC Protocol for Hybrid Wireless Network with Heterogeneous Sensor Nodes. J. Sensors 2016, 2016, 7951965. [Google Scholar]
- M. N. Alam et al. Neighbor Initiated Approach for Avoiding Deaf and Hidden Node Problems in Directional MAC for Ad Hoc Networks. Wireless Netw. 2013, 19, 587–602. [Google Scholar]
- M. N. Alam et al. A Mechanism to Improve Spatiality in Directional MAC for Wireless Ad-Hoc Networks. in Proc. ICUFN 2013, 2013, 661–666.
- P. Khan et al. Performance Analysis of WBAN MAC Protocol under Different Access Periods. Int. J. Distrib. Sens. Networks 2015, 2015. [Google Scholar]
- R. Cheng et al. Evaluating Probabilistic Queries Over Imprecise Data. in Proc. ACM SIGMOD Int. Conf. Manage. Data 2003, 551–562.
- K. Aitdaraou et al. A Fuzzy Observer Synthesis to State and Fault Estimation for Takagi–Sugeno Implicit Systems. IAES Int. J. Artif. Intell. 2023, 12, 241. [Google Scholar] [CrossRef]
- R. F. Ningrum et al. Fuzzy Mamdani Logic Inference Model in the Loading of Distribution Substation Transformer SCADA System. IAES Int. J. Artif. Intell. 2021, 10, 298–305. [Google Scholar] [CrossRef]
- A. M. Abdu et al. Machine Learning for Plant Disease Detection: An Investigative Comparison Between SVM and Deep Learning. IAES Int. J. Artif. Intell. 2020, 9, 670–683. [Google Scholar] [CrossRef]
- A. A. Ojugo and D. O. Otakore. Intelligent Cluster Connectionist Recommender System Using Implicit Graph Friendship Algorithm for Social Networks. IAES Int. J. Artif. Intell. 2020, 9, 497–506. [Google Scholar] [CrossRef]
- T. Mohd et al. Machine Learning Building Price Prediction with Green Building Determinant. IAES Int. J. Artif. Intell. 2020, 9, 379–386. [Google Scholar] [CrossRef]
- D. Srinivas and E. V. Prasad. Adaptive density-based localization algorithm using particle swarm optimization and DBSCAN clustering approach. Int. J. Comput. Appl., 2021. [Online]. Available: https://search.proquest.com/openview/b82378649d6d60e7a55b347211baf3a0.
- H. El-Zeheiry, M. Elmogy, and N. Elaraby. Fuzzy C-mean and density-based spatial clustering for internet of things data processing. in Medical Big Data and Internet of Medical Things: Advances, Challenges, and Applications, CRC Press, 2018. [Online]. Available: https://www.taylorfrancis.com/chapters/edit/10.1201/9781351030380-7.
- R. M. Devadas, V. Hiremani, and R. V. Bidwe. Hybrid Clustering With Quantum Particle Swarm Optimization Initialization for Fuzzy C-Means and DBSCAN. IEEE, 2024. [Online]. Available: https://ieeexplore.ieee.org/abstract/document/10690825.
- M. H. Rad and M. Abdolrazzagh-Nezhad. A fuzzy-based hybrid density-based clustering with Earthworm Optimization for data cube clustering. Soft Comput. 2020, 24. [Online]. Available: https://link.springer.com/article/10.1007/s00500-020-04881-0.
- U. Kazemi and S. Soleimani. A flexible game-theoretic adjustable DBSCAN (GTAD): A new approach for urban big data clustering. Soft Comput. 2025. [Online]. Available: https://link.springer.com/article/10.1007/s00500-025-10405-5.
- M. Chaudhry, I. Shafi, M. Mahnoor, and D. L. R. Vargas. Fuzzy, density-based, and metaheuristic clustering: A systematic literature review. Symmetry 2023, 15, 1679, [Online]. Available: https://www.mdpi.com/2073-8994/15/9/1679. [Google Scholar]
- A. Kousis and C. Tjortjis. Data mining algorithms for smart cities: A bibliometric analysis. Algorithms 2021, 14, 242, [Online]. Available: https://www.mdpi.com/1999-4893/14/8/242. [Google Scholar] [CrossRef]


| Method | Pre-2010 Data | Post-2010 Data |
| Proposed Method | 0.82 | 0.75 |
| Standard DBSCAN (best) | 0.67 | 0.66 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).