Submitted:
14 March 2026
Posted:
17 March 2026
You are already at the latest version
Abstract
Keywords:
Introduction

Data Sources and Collection
- ○
- GPS Data from Vehicles and Smartphones:
- ○
- Internet of Things (IoT) Sensors:
- ○
- Ride Sharing Platforms:
- ○
- Social Media Data:
- ○
- Public Transport Data:
Challenges in Data Collection and Integration
- ○
- Data Heterogeneity:
- ○
- Data Volume:
- ○
- Data Accuracy:
- ○
- Data Privacy:
Big Data Technologies for Smart Transportation
Recommended Big Data Tools and Frameworks
1. Apache Spark

Spark Architecture and Components

1. Driver Program
2. Cluster Manager

3. Executors
4. Resilient Distributed Dataset (RDD)
5. Streaming Module
6. MLlib (Machine Learning Library)
7. Spark SQL
8. Graphx
9. Apache Spark Core


Justification for Using Apache Spark
- ○
- Fast Processing
- ○
- Advanced Machine Learning Support
- ○
- Scalability
Conclusion
Conflicts of Interest
References
- Oladimeji, D.; Gupta, K.; Kose, N.A.; Gundogan, K.; Ge, L.; Liang, F. Smart transportation: An overview of technologies and applications. Sensors 2023, 23(8), 3880. [Google Scholar] [CrossRef]
- Fantin Irudaya Raj, E.; Appadurai, M. Internet of things-based smart transportation system for smart cities. In Intelligent Systems for Social Good: Theory and Practice; Springer Nature Singapore: Singapore, 2022; pp. 39–50. [Google Scholar]
- Singh, B. Federated learning for envision future trajectory smart transport system for climate preservation and smart green planet: Insights into global governance and SDG-9 (Industry, Innovation and Infrastructure). National Journal of Environmental Law 2023, 6(2), 6–17. [Google Scholar]
- Elassy, M.; Al-Hattab, M.; Takruri, M.; Badawi, S. Intelligent transportation systems for sustainable smart cities. Transportation Engineering 2024, 16, 100252. [Google Scholar] [CrossRef]
- Ray, S.K.; Pawlikowski, K.; Sirisena, H. A fast MAC-layer handover for an IEEE 802.16 e-based WMAN. International Conference on Access Networks, 2008, October; Springer Berlin Heidelberg: Berlin, Heidelberg; pp. 102–117. [Google Scholar]
- Kunjir, S.N.; Patil, S.S.; Hingane, B.S.; Pagariya, J.A.; Rashid, M. Managing smart urban transportation with the integration of big data analytic platform. 2023 6th International Conference on Contemporary Computing and Informatics (IC3I) 2023, Vol. 6, 1807–1811. [Google Scholar]
- Data analytics for intelligent transportation systems; Chowdhury, M., Dey, K., Apon, A., Eds.; Elsevier, 2024. [Google Scholar]
- Samaras, V.; Daskapan, S.; Ahmad, R.; Ray, S.K. An enterprise security architecture for accessing SaaS cloud services with BYOD. 2014 Australasian Telecommunication Networks and Applications Conference (ATNAC), 2014, November; IEEE; pp. 129–134. [Google Scholar]
- Niyazi, M.; Behnamian, J. Application of cloud computing and big data in three-stage dynamic modeling of disaster relief logistics and wounded transportation: A case study. Environmental Science and Pollution Research 2023, 30(13), 38121–38140. [Google Scholar] [CrossRef]
- Ushakov, D.; Dudukalov, E.; Kozlova, E.; Shatila, K. The Internet of Things impact on smart public transportation. Transportation Research Procedia 2022, 63, 2392–2400. [Google Scholar] [CrossRef]
- Dey, K.; Ray, S.; Bhattacharyya, P.K.; Gangopadhyay, A.; Bhasin, K.K.; Verma, R.D. Salicyladehyde 4-methoxybenzoylhydrazone and diacetylbis (4-methoxybenzoylhydrazone) as ligands for tin, lead and zirconium. J. Indian Chem. Soc. 1985, 62(11). [Google Scholar]
- Dritsas, E.; Trigka, M. Applying machine learning on big data with Apache Spark. In IEEE Access; 2025. [Google Scholar]
- Anwesa Chaudhuri, A.C.; Sanjib Ray, S.R. Antiproliferative activity of phytochemicals present in aerial parts aqueous extract of Ampelocissus latifolia (Roxb.) Planch. on apical meristem cells. Int. J. Pharma Bio Sci. 2015, 6, 99–107. [Google Scholar]
- Azeem, M.; Abualsoud, B.M.; Priyadarshana, D. Mobile big data analytics using deep learning and Apache Spark. Mesopotamian Journal of Big Data 2023, 2023, 16–28. [Google Scholar] [CrossRef]
- Alexakis, T.; Peppes, N.; Demestichas, K.; Adamopoulou, E. A distributed big data analytics architecture for vehicle sensor data. Sensors 2022, 23(1), 357. [Google Scholar] [CrossRef]
- Ray, S.K.; Sirisena, H.; Deka, D. LTE-Advanced handover: An orientation matching-based fast and reliable approach. 38th annual IEEE conference on local computer networks, 2013, October; IEEE; pp. 280–283. [Google Scholar]
- Ushakov, D.; Dudukalov, E.; Kozlova, E.; Shatila, K. The Internet of Things impact on smart public transportation. Transportation Research Procedia 2022, 63, 2392–2400. [Google Scholar] [CrossRef]
- Gul, O.M. Blockchain-enabled Internet of Things (IoTs) platforms for vehicle sensing and transportation monitoring. Machine Learning, Blockchain Technologies and Big Data Analytics for IoTs: Methods, Technologies and Applications 2022, 351–373. [Google Scholar]
- Fantin Irudaya Raj, E.; Appadurai, M. Internet of things-based smart transportation system for smart cities. In Intelligent Systems for Social Good: Theory and Practice; Springer Nature Singapore: Singapore, 2022; pp. 39–50. [Google Scholar]
- Shah, I.A.; Jhanjhi, N.Z.; Laraib, A. Cybersecurity and blockchain usage in contemporary business. In Handbook of Research on Cybersecurity Issues and Challenges for Business and FinTech Applications; IGI Global Scientific Publishing, 2023; pp. 49–64. [Google Scholar]
- Younas, F. Factors influencing consumer choice in ride sharing platform: A study on Uber and Bolt in Helsinki. 2025. [Google Scholar]
- Azeem, M.; Ullah, A.; Ashraf, H.; Jhanjhi, N.Z.; Humayun, M.; Aljahdali, S.; Tabbakh, T.A. Fog-oriented secure and lightweight data aggregation in iomt. IEEE Access 2021, 9, 111072–111082. [Google Scholar] [CrossRef]
- Ker, S.K.; Liu, S. Uber Freight: Assessment and Determination of Optimal Design Features for a Drop Trailer Service Offering and Network. Doctoral dissertation, 2022. [Google Scholar]
- Lee, S.; Abdullah, A.; Jhanjhi, N.Z. A review on honeypot-based botnet detection models for smart factory. International Journal of Advanced Computer Science and Applications 2020, 11(6). [Google Scholar] [CrossRef]
- Agarwal, P.; Alam, M. Open service platforms for IoT. In Internet of Things (IoT) Concepts and Applications; Springer International Publishing: Cham, 2020; pp. 43–59. [Google Scholar]
- Muzafar, S.; Jhanjhi, N.Z. Success stories of ICT implementation in Saudi Arabia. In Employing Recent Technologies for Improved Digital Governance; IGI Global Scientific Publishing, 2020; pp. 151–163. [Google Scholar]
- Li, D.; Zhang, Y.; Li, C. Mining public opinion on transportation systems based on social media data. Sustainability 2019, 11(15), 4016. [Google Scholar] [CrossRef]
- Endarnoto, S.K.; Pradipta, S.; Nugroho, A.S.; Purnama, J. Traffic condition information extraction & visualization from social media twitter for android mobile application. In Proceedings of the 2011 International Conference on Electrical Engineering and Informatics, 2011, July; IEEE; pp. 1–4. [Google Scholar]
- Liu, D.; Guo, J.; Gu, Y.; King, M.; Han, L.D.; Brakewood, C. Analyzing Transit Systems Using General Transit Feed Specification (GTFS) by Generating Spatiotemporal Transit Networks. Information 2025, 16(1), 24. [Google Scholar] [CrossRef]
- Musa, A.A.; Malami, S.I.; Alanazi, F.; Ounaies, W.; Alshammari, M.; Haruna, S.I. Sustainable traffic management for smart cities using internet-of-things-oriented intelligent transportation systems (ITS): Challenges and recommendations. Sustainability 2023, 15(13), 9859. [Google Scholar] [CrossRef]
- JingXuan, C.; Tayyab, M.; Muzammal, S.M.; Jhanjhi, N.Z.; Ray, S.K.; Ashfaq, F. Integrating AI with robotic process automation (RPA): Advancing intelligent automation systems. 2024 IEEE 29th Asia Pacific Conference on Communications (APCC), 2024, November; IEEE; pp. 259–265. [Google Scholar]
- Abdulkarim, S.; Mohktar, S.; Babaji, N.A.; Maisalati, M.S.; Musa, M.K.; Pwara, K.E. Roles of road transport policy on economic developmentin federal capital territory Abuja, Nigeria: Atakeholders perception. Management 2021, 7(27), 179–197. [Google Scholar] [CrossRef]
- Abdulkarim, S.; Mohktar, S.; Babaji, N.A.; Maisalati, M.S.; Musa, M.K.; Pwara, K.E. Roles of road transport policy on economic developmentin federal capital territory Abuja, Nigeria: Stakeholders perception. Management 2021, 7(27), 179–197. [Google Scholar] [CrossRef]
- Javed, D.; Jhanjhi, N.Z.; Ashfaq, F.; Khan, N.A.; Das, S.R.; Singh, S. Student performance analysis to identify the students at risk of failure. 2024 International Conference on Emerging Trends in Networks and Computer Communications (ETNCC), 2024, July; IEEE; pp. 1–6. [Google Scholar]
- Tang, L.; Na, S. Comparison of machine learning methods for ground settlement prediction with different tunneling datasets. Journal of Rock Mechanics and Geotechnical Engineering 2021, 13(6), 1274–1289. [Google Scholar] [CrossRef]
- Bora, P.S.; Sharma, S.; Batra, I.; Malik, A.; Ashfaq, F. Identification and classification of rare medicinal plants. 2024 International Conference on Emerging Trends in Networks and Computer Communications (ETNCC), 2024, July; IEEE; pp. 1–6. [Google Scholar]
- Shreyas, S.K.; Dey, A. Application of soft computing techniques in tunnelling and underground excavations: State of the art and future prospects. Innovative Infrastructure Solutions 2019, 4(1), 46. [Google Scholar] [CrossRef]
- Badu-Marfo, G.; Farooq, B.; Patterson, Z. A perspective on the challenges and opportunities for privacy-aware big transportation data. Journal of Big Data Analytics in Transportation 2019, 1(1), 1–23. [Google Scholar] [CrossRef]
- Miao, Y.; Yang, Y.; Li, X.; Choo, K.K.R.; Meng, X.; Deng, R.H. Comprehensive survey on privacy-preserving spatial data query in transportation systems. IEEE Transactions on Intelligent Transportation Systems 2023, 24(12), 13603–13616. [Google Scholar] [CrossRef]
- Jabeen, T.; Jabeen, I.; Ashraf, H.; Ullah, A.; Jhanjhi, N.Z.; Ghoniem, R.M.; Ray, S.K. Smart wireless sensor technology for healthcare monitoring system using cognitive radio networks. Sensors 2023, 23(13), 6104. [Google Scholar] [CrossRef]
- Callegati, F.; Campi, A.; Melis, A.; Prandini, M.; Zevenbergen, B. Privacy-preserving design of data processing systems in the public transport context. Pacific Asia Journal of the Association for Information Systems 2015, 7(4), 4. [Google Scholar] [CrossRef]
- Singha, S.; Singha, R. Protecting data and privacy: Cloud-based solutions for intelligent transportation applications. Scalable computing: Practice and experience 2023, 24(3), 257–276. [Google Scholar] [CrossRef]
- Sui, P.; Li, X.; Bai, Y. A study of enhancing privacy for intelligent transportation systems: $ k $-correlation privacy model against moving preference attacks for location trajectory data. IEEE Access 2017, 5, 24555–24567. [Google Scholar] [CrossRef]
- Ying, Z.; Cao, S.; Liu, X.; Ma, Z.; Ma, J.; Deng, R.H. PrivacySignal: Privacy-preserving traffic signal control for intelligent transportation system. IEEE Transactions on Intelligent Transportation Systems 2022, 23(9), 16290–16303. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.