Submitted:
20 October 2023
Posted:
23 October 2023
You are already at the latest version
Abstract
Keywords:Â
1. Introduction
2. Related work
3. Methods and materials
3.1. The stage of data collection and filtering
- Data cleaning: The task consists of preparing the data for future processing and analysis. It is based on applying a linguistic extraction process as follows. First, a customized stop-words list is defined to filter the tweetsâ dataset. So, when there is a match between the list of stop-words and the words of tweets, these are removed. This stop-words list identifies critical terms on explorations and key terms useful for the following data exploration task. The stop-words list also contains pronouns, symbols, and everyday words usually reported in opinions. An example of stop-words is the protocol name (e.g., âhttpsâ), emotional expressions (e.g., âJajajaâ), and irrelevant verbs. On the other hand, some examples of keyword terms for data exploration are specific names of accounts, such as SSP (Spanish acronym of the Ministry of Public Security) and politiciansâ names, among others.
- Open data employing: Each stage mentioned above used open government data published by the Mexican government due to the impact produced by the gasoline crisis. These open datasets were collected in 2019 and made available in CSV format. The open data include information concerning gasoline clandestine pipelines and their location per state, the number of monthly occurred events, and the number of fuel thefts documented by the government in each particular state. Furthermore, a geospatial open dataset containing the country statesâ administrative boundaries has also been considered. This dataset was provided by the Mexican Institute of Statistics and Geography (INEGI) and is available in shapefile format, presenting information at the 1:250,000 scale. In addition, the geospatial data contains the following attributes: "<idState>", "<name>", and "<geometry>" with the WGS84 geo-referenced style.
3.2. The discovery insights
![]() |
3.3. The spatiotemporal insight analysis
- Dimensions: [State][Month][Year]
- Measure: [Sum(CrimeEvents)]
-
Dimensions: [idTweet][Topic][Date-Time],[Latitude][Longitude][idMexicanState]
- Measure: [Sum(ClusterSize)]
-
Dimensions: [idTweet][Topic][Date-Time],[Latitude][Longitude][idMexicanState][Polarity]
- Measure: [Count(Polarity)]
4. Experimental results
4.1. Results of the geographic and social topics distribution
4.2. Results of the geographic polarization distribution
5. Conclusions and future work
Author Contributions
Funding
Institutional Review Board Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Banda, K.K.; Cluverius, J. Elite polarization, party extremity, and affective polarization. Electoral Studies 2018, 56, 90â101. [Google Scholar] [CrossRef]
- Rathnayake, C.; Suthers, D.D. âEnclaves of exposureâ: A conceptual viewpoint to explore cross-ideology exposure on social network sites. The Social Science Journal 2019, 56, 145â155. [Google Scholar] [CrossRef]
- Nohuddin, P.N.; Coenen, F.; Christley, R.; Setzkorn, C.; Patel, Y.; Williams, S. Finding âinterestingâ trends in social networks using frequent pattern mining and self organizing maps. Knowledge-Based Systems 2012, 29, 104â113. [Google Scholar] [CrossRef]
- Jancsics, D. Interdisciplinary perspectives on corruption. Sociology Compass 2014, 8, 358â372. [Google Scholar] [CrossRef]
- Lan, X.; Li, W. Swiss watch cycles: Evidence of corruption during leadership transition in China. Journal of Comparative Economics 2018, 46, 1234â1252. [Google Scholar] [CrossRef]
- Ferraresi, M.; Kotsogiannis, C.; Rizzo, L. Decentralization and fuel subsidies. Energy Economics 2018, 74, 275â286. [Google Scholar] [CrossRef]
- Silvestre, H.C.; Gomes, R.C.; Lamba, J.R.; Correia, A.M. Implementation of Brazilâs energy policy through the national oil company: From institutional chaos to strategic order. Energy policy 2018, 119, 87â96. [Google Scholar] [CrossRef]
- Lee, C.; Shin, J.; Hong, A. Does social media use really make people politically polarized? Direct and indirect effects of social media use on political polarization in South Korea. Telematics and Informatics 2018, 35, 245â254. [Google Scholar] [CrossRef]
- Ali, M.; Azab, N.; Sorour, M.K.; Dora, M. Integration vs. polarisation among social media users: Perspectives through social capital theory on the recent Egyptian political landscape. Technological Forecasting and Social Change 2019, 145, 461â473. [Google Scholar] [CrossRef]
- StĂžttrup, J.G.; Kokkalis, A.; Brown, E.J.; Olsen, J.; Andersen, S.K.; Pedersen, E.M. Harvesting geo-spatial data on coastal fish assemblages through coordinated citizen science. Fisheries Research 2018, 208, 86â96. [Google Scholar] [CrossRef]
- Yao, H.; Xiong, M.; Zeng, D.; Gong, J. Mining multiple spatialâtemporal paths from social media data. Future Generation Computer Systems 2018, 87, 782â791. [Google Scholar] [CrossRef]
- Sinclair, M.; Ghermandi, A.; Moses, S.A.; Joseph, S. Recreation and environmental quality of tropical wetlands: A social media based spatial analysis. Tourism Management 2019, 71, 179â186. [Google Scholar] [CrossRef]
- Caverlee, J.; Cheng, Z.; Sui, D.Z.; Kamath, K.Y. Towards Geo-Social Intelligence: Mining, Analyzing, and Leveraging Geospatial Footprints in Social Media. IEEE Data Eng. Bull. 2013, 36, 33â41. [Google Scholar]
- Srivastava, A.N.; Sahami, M. Text mining: Classification, clustering, and applications; CRC press, 2009. [Google Scholar]
- Aggarwal, C.C.; Aggarwal, C.C. Mining text data; Springer, 2015. [Google Scholar]
- Miranda, C.A.; Rodriguez, R.C.; Zagal-Flores, R. Arquitectura Web para anĂĄlisis de sentimientos en Facebook con enfoque semĂĄntico. Res. Comput. Sci. 2014, 75, 59â69. [Google Scholar] [CrossRef]
- Specia, M. Long lines and guarded fuel: Mexicoâs gas crisis, explained. The New York Times 2019. [Google Scholar]
- Peschard Mariscal, J.; Salazar Rebolledo, M.G.; Olea GĂłmez, O.A. Âż QuĂ© sabemos del robo de combustible en MĂ©xico? Claroscuros de un delito que no cede. Revista mexicana de ciencias polĂticas y sociales 2021, 66, 245â280. [Google Scholar] [CrossRef]










Disclaimer/Publisherâs Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
