Submitted:
30 December 2022
Posted:
04 January 2023
Read the latest preprint version here
Abstract
Keywords:
1. Introduction
- Utilising GPT-3 to gather mobile phone release prices on a large scale, which is used to estimate the socioeconomic status of the subscribers.
- The socioeconomic status distribution of a large social event attendees is evaluated.
- Mobile cells along the riverbank are clustered based on their attendees’ mobility indicators and socioeconomic status.
2. Literature Review
2.1. Call Detail Records
2.2. Mobility Indicators
2.2.1. Radius of Gyration
2.2.2. K-Radius of Gyration
2.2.3. Entropy
2.3. Socioeconomic Status Analysis
2.4. GPT-3
2.5. Clustering
3. Materials and Methods
3.1. Data Sources
3.2. Data Preprocessing
3.3. Data Collection with OpenAI’s GPT-3
3.4. Event Analysis
3.5. Clustering
4. Results and Discussion
4.1. Data Processing Framework
4.2. Large Social Event Analysis
4.3. Clustering
4.3.1. Mobility Clusters
- H++ - High radius of gyration and high entropy.
- H/E–R+ - High metrics overall, lower entropy and higher radius of gyration.
- H/E+R– - High metrics overall, lower radius of gyration and higher entropy.
- M - Medium metrics.
- L/E–R+ - Low metrics overall, lower entropy and higher radius of gyration.
- L/E+R– - Low metrics overall, lower radius of gyration and higher entropy.
4.3.2. Socioeconomic Status Clusters
4.3.3. Mobility and Socioeconomic Clusters
4.4. Limitations
4.5. Future Work
5. Conclusions
Author Contributions
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| AI | Artificial Intelligence |
| API | Application Programming Interface |
| CDR | Call Detail Record |
| EU | European Union |
| EUR | Euro |
| GPS | Global Positioning System |
| GPT | Generative Pre-trained Transformer |
| GSM | Global System for Mobile Communications |
| HUF | Hungarian Forint |
| ID | Identification |
| IMEI | International Mobile Equipment Identity |
| SES | Social Economic Status |
| SIM | Subscriber Identity Module |
| TAC | Type Allocation Code |
| UEFA | Union of European Football Associations |
| USD | United States dollar |
References
- United Nations, Department of Economic and Social Affairs, Population Division. World Urbanization Prospects: The 2018 Revision (ST/ESA/SER.A/420), 2019.
- Mamei, M.; Colonna, M. Estimating attendance from cellular network data. International Journal of Geographical Information Science 2016, 30, 1281–1301. [Google Scholar] [CrossRef]
- Furletti, B.; Trasarti, R.; Cintia, P.; Gabrielli, L. Discovering and understanding city events with big data: the case of rome. Information 2017, 8, 74. [Google Scholar] [CrossRef]
- Hiir, H.; Sharma, R.; Aasa, A.; Saluveer, E. Impact of Natural and Social Events on Mobile Call Data Records – An Estonian Case Study. Complex Networks and Their Applications VIII; Cherifi, H.; Gaito, S.; Mendes, J.F.; Moro, E.; Rocha, L.M., Eds.; Springer International Publishing: Cham, 2020; pp. 415–426.
- Rotman, A.; Shalev, M. Using Location Data from Mobile Phones to Study Participation in Mass Protests. Sociological Methods & Research, 2020; 0049124120914926. [Google Scholar]
- Pintér, G.; Felde, I. Analyzing the Behavior and Financial Status of Soccer Fans from a Mobile Phone Network Perspective: Euro 2016, a Case Study. Information 2021, 12, 468. [Google Scholar] [CrossRef]
- Pintér, G.; Felde, I. Awakening City: Traces of the Circadian Rhythm within the Mobile Phone Network Data. Information 2022, 13, 114. [Google Scholar] [CrossRef]
- Dale, R. GPT-3: What’s it good for? Natural Language Engineering 2021, 27, 113–118. [Google Scholar] [CrossRef]
- Candia, J.; González, M.C.; Wang, P.; Schoenharl, T.; Madey, G.; Barabási, A.L. Uncovering individual and collective human dynamics from mobile phone records. Journal of physics A: mathematical and theoretical 2008, 41, 224015. [Google Scholar] [CrossRef]
- Gonzalez, M.C.; Hidalgo, C.A.; Barabasi, A.L. Understanding individual human mobility patterns. nature 2008, 453, 779–782. [Google Scholar] [CrossRef] [PubMed]
- Pappalardo, L.; Simini, F.; Rinzivillo, S.; Pedreschi, D.; Giannotti, F.; Barabási, A.L. Returners and explorers dichotomy in human mobility. Nature communications 2015, 6, 1–8. [Google Scholar] [CrossRef] [PubMed]
- Andrienko, N.; Andrienko, G. Designing visual analytics methods for massive collections of movement data. Cartographica: The International Journal for Geographic Information and Geovisualization 2007, 42, 117–138. [Google Scholar] [CrossRef]
- Calabrese, F.; Colonna, M.; Lovisolo, P.; Parata, D.; Ratti, C. Real-time urban monitoring using cell phones: A case study in Rome. IEEE transactions on intelligent transportation systems 2010, 12, 141–151. [Google Scholar] [CrossRef]
- Sagl, G.; Loidl, M.; Beinat, E. A visual analytics approach for extracting spatio-temporal urban mobility information from mobile network traffic. ISPRS International Journal of Geo-Information 2012, 1, 256–271. [Google Scholar] [CrossRef]
- Senaratne, H.; Mueller, M.; Behrisch, M.; Lalanne, F.; Bustos-Jiménez, J.; Schneidewind, J.; Keim, D.; Schreck, T. Urban mobility analysis with mobile network data: A visual analytics approach. IEEE Transactions on Intelligent Transportation Systems 2017, 19, 1537–1546. [Google Scholar] [CrossRef]
- Wirz, M.; Franke, T.; Roggen, D.; Mitleton-Kelly, E.; Lukowicz, P.; Tröster, G. Probing crowd density through smartphones in city-scale mass gatherings. EPJ Data Science 2013, 2, 1–24. [Google Scholar] [CrossRef]
- Barnett, I.; Khanna, T.; Onnela, J.P. Social and spatial clustering of people at humanity’s largest gathering. PLOS ONE 2016, 11, 1–12. [Google Scholar] [CrossRef] [PubMed]
- Hiir, H.; Sharma, R.; Aasa, A.; Saluveer, E. Impact of Natural and Social Events on Mobile Call Data Records – An Estonian Case Study. Complex Networks and Their Applications VIII. Springer, 2019, pp. 415–426.
- Xu, Y.; Belyi, A.; Bojic, I.; Ratti, C. Human mobility and socioeconomic status: Analysis of Singapore and Boston. Computers, Environment and Urban Systems 2018, 72, 51–67. [Google Scholar] [CrossRef]
- Stowe, K. An Introduction to Thermodynamics and Statistical Mechanics; Cambridge University Press, 2007. [Google Scholar]
- Song, C.; Qu, Z.; Blumm, N.; Barabási, A.L. Limits of predictability in human mobility. Science 2010, 327, 1018–1021. [Google Scholar] [CrossRef] [PubMed]
- Pappalardo, L.; Vanhoof, M.; Gabrielli, L.; Smoreda, Z.; Pedreschi, D.; Giannotti, F. An analytical framework to nowcast well-being using mobile phone data. International Journal of Data Science and Analytics 2016, 2, 75–92. [Google Scholar] [CrossRef]
- Pintér, G.; Nadai, L.; Bognár, G.; Felde, I. Evaluation of mobile phone signals in urban environment during a large social event. 2018 IEEE 12th International Symposium on Applied Computational Intelligence and Informatics (SACI). IEEE, 2018, pp. 247–250.
- Preoţiuc-Pietro, D.; Lampos, V.; Aletras, N. An analysis of the user occupational class through Twitter content. 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 2015, pp. 1754–1764.
- Preoţiuc-Pietro, D.; Volkova, S.; Lampos, V.; Bachrach, Y.; Aletras, N. Studying user income through language, behaviour and affect in social media. PloS one 2015, 10, e0138717. [Google Scholar] [CrossRef] [PubMed]
- Lampos, V.; Aletras, N.; Geyti, J.K.; Zou, B.; Cox, I.J. Inferring the socioeconomic status of social media users based on behaviour and language. European conference on information retrieval. Springer, 2016, pp. 689–695.
- Huang, Q.; Wong, D.W. Activity patterns, socioeconomic status and urban spatial structure: what can social media data tell us? International Journal of Geographical Information Science 2016, 30, 1873–1898. [Google Scholar] [CrossRef]
- Soto, V.; Frias-Martinez, V.; Virseda, J.; Frias-Martinez, E. Prediction of socioeconomic levels using cell phone records. International conference on user modeling, adaptation, and personalization. Springer, 2011, pp. 377–388.
- Frias-Martinez, V.; Virseda, J. Cell phone analytics: Scaling human behavior studies into the millions. Information Technologies & International Development 2013, 9, 35–50. [Google Scholar]
- Blumenstock, J.; Cadamuro, G.; On, R. Predicting poverty and wealth from mobile phone metadata. Science 2015, 350, 1073–1076. [Google Scholar] [CrossRef] [PubMed]
- Almaatouq, A.; Prieto-Castrillo, F.; Pentland, A. Mobile communication signatures of unemployment. International conference on social informatics. Springer, 2016, pp. 407–418.
- Ding, S.; Huang, H.; Zhao, T.; Fu, X. Estimating Socioeconomic Status via Temporal-Spatial Mobility Analysis - A Case Study of Smart Card Data. 2019 28th International Conference on Computer Communication and Networks (ICCCN), 2019, pp. 1–9.
- Barbosa, H.; Hazarie, S.; Dickinson, B.; Bassolas, A.; Frank, A.; Kautz, H.; Sadilek, A.; Ramasco, J.J.; Ghoshal, G. Uncovering the socioeconomic facets of human mobility. Scientific reports 2021, 11, 1–13. [Google Scholar] [CrossRef] [PubMed]
- Sultan, S.F.; Humayun, H.; Nadeem, U.; Bhatti, Z.K.; Khan, S. Mobile phone price as a proxy for socio-economic indicators. Proceedings of the Seventh International Conference on Information and Communication Technologies and Development, 2015, pp. 1–4.
- Wijesinghe, W.; Kumarasinghe, C.; Mannapperuma, J.; Liyanage, K. Socioeconomic status classification of geographic regions in Sri Lanka through anonymized call detail records. EAI International Conference on Big Data Innovation for Sustainable Cognitive Computing. Springer, 2020, pp. 299–311.
- Pintér, G.; Felde, I. Evaluating the Effect of the Financial Status to the Mobility Customs. ISPRS International Journal of Geo-Information 2021, 10, 328. [Google Scholar] [CrossRef]
- Brown, T.B.; Mann, B.; Ryder, N.; Subbiah, M.; Kaplan, J.; Dhariwal, P.; Neelakantan, A.; Shyam, P.; Sastry, G.; Askell, A.; et al. Language Models are Few-Shot Learners, 2020.
- Thiergart, J.; Huber, S.; Übellacker, T. Understanding Emails and Drafting Responses – An Approach Using GPT-3, 2021.
- Chintagunta, B.; Katariya, N.; Amatriain, X.; Kannan, A. Medically Aware GPT-3 as a Data Generator for Medical Dialogue Summarization. Proceedings of the 6th Machine Learning for Healthcare Conference; Jung, K.; Yeung, S.; Sendak, M.; Sjoding, M.; Ranganath, R., Eds. PMLR, 2021, Vol. 149, Proceedings of Machine Learning Research, pp. 354–372.
- Goyal, T.; Li, J.J.; Durrett, G. News Summarization and Evaluation in the Era of GPT-3, 2022.
- Wang, S.; Liu, Y.; Xu, Y.; Zhu, C.; Zeng, M. Want To Reduce Labeling Cost? GPT-3 Can Help, 2021.
- Bishop, C.M.; Nasrabadi, N.M. Pattern recognition and machine learning; Vol. 4, Springer, 2006.
- Estivill-Castro, V. Why so many clustering algorithms: a position paper. ACM SIGKDD explorations newsletter 2002, 4, 65–75. [Google Scholar] [CrossRef]
- Yang, P.; Zhu, T.; Wan, X.; Wang, X. Identifying significant places using multi-day call detail records. 2014 IEEE 26th International Conference on Tools with Artificial Intelligence. IEEE, 2014, pp. 360–366.
- Qin, S.; Zuo, Y.; Wang, Y.; Sun, X.; Dong, H. Travel trajectories analysis based on call detail record data. 2017 29th Chinese Control And Decision Conference (CCDC). IEEE, 2017, pp. 7051–7056.
- Nair, S.C.; Elayidom, M.S.; Gopalan, S. Call detail record-based traffic density analysis using global K-means clustering. International Journal of Intelligent Enterprise 2020, 7, 176–187. [Google Scholar] [CrossRef]
- Gil-Garcia, R.J.; Badia-Contelles, J.M.; Pons-Porrata, A. A general framework for agglomerative hierarchical clustering algorithms. 18th International Conference on Pattern Recognition (ICPR’06). IEEE, 2006, Vol. 2, pp. 569–572.
- Ghahramani, M.; Zhou, M.; Hon, C.T. Extracting significant mobile phone interaction patterns based on community structures. IEEE Transactions on Intelligent Transportation Systems 2018, 20, 1031–1041. [Google Scholar] [CrossRef]
- Li, M.; Gao, S.; Lu, F.; Zhang, H. Reconstruction of human movement trajectories from large-scale low-frequency mobile phone data. Computers, Environment and Urban Systems 2019, 77, 101346. [Google Scholar] [CrossRef]
- Wang, H.; Calabrese, F.; Di Lorenzo, G.; Ratti, C. Transportation mode inference from anonymized and aggregated mobile phone call detail records. 13th International IEEE Conference on Intelligent Transportation Systems. IEEE, 2010, pp. 318–323.
- Sultan, K.; Ali, H.; Zhang, Z. Call detail records driven anomaly detection and traffic prediction in mobile cellular networks. IEEE Access 2018, 6, 41728–41737. [Google Scholar] [CrossRef]
- Al-Zuabi, I.M.; Jafar, A.; Aljoumaa, K. Predicting customer’s gender and age depending on mobile phone data. Journal of Big Data 2019, 6, 1–16. [Google Scholar] [CrossRef]
- Lenormand, M.; Samaniego, H.; Chaves, J.C.; da Fonseca Vieira, V.; da Silva, M.A.H.B.; Evsukoff, A.G. Entropy as a measure of attractiveness and socioeconomic complexity in Rio de Janeiro metropolitan area. Entropy 2020, 22, 368. [Google Scholar] [CrossRef]
- Ghahramani, M.; Zhou, M.C.; Qiao, Y.; Wu, N.Q. Spatio-Temporal Analysis of Mobile Phone Network based on Self-Organizing Feature Map. IEEE Internet of Things Journal 2021. [Google Scholar]
- Zhao, T.; Huang, H.; Yao, X.; Luo, J.d.; Fu, X. Predicting individual socioeconomic status from mobile phone data: a semi-supervised hypergraph-based factor graph approach. International Journal of Data Science and Analytics 2020, 9, 361–372. [Google Scholar] [CrossRef]
- Research summary for the National Media and Infocommunications Authority. Electronic Communication Services Usage by Households and Individuals, 2014.
- Hungarian Central Statistical Office. Gazetteer of Hungary 1st January, 2014; Hungarian Central Statistical Office, 2014. [Google Scholar]
- OpenAI. API. https://openai.com/api, accessed on 15.12.2022.
- szifon.com. Elérhetoek a független iPhone 4S árak a magyar Apple Store-ban! https://szifon.com/2011/10/21/elerhetoek-a-fuggetlen-iphone-4s-arak-a-magyar-apple-store-ban, accessed on 20.12.2022.
- Protalinski, E. iPhone prices from the original to iPhone X. https://venturebeat.com/2017/09/12/iphone-prices-from-the-original-to-iphone-x, accessed on 14.02.2022.
- Sainani, M. GSMArena Mobile Phone Devices. https://www.kaggle.com/msainani/gsmarena-mobile-devices, accessed on 02.03.2022.
- Bholowalia, P.; Kumar, A. EBK-means: A clustering technique based on elbow method and k-means in WSN. International Journal of Computer Applications 2014, 105. [Google Scholar]
- Szabó, K.; Pintér, G.; Felde, I. Evaluating the Socioeconomic Status of a Large Social Event Attendees. 2022 IEEE 16th International Symposium on Applied Computational Intelligence and Informatics (SACI), 2022, pp. 77–80.
- Pappalardo, L.; Pedreschi, D.; Smoreda, Z.; Giannotti, F. Using big data to study the link between human mobility and socio-economic development. 2015 IEEE International Conference on Big Data (Big Data). IEEE, 2015, pp. 871–878.













Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).