Submitted:
18 December 2025
Posted:
22 December 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Work and Conceptual Background
2.1. Authoritative Geospatial Models: EDGV
2.2. OpenStreetMap: A Collaborative Folksonomy
2.3. The Semantic Alignment Challenge
2.4. AI Paradigms and Ontological Reasoning
3. Methodology
3.1. Ontology Construction
3.2. Language Models and Dialogue Prompt
| Listing 1. Prompt used in all experiments. |
![]() |
3.3. Evaluation Protocol
- Completeness in Semantic Alignment: number of classes accurately associated in relation to the total of 87 classes, attributes, and domains of the EDGV represented in the input ontology, considering the readability of the EDGV ontology and the OSM wiki. This analysis assessed geospatial data quality, specifically the completeness dimension, drawing on ISO standards [34]. The criteria for determining a commission were whether or not the association was considered appropriate to the ontology class, and for omission, the failure to make an association to the ontology class. The metric for determining the appropriate association was based on the authors’ observation and the conceptual definition of the elements in their original schemas.
- Syntactic Conformity: the ability to generate valid OWL code while maintaining structural integrity, for example, preserving the existing class hierarchy, consistently creating new classes representing the OSM tags as subclasses of OSM_Tags, maintenance of the original object properties and instances, and use of ‘EquivalentTo’ notation as an element to semantic association.
- Complex reasoning: test of LLM’s ability to infer domain rules, by making associations at different hierarchical levels (e.g. all religious buildings correspond to ‘amenity=place_of_worship’). Additionally, the ability to classify relationships into multiple associations (1:1, 1:n, and n:1) and identify the classes involved in many-to-one relationships.
4. Results
4.1. OpenAI (ChatGPT)
4.2. DeepSeek
4.3. Google (Gemini)
5. Discussion
5.1. Completeness in Semantic Alignment
5.2. Syntactic Conformity and Structural Integrity
5.3. Reasoning Depth: Hierarchical Levels and Multiplicity
5.4. Theoretical Implications and Comparison with Related Work
5.5. Limitations and Directions for Future Research
6. Conclusions
Supplementary Materials
Data Availability Statement
Conflicts of Interest
Abbreviations
| EDGV | Estruturação de Dados Geoespaciais Vetoriais (Brazilian Portuguese) |
| LLM | Large Language Model |
| LRM | Large Reasoning Model |
| OSM | OpenStreetMap |
| OWL | Ontology Web Language |
| 1 | Access to ChatGPT: https://openai.com/index/chatgpt/
|
| 2 | Access to DeepSeek: https://www.deepseek.com/
|
| 3 | Access to Gemini: https://gemini.google.com/app
|
References
- Rosch, E. (1975). Cognitive representations of semantic categories. Journal of Experimental Psychology: General, 104(3), 192–233. [CrossRef]
- Rosch, E. (1973). Natural categories. Cognitive Psychology, vol.4.
- Petchenik, B. B. (1977). Cognition In Cartography. Cartographica: The International Journal for Geographic Information and Geovisualization, 14(1), 117–128. [CrossRef]
- Fremlin, G., & Robinson, A. H. (1998). What Is It That Is Represented on a Topographical Map? Cartographica: The International Journal for Geographic Information and Geovisualization, 35(1-2), 13-19.
- Janowicz, K.; Scheider, S.; Adams, B. (2013). A Geo-semantics Flyby. In: (Eds) Rudolph S.; Gottlob G.; Horrocks, I.; Van Harmelen; F. Reasoning Web: Semantic Technologies for Intelligent Data Access. Reasoning Web 2013. Lecture Notes in Computer Science, v. 8067. Springer, Berlin, Heidelberg, p. 230-250.
- Machado, A. A., & Camboim, S. P. (2024). Semantic Alignment of Official and Collaborative Geospatial Data: A Case Study in Brazil. Revista Brasileira de Cartografia, 76. Scopus. [CrossRef]
- Souza, F. A., da Silva, E. D. B., & Camboim, S. P. (2025). Explorando o Uso de Large Language Model (ChatGPT) para Alinhamento Semântico entre Esquemas Conceituais de Dados Geoespaciais. Rev. Bras. Cartogr, 77, 1.
- Souza, F. A., & Camboim, S. P. (2024). Advancing Geospatial Data Integration: The Role of Prompt Engineering in Semantic Association with chatGPT. Free and Open-Source Software for Geospatial 2024 (FOSS4G 2024), Belém,PA,Brazil, 02-08-December, 2024 (Session Academic Track, Part Full Papers, pp 87-92). https://zenodo.org/records/14250739.
- Souza, F. A., & Camboim, S. P. (2023). Semantic Alignment of Geospatial Data Models using chatGPT: preliminary studies. In da Fonseca Feitosa F. & Vinhas L. (Eds.), Proc. Brazilian Symp. GeoInformatics (pp. 399–404). National Institute for Space Research, INPE; Scopus. https://www.scopus.com/inward/record.uri?eid=2-s2.0-85181118913&partnerID=40&md5=45de9b24f4242bc1e4306f46b84a1ed0.
- Anand, S., Morley, J., Jiang, W., Du, H., & Hart, G. (2010). When worlds collide: Combining Ordnance Survey and Open Street Map data. In: AGI Geocommunity ‘10, London, UK.
- Ballatore, A., Bertolotto, M., & Wilson, D. C. (2013). Geographic knowledge extraction and semantic similarity in OpenStreetMap. Knowledge and Information Systems, 37(1), 61–81. [CrossRef]
- Du, H., Alechina, N., Jackson, M., & Hart, G. (2013). Matching Formal and Informal Geospatial Ontologies. In D. Vandenbroucke, B. Bucher, & J. Crompvoets (Eds.), Geographic Information Science at the Heart of Europe (pp. 155–171). Springer International Publishing. [CrossRef]
- Yu, L., Qiu, P., Liu, X., Lu, F., & Wan, B. (2018). A holistic approach to aligning geospatial data with multidimensional similarity measuring. International Journal of Digital Earth, 11(8), 845–862. [CrossRef]
- Romanenko, E., Calvanese, D., & Guizzardi, G. (2024). Evaluating quality of ontology-driven conceptual models abstractions. Data & Knowledge Engineering, 153, 102342. [CrossRef]
- Kang, Y., Gao, S., & Roth, R. (2024). Artificial intelligence studies in cartography: A review and synthesis of methods, applications, and ethics. Cartography and Geographic Information Science. [CrossRef]
- Mirzadeh, I., Alizadeh, K., Shahrokhi, H., Tuzel, O., Bengio, S., & Farajtabar, M. (2024). GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models (arXiv:2410.05229). arXiv. [CrossRef]
- Tucker, S. (2024). A systematic review of geospatial location embedding approaches in large language models: A path to spatial AI systems. arXiv, Jan. 12, 2024. doi: 10.48550. arXiv preprint arXiv.2401.10279 .
- Valmeekam, K., Stechly, K., & Kambhampati, S. (2024). LLMs Still Can’t Plan; Can LRMs? A Preliminary Evaluation of OpenAI’s o1 on PlanBench (arXiv:2409.13373). arXiv. [CrossRef]
- Concar. (2017). Comissão Nacional de Cartografia. Especificações Técnicas para Estruturação de Dados Geoespaciais Vetoriais (ET-EDGV 3.0). NCB-CC/E 0001B08. Versão 3.0.
- Grinberger, A. Y., Minghini, M., Juhász, L., Yeboah, G., & Mooney, P. (2022). OSM Science - The Academic Study of the OpenStreetMap Project, Data, Contributors, Community, and Applications. ISPRS International Journal of Geo-Information, 11(4), 230. [CrossRef]
- Kaur, J., Singh, J., Sehra, S. S., & Rai, H. S. (2017). Systematic Literature Review of Data Quality Within OpenStreetMap. 2017 International Conference on Next Generation Computing and Information Systems (ICNGCIS), 177–182. [CrossRef]
- OSM. (2025). OpenStreetMap: Map Features. https://wiki.openstreetmap.org/wiki/Map_features.
- Liang, B., Wang, Y., & Tong, C. (2025). AI Reasoning in Deep Learning Era: From Symbolic AI to Neural–Symbolic AI. Mathematics, 13(11), 1707. [CrossRef]
- Mira, J. M. (2008). Symbols versus connections: 50 years of artificial intelligence. Neurocomputing, 71(4-6), 671-680. [CrossRef]
- Santhanam, S.; Shaikh, S. (2019). A Survey of Natural Language Generation Techniques with a Focus on Dialogue Systems - Past, Present and Future Directions. Cornell University. https://arxiv.org/abs/1906.00500.
- Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is All you Need. 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA.
- Jozefowicz, R.; Vinyals, O.; Schuster, M.; Shazeer, N.; Wu, Y. (2016). Exploring the Limits of Language Modeling. https://arxiv.org/abs/1602.02410.
- Prince, S. J. D. (2023). Understanding Deep Learning. http://udlbook.com.
- Berners-Lee, T.; Hendler, J.; Lassila, O. (2001). The Semantic Web: A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities. Scientific American, a division of Nature America, Inc. Vol. 284, No. 5, pp. 34-43.
- Gruber, T. R. (1993). A translation approach to portable ontology specifications. Knowledge Acquisition, 5(2), 199–220. [CrossRef]
- Zhang, Y., Wei, C., He, Z., & Yu, W. (2024). GeoGPT: An assistant for understanding and processing geospatial tasks. International Journal of Applied Earth Observation and Geoinformation, 131. Scopus. [CrossRef]
- Mooney, P., Cui, W., Guan, B., & Juhász, L. (2023). Towards Understanding the Geospatial Skills of ChatGPT Taking a Geographic Information Systems (GIS) Exam. In S. Newsam, L. Yang, G. Mai, B. Martins, D. Lunga, & S. Gao (Eds.), Maynooth University (WOS:001152316700015; pp. 85–94). [CrossRef]
- Borges, K. A. V., Davis Jr., C. A. & Laender, A. H. F. (2001). OMT-G: An Object-Oriented Data Model for Geographic Applications. GeoInformatica 5, 221–260. [CrossRef]
- International Organization for Standardization. (2013). Geographic information – Data quality (ISO Standard No. 19157:2013[2014]). https://www.iso.org/standard/32575.html.
- Zhang, Q., Zhang, T., Zhai, J., Fang, C., Yu, B., Sun, W., & Chen, Z. (2024). A Critical Review of Large Language Model on Software Engineering: An Example from ChatGPT and Automated Program Repair. arXiv. http://arxiv.org/abs/2310.08879.
- Bravo, J. V. M. (2014). A Confiabilidade Semântica das Informações Geográficas Voluntárias como Função da Organização Mental do Conhecimento Espacial. Dissertação de Mestrado. 139 p. Universidade Federal do Paraná, Programa de Pós-Graduação em Ciências Geodésicas, Curitiba (PR).
- McCarthy, J. (2007). What is Artificial Intelligence? Computer Science Department / Stanford University. http://www-formal.stanford.edu/jmc/.












| Class | Attribute | Domain |
|---|---|---|
| Building | Name | - |
| Approximate geometry | - | |
| Operational | Yes / No / Unknown | |
| Approximate height | - | |
| Touristic | Yes / No / Unknown | |
| Cultural | Yes / No / Unknown | |
| Fuel station | - | - |
| Public toilets | - | - |
| Educational Building | - | - |
| Farming, plant extraction and/or fishing Building | Building type | Apiary Aviary Barn Pigsty Farm operational headquarters Plant nursery Aquaculture nursery |
| Commerce and/or Services Building | Finality | Commercial Residential Services |
| Building type | Newsstand Bank Shopping center Convention center Exhibition center Butcher shop Pharmacy Hotel Convenience store Building materials and/or hardware store Furniture store Clothing and/or fabric store Public marketplace Motel Car Repair Other businesses Other services Inn Greengrocer Restaurant Supermarket Dealership |
|
| Mineral extraction Building | - | - |
| Healthcare Building | Level of care | Primary Secondary Tertiary |
| Housing Construction | - | - |
| Indigenous Building | Collective | - |
| Isolated | - | |
| Residential Building | - | - |
| Building or Construction of a phenomenon measurement station | - | - |
| Building or Construction of leisure | Building type | Amphitheater Public Records Library Cultural Center Documentation center Circus Acoustic concert hall Conservatory Bandstand Various cultural facilities Event and/or cultural space Film screening space Stadium Gallery Gymnasium Museum Fishing platform Theater |
| Religious Building | Christian | - |
| Teaching | - | |
| Religion type | - | |
| Building type | Mortuary chapel Center Convent Church Mosque Monastery Synagogue Temple Afro-Brazilian religious temple (‘Terreiro’) |
| Metric/ Category |
ChatGPT | DeepSeek | Gemini | ||||
|---|---|---|---|---|---|---|---|
| 4o | o1.preview | 5.0 | V3 | R1 | 2.0 | 2.5 | |
|
Unassociated classes (Omission) |
83 (95.4%) |
73 (83.9%) |
60 (69.0%) | 77 (88.5%) | 53 (60.9%) |
25 (28.7%) |
25 (28.7%) |
|
Total classes associated |
04 | 14 | 27 | 10 | 34 | 62 | 62 |
| – Appropriate associations (True Positive) | 04 (4.6%) |
14 (16.1%) |
26 (29,9%) |
09 (10.4%) |
32 (36.8%) |
62 (71.3%) |
60 (69.0%) |
| – Inappropriate associations (False Positive) | 0 | 0 | 01 (1.1%) |
01 (1.1%) |
02 (2.3%) |
0 | 02 (2.3%) |
| Model Family & Version | Generated Tags (n) | Structural Integrity (Hierarchy & Classes) | Naming Convention | Ontology Components Preservation (Props/Instances) | Mapping Logic (Association Method) |
|---|---|---|---|---|---|
| OpenAI | |||||
| ChatGPT 4o | 4 | Failed. Lost hierarchy; retained only associated classes (flat structure). | key=value | Low. Lost properties and instances. | EquivalentTo + ObjectProperty (hasOSMTag) |
| ChatGPT o1-preview | 14 | Partial. Retained associated classes but lost global hierarchy. | key_value | Medium. Retained properties; lost annotations/instances. | EquivalentTo + ObjectProperty (mapsTo...) |
| ChatGPT 5.0 | 31 | High. Preserved full hierarchy and original classes. | OSM_key_value | High. Retained annotations and properties. Instances unlinked. | EquivalentTo + ObjectProperty (associated_with) |
| DeepSeek | |||||
| DeepSeek V3 | 11 | Failed (Inverted). Created OSM superclasses containing EDGV subclasses. | key | Low. Lost all components. | SubclassOf + ObjectProperty |
| DeepSeek R1 | 34 | Failed. Treated original classes as subclasses of OSM tags. | OSM_key_value | Low. Lost all components. | EquivalentTo (direct naming association) |
| Gemini 2.0 Flash | 62 | Failed (Inverted). Similar to V3; inverted hierarchy structure. | OSM_Tags_key_value | Low. Lost all components. | SubclassOf (direct) |
| Gemini 2.5 Pro | 62 | High (Renamed). Preserved hierarchy but renamed original classes. | key_value = Class | High. Preserved annotations, properties, and instances. | Mixed: EquivalentTo and SubclassOf (Inconsistent) |
| Taxonomic Level | ChatGPT | DeepSeek | Gemini | ||||
|---|---|---|---|---|---|---|---|
| 4o | o1 | 5.0 | V3 | R1 | 2.0 | 2.5 | |
| Unassociated classes (Omission) | 83 | 73 | 60 | 77 | 53 | 25 | 25 |
| Appropriate associations (Total) | 4 | 14 | 26 | 9 | 32 | 62 | 60 |
| – Superclass level | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
| – Class level | 3 | 2 | 7 | 9 | 5 | 2 | 10 |
| – Attribute level | 0 | 0 | 0 | 0 | 2 | 0 | 0 |
| – Domain level | 1 | 12 | 19 | 0 | 25 | 59 | 50 |
| Mapping Multiplicity | ChatGPT | DeepSeek | Gemini | ||||
|---|---|---|---|---|---|---|---|
| 4o | o1 | 5.0 | V3 | R1 | 2.0 | 2.5 | |
| Unassociated classes (Omission) | 83 | 73 | 60 | 77 | 53 | 25 | 25 |
| Appropriate associations (Total) | 4 | 14 | 26 | 9 | 32 | 62 | 60 |
| – 1:1 mapping (One-to-One) | 4 | 14 | 22 | 6 | 19 | 56 | 43 |
| – 1:n mapping (One-to-Many) | 0 | 0 | 4 | 1 | 0 | 0 | 0 |
| – n:1 mapping (Many-to-One) | 0 | 0 | 0 | 2 | 13 | 6 | 17 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
