Submitted:
15 April 2024
Posted:
15 April 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Methods
2.1. Recognition of Toponyms Derived from Proper Noun
2.1.1. Extraction of Proper Noun Derivatives
2.1.2. Extraction of Proper Noun Derivatives
2.2. Discrimination of Derivative Relation of Proper Noun
2.2.1. Feature Construction of Toponym Derivation
2.2.2. Spatial Topological Relations
2.2.3. Spatial Metric Relation
2.2.4. Place Name Lifetime
2.2.5. Patterns of Toponym Derivation
2.2.6. Morphological Topological Relations of Toponyms
2.3. Construction of the Supervised Dataset
- 1)
- Spatial topology rules. In the proper noun derivation relationship, the spatial topological feature values between the proper noun-derived place name entity and the original place name entity should be included in the spatial topological feature sequence of the proper noun derivation relationship in the derivation mode. In this paper, the statistical method is used to first count the frequency of the spatial topological relationship values between the place name entities in each derivation mode from the set of place names derived from the proper name, and then the mean value of each spatial topological relationship value is taken as the threshold value, and the spatial topological relationship value whose frequency is greater than the threshold value is added to the spatial topological feature sequence of the derived mode; when the spatial topological relationship values between the proper name-derived place name entity and the native place name entity are included in the spatial topological sequence under the derivation mode, the pair of place name entities satisfies the spatial topological rule.
- 2)
- Spatial measurement rules. In the toponym derivation relationship, there is a limit to the distance between the derived place name entity and the original place name entity, which is called the derivation distance of the original place name. The distance between the derived place name entity and the original place name entity should be smaller than the derivation distance of the original place name entity. In this paper, the method of Liu Hanyou et al [9] is used to estimate the derivation distance.
- 3)
- 4)
- Toponymic derivation mode rules. For the of derived toponyms and native toponyms in the toponym derivation relationship, this paper adopts the method of combining artificial prior knowledge to manually select the set of place name derivation modes that conform to the common sense of place name derivation from the place name category sequence of the set of derived toponyms. In the special noun derived toponyms dataset, the composed of the special noun derived place name category and the original place name category should be included in the artificially defined derivation mode set D.
- 5)
- Topological rules of toponyms. In the toponym derivation relationship, there is a strong similarity between the derived place name and the toponym part of the original place name. Therefore, the word morphology of the derived place name and the original place name should have a topological intersection relationship. In this paper, the character sequence of the place name word is taken as the morphological feature of the place name, and whether the two toponyms contain the same or similar words is used as the criterion for judging whether there is a topological intersection relationship between the two toponyms.
- 6)
- Construction rules for positive and negative samples. When both place name entities satisfy the spatial topology rules, spatial metric rules, place name survival rules, place name derivation mode rules, and place name morphological topology rules, then there is a toponym derivation relationship between the two place name entities, and the two place name entities are positive samples; conversely, the two toponyms are negative samples.
3. Results and Discussion
3.1. Recognition of Toponyms Derived from Proper Noun
3.2. Discrimination of Toponym Derivation
4. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Yuan, S.; Yang, D.; Liu, J.; Tian, S.; Liang, J.; Xiao, Y.; Xie, R. Causality-aware Concept Extraction based on Knowledge-guided Prompting. arxiv 2023, arXiv:2305.01876. [Google Scholar]
- He, Q.; Chen, G.; Song, W.; Zhang, P. Prompt-Based Word-Level Information Injection BERT for Chinese Named Entity Recognition. Applied Sciences 2023, 13. [Google Scholar] [CrossRef]
- Rozhkov, I.S.; Loukachevitch, N.V. Prompts in Few-Shot Named Entity Recognition. Pattern Recognition and Image Analysis 2023, 33, 122–131. [Google Scholar] [CrossRef]
- Ashok, D.; Lipton, Z.C. PromptNER: Prompting For Named Entity Recognition. arxiv 2023, arXiv:2305.15444. [Google Scholar]
- Chen, P.; Wang, J.; Lin, H.; Zhao, D.; Yang, Z. Few-shot biomedical named entity recognition via knowledge-guided instance generation and prompt contrastive learning. Bioinformatics 2023, 39, btad496. [Google Scholar] [CrossRef] [PubMed]
- Cheng, H. Derived Place Names and Their Translation Methods. Chinese scientific and technological terms 2016, 18, 5–8. [Google Scholar] [CrossRef]
- Ruwei, T. Native place names and derived place names. Chinese place name 2018, 30. [Google Scholar]
- Cosimo, P. THE CITIES OF THE AMERICAS IN MODERN TIMES: A CASE STUDY ON TOPONYMY. GEOGRAPHY 2016, 9, 28–46. [Google Scholar] [CrossRef]
- Hanyou, L.; Jizhou, W.; Xi, M.; Weijun, M. Mining of Complete Derived Place Names and Generic Derived Place Names. Surveying and mapping science 2022, 47, 176–181. [Google Scholar] [CrossRef]
- Hanyou, L. Automatic Recognition and Translation of English Derived Place Names for Global Mapping. Master’s Thesis, Liaoning Technical University, 2022. [Google Scholar] [CrossRef]
- Shi, P.; Lin, J. Simple BERT Models for Relation Extraction and Semantic Role Labeling. arXiv 2019, arXiv:1904.05255. [Google Scholar]
- Zhao, L.; Li, L.; Zheng, X. A BERT based Sentiment Analysis and Key Entity Detection Approach for Online Financial Texts. arXiv 2020, arXiv:2001.05326. [Google Scholar]
- Zhandong, W.; Shuqin, L.; Sheng, L.; Xinzhi, S. Multi-Class Disturbance Events Recognition Based on EMD and XGBoost in ϕ-OTDR. IEEE ACCESS 2020, 8. [Google Scholar] [CrossRef]
- Li, S.; Zhang, X. Research on orthopedic auxiliary classification and prediction model based on XGBoost algorithm. Neural Computing and Applications 2020, 32, 1971–1979. [Google Scholar] [CrossRef]
- Axing, Z.; Guinian, L.; Chenghu, Z.; Chengzhi, Q. Geographic Similarity: Third Law of Geography? Journal of Geoinformation Science 2020, 22, 673–679. [Google Scholar] [CrossRef]
- Li-li, S. A study on the linguistic dimensions of mountain names in China and America. Journal of Shangluo University 2021, 35, 48–54. [Google Scholar] [CrossRef]
- Jian, C.; Bin, Z.; Rupeng, L. Place Name: The "Locator" of Map Spatial Cognition. Geographic Information World 2012, 10, 6–9+13. [Google Scholar]
- Wu Jianxin, Fang Yu, C.B. Research status and development of topological spatial relation description theory. Geography and Geographic Information Science 2005, 1–4.
- Tobler, W.R. A Computer Movie Simulating Urban Growth in the Detroit Region. Economic Geography 1970, 46, 234–240. [Google Scholar] [CrossRef]
- Xingguang, W.; Ruijie, Z.; Yi, Z. Research on place name disambiguation method based on geographical association degree and evidence theory. Journal of Peking University (Natural Science Edition) 2017, 53, 344–352. [Google Scholar] [CrossRef]
- Jiangtao, B.; Wei, P.; Yongjian, H.; Yongqiang, Z.; Huihui, Y. Research on the Construction of Historical Place Name Comprehensive Information System Based on TGIS and Big Data Technology. Journal of Global Change Data 2021, 5, 363–372+520–529. [Google Scholar]
- Hui, D. Spatio-temporal attributes and cultural values of Chinese place names. Civil Administration of China 2017, 55. [Google Scholar]






| Input | The category of toponym is X. | |
| The toponym is Y. | ||
| The proper noun derivatives of toponym is [MASK]. | ||
| Example Type | Postive | Negtive |
| Input[X] | Lake Grove Club | Slichter Residence Hall |
| Input[Y] | Residential | Building |
| Output | Lake | None |
| Answer Map | Lake | Nothing |
| Model Name | Accuracy | Precision | Recall | F1 |
|---|---|---|---|---|
| 99.32% | 99.75% | 99.32% | 99.43% | |
| 94.98% | 97.34% | 94.98% | 95.63% | |
| 97.90% | 98.72% | 97.90% | 98.15% | |
| 91.54% | 92.58% | 91.55% | 91.30% | |
| 98.79% | 99.14% | 98.79% | 98.85% | |
| 98.36% | 97.94% | 98.36% | 98.10% |
| Model Name | Homogeneity | Completeness | V-measure | Adjusted Mutual Information | |
|---|---|---|---|---|---|
| DBSCAN | 84.19% | 82.17% | 83.17% | 82.94% | |
| GaussianMixture | 84.37% | 81.08% | 82.69% | 82.47% | |
| Kmeans | 81.50% | 77.52% | 79.46% | 79.20% | |
| Kmeans++ | 81.68% | 78.25% | 79.93% | 79.67% | |
| Rule Type | Rule |
|---|---|
| Positive example | 1)The spatial distance between geographical entities whose derivation mode is should be less than 2554761.8275.2)The distance between geographical entities whose derivation mode is or should be equal to 0. |
| 2)The spatial topological relation sequence of the original geographical names of County and State derived from their proper names is [Contains, Crosses].2)The spatial topological relationship sequence between the original place names of River and their derived names is [Disjoint]. | |
| 3)The difference between the beginning time and the end time of the survival period of the original place name and the derived place name is greater than 0. | |
| 4)Topological morphology relationship of place names of original and derived place names is Intersection. | |
| 5)The artificially defined specific name derivation modes are . | |
| Negative example | If any of the positive example rules is not met, it is a negative example. |
| Model Name | Accuracy | Precision | Recall | F1 |
|---|---|---|---|---|
| Xgboost | 99.32% | 99.33% | 99.11% | 99.55% |
| RandomForest | 98.07% | 98.00% | 98.21% | 98.10% |
| GaussianNB | 81.34% | 84.30% | 77.90% | 80.97% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).