Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Designing of a Geographical Entity Annotation System Using the BiLSTM+CRF+AGG Model

Version 1 : Received: 23 September 2023 / Approved: 25 September 2023 / Online: 26 September 2023 (13:59:37 CEST)

How to cite: Xu, A.; Ling, G.; Wang, C. Designing of a Geographical Entity Annotation System Using the BiLSTM+CRF+AGG Model. Preprints 2023, 2023091773. https://doi.org/10.20944/preprints202309.1773.v1 Xu, A.; Ling, G.; Wang, C. Designing of a Geographical Entity Annotation System Using the BiLSTM+CRF+AGG Model. Preprints 2023, 2023091773. https://doi.org/10.20944/preprints202309.1773.v1

Abstract

When analyzing user geospatial information through deep learning methods, it is typically necessary to annotate existing geospatial data. Currently, manual annotation methods are commonly employed, suffering from issues related to low efficiency and accuracy. This design is based on the TensorFlow deep learning framework and first realizes the BiLSTM+CRF+AGG deep learning model. Among the models, AGG is the aggregation layer, which is introduced to solve the problem of solid particle size equilibrium. Secondly, based on the characteristics of original data and professional data, an automatic labeling algorithm is proposed. The algorithm first preprocesses the acquired original data set and professional data set, The most valuable unlabeled data set which can make the training model converge quickly is selected from the original data set as the target data set. Sort the target data set based on preset rules and set annotation parameters for the sorted target data set. Based on the set annotation parameters, the corpus is synthesized and used as the annotation result. Thirdly, based on the active learning strategy, a manual annotation auxiliary scheme is proposed, and an Excel generation module that is convenient for manual annotation correction is designed and developed to further improve the efficiency and quality of annotation through iterative processing. The combination of the BiLSTM+CRF+AGG deep learning model and high-quality annotation can accurately identify non-standard, incomplete, or even incorrect geographical information entities. This annotation method has found successful practical application within the context of the research project and has been granted an invention patent.

Keywords

geospatial entities; annotation system; deep learning; BiLSTM+CRF+AGG model; active Learning; human-assisted

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.