Preprint Article Version 1 NOT YET PEER-REVIEWED

Spatiotemporal Information Extraction from a Historic Expedition Gazetteer

  1. Department of Computer Science, University of Cape Town, Cape Town, 7700, South Africa
  2. Department of Geo-Information Processing (GIP), Faculty of Geo-information Science and Earth Observation (ITC), University of Twente, 7522 NB Enschede, The Netherlands
  3. Regional Integrity Management Systems, ROSEN Europe B.V., 7575 EJ Oldenzaal, The Netherlands
Version 1 : Received: 21 October 2016 / Approved: 22 October 2016 / Online: 22 October 2016 (10:55:08 CEST)

A peer-reviewed article of this Preprint also exists.

Bekele, M.K.; de By, R.A.; Singh, G. Spatiotemporal Information Extraction from a Historic Expedition Gazetteer. ISPRS Int. J. Geo-Inf. 2016, 5, 221. Bekele, M.K.; de By, R.A.; Singh, G. Spatiotemporal Information Extraction from a Historic Expedition Gazetteer. ISPRS Int. J. Geo-Inf. 2016, 5, 221.

Journal reference: ISPRS Int. J. Geo-Inf. 2016, 5, 221
DOI: 10.3390/ijgi5120221

Abstract

Historic expeditions are events that are flavored by exploratory, scientific, military or geographic characteristics. Such events are often documented in literature, journey notes or personal diaries. A typical historic expedition involves multiple site visits and their descriptions contain spatiotemporal and attributive contexts. Expeditions involve movements in space that can be represented by triplet features (location, time and description). However, such features are implicit and innate parts of textual documents. Extracting the geospatial information from these documents requires understanding the contextualized entities in the text. To this end, we developed a semi-automated framework that has multiple Information Retrieval and Natural Language Processing components to extract the spatiotemporal information from a two-volumes historic expedition gazetteer. Our framework has three basic components, namely, the Text Preprocessor, the Gazetteer Processing Machine and the JAPE (Java Annotation Pattern Engine) Transducer. We used the Brazilian Ornithological Gazetteer as an experimental dataset and extracted the spatial and temporal entities from entries that refer to three expeditioners’ site visits and mapped the trajectory of each expedition using the extracted information. Finally, one of the mapped trajectories was manually compared with a historical reference map of that expedition to assess the reliability of our framework. The reference map was manually prepared in previous research work by others.

Subject Areas

GIR; TIR; NLP; spatiotemporal information; temporal inference

Readers' Comments and Ratings (0)

Discuss and rate this article
Views 284
Downloads 464
Comments 0
Metrics 0
Discuss and rate this article

×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.