z-logo
Premium
Automated Geocoding of Textual Documents: A Survey of Current Approaches
Author(s) -
Melo Fernando,
Martins Bruno
Publication year - 2017
Publication title -
transactions in gis
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.721
H-Index - 63
eISSN - 1467-9671
pISSN - 1361-1682
DOI - 10.1111/tgis.12212
Subject(s) - geocoding , heuristics , computer science , discriminative model , geospatial analysis , geographic coordinate system , information retrieval , set (abstract data type) , probabilistic logic , cluster analysis , geolocation , feature (linguistics) , natural language processing , task (project management) , artificial intelligence , geography , cartography , world wide web , linguistics , philosophy , programming language , operating system , management , economics
This survey article describes previous research addressing text‐based document geocoding, i.e. the task of predicting the geospatial coordinates of latitude and longitude, that best correspond to an entire document, based on its textual contents. We describe (1) early document geocoding systems that use heuristics over place names mentioned in the text (e.g. names of cities and states), (2) probabilistic language modeling approaches, where generative models are built for different regions in the world (usually considering a discretization based on a rectangular grid) from the words occurring in a set of georeferenced training documents, which are then used to predict per‐region probabilities for previously unseen test documents, (3) combinations of different models and heuristics, including clustering procedures, feature selection approaches, and/or language models built from different sources, and (4) recent approaches based on discriminative classification models.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here