z-logo
Premium
Metadata Topic Harmonization and Semantic Search for Linked‐Data‐Driven Geoportals: A Case Study Using ArcGIS Online
Author(s) -
Hu Yingjie,
Janowicz Krzysztof,
Prasad Sathya,
Gao Song
Publication year - 2015
Publication title -
transactions in gis
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.721
H-Index - 63
eISSN - 1467-9671
pISSN - 1361-1682
DOI - 10.1111/tgis.12151
Subject(s) - metadata , computer science , information retrieval , geoportal , geospatial metadata , metadata modeling , linked data , world wide web , discoverability , geospatial analysis , semantic grid , semantic web , meta data services , latent dirichlet allocation , metadata repository , topic model , geography , cartography , gis and public health
Geoportals provide integrated access to geospatial resources, and enable both authorities and the general public to contribute and share data and services. An essential goal of geoportals is to facilitate the discovery of the available resources. Such a process relies heavily on the quality of metadata. While multiple metadata standards have been established, data contributers may adopt different standards when sharing their data via the same geoportal. This is especially the case for user‐generated content where various terms and topics can be introduced to describe similar datasets. While this heterogeneity provides a wealth of perspectives, it also complicates resource discovery. With the fast development of the Semantic Web technologies, there is a rise of Linked‐Data‐driven portals. Although these novel portals open up new ways to organize metadata and retrieve resources, they lack effective semantic search methods. This article addresses the two challenges discussed above, namely the topic heterogeneity brought by multiple metadata standards and the lack of established semantic search in Linked‐Data‐driven geoportals. To harmonize the metadata topics, we employ a natural language processing method, namely Labeled Latent Dirichlet Allocation (LLDA), and train it using standardized metadata from Data.gov . With respect to semantic search, we construct thematic and geographic matching features from the textual metadata descriptions, and train a regression model via a human participants experiment. We evaluate our methods by examining their performances in addressing the two issues. Finally, we implement a semantics‐enabled and Linked‐Data‐driven prototypical geoportal using a sample dataset from Esri's ArcGIS Online.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here