Constructing and Cleaning Identity Graphs in the LOD Cloud | Zendy

Joe Raad | Zendy; Wouter van Beek | Zendy; Frank van Harmelen | Zendy; Jan Wielemaker | Zendy; Nathalie Pernelle | Zendy; Fatiha Saïs | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Constructing and Cleaning Identity Graphs in the LOD Cloud

Author(s) -

Joe Raad,

Wouter van Beek,

Frank van Harmelen,

Jan Wielemaker,

Nathalie Pernelle,

Fatiha Saïs

Publication year - 2020

Publication title -

data intelligence

Language(s) - English

Resource type - Journals

eISSN - 2096-7004

pISSN - 2641-435X

DOI - 10.1162/dint_a_00057

Subject(s) - computer science , semantic web , linked data , identity (music) , graph , statement (logic) , world wide web , owl s , information retrieval , social semantic web , theoretical computer science , linguistics , physics , philosophy , acoustics

In the absence of a central naming authority on the Semantic Web, it is common for different data sets to refer to the same thing by different names. Whenever multiple names are used to denote the same thing, owl:sameAs statements are needed in order to link the data and foster reuse. Studies that date back as far as 2009, observed that the owl:sameAs property is sometimes used incorrectly. In our previous work, we presented an identity graph containing over 500 million explicit and 35 billion implied owl:sameAs statements, and presented a scalable approach for automatically calculating an error degree for each identity statement. In this paper, we generate subgraphs of the overall identity graph that correspond to certain error degrees. We show that even though the Semantic Web contains many erroneous owl:sameAs statements, it is still possible to use Semantic Web data while at the same time minimising the adverse effects of misusing owl:sameAs.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research