Research on the Method of Extracting Domain Knowledge From the Freebase RDF Dumps
Author(s) -
Deyan Chen,
Hong Zhao
Publication year - 2018
Publication title -
ieee access
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.587
H-Index - 127
ISSN - 2169-3536
DOI - 10.1109/access.2018.2868516
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
In the process of constructing a domain semantic knowledge base based on ontologies, reusing existing domain knowledge bases not only facilitates sharing, integration, and reuse of the domain semantic knowledge base but also can accelerate the construction of the domain semantic knowledge base. The open and fast growing Freebase database is a good data source, which can be reused to construct the domain semantic knowledge base. However, extracting domain knowledge from the Freebase Resource Description Framework (RDF) dumps faces many challenges. For example, the dump package is too large to read or load; the dump package contains a lot of unnecessary and redundant facts; some ill-formed triples may cause the load to fail, and so on. In response to these obstacles and the deficiencies of existing research, this paper proposes a method to extract domain knowledge quickly, accurately, and completely from the Freebase RDF dumps and describes the domain knowledge using the semantic constructs in ontology standard description languages. Taking extracting the ontology schema and instance data of the medicine domain, including the facts pointing to semantically related domains, as an example, the principle and implementation process of the method are explained in detail and the algorithms of the key processes are described. Finally, the method of this paper is evaluated, including the comparison and analysis of related methods with work objectives, software tools used, processing results, processing performance, accuracy, completeness, and reusability.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom