CN-DBpedia2: An Extraction and Verification Framework for Enriching Chinese Encyclopedia Knowledge Base
Author(s) -
Bo Xu,
Jiaqing Liang,
Chenhao Xie,
Bin Liang,
Lihan Chen,
Yanghua Xiao
Publication year - 2019
Publication title -
data intelligence
Language(s) - English
Resource type - Journals
eISSN - 2096-7004
pISSN - 2641-435X
DOI - 10.1162/dint_a_00017
Subject(s) - knowledge base , computer science , encyclopedia , construct (python library) , knowledge extraction , question answering , information retrieval , entity linking , knowledge based systems , information extraction , base (topology) , artificial intelligence , programming language , mathematics , mathematical analysis , library science
Knowledge base plays an important role in machine understanding and has been widely used in various applications, such as search engine, recommendation system and question answering. However, most knowledge bases are incomplete, which can cause many downstream applications to perform poorly because they cannot find the corresponding facts in the knowledge bases. In this paper, we propose an extraction and verification framework to enrich the knowledge bases. Specifically, based on the existing knowledge base, we first extract new facts from the description texts of entities. But not all newly-formed facts can be added directly to the knowledge base because the errors might be involved by the extraction. Then we propose a novel crowd-sourcing based verification step to verify the candidate facts. Finally, we apply this framework to the existing knowledge base CN-DBpedia and construct a new version of knowledge base CN-DBpedia2, which additionally contains the high confidence facts extracted from the description texts of entities.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom