Automatic Strengthening of Graph-Structured Knowledge Bases | Zendy

Vinay K. Chaudhri | Zendy; Nikhil Dinesh | Zendy; Stijn Heymans | Zendy; Michael Wessel | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Automatic Strengthening of Graph-Structured Knowledge Bases

Author(s) -

Vinay K. Chaudhri,

Nikhil Dinesh,

Stijn Heymans,

Michael Wessel

Publication year - 2014

Publication title -

lecture notes in computer science

Language(s) - English

Resource type - Book series

SCImago Journal Rank - 0.249

H-Index - 400

eISSN - 1611-3349

pISSN - 0302-9743

DOI - 10.1007/978-3-319-04534-4_12

Subject(s) - computer science , theoretical computer science , inference , directed acyclic graph , knowledge representation and reasoning , graph , rule of inference , taxonomy (biology) , artificial intelligence , algorithm , botany , biology

We consider the problem of identifying inherited content in knowledge representation structures called concept graphs (CGraphs). A CGraph is a visual representation of a concept; in the following, CGraphs and concepts are used synonymously. A CGraph is a node- and edge-labeled directed graph. Labeled (binary) edges represent relations between nodes, which are considered instances of the concepts in their node labels. CGraphs are arranged in a taxonomy (is-a hierarchy). The taxonomy is a directed acyclic graph, as multiple inheritance is allowed. A taxonomy and set of CGraphs is called a graph-structured knowledge base (GSKB). A CGraph can inherit content from other CGraphs intuitively, if C and D are CGraphs, then C may contain content inherited from D, i.e. labeled nodes and edges \"from D\" can appear in C, if D is a direct or indirect superconcept of C, or if C contains a node being labeled with either D or some subclass of D. In both cases, C is said to refer to D. This paper contains three contributions. First, we describe and formalize the problem from a logical point of view and give a first-order semantics for CGraphs. We show that the identification of inherited content in CGraphs depends on some form of hypothetical reasoning and is thus not a purely deductive inference task, as it requires unsound reasoning. Hence, this inference is different from the standard subsumption checking problem, as known from description logics (DLs) [1]. We show that the provenance problem (from where does a logical atom in a CGraph get inherited?) strongly depends on the solution to the co-reference problem (which existentials in the first-order axiomatization of concepts as formulas denote identical domain individuals?) We demonstrate that the desired inferences can be obtained from a so-called strengthened GSKB, which is an augmented variant of the input GSKB. We present an algorithm which augments and strengthens an input GSKB, using model-theoretic notions. Secondly, we are addressing the problem from a graph-theoretic point of view, as this perspective is closer to the actual implementation. We show that we can identify inherited content by computing so-called concept coverings, which induce inherited content from superconcepts by means of graph morphisms. We argue that the algorithm solves a challenging (NP-hard) problem. Thirdly, we apply the algorithm to the large-scale biological knowledge base from the AURA project [2], and present a preliminary evaluation of its performance.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research