z-logo
open-access-imgOpen Access
Concept Expansion Using Web Tables
Author(s) -
Chi Wang,
Kaushik Chakrabarti,
Yeye He,
Kris Ganjam,
ZhiMin Chen,
Philip A. Bernstein
Publication year - 2015
Publication title -
proceedings of the 24th international conference on world wide web
Language(s) - English
Resource type - Conference proceedings
DOI - 10.1145/2736277.2741644
Subject(s) - computer science , ambiguity , pagerank , information retrieval , leverage (statistics) , ranking (information retrieval) , set (abstract data type) , table (database) , probabilistic logic , focus (optics) , data mining , theoretical computer science , data science , machine learning , artificial intelligence , programming language , physics , optics
We study the following problem: given the name of an ad-hoc concept as well as a few seed entities belonging to the concept, output all entities belonging to it. Since producing the exact set of entities is hard, we focus on returning a ranked list of entities. Previous approaches either use seed entities as the only input, or inherently require negative examples. They suffer from input ambiguity and semantic drift, or are not viable options for ad-hoc tail concepts. In this paper, we propose to leverage the millions of tables on the web for this problem. The core technical challenge is to identify the ``exclusive'' tables for a concept to prevent semantic drift; existing holistic ranking techniques like personalized PageRank are inadequate for this purpose. We develop novel probabilistic ranking methods that can model a new type of table-entity relationship. Experiments with real-life concepts show that our proposed solution is significantly more effective than applying state-of-the-art set expansion or holistic ranking techniques.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom