Weblogs as a source for extracting general world knowledge
Author(s) -
Jonathan Gordon,
Benjamin Van Durme,
Lenhart K. Schubert
Publication year - 2009
Publication title -
ur research (university of rochester)
Language(s) - English
Resource type - Conference proceedings
DOI - 10.1145/1597735.1597774
Subject(s) - newspaper , computer science , general knowledge , knowledge extraction , world wide web , data science , information retrieval , artificial intelligence , media studies , sociology , psychology , social psychology
Knowledge extraction (KE) efforts have often used corpora of heavily edited writing and sources written to provide the desired knowledge (e.g., newspapers or textbooks). However, the proliferation of diverse, up-to-date, unedited writing on the Web, especially in weblogs, offers new challenges for KE tools. We describe our efforts to extract general knowledge implicit in this noisy data and examine whether such sources can be an adequate substitute for resources like Wikipedia.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom