z-logo
Premium
Hyponym extraction from the web by bootstrapping
Author(s) -
Tian Fang,
Yuan Caixia,
Ren Fuji
Publication year - 2012
Publication title -
ieej transactions on electrical and electronic engineering
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.254
H-Index - 30
eISSN - 1931-4981
pISSN - 1931-4973
DOI - 10.1002/tee.21696
Subject(s) - bootstrapping (finance) , ranking (information retrieval) , computer science , matching (statistics) , similarity (geometry) , set (abstract data type) , information retrieval , artificial intelligence , data mining , similitude , noise (video) , mathematics , statistics , image (mathematics) , econometrics , programming language
This paper proposes an effective method to automatically extract hyponym from the Web for Chinese. The method extracts hyponyms for a given hypernym through weak supervision in two stages: the first stage is submitting a hypernym and a seed hyponym as a query to Web search engine, and automatically extracting hyponyms matching with a Chinese doubly anchored hyponymy pattern from the Web by bootstrapping. In order to reduce noise data in bootstrapping extraction, we propose a set of filtering rules to ensure matching of the proper hypernym in the extracted sentence. The second stage is ranking all the extracted candidate hyponyms by an integrated ranking algorithm which takes into account measures both of linkage frequency between coordinate hyponyms and of semantic similarity between the hypernym and candidate hyponym based on co‐occurrence statistics. © 2011 Institute of Electrical Engineers of Japan. Published by John Wiley & Sons, Inc.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here