Premium
Improving taxonomic relation learning via incorporating relation descriptions into word embeddings
Author(s) -
Huang Subin,
Luo Xiangfeng,
Huang Jing,
Wang Hao,
Gu Shengwei,
Guo Yike
Publication year - 2020
Publication title -
concurrency and computation: practice and experience
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.309
H-Index - 67
eISSN - 1532-0634
pISSN - 1532-0626
DOI - 10.1002/cpe.5696
Subject(s) - natural language processing , word embedding , artificial intelligence , computer science , relationship extraction , inference , similarity (geometry) , semantic relation , relation (database) , semantic similarity , embedding , word (group theory) , taxonomy (biology) , machine learning , information extraction , linguistics , data mining , ecology , biology , philosophy , image (mathematics) , cognition , neuroscience
Summary Taxonomic relations play an important role in various Natural Language Processing (NLP) tasks (eg, information extraction, question answering and knowledge inference). Existing approaches on embedding‐based taxonomic relation learning mainly rely on the word embeddings trained using co‐occurrence‐based similarity learning. However, the performance of these approaches is not quite satisfactory due to the lack of sufficient taxonomic semantic knowledge within word embeddings. To solve this problem, we propose an improved embedding‐based approach to learn taxonomic relations via incorporating relation descriptions into word embeddings. First, to capture additional taxonomic semantic knowledge, we train special word embeddings using not only co‐occurrence information of words but also relation descriptions (eg, taxonomic seed relations and their contextual triples). Then, using the trained word embeddings as features, we employ two learning models to identify and predict taxonomic relations, namely, offset‐based classification model and offset‐based similarity model. Experimental results on four real‐world domain datasets demonstrate that our proposed approach can capture additional taxonomic semantic knowledge and reduce dependence on the training dataset, outperforming the state‐of‐the‐art compared approaches on the taxonomic relation learning task.