MeSH-based disambiguation method using an intrinsic information content measure of semantic similarity
Author(s) -
Imen Gabsi,
Hager Kammoun,
Sarra brahmi,
Ikram Amous
Publication year - 2017
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2017.08.169
Subject(s) - computer science , automatic summarization , information retrieval , semantic similarity , search engine indexing , context (archaeology) , set (abstract data type) , natural language processing , similarity (geometry) , thesaurus , measure (data warehouse) , domain (mathematical analysis) , similarity measure , information extraction , term (time) , word (group theory) , task (project management) , artificial intelligence , semeval , data mining , image (mathematics) , paleontology , mathematical analysis , linguistics , philosophy , physics , mathematics , management , quantum mechanics , economics , biology , programming language
Word Sense Disambiguation represents a crucial task in many natural language processing applications such as information extraction, information retrieval and text summarization. It intends to identify the correct sense of an ambiguous term (target word) according to its context. In this paper, we present a biomedical knowledge-based disambiguation method for determining the adequate domain or sub-domain (sense) of an ambiguous biomedical term. This method uses the MeSH thesaurus as a knowledge resource and an intrinsic information content-based semantic similarity to measure the likeness between their different senses of the ambiguous term MeSH and its context. We define this latter as the set of unambiguous MeSH terms characterizing a document. We aim on one hand to minimize the time and computational complexity and on the other hand to increase the accuracy of the disambiguation by using such context. To evaluate this method, we involved it into a document indexing and retrieval system. The results of experiments, carried out on a sub-set of the OHSUMED collection, in terms of Mean Average Precision show that our method performs well.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom