Premium
Exploiting tag similarities to discover synonyms and homonyms in folksonomies
Author(s) -
Eynard Davide,
Mazzola Luca,
Dattolo Antonina
Publication year - 2013
Publication title -
software: practice and experience
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.437
H-Index - 70
eISSN - 1097-024X
pISSN - 0038-0644
DOI - 10.1002/spe.2150
Subject(s) - computer science , information retrieval , exploit , metadata , folksonomy , cluster analysis , context (archaeology) , semantic similarity , graph , natural language processing , artificial intelligence , world wide web , theoretical computer science , paleontology , computer security , biology
SUMMARY Tag‐based systems are widely available, thanks to their intrinsic advantages, such as self‐organization, currency, and ease of use. Although they represent a precious source of semantic metadata, their utility is still limited. The inherent lexical ambiguities of tags strongly affect the extraction of structured knowledge and the quality of tag‐based recommendation systems. In this paper, we propose a methodology for the analysis of tag‐based systems, addressing tag synonymy and homonymy at the same time in a holistic approach: in more detail, we exploit a tripartite graph to reduce the problem of synonyms and homonyms; we apply a customized version of Tag Context Similarity to detect them, overcoming the limitations of current similarity metrics; finally, we propose the application of an overlapping clustering algorithm to detect contexts and homonymies, then evaluate its performances, and introduce a methodology for the interpretation of its results. Copyright © 2012 John Wiley & Sons, Ltd.