z-logo
open-access-imgOpen Access
An Approach of Semantic Similarity Measure between Documents Based on Big Data
Author(s) -
Mohammed Erritali,
Abderrahim Beni-Hssane,
Marouane Birjali,
Youness Madani
Publication year - 2016
Publication title -
international journal of electrical and computer engineering
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.277
H-Index - 22
ISSN - 2088-8708
DOI - 10.11591/ijece.v6i5.pp2454-2461
Subject(s) - computer science , wordnet , semantic similarity , search engine indexing , similarity (geometry) , information retrieval , semantic computing , big data , measure (data warehouse) , similarity measure , data mining , artificial intelligence , semantic web , image (mathematics)
Semantic indexing and document similarity is an important information retrieval system problem in Big Data with broad applications. In this paper, we investigate MapReduce programming model as a specific framework for managing distributed processing in a large of amount documents. Then we study the state of the art of different approaches for computing the similarity of documents. Finally, we propose our approach of semantic similarity measures using WordNet as an external network semantic resource. For evaluation, we compare the proposed approach with other approaches previously presented by using our new MapReduce algorithm. Experimental results review that our proposed approach outperforms the state of the art ones on running time performance and increases the measurement of semantic similarity.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here