Extraction of Interlingual Documents Clusters Based on Closed Concepts Mining | Zendy

Mohamed Chebel | Zendy; Chiraz Latiri | Zendy; Éric Gaussier | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Extraction of Interlingual Documents Clusters Based on Closed Concepts Mining

Author(s) -

Mohamed Chebel,

Chiraz Latiri,

Éric Gaussier

Publication year - 2015

Publication title -

procedia computer science

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.334

H-Index - 76

ISSN - 1877-0509

DOI - 10.1016/j.procs.2015.08.176

Subject(s) - computer science , comparability , cluster analysis , exploit , clef , information retrieval , natural language processing , artificial intelligence , vector space model , data mining , task (project management) , mathematics , computer security , management , combinatorics , economics

To address multilingual document classification in an effcient and effective manner, we claim that a synergy between classical IR techniques such as vector model and some advanced data mining methods, especially Formal Concept Analysis, is particularly appropriate. We propose in this paper, a new statistical approach for extracting inter-language clusters from multilingual documents based on Closed Concepts Mining and vector model. Formal Concept Analysis techniques are applied to extract Closed Concepts from comparable corpora; and, then, exploit these Closed Concepts and vector models in the clustering and alignment of multilin- gual documents. An experimental evaluation is conducted on the collection of bilingual documents French-English of CLEF’2003. The results confirmed that the synergy between Formal Concept Analysis and vector model is fruitful to extract bilingual classes of documents, with an interesting comparability score

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research