z-logo
open-access-imgOpen Access
Automatic Aspect Extraction using Lexical Semantic Knowledge in Code-Mixed Context
Author(s) -
Kavita Asnani,
Jyoti D. Pawar
Publication year - 2017
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2017.08.146
Subject(s) - computer science , latent dirichlet allocation , natural language processing , topic model , artificial intelligence , code (set theory) , context (archaeology) , language model , probabilistic logic , information retrieval , programming language , paleontology , set (abstract data type) , biology
We study the problem of automatic extraction of aspects from code-mixed social media data in the form of topic clusters. To address the same, we present the background and propose a code-mixed probabilistic topic model. Unlike the standard Latent Dirichlet Allocation (LDA) model, it updates the distribution of words to distribution of cross-lingual sets. This results in enhancing LDA to process code-mixed data to generate topic clusters by i) improving the relevance of aspect clusters by restricting insignificant words from inclusion in the clusters and ii) encouraging inclusion of coherent words which are semantically related to each other. This becomes possible by leveraging cross-lingual semantic information from a multilingual dictionary called BabelNet. We call our proposed model as code-mixed semantic LDA (cms-LDA) model. Our results indicate that cms-LDA substantially improves the coherence of aspects in topic clusters as compared to the standard topic modeling counterparts. In our experiments we compared the performance of our model using three forms of data i) monolingual where data is written in a single language and the language is known. ii) code-mixed data with automatic language identification and monolingual cluster representations of the same.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom