Large Scale Microblogging Intentions Analysis with Pattern Based Approach
Author(s) -
Mohamed Hamroun,
Mohamed Salah Gouider,
Lamjed Ben Saïd
Publication year - 2016
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2016.08.169
Subject(s) - computer science , microblogging , social media , process (computing) , field (mathematics) , scale (ratio) , domain (mathematical analysis) , slang , sentiment analysis , volume (thermodynamics) , world wide web , information retrieval , data science , artificial intelligence , physics , quantum mechanics , mathematical analysis , linguistics , philosophy , mathematics , pure mathematics , operating system
In recent years, social networks have become very popular. Twitter, a micro-blogging service, is estimated to have about 200 million registered users and these users create approximately 65 million tweets a day. Twitter constitutes a powerful medium today that people use to express their thoughts and intentions. The challenge is that each tweet is limited in 140 characters, and is hence very short. It may contain slang and misspelled words. Thus, it is difficult to apply traditional NLP techniques which are designed for working with formal languages, into Twitter domain. Another challenge is that the total volume of tweets is extremely high, and it takes a long time to process. In this paper, we describe a large-scale distributed system for intentions analysis process based on lexico semantic patterns using Hadoop Distributed File System (HDFS) and MapReduce functions. We conduct a case study of user intentions in the commercial field. The proposed method has stably performed data gathering and data loading. Besides, it has maintained stable load balancing of memory and CPU resources during data processing by the HDFS system. The proposed MapReduce functions have effectively performed intentions analysis in the experiments. Finally, obtained results show the importance and effectiveness of intentions detection using semantic patterns
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom