
A Novel Framework for Aspect Knowledgebase Generated Automatically from Social Media Using Pattern Rules
Author(s) -
Tuan Anh Tran,
Jarunee Duangsuwan,
Wiphada Wettayaprasit
Publication year - 2021
Publication title -
computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.145
H-Index - 5
eISSN - 2300-7036
pISSN - 1508-2806
DOI - 10.7494/csci.2021.22.4.4028
Subject(s) - computer science , automatic summarization , pruning , sentiment analysis , bigram , social media , information retrieval , benchmark (surveying) , artificial intelligence , similarity (geometry) , natural language processing , world wide web , trigram , geodesy , agronomy , image (mathematics) , biology , geography
One of the factors improving businesses in business intelligence is summarization systems which could generate summaries based on sentiment from social media. However, these systems could not produce automatically, they used annotated datasets. To automatically produce sentiment summaries without using the annotated datasets, we propose a novel framework using pattern rules. The framework has two procedures: 1) pre-processing and 2) aspect knowledgebase generation. The first procedure is to check and correct misspelt words (bigram and unigram) by a proposed method, and tag part-of-speech all words. The second procedure is to automatically generate aspect knowledgebase used to produce sentiment summaries by the sentiment summarization systems. Pattern rules and semantic similarity-based pruning are used to automatically generate aspect knowledgebase from social media. In the experiments, eight domains from benchmark datasets of reviews are used. The performance evaluation of our proposed approach shows the high performance when compared to other approaches.