z-logo
Premium
Spotting review spammer groups: A cosine pattern and network based method
Author(s) -
Zhang Lu,
He Gaofeng,
Cao Jie,
Zhu Haiting,
Xu Bingfeng
Publication year - 2018
Publication title -
concurrency and computation: practice and experience
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.309
H-Index - 67
eISSN - 1532-0634
pISSN - 1532-0626
DOI - 10.1002/cpe.4686
Subject(s) - spamming , computer science , ranking (information retrieval) , cosine similarity , spotting , exploit , artificial intelligence , data mining , value (mathematics) , machine learning , pattern recognition (psychology) , information retrieval , the internet , world wide web , computer security
Summary Nowadays, online product reviews strongly influence the purchase decision of consumers in e‐commerce platforms. Driven by the immense financial profits, review spammers deliberately post fake reviews to promote or demote their target products. Some spammers are even organized as groups to work together and try to take total control of the sentiment on their target products. To detect such spammer groups, most previous works exploit frequent itemset mining (FIM) to find spammer group candidates and then use unsupervised spamicity ranking methods to identify real spammer groups. However, these methods usually suffer from the problem of threshold setting, ie, high support value finding fewer groups while low support value leading to more coincidentally generated groups and computational inefficiency. Moreover, the unsupervised methods are not able to make good use of labeled instances which are actually obtainable in practice. In this paper, we propose CONSGD, a cosine pattern and heterogeneous information network–based spammer group detecting method. Specifically, the CONSGD uses cosine pattern mining (CPM) to discover tight spammer group candidates with a respective low support value, where the cosine threshold is utilized to avoid coincidentally generated groups. Moreover, CONSGD employs heterogeneous information network classification to identify the real spammer groups, which could utilize the labeled instances and do not rely to the assumption of independent instances. Experiments on real‐life dataset show that our proposed CONSGD is effective and outperforms the state‐of‐the‐art spammer group detection methods.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here