z-logo
open-access-imgOpen Access
Mining coherent anomaly collections on web data
Author(s) -
Hanbo Dai,
Feida Zhu,
EePeng Lim,
HweeHwa Pang
Publication year - 2012
Publication title -
singapore management university institutional knowledge (ink) (singapore management university)
Language(s) - English
Resource type - Conference proceedings
DOI - 10.1145/2396761.2398472
Subject(s) - spamming , computer science , boom , disjoint sets , anomaly (physics) , identification (biology) , cluster analysis , data mining , anomaly detection , social media , information retrieval , the internet , world wide web , artificial intelligence , mathematics , engineering , physics , condensed matter physics , botany , combinatorics , environmental engineering , biology
The recent boom of weblogs and social media has attached increasing importance to the identification of suspicious users with unusual behavior, such as spammers or fraudulent reviewers. A typical spamming strategy is to employ multiple dummy accounts to collectively promote a target, be it a URL or a product. Consequently, these suspicious accounts exhibit certain coherent anomalous behavior identifiable as a collection. In this paper, we propose the concept of Coherent Anomaly Collection (CAC) to capture this kind of collections, and put forward an efficient algorithm to simultaneously find the top-K disjoint CACs together with their anomalous behavior patterns. Compared with existing approaches, our new algorithm can find disjoint anomaly collections with coherent extreme behavior without having to specify either their number or sizes. Results on real Twitter data show that our approach discovers meaningful and informative hashtag spammer groups of various sizes which are hard to detect by clustering-based methods.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom