z-logo
open-access-imgOpen Access
Fingerprint-Based Near-Duplicate Document Detection with Applications to SNS Spam Detection
Author(s) -
Phuc-Tran Ho,
SungRyul Kim
Publication year - 2014
Publication title -
international journal of distributed sensor networks
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.324
H-Index - 53
eISSN - 1550-1477
pISSN - 1550-1329
DOI - 10.1155/2014/612970
Subject(s) - computer science , fingerprint (computing) , spamming , similarity (geometry) , spambot , social media , tree (set theory) , social network (sociolinguistics) , world wide web , trie , data mining , information retrieval , computer security , internet privacy , artificial intelligence , data structure , the internet , mathematical analysis , mathematics , image (mathematics) , programming language
Social networking has been used widely by millions of people over the world. It has become the most popular way for people who want to connect and interact online with their friends. Currently, there are many social networking sites, for instance, Facebook, My Space, and Twitter, with a huge number of active users. Therefore, they are also good places for spammers or cheaters who want to steal the personal information of users or advertise their products. Recently, many proposed methods are applied to detect spam comments on social networks with different techniques. In this paper, we propose a similarity-based method that combines fingerprinting technique with trie-tree data structure and meet-in-the-middle approach in order to achieve a higher accuracy in spam comments detection. Using our proposed approach, we are able to detect around 98% spam comments in our dataset.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom