z-logo
open-access-imgOpen Access
Filtering Chinese Image Spam Using Pseudo‐OCR
Author(s) -
Xu Bin,
Li Ruiguang,
Liu Yashu,
Yan Hanbing,
Li Siyuan,
Zhang Honggang
Publication year - 2015
Publication title -
chinese journal of electronics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.267
H-Index - 25
eISSN - 2075-5597
pISSN - 1022-4653
DOI - 10.1049/cje.2015.01.022
Subject(s) - computer science , optical character recognition , artificial intelligence , character (mathematics) , image (mathematics) , key (lock) , feature (linguistics) , pattern recognition (psychology) , point (geometry) , chinese characters , computer vision , mathematics , computer security , linguistics , philosophy , geometry
For image spam filtering, the Optical character recognition(OCR) based methods often achieve a better performance due to the more complex structure of recognizing corresponding text. However, applying traditional OCR techniques usually introduced shortcomings like the expensive computational cost, vulnerability to image noises and artificial interferences, especially for Chinese image spam filtering. So, by optimizing recognition procedure of traditional OCR, we propose the idea of pseudo‐OCR more suitable for Chinese image spam filtering. During which discriminating the potential image spam character features from ham ones is sufficient, instead of recognizing them. What's more, a novel Chinese key‐point based character feature specific for pseudo‐OCR is also devised and extracted using a carefully designed algorithm, which outperforms classic corner detection methods in finding such key‐points. Experiment results show that our proposed system usually has a better performance than traditional OCR based method while maintaining a low false positive rate.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here