E-mail spam filtering by a new hybrid feature selection method using IG and CNB wrapper
Author(s) -
Seyed Mostafa Pourhashemi
Publication year - 2013
Publication title -
computer engineering and applications journal
Language(s) - English
Resource type - Journals
eISSN - 2252-5459
pISSN - 2252-4274
DOI - 10.18495/comengapp.v2i3.29
Subject(s) - feature selection , computer science , naive bayes classifier , artificial intelligence , discriminative model , support vector machine , classifier (uml) , pattern recognition (psychology) , machine learning , random forest , bayes error rate , word error rate , quadratic classifier , multinomial distribution , bayes classifier , data mining , mathematics , statistics
The growing volume of spam emails has resulted in the necessity for more accurate and efficient email classification system. The purpose of this research is presenting an machine learning approach for enhancing the accuracy of automatic spam detecting and filtering and separating them from legitimate messages. In this regard, for reducing the error rate and increasing the efficiency, the hybrid architecture on feature selection has been used. Features used in these systems, are the body of text messages. Proposed system of this research has used the combination of two filtering models, Filter and Wrapper, with Information Gain (IG) filter and Complement Naive Bayes (CNB) wrapper as feature selectors. In addition, Multinomial Naive Bayes (MNB) classifier, Discriminative Multinomial Naive Bayes (DMNB) classifier, Support Vector Machine (SVM) classifier and Random Forest classifier are used for classification. Finally, the output results of this classifiers and feature selection methods are examined and the best design is selected and it is compared with another similar works by considering different parameters. The optimal accuracy of the proposed system is evaluated equal to 99%.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom