z-logo
open-access-imgOpen Access
Junk Filtering through Naive Bayesian Algorithm
Author(s) -
Anushka Srivastava
Publication year - 2021
Publication title -
international journal for research in applied science and engineering technology
Language(s) - English
Resource type - Journals
ISSN - 2321-9653
DOI - 10.22214/ijraset.2021.36801
Subject(s) - naive bayes classifier , computer science , machine learning , algorithm , pace , the internet , artificial intelligence , precision and recall , filter (signal processing) , classifier (uml) , statistical classification , data mining , support vector machine , world wide web , geodesy , computer vision , geography
As the world is seamlessly developing at a very high pace, we have been seeing enormous growth in various sectors of Technology. Networking has played a crucial part in the exchange of technological culture around the globe, and the Internet being the sole medium of Network enhancement has taken over every aspect of our society. Today, most of the professional communications are done through emailing. As far as email has proven to be an efficient, professional and easy way of communication, it also comes with the disadvantage of unwanted bulk bombarding of spam content. This has been a critical concern for email users. Consequently, it has become very difficult for spam filters to efficiently filter the unwanted emails, since nowadays emails are written in such a manner that any existing algorithm cannot give 100% accuracy in predicting spam. This paper deals with Naive Bayesian Classifier that is a Machine Learning algorithm for antispam filtering, which gives satisfactory results by automatically constructing anti-spam filters with extended conduct. The review over the researched performance of Naive Bayes algorithm is done by the investigations of Spam ham csv datasets. The performance of the algorithm is evaluated based on the accuracy, recall and precision it shows on the mentioned datasets. This technique gives 96-97% accuracy and 89% precision on the investigated dataset. The result also highlights that the content of the email and the number of instances of the dataset has an apparent effect on the performance of the algorithm.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here