Open Access
Intrusion Detection System on KDD’99 Dataset with Imbalanced Classes
Author(s) -
Anupam Agrawal
Publication year - 2021
Publication title -
international journal of engineering and advanced technology
Language(s) - English
Resource type - Journals
ISSN - 2249-8958
DOI - 10.35940/ijeat.b3238.1211221
Subject(s) - computer science , naive bayes classifier , artificial intelligence , preprocessor , pattern recognition (psychology) , precision and recall , machine learning , intrusion detection system , classifier (uml) , principal component analysis , data mining , data pre processing , support vector machine
The paper describes a method of intrusion detection that keeps check of it with help of machine learning algorithms. The experiments have been conducted over KDD’99 cup dataset, which is an imbalanced dataset, cause of which recall of some classes coming drastically low as there were not enough instances of it in there. For Preprocessing of dataset One Hot Encoding and Label Encoding to make it machine readable. The dimensionality of dataset has been reduced using Principal Component Analysis and classification of dataset into classes viz. attack and normal is done by Naïve Bayes Classifier. Due to imbalanced nature, shift of focus was on recall and overall recall and compared with other models which have achieved great accuracy. Based on the results, using a self optimizing loop, model has achieved better geometric mean accuracy.