Network intrusion detection: a comparative study of four classifiers using the NSL-KDD and KDD’99 datasets | Zendy

Ananya Devarakonda | Zendy; Nilesh Kumar Sharma | Zendy; Prita Saha | Zendy; S Ramya | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Network intrusion detection: a comparative study of four classifiers using the NSL-KDD and KDD’99 datasets

Author(s) -

Ananya Devarakonda,

Nilesh Kumar Sharma,

Prita Saha,

S Ramya

Publication year - 2022

Publication title -

journal of physics. conference series

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.21

H-Index - 85

eISSN - 1742-6596

pISSN - 1742-6588

DOI - 10.1088/1742-6596/2161/1/012043

Subject(s) - computer science , random forest , intrusion detection system , data mining , preprocessor , data pre processing , classifier (uml) , autoencoder , artificial intelligence , machine learning , feature selection , artificial neural network

As most of the population acquires access to the internet, protecting online identity from threats of confidentiality, integrity, and accessibility becomes an increasingly important problem to tackle. By definition, a network intrusion detection system (IDS) helps pinpoint and identify anomalous network traffic to bring forward and classify suspicious activity. It is a fundamental part of network security and provides the first line of defense against a potential attack by alerting an administrator or appropriate personnel of possible malicious network activity. Several academic publications propose various artificial intelligence (AI) methods for an accurate network intrusion detection system (IDS). This paper outlines and compares four AI methods to train two benchmark datasets- the KDD’99 and the NSL-KDD. Apart from model selection, data preprocessing plays a vital role in contributing to accurate solutions, and thus, we propose a simple yet effective data preprocessing method. We also evaluate and compare the accuracy and performance of four popular models- decision tree (DT), multi-layer perceptron (MLP), random forest (RF), and a stacked autoencoder (SAE) model. Of the four methods, the random forest classifier showed the most consistent and accurate results.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore