Performance Evaluation of Reinforcement Learning-Based Intrusion Detection Systems
Author(s) -
Imtithal A. Saeed,
Ali Selamat,
Foad Rohani,
Ondrej Krejcar
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3628626
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Reinforcement Learning (RL) offers an innovative approach to Intrusion Detection Systems (IDSs) by enabling agents to autonomously learn through dynamic interactions within network environments, without relying on pre-recorded datasets. Through RL, agents evaluate action values—rewards and penalties at each time step, aiming to converge toward optimal policies that enhance intrusion detection effectiveness. However, achieving reliable convergence in real-time environments remains uncertain, potentially impacting overall detection performance. This paper examines two prominent RL-based IDS architectures: a single-agent model and a game-theoretic multi-agent model, both employing the Deep Q-Network (DQN) algorithm, to highlight the challenges associated with convergence in real-time environments. Using OMNeT++ simulation, a semi-realistic network environment was implemented to test the performance of both architectures. Key performance metrics including accumulated Q-values, loss, epsilon decay, detection accuracy, and precision were analyzed to evaluate the two models. The findings reveal that both the single-agent and game-theoretic multi-agent architectures exhibit unstable convergence, leading to reduced detection accuracy and precision. The observed metrics highlight areas for improvement and underscore the challenges that must be addressed to achieve optimal real-time intrusion detection using reinforcement learning.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom