z-logo
open-access-imgOpen Access
A Multistage Game in Smart Grid Security: A Reinforcement Learning Solution
Author(s) -
Zhen Ni,
Shuva Paul
Publication year - 2019
Publication title -
ieee transactions on neural networks and learning systems
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.882
H-Index - 212
eISSN - 2162-2388
pISSN - 2162-237X
DOI - 10.1109/tnnls.2018.2885530
Subject(s) - reinforcement learning , computer science , computer security , smart grid , grid , cyber attack , set (abstract data type) , game theory , transmission (telecommunications) , artificial intelligence , engineering , telecommunications , geometry , mathematics , microeconomics , electrical engineering , economics , programming language
Existing smart grid security research investigates different attack techniques and cascading failures from the attackers' viewpoints, while the defenders' or the operators' protection strategies are somehow neglected. Game theoretic methods are applied for the attacker-defender games in the smart grid security area. Yet, most of the existing works only use the one-shot game and do not consider the dynamic process of the electric power grid. In this paper, we propose a new solution for a multistage game (also called a dynamic game) between the attacker and the defender based on reinforcement learning to identify the optimal attack sequences given certain objectives (e.g., transmission line outages or generation loss). Different from a one-shot game, the attacker here learns a sequence of attack actions applying for the transmission lines and the defender protects a set of selected lines. After each time step, the cascading failure will be measured, and the line outage (and/or generation loss) will be used as the feedback for the attacker to generate the next action. The performance is evaluated on W&W 6-bus and IEEE 39-bus systems. A comparison between a multistage attack and a one-shot attack is conducted to show the significance of the multistage attack. Furthermore, different protection strategies are evaluated in simulation, which shows that the proposed reinforcement learning solution can identify optimal attack sequences under several attack objectives. It also indicates that attacker's learned information helps the defender to enhance the security of the system.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom