A Multistage Game in Smart Grid Security: A Reinforcement Learning Solution | Zendy

Zhen Ni | Zendy; Shuva Paul | Zendy

AI Assistant Blog Pricing

Open Access

A Multistage Game in Smart Grid Security: A Reinforcement Learning Solution

Author(s) -

Zhen Ni,

Shuva Paul

Publication year - 2019

Publication title -

ieee transactions on neural networks and learning systems

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 2.882

H-Index - 212

eISSN - 2162-2388

pISSN - 2162-237X

DOI - 10.1109/tnnls.2018.2885530

Subject(s) - reinforcement learning , computer science , computer security , smart grid , grid , cyber attack , set (abstract data type) , game theory , transmission (telecommunications) , artificial intelligence , engineering , telecommunications , geometry , mathematics , microeconomics , electrical engineering , economics , programming language

Existing smart grid security research investigates different attack techniques and cascading failures from the attackers' viewpoints, while the defenders' or the operators' protection strategies are somehow neglected. Game theoretic methods are applied for the attacker-defender games in the smart grid security area. Yet, most of the existing works only use the one-shot game and do not consider the dynamic process of the electric power grid. In this paper, we propose a new solution for a multistage game (also called a dynamic game) between the attacker and the defender based on reinforcement learning to identify the optimal attack sequences given certain objectives (e.g., transmission line outages or generation loss). Different from a one-shot game, the attacker here learns a sequence of attack actions applying for the transmission lines and the defender protects a set of selected lines. After each time step, the cascading failure will be measured, and the line outage (and/or generation loss) will be used as the feedback for the attacker to generate the next action. The performance is evaluated on W&W 6-bus and IEEE 39-bus systems. A comparison between a multistage attack and a one-shot attack is conducted to show the significance of the multistage attack. Furthermore, different protection strategies are evaluated in simulation, which shows that the proposed reinforcement learning solution can identify optimal attack sequences under several attack objectives. It also indicates that attacker's learned information helps the defender to enhance the security of the system.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom

About

About Careers Publisher Partners Contact Us Our institutional solutions Get Organisational Trial or Quote

Learn

FAQs Blog Terms of Use Privacy Policy

Download the Zendy App

Discover

Explore

Home ZAIA Blog