| Literature DB >> 35432512 |
Mariam Ibrahim1, Ahmad Alsheikh1,2, Ruba Elhafiz1.
Abstract
Evaluating the resiliency of power systems against abnormal operational conditions is crucial for adapting effective actions in planning and operation. This paper introduces the level-of-resilience (LoR) measure to assess power system resiliency in terms of the minimum number of faults needed to produce a system outage (blackout) under sequential topology attacks. Four deep reinforcement learning (DRL)-based agents: deep Q-network (DQN), double DQN, the REINFORCE (Monte-Carlo policy gradient), and REINFORCE with baseline are used to determine the LoR. In this paper, three case studies based on IEEE 6-bus test system are investigated. The results demonstrate that the double DQN network agent achieved the highest success rate, and it was the fastest among the other agents. Thus, it can be an efficient agent for resiliency evaluation.Entities:
Mesh:
Year: 2022 PMID: 35432512 PMCID: PMC9010153 DOI: 10.1155/2022/2017366
Source DB: PubMed Journal: Comput Intell Neurosci
Recent studies on smart grid system security using RL and DRL.
| Reference | System | Method | Attack | Recovery action | Aim | Limitations |
|---|---|---|---|---|---|---|
| [ | Modified 9-bus system | Deep deterministic policy gradient (DDPG) | Multiswitch attacks and false data injection (FDI) attacks | Reclose the transmission lines lost in the cyber-attack by optimizing the reclosing time. | Reach the asynchrony in the power system by applying power blocking which will accelerate/decelerate the rotors of the generators | Owing to its continuous action space, it will not be suitable for topological resilience studies |
| [ | IEEE 9, 14 and 30-bus systems | Deep | Data integrity attacks | No recovery action | Evaluate the delay-alarm error rates, false-alarm error rates, and detect-failure rates for the systems | DQN suffers from overestimation |
| [ | IEEE 30-bus system | Deep | Coordinated cyber physical topology (CCPT) attacks | Control center can detect the line outage by using phasor measurement units (PMU) data | Investigate the coordinated topology attacks in smart grid which combine a physical topology attack and a cyber-topology attack | |
| [ | Wood & Wollenberg 6-bus system and IEEE 30-bus system |
| Sequential attacks | Automatic generation control (AGC) | Identify the minimum number of attacks/actions to reach blackout threshold |
|
| [ | IEEE 14-bus system | SARSA | False data injection (FDI), jamming, and denial of service (DoS) attacks | No recovery action | Formulation an online cyber-attack detection as a POMDP problem and propose a solution based on the model-free RL for POMDPs | |
| Our work | IEEE 6-bus system | Deep | Sequential attacks | Disconnecting the faulted transmission lines | Evaluating the resiliency of power systems against faults/attacks using DRL | Needs to investigate tabular methods such as |
Acronyms and notations used.
| Category | Items/symbols | Description |
|
| ||
| Acronyms | LoR | Level-of-resilience |
| PS | Power system | |
| DRL | Deep reinforcement learning | |
| DQN | Deep | |
| ML | Machine learning | |
| CIP | Critical infrastructure protection | |
| PV | Photovoltaic generator | |
| DDPG | Deep deterministic policy gradient | |
| FDA | False data injection | |
|
| State-action value | |
| (L–G) | Single line-to-ground fault | |
| (L–L) | Line-to-line fault | |
| (L–L–G) | Double line-to-ground | |
|
| ||
| Notations |
| Agent's policy |
| V(S) | Value function | |
| R | Reward | |
|
| Return | |
|
| The discounting factor | |
| S | State | |
| A | Action | |
|
| Probability of selecting an action | |
|
| Weights | |
|
| The value function target | |
| ∇ | Gradient | |
|
| Parameterized function with respect to | |
|
| The advantage function | |
|
| Actor policy | |
|
| Terminal state | |
|
| The learning rates | |
|
| The mode after | |
|
| A set of attack scenarios | |
|
| Number of faults/attacks | |
Figure 1Power system PS1.
Figure 3Power system PS3.
Figure 4Results of DRL agents for the case of symmetrical L–L–L–G fault scenarios.
Minimum number of faults under three-phase L–L–L–G fault scenarios.
| PS/agent | DQN | Double DQN | REINFORCE | REINFORCE with baseline |
|---|---|---|---|---|
|
| 6 | 6 | 6 | 6 |
|
| 6 | 5 | 6 | 6 |
|
| 7 | 7 | 7 | 7 |
Figure 5Results of DRL agents for the case of asymmetrical L–L–G fault scenarios.
Minimum number of faults under single-phase L–L–G fault scenarios.
| PS/agent | DQN | Double DQN | REINFORCE | REINFORCE with baseline |
|---|---|---|---|---|
|
| 8 | 7 | 10 | 8 |
|
| 7 | 7 | 8 | 7 |
|
| 10 | 10 | 11 | 10 |