| Literature DB >> 34204726 |
Juan Parras1, Maximilian Hüttenrauch2,3, Santiago Zazo1, Gerhard Neumann2,3.
Abstract
Recent advances in Deep Reinforcement Learning allow solving increasingly complex problems. In this work, we show how current defense mechanisms in Wireless Sensor Networks are vulnerable to attacks that use these advances. We use a Deep Reinforcement Learning attacker architecture that allows having one or more attacking agents that can learn to attack using only partial observations. Then, we subject our architecture to a test-bench consisting of two defense mechanisms against a distributed spectrum sensing attack and a backoff attack. Our simulations show that our attacker learns to exploit these systems without having a priori information about the defense mechanism used nor its concrete parameters. Since our attacker requires minimal hyper-parameter tuning, scales with the number of attackers, and learns only by interacting with the defense mechanism, it poses a significant threat to current defense procedures.Entities:
Keywords: Deep Reinforcement Learning; POMDP; SSDF attack; TRPO; backoff attack
Year: 2021 PMID: 34204726 DOI: 10.3390/s21124060
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576