| Literature DB >> 35632207 |
Mingxuan Song1, Chengyu Hu1, Wenyin Gong1, Xuesong Yan1.
Abstract
Reducing pollutant detection time based on a reasonable sensor combination is desirable. Clean drinking water is essential to life. However, the water supply network (WSN) is a vulnerable target for accidental or intentional contamination due to its extensive geographic coverage, multiple points of access, backflow, infrastructure aging, and designed sabotage. Contaminants entering WSN are one of the most dangerous events that may cause sickness or even death among people. Using sensors to monitor the water quality in real time is one of the most effective ways to minimize negative consequences on public health. However, it is a challenge to deploy a limited number of sensors in a large-scale WSN. In this study, the sensor placement problem (SPP) is modeled as a sequential decision optimization problem, then an evolutionary reinforcement learning (ERL) algorithm based on domain knowledge is proposed to solve SPP. Extensive experiments have been conducted and the results show that our proposed algorithm outperforms meta-heuristic algorithms and deep reinforcement learning (DRL).Entities:
Keywords: combinatorial optimization; domain knowledge; evolutionary reinforcement learning; sensor placement
Mesh:
Year: 2022 PMID: 35632207 PMCID: PMC9145701 DOI: 10.3390/s22103799
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1Layout of the water quality sensors in a small WSN (9 nodes, 2 sources, 1 tank, 13 pipes).
Figure 2The framework of the domain knowledge-based ERL algorithm. Please note that the same serial number represents the same operation.
Figure 3The network researched in the experiment.
Figure 4Comparison of the best results of different algorithms.
Algorithm comparison.
| Algorithm | GA | DRL | ERL |
|---|---|---|---|
| Average detection time/5 min | 1195.606 | 1278.162 | 1192.254 |
Different learning configurations.
| Task Index | Domain Knowledge | Select with Probability | Multi-Threaded Search | EC Operator |
|---|---|---|---|---|
|
| No | Yes | Yes | Yes |
|
| Yes | No | Yes | Yes |
|
| Yes | Yes | No | Yes |
|
| Yes | Yes | Yes | No |
| For Comparison | Yes | Yes | Yes | Yes |
Algorithm comparison.
| Task Index | Average Detection Time/5 min | Optimal Algebra |
|---|---|---|
|
| 1237.8 | 174 |
|
| 1353.23 | 268 |
|
| 1255.95 | 416 |
|
| 1217.55 | 146 |
| For Comparison | 1192.25 | 137 |
Figure 5Domain knowledge comparison-based ERL.
Figure 6Selection strategy comparison based ERL.
Figure 7Population evolution comparison-based ERL.