| Literature DB >> 36015896 |
Feng Jing1,2,3, Hailin Zhang1,3, Mei Gao1,2,3, Bin Xue2,3,4, Kunrui Cao2.
Abstract
Signal detection is one of the most critical and challenging issues in ambient backscatter communication (AmBC) systems. In this paper, a multi-antenna AmBC signal detection method is proposed based on reconfigurable intelligent surface (RIS) and deep reinforcement learning. Firstly, an efficient multi-antenna AmBC system is developed based on RIS, which can achieve information transmission and energy collection simultaneously. Secondly, a smart twin delayed deep deterministic (TD3) AmBC signal detection method is presented, based on deep reinforcement learning. Extensive quantitative and qualitative experiments are performed, which show that the proposed method is more compelling than the outstanding comparison methods.Entities:
Keywords: ambient backscatter communication; deep reinforcement learning; multi-antenna; reconfigurable intelligent surface; signal detection
Mesh:
Year: 2022 PMID: 36015896 PMCID: PMC9414307 DOI: 10.3390/s22166137
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1RIS-assisted AmBC system.
Figure 2The TD3-based framework.
Simulation parameters.
| Parameter |
|
|
|---|---|---|
|
| learing rate for q_net | 0.00003 |
|
| learing rate for policy_net | 0.0003 |
|
| delayed steps for updating the policy network and target networks | 3.0 |
|
| range of action noise for exploration | 1.0 |
|
| range of action noise for evaluation of action value | 0.5 |
|
| buffer size for experience replay | 500,000 |
|
| the number of episodes | 5000 |
|
| the number of steps in each episode | 20,000 |
|
| the number of experiences in the mini-batch | 64 |
|
| discounted rate for future reward | 0.99 |
|
| Rician factor | 3 |
|
| the distance from the BS to the IR | 202 m |
|
| the distance from the RIS to the IR | 30 m |
|
| the distance from the BS to the RIS | 200 m |
Figure 3Median GRCD versus the number of antenna N at the reader with K = 4, Hidden_dim = 128.
Figure 4Convergence performance of the proposed algorithms with N = 49, Hidden_dim = 256.
Time consumed of various algorithms at N = 36 (unit: second).
| Algorithms | RIS-TD3 | RIS-DDPG | SCA | SDR |
|---|---|---|---|---|
| 1 | 0.4188 | 0.3231 | 0.6420 | 10.4801 |
| 2 | 0.3837 | 0.3349 | 0.5787 | 7.0571 |
| 3 | 0.3684 | 0.3371 | 0.4785 | 7.5617 |
| 4 | 0.3647 | 0.3241 | 0.5332 | 9.7516 |
| 5 | 0.3868 | 0.4239 | 0.4490 | 5.9844 |
| 6 | 0.382 | 0.3531 | 0.4675 | 7.6004 |
| 7 | 0.369 | 0.3291 | 0.4493 | 6.6297 |
| 8 | 0.3786 | 0.3142 | 0.4527 | 6.0317 |
| 9 | 0.3526 | 0.3331 | 0.4411 | 7.2425 |
| 10 | 0.361 | 0.3577 | 0.5159 | 7.2832 |
| Average | 0.3766 | 0.3430 | 0.5008 | 7.5622 |
Figure 5Effect of different hidden layer sizes on the algorithm performance with N = 36, .
Figure 6Effect of different exploration step lengths on the performance of the algorithm with N = 36, and Hidden_dim = 128.
Figure 7Effect of different discount rates on algorithm performance with N = 36, Explore_steps = 5000 and Hidden_dim = 128.
Figure 8Effect of different learning rates on algorithm performance; N = 36, Explore_steps = 5000, , and Hidden_dim = 128.