| Literature DB >> 32326250 |
Hyebin Park1, Yujin Lim1.
Abstract
Increased data traffic resulting from the increase in the deployment of connected vehicles has become relevant in vehicular social networks (VSNs). To provide efficient communication between connected vehicles, researchers have studied device-to-device (D2D) communication. D2D communication not only reduces the energy consumption and loads of the system but also increases the system capacity by reusing cellular resources. However, D2D communication is highly affected by interference and therefore requires interference-management techniques, such as mode selection and power control. To make an optimal mode selection and power control, it is necessary to apply reinforcement learning that considers a variety of factors. In this paper, we propose a reinforcement-learning technique for energy optimization with fifth-generation communication in VSNs. To achieve energy optimization, we use centralized Q-learning in the system and distributed Q-learning in the vehicles. The proposed algorithm learns to maximize the energy efficiency of the system by adjusting the minimum signal-to-interference plus noise ratio to guarantee the outage probability. Simulations were performed to compare the performance of the proposed algorithm with that of the existing mode-selection and power-control algorithms. The proposed algorithm performed the best in terms of system energy efficiency and achievable data rate.Entities:
Keywords: 5G; D2D communication; mode selection; power control; vehicle-to-vehicle communication; vehicular social network
Year: 2020 PMID: 32326250 PMCID: PMC7219230 DOI: 10.3390/s20082361
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1System model.
Figure 2Operation procedure.
Parameters used in the simulation.
| Parameter | Notation | Value |
|---|---|---|
| Noise power spectral density |
|
|
| Total bandwidth |
|
|
| SINR constraint |
|
|
| Maximum outage probability constraint |
|
|
| Minimum outage probability constraint |
|
|
| Circuit power of RRH |
|
|
| Slope of RRH |
|
|
| Circuit power of fronthaul transceiver and switch |
|
|
| Power consumption per bit/s |
|
|
| Transmission power of cellular device |
|
|
| Transmission power of RRH |
|
|
Figure 3Comparison of system energy efficiencies with various device-to-device (D2D) distances and system loads: (a) with light traffic; (b) with light traffic; (c) with heavy traffic; and (d) with heavy traffic.
Figure 4Comparison of average achievable data rate with various D2D distances and system loads: (a) with light traffic; (b) with light traffic; (c) 250 m with heavy traffic; and (d) with heavy traffic.
Figure 5Comparison of signal-to-interference noise ratio (SINR) with various D2D distances and system loads: (a) with light traffic; (b) with light traffic; (c) with heavy traffic; and (d) with heavy traffic.
Figure 6Convergence of system energy efficiency as a reward function with D2D distance = 350 m: (a) light traffic scenario; (b) heavy traffic scenario.
Average system energy efficiencies with system load variance,
| System Load (%) | 40 | 50 | 60 | 70 | 80 |
|---|---|---|---|---|---|
| Optimal | 4.8414 | 5.2831 | 4.9594 | 5.4548 | 5.5733 |
| BA | 1.9857 | 1.0919 | 0.8479 | 0.7086 | 0.6553 |
| Proposed | 3.8026 | 2.5863 | 1.9223 | 1.5863 | 1.4909 |
| Compare1 | 3.2371 | 1.9767 | 1.5212 | 1.2576 | 1.1491 |
| Compare2 | 1.9429 | 1.0723 | 0.8184 | 0.6992 | 0.6376 |
Average achievable data rate with system load variance,
| System Load (%) | 40 | 50 | 60 | 70 | 80 |
|---|---|---|---|---|---|
| Optimal | 183.1308 | 184.869 | 181.0795 | 180.4453 | 179.5396 |
| BA | 130.6982 | 124.3168 | 123.1323 | 122.5992 | 119.5717 |
| Proposed | 168.763 | 158.1163 | 138.6988 | 131.8081 | 123.8312 |
| Compare1 | 115.4847 | 98.18192 | 88.934 | 81.2835 | 72.47508 |
| Compare2 | 126.4065 | 122.9883 | 120.3971 | 119.4175 | 117.3408 |