| Literature DB >> 35626619 |
Wei Qi1, Hao Sun2, Lichen Yu3, Shuo Xiao2, Haifeng Jiang2.
Abstract
When an unmanned aerial vehicle (UAV) performs tasks such as power patrol inspection, water quality detection, field scientific observation, etc., due to the limitations of the computing capacity and battery power, it cannot complete the tasks efficiently. Therefore, an effective method is to deploy edge servers near the UAV. The UAV can offload some of the computationally intensive and real-time tasks to edge servers. In this paper, a mobile edge computing offloading strategy based on reinforcement learning is proposed. Firstly, the Stackelberg game model is introduced to model the UAV and edge nodes in the network, and the utility function is used to calculate the maximization of offloading revenue. Secondly, as the problem is a mixed-integer non-linear programming (MINLP) problem, we introduce the multi-agent deep deterministic policy gradient (MADDPG) to solve it. Finally, the effects of the number of UAVs and the summation of computing resources on the total revenue of the UAVs were simulated through simulation experiments. The experimental results show that compared with other algorithms, the algorithm proposed in this paper can more effectively improve the total benefit of UAVs.Entities:
Keywords: mobile edge computing; stackelberg game; unmanned aerial vehicle
Year: 2022 PMID: 35626619 PMCID: PMC9141349 DOI: 10.3390/e24050736
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.738
Figure 1Task offloading system model in UAV network.
Figure 2Two-stage Stackelberg game model.
Figure 3Interaction process between agent and environment.
Experimental parameters of MADDPG task offloading algorithm.
| Parameter | Definition | Setting |
|---|---|---|
|
| Length of scene | 100 m |
|
| Power of communication | 20 W |
|
| Bandwidth | 100 MHz |
|
| Computing power of UAV | 1.5 GHz |
|
| Computing power edge node | 10 GHz |
|
| Power of Gaussian white noise | 2.5 × 10 −13 w |
|
| Fading factor of transmission channel | 4 |
|
| Volume of task data | 100~150 MB |
|
| Latest completion time of task | 5~20 s |
|
| Coefficient of CPU energy structure |
|
|
| Expenditure-sensitive factor | 0~0.5 |
|
| Time-sensitive factor | 0~0.5 |
Figure 4Curve of system utility and number of iterations.
Figure 5Curve of success rate of task.
Figure 6Average delay curve of different algorithms.
Figure 7Average energy consumption curve of different algorithms.
Figure 8Average utility curve of system with different algorithms.
Figure 9Average utility curve of UAV with different algorithms.