| Literature DB >> 35327828 |
Fei Luo1, Shuai Zheng1, Weichao Ding1, Joel Fuentes2, Yong Li1.
Abstract
In mobile edge computing systems, the edge server placement problem is mainly tackled as a multi-objective optimization problem and solved with mixed integer programming, heuristic or meta-heuristic algorithms, etc. These methods, however, have profound defect implications such as poor scalability, local optimal solutions, and parameter tuning difficulties. To overcome these defects, we propose a novel edge server placement algorithm based on deep q-network and reinforcement learning, dubbed DQN-ESPA, which can achieve optimal placements without relying on previous placement experience. In DQN-ESPA, the edge server placement problem is modeled as a Markov decision process, which is formalized with the state space, action space and reward function, and it is subsequently solved using a reinforcement learning algorithm. Experimental results using real datasets from Shanghai Telecom show that DQN-ESPA outperforms state-of-the-art algorithms such as simulated annealing placement algorithm (SAPA), Top-K placement algorithm (TKPA), K-Means placement algorithm (KMPA), and random placement algorithm (RPA). In particular, with a comprehensive consideration of access delay and workload balance, DQN-ESPA achieves up to 13.40% and 15.54% better placement performance for 100 and 300 edge servers respectively.Entities:
Keywords: access delay; edge computing; markov decision process; reinforcement learning; workload balance
Year: 2022 PMID: 35327828 PMCID: PMC8946978 DOI: 10.3390/e24030317
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1Edge computing system model.
Figure 2Diagram with location changes of an edge server.
Figure 3Deep Q-network used in DQN-ESPA.
Figure 4Results with the placement of 100 edge servers at 3000 base station locations. (a) Access delay of DQN-ESPA with the variety of iterations; (b) Workload balance of DQN-ESPA with the variety of iterations; (c) Average access delay of the compared algorithms; (d) Average workload standard deviation of the compared algorithms.
Figure 5Results with the placement of 300 edge servers at 3000 base station locations. (a) Access delay of DQN-ESPA with the variety of iterations; (b) Workload balance of DQN-ESPA with the variety of iterations; (c) Average access delay of the compared algorithms; (d) Average workload standard deviation of the compared algorithms.
Overall performance indicators.
| No. | DQN-ESPA | SAPA | KMPA | TKPA | RPA |
|---|---|---|---|---|---|
| 100 | 0.8380 | 0.8529 | 0.8896 | 0.9589 | 0.9677 |
| 300 | 0.8144 | 0.8343 | 0.8593 | 0.9389 | 0.9643 |