| Literature DB >> 36016062 |
Armando Ordonez1, Oscar Mauricio Caicedo2, William Villota3, Angela Rodriguez-Vivas2, Nelson L S da Fonseca3.
Abstract
Reinforcement Learning (RL) comes with the promise of automating network management. However, due to its trial-and-error learning approach, model-based RL (MBRL) is not applicable in some network management scenarios. This paper explores the potential of using Automated Planning (AP) to achieve this MBRL in the functional areas of network management. In addition, a comparison of several integration strategies of AP and RL is depicted. We also describe an architecture that realizes a cognitive management control loop by combining AP and RL. Our experiments evaluate on a simulated environment evidence that the combination proposed improves model-free RL but demonstrates lower performance than Deep RL regarding the reward and convergence time metrics. Nonetheless, AP-based MBRL is useful when the prediction model needs to be understood and when the high computational complexity of Deep RL can not be used.Entities:
Keywords: automated planning; model based; network management; reinforcement learning
Mesh:
Year: 2022 PMID: 36016062 PMCID: PMC9416718 DOI: 10.3390/s22166301
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1Integrating AP and RL.
Figure 2Automated Planning and Reinforcement Learning in the C-MAPE model.
Figure 3HTN Planning and Q-learning for Network Slicing Admission Control.
Figure 4Dyna framework [63].
HTN Model for Network Slicing Admission Control.
| task: optimize_edge_20 (edge,central,bw) |
| task: optimize_edge_40 (edge,central,bw) |
| task: optimize_central_20 (edge,central,bw) |
| task: optimize_edge_60 (edge,central,bw) |
Figure 5Results for different tasks.
Figure 6Convergence time—Reward vs. Episodes.
Figure 7DeepRL vs. Model-based RL.