| Literature DB >> 36162062 |
He Du1, Bing Hao1, Jianshuo Zhao1, Jiamin Zhang1, Qi Wang1, Qi Yuan2.
Abstract
Path planning is a major challenging problem for mobile robots, as the robot is required to reach the target position from the starting position while simultaneously avoiding conflicts with obstacles. This paper refers to a novel method as short and safe Q-learning to alleviate the short and safe path planning task of mobile robots. To solve the slow convergence of Q-learning, the artificial potential field is utilized to avoid random exploration and provides a priori knowledge of the environment for mobile robots. Furthermore, to speed up the convergence of the Q-learning and reduce the computing time, a dynamic reward is proposed to facilitate the mobile robot towards the target point. The experiments are divided into two parts: short and safe path planning. The mobile robot can reach the target with the optimal path length in short path planning, and away from obstacles in safe path planning. Experiments compared with the state-of-the-art algorithm demonstrate the effectiveness and practicality of the proposed approach. Concluded, the path length, computing time and turning angle of SSQL is increased by 2.83%, 23.98% and 7.98% in short path planning, 3.64%, 23.42% and 12.61% in safe path planning compared with classical Q-learning. Furthermore, the SSQL outperforms other optimization algorithms with shorter path length and smaller turning angles.Entities:
Mesh:
Year: 2022 PMID: 36162062 PMCID: PMC9512417 DOI: 10.1371/journal.pone.0275100
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1Path planning problem formulation: (a) Schematic diagram of MR; (b) Direction of motion when there is an obstacle around the MR; (c) Short path length condition; (d) Safe path length condition. (Circle for MR; white grid for Sfree; black grid for Sobs; solid black line is the path planned by SSQL).
Fig 2The number of iterations that path length converges to the optimum.
Parameters setting of DFQL, CQL, PSO, GWO, DA, and MFO.
| Algorithms | Parameters Selection |
|
|
|---|---|---|---|
| DFQL | 100 | — | |
| CQL | — | ||
| PSO | 30 | ||
|
| |||
| GWO | |||
| DA |
| ||
|
| |||
|
| |||
|
| |||
|
| |||
| MFO |
where, Max is the maximum number of iterations, Pop is the population size; PSO: c1 and c2 are the learning factor, ω is the linearly decreasing weight (LDW) (Tian et al., 2018), ωmax is the maximum inertia weight, ωmin is the minimum inertia weight; GWO: a is a constant, the initial value is 2, and decreases linearly from 2 to 0 with the iteration of the algorithm, C is a random number between 0 and 2; DA: a is the alignment weight, c is the cohesion weight, e is the natural enemy weight factor, f is the prey weight factor, r is the neighborhood radius, and s is the separation weight; MFO: t is the path coefficient, b is the logarithmic spiral shape constant; rand() denotes a random number between [0,1], and Iter denotes the current iteration number.
Fig 3Short path planning (solution) for MR in different test environments (Each map shows the best path obtained by the SSQL algorithm, S is the starting point and T is the target point).
A comparison between SSQL, CQL, PSO, GWO, DA and MFO for short path planning.
The lowest (best) both mean, standard deviation, and the values of path length bigger than the level 0.05 of significance are highlighted.
| Env | Statistics | SSQL | CQL | PSO | GWO | DA | MFO | |
|---|---|---|---|---|---|---|---|---|
| M01 | Path Length | Mean |
| 290.18 | 346.61 | 274.56 | 353.05 | 316.54 |
| Std. Dev. |
| 38.59 | 12.88 | 5.96 | 16.32 | 27.86 | ||
| - | 6.99e-3 | 5.54e-26 | 2.57e-4 | 2.21e-23 | 3.74e-10 | |||
| Angle | Mean |
| 0.43 | 0.63 | 0.54 | 0.75 | 0.66 | |
| Std. Dev. | 0.26 | 0.44 | 0.19 | 0.42 | 0.28 |
| ||
| Time | Mean | 5.92 | 6.54 |
| 2.92 | 3.67 | 2.77 | |
| M02 | Path Length | Mean | 310.49 | 314.39 | 317.52 |
| 315.56 | 313.81 |
| Std. Dev. | 2.97 | 6.36 | 11.13 |
| 9.36 | 10.45 | ||
| - | 2.15e-3 |
| 1.52e-6 | 2.92e-7 |
| |||
| Angle | Mean | 1.10 | 1.66 |
| 0.77 | 0.86 | 0.84 | |
| Std. Dev. |
|
| 0.14 | 0.06 | 0.21 | 0.17 | ||
| Time | Mean | 5.21 | 5.79 | 3.32 | 3.37 | 4.06 |
| |
| M03 | Path Length | Mean |
| 297.01 | 308.14 | 299.94 | 327.87 | 309.12 |
| Std. Dev. |
| 6.36 | 12.59 | 9.52 | 14.38 | 9.89 | ||
| - | 1.16e-2 | 5.20e-7 | 1.28e-3 | 1.89e-14 | 7.80e-10 | |||
| Angle | Mean | 1.62 | 1.72 | 1.25 | 1.36 |
| 1.22 | |
| Std. Dev. | 0.16 | 0.17 | 0.17 | 0.17 | 0.32 |
| ||
| Time | Mean | 6.09 | 6.97 | 3.46 | 3.62 | 4.55 |
| |
| M04 | Path Length | Mean |
| 246.88 | 261.22 | 247.43 | 261.22 | 265.08 |
| Std. Dev. |
| 6.42 | 18.51 | 14.24 | 21.13 | 12.08 | ||
| - | 5.82e-3 | 6.54e-6 |
| 4.41e-5 | 1.21e-11 | |||
| Angle | Mean |
| 0.73 | 0.79 | 0.72 | 0.72 | 0.79 | |
| Std. Dev. |
| 0.24 | 0.22 | 0.18 | 0.16 | 0.24 | ||
| Time | Mean | 7.24 | 8.19 | 4.24 |
| 3.64 | 3.17 | |
| M05 | Path Length | Mean |
| 256.22 | 264.21 | 278.58 | 285.20 | 256.50 |
| Std. Dev. |
| 20.98 | 27.49 | 25.33 | 28.96 | 15.45 | ||
| - |
| 6.64e-3 |
| 4.41e-5 | 1.21e-11 | |||
| Angle | Mean |
| 1.42 | 1.65 | 1.63 | 1.53 | 1.71 | |
| Std. Dev. |
| 0.19 | 0.23 | 0.28 | 0.34 | 0.35 | ||
| Time | Mean | 6.17 | 6.51 | 3.78 | 3.88 | 4.21 |
| |
| M06 | Path Length | Mean |
| 253.25 | 276.18 | 277.02 | 277.02 | 269.56 |
| Std. Dev. |
| 6.35 | 28.26 | 26.98 | 26.98 | 20.71 | ||
| - | 2.54e-2 | 2.10e-5 | 6.64e-6 | 6.64e-6 | 1.65e-5 | |||
| Angle | Mean | 1.37 | 1.57 | 1.82 | 1.83 | 1.88 | 1.94 | |
| Std. Dev. |
| 0.33 | 0.62 | 0.47 | 0.46 | 0.41 | ||
| Time | Mean | 5.39 | 5.79 | 4.33 |
| 4.56 | 4.71 | |
| M07 | Path Length | Mean |
| 249.55 | 255.63 | 248.73 | 250.40 | 248.74 |
| Std. Dev. |
| 15.79 | 15.55 | 14.38 | 13.28 | 12.24 | ||
| - | 7.90e-4 | 1.53e-6 | 7.27e-4 | 4.88e-5 | 1.38e-4 | |||
| Angle | Mean |
| 1.58 | 1.58 | 1.52 | 1.51 | 1.50 | |
| Std. Dev. |
| 0.55 | 0.49 | 0.38 | 0.43 | 0.44 | ||
| Time | Mean | 6.18 | 6.52 | 3.49 | 3.91 | 4.66 |
| |
| M08 | Path Length | Mean |
| 299.19 | 297.54 | 301.95 | 294.78 | 313.55 |
| Std. Dev. |
| 21.41 | 22.26 | 25.27 | 19.50 | 31.35 | ||
| - | 7.32e-3 | 2.41e-2 | 4.82e-3 |
| 1.02e-4 | |||
| Angle | Mean |
| 1.51 | 1.85 | 1.53 | 1.68 | 1.68 | |
| Std. Dev. |
| 0.32 | 0.68 | 0.34 | 0.56 | 0.48 | ||
| Time | Mean | 6.94 | 7.51 |
| 3.92 | 3.51 | 3.47 | |
| M09 | Path Length | Mean |
| 317.16 | 326.26 | 319.64 | 321.30 | 319.37 |
| Std. Dev. |
| 20.11 | 15.96 | 12.37 | 13.67 | 12.50 | ||
| - |
| 7.84e-5 | 6.60e-3 | 2.41e-3 | 9.17e-3 | |||
| Angle | Mean |
| 1.52 | 1.83 | 1.84 | 1.78 | 1.79 | |
| Std. Dev. |
| 0.43 | 0.52 | 0.53 | 0.60 | 0.53 | ||
| Time | Mean | 6.15 | 6.35 |
| 3.97 | 4.28 | 4.12 | |
Fig 4Compared mean path length of various path planning algorithms for MR (The interval on each bar denotes the standard deviation of the path length).
Fig 5Safe path planning (solution) for MR in different test environments (Each map shows the best path obtained by the SSQL algorithm, S is the starting point and T is the target point).
A comparison between SSQL, CQL, PSO, GWO, DA and MFO for safe path planning.
The lowest (best) both mean, standard deviation, and the values of path length bigger than the level 0.05 of significance are highlighted.
| Env | Statistics | SSQL | CQL | PSO | GWO | DA | MFO | |
|---|---|---|---|---|---|---|---|---|
| M01 | Path Length | Mean |
| 296.43 | 347.39 | 297.79 | 352.86 | 321.23 |
| Std. Dev. |
| 42.16 | 11.98 | 35.66 | 14.14 | 28.18 | ||
| - | 1.34e-3 | 1.76e-26 | 1.32e-4 | 5.75e-25 | 4.58e-11 | |||
| Angle | Mean |
| 0.51 | 0.73 | 0.62 | 0.82 | 0.72 | |
| Std. Dev. |
| 0.47 | 0.28 | 0.46 | 0.31 | 0.30 | ||
| Time | Mean | 5.46 | 6.14 |
| 2.87 | 3.52 | 2.77 | |
| M02 | Path Length | Mean |
| 318.69 | 320.25 | 317.71 | 319.47 | 319.86 |
| Std. Dev. |
| 2.98 | 9.41 | 2.87 | 6.94 | 8.56 | ||
| - | 7.92e-4 | 3.35e-2 | 3.79e-2 | 2.37e-2 | 3.59e-2 | |||
| Angle | Mean | 1.20 | 1.71 | 1.22 |
| 0.99 | 1.07 | |
| Std. Dev. |
| 0.16 | 0.58 | 0.47 | 0.44 | 0.50 | ||
| Time | Mean | 6.29 | 6.77 | 3.76 | 3.43 | 3.92 |
| |
| M03 | Path Length | Mean |
| 302.48 | 308.53 | 304.82 | 325.13 | 310.29 |
| Std. Dev. |
| 7.16 | 12.15 | 9.49 | 13.54 | 8.19 | ||
| - |
| 9.81e-4 | 1.73e-2 | 1.60e-11 | 3.98e-7 | |||
| Angle | Mean | 1.59 | 1.72 | 1.42 | 1.48 |
| 1.42 | |
| Std. Dev. |
| 0.17 | 0.30 | 0.25 | 0.43 | 0.27 | ||
| Time | Mean | 6.22 | 6.57 | 3.64 |
| 4.17 | 3.51 | |
| M04 | Path Length | Mean |
| 261.10 | 264.76 | 259.26 | 268.69 | 278.38 |
| Std. Dev. |
| 9.89 | 17.44 | 10.48 | 17.24 | 16.71 | ||
| - | 6.71e-3 | 6.21e-3 |
| 2.20e-4 | 1.45e-8 | |||
| Angle | Mean | 1.57 | 1.90 | 1.51 |
| 1.43 | 1.53 | |
| Std. Dev. |
| 0.76 | 0.62 | 0.70 | 0.82 | 0.56 | ||
| Time | Mean | 5.92 | 6.28 |
| 3.69 | 5.12 | 4.21 | |
| M05 | Path Length | Mean |
| 281.67 | 279.62 | 284.58 | 296.93 | 283.94 |
| Std. Dev. |
| 22.96 | 24.28 | 18.20 | 20.21 | 19.97 | ||
| - | 1.93e-2 |
| 7.06e-4 | 1.22e-7 | 2.26e-3 | |||
| Angle | Mean |
| 1.42 | 1.59 | 1.58 | 1.56 | 1.67 | |
| Std. Dev. |
| 0.29 | 0.34 | 0.36 | 0.38 | 0.43 | ||
| Time | Mean | 5.71 | 6.29 | 4.14 |
| 4.56 | 3.72 | |
| M06 | Path Length | Mean |
| 273.41 | 293.69 | 293.59 | 284.20 | 286.96 |
| Std. Dev. |
| 25.25 | 25.76 | 27.76 | 28.74 | 27.96 | ||
| - | 1.92e-3 | 1.87e-8 | 8.35e-8 | 2.17e-5 | 3.26e-6 | |||
| Angle | Mean |
| 2.41 | 2.14 | 2.48 | 2.65 | 2.70 | |
| Std. Dev. |
| 0.76 | 0.73 | 0.69 | 0.53 | 0.57 | ||
| Time | Mean | 7.34 | 7.72 | 4.18 |
| 4.52 | 4.12 | |
| M07 | Path Length | Mean |
| 332.45 | 340.97 | 355.83 | 361.49 | 335.95 |
| Std. Dev. |
| 50.16 | 37.69 | 31.20 | 41.76 | 49.91 | ||
| - |
| 1.86e-2 | 7.10e-6 | 3.01e-5 |
| |||
| Angle | Mean |
| 3.33 | 3.60 | 3.48 | 3.83 | 4.00 | |
| Std. Dev. |
| 1.49 | 1.28 | 1.17 | 1.21 | 1.20 | ||
| Time | Mean | 5.41 | 5.87 |
| 4.15 | 4.52 | 5.51 | |
| M08 | Path Length | Mean |
| 325.94 | 333.84 | 329.88 | 318.64 | 326.12 |
| Std. Dev. |
| 30.24 | 27.60 | 29.52 | 32.23 | 30.39 | ||
| - | 4.57e-2 | 8.54e-4 | 8.81e-3 |
| 4.37e-2 | |||
| Angle | Mean |
| 1.56 | 1.85 | 1.59 | 1.63 | 1.75 | |
| Std. Dev. |
| 0.29 | 0.65 | 0.29 | 0.53 | 0.43 | ||
| Time | Mean | 6.63 | 7.52 |
| 3.82 | 4.91 | 4.51 | |
| M09 | Path Length | Mean |
| 324.33 | 329.28 | 322.92 | 328.99 | 324.75 |
| Std. Dev. |
| 19.86 | 13.24 | 11.27 | 9.39 | 12.20 | ||
| - | 2.64e-2 | 5.35e-6 | 1.82e-3 | 1.77e-8 | 4.07e-4 | |||
| Angle | Mean |
| 1.83 | 1.93 | 1.87 | 1.74 | 1.86 | |
| Std. Dev. |
| 0.40 | 0.47 | 0.52 | 0.53 | 0.45 | ||
| Time | Mean | 5.15 | 5.62 | 3.71 |
| 4.79 | 4.53 | |
Fig 6Compared mean path length of various path planning algorithms for MR (The interval on each bar denotes the standard deviation of the path length).
Comparison of algorithm performance between SSQL and CQL in short path planning.
| Path length vs CQL | Angle vs CQL | Time vs CQL | |
|---|---|---|---|
| M01 | 7.06% | 76.74% | 9.48% |
| M02 | 1.24% | 33.73% | 10.02% |
| M03 | 1.25% | 5.81% | 12.63% |
| M04 | 1.79% | 20.55% | 11.60% |
| M05 | 2.80% | 6.34% | 5.22% |
| M06 | 1.31% | 12.74% | 6.91% |
| M07 | 4.54% | 33.54% | 5.21% |
| M08 | 3.87% | 11.92% | 7.59% |
| M09 | 1.57% | 14.47% | 3.15% |
Comparison of algorithm performance between SSQL and CQL in safe path planning.
| Path length vs CQL | Angle vs CQL | Time vs CQL | |
|---|---|---|---|
| M01 | 9.22% | 84.31% | 53.19% |
| M02 | 0.74% | 29.82% | 7.09% |
| M03 | 0.84% | 7.56% | 5.33% |
| M04 | 2.43% | 17.37% | 5.73% |
| M05 | 4.10% | 16.90% | 9.22% |
| M06 | 5.76% | 9.54% | 4.92% |
| M07 | 3.15% | 3.90% | 7.84% |
| M08 | 3.93% | 14.10% | 11.84% |
| M09 | 2.62% | 27.32% | 8.36% |