| Literature DB >> 35062596 |
Lingli Yu1,2, Shuxin Huo1,2, Keyi Li1,2, Yadong Wei1,2.
Abstract
An intelligent land vehicle utilizes onboard sensors to acquire observed states at a disorderly intersection. However, partial observation of the environment occurs due to sensor noise. This causes decision failure easily. A collision relationship-based driving behavior decision-making method via deep recurrent Q network (CR-DRQN) is proposed for intelligent land vehicles. First, the collision relationship between the intelligent land vehicle and surrounding vehicles is designed as the input. The collision relationship is extracted from the observed states with the sensor noise. This avoids a CR-DRQN dimension explosion and speeds up the network training. Then, DRQN is utilized to attenuate the impact of the input noise and achieve driving behavior decision-making. Finally, some comparative experiments are conducted to verify the effectiveness of the proposed method. CR-DRQN maintains a high decision success rate at a disorderly intersection with partially observable states. In addition, the proposed method is outstanding in the aspects of safety, the ability of collision risk prediction, and comfort.Entities:
Keywords: collision relationship; decision-making; deep recurrent Q network; intelligent land vehicle; partially observable Markov decision process
Year: 2022 PMID: 35062596 PMCID: PMC8780178 DOI: 10.3390/s22020636
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Summary of the common decision-making method.
| Method | Reference | Application | |
|---|---|---|---|
|
| [ | Lane changing at congested, urban scenarios | |
| [ | Decision-making at an urban unsignalized intersection | ||
|
| [ | Parking | |
|
| [ | Decision-making in a vehicle sensor tracking system | |
| [ | Lane changing | ||
|
| Machine learning | [ | Driving style classification |
| [ | Lateral motion prediction | ||
| Deep reinforcement learning | [ | Decision-making at intersections | |
| [ | Lane changing in dynamic and uncertain highways | ||
Figure 1Description of collision relationship between an ego vehicle and one surrounding vehicle.
Figure 2Description of collision relationship between an ego vehicle and multiple surrounding vehicles.
Figure 3Construction of Driving Behavior Decision-making Method based on CR-DRQN.
Figure 4Training process based on CR-DRQN. (a) presents the change of the reward obtained by the intelligent vehicle with the increase of episodes. (b) presents the relationship between the loss and update steps.
Settings of CR-DRQN’s parameters and corresponding training results.
| SerialNumber | Network Layers | Network Parameters |
|
|
| Q-Value |
|---|---|---|---|---|---|---|
| 1 | 4 | 64/32/16/5 | −5 | −0.2 | 442.7 | 83.34 |
| 2 | 128/64/32/5 | 9 | −1.6 | 466.4 | 100.68 | |
| 3 | 256/128/64/5 | 12 | −1.4 | 447.8 | 100.26 | |
| 4 | 5 | 64/32/16/8/5 | 3 | −0.4 | 448.3 | 92.26 |
| 5 | 128/64/32/16/5 | −4 | −0.6 | 446.6 | 84.72 | |
| 6 | 256/128/64/32/5 | 3 | −6.4 | 444.8 | 85.56 | |
| 7 |
| 128/64/32/16/8/5 | 3 | −0.4 | 448.3 | 92.26 |
| 8 | 160/80/40/20/10/5 | 3 | −0.4 | 448.3 | 92.26 | |
|
|
|
|
| 485.5 |
| |
| 10 | 320/160/80/40/20/5 | 30 | −1 | 485.5 | 126.1 | |
| 11 | 7 | 256/128/64/32/16/8/5 | −15 | −11.6 | 439 | 61.2 |
| 12 | 320/160/80/40/20/10/5 | 15 | −4.3 |
| 121.68 | |
| 13 | 512/256/128/64/32/16/5 | 26 | −0.5 | 474.7 | 120.44 | |
| 14 | 640/320/160/80/40/20/5 | 6 | −0.4 | 443.4 | 94.26 | |
| 15 | 8 | 512/256/128/64/32/16/8/5 | −3 | −8.2 | 437 | 76.2 |
| 16 | 640/320/160/80/40/20/10/5 | −9 | −0.7 | 434.4 | 77.18 |
Figure 5Decision success rate under different probabilities of noise occurrence.
Figure 6Velocity of an ego vehicle through an intersection at different noise probabilities. (a–f) present the results at the noise probabilities of 10–60% respectively.
Figure 7Action chose by CR-DRQN at different probabilities of noise occurrence. (a–f) present the results at the noise probabilities of 10–60% respectively.
Average change times of the ego vehicle’s acceleration at different probabilities of noise occurrence after 30 experiments.
| Noise Probability | 10% | 20% | 30% | 40% | 50% | 60% |
|---|---|---|---|---|---|---|
| DQN | 33 | 36 | 28 | 36 | 40 | 25 |
| Prioritized-DQN | 40 | 37 | 40 | 33 | 41 | 44 |
| DDQN | 38 | 32 | 35 | 32 | 32 | 38 |
| D3QN | 22 | 23 | 32 | 29 | 32 | 40 |
|
|
|
|
|
|
|
|