| Literature DB >> 31615164 |
Jisun Park1, Mingyun Wen2, Yunsick Sung3, Kyungeun Cho4.
Abstract
Nowadays, deep learning methods based on a virtual environment are widely applied to research and technology development for autonomous vehicle's smart sensors and devices. Learning various driving environments in advance is important to handle unexpected situations that can exist in the real world and to continue driving without accident. For training smart sensors and devices of an autonomous vehicle well, a virtual simulator should create scenarios of various possible real-world situations. To create reality-based scenarios, data on the real environment must be collected from a real driving vehicle or a scenario analysis process conducted by experts. However, these two approaches increase the period and the cost of scenario generation as more scenarios are created. This paper proposes a scenario generation method based on deep learning to create scenarios automatically for training autonomous vehicle smart sensors and devices. To generate various scenarios, the proposed method extracts multiple events from a video which is taken on a real road by using deep learning and generates the multiple event in a virtual simulator. First, Faster-region based convolution neural network (Faster-RCNN) extracts bounding boxes of each object in a driving video. Second, the high-level event bounding boxes are calculated. Third, long-term recurrent convolution networks (LRCN) classify each type of extracted event. Finally, all multiple event classification results are combined into one scenario. The generated scenarios can be used in an autonomous driving simulator to teach multiple events that occur during real-world driving. To verify the performance of the proposed scenario generation method, experiments using real driving video data and a virtual simulator were conducted. The results for deep learning model show an accuracy of 95.6%; furthermore, multiple high-level events were extracted, and various scenarios were generated in a virtual simulator for smart sensors and devices of an autonomous vehicle.Entities:
Keywords: autonomous vehicle; deep learning; scenario generation; smart sensor and device
Year: 2019 PMID: 31615164 PMCID: PMC6833086 DOI: 10.3390/s19204456
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Proposed multi-event-based scenario generation approach.
Figure 2Multiple high-level events extraction process.
Figure 3High-level event area extraction process.
Figure 4Long-term recurrent convolution networks (LRCN)-based event classification model structure.
List of scenario elements.
| Elements | Symbols | Description |
|---|---|---|
| Scenario | s = (e list) | One scenario includes multiple events |
| Event | e = (o list, event class) | An event includes the list of objects included in the event and the class of the event |
| Object | o = object | Objects including persons, animals, or cars |
| High-level event class | c = event class | Types of events occurring in the driving video |
Figure 5Execution of multiple events in a virtual simulator through the scenario input.
Figure 6Virtual simulator environment for an autonomous vehicle to train.
List of event types.
| High-Level Event Class | No. of Video Clips |
|---|---|
| human_push_car | 36 |
| human_motocyling | 27 |
| human_hugging | 85 |
| vehicle_changeLane | 16 |
| human_wave_hand | 20 |
| human_pet_animal | 12 |
| vehicle_turn | 24 |
| human_checkVictim | 22 |
| vehicle_stop | 50 |
| human_walk | 53 |
| vehicle_pass_by | 70 |
| human_crossroad | 39 |
| human_run | 13 |
| human_wait | 81 |
| human_getoff | 15 |
| human_check Car | 5 |
| human_use_phone | 7 |
| human_fight | 10 |
| human_phone_call | 27 |
| human_talk | 12 |
| human_smoking | 53 |
| human_trash_collecting | 24 |
| human_sit | 24 |
| 23 | 725 |
List of object types.
| Object Types (Total Nine Types) |
|---|
| Person, car, bike, motorbike, bus, truck, bird, cat, dog |
Figure 7Faster-region based convolution neural network (RCNN) results.
Figure 8Faster-RCNN based event area integration algorithm results.
Confusion of estimates and actual values.
| Confusion Matrix | Actual Values | ||
|---|---|---|---|
| Positive | Negative | ||
|
|
| True Positive | False Positive |
|
| False Negative | True Negative | |
Figure 9Confusion matrix of the LRCN result.
Comparison results of classification models.
| LRCN [ | LRCN + Full Area | LRCN (Inception-v3) | |
|---|---|---|---|
| Classification Accuracy | 78.2 | 80.5 | 95.6 |
Scenario Generation Result #1.
| Input Data |
|---|
|
|
|
|
| s1 = {e1(human_motocyling), e2(vehicle_changeLane)} |
|
|
|
|
Scenario Generation Result #2.
| Input Data |
|---|
|
|
|
|
| s2 = {e1(vehicle_turn), e2(vehicle_stop)} |
|
|
|
|
Scenario Generation Result #3.
| Input Data |
|---|
|
|
|
|
| s3 = {e1(vehicle_stop), e2(human_motocyling e3(vehicle_stop)} |
|
|
|
|
Scenario Generation Result #4.
| Input data |
|---|
|
|
|
|
| s4 = {e1(human_pet_animal), e2(vehicle_stop)} |
|
|
|
|