| Literature DB >> 32554384 |
David Grethlein1,2, Flaura Koplin Winston1,3,4, Elizabeth Walshe3,5, Sean Tanner1,6, Venk Kandadai1, Santiago Ontañón2.
Abstract
BACKGROUND: A large Midwestern state commissioned a virtual driving test (VDT) to assess driving skills preparedness before the on-road examination (ORE). Since July 2017, a pilot deployment of the VDT in state licensing centers (VDT pilot) has collected both VDT and ORE data from new license applicants with the aim of creating a scoring algorithm that could predict those who were underprepared.Entities:
Keywords: accidents, traffic; adolescent; automobile driving; cause of death; child; humans; licensure; machine learning; motor vehicle; motor vehicles; on-road exam; simulated driving assessment; support vector machines
Mesh:
Year: 2020 PMID: 32554384 PMCID: PMC7333075 DOI: 10.2196/13995
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Figure 1Workstation Setup: 1) Standard Monitor, 2) Standard Desktop Computer, 3) Off-the-shelf USB Steering Wheel and Pedals.
Figure 2Virtual driving test workstation.
Figure 3Sample derivation: data from 335 (7.2%) enrolled driver applicants were excluded from the final sample of 4308 because they either did not complete the VDT workflow or their assessment replay file was unavailable for analysis. VDT: virtual driving test.
Figure 4Time Series Clustering for highlighted event zone with cluster centers representing prototypical driving behaviors in that zone.
A confusion matrix, showing the 4 quadrants: FF (fail-fail), FP (fail-pass), PF (pass-fail), and PP (pass-pass).
| Confusion matrix | Fail OREa | Pass ORE |
| Fail VDTb | FF | FP |
| Pass VDT | PF | PP |
aORE: on-road examination.
bVDT: virtual driving test.
Summary results of the evaluation metrics obtained from 4 classifiers: variables+logistic regression, variables+support vector machine, time series clustering+logistic regression, and series time clustering+support vector machine.
| Classifier results | Standard method (variables) | Novel method (time series clustering) | ||
|
| Logistic regression | SVMa | Logistic regression | SVM |
| Accuracy, % | 76.1 | 75.4 | 76.2 | 74.9 |
| Fail rate, % | 6.6 | 3.4 | 3.5 | 3.6 |
| False alarm rate, % | 2.6 | 1.3 | 1.0 | 1.7 |
| Ratio of false alarms, % | 38.5 | 37.7 | 27.2 | 45.9 |
| Relative risk (95% CI) | 2.684 (2.409-2.991) | 2.581 (2.250-2.961) | 3.071 (2.747-3.434) | 2.223 (1.906-2.592) |
| True positive rate, % | 15.9 | 8.3 | 10.0 | 7.8 |
| False positive rate, % | 3.4 | 1.7 | 1.3 | 2.2 |
aSVM: support vector machine.
Figure 5Receiver operator characteristic curves for logistic regression using the variables and time series clustering feature sets for iterated logistic cutoff threshold values. Points in the bottom left represent models with the lowest thresholds (more people pass the virtual driving test), whereas points in the top right represent models with the highest thresholds (fewer people pass the virtual driving test).