| Literature DB >> 33659911 |
Sooji Ha1,2, Daniel J Marchetto3, Sameer Dharur4, Omar I Asensio3,5.
Abstract
The transportation sector is a major contributor to greenhouse gas (GHG) emissions and is a driver of adverse health effects globally. Increasingly, government policies have promoted the adoption of electric vehicles (EVs) as a solution to mitigate GHG emissions. However, government analysts have failed to fully utilize consumer data in decisions related to charging infrastructure. This is because a large share of EV data is unstructured text, which presents challenges for data discovery. In this article, we deploy advances in transformer-based deep learning to discover topics of attention in a nationally representative sample of user reviews. We report classification accuracies greater than 91% (F1 scores of 0.83), outperforming previously leading algorithms in this domain. We describe applications of these deep learning models for public policy analysis and large-scale implementation. This capability can boost intelligence for the EV charging market, which is expected to grow to US$27.6 billion by 2027.Entities:
Keywords: artificial intelligence; consumer behavior; deep learning; electric vehicles; machine learning; natural language processing; policy analysis; sustainable transportation; topic classification; transformers
Year: 2021 PMID: 33659911 PMCID: PMC7892356 DOI: 10.1016/j.patter.2020.100195
Source DB: PubMed Journal: Patterns (N Y) ISSN: 2666-3899
EV mobile app typology of user reviews
| Topic | Sub-topic examples |
|---|---|
| Functionality | general functionality, charger, screen, power level, connector type, card, reader, connection, time, error message, station, mobile application, customer service |
| Range anxiety | trip, range, location accessibility |
| Availability | number of stations available, ICE, general congestion |
| Cost | parking, charging, payment |
| User interactions | charger etiquette, anticipated time available, user tips |
| Location | general location, directions, staff, amenities, points of interest, user activity, signage |
| Service time | charging rate |
| Dealership | dealership charging experience, competing brand quality, relationship with dealers |
| Other | general experiences |
ICE refers to situations where a charging station is blocked by an internal combustion engine vehicle.
Overall model performance
| Accuracy % (SD) | F1 score (SD) | |
|---|---|---|
| BERT | 91.6 (0.13) | 0.83 (0.0037) |
| XLNet | 91.6 (0.07) | 0.84 (0.0015) |
| Majority classifier | 81.1 (0.00) | 0.45 (0.0000) |
| LSTM | 90.3 (0.17) | 0.80 (0.0036) |
| CNN | 90.9 (0.12) | 0.81 (0.0032) |
Models were trained and tested on expert annotated data.
Figure 1Topic level classification performance
(A) For the baseline model we use the majority classifier, which predicts the simple majority for a given topic. For higher values in accuracy, the majority classifier reflects more imbalance in the training and testing data. We find that the deep learning models outperform the majority classifier in model accuracy, particularly for more frequently occurring labels, the Functionality, Location, and Availability topics.
(B) We also compare the relative performance of the transformer models with CNN and LSTM classifiers. High F1 scores for imbalanced topics indicate strong detection of true positives. Our results indicate that transformer models, BERT and XLNet, which achieve similar performance, improve upon the CNN and LSTM benchmarks in the F1 score across all topics. The error bars represent upper and lower 95% confidence intervals.
See also Tables S2 and S3.
Ground truth evaluation of human performance versus transformer models
| Classifier | Training set | Accuracy % (SD) | F1 score (SD) |
|---|---|---|---|
| BERT | Expert annotated | 89.1 (4.09) | 0.82 (0.06) |
| BERT | Crowd annotated | 73.2 (3.85) | 0.53 (0.06) |
| XLNet | Expert annotated | 91.0 (4.70) | 0.85 (0.06) |
| XLNet | Crowd annotated | 74.2 (4.15) | 0.54 (0.07) |
| Crowd ( | – | 73.9 (6.06) | 0.61 (0.09) |
| Human experts ( | – | 86.0 (4.40) | 0.79 (0.07) |
Cross validation was for 10 runs.
Examples where expert-trained transformers exceed human benchmarks
| Ground truth | Human expert | Expert-trained transformers | |||||
|---|---|---|---|---|---|---|---|
| BERT | XLNet | ||||||
| Labels | Labels | Acc. (%) | Labels | Acc. (%) | Labels | Acc. (%) | |
“… unit says decommissioned but it will still release the charger after a long pause.” | Functionality | User interaction | 75 | Functionality | 100 | Functionality | 100 |
“Thanks very busy dealership but happy to allow use of qcdc.” | Functionality, Availability, Dealership | Functionality, Dealership | 87.5 | Functionality, Availability, Dealership | 100 | Functionality, Availability, Dealership | 100 |
“Charging on the quick charger - will be done by 12:15.” | Functionality, User interaction | Functionality, Location | 75 | User interaction | 87.5 | User interaction | 87.5 |
“Went from 18-82% in 27 min! First time DC charging and met another nice Leaf owner who showed me how to use the machine. Thanks for the charge!” | Functionality, Service time | Functionality, Availability, Location, User interaction, Dealership | 62.5 | Service time | 87.5 | Functionality, Service time, Dealership | 87.5 |
“The CHAdeMO charger does work …. Nissan Hill had to move an ICE for me to gain access, but did so quickly. The CHAdeMO did not cost me any $ Charged quick! Don't hesitate to use.” | Functionality, Availability, Cost, Dealership | Functionality, Availability, Cost, User interaction, Location, Service time, Dealership | 62.5 | Functionality, Cost, Dealership | 87.5 | Functionality, Cost, Service time, Dealership | 75 |
“So the dealer had all of their cars being serviced parked in every spot including the quick charger. I called and asked them for at least access to the quick charger and they agreed but never did anything so I left and drove to Larry h nissan. I was willing to pay because I was in a hurry and obviously the Toyota dealer doesn't want my business.” | Availability, Cost, Dealership | Functionality, Availability, User interaction, Location, Dealership | 50 | Availability, Dealership | 87.5 | Availability, Location, Dealership | 75 |
Figure 2Predicted discussion frequency of station availability for US metropolitan and micropolitan statistical areas
The map reveals areas with high and low discussion frequency for predicted Availability issues in all metropolitan statistical areas (e.g., population greater than 50,000). Micropolitan statistical areas (e.g., population 10,000–49,999) have higher Availability discussions in some states in the West and Midwest regions. The algorithms predict that many micropolitan statistical areas could be underserved with regard to station availability.