| Literature DB >> 33967349 |
Chenyang Wu1,2, Scott Le Vine2,3, Elizabeth Bengel3, Jason Czerwinski3, John Polak2.
Abstract
In recent years, there has been a scholarly debate regarding the decrease in automobile-related mobility indicators (car ownership, driving license holding, VMT, etc.). Broadly speaking, two theories have been put forward to explain this trend: (1) economic factors whose impacts are well-understood in principle, but whose occurrence among young adults as a demographic sub-group had been overlooked, and (2) less well-understood shifts in cultural mores, values and sentiment towards the automobile. This second theory is devilishly difficult to study, due primarily to limitations in standard data resources such as the National Household Travel Survey and international peer datasets. In this study we first compiled a database of lyrics to popular music songs from 1956 to 2015 (defined by inclusion in the annual "top 40"), and subsequently identified references to automobiles within this corpus. We then evaluated whether there is support for theory #2 above within popular music, by looking at changes from the 1950s to the 2010s. We demonstrate that the frequency of references to automobility tended for many years to increase over time, however there has more recently been a decline after the late 2000s (decade). In terms of the sentiment of popular music lyrics that reference automobiles, our results are mixed as to whether the references are becoming increasingly positive or negative (machine analysis suggests increasing negativity, while human analysis did not find a significant association), however a consistent observation is that sentiment of automobile references have over time become more positive relative to sentiment of song lyrics overall. We also show that sentiment towards automobile references differs systematically by genre, e.g. automobile references within 'Rock' lyrics are in general more negative than similar references to cars in other music genres). The data generated on this project have been archived and made available open access for use by future researchers; details are in the full paper.Entities:
Keywords: Natural language processing; Peak car; Popular music; Sentiment analysis
Year: 2021 PMID: 33967349 PMCID: PMC8088827 DOI: 10.1007/s11116-021-10189-1
Source DB: PubMed Journal: Transportation (Amst) ISSN: 0049-4488 Impact factor: 4.814
Summary of literature in peak car
| Socio-demographic features (i.e. income, gender, car access) | Built environment (i.e. population density, public transport access) | Values and attitudes (i.e. environment awareness, attitude towards cars) | ICT (i.e. smart phone and social media use) | Graduated Driving License |
|---|---|---|---|---|
McDonald and Trowbridge ( Raimond and Milthorpe ( Williams ( Licaj et al. ( Kuhnimhof et al. ( Sivak and Schoettle ( Delbosc and Currie ( Delbosc and Currie ( Delbosc and Currie ( Le Vine et al. ( Le Vine and Polak ( Schoettle and Sivak ( Tefft et al. ( Brown and Handy ( Ciari and Axhausen ( Curry et al. ( Baradaran et al. ( Hjorthol ( Delbosc and Nakanishi ( Thigpen and Handy ( Rérat ( Bayart et al. ( Vaca et al. ( | McDonald and Trowbridge ( Raimond and Milthorpe ( Williams ( Licaj et al. ( Sivak and Schoettle ( Delbosc and Currie ( Delbosc and Currie ( Le Vine et al. ( Le Vine and Polak ( Schoettle and Sivak ( Brown and Handy ( Baradaran et al. ( Hjorthol ( Thigpen and Handy ( Rérat ( Bayart et al. ( Vaca et al. ( | Williams ( Delbosc and Currie ( Delbosc and Currie ( Le Vine et al. ( Schoettle and Sivak ( Brown and Handy ( Fylan and Caveney ( Thigpen and Handy ( | Sivak and Schoettle ( Delbosc and Currie ( Delbosc and Currie ( Delbosc and Currie ( Le Vine et al. ( Schoettle and Sivak ( Brown and Handy ( Thigpen and Handy ( | Raimond and Milthorpe ( Sivak and Schoettle ( Delbosc and Currie ( Tefft et al. ( Thigpen and Handy ( |
Literature review of sentiment analysis in transport
| Study | Objective | Data source | Geography | Algorithm/software used | Key findings |
|---|---|---|---|---|---|
| Qi et al. ( | A framework to extract and analyze public opinions on transport service | Miami-Dade country, US | AFINN | After introducing sentiment variables, the prediction accuracy can rise 18.6% and reach ~ 90% | |
| Kinra et al. ( | Public opinion about the adoption of autonomous vehicles | Denmark | SentiStrengh | Text analytics can be used as a complement to surveys Safety, labor participation and congestion are the most important concerns | |
| Mondschein et al. ( | Customer sentiment towards parking | Yelp | Phoenix, US | Lexicon-based algorithm | Sentiment about parking is in general negative Parking sentiment is part of the overall perception of customers toward a business Districts with more parking spaces per business tend to have more positive parking sentiment Parking is viewed more positively when shared parking facilities are provided |
| El-Diraby et al. ( | Customers’ satisfaction on transit service | Vancouver, Canada | SentiStrength | Sentiment is in general negative Sentiments toward disruption, especially those related to public safety incidents, showed lower levels of negative sentiment The sentiment of the sub-network of the most influential players closely matched the topics and sentiment of the full network | |
| Mendez et al. ( | An approach to capture user satisfaction with public transport | Santiago, Chile | SentiStrength | The amount of bus stops and bus services covered by the proposed approach is larger than survey data The proposed approach is effective in diagnosing problems in a timely manner | |
| Pratt et al. ( | Public opinion regarding ridesharing service | US | Aylien | The number of negative tweets outweighs the number of positive ones about the service characteristics (like routing and travel time) Most tweets about other passengers feature “humor” about other passengers | |
| Rahim Taleqani et al. ( | Public opinion regarding dockless bikesharing | Multiple countries (primarily US) | Logistic regression, support vector machines, and naïve Bayes | The dockless bikesharing system receives more positive sentiments than negative ones The mostly mentioned sub-topics relevant to dockless bikesharing are electric scooters, private e-hailing companies, and blockage of sidewalks | |
| Haghighi et al. ( | A framework to analyze public opinion on transit service quality | Utah, US | Rsentiment proposed by (Bose et al. | The number of negative tweets is greater on weekends than weekdays Most negative tweets are related to transit routes with higher ridership There is potential to use social media data to analyze transit service quality | |
| Kulkarni et al. ( | A system that can analyze public opinion on transport | California, US | Valence Aware Dictionary and sEntiment Reasoner (VADER) | The quality of the system depends on the size of the dataset, the number of topics that are specific to the topic modelling algorithm, and the positive/negative thresholds of the sentiment analysis algorithm | |
| Pai and Liu ( | Predict vehicle sales by sentiment analysis | US | SentiStrength | Both social media sentiment and stock values have predictive power to forecast monthly total vehicle sales | |
| Ali et al. ( | Using sentiment analysis to monitor transportation activities | Not mentioned | SentiWordNet | The proposed approach can determine real-time traffic congestion mapping | |
| Baj-Rogowska ( | Public opinion regarding Uber (ridehailing) | Not mentioned | ProSuite | Sentiment analysis reflected events that affected the company’s reputation | |
| Wijnhoven and Plant ( | Predict car sales by sentiment analysis | Coosto, Twitter, Facebook, LinkedIn, YouTube, Google + , Hyves, Instagram and Pinterest | The Netherlands | Coosto | Social media sentiments have little predictive power towards car sales, while Google Trends data and social mention volume have significant predictive power |
| Effendy et al. ( | Public opinion regarding public transport | Indonesian | Support vector machines | The accuracy of sentiment analysis using support vector machine can reach 78% | |
| Fraedrich and Lenz ( | Public interest in autonomous driving | Online comments on newspaper articles | Germany and US | Qualitative content analysis | Response to autonomous driving in different countries and different types of media is different Sentiment towards autonomous driving is generally positive, however the authors report finding some negative sentiments |
| Giancristofaro and Panangadan ( | Public opinion of the California Department of Transportation | California, US | Support vector machines, naïve Bayes, and random forests | The precision of sentiment analysis can be improved if images and texts are combined | |
| Hao et al. ( | Public opinion towards the I-710 Corridor Project | California, US | API based on a naïve Bayes classifier | There are increasing twitter users participating the I-710 Corridor Project over time The number of comments from personal twitter accounts is positively correlated to the number of tweets from the organization account Twitter users are more likely to sent positive comments in the morning and negative comments in the afternoon towards I-710 Project Twitter users are more positive towards “Freeway Tunnel”, “Light Rail Transit” and improving the existing infrastructure, and more negative towards “Rapid Bus Transit” | |
| Das et al. ( | Users’ sentiment toward bikesharing | Washington DC, US | Lexicon-based algorithm | Most people view the bikesharing system positively | |
| Luong and Houston ( | Public opinion on light rail service | Los Angeles, US | Lexicon-based algorithm | Commuters mainly retweeted from other individuals or transit agencies, while schools and firms did not have strong retweet connections The Red Line was associated with the most positive tweets whereas the Blue Line has the most negative sentiments (of Los Angeles’ heavy rail lines) | |
| Schweitzer ( | Public opinion regarding public transit | US | Lexicon-based algorithm | Public transit receives the most negative comments Transit companies that respond to other social media users receive statistically more positive sentiments | |
| Collins et al. ( | Transit riders’ satisfaction | Chicago, US | SentiStrength | Riders express negative sentiments more than positive sentiments when an event occurs When an accident occurs, unusually high total social media engagement also occurs |
Fig. 1Percentage of music, by decade and genre
Fig. 2Of all references to cars, the percentage distribution by genre
Fig. 3Average word count per song, by decade and genre
Fig. 4Average duration per song in minutes, by decade and genre
Fig. 5Average number of automobile references per song
Fig. 6Average number of automobile references per 100 words
Fig. 7Percentage of tokens meeting various criteria (see label of each curve)
Fig. 8Percentage of tokens meeting each criterion in legend, by genre
Correlation matrix comparing coding outputs for car token sentiments
| Alchemy | Joy Watson | Human analyst 1a | Human analyst 2 | |
|---|---|---|---|---|
| Alchemy | 1.0 | 0.44 | 0.16 | 0.13 |
| Joy Watson | 1.0 | 0.13 | 0.09 | |
| Human analyst 1 | 1.0 | 0.60 | ||
| Human analyst 2 | 1.0 |
aSee acknowledgments for identification of human analysts
Fig. 9Average sentiment of songs and tokens
Correlation between sentiment and year (bold indicates significant at p < 0.05)
| Alchemy | Joy Watson | Human analyst 1 | Human analyst 2 | |||||
|---|---|---|---|---|---|---|---|---|
| Automobile references | All lyrics | Automobile references | All lyrics | Automobile references | All lyrics | Automobile references | All lyrics | |
| Pearson’s correlation between sentiment and year | 0.03 ( | |||||||
Fig. 10Average sentiment of tokens meeting each criterion in legend
Regression analysis for sentiment of automobile references
| Alchemy | Joy Watson | Human analyst 1 | Human analyst 2 | |||||
|---|---|---|---|---|---|---|---|---|
| Coefficients | Coefficients | Coefficients | Coefficients | |||||
| Constant | 7.92 | 0.01 | 6.42 | < 0.01 | 0.55 | < 0.01 | 0.43 | < 0.01 |
| Year | < 0.01 | < 0.01 | * | * | ||||
| Percentage of female artists | * | * | < 0.01 | * | ||||
| Genre: Pop | 0 (fixed) | 0 (fixed) | 0 (fixed) | 0 (fixed) | ||||
| Genre: Hip-Hop/Rap | * | 0.04 | * | 0.29 | < 0.01 | |||
| Genre: R&B/Soul | * | * | 0.02 | * | ||||
| Genre: Rock | 0.13 | 0.04 | < 0.01 | < 0.01 | ||||
| Genre: Country | * | * | * | * | ||||
| Genre: Other | * | * | * | * | ||||
| Criteria: Cars | * | * | * | * | ||||
| Criteria: Traffic | * | * | < 0.01 | < 0.01 | ||||
| Criteria: Driving | * | * | 0.14 | 0.07 | 0.09 | 0.13 | ||
| Criteria: Passenger | * | * | * | * | ||||
| Criteria: Stationary in Vehicle | * | * | * | * | ||||
| Criteria: Car Parts | * | * | * | 0.12 | 0.06 | |||
| Criteria: Brand | * | * | 0.21 | < 0.01 | 0.19 | < 0.01 | ||
| Criteria: Taxi/Hitching a Ride | * | 0.02 | * | * | ||||
| Adjusted | 0.01 | 0.05 | 0.10 | 0.15 | ||||
| Model significance | 0.03 | < 0.01 | < 0.01 | < 0.01 | ||||
*Factors with p value > 0.15 were excluded
Regression analysis for sentiment of songs (i.e. all-lyrics)
| Alchemy | Joy Watson | Human analyst 1 | Human analyst 2 | |||||
|---|---|---|---|---|---|---|---|---|
| Coefficients | Coefficients | Coefficients | Coefficients | |||||
| Constant | 7.98 | < 0.01 | 8.15 | < 0.01 | 8.07 | 0.08 | 0.16 | < 0.01 |
| Year | < 0.01 | < 0.01 | < 0.01 | * | ||||
| Percentage of female artists | * | * | * | * | ||||
| Automobile reference | * | * | * | * | ||||
| Genre: Pop | ||||||||
| Genre: Hip-Hop/Rap | * | * | 0.33 | 0.04 | 0.28 | 0.06 | ||
| Genre: R&B/Soul | 0.08 | 0.01 | * | 0.27 | < 0.01 | * | ||
| Genre: Rock | 0.04 | 0.05 | 0.07 | < 0.01 | ||||
| Genre: Country | * | * | * | * | ||||
| Genre: Other | 0.22 | 0.11 | * | * | * | |||
| Adjusted | 0.08 | 0.05 | 0.03 | 0.03 | ||||
| Model significance | < 0.01 | < 0.01 | < 0.01 | < 0.01 | ||||
* Factors with p value > 0.15 were excluded
Regression analysis for sentiment of automobile references (10-year band model)
| Alchemy | Joy Watson | Human analyst 1 | Human analyst 2 | |||||
|---|---|---|---|---|---|---|---|---|
| Coefficients | Coefficients | Coefficients | Coefficients | |||||
| Constant | 0.14 | 0.11 | 0.45 | < 0.01 | 0.55 | < 0.01 | 0.43 | < 0.01 |
| Decade | < 0.01 | < 0.01 | * | * | ||||
| Percentage of female artists | * | * | < 0.01 | * | ||||
| Genre: Pop | 0 (fixed) | 0 (fixed) | 0 (fixed) | 0 (fixed) | ||||
| Genre: Hip-Hop/Rap | 0.15 | 0.03 | * | 0.29 | < 0.01 | |||
| Genre: R&B/Soul | * | * | 0.02 | * | ||||
| Genre: Rock | < 0.01 | 0.06 | < 0.01 | < 0.01 | ||||
| Genre: Country | * | * | * | * | ||||
| Genre: Other | * | * | * | * | ||||
| Criteria: Cars | * | * | * | * | ||||
| Criteria: Traffic | * | * | < 0.01 | < 0.01 | ||||
| Criteria: Driving | 0.13 | 0.14 | 0.07 | 0.09 | 0.13 | |||
| Criteria: Passenger | * | * | * | * | ||||
| Criteria: Stationary in Vehicle | * | * | * | * | ||||
| Criteria: Car Parts | * | * | * | 0.12 | 0.06 | |||
| Criteria: Brand | * | * | 0.21 | < 0.01 | 0.19 | < 0.01 | ||
| Criteria: Taxi/Hitching a Ride | 0.06 | 0.01 | * | * | ||||
| Adjusted | 0.03 | 0.04 | 0.10 | 0.15 | ||||
| Model significance | < 0.01 | < 0.01 | < 0.01 | < 0.01 | ||||
*Factors with p value > 0.15 were excluded
Regression analysis for sentiment of automobile references (before-after peak car model)
| Alchemy | Joy Watson | Human Analyst 1 | Human Analyst 2 | |||||
|---|---|---|---|---|---|---|---|---|
| Coefficients | Coefficients | Coefficients | Coefficients | |||||
| Constant | Not significant | 0.36 | < 0.01 | 0.55 | < 0.01 | 0.43 | < 0.01 | |
| After peak car | < 0.01 | * | * | |||||
| Percentage of female artists | * | < 0.01 | * | |||||
| Genre: Pop | 0 (fixed) | 0 (fixed) | 0 (fixed) | |||||
| Genre: Hip-Hop/Rap | 0.01 | * | 0.29 | < 0.01 | ||||
| Genre: R&B/Soul | * | 0.02 | * | |||||
| Genre: Rock | 0.10 | < 0.01 | < 0.01 | |||||
| Genre: Country | * | * | * | |||||
| Genre: Other | * | * | * | |||||
| Criteria: Cars | * | * | * | |||||
| Criteria: Traffic | * | < 0.01 | < 0.01 | |||||
| Criteria: Driving | 0.14 | 0.07 | 0.09 | 0.13 | ||||
| Criteria: Passenger | * | * | * | |||||
| Criteria: Stationary in Vehicle | * | * | * | |||||
| Criteria: Car Parts | * | * | 0.12 | 0.06 | ||||
| Criteria: Brand | * | 0.21 | < 0.01 | 0.19 | < 0.01 | |||
| Criteria: Taxi/Hitching a Ride | 0.03 | * | * | |||||
| Adjusted | 0.04 | 0.10 | 0.15 | |||||
| Model significance | < 0.01 | < 0.01 | < 0.01 | |||||
*Factors with p value > 0.15 were excluded