| Literature DB >> 34188335 |
Tasnim M A Zayet1, Maizatul Akmar Ismail1, Kasturi Dewi Varathan1, Rafidah M D Noor2, Hui Na Chua3, Angela Lee3, Yeh Ching Low3, Sheena Kaur Jaswant Singh4.
Abstract
Social media is a pool of users' thoughts, opinions, surrounding environment, situation and others. This pool can be used as a real-time and feedback data source for many domains such as transportation. It can be used to get instant feedback from commuters; their opinions toward the transportation network and their complaints, in addition to the traffic situation, road conditions, events detection and many others. The problem is in how to utilize social media data to achieve one or more of these targets. A systematic review was conducted in the field of transportation-related research based on social media analysis (TRR-SMA) from the years between 2008 and 2018; 74 papers were identified from an initial set of 703 papers extracted from 4 digital libraries. This review will structure the field and give an overview based on the following grounds: activity, keywords, approaches, social media data and platforms and focus of the researches. It will show the trend in the research subjects by countries, in addition to the activity trends, platforms usage trend and others. Further analysis of the most employed approach (Lexicons) and data (text) will be also shown. Finally, challenges and future works are drawn and proposed. Supplementary Information: The online version contains supplementary material available at 10.1007/s11192-021-04046-2. © Akadémiai Kiadó, Budapest, Hungary 2021.Entities:
Keywords: Intelligent transportation system; Opinion mining; Sentiment analysis; Social media analysis; Systematic mapping review; Text mining; Traffic
Year: 2021 PMID: 34188335 PMCID: PMC8222706 DOI: 10.1007/s11192-021-04046-2
Source DB: PubMed Journal: Scientometrics ISSN: 0138-9130 Impact factor: 3.801
Fig. 1Systematic mapping construction methodology
Main research questions (RQs)
| Research questions (RQs) | Motivation |
|---|---|
| RQ1: How social media is used in transportation research based on social media analysis? | To identify the trends in the field, the used keywords, the used social media data, the used social media platforms and the used approaches |
| RQ2: What are the aims of transportation researches based on social media analysis? | To identify the targets of the researches and their trends in the world and by countries |
| RQ3: What are the challenges, principal findings and possible future works in the field? | To identify the challenges in the field and main findings from the analysis and draw the needs of the field |
Sub-questions of RQ1
| Sub-research questions drawn from RQ1 (s-RQ1) | Motivation |
|---|---|
| s1-RQ1: What is the distribution of the researches in terms of activity? | To identify the trend in the publication in the field in terms of years, countries, publishers and first authorship |
| s2-RQ1: What are the used keywords in the field? | To identify the most used keywords by authors in the researches |
| s3-RQ1: What are the social media data/attributes used by the researchers? | To identify the used social media attributes, the most used ones and the aim of using them |
| s4-RQ1: What are the rules of text data/text mining in the TRR-SMA field? | To identify the usages of the text data |
| S5-RQ1: What are the social media platforms used by the literature? | To identify the most used social media platforms and their usage trend |
| S6-RQ1: What are the datasets used by researchers? | To identify the datasets and the methods of collecting them |
| S7-RQ1: What are the approaches used to analyse social media data in transportation researches based on social media analysis? | To identify the most used methods for analysing social media data |
Sub-questions of RQ2
| Sub-research questions drawn from RQ2 (s-RQ2) | Motivation |
|---|---|
| s1-RQ2: Which subjects were targeted by researchers in the TRR-SMA field? | To identify the targets of the researches |
| s2-RQ2: What are the trended subjects in the world and by countries? | To identify the trends of research-subjects around the world and in the target countries |
| s3-RQ2: What are the social media attributes used for achieving the targets? | To identify the role of social media data in the research field to achieve each research target |
Fig. 2Search and selection process
Query terms and synonyms
| Transportation-related terms | Social media analysis-related terms |
|---|---|
| Transport | Social media analysis |
| Transportation | Opinion mining |
| Transport-related | Text mining |
| Intelligent transportation system | Sentiment analysis |
| Social network analysis |
The inclusion and exclusion criteria
| Inclusion terms (ICs) | Exclusion terms (ECs) |
|---|---|
| IC1: The work proposed a method for social media analysis for transportation-related subjects | EC1: The work is a thesis, book and other grey literature |
| IC2: The work is a journal paper or a paper in a conference proceeding/ peer-reviewed paper | EC2: Papers written in other languages, other than English |
| IC3: Clearly mentioned the dataset used | EC3: The dataset is not clear/mentioned |
| IC4: Workshop/journal papers that have been extended from conference papers | EC4: Conference papers that have been extended to journal/workshop papers |
| EC5: Works related to social media analysis but not for transportation-related subject or vice versa |
Examples of the excluded studies and the corresponding EC term
| The excluded studies | Exclusion term (EC) |
|---|---|
| Liu et al. ( | EC2 |
| Patel et al. ( | EC3 |
| Di Wang et al. ( | EC4 |
Fig. 3The classification scheme
Fig. 4The activity trend of TRR-SMA Field over the past decade
Fig. 5The activity distribution of the top active countries
The top publishers in the TRR-SMA field
| Publisher name | Number of publications |
|---|---|
| IEEE conference on intelligent transportation systems | 5 |
| IEEE Transactions on Intelligent Transportation Systems | 3 |
| IEEE International Conference on Big Data Computing Service and Applications | 2 |
| IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining | 2 |
| Industrial Management and Data Systems | 2 |
| International Conference on Information and Communication Technology | 2 |
| Transportation Research Part C: Emerging Technologies | 2 |
Fig. 6The most used keywords
Roles of Text Data in TRR-SMA Field
| Text Role | Location Detection | Sentiment Analysis | Topic Extraction/ Classification | Weather Detection | Routs Detection | Account/Company Detection | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Study | Location names | Sentimental words | Capitalization | Negation | Boosters | Intensifiers | Letter Repetition | Emotions | Punctuation | Keywords/Terms | Hashtags | Transport Mode | Features/Aspect s | Keywords | Origin–Destination | Mentions |
| (Abalı et al., | √ | |||||||||||||||
| (Alamsyah et al. | √ | |||||||||||||||
| (Alamsyahl & Rachmadiansyah, | ||||||||||||||||
| (Ali et al., | √ | |||||||||||||||
| (Buch et al., | √ | |||||||||||||||
| (Y. Chen et al., | √ | |||||||||||||||
| (Fiarni et al., | √ | √ | √ | |||||||||||||
| (Gal-Tzur et al., | √ | √ | ||||||||||||||
| (Gupta et al., | √ | |||||||||||||||
| (Haghighi et al., | √ | √ | √ | |||||||||||||
| (Hosseini et al., | √ | |||||||||||||||
| (Kaur & Balakrishnan, | √ | √ | √ | √ | √ | |||||||||||
| (Kovács-Győri et al. | √ | |||||||||||||||
| (Kulkarni et al., | √ | √ | ||||||||||||||
| (K. Lee & Yu, | √ | |||||||||||||||
| (Musaev et al., | ||||||||||||||||
| (Rane & Kumar, | ||||||||||||||||
| (Rybarczyk et al., | √ | √ | ||||||||||||||
| (Salas et al., | √ | |||||||||||||||
| (Samonte et al. | ||||||||||||||||
| (Saragih and Girsang 2018) | √ | √ | ||||||||||||||
| (Sdoukopoulos et al. | √ | √ | ||||||||||||||
| (Serna & Gasparovic, | √ | |||||||||||||||
| (Sternberg et al., | √ | |||||||||||||||
| (C. Wang et al. | ||||||||||||||||
| (Wayasti et al., | ||||||||||||||||
| (Windasari et al., | √ | √ | √ | |||||||||||||
| (Z. Zhang, Chen, et al., | ||||||||||||||||
| (Ali et al., | √ | √ | ||||||||||||||
| (AlSheikh et al., | ||||||||||||||||
| (Anastasia & Budi | √ | |||||||||||||||
| (Baj-Rogowska, | √ | |||||||||||||||
| (Casas & Delmelle, | √ | |||||||||||||||
| (Dutta Das et al., | ||||||||||||||||
| (Kuflik et al., | √ | |||||||||||||||
| (Lu et al. | √ | |||||||||||||||
| (Luckner et al., | √ | |||||||||||||||
| (Pournarakis et al. | √ | √ | √ | |||||||||||||
| ( Salas et al., | √ | √ | ||||||||||||||
| (Saldana-Perez et al., | ||||||||||||||||
| (Septiana et al., | √ | |||||||||||||||
| (Serna et al., | √ | |||||||||||||||
| (Sinha et al.. | √ | √ | √ | |||||||||||||
| (Suma et al., | √ | |||||||||||||||
| (Thelwall, | √ | √ | √ | √ | √ | √ | ||||||||||
| (D. Wang et al., | √ | |||||||||||||||
| (L. Zhang et al., | ||||||||||||||||
| (Kim et al. | ||||||||||||||||
| (S. Chen et al., | √ | √ | ||||||||||||||
| (Gao et al., | √ | |||||||||||||||
| (Giancristofaro & Panangadan, | √ | |||||||||||||||
| (Hoang et al., | √ | √ | √ | √ | ||||||||||||
| (Itoh et al., | √ | √ | ||||||||||||||
| (Lacic et al., | ||||||||||||||||
| (Liyang et al., | √ | √ | √ | |||||||||||||
| (Tse et al., | √ | |||||||||||||||
| (Ulloa et al., | √ | √ | ||||||||||||||
| (Yang & Anwar, | √ | √ | √ | |||||||||||||
| (B. Zhang, Kotkov, et al., | √ | |||||||||||||||
| (Candelieri & Archetti, | √ | √ | √ | |||||||||||||
| (D'Andrea et al., | √ | |||||||||||||||
| (Fu et al., | √ | |||||||||||||||
| (Georgiou et al., | √ | √ | ||||||||||||||
| (Rahman et al., | ||||||||||||||||
| (X. Zhang et al., | ||||||||||||||||
| (Adeborna & Siau, | √ | |||||||||||||||
| (Candelieri & Archetti, | ||||||||||||||||
| (Cao et al. | √ | √ | √ | |||||||||||||
| (Carpenter et al., | √ | |||||||||||||||
| (Gal-Tzur et al., | √ | |||||||||||||||
| (Kumar et al., | √ | √ | ||||||||||||||
| (Liau & Tan, | √ | √ | √ | √ | √ | |||||||||||
| (Daly et al., | √ | √ | ||||||||||||||
| (Mostafa, | √ | √ | ||||||||||||||
Fig. 7The usage trends of the social media platforms
Used and generated lexicons analysis
| Literature | Used lexicon type | Used lexicon/ tool name | Generated lexicon type | Generation method |
|---|---|---|---|---|
| (Adeborna & Siau, | SL-General | Bing Liu | SL-Domain | Modifications to the lexicon by adding domain-dependent words from WordNet and tweets using Correlated Topics Models (CTM) with Variational Expectation–Maximization (VEM) based on Airline Quality Rating (AQR) criteria resulted in 4 lexicons concerning 4 topics related to airlines |
| (Ali et al., | SL-General | SentiWordNet | SL-Domain | SentiWordNet for scoring with Fuzzy-Ontology and Semantic web rule language (SWRL) for rule-based decision-making |
| (Baj-Rogowska, | SL-General | CAT | – | – |
| (Buch et al., | SL-General | SentiStrength | – | – |
| (Cao et al., | SL-General | HowNet | SL-Domain | Generating lexicon by expanding seed words for Chinese |
| (Carpenter et al., | SL-General Dic-General | MPQA WordNet | SL-Domain | Using WordNet to expand the seed words through their synonyms |
| (Fiarni et al., | – | – | SL-Domain | Generated using rule-based approach with NB considering negation |
| (Gal-Tzur et al., | – | – | Dic-Domain | Constructed using 35 documents from research articles, websites and forums by term frequency and specialists |
| (Gupta et al., | SL-General | Bing Liu, MPQA, AFINN, SentiWordNet | – | – |
| (Haghighi et al., | SL-General | Rsentiment Package | – | – |
| (Hosseini et al., | – | – | Dic-Domain | Using 2 Thesaurus, Transportation Research Thesaurus (TRT) and the Australian Transport Index Thesaurus |
| (Kaur & Balakrishnan, | SL-General | – | Used general lexicon with letter repetition, intensification, capitalization, negation and exclamation mark to calculate the sentiment of the words | |
| (Kulkarni et al., | SL-General | VADER | – | – |
| (K. Lee & Yu, | SL-General | AFINN | – | – |
| (Liau & Tan, | SL-General | Bing Liu | SL-General | The Malay lexicon was created manually |
| (Mostafa, | SL-General | Bing Liu | – | – |
| (Rybarczyk et al., | SL-General | ANEW | – | – |
| (Salas et al., | SL-General | SentiStrength, TensiStrength | – | – |
| (Saragih and Girsang, | SL-General | Bing Liu, library of sentiment for Indonesian words | SL-General | The Indonesian lexicon was generated by translating Bing Liu lexicon |
| (Sdoukopoulos et al., | SL-General | NodeXL | – | – |
| (Serna et al., | Dic-General | WordNet | SL-Domain | WordNet used to expand word related to transportation and then used the total rank of the text to define the positive and negative words |
| (Thelwall, | SL-General | TensiStrength | – | – |
| (Yang & Anwar, | SL-General | SentiWordNet | – | – |
| (L. Zhang et al., | SL-General Dic-General | -VADER -WordNet | SL-Domain | Define seed set manually then extend it through wordnet iteratively. They used VADER to assign a level of sentiment |
| (Daly et al., | – | OpenStreet Maps | Dic-Location | – |
| (Kuflik et al., | – | – | Dic-Transport | 35 transport documents were used: stakeholders’ Web sites (e.g. of taxi services, transport magazines, etc.); research articles and white papers, transport Web forums, blogs and SM accounts |
| (Saldana-Perez et al., | – | – | Dic-Traffic | Using specified classes and TF to build traffic-related dictionary from tweets |
| (Tse et al., | – | – | Dic-Pollution | Most frequent keywords in the related posts |
| (D. Wang et al., | Dic-General Dic-Location | Google twitter frequent words—British Telecommunications (BT) Dictionary | – | – |
Fig. 8Targets classification. OT = Online Transport, EV = Electric Vehicle, BRT = Bus Rapid Transit
Fig. 9The trended subjects around the world in the past decade
Fig. 10The trended subjects in the top locations