| Literature DB >> 34988314 |
Abstract
Intelligent Transportation Systems (ITS) is not a new concept. Notably, ITS has been cited in various journal articles and proceedings papers around the world, and it has become increasingly popular. Additionally, ITS involves multidisciplinary science. The growing number of journal articles makes ITS reviews complicated, and research gaps can be difficult to identify. The existing software for systematic reviews still relies on highly laborious tasks, manual reading, and a homogeneous dataset of research articles. This study proposes a framework that can address these issues, return a comprehensive systematic review of ITS, and promote efficient systematic reviews. The proposed framework consists of Natural Language Processing (NLP) methods i.e., Named Entity Recognition (NER), Latent Dirichlet Allocation (LDA), and word embedding (continuous skip-gram). It enables this study to explore the context of research articles and their overall interpretation to determine and define the directions of knowledge growth and ITS development. The framework can systematically separate unrelated documents and simplify the review process for large dataset. To our knowledge, compared to prior research regarding systematic review of ITS, this study offers more thorough review.Entities:
Keywords: Continuous skip-gram; Custom named entity recognition; Intelligent transportation system; Latent dirichlet allocation; Natural language processing; Systematic review; Word embedding
Year: 2021 PMID: 34988314 PMCID: PMC8695271 DOI: 10.1016/j.heliyon.2021.e08615
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
Figure 1The number of ITS publications from 1971–2020.
Figure 2The proposed framework.
Text preprocessing results (Example).
| Before | After | |
|---|---|---|
| Title | Abstract | Text Cleaned |
| Building Highway Systems With Computer Graphic Simulations. | A number of computer applications generate perspective drawings of proposed highway designs as a means of design evaluation. The use of this capability is rare in the United States …. technology are available. © 1974, IEEE All rights reserved. | [build highway systems computer graphic simulations number computer applications generate perspective draw propose highway design mean design evaluation use capability rare unite state …. available] |
Custom NER results (Example).
| Context | Named Entity | Label |
|---|---|---|
| implementation area traffic control surveillance system silicon emergence also technology new opportunities become available real time process traffic parameters objective paper consider several design issue implementation traffic monitor surveillance system silicon. | [('area traffic control', 15, 35, 'B_ITS′), ('traffic monitor surveillance system', 217, 252, 'B_ITS′)] | True |
| mk buy leave profitable alone centennial engineer specialize design construction management firm base armada colo found originally staff persons specialize transportation sixteen years later firm employees morrison knudsen corp purchase retain previous employees | [('sixteen years', 171, 184, 'DATE'), ('morrison knudsen corp', 206, 227, 'ORG')] | False |
Figure 3Best topic number (k).
Topic groups.
| Topic number | Bag of words | Interpretation |
|---|---|---|
| 1 | Traffic, vehicle, control, vehicles, drive, road, safety, driver, speed, signal | Traffic control and safety |
| 2 | Information, transport, service, management, data, public, development, provide, technology, research | ITS in general and public/urban transport issue |
| 3 | Detection, vehicle, propose, image, method, road, position, track, algorithm, feature | The detection system in ITS |
| 4 | Energy, control, design, model, power, railway, train, cost, electric, monitor | Emissions reduction and electric vehicles, energy resource substitutions, and intelligent systems for the train system |
| 5 | Network, communication, vehicular, vehicles, vehicle, propose, wireless, data, applications, service | Vehicle communication system |
| 6 | Traffic, model, time, data, network, propose, flow, algorithm, prediction, travel | Traffic optimization, traffic flow prediction, traffic networks and model, and traffic data |
Intrinsic evaluation results.
| Topic number | Pair of words | Results | Skip-Gram Hyperparameter |
|---|---|---|---|
| 1 | safe – centralize + dangerous | decentralize | min_count = 10, window = 2, embedding size = 50, sample = 6.10−5, and negative = 40, epoch = 30 |
| 2 | vehicle – road + passengers | bus | min_count = 5, window = 2, embedding size = 50, sample = 6.10−5, alpha = 0.01, and negative = 40, epoch = 30 |
| 3 | recognition – detection + detect | recognize | min_count = 20, window = 2, embedding size = 100, sample = 6.10−5, alpha = 0,01, min_alpha = 0,0007, and negative = 40, epoch = 30 |
| 4 | ∗/The study found that the articles in this topic cluster were less homogeneous. Therefore, we decided not to apply word embedding for this topic cluster. More explained in the section 'Topic Four: Emissions reduction and electric vehicles, energy resource substitutions, and intelligent system for the train system' | ||
| 5 | intelligent_transportation – system + communication | systems | min_count = 30, window = 2, embedding size = 50, sample = 6.10−5, alpha = 0,01, min_alpha = 0,0007, and negative = 40, epoch = 30 |
| 6 | advance_traveler - systems_ais + intelligent_transportation | systems | min_count = 30, window = 2, embedding size = 100, sample = 6.10−5, alpha = 0,01, min_alpha = 0,0007, and negative = 50, epoch = 40 |
Figure 4Topic distributions in documents according to the publication year.
Figure 5Subtopic distributions for topic one according to publication year.
Figure 6Subtopic distributions for topic three according to publication year.
Inaccurate results based on custom NER.
| Context | POS | Label | Theta |
|---|---|---|---|
| automatic guidance mobile robots two way | [('two', 33, 36, 'CARDINAL'), ('traffic method control', 41, 63, 'B_ITS′), ('two', 769, 772, 'CARDINAL')] | True | Topic 1 = 0,4 |