| Literature DB >> 28272315 |
Yingchi Mao1, Haishi Zhong2, Xianjian Xiao3, Xiaofang Li4.
Abstract
With the rapid spread of built-in GPS handheld smart devices, the trajectory data from GPS sensors has grown explosively. Trajectory data has spatio-temporal characteristics and rich information. Using trajectory data processing techniques can mine the patterns of human activities and the moving patterns of vehicles in the intelligent transportation systems. A trajectory similarity measure is one of the most important issues in trajectory data mining (clustering, classification, frequent pattern mining, etc.). Unfortunately, the main similarity measure algorithms with the trajectory data have been found to be inaccurate, highly sensitive of sampling methods, and have low robustness for the noise data. To solve the above problems, three distances and their corresponding computation methods are proposed in this paper. The point-segment distance can decrease the sensitivity of the point sampling methods. The prediction distance optimizes the temporal distance with the features of trajectory data. The segment-segment distance introduces the trajectory shape factor into the similarity measurement to improve the accuracy. The three kinds of distance are integrated with the traditional dynamic time warping algorithm (DTW) algorithm to propose a new segment-based dynamic time warping algorithm (SDTW). The experimental results show that the SDTW algorithm can exhibit about 57%, 86%, and 31% better accuracy than the longest common subsequence algorithm (LCSS), and edit distance on real sequence algorithm (EDR) , and DTW, respectively, and that the sensitivity to the noise data is lower than that those algorithms.Entities:
Keywords: GPS sensor; GPS trajectory; spatial-temporal data; trajectory similarity measure
Year: 2017 PMID: 28272315 PMCID: PMC5375810 DOI: 10.3390/s17030524
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Various point sampling methods of the same trajectory. (a) A trajectory of two sampling methods; and (b) two trajectories take the same point.
Figure 2A sub-trajectory segment and a natural sub-trajectory segment.
Figure 4Two computation methods of the proxy sub-trajectory distance. (a) The area enclosed by the two proxy trajectories; and (b) a simplified calculation method.
Figure 5Point-segment distance.
Figure 6Prediction distance.
Figure 7Segment-Segment Distance.
Figure 8The sensitivity analysis about the value of .
Figure 9Labomni dataset (a) and Cross dataset (b).
Figure 10Query results of trajectory sequences with the SDTW and DTW algorithms. (a) The original query trajectory; (b) Query results of trajectory sequences with the SDTW; (c) Query results of trajectory sequences with the DTW algorithms.
Figure 11Comparison of Clustering Error rates. (a) Clustering error rates based on the Cross dataset; and (b) clustering error rates based on the Labomni dataset.
Figure 12Comparison of noise effect on the algorithms. (a) Clustering error rates based on the Cross dataset; and (b) clustering error rates based on the Labomni dataset.
Error rate with various values.
| AHC | 0.061 | 0.019 | 0.013 | 0.006 | 0.006 | 0.006 | 0.006 | 0.007 | 0.008 | 0.010 | 0.010 |
| 0.217 | 0.061 | 0.009 | 0.009 | 0.008 | 0.009 | 0.01 | 0.01 | 0.01 | 0.011 | 0.011 |
Figure 13ER with various values.