| Literature DB >> 31783539 |
Antonio Candelieri1, Stanislav Fedorov1, Enza Messina1.
Abstract
This paper presents an efficient approach for subsequence search in data streams. The problem consists of identifying coherent repetitions of a given reference time-series, also in the multivariate case, within a longer data stream. The most widely adopted metric to address this problem is Dynamic Time Warping (DTW), but its computational complexity is a well-known issue. In this paper, we present an approach aimed at learning a kernel approximating DTW for efficiently analyzing streaming data collected from wearable sensors, while reducing the burden of DTW computation. Contrary to kernel, DTW allows for comparing two time-series with different length. To enable the use of kernel for comparing two time-series with different length, a feature embedding is required in order to obtain a fixed length vector representation. Each vector component is the DTW between the given time-series and a set of "basis" series, randomly chosen. The approach has been validated on two benchmark datasets and on a real-life application for supporting self-rehabilitation in elderly subjects has been addressed. A comparison with traditional DTW implementations and other state-of-the-art algorithms is provided: results show a slight decrease in accuracy, which is counterbalanced by a significant reduction in computational costs.Entities:
Keywords: data stream analysis; dynamic time warping; kernel learning; pattern query; subsequence search
Year: 2019 PMID: 31783539 PMCID: PMC6928775 DOI: 10.3390/s19235192
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1An illustration of accumulated cost matrix and associated optimal warping path when using DTW to align the two time-series in the picture.
Figure 2Illustration of the parallel computation of the components of the vector . Time-series in the figure are assumed multivariate.
Figure 3The first 10 best-matching subsequences were identified by four different algorithms on the Experiment 1. The reference pattern consists of 100 acceleration data points collected when a Sony AIBO robot was walking on a carpet. The data stream consists of 5000 data points when the same dog was walking on cement, then followed by 3000 walking on a carpet and then again on cement.
Figure 4First 25 best matching subsequences identified by four different algorithms on the Experiment 1. Reference pattern consists of 100 acceleration data points collected when a Sony AIBO robot was walking on a carpet. The data stream consists of 5000 data points when the same dog was walking on cement, followed by 3000 walking on a carpet and then again on cement.
Figure 5Distance between each pair of reference and data stream, respectively, for DTW-based sequence search (on the left) and kernel-based DTW approximation (on the right). The brighter the colour the lower the distance.
DTW-based subsequence search vs. kernel-based DTW approximation for subsequence search.
| Coherent Visually | ||
|---|---|---|
| Exercise | DTW-Based Subsequence Search | Kernel-Based DTW Approximation |
| Flexo-extension of the | Yes | Yes |
| Yes | Yes | |
| Yes | Yes | |
| No | Yes | |
| No | Yes | |
| Light squat, | Yes | No |
| Yes | No | |
| Yes | Yes | |
| Yes | - | |
| Yes | - | |
| Back extension of the | Yes | Yes |
| Yes | Yes | |
| Yes | Yes | |
| No | Yes | |
| No | Yes | |
| Rotate the torso, | Yes | Yes |
| Yes | Yes | |
| Yes | - | |
| Yes | - | |
| No | - | |
| Raise and lower the arms, | Yes | Yes |
| Yes | Yes | |
| Yes | Yes | |
| Yes | Yes | |
| Yes | Yes | |
| No | Yes | |
| Yes | Yes | |
| No | Yes | |
| Yes | - | |
| Yes | - | |