Literature DB >> 29770257

Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data.

David Hallac1, Sagar Vare1, Stephen Boyd1, Jure Leskovec1.   

Abstract

Subsequence clustering of multivariate time series is a useful tool for discovering repeated patterns in temporal data. Once these patterns have been discovered, seemingly complicated datasets can be interpreted as a temporal sequence of only a small number of states, or clusters. For example, raw sensor data from a fitness-tracking application can be expressed as a timeline of a select few actions (i.e., walking, sitting, running). However, discovering these patterns is challenging because it requires simultaneous segmentation and clustering of the time series. Furthermore, interpreting the resulting clusters is difficult, especially when the data is high-dimensional. Here we propose a new method of model-based clustering, which we call Toeplitz Inverse Covariance-based Clustering (TICC). Each cluster in the TICC method is defined by a correlation network, or Markov random field (MRF), characterizing the interdependencies between different observations in a typical subsequence of that cluster. Based on this graphical representation, TICC simultaneously segments and clusters the time series data. We solve the TICC problem through alternating minimization, using a variation of the expectation maximization (EM) algorithm. We derive closed-form solutions to efficiently solve the two resulting subproblems in a scalable way, through dynamic programming and the alternating direction method of multipliers (ADMM), respectively. We validate our approach by comparing TICC to several state-of-the-art baselines in a series of synthetic experiments, and we then demonstrate on an automobile sensor dataset how TICC can be used to learn interpretable clusters in real-world scenarios.

Entities:  

Year:  2017        PMID: 29770257      PMCID: PMC5951184          DOI: 10.1145/3097983.3098060

Source DB:  PubMed          Journal:  KDD        ISSN: 2154-817X


  6 in total

1.  Activity classification using realistic data from wearable sensors.

Authors:  Juha Pärkkä; Miikka Ermes; Panu Korpipää; Jani Mäntyjärvi; Johannes Peltola; Ilkka Korhonen
Journal:  IEEE Trans Inf Technol Biomed       Date:  2006-01

2.  ;Neural-gas' network for vector quantization and its application to time-series prediction.

Authors:  T M Martinetz; S G Berkovich; K J Schulten
Journal:  IEEE Trans Neural Netw       Date:  1993

3.  Sparse inverse covariance estimation with the graphical lasso.

Authors:  Jerome Friedman; Trevor Hastie; Robert Tibshirani
Journal:  Biostatistics       Date:  2007-12-12       Impact factor: 5.899

4.  The joint graphical lasso for inverse covariance estimation across multiple classes.

Authors:  Patrick Danaher; Pei Wang; Daniela M Witten
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2014-03       Impact factor: 4.488

5.  Node-Based Learning of Multiple Gaussian Graphical Models.

Authors:  Karthik Mohan; Palma London; Maryam Fazel; Daniela Witten; Su-In Lee
Journal:  J Mach Learn Res       Date:  2014-01-01       Impact factor: 3.654

Review 6.  A review of subsequence time series clustering.

Authors:  Seyedjamal Zolhavarieh; Saeed Aghabozorgi; Ying Wah Teh
Journal:  ScientificWorldJournal       Date:  2014-07-21
  6 in total
  1 in total

1.  Dynamics reconstruction and classification via Koopman features.

Authors:  Wei Zhang; Yao-Chsi Yu; Jr-Shin Li
Journal:  Data Min Knowl Discov       Date:  2019-06-24       Impact factor: 3.670

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.