Literature DB >> 32153346

A disk-aware algorithm for time series motif discovery.

Abdullah Mueen1, Eamonn Keogh1, Qiang Zhu1, Sydney S Cash2, M Brandon Westover3, Nima Bigdely-Shamlo4.   

Abstract

Time series motifs are sets of very similar subsequences of a long time series. They are of interest in their own right, and are also used as inputs in several higher-level data mining algorithms including classification, clustering, rule-discovery and summarization. In spite of extensive research in recent years, finding time series motifs exactly in massive databases is an open problem. Previous efforts either found approximate motifs or considered relatively small datasets residing in main memory. In this work, we leverage off previous work on pivot-based indexing to introduce a disk-aware algorithm to find time series motifs exactly in multi-gigabyte databases which contain on the order of tens of millions of time series. We have evaluated our algorithm on datasets from diverse areas including medicine, anthropology, computer networking and image processing and show that we can find interesting and meaningful motifs in datasets that are many orders of magnitude larger than anything considered before.

Entities:  

Keywords:  Bottom-up search; Pruning; Random references; Time series motifs

Year:  2010        PMID: 32153346      PMCID: PMC7062370          DOI: 10.1007/s10618-010-0176-8

Source DB:  PubMed          Journal:  Data Min Knowl Discov        ISSN: 1384-5810            Impact factor:   3.670


  9 in total

1.  PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals.

Authors:  A L Goldberger; L A Amaral; L Glass; J M Hausdorff; P C Ivanov; R G Mark; J E Mietus; G B Moody; C K Peng; H E Stanley
Journal:  Circulation       Date:  2000-06-13       Impact factor: 29.690

2.  EEG changes accompanying learned regulation of 12-Hz EEG activity.

Authors:  Arnaud Delorme; Scott Makeig
Journal:  IEEE Trans Neural Syst Rehabil Eng       Date:  2003-06       Impact factor: 3.802

3.  Learning recurrent behaviors from heterogeneous multivariate time-series.

Authors:  Florence Duchêne; Catherine Garbay; Vincent Rialle
Journal:  Artif Intell Med       Date:  2006-08-28       Impact factor: 5.326

4.  Knowledge construction from time series data using a collaborative exploration system.

Authors:  Thomas Guyet; Catherine Garbay; Michel Dojat
Journal:  J Biomed Inform       Date:  2007-10-09       Impact factor: 6.317

5.  Effective proximity retrieval by ordering permutations.

Authors:  Edgar Chavez; Karina Figueroa; Gonzalo Navarro
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2008-09       Impact factor: 6.226

6.  Independent component analysis using an extended infomax algorithm for mixed subgaussian and supergaussian sources.

Authors:  T W Lee; M Girolami; T J Sejnowski
Journal:  Neural Comput       Date:  1999-02-15       Impact factor: 2.026

7.  Functional uncoupling of hemodynamic from neuronal response by inhibition of neuronal nitric oxide synthase.

Authors:  Bojana Stefanovic; Wolfram Schwindt; Mathias Hoehn; Afonso C Silva
Journal:  J Cereb Blood Flow Metab       Date:  2006-08-02       Impact factor: 6.200

8.  Brain activity-based image classification from rapid serial visual presentation.

Authors:  Nima Bigdely-Shamlo; Andrey Vankov; Rey R Ramirez; Scott Makeig
Journal:  IEEE Trans Neural Syst Rehabil Eng       Date:  2008-10       Impact factor: 3.802

9.  80 million tiny images: a large data set for nonparametric object and scene recognition.

Authors:  Antonio Torralba; Rob Fergus; William T Freeman
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2008-11       Impact factor: 6.226

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.