Literature DB >> 25401180

Data smashing: uncovering lurking order in data.

Ishanu Chattopadhyay, Hod Lipson.   

Abstract

From automatic speech recognition to discovering unusual stars, underlying almost all automated discovery tasks is the ability to compare and contrast data streams with each other, to identify connections and spot outliers. Despite the prevalence of data, however, automated methods are not keeping pace. A key bottleneck is that most data comparison algorithms today rely on a human expert to specify what 'features' of the data are relevant for comparison. Here, we propose a new principle for estimating the similarity between the sources of arbitrary data streams, using neither domain knowledge nor learning. We demonstrate the application of this principle to the analysis of data from a number of real-world challenging problems, including the disambiguation of electro-encephalograph patterns pertaining to epileptic seizures, detection of anomalous cardiac activity from heart sound recordings and classification of astronomical objects from raw photometry. In all these cases and without access to any domain knowledge, we demonstrate performance on a par with the accuracy achieved by specialized algorithms and heuristics devised by domain experts. We suggest that data smashing principles may open the door to understanding increasingly complex observations, especially when experts do not know what to look for.

Entities:  

Mesh:

Year:  2014        PMID: 25401180      PMCID: PMC4223903          DOI: 10.1098/rsif.2014.0826

Source DB:  PubMed          Journal:  J R Soc Interface        ISSN: 1742-5662            Impact factor:   4.118


  9 in total

1.  Nonlinear dimensionality reduction by locally linear embedding.

Authors:  S T Roweis; L K Saul
Journal:  Science       Date:  2000-12-22       Impact factor: 47.728

2.  A global geometric framework for nonlinear dimensionality reduction.

Authors:  J B Tenenbaum; V de Silva; J C Langford
Journal:  Science       Date:  2000-12-22       Impact factor: 47.728

3.  Cognition. The manifold ways of perception.

Authors:  H S Seung; D D Lee
Journal:  Science       Date:  2000-12-22       Impact factor: 47.728

4.  High-energy physics: Down the petabyte highway.

Authors:  Geoff Brumfiel
Journal:  Nature       Date:  2011-01-20       Impact factor: 49.962

5.  More is less: signal processing and the data deluge.

Authors:  Richard G Baraniuk
Journal:  Science       Date:  2011-02-11       Impact factor: 47.728

6.  Abductive learning of quantized stochastic processes with probabilistic finite automata.

Authors:  Ishanu Chattopadhyay; Hod Lipson
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2012-12-31       Impact factor: 4.226

7.  Solution of the embedding problem and decomposition of symmetric matrices.

Authors:  M J Sippl; H A Scheraga
Journal:  Proc Natl Acad Sci U S A       Date:  1985-04       Impact factor: 11.205

8.  A standardized set of 260 pictures: norms for name agreement, image agreement, familiarity, and visual complexity.

Authors:  J G Snodgrass; M Vanderwart
Journal:  J Exp Psychol Hum Learn       Date:  1980-03

9.  Indications of nonlinear deterministic and finite-dimensional structures in time series of brain electrical activity: dependence on recording region and brain state.

Authors:  R G Andrzejak; K Lehnertz; F Mormann; C Rieke; P David; C E Elger
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2001-11-20
  9 in total
  3 in total

1.  Cardiac Comorbidity Risk Score: Zero-Burden Machine Learning to Improve Prediction of Postoperative Major Adverse Cardiac Events in Hip and Knee Arthroplasty.

Authors:  Dmytro Onishchenko; Daniel S Rubin; James R van Horne; R Parker Ward; Ishanu Chattopadhyay
Journal:  J Am Heart Assoc       Date:  2022-07-29       Impact factor: 6.106

2.  Feature identification in time series data sets.

Authors:  Justin Shaw; Marek Stastna; Aaron Coutino; Ryan K Walter; Eduard Reinhardt
Journal:  Heliyon       Date:  2019-05-23

3.  Reduced false positives in autism screening via digital biomarkers inferred from deep comorbidity patterns.

Authors:  Dmytro Onishchenko; Yi Huang; James van Horne; Peter J Smith; Michael E Msall; Ishanu Chattopadhyay
Journal:  Sci Adv       Date:  2021-10-06       Impact factor: 14.136

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.