Literature DB >> 22174245

Detecting novel associations in large data sets.

David N Reshef1, Yakir A Reshef, Hilary K Finucane, Sharon R Grossman, Gilean McVean, Peter J Turnbaugh, Eric S Lander, Michael Mitzenmacher, Pardis C Sabeti.   

Abstract

Identifying interesting relationships between pairs of variables in large data sets is increasingly important. Here, we present a measure of dependence for two-variable relationships: the maximal information coefficient (MIC). MIC captures a wide range of associations both functional and not, and for functional relationships provides a score that roughly equals the coefficient of determination (R(2)) of the data relative to the regression function. MIC belongs to a larger class of maximal information-based nonparametric exploration (MINE) statistics for identifying and classifying relationships. We apply MIC and MINE to data sets in global health, gene expression, major-league baseball, and the human gut microbiota and identify known and novel relationships.

Entities:  

Mesh:

Year:  2011        PMID: 22174245      PMCID: PMC3325791          DOI: 10.1126/science.1205438

Source DB:  PubMed          Journal:  Science        ISSN: 0036-8075            Impact factor:   47.728


  15 in total

1.  The mutual information: detecting and evaluating dependencies between variables.

Authors:  R Steuer; J Kurths; C O Daub; J Weise; J Selbig
Journal:  Bioinformatics       Date:  2002       Impact factor: 6.937

2.  Estimating mutual information.

Authors:  Alexander Kraskov; Harald Stögbauer; Peter Grassberger
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2004-06-23

3.  Logic of the yeast metabolic cycle: temporal compartmentalization of cellular processes.

Authors:  Benjamin P Tu; Andrzej Kudlicki; Maga Rowicka; Steven L McKnight
Journal:  Science       Date:  2005-10-27       Impact factor: 47.728

4.  The human microbiome project.

Authors:  Peter J Turnbaugh; Ruth E Ley; Micah Hamady; Claire M Fraser-Liggett; Rob Knight; Jeffrey I Gordon
Journal:  Nature       Date:  2007-10-18       Impact factor: 49.962

5.  Dealing with data. Challenges and opportunities. Introduction.

Authors: 
Journal:  Science       Date:  2011-02-11       Impact factor: 47.728

6.  Maternal education and child health: is there a strong causal relationship?

Authors:  S Desai; S Alva
Journal:  Demography       Date:  1998-02

7.  How does multiple testing correction work?

Authors:  William S Noble
Journal:  Nat Biotechnol       Date:  2009-12       Impact factor: 54.908

8.  Influence of life stress on depression: moderation by a polymorphism in the 5-HTT gene.

Authors:  Avshalom Caspi; Karen Sugden; Terrie E Moffitt; Alan Taylor; Ian W Craig; HonaLee Harrington; Joseph McClay; Jonathan Mill; Judy Martin; Antony Braithwaite; Richie Poulton
Journal:  Science       Date:  2003-07-18       Impact factor: 47.728

9.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization.

Authors:  P T Spellman; G Sherlock; M Q Zhang; V R Iyer; K Anders; M B Eisen; P O Brown; D Botstein; B Futcher
Journal:  Mol Biol Cell       Date:  1998-12       Impact factor: 4.138

10.  Evolution of mammals and their gut microbes.

Authors:  Ruth E Ley; Micah Hamady; Catherine Lozupone; Peter J Turnbaugh; Rob Roy Ramey; J Stephen Bircher; Michael L Schlegel; Tammy A Tucker; Mark D Schrenzel; Rob Knight; Jeffrey I Gordon
Journal:  Science       Date:  2008-05-22       Impact factor: 47.728

View more
  362 in total

1.  Finding correlations in big data.

Authors: 
Journal:  Nat Biotechnol       Date:  2012-04-10       Impact factor: 54.908

2.  Use of big data in drug development for precision medicine.

Authors:  Rosa S Kim; Nicolas Goossens; Yujin Hoshida
Journal:  Expert Rev Precis Med Drug Dev       Date:  2016-04-28

Review 3.  A system biology approach to identify regulatory pathways underlying the neuroendocrine control of female puberty in rats and nonhuman primates.

Authors:  Alejandro Lomniczi; Hollis Wright; Juan Manuel Castellano; Kemal Sonmez; Sergio R Ojeda
Journal:  Horm Behav       Date:  2013-07       Impact factor: 3.587

4.  Microbial determinants of biochemical individuality and their impact on toxicology and pharmacology.

Authors:  Andrew D Patterson; Peter J Turnbaugh
Journal:  Cell Metab       Date:  2014-08-21       Impact factor: 27.287

5.  Exploratory Analysis in Time-Varying Data Sets: a Healthcare Network Application.

Authors:  Narine Manukyan; Margaret J Eppstein; Jeffrey D Horbar; Kathleen A Leahy; Michael J Kenny; Shreya Mukherjee; Donna M Rizzo
Journal:  Int J Adv Comput Sci       Date:  2013-07

6.  Genome-scale analysis of interaction dynamics reveals organization of biological networks.

Authors:  Jishnu Das; Jaaved Mohammed; Haiyuan Yu
Journal:  Bioinformatics       Date:  2012-05-09       Impact factor: 6.937

7.  Discovering and deciphering relationships across disparate data modalities.

Authors:  Joshua T Vogelstein; Eric W Bridgeford; Qing Wang; Carey E Priebe; Mauro Maggioni; Cencheng Shen
Journal:  Elife       Date:  2019-01-15       Impact factor: 8.140

8.  Machine learning-based differential network analysis: a study of stress-responsive transcriptomes in Arabidopsis.

Authors:  Chuang Ma; Mingming Xin; Kenneth A Feldmann; Xiangfeng Wang
Journal:  Plant Cell       Date:  2014-02-11       Impact factor: 11.277

9.  Assessing the similarity of ligand binding conformations with the Contact Mode Score.

Authors:  Yun Ding; Ye Fang; Juana Moreno; J Ramanujam; Mark Jarrell; Michal Brylinski
Journal:  Comput Biol Chem       Date:  2016-09-06       Impact factor: 2.877

Review 10.  Time domain measures of inter-channel EEG correlations: a comparison of linear, nonparametric and nonlinear measures.

Authors:  J D Bonita; L C C Ambolode; B M Rosenberg; C J Cellucci; T A A Watanabe; P E Rapp; A M Albano
Journal:  Cogn Neurodyn       Date:  2013-09-04       Impact factor: 5.082

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.