Literature DB >> 24550517

Equitability, mutual information, and the maximal information coefficient.

Justin B Kinney1, Gurinder S Atwal.   

Abstract

How should one quantify the strength of association between two random variables without bias for relationships of a specific form? Despite its conceptual simplicity, this notion of statistical "equitability" has yet to receive a definitive mathematical formalization. Here we argue that equitability is properly formalized by a self-consistency condition closely related to Data Processing Inequality. Mutual information, a fundamental quantity in information theory, is shown to satisfy this equitability criterion. These findings are at odds with the recent work of Reshef et al. [Reshef DN, et al. (2011) Science 334(6062):1518-1524], which proposed an alternative definition of equitability and introduced a new statistic, the "maximal information coefficient" (MIC), said to satisfy equitability in contradistinction to mutual information. These conclusions, however, were supported only with limited simulation evidence, not with mathematical arguments. Upon revisiting these claims, we prove that the mathematical definition of equitability proposed by Reshef et al. cannot be satisfied by any (nontrivial) dependence measure. We also identify artifacts in the reported simulation evidence. When these artifacts are removed, estimates of mutual information are found to be more equitable than estimates of MIC. Mutual information is also observed to have consistently higher statistical power than MIC. We conclude that estimating mutual information provides a natural (and often practical) way to equitably quantify statistical associations in large datasets.

Mesh:

Year:  2014        PMID: 24550517      PMCID: PMC3948249          DOI: 10.1073/pnas.1309933111

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  17 in total

1.  Estimating mutual information.

Authors:  Alexander Kraskov; Harald Stögbauer; Peter Grassberger
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2004-06-23

2.  Analyzing neural responses to natural signals: maximally informative dimensions.

Authors:  Tatyana Sharpee; Nicole C Rust; William Bialek
Journal:  Neural Comput       Date:  2004-02       Impact factor: 2.026

3.  Mathematics. A correlation for the 21st century.

Authors:  Terry Speed
Journal:  Science       Date:  2011-12-16       Impact factor: 47.728

4.  Finding correlations in big data.

Authors: 
Journal:  Nat Biotechnol       Date:  2012-04-10       Impact factor: 54.908

5.  Using deep sequencing to characterize the biophysical mechanism of a transcriptional regulatory sequence.

Authors:  Justin B Kinney; Anand Murugan; Curtis G Callan; Edward C Cox
Journal:  Proc Natl Acad Sci U S A       Date:  2010-05-03       Impact factor: 11.205

6.  A universal framework for regulatory element discovery across all genomes and data types.

Authors:  Olivier Elemento; Noam Slonim; Saeed Tavazoie
Journal:  Mol Cell       Date:  2007-10-26       Impact factor: 17.970

7.  Relative performance of mutual information estimation methods for quantifying the dependence among short and noisy data.

Authors:  Shiraj Khan; Sharba Bandyopadhyay; Auroop R Ganguly; Sunil Saigal; David J Erickson; Vladimir Protopopescu; George Ostrouchov
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2007-08-14

8.  Minerva and minepy: a C engine for the MINE suite and its R, Python and MATLAB wrappers.

Authors:  Davide Albanese; Michele Filosi; Roberto Visintainer; Samantha Riccadonna; Giuseppe Jurman; Cesare Furlanello
Journal:  Bioinformatics       Date:  2012-12-14       Impact factor: 6.937

9.  Detecting novel associations in large data sets.

Authors:  David N Reshef; Yakir A Reshef; Hilary K Finucane; Sharon R Grossman; Gilean McVean; Peter J Turnbaugh; Eric S Lander; Michael Mitzenmacher; Pardis C Sabeti
Journal:  Science       Date:  2011-12-16       Impact factor: 47.728

10.  Systematic discovery of structural elements governing stability of mammalian messenger RNAs.

Authors:  Hani Goodarzi; Hamed S Najafabadi; Panos Oikonomou; Todd M Greco; Lisa Fish; Reza Salavati; Ileana M Cristea; Saeed Tavazoie
Journal:  Nature       Date:  2012-04-08       Impact factor: 49.962

View more
  70 in total

1.  Putting things in order.

Authors:  Ning Sun; Hongyu Zhao
Journal:  Proc Natl Acad Sci U S A       Date:  2014-11-07       Impact factor: 11.205

2.  Limitations to Estimating Mutual Information in Large Neural Populations.

Authors:  Jan Mölter; Geoffrey J Goodhill
Journal:  Entropy (Basel)       Date:  2020-04-24       Impact factor: 2.524

3.  Jackknife approach to the estimation of mutual information.

Authors:  Xianli Zeng; Yingcun Xia; Howell Tong
Journal:  Proc Natl Acad Sci U S A       Date:  2018-09-17       Impact factor: 11.205

4.  Reply to Murrell et al.: Noise matters.

Authors:  Justin B Kinney; Gurinder S Atwal
Journal:  Proc Natl Acad Sci U S A       Date:  2014-05-27       Impact factor: 11.205

5.  Cleaning up the record on the maximal information coefficient and equitability.

Authors:  David N Reshef; Yakir A Reshef; Michael Mitzenmacher; Pardis C Sabeti
Journal:  Proc Natl Acad Sci U S A       Date:  2014-08-19       Impact factor: 11.205

6.  R2-equitability is satisfiable.

Authors:  Ben Murrell; Daniel Murrell; Hugh Murrell
Journal:  Proc Natl Acad Sci U S A       Date:  2014-04-29       Impact factor: 11.205

7.  Gene coexpression measures in large heterogeneous samples using count statistics.

Authors:  Y X Rachel Wang; Michael S Waterman; Haiyan Huang
Journal:  Proc Natl Acad Sci U S A       Date:  2014-10-06       Impact factor: 11.205

8.  Reply to Reshef et al.: Falsifiability or bust.

Authors:  Justin B Kinney; Gurinder S Atwal
Journal:  Proc Natl Acad Sci U S A       Date:  2014-08-19       Impact factor: 11.205

9.  Epileptic foci localization based on mapping the synchronization of dynamic brain network.

Authors:  Tian Mei; Xiaoyan Wei; Ziyi Chen; Xianghua Tian; Nan Dong; Dongmei Li; Yi Zhou
Journal:  BMC Med Inform Decis Mak       Date:  2019-01-31       Impact factor: 2.796

10.  Assessing the similarity of ligand binding conformations with the Contact Mode Score.

Authors:  Yun Ding; Ye Fang; Juana Moreno; J Ramanujam; Mark Jarrell; Michal Brylinski
Journal:  Comput Biol Chem       Date:  2016-09-06       Impact factor: 2.877

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.