Literature DB >> 26406245

A Bayesian Alternative to Mutual Information for the Hierarchical Clustering of Dependent Random Variables.

Guillaume Marrelec1, Arnaud Messé2, Pierre Bellec3.   

Abstract

The use of mutual information as a similarity measure in agglomerative hierarchical clustering (AHC) raises an important issue: some correction needs to be applied for the dimensionality of variables. In this work, we formulate the decision of merging dependent multivariate normal variables in an AHC procedure as a Bayesian model comparison. We found that the Bayesian formulation naturally shrinks the empirical covariance matrix towards a matrix set a priori (e.g., the identity), provides an automated stopping rule, and corrects for dimensionality using a term that scales up the measure as a function of the dimensionality of the variables. Also, the resulting log Bayes factor is asymptotically proportional to the plug-in estimate of mutual information, with an additive correction for dimensionality in agreement with the Bayesian information criterion. We investigated the behavior of these Bayesian alternatives (in exact and asymptotic forms) to mutual information on simulated and real data. An encouraging result was first derived on simulations: the hierarchical clustering based on the log Bayes factor outperformed off-the-shelf clustering techniques as well as raw and normalized mutual information in terms of classification accuracy. On a toy example, we found that the Bayesian approaches led to results that were similar to those of mutual information clustering techniques, with the advantage of an automated thresholding. On real functional magnetic resonance imaging (fMRI) datasets measuring brain activity, it identified clusters consistent with the established outcome of standard procedures. On this application, normalized mutual information had a highly atypical behavior, in the sense that it systematically favored very large clusters. These initial experiments suggest that the proposed Bayesian alternatives to mutual information are a useful new tool for hierarchical clustering.

Entities:  

Mesh:

Year:  2015        PMID: 26406245      PMCID: PMC4583305          DOI: 10.1371/journal.pone.0137278

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


  27 in total

1.  Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements.

Authors:  A J Butte; I S Kohane
Journal:  Pac Symp Biocomput       Date:  2000

2.  Gene clustering based on clusterwide mutual information.

Authors:  Xiaobo Zhou; Xiaodong Wang; Edward R Dougherty; Daniel Russ; Edward Suh
Journal:  J Comput Biol       Date:  2004       Impact factor: 1.479

3.  Network modelling methods for FMRI.

Authors:  Stephen M Smith; Karla L Miller; Gholamreza Salimi-Khorshidi; Matthew Webster; Christian F Beckmann; Thomas E Nichols; Joseph D Ramsey; Mark W Woolrich
Journal:  Neuroimage       Date:  2010-09-15       Impact factor: 6.556

Review 4.  How does gene expression clustering work?

Authors:  Patrik D'haeseleer
Journal:  Nat Biotechnol       Date:  2005-12       Impact factor: 54.908

5.  Multi-level bootstrap analysis of stable clusters in resting-state fMRI.

Authors:  Pierre Bellec; Pedro Rosa-Neto; Oliver C Lyttelton; Habib Benali; Alan C Evans
Journal:  Neuroimage       Date:  2010-03-10       Impact factor: 6.556

6.  The Richness of Task-Evoked Hemodynamic Responses Defines a Pseudohierarchy of Functionally Meaningful Brain Networks.

Authors:  Pierre Orban; Julien Doyon; Michael Petrides; Maarten Mennes; Richard Hoge; Pierre Bellec
Journal:  Cereb Cortex       Date:  2014-04-11       Impact factor: 5.357

7.  Functional network organization of the human brain.

Authors:  Jonathan D Power; Alexander L Cohen; Steven M Nelson; Gagan S Wig; Kelly Anne Barnes; Jessica A Church; Alecia C Vogel; Timothy O Laumann; Fran M Miezin; Bradley L Schlaggar; Steven E Petersen
Journal:  Neuron       Date:  2011-11-17       Impact factor: 17.173

8.  Bayesian hierarchical clustering for studying cancer gene expression data with unknown statistics.

Authors:  Korsuk Sirinukunwattana; Richard S Savage; Muhammad F Bari; David R J Snead; Nasir M Rajpoot
Journal:  PLoS One       Date:  2013-10-23       Impact factor: 3.240

9.  Neurophysiological architecture of functional magnetic resonance images of human brain.

Authors:  Raymond Salvador; John Suckling; Martin R Coleman; John D Pickard; David Menon; Ed Bullmore
Journal:  Cereb Cortex       Date:  2005-01-05       Impact factor: 5.357

10.  The pipeline system for Octave and Matlab (PSOM): a lightweight scripting framework and execution engine for scientific workflows.

Authors:  Pierre Bellec; Sébastien Lavoie-Courchesne; Phil Dickinson; Jason P Lerch; Alex P Zijdenbos; Alan C Evans
Journal:  Front Neuroinform       Date:  2012-04-03       Impact factor: 4.081

View more
  2 in total

1.  Reproducibility of EEG-MEG fusion source analysis of interictal spikes: Relevance in presurgical evaluation of epilepsy.

Authors:  Rasheda Arman Chowdhury; Giovanni Pellegrino; Ümit Aydin; Jean-Marc Lina; François Dubeau; Eliane Kobayashi; Christophe Grova
Journal:  Hum Brain Mapp       Date:  2017-11-21       Impact factor: 5.038

2.  Understanding the nature of face processing in early autism: A prospective study.

Authors:  Charlotte Tye; Giorgia Bussu; Teodora Gliga; Mayada Elsabbagh; Greg Pasco; Kristinn Johnsen; Tony Charman; Emily J H Jones; Jan Buitelaar; Mark H Johnson
Journal:  J Psychopathol Clin Sci       Date:  2022-08
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.