Literature DB >> 33777285

A framework for stability-based module detection in correlation graphs.

Mingmei Tian1, Rachael Hageman Blair1, Lina Mu2, Matthew Bonner2, Richard Browne3, Han Yu4.   

Abstract

Graphs can be used to represent the direct and indirect relationships between variables, and elucidate complex relationships and interdependencies. Detecting structure within a graph is a challenging problem. This problem is studied over a range of fields and is sometimes termed community detection, module detection, or graph partitioning. A popular class of algorithms for module detection relies on optimizing a function of modularity to identify the structure. In practice, graphs are often learned from the data, and thus prone to uncertainty. In these settings, the uncertainty of the network structure can become exaggerated by giving unreliable estimates of the module structure. In this work, we begin to address this challenge through the use of a nonparametric bootstrap approach to assessing the stability of module detection in a graph. Estimates of stability are presented at the level of the individual node, the inferred modules, and as an overall measure of performance for module detection in a given graph. Furthermore, bootstrap stability estimates are derived for complexity parameter selection that ultimately defines a graph from data in a way that optimizes stability. This approach is utilized in connection with correlation graphs but is generalizable to other graphs that are defined through the use of dissimilarity measures. We demonstrate our approach using a broad range of simulations and on a metabolomics dataset from the Beijing Olympics Air Pollution study. These approaches are implemented using bootcluster package that is available in the R programming language.
© 2021 The Authors. Statistical Analysis and Data Mining published by Wiley Periodicals LLC.

Entities:  

Keywords:  Jaccard coefficient; clustering; graphical model; module detection; network; stability

Year:  2021        PMID: 33777285      PMCID: PMC7986843          DOI: 10.1002/sam.11495

Source DB:  PubMed          Journal:  Stat Anal Data Min        ISSN: 1932-1864            Impact factor:   1.051


  16 in total

1.  A stability based method for discovering structure in clustered data.

Authors:  Asa Ben-Hur; Andre Elisseeff; Isabelle Guyon
Journal:  Pac Symp Biocomput       Date:  2002

2.  Fast algorithm for detecting community structure in networks.

Authors:  M E J Newman
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2004-06-18

3.  Defining and identifying communities in networks.

Authors:  Filippo Radicchi; Claudio Castellano; Federico Cecconi; Vittorio Loreto; Domenico Parisi
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-23       Impact factor: 11.205

4.  Coexpression network analysis in chronic hepatitis B and C hepatic lesions reveals distinct patterns of disease progression to hepatocellular carcinoma.

Authors:  Danning He; Zhi-Ping Liu; Masao Honda; Shuichi Kaneko; Luonan Chen
Journal:  J Mol Cell Biol       Date:  2012-03-31       Impact factor: 6.216

5.  The joint graphical lasso for inverse covariance estimation across multiple classes.

Authors:  Patrick Danaher; Pei Wang; Daniela M Witten
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2014-03       Impact factor: 4.488

6.  Bagging to improve the accuracy of a clustering procedure.

Authors:  Sandrine Dudoit; Jane Fridlyand
Journal:  Bioinformatics       Date:  2003-06-12       Impact factor: 6.937

7.  Quantitative assessment of gene expression network module-validation methods.

Authors:  Bing Li; Yingying Zhang; Yanan Yu; Pengqian Wang; Yongcheng Wang; Zhong Wang; Yongyan Wang
Journal:  Sci Rep       Date:  2015-10-16       Impact factor: 4.379

8.  Correlation-Based Network Generation, Visualization, and Analysis as a Powerful Tool in Biological Studies: A Case Study in Cancer Cell Metabolism.

Authors:  Albert Batushansky; David Toubiana; Aaron Fait
Journal:  Biomed Res Int       Date:  2016-10-19       Impact factor: 3.411

9.  Metabolomics Profiling before, during, and after the Beijing Olympics: A Panel Study of Within-Individual Differences during Periods of High and Low Air Pollution.

Authors:  Lina Mu; Zhongzheng Niu; Rachael Hageman Blair; Han Yu; Richard W Browne; Matthew R Bonner; Tiffany Fanter; Furong Deng; Mya Swanson
Journal:  Environ Health Perspect       Date:  2019-05-29       Impact factor: 9.031

10.  WGCNA: an R package for weighted correlation network analysis.

Authors:  Peter Langfelder; Steve Horvath
Journal:  BMC Bioinformatics       Date:  2008-12-29       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.