Literature DB >> 23050260

Fast R Functions for Robust Correlations and Hierarchical Clustering.

Peter Langfelder1, Steve Horvath.   

Abstract

Many high-throughput biological data analyses require the calculation of large correlation matrices and/or clustering of a large number of objects. The standard R function for calculating Pearson correlation can handle calculations without missing values efficiently, but is inefficient when applied to data sets with a relatively small number of missing data. We present an implementation of Pearson correlation calculation that can lead to substantial speedup on data with relatively small number of missing entries. Further, we parallelize all calculations and thus achieve further speedup on systems where parallel processing is available. A robust correlation measure, the biweight midcorrelation, is implemented in a similar manner and provides comparable speed. The functions cor and bicor for fast Pearson and biweight midcorrelation, respectively, are part of the updated, freely available R package WGCNA.The hierarchical clustering algorithm implemented in R function hclust is an order n(3) (n is the number of clustered objects) version of a publicly available clustering algorithm (Murtagh 2012). We present the package flashClust that implements the original algorithm which in practice achieves order approximately n(2), leading to substantial time savings when clustering large data sets.

Entities:  

Year:  2012        PMID: 23050260      PMCID: PMC3465711     

Source DB:  PubMed          Journal:  J Stat Softw        ISSN: 1548-7660            Impact factor:   6.440


  8 in total

1.  Hierarchical organization of modularity in metabolic networks.

Authors:  E Ravasz; A L Somera; D A Mongru; Z N Oltvai; A L Barabási
Journal:  Science       Date:  2002-08-30       Impact factor: 47.728

2.  A general framework for weighted gene co-expression network analysis.

Authors:  Bin Zhang; Steve Horvath
Journal:  Stat Appl Genet Mol Biol       Date:  2005-08-12

3.  Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R.

Authors:  Peter Langfelder; Bin Zhang; Steve Horvath
Journal:  Bioinformatics       Date:  2007-11-16       Impact factor: 6.937

4.  Analysis of oncogenic signaling networks in glioblastoma identifies ASPM as a molecular target.

Authors:  S Horvath; B Zhang; M Carlson; K V Lu; S Zhu; R M Felciano; M F Laurance; W Zhao; S Qi; Z Chen; Y Lee; A C Scheck; L M Liau; H Wu; D H Geschwind; P G Febbo; H I Kornblum; T F Cloughesy; S F Nelson; P S Mischel
Journal:  Proc Natl Acad Sci U S A       Date:  2006-11-07       Impact factor: 11.205

5.  Integrating genetic and network analysis to characterize genes related to mouse weight.

Authors:  Anatole Ghazalpour; Sudheer Doss; Bin Zhang; Susanna Wang; Christopher Plaisier; Ruth Castellanos; Alec Brozell; Eric E Schadt; Thomas A Drake; Aldons J Lusis; Steve Horvath
Journal:  PLoS Genet       Date:  2006-07-05       Impact factor: 5.917

6.  Gene network interconnectedness and the generalized topological overlap measure.

Authors:  Andy M Yip; Steve Horvath
Journal:  BMC Bioinformatics       Date:  2007-01-24       Impact factor: 3.169

7.  WGCNA: an R package for weighted correlation network analysis.

Authors:  Peter Langfelder; Steve Horvath
Journal:  BMC Bioinformatics       Date:  2008-12-29       Impact factor: 3.169

8.  A robust measure of correlation between two genes on a microarray.

Authors:  Johanna Hardin; Aya Mitani; Leanne Hicks; Brian VanKoten
Journal:  BMC Bioinformatics       Date:  2007-06-25       Impact factor: 3.169

  8 in total
  353 in total

1.  Serum biomarkers associated with baseline clinical severity in young steroid-naïve Duchenne muscular dystrophy boys.

Authors:  Utkarsh J Dang; Michael Ziemba; Paula R Clemens; Yetrib Hathout; Laurie S Conklin; Eric P Hoffman
Journal:  Hum Mol Genet       Date:  2020-08-29       Impact factor: 6.150

2.  Discovering New Biology through Sequencing of RNA.

Authors:  Andreas P M Weber
Journal:  Plant Physiol       Date:  2015-09-09       Impact factor: 8.340

Review 3.  The Role of the Gut Microbiome in Predicting Response to Diet and the Development of Precision Nutrition Models-Part I: Overview of Current Methods.

Authors:  Riley L Hughes; Maria L Marco; James P Hughes; Nancy L Keim; Mary E Kable
Journal:  Adv Nutr       Date:  2019-11-01       Impact factor: 8.701

4.  Generation of a microglial developmental index in mice and in humans reveals a sex difference in maturation and immune reactivity.

Authors:  Richa Hanamsagar; Mark D Alter; Carina S Block; Haley Sullivan; Jessica L Bolton; Staci D Bilbo
Journal:  Glia       Date:  2017-06-15       Impact factor: 7.452

5.  Reply to Liu et al.

Authors:  Srinivas Nallandhighal; Tuan M Tran
Journal:  J Infect Dis       Date:  2019-07-02       Impact factor: 5.226

6.  lncRNA expression predicts mRNA abundance.

Authors:  Alicja Pacholewska; Myong-Hee Sung
Journal:  Epigenomics       Date:  2019-07-24       Impact factor: 4.778

7.  Gene expression profile of subcutaneous adipose tissue in BMI-discordant monozygotic twin pairs unravels molecular and clinical changes associated with sub-types of obesity.

Authors:  M Muniandy; S Heinonen; H Yki-Järvinen; A Hakkarainen; J Lundbom; N Lundbom; J Kaprio; A Rissanen; M Ollikainen; K H Pietiläinen
Journal:  Int J Obes (Lond)       Date:  2017-04-25       Impact factor: 5.095

8.  Gene Expression Correlated with Severe Asthma Characteristics Reveals Heterogeneous Mechanisms of Severe Disease.

Authors:  Brian D Modena; Eugene R Bleecker; William W Busse; Serpil C Erzurum; Benjamin M Gaston; Nizar N Jarjour; Deborah A Meyers; Jadranka Milosevic; John R Tedrow; Wei Wu; Naftali Kaminski; Sally E Wenzel
Journal:  Am J Respir Crit Care Med       Date:  2017-06-01       Impact factor: 21.405

9.  Systems genetic analysis of inversion polymorphisms in the malaria mosquito Anopheles gambiae.

Authors:  Changde Cheng; John C Tan; Matthew W Hahn; Nora J Besansky
Journal:  Proc Natl Acad Sci U S A       Date:  2018-07-09       Impact factor: 11.205

10.  The Systems Architecture of Molecular Memory in Poplar after Abiotic Stress.

Authors:  Elisabeth Georgii; Karl Kugler; Matthias Pfeifer; Elisa Vanzo; Katja Block; Malgorzata A Domagalska; Werner Jud; Hamada AbdElgawad; Han Asard; Richard Reinhardt; Armin Hansel; Manuel Spannagl; Anton R Schäffner; Klaus Palme; Klaus F X Mayer; Jörg-Peter Schnitzler
Journal:  Plant Cell       Date:  2019-01-31       Impact factor: 11.277

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.