Literature DB >> 33445666

K-Module Algorithm: An Additional Step to Improve the Clustering Results of WGCNA Co-Expression Networks.

Jie Hou1, Xiufen Ye1, Chuanlong Li1, Yixing Wang1.   

Abstract

Among biological networks, co-expression networks have been widely studied. One of the most commonly used pipelines for the construction of co-expression networks is weighted gene co-expression network analysis (WGCNA), which can identify highly co-expressed clusters of genes (modules). WGCNA identifies gene modules using hierarchical clustering. The major drawback of hierarchical clustering is that once two objects are clustered together, it cannot be reversed; thus, re-adjustment of the unbefitting decision is impossible. In this paper, we calculate the similarity matrix with the distance correlation for WGCNA to construct a gene co-expression network, and present a new approach called the k-module algorithm to improve the WGCNA clustering results. This method can assign all genes to the module with the highest mean connectivity with these genes. This algorithm re-adjusts the results of hierarchical clustering while retaining the advantages of the dynamic tree cut method. The validity of the algorithm is verified using six datasets from microarray and RNA-seq data. The k-module algorithm has fewer iterations, which leads to lower complexity. We verify that the gene modules obtained by the k-module algorithm have high enrichment scores and strong stability. Our method improves upon hierarchical clustering, and can be applied to general clustering algorithms based on the similarity matrix, not limited to gene co-expression network analysis.

Entities:  

Keywords:  connectivity; distance correlation; enrichment analysis; gene co-expression networks

Year:  2021        PMID: 33445666      PMCID: PMC7828115          DOI: 10.3390/genes12010087

Source DB:  PubMed          Journal:  Genes (Basel)        ISSN: 2073-4425            Impact factor:   4.096


  23 in total

1.  A general framework for weighted gene co-expression network analysis.

Authors:  Bin Zhang; Steve Horvath
Journal:  Stat Appl Genet Mol Biol       Date:  2005-08-12

2.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.

Authors:  Da Wei Huang; Brad T Sherman; Richard A Lempicki
Journal:  Nat Protoc       Date:  2009       Impact factor: 13.491

3.  A Genetic Algorithm to Optimize Weighted Gene Co-Expression Network Analysis.

Authors:  David Toubiana; Rami Puzis; Avi Sadka; Eduardo Blumwald
Journal:  J Comput Biol       Date:  2019-07-30       Impact factor: 1.479

4.  Normalized lmQCM: An Algorithm for Detecting Weak Quasi-Cliques in Weighted Graph with Applications in Gene Co-Expression Module Discovery in Cancers.

Authors:  Jie Zhang; Kun Huang
Journal:  Cancer Inform       Date:  2016-07-24

5.  A high-resolution association mapping panel for the dissection of complex traits in mice.

Authors:  Brian J Bennett; Charles R Farber; Luz Orozco; Hyun Min Kang; Anatole Ghazalpour; Nathan Siemers; Michael Neubauer; Isaac Neuhaus; Roumyana Yordanova; Bo Guan; Amy Truong; Wen-pin Yang; Aiqing He; Paul Kayne; Peter Gargalovic; Todd Kirchgessner; Calvin Pan; Lawrence W Castellani; Emrah Kostem; Nicholas Furlotte; Thomas A Drake; Eleazar Eskin; Aldons J Lusis
Journal:  Genome Res       Date:  2010-01-06       Impact factor: 9.043

6.  Unraveling inflammatory responses using systems genetics and gene-environment interactions in macrophages.

Authors:  Luz D Orozco; Brian J Bennett; Charles R Farber; Anatole Ghazalpour; Calvin Pan; Nam Che; Pingzi Wen; Hong Xiu Qi; Adonisa Mutukulu; Nathan Siemers; Isaac Neuhaus; Roumyana Yordanova; Peter Gargalovic; Matteo Pellegrini; Todd Kirchgessner; Aldons J Lusis
Journal:  Cell       Date:  2012-10-26       Impact factor: 41.582

Review 7.  Computational cluster validation in post-genomic data analysis.

Authors:  Julia Handl; Joshua Knowles; Douglas B Kell
Journal:  Bioinformatics       Date:  2005-05-24       Impact factor: 6.937

8.  An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks.

Authors:  Juan A Botía; Jana Vandrovcova; Paola Forabosco; Sebastian Guelfi; Karishma D'Sa; John Hardy; Cathryn M Lewis; Mina Ryten; Michael E Weale
Journal:  BMC Syst Biol       Date:  2017-04-12

9.  WGCNA: an R package for weighted correlation network analysis.

Authors:  Peter Langfelder; Steve Horvath
Journal:  BMC Bioinformatics       Date:  2008-12-29       Impact factor: 3.169

Review 10.  Network-Based Approaches to Explore Complex Biological Systems towards Network Medicine.

Authors:  Giulia Fiscon; Federica Conte; Lorenzo Farina; Paola Paci
Journal:  Genes (Basel)       Date:  2018-08-31       Impact factor: 4.096

View more
  4 in total

1.  Identification of biomarkers related to neutrophils and two molecular subtypes of systemic lupus erythematosus.

Authors:  Huiyan Li; Pingting Yang
Journal:  BMC Med Genomics       Date:  2022-07-20       Impact factor: 3.622

Review 2.  Molecular Regulatory Mechanisms Drive Emergent Pathogenetic Properties of Neisseria gonorrhoeae.

Authors:  Ashwini Sunkavalli; Ryan McClure; Caroline Genco
Journal:  Microorganisms       Date:  2022-04-28

3.  Distance correlation application to gene co-expression network analysis.

Authors:  Jie Hou; Xiufen Ye; Weixing Feng; Qiaosheng Zhang; Yatong Han; Yusong Liu; Yu Li; Yufen Wei
Journal:  BMC Bioinformatics       Date:  2022-02-21       Impact factor: 3.169

4.  Spatial Transcriptomic Analysis Using R-Based Computational Machine Learning Reveals the Genetic Profile of Yang or Yin Deficiency Syndrome in Chinese Medicine Theory.

Authors:  Cheng Zhang; Chi Wing Tam; Guoyi Tang; Yuanyuan Chen; Ning Wang; Yibin Feng
Journal:  Evid Based Complement Alternat Med       Date:  2022-03-16       Impact factor: 2.629

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.