Literature DB >> 17646346

A graph-based approach to systematically reconstruct human transcriptional regulatory modules.

Xifeng Yan1, Michael R Mehan, Yu Huang, Michael S Waterman, Philip S Yu, Xianghong Jasmine Zhou.   

Abstract

MOTIVATION: A major challenge in studying gene regulation is to systematically reconstruct transcription regulatory modules, which are defined as sets of genes that are regulated by a common set of transcription factors. A commonly used approach for transcription module reconstruction is to derive coexpression clusters from a microarray dataset. However, such results often contain false positives because genes from many transcription modules may be simultaneously perturbed upon a given type of conditions. In this study, we propose and validate that genes, which form a coexpression cluster in multiple microarray datasets across diverse conditions, are more likely to form a transcription module. However, identifying genes coexpressed in a subset of many microarray datasets is not a trivial computational problem.
RESULTS: We propose a graph-based data-mining approach to efficiently and systematically identify frequent coexpression clusters. Given m microarray datasets, we model each microarray dataset as a coexpression graph, and search for vertex sets which are frequently densely connected across [theta m] datasets (0 < or = theta < or = 1). For this novel graph-mining problem, we designed two techniques to narrow down the search space: (1) partition the input graphs into (overlapping) groups sharing common properties; (2) summarize the vertex neighbor information from the partitioned datasets onto the 'Neighbor Association Summary Graph's for effective mining. We applied our method to 105 human microarray datasets, and identified a large number of potential transcription modules, activated under different subsets of conditions. Validation by ChIP-chip data demonstrated that the likelihood of a coexpression cluster being a transcription module increases significantly with its recurrence. Our method opens a new way to exploit the vast amount of existing microarray data accumulation for gene regulation study. Furthermore, the algorithm is applicable to other biological networks for approximate network module mining. AVAILABILITY: http://zhoulab.usc.edu/NeMo/.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17646346     DOI: 10.1093/bioinformatics/btm227

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  32 in total

1.  Algorithm to identify frequent coupled modules from two-layered network series: application to study transcription and splicing coupling.

Authors:  Wenyuan Li; Chao Dai; Chun-Chi Liu; Xianghong Jasmine Zhou
Journal:  J Comput Biol       Date:  2012-06       Impact factor: 1.479

2.  An integrative network approach to map the transcriptome to the phenome.

Authors:  Michael R Mehan; Juan Nunez-Iglesias; Mrinal Kalakrishnan; Michael S Waterman; Xianghong Jasmine Zhou
Journal:  J Comput Biol       Date:  2009-08       Impact factor: 1.479

3.  Integrative analysis of many RNA-seq datasets to study alternative splicing.

Authors:  Wenyuan Li; Chao Dai; Shuli Kang; Xianghong Jasmine Zhou
Journal:  Methods       Date:  2014-02-28       Impact factor: 3.608

4.  Comparison of threshold selection methods for microarray gene co-expression matrices.

Authors:  Bhavesh R Borate; Elissa J Chesler; Michael A Langston; Arnold M Saxton; Brynn H Voy
Journal:  BMC Res Notes       Date:  2009-12-02

5.  A global meta-analysis of microarray expression data to predict unknown gene functions and estimate the literature-data divide.

Authors:  Jonathan D Wren
Journal:  Bioinformatics       Date:  2009-05-15       Impact factor: 6.937

6.  Enumeration of condition-dependent dense modules in protein interaction networks.

Authors:  Elisabeth Georgii; Sabine Dietmann; Takeaki Uno; Philipp Pagel; Koji Tsuda
Journal:  Bioinformatics       Date:  2009-02-11       Impact factor: 6.937

7.  Clique-based data mining for related genes in a biomedical database.

Authors:  Tsutomu Matsunaga; Chikara Yonemori; Etsuji Tomita; Masaaki Muramatsu
Journal:  BMC Bioinformatics       Date:  2009-07-01       Impact factor: 3.169

8.  Using pre-existing microarray datasets to increase experimental power: application to insulin resistance.

Authors:  Bernie J Daigle; Alicia Deng; Tracey McLaughlin; Samuel W Cushman; Margaret C Cam; Gerald Reaven; Philip S Tsao; Russ B Altman
Journal:  PLoS Comput Biol       Date:  2010-03-26       Impact factor: 4.475

9.  Meta Analysis of Gene Expression Data within and Across Species.

Authors:  Ana C Fierro; Filip Vandenbussche; Kristof Engelen; Yves Van de Peer; Kathleen Marchal
Journal:  Curr Genomics       Date:  2008-12       Impact factor: 2.236

10.  Transcription factor site dependencies in human, mouse and rat genomes.

Authors:  Andrija Tomovic; Michael Stadler; Edward J Oakeley
Journal:  BMC Bioinformatics       Date:  2009-10-16       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.