Literature DB >> 29036516

A two-stage approach of gene network analysis for high-dimensional heterogeneous data.

Sangin Lee1, Faming Liang2, Ling Cai3, Guanghua Xiao4.   

Abstract

Gaussian graphical models have been widely used to construct gene regulatory networks from gene expression data. Most existing methods for Gaussian graphical models are designed to model homogeneous data, assuming a single Gaussian distribution. In practice, however, data may consist of gene expression studies with unknown confounding factors, such as study cohort, microarray platforms, experimental batches, which produce heterogeneous data, and hence lead to false positive edges or low detection power in resulting network, due to those unknown factors. To overcome this problem and improve the performance in constructing gene networks, we propose a two-stage approach to construct a gene network from heterogeneous data. The first stage is to perform a clustering analysis in order to assign samples to a few clusters where the samples in each cluster are approximately homogeneous, and the second stage is to conduct an integrative analysis of networks from each cluster. In particular, we first apply a model-based clustering method using the singular value decomposition for high-dimensional data, and then integrate the networks from each cluster using the integrative $\psi$-learning method. The proposed method is based on an equivalent measure of partial correlation coefficients in Gaussian graphical models, which is computed with a reduced conditional set and thus it is useful for high-dimensional data. We compare the proposed two-stage learning approach with some existing methods in various simulation settings, and demonstrate the robustness of the proposed method. Finally, it is applied to integrate multiple gene expression studies of lung adenocarcinoma to identify potential therapeutic targets and treatment biomarkers.

Entities:  

Mesh:

Year:  2018        PMID: 29036516      PMCID: PMC5862270          DOI: 10.1093/biostatistics/kxx033

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  21 in total

1.  A Local Poisson Graphical Model for inferring networks from sequencing data.

Authors:  Genevera I Allen; Zhandong Liu
Journal:  IEEE Trans Nanobioscience       Date:  2013-08-15       Impact factor: 2.935

2.  A Selective Review of Group Selection in High-Dimensional Models.

Authors:  Jian Huang; Patrick Breheny; Shuangge Ma
Journal:  Stat Sci       Date:  2012       Impact factor: 2.901

3.  Statistical completion of a partially identified graph with applications for the estimation of gene regulatory networks.

Authors:  Donghyeon Yu; Won Son; Johan Lim; Guanghua Xiao
Journal:  Biostatistics       Date:  2015-04-01       Impact factor: 5.899

4.  Covariate-Adjusted Precision Matrix Estimation with an Application in Genetical Genomics.

Authors:  T Tony Cai; Hongzhe Li; Weidong Liu; Jichun Xie
Journal:  Biometrika       Date:  2012-11-30       Impact factor: 2.445

Review 5.  Protein phosphatase 2A: a target for anticancer therapy.

Authors:  Danilo Perrotti; Paolo Neviani
Journal:  Lancet Oncol       Date:  2013-05       Impact factor: 41.316

6.  Partial Correlation Estimation by Joint Sparse Regression Models.

Authors:  Jie Peng; Pei Wang; Nengfeng Zhou; Ji Zhu
Journal:  J Am Stat Assoc       Date:  2009-06-01       Impact factor: 5.033

7.  A functional copy-number variation in MAPKAPK2 predicts risk and prognosis of lung cancer.

Authors:  Bin Liu; Lei Yang; Binfang Huang; Mei Cheng; Hui Wang; Yinyan Li; Dongsheng Huang; Jian Zheng; Qingchu Li; Xin Zhang; Weidong Ji; Yifeng Zhou; Jiachun Lu
Journal:  Am J Hum Genet       Date:  2012-08-10       Impact factor: 11.025

8.  Nfib Promotes Metastasis through a Widespread Increase in Chromatin Accessibility.

Authors:  Sarah K Denny; Dian Yang; Chen-Hua Chuang; Jennifer J Brady; Jing Shan Lim; Barbara M Grüner; Shin-Heng Chiou; Alicia N Schep; Jessika Baral; Cécile Hamard; Martine Antoine; Marie Wislez; Christina S Kong; Andrew J Connolly; Kwon-Sik Park; Julien Sage; William J Greenleaf; Monte M Winslow
Journal:  Cell       Date:  2016-06-30       Impact factor: 41.582

Review 9.  Joining the cell survival squad: an emerging role for protein kinase CK2.

Authors:  Khalil Ahmed; Delphine A Gerber; Claude Cochet
Journal:  Trends Cell Biol       Date:  2002-05       Impact factor: 20.808

10.  Comparing statistical methods for constructing large scale gene networks.

Authors:  Jeffrey D Allen; Yang Xie; Min Chen; Luc Girard; Guanghua Xiao
Journal:  PLoS One       Date:  2012-01-17       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.