Literature DB >> 19930695

Regularized estimation of large-scale gene association networks using graphical Gaussian models.

Nicole Krämer1, Juliane Schäfer, Anne-Laure Boulesteix.   

Abstract

BACKGROUND: Grapn>hical Gaussian models are popular tools for the estimation of (undirected) gene association networks from microarray data. A key issue when the number of variables greatly exceeds the number of sampn>les is the estimation of the matrix of partial correlations. Since the (Moore-Penrose) inverse of the sampn>le covariance matrix leads to poor estimates in this scenario, standard methods are inapn>propriate and adequate regularization techniques are needed. Popular apn>proaches include biased estimates of the covariance matrix and high-dimensional regression schemes, such as the Lasso and Partial Least Squares.
p
an class="abstract_title">RESULTS:
In this article, we investigate a general framework for combining regularized regression methods with the estimation of Graphical Gaussian models. This framework includes various existing methods as well as two new approaches based on ridge regression and adaptive lasso, respectively. These methods are extensively compared both qualitatively and quantitatively within a simulation study and through an application to six diverse real data sets. In addition, all proposed algorithms are implemented in the R package "parcor", available from the R repository CRAN.
CONCLUSION: In our simulation studies, the investigated non-sparse regression methods, i.e. Ridge Regression and Partial Least Squares, exhibit rather conservative behavior when combined with (local) false discovery rate multiple testing in order to decide whether or not an edge is present in the network. For networks with higher densities, the difference in performance of the methods decreases. For sparse networks, we confirm the Lasso's well known tendency towards selecting too many edges, whereas the two-stage adaptive Lasso is an interesting alternative that provides sparser solutions. In our simulations, both sparse and non-sparse methods are able to reconstruct networks with cluster structures. On six real data sets, we also clearly distinguish the results obtained using the non-sparse methods and those obtained using the sparse methods where specification of the regularization parameter automatically means model selection. In five out of six data sets, Partial Least Squares selects very dense networks. Furthermore, for data that violate the assumption of uncorrelated observations (due to replications), the Lasso and the adaptive Lasso yield very complex structures, indicating that they might not be suited under these conditions. The shrinkage approach is more stable than the regression based approaches when using subsampling.

Entities:  

Mesh:

Year:  2009        PMID: 19930695      PMCID: PMC2808166          DOI: 10.1186/1471-2105-10-384

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  21 in total

1.  Transcriptome-based determination of multiple transcription regulator activities in Escherichia coli by using network component analysis.

Authors:  Katy C Kao; Young-Lyeol Yang; Riccardo Boscolo; Chiara Sabatti; Vwani Roychowdhury; James C Liao
Journal:  Proc Natl Acad Sci U S A       Date:  2003-12-23       Impact factor: 11.205

2.  Gradient directed regularization for sparse Gaussian concentration graphs, with applications to inference of genetic networks.

Authors:  Hongzhe Li; Jiang Gui
Journal:  Biostatistics       Date:  2005-12-02       Impact factor: 5.899

Review 3.  Partial least squares: a versatile tool for the analysis of high-dimensional genomic data.

Authors:  Anne-Laure Boulesteix; Korbinian Strimmer
Journal:  Brief Bioinform       Date:  2006-05-26       Impact factor: 11.622

4.  A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics.

Authors:  Juliane Schäfer; Korbinian Strimmer
Journal:  Stat Appl Genet Mol Biol       Date:  2005-11-14

5.  Identification of genetically mediated cortical networks: a multivariate study of pediatric twins and siblings.

Authors:  J E Schmitt; R K Lenroot; G L Wallace; S Ordaz; K N Taylor; N Kabani; D Greenstein; J P Lerch; K S Kendler; M C Neale; J N Giedd
Journal:  Cereb Cortex       Date:  2008-01-29       Impact factor: 5.357

Review 6.  A review of feature selection techniques in bioinformatics.

Authors:  Yvan Saeys; Iñaki Inza; Pedro Larrañaga
Journal:  Bioinformatics       Date:  2007-08-24       Impact factor: 6.937

7.  Reconstruction of genetic association networks from microarray data: a partial least squares approach.

Authors:  Vasyl Pihur; Somnath Datta; Susmita Datta
Journal:  Bioinformatics       Date:  2008-01-18       Impact factor: 6.937

8.  Combining partial correlation and an information theory approach to the reversed engineering of gene co-expression networks.

Authors:  Antonio Reverter; Eva K F Chan
Journal:  Bioinformatics       Date:  2008-09-10       Impact factor: 6.937

Review 9.  Stability and aggregation of ranked gene lists.

Authors:  Anne-Laure Boulesteix; Martin Slawski
Journal:  Brief Bioinform       Date:  2009-09       Impact factor: 11.622

10.  An Arabidopsis gene network based on the graphical Gaussian model.

Authors:  Shisong Ma; Qingqiu Gong; Hans J Bohnert
Journal:  Genome Res       Date:  2007-10-05       Impact factor: 9.043

View more
  58 in total

1.  Comparative analysis of false discovery rate methods in constructing metabolic association networks.

Authors:  Imhoi Koo; Sen Yao; Xiang Zhang; Seongho Kim
Journal:  J Bioinform Comput Biol       Date:  2014-08-07       Impact factor: 1.122

2.  Gene Network Reconstruction using Global-Local Shrinkage Priors.

Authors:  Gwenaël G R Leday; Mathisca C M de Gunst; Gino B Kpogbezan; Aad W van der Vaart; Wessel N van Wieringen; Mark A van de Wiel
Journal:  Ann Appl Stat       Date:  2017-03       Impact factor: 2.083

3.  Inference of gene regulatory networks from genetic perturbations with linear regression model.

Authors:  Zijian Dong; Tiecheng Song; Chuang Yuan
Journal:  PLoS One       Date:  2013-12-23       Impact factor: 3.240

4.  Incorporating prior knowledge into Gene Network Study.

Authors:  Zixing Wang; Wenlong Xu; F Anthony San Lucas; Yin Liu
Journal:  Bioinformatics       Date:  2013-08-16       Impact factor: 6.937

5.  Multilevel regularized regression for simultaneous taxa selection and network construction with metagenomic count data.

Authors:  Zhenqiu Liu; Fengzhu Sun; Jonathan Braun; Dermot P B McGovern; Steven Piantadosi
Journal:  Bioinformatics       Date:  2014-11-20       Impact factor: 6.937

6.  A novel genomic selection method combining GBLUP and LASSO.

Authors:  Hengde Li; Jingwei Wang; Zhenmin Bao
Journal:  Genetica       Date:  2015-02-06       Impact factor: 1.082

7.  Adjusted adaptive Lasso for covariate model-building in nonlinear mixed-effect pharmacokinetic models.

Authors:  Elham Haem; Kajsa Harling; Seyyed Mohammad Taghi Ayatollahi; Najaf Zare; Mats O Karlsson
Journal:  J Pharmacokinet Pharmacodyn       Date:  2017-01-31       Impact factor: 2.745

8.  The gene regulatory network of mESC differentiation: a benchmark for reverse engineering methods.

Authors:  Johannes Meisig; Nils Blüthgen
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2018-07-05       Impact factor: 6.237

9.  Evaluation of linkage disequilibrium in wheat with an L1-regularized sparse Markov network.

Authors:  Gota Morota; Daniel Gianola
Journal:  Theor Appl Genet       Date:  2013-05-10       Impact factor: 5.699

10.  A Multi-Method Approach for Proteomic Network Inference in 11 Human Cancers.

Authors:  Yasin Şenbabaoğlu; Selçuk Onur Sümer; Francisco Sánchez-Vega; Debra Bemis; Giovanni Ciriello; Nikolaus Schultz; Chris Sander
Journal:  PLoS Comput Biol       Date:  2016-02-29       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.