Literature DB >> 27862229

Sparse multivariate factor analysis regression models and its applications to integrative genomics analysis.

Yan Zhou1, Pei Wang2, Xianlong Wang3, Ji Zhu4, Peter X-K Song4.   

Abstract

The multivariate regression model is a useful tool to explore complex associations between two kinds of molecular markers, which enables the understanding of the biological pathways underlying disease etiology. For a set of correlated response variables, accounting for such dependency can increase statistical power. Motivated by integrative genomic data analyses, we propose a new methodology-sparse multivariate factor analysis regression model (smFARM), in which correlations of response variables are assumed to follow a factor analysis model with latent factors. This proposed method not only allows us to address the challenge that the number of association parameters is larger than the sample size, but also to adjust for unobserved genetic and/or nongenetic factors that potentially conceal the underlying response-predictor associations. The proposed smFARM is implemented by the EM algorithm and the blockwise coordinate descent algorithm. The proposed methodology is evaluated and compared to the existing methods through extensive simulation studies. Our results show that accounting for latent factors through the proposed smFARM can improve sensitivity of signal detection and accuracy of sparse association map estimation. We illustrate smFARM by two integrative genomics analysis examples, a breast cancer dataset, and an ovarian cancer dataset, to assess the relationship between DNA copy numbers and gene expression arrays to understand genetic regulatory patterns relevant to the disease. We identify two trans-hub regions: one in cytoband 17q12 whose amplification influences the RNA expression levels of important breast cancer genes, and the other in cytoband 9q21.32-33, which is associated with chemoresistance in ovarian cancer.
© 2016 WILEY PERIODICALS, INC.

Entities:  

Keywords:  EM-blockwise coordinate descent; high-dimensional data; latent factors; regularization

Mesh:

Substances:

Year:  2016        PMID: 27862229      PMCID: PMC5154917          DOI: 10.1002/gepi.22018

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  23 in total

1.  Lethality and centrality in protein networks.

Authors:  H Jeong; S P Mason; A L Barabási; Z N Oltvai
Journal:  Nature       Date:  2001-05-03       Impact factor: 49.962

2.  Exploratory factor analysis of pathway copy number data with an application towards the integration with gene expression data.

Authors:  Wessel N van Wieringen; Mark A van de Wiel
Journal:  J Comput Biol       Date:  2011-05       Impact factor: 1.479

3.  Distinct patterns of DNA copy number alteration are associated with different clinicopathological features and gene-expression subtypes of breast cancer.

Authors:  Anna Bergamaschi; Young H Kim; Pei Wang; Therese Sørlie; Tina Hernandez-Boussard; Per E Lonning; Robert Tibshirani; Anne-Lise Børresen-Dale; Jonathan R Pollack
Journal:  Genes Chromosomes Cancer       Date:  2006-11       Impact factor: 5.006

4.  NETWORK EXPLORATION VIA THE ADAPTIVE LASSO AND SCAD PENALTIES.

Authors:  Jianqing Fan; Yang Feng; Yichao Wu
Journal:  Ann Appl Stat       Date:  2009-06-01       Impact factor: 2.083

5.  Regularized Multivariate Regression for Identifying Master Predictors with Application to Integrative Genomics Study of Breast Cancer.

Authors:  Jie Peng; Ji Zhu; Anna Bergamaschi; Wonshik Han; Dong-Young Noh; Jonathan R Pollack; Pei Wang
Journal:  Ann Appl Stat       Date:  2010-03       Impact factor: 2.083

6.  Specific copy number alterations associated with docetaxel/carboplatin response in ovarian carcinomas.

Authors:  Lovisa Osterberg; Kristina Levan; Karolina Partheen; Ulla Delle; Björn Olsson; Karin Sundfeldt; György Horvath
Journal:  Anticancer Res       Date:  2010-11       Impact factor: 2.480

7.  TNFSF10 (TRAIL), a p53 target gene that mediates p53-dependent cell death.

Authors:  Kageaki Kuribayashi; Gabriel Krigsfeld; Wenge Wang; Jing Xu; Patrick A Mayes; David T Dicker; Gen Sheng Wu; Wafik S El-Deiry
Journal:  Cancer Biol Ther       Date:  2008-12       Impact factor: 4.742

8.  Nodal/Activin signaling drives self-renewal and tumorigenicity of pancreatic cancer stem cells and provides a target for combined drug therapy.

Authors:  Enza Lonardo; Patrick C Hermann; Maria-Theresa Mueller; Stephan Huber; Anamaria Balic; Irene Miranda-Lorenzo; Sladjana Zagorac; Sonia Alcala; Iker Rodriguez-Arabaolaza; Juan Carlos Ramirez; Raul Torres-Ruíz; Elena Garcia; Manuel Hidalgo; David Álvaro Cebrián; Rainer Heuchel; Matthias Löhr; Frank Berger; Peter Bartenstein; Alexandra Aicher; Christopher Heeschen
Journal:  Cell Stem Cell       Date:  2011-11-04       Impact factor: 24.633

Review 9.  Cancer gene prioritization by integrative analysis of mRNA expression and DNA copy number data: a comparative review.

Authors:  Leo Lahti; Martin Schäfer; Hans-Ulrich Klein; Silvio Bicciato; Martin Dugas
Journal:  Brief Bioinform       Date:  2012-03-22       Impact factor: 11.622

10.  17β-estradiol upregulates GREB1 and accelerates ovarian tumor progression in vivo.

Authors:  Laura A Laviolette; Kendra M Hodgkinson; Neha Minhas; Carol Perez-Iratxeta; Barbara C Vanderhyden
Journal:  Int J Cancer       Date:  2014-02-25       Impact factor: 7.396

View more
  2 in total

1.  Assisted clustering of gene expression data using ANCut.

Authors:  Sebastian J Teran Hidalgo; Mengyun Wu; Shuangge Ma
Journal:  BMC Genomics       Date:  2017-08-16       Impact factor: 3.969

Review 2.  Machine Learning Based Computational Gene Selection Models: A Survey, Performance Evaluation, Open Issues, and Future Research Directions.

Authors:  Nivedhitha Mahendran; P M Durai Raj Vincent; Kathiravan Srinivasan; Chuan-Yu Chang
Journal:  Front Genet       Date:  2020-12-10       Impact factor: 4.599

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.