Literature DB >> 29114920

Integrative sparse principal component analysis of gene expression data.

Mengque Liu1, Xinyan Fan1, Kuangnan Fang1, Qingzhao Zhang1,2, Shuangge Ma1,2,3.   

Abstract

In the analysis of gene expression data, dimension reduction techniques have been extensively adopted. The most popular one is perhaps the PCA (principal component analysis). To generate more reliable and more interpretable results, the SPCA (sparse PCA) technique has been developed. With the "small sample size, high dimensionality" characteristic of gene expression data, the analysis results generated from a single dataset are often unsatisfactory. Under contexts other than dimension reduction, integrative analysis techniques, which jointly analyze the raw data of multiple independent datasets, have been developed and shown to outperform "classic" meta-analysis and other multidatasets techniques and single-dataset analysis. In this study, we conduct integrative analysis by developing the iSPCA (integrative SPCA) method. iSPCA achieves the selection and estimation of sparse loadings using a group penalty. To take advantage of the similarity across datasets and generate more accurate results, we further impose contrasted penalties. Different penalties are proposed to accommodate different data conditions. Extensive simulations show that iSPCA outperforms the alternatives under a wide spectrum of settings. The analysis of breast cancer and pancreatic cancer data further shows iSPCA's satisfactory performance.
© 2017 WILEY PERIODICALS, INC.

Entities:  

Keywords:  contrasted penalization; gene expression data; integrative analysis; sparse PCA

Mesh:

Year:  2017        PMID: 29114920      PMCID: PMC5912177          DOI: 10.1002/gepi.22089

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  20 in total

1.  Identification of cancer genomic markers via integrative sparse boosting.

Authors:  Yuan Huang; Jian Huang; Ben-Chang Shia; Shuangge Ma
Journal:  Biostatistics       Date:  2011-10-31       Impact factor: 5.899

2.  Detecting functional connectivity in fMRI using PCA and regression analysis.

Authors:  Yuan Zhong; Huinan Wang; Guangming Lu; Zhiqiang Zhang; Qing Jiao; Yijun Liu
Journal:  Brain Topogr       Date:  2009-05-01       Impact factor: 3.020

3.  Integrative analysis and variable selection with multiple high-dimensional data sets.

Authors:  Shuangge Ma; Jian Huang; Xiao Song
Journal:  Biostatistics       Date:  2011-03-16       Impact factor: 5.899

4.  Incorporating network structure in integrative analysis of cancer prognosis data.

Authors:  Jin Liu; Jian Huang; Shuangge Ma
Journal:  Genet Epidemiol       Date:  2012-11-17       Impact factor: 2.135

5.  Integrative analysis of high-throughput cancer studies with contrasted penalization.

Authors:  Xingjie Shi; Jin Liu; Jian Huang; Yong Zhou; BenChang Shia; Shuangge Ma
Journal:  Genet Epidemiol       Date:  2014-01-06       Impact factor: 2.135

6.  Sparse principal component analysis for identifying ancestry-informative markers in genome-wide association studies.

Authors:  Seokho Lee; Michael P Epstein; Richard Duncan; Xihong Lin
Journal:  Genet Epidemiol       Date:  2012-04-16       Impact factor: 2.135

7.  Variable selection in the accelerated failure time model via the bridge method.

Authors:  Jian Huang; Shuangge Ma
Journal:  Lifetime Data Anal       Date:  2009-12-16       Impact factor: 1.588

8.  FKBP51 affects cancer cell response to chemotherapy by negatively regulating Akt.

Authors:  Huadong Pei; Liang Li; Brooke L Fridley; Gregory D Jenkins; Krishna R Kalari; Wilma Lingle; Gloria Petersen; Zhenkun Lou; Liewei Wang
Journal:  Cancer Cell       Date:  2009-09-08       Impact factor: 31.743

9.  Gene expression abnormalities in histologically normal breast epithelium of breast cancer patients.

Authors:  Anusri Tripathi; Chialin King; Antonio de la Morenas; Victoria Kristina Perry; Bohdana Burke; Gregory A Antoine; Erwin F Hirsch; Maureen Kavanah; Jane Mendez; Michael Stone; Norman P Gerry; Marc E Lenburg; Carol L Rosenberg
Journal:  Int J Cancer       Date:  2008-04-01       Impact factor: 7.396

10.  Eigengene networks for studying the relationships between co-expression modules.

Authors:  Peter Langfelder; Steve Horvath
Journal:  BMC Syst Biol       Date:  2007-11-21
View more
  1 in total

1.  Virtual reality for the observation of oncology models (VROOM): immersive analytics for oncology patient cohorts.

Authors:  Chng Wei Lau; Zhonglin Qu; Daniel Draper; Rosa Quan; Ali Braytee; Andrew Bluff; Dongmo Zhang; Andrew Johnston; Paul J Kennedy; Simeon Simoff; Quang Vinh Nguyen; Daniel Catchpoole
Journal:  Sci Rep       Date:  2022-07-05       Impact factor: 4.996

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.