Literature DB >> 19223452

Identification of differential gene pathways with principal component analysis.

Shuangge Ma1, Michael R Kosorok.   

Abstract

MOTIVATION: Development of high-throughput technology makes it possible to measure expressions of thousands of genes simultaneously. Genes have the inherent pathway structure, where pathways are composed of multiple genes with coordinated biological functions. It is of great interest to identify differential gene pathways that are associated with the variations of phenotypes.
RESULTS: We propose the following approach for detecting differential gene pathways. First, we construct gene pathways using databases such as KEGG or GO. Second, for each pathway, we extract a small number of representative features, which are linear combinations of gene expressions and/or their transformations. Specifically, we propose using (i) principal components (PCs) of gene expression sets, (ii) PCs of expanded gene expression sets and (iii) expanded sets of PCs of gene expressions, as the representative features. Third, we identify differential gene pathways as those with representative features significantly associated with the variations of phenotypes, particularly disease clinical outcomes, in regression models. The false discovery rate approach is used to adjust for multiple comparisons. Analysis of three gene expression datasets suggests that (i) the proposed approach can effectively identify differential gene pathways; (ii) PCs that explain only a small amount of variations of gene expressions may bear significant associations between gene pathways and phenotypes; (iii) including second-order terms of gene expressions may lead to identification of new differential gene pathways; (iv) the proposed approach is relatively insensitive to additional noises; and (v) the proposed approach can identify gene pathways missed by alternative approaches. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Mesh:

Year:  2009        PMID: 19223452      PMCID: PMC2732304          DOI: 10.1093/bioinformatics/btp085

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  28 in total

1.  Principal component analysis for clustering gene expression data.

Authors:  K Y Yeung; W L Ruzzo
Journal:  Bioinformatics       Date:  2001-09       Impact factor: 6.937

2.  A global test for groups of genes: testing association with a clinical outcome.

Authors:  Jelle J Goeman; Sara A van de Geer; Floor de Kort; Hans C van Houwelingen
Journal:  Bioinformatics       Date:  2004-01-01       Impact factor: 6.937

3.  The proliferation gene expression signature is a quantitative integrator of oncogenic events that predicts survival in mantle cell lymphoma.

Authors:  Andreas Rosenwald; George Wright; Adrian Wiestner; Wing C Chan; Joseph M Connors; Elias Campo; Randy D Gascoyne; Thomas M Grogan; H Konrad Muller-Hermelink; Erlend B Smeland; Michael Chiorazzi; Jena M Giltnane; Elaine M Hurt; Hong Zhao; Lauren Averett; Sarah Henrickson; Liming Yang; John Powell; Wyndham H Wilson; Elaine S Jaffe; Richard Simon; Richard D Klausner; Emilio Montserrat; Francesc Bosch; Timothy C Greiner; Dennis D Weisenburger; Warren G Sanger; Bhavana J Dave; James C Lynch; Julie Vose; James O Armitage; Richard I Fisher; Thomas P Miller; Michael LeBlanc; German Ott; Stein Kvaloy; Harald Holte; Jan Delabie; Louis M Staudt
Journal:  Cancer Cell       Date:  2003-02       Impact factor: 31.743

4.  [Tyrosine metabolism in leukemia].

Authors:  V D Ivanova; M M Kaverzneva
Journal:  Probl Gematol Pereliv Krovi       Date:  1971-02

5.  In vitro modulation of natural killer cell activity in non-Hodgkin's lymphoma patients after therapy.

Authors:  B A Mehta; M N Satam; S H Advani; J J Nadkarni
Journal:  Cancer Immunol Immunother       Date:  1989       Impact factor: 6.968

6.  Natural cell-mediated cytotoxicity in cutaneous T-cell lymphomas.

Authors:  B A Neilan; E C Vonderheid; K J O'Neill
Journal:  J Invest Dermatol       Date:  1983-08       Impact factor: 8.551

7.  High-throughput retroviral tagging for identification of genes involved in initiation and progression of mouse splenic marginal zone lymphomas.

Authors:  Min Sun Shin; Torgny N Fredrickson; Janet W Hartley; Takeshi Suzuki; Keiko Akagi; Keiko Agaki; Herbert C Morse
Journal:  Cancer Res       Date:  2004-07-01       Impact factor: 12.701

8.  Genome-wide association study and mouse model identify interaction between RET and EDNRB pathways in Hirschsprung disease.

Authors:  Minerva M Carrasquillo; Andrew S McCallion; Erik G Puffenberger; Carl S Kashuk; Nassim Nouri; Aravinda Chakravarti
Journal:  Nat Genet       Date:  2002-09-23       Impact factor: 38.330

Review 9.  Molecular control of the cell cycle in cancer: biological and clinical aspects.

Authors:  Michael Boe Møller
Journal:  Dan Med Bull       Date:  2003-05

10.  A general modular framework for gene set enrichment analysis.

Authors:  Marit Ackermann; Korbinian Strimmer
Journal:  BMC Bioinformatics       Date:  2009-02-03       Impact factor: 3.169

View more
  27 in total

Review 1.  Identification of aberrant pathways and network activities from high-throughput data.

Authors:  Jinlian Wang; Yuji Zhang; Catalin Marian; Habtom W Ressom
Journal:  Brief Bioinform       Date:  2012-01-27       Impact factor: 11.622

2.  Independent component analysis: mining microarray data for fundamental human gene expression modules.

Authors:  Jesse M Engreitz; Bernie J Daigle; Jonathan J Marshall; Russ B Altman
Journal:  J Biomed Inform       Date:  2010-07-07       Impact factor: 6.317

3.  Unsupervised Extraction of Stable Expression Signatures from Public Compendia with an Ensemble of Neural Networks.

Authors:  Jie Tan; Georgia Doing; Kimberley A Lewis; Courtney E Price; Kathleen M Chen; Kyle C Cady; Barret Perchuk; Michael T Laub; Deborah A Hogan; Casey S Greene
Journal:  Cell Syst       Date:  2017-07-12       Impact factor: 10.304

4.  Genetic and nongenetic variation revealed for the principal components of human gene expression.

Authors:  Anita Goldinger; Anjali K Henders; Allan F McRae; Nicholas G Martin; Greg Gibson; Grant W Montgomery; Peter M Visscher; Joseph E Powell
Journal:  Genetics       Date:  2013-09-11       Impact factor: 4.562

5.  Principal component analysis based methods in bioinformatics studies.

Authors:  Shuangge Ma; Ying Dai
Journal:  Brief Bioinform       Date:  2011-01-17       Impact factor: 11.622

6.  A network-based gene-weighting approach for pathway analysis.

Authors:  Zhaoyuan Fang; Weidong Tian; Hongbin Ji
Journal:  Cell Res       Date:  2011-09-06       Impact factor: 25.617

7.  Adaptive elastic-net sparse principal component analysis for pathway association testing.

Authors:  Xi Chen
Journal:  Stat Appl Genet Mol Biol       Date:  2011-10-24

Review 8.  Systems analysis of high-throughput data.

Authors:  Rosemary Braun
Journal:  Adv Exp Med Biol       Date:  2014       Impact factor: 2.622

9.  Integrative sparse principal component analysis of gene expression data.

Authors:  Mengque Liu; Xinyan Fan; Kuangnan Fang; Qingzhao Zhang; Shuangge Ma
Journal:  Genet Epidemiol       Date:  2017-11-08       Impact factor: 2.135

10.  Detection of gene pathways with predictive power for breast cancer prognosis.

Authors:  Shuangge Ma; Michael R Kosorok
Journal:  BMC Bioinformatics       Date:  2010-01-01       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.