Literature DB >> 19168911

Sparse linear discriminant analysis for simultaneous testing for the significance of a gene set/pathway and gene selection.

Michael C Wu1, Lingsong Zhang, Zhaoxi Wang, David C Christiani, Xihong Lin.   

Abstract

MOTIVATION: Pathway and gene set-based approaches for the analysis of gene expression profiling experiments have become increasingly popular for addressing problems associated with individual gene analysis. Since most genes are not differently expressed, existing gene set tests, which consider all the genes within a gene set, are subject to considerable noise and power loss, a concern exacerbated in studies in which the degree of differential expression is moderate for truly differentially expressed genes. For a significantly differentially expressed pathway, it is also of substantial interest to select important genes that drive the differential expression of the pathway.
METHODS: We develop a unified framework to jointly test the significance of a pathway and to select a subset of genes that drive the significant pathway effect. To achieve dimension reduction and gene selection, we decompose each gene pathway into a single score by using a regularized form of linear discriminant analysis, called sparse linear discriminant analysis (sLDA). Testing for the significance of the pathway effect proceeds via permutation of the sLDA score. The sLDA-based test is compared with competing approaches with simulations and two applications: a study on the effect of metal fume exposure on immune response and a study of gene expression profiles among Type II Diabetes patients.
RESULTS: Our results show that sLDA-based testing provides a powerful approach to test for the significance of a differentially expressed pathway and gene selection. AVAILABILITY: An implementation of the proposed sLDA-based pathway test in the R statistical computing environment is available at http://www.hsph.harvard.edu/~mwu/software/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2009        PMID: 19168911      PMCID: PMC2732305          DOI: 10.1093/bioinformatics/btp019

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  19 in total

1.  Global functional profiling of gene expression.

Authors:  Sorin Draghici; Purvesh Khatri; Rui P Martins; G Charles Ostermeier; Stephen A Krawetz
Journal:  Genomics       Date:  2003-02       Impact factor: 5.736

2.  Analyzing gene expression data in terms of gene sets: methodological issues.

Authors:  Jelle J Goeman; Peter Bühlmann
Journal:  Bioinformatics       Date:  2007-02-15       Impact factor: 6.937

3.  Effects of glucose metabolism on the regulation of genes of fatty acid synthesis and triglyceride secretion in the liver.

Authors:  Núria Morral; Howard J Edenberg; Scott R Witting; Jennifer Altomonte; Tearina Chu; Matthew Brown
Journal:  J Lipid Res       Date:  2007-04-21       Impact factor: 5.922

4.  Purification and properties of liver fructose 1,6-bisphosphatase from C57BL/KsJ normal and diabetic mice.

Authors:  F Marcus; M M Hosey
Journal:  J Biol Chem       Date:  1980-03-25       Impact factor: 5.157

5.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

6.  Diagnosis of multiple cancer types by shrunken centroids of gene expression.

Authors:  Robert Tibshirani; Trevor Hastie; Balasubramanian Narasimhan; Gilbert Chu
Journal:  Proc Natl Acad Sci U S A       Date:  2002-05-14       Impact factor: 11.205

7.  Proteome analysis of skeletal muscle from obese and morbidly obese women.

Authors:  Dustin S Hittel; Yetrib Hathout; Eric P Hoffman; Joseph A Houmard
Journal:  Diabetes       Date:  2005-05       Impact factor: 9.461

8.  Insulin mediates the stimulation of pyruvate kinase by a dual mechanism.

Authors:  W C Parks; R L Drake
Journal:  Biochem J       Date:  1982-11-15       Impact factor: 3.857

9.  Serum gamma-glutamyltransferase and risk of metabolic syndrome and type 2 diabetes in middle-aged Japanese men.

Authors:  Noriyuki Nakanishi; Kenji Suzuki; Kozo Tatara
Journal:  Diabetes Care       Date:  2004-06       Impact factor: 19.112

10.  Model-based analysis of oligonucleotide arrays: model validation, design issues and standard error application.

Authors:  C Li; W Hung Wong
Journal:  Genome Biol       Date:  2001-08-03       Impact factor: 13.583

View more
  21 in total

1.  Subcellular localization of Gram-negative bacterial proteins using sparse learning.

Authors:  Zhonglong Zheng; Jie Yang
Journal:  Protein J       Date:  2010-04       Impact factor: 2.371

2.  Integrative linear discriminant analysis with guaranteed error rate improvement.

Authors:  Quefeng Li; Lexin Li
Journal:  Biometrika       Date:  2018-10-22       Impact factor: 2.445

3.  A regularized Hotelling's T2 test for pathway analysis in proteomic studies.

Authors:  Lin S Chen; Debashis Paul; Ross L Prentice; Pei Wang
Journal:  J Am Stat Assoc       Date:  2011-12       Impact factor: 5.033

4.  Power calculation for overall hypothesis testing with high-dimensional commensurate outcomes.

Authors:  Yueh-Yun Chi; Matthew J Gribbin; Jacqueline L Johnson; Keith E Muller
Journal:  Stat Med       Date:  2013-09-30       Impact factor: 2.373

5.  Graph-based sparse linear discriminant analysis for high-dimensional classification.

Authors:  Jianyu Liu; Guan Yu; Yufeng Liu
Journal:  J Multivar Anal       Date:  2018-12-17       Impact factor: 1.473

6.  Pathway analysis using random forests with bivariate node-split for survival outcomes.

Authors:  Herbert Pang; Debayan Datta; Hongyu Zhao
Journal:  Bioinformatics       Date:  2009-11-18       Impact factor: 6.937

7.  Sparsifying the Fisher Linear Discriminant by Rotation.

Authors:  Ning Hao; Bin Dong; Jianqing Fan
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2014-11-07       Impact factor: 4.488

8.  Analyzing LC/MS metabolic profiling data in the context of existing metabolic networks.

Authors:  Tianwei Yu; Yun Bai
Journal:  Curr Metabolomics       Date:  2013-01-01

9.  Prioritizing individual genetic variants after kernel machine testing using variable selection.

Authors:  Qianchuan He; Tianxi Cai; Yang Liu; Ni Zhao; Quaker E Harmon; Lynn M Almli; Elisabeth B Binder; Stephanie M Engel; Kerry J Ressler; Karen N Conneely; Xihong Lin; Michael C Wu
Journal:  Genet Epidemiol       Date:  2016-08-03       Impact factor: 2.135

10.  Optimal Feature Selection in High-Dimensional Discriminant Analysis.

Authors:  Mladen Kolar; Han Liu
Journal:  IEEE Trans Inf Theory       Date:  2015-02       Impact factor: 2.501

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.