Literature DB >> 24489370

Integrative gene set analysis of multi-platform data with sample heterogeneity.

Jun Hu1, Jung-Ying Tzeng2.   

Abstract

MOTIVATION: Gene set analysis is a popular method for large-scale genomic studies. Because genes that have common biological features are analyzed jointly, gene set analysis often achieves better power and generates more biologically informative results. With the advancement of technologies, genomic studies with multi-platform data have become increasingly common. Several strategies have been proposed that integrate genomic data from multiple platforms to perform gene set analysis. To evaluate the performances of existing integrative gene set methods under various scenarios, we conduct a comparative simulation analysis based on The Cancer Genome Atlas breast cancer dataset.
RESULTS: We find that existing methods for gene set analysis are less effective when sample heterogeneity exists. To address this issue, we develop three methods for multi-platform genomic data with heterogeneity: two non-parametric methods, multi-platform Mann-Whitney statistics and multi-platform outlier robust T-statistics, and a parametric method, multi-platform likelihood ratio statistics. Using simulations, we show that the proposed multi-platform Mann-Whitney statistics method has higher power for heterogeneous samples and comparable performance for homogeneous samples when compared with the existing methods. Our real data applications to two datasets of The Cancer Genome Atlas also suggest that the proposed methods are able to identify novel pathways that are missed by other strategies.
AVAILABILITY AND IMPLEMENTATION: http://www4.stat.ncsu.edu/∼jytzeng/Software/Multiplatform_gene_set_analysis/
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2014        PMID: 24489370      PMCID: PMC4029033          DOI: 10.1093/bioinformatics/btu060

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  38 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

3.  Generalized T2 test for genome association studies.

Authors:  Momiao Xiong; Jinying Zhao; Eric Boerwinkle
Journal:  Am J Hum Genet       Date:  2002-03-29       Impact factor: 11.025

4.  A global test for groups of genes: testing association with a clinical outcome.

Authors:  Jelle J Goeman; Sara A van de Geer; Floor de Kort; Hans C van Houwelingen
Journal:  Bioinformatics       Date:  2004-01-01       Impact factor: 6.937

5.  Stabilization of stalled DNA replication forks by the BRCA2 breast cancer susceptibility protein.

Authors:  Mikhail Lomonosov; Shubha Anand; Mahesh Sangrithi; Rachel Davies; Ashok R Venkitaraman
Journal:  Genes Dev       Date:  2003-12-17       Impact factor: 11.361

6.  DNA methylation alterations exhibit intraindividual stability and interindividual heterogeneity in prostate cancer metastases.

Authors:  Martin J Aryee; Wennuan Liu; Julia C Engelmann; Philipp Nuhn; Meltem Gurel; Michael C Haffner; David Esopi; Rafael A Irizarry; Robert H Getzenberg; William G Nelson; Jun Luo; Jianfeng Xu; William B Isaacs; G Steven Bova; Srinivasan Yegnasubramanian
Journal:  Sci Transl Med       Date:  2013-01-23       Impact factor: 17.956

7.  PLK1 signaling in breast cancer cells cooperates with estrogen receptor-dependent gene transcription.

Authors:  Michael Wierer; Gaetano Verde; Paola Pisano; Henrik Molina; Jofre Font-Mateu; Luciano Di Croce; Miguel Beato
Journal:  Cell Rep       Date:  2013-06-13       Impact factor: 9.423

8.  Proteomic and bioinformatic analysis of mammalian SWI/SNF complexes identifies extensive roles in human malignancy.

Authors:  Cigall Kadoch; Diana C Hargreaves; Courtney Hodges; Laura Elias; Lena Ho; Jeff Ranish; Gerald R Crabtree
Journal:  Nat Genet       Date:  2013-05-05       Impact factor: 38.330

Review 9.  Cancer heterogeneity: implications for targeted therapeutics.

Authors:  R Fisher; L Pusztai; C Swanton
Journal:  Br J Cancer       Date:  2013-01-08       Impact factor: 7.640

10.  Gene set analysis methods: statistical models and methodological differences.

Authors:  Henryk Maciejewski
Journal:  Brief Bioinform       Date:  2014-07       Impact factor: 11.622

View more
  2 in total

1.  Integrative clustering of multi-level omics data for disease subtype discovery using sequential double regularization.

Authors:  Sunghwan Kim; Steffi Oesterreich; Seyoung Kim; Yongseok Park; George C Tseng
Journal:  Biostatistics       Date:  2016-08-22       Impact factor: 5.899

2.  Gene-Set Integrative Analysis of Multi-Omics Data Using Tensor-based Association Test.

Authors:  Sheng-Mao Chang; Meng Yang; Wenbin Lu; Yu-Jyun Huang; Yueyang Huang; Hung Hung; Jeffrey C Miecznikowski; Tzu-Pin Lu; Jung-Ying Tzeng
Journal:  Bioinformatics       Date:  2021-03-01       Impact factor: 6.937

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.