| Literature DB >> 23815123 |
Irina Dinu1, Xiaoming Wang, Linda E Kelemen, Shabnam Vatanpour, Saumyadipta Pyne.
Abstract
BACKGROUND: Gene set analysis (GSA) methods test the association of sets of genes with a phenotype in gene expression microarray studies. Many GSA methods have been proposed, especially methods for use with a binary phenotype. Equally, if not more importantly however, is the ability to test the enrichment of a gene signature or pathway against the continuous phenotypes which are routinely and commonly observed in, for example, clinicopathological measurements. It is not always easy or meaningful to dichotomize continuous phenotypes into two classes, and attempting to do this may lead to the inaccurate classification of samples, which would affect the downstream enrichment analysis. In the present study, we have build on recent efforts to incorporate correlation structure within gene sets and pathways into the GSA test statistic. To address the issue of continuous phenotypes directly without the need for artificial discrete classification and thus increase the power of the test while ensuring computational efficiency and rigor, new GSA methods that can incorporate a covariance matrix estimator for a continuous phenotype may present an effective approach.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23815123 PMCID: PMC3717275 DOI: 10.1186/1471-2105-14-212
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Type I errors for four GSA methods: LCT, LCT, SAM-GS and Global-Test
| 10 | 10 | 10 | ||||||||||
| 20 | 20 | 20 | ||||||||||
| 5 | 5 | 5 | ||||||||||
| Method | ρ = 0.0 | ρ = 0.3 | ρ = 0.6 | ρ = 0.9 | ρ = 0.0 | ρ = 0.3 | ρ = 0.6 | ρ = 0.9 | ρ = 0.0 | ρ = 0.3 | ρ = 0.6 | ρ = 0.9 |
| LCT | .005 | .003 | .003 | .002 | .012 | .012 | .008 | .009 | .045 | .049 | .044 | .052 |
| LCT2 | .006 | .007 | .004 | .003 | .013 | .013 | .009 | .009 | .048 | .057 | .048 | .046 |
| SAMGS | .004 | .005 | .006 | .004 | .014 | .012 | .007 | .008 | .044 | .051 | .048 | .050 |
| Global | .007 | .008 | .011 | .011 | .013 | .019 | .0117 | .015 | .053 | .052 | .054 | .053 |
| 20 | 20 | 20 | ||||||||||
| 100 | 100 | 100 | ||||||||||
| 20 | 20 | 20 | ||||||||||
| Method | ρ = 0.0 | ρ = 0.3 | ρ = 0.6 | ρ = 0.9 | ρ = 0.0 | ρ = 0.3 | ρ = 0.6 | ρ = 0.9 | ρ = 0.0 | ρ = 0.3 | ρ = 0.6 | ρ = 0.9 |
| LCT | .010 | .009 | .010 | .010 | .012 | .014 | .014 | .016 | .056 | .053 | .051 | .058 |
| LCT2 | .011 | .011 | .009 | .007 | .014 | .017 | .017 | .014 | .056 | .055 | .055 | .054 |
| SAMGS | .010 | .011 | .010 | .007 | .012 | .017 | .018 | .014 | .056 | .053 | .052 | .051 |
| Global | .004 | .004 | .005 | .006 | .010 | .005 | .006 | .012 | .033 | .041 | .043 | .052 |
| 50 | 50 | 50 | ||||||||||
| 200 | 200 | 200 | ||||||||||
| 40 | 40 | 40 | ||||||||||
| Method | ρ = 0.0 | ρ = 0.3 | ρ = 0.6 | ρ = 0.9 | ρ = 0.0 | ρ = 0.3 | ρ = 0.6 | ρ = 0.9 | ρ = 0.0 | ρ = 0.3 | ρ = 0.6 | ρ = 0.9 |
| LCT | .010 | .010 | .010 | .011 | .016 | .016 | .015 | .015 | .057 | .055 | .051 | .053 |
| LCT2 | .008 | .010 | .011 | .008 | .016 | .016 | .017 | .013 | .053 | .058 | .049 | .059 |
| SAMGS | .010 | .008 | .011 | .009 | .015 | .015 | .015 | .019 | .056 | .057 | .051 | .050 |
| Global | .004 | .004 | .004 | .001 | .009 | .012 | .012 | .010 | .040 | .059 | .052 | .051 |
| 100 | 100 | 100 | ||||||||||
| 400 | 400 | 400 | ||||||||||
| 60 | 60 | 60 | ||||||||||
| Method | ρ = 0.0 | ρ = 0.3 | ρ = 0.6 | ρ = 0.9 | ρ = 0.0 | ρ = 0.3 | ρ = 0.6 | ρ = 0.9 | ρ = 0.0 | ρ = 0.3 | ρ = 0.6 | ρ = 0.9 |
| LCT | .008 | .006 | .003 | .005 | .012 | .010 | .010 | .007 | .049 | .046 | .043 | .034 |
| LCT2 | .009 | .006 | .004 | .004 | .012 | .011 | .011 | .010 | .055 | .054 | .042 | .034 |
| SAMGS | .006 | .007 | .005 | .008 | .012 | .009 | .010 | .011 | .047 | .050 | .047 | .040 |
| Global | .002 | .003 | .003 | .002 | .009 | .005 | .007 | .006 | .056 | .036 | .034 | .034 |
Figure 1Power comparison ( = 20 and = 100) between four GSA approaches: LCT, LCT, SAM-GS and Global Test.
Percentages of gene sets with p-values less than 0.005, 0.01, 0.05 and 0.10
| | ≤.005 | ≤.01 | ≤.05 | ≤.10 |
| LCT | 3 | 4 | 27 | 63 |
| LCT2 | 3 | 4 | 20 | 46 |
| SAM-GS | 1 | 2 | 14 | 27 |
| Global Test | 0 | 2 | 15 | 34 |
Gene sets and pathways associated with gene expression measurements
| NADLER_OBESITY_UP | 46 | 0 | 0.004 | 0.108 | 0.098 |
| HSA04920_ADIPOCYTOKINE_SIGNALING_PATHWAY | 68 | 0.003 | 0.003 | 0.042 | 0.032 |
| HSA04140_REGULATION_OF_AUTOPHAGY | 26 | 0.004 | 0.007 | 0.003 | 0.002 |
| HIF1_TARGETS | 32 | 0.006 | 0.025 | 0.027 | 0.03 |
| DORSEY_DOXYCYCLINE_UP | 29 | 0.011 | 0.063 | 0.174 | 0.174 |
| SHIPP_DLBCL_CURED_UP | 28 | 0.013 | 0.003 | 0.01 | 0.02 |
| JNK_UP | 24 | 0.015 | 0.026 | 0.083 | 0.074 |
| PROSTAGLANDIN_SYNTHESIS_REGULATION | 28 | 0.016 | 0.04 | 0.165 | 0.155 |
| CARDIACEGFPATHWAY | 16 | 0.019 | 0.018 | 0.02 | 0.01 |
| CITED1_KO_HET_UP | 23 | 0.022 | 0.023 | 0.036 | 0.031 |
| XU_CBP_DN | 32 | 0.022 | 0.027 | 0.06 | 0.064 |
| CHREBPPATHWAY | 16 | 0.027 | 0.029 | 0.029 | 0.023 |
| OXSTRESS_BREASTCA_UP | 24 | 0.027 | 0.046 | 0.044 | 0.047 |
| AGUIRRE_PANCREAS_CHR17 | 61 | 0.029 | 0.034 | 0.082 | 0.06 |
| ST_GAQ_PATHWAY | 27 | 0.031 | 0.047 | 0.109 | 0.083 |
| HSA04340_HEDGEHOG_SIGNALING_PATHWAY | 46 | 0.032 | 0.038 | 0.036 | 0.036 |
| NFATPATHWAY | 47 | 0.034 | 0.041 | 0.065 | 0.053 |
| HYPOXIA_REVIEW | 75 | 0.035 | 0.055 | 0.098 | 0.095 |
| HSA04614_RENIN_ANGIOTENSIN_SYSTEM | 16 | 0.04 | 0.108 | 0.087 | 0.076 |
| CPR_NULL_LIVER_DN | 16 | 0.041 | 0.047 | 0.038 | 0.036 |
| HSA00380_TRYPTOPHAN_METABOLISM | 49 | 0.043 | 0.055 | 0.11 | 0.1 |
| HSA04630_JAK_STAT_SIGNALING_PATHWAY | 135 | 0.045 | 0.065 | 0.173 | 0.167 |
| DIAB_NEPH_UP | 58 | 0.046 | 0.051 | 0.196 | 0.194 |
| TRYPTOPHAN_METABOLISM | 57 | 0.049 | 0.064 | 0.089 | 0.09 |
| INSULIN_SIGNALING | 93 | 0.049 | 0.068 | 0.187 | 0.18 |
| PASSERINI_GROWTH | 32 | 0.049 | 0.12 | 0.31 | 0.319 |
| TNFA_NFKB_DEP_UP | 18 | 0.05 | 0.07 | 0.152 | 0.169 |
| FRUCTOSE_AND_MANNOSE_METABOLISM | 24 | 0.055 | 0.02 | 0.039 | 0.041 |
| ANDROGEN_AND_ESTROGEN_METABOLISM | 21 | 0.058 | 0.028 | 0.046 | 0.042 |
| POMEROY_DESMOPLASIC_VS_CLASSIC_MD_DN | 38 | 0.091 | 0.048 | 0.13 | 0.116 |
| TCA | 15 | 0.101 | 0.058 | 0.042 | 0.037 |
| HSA03050_PROTEASOME | 21 | 0.103 | 0.113 | 0.051 | 0.048 |
| PROTEASOME_DEGRADATION | 27 | 0.122 | 0.082 | 0.028 | 0.029 |
The p-values from the four GSA methods, LCT, LCT2, SAM-GS and Global Test, are shown.