| Literature DB >> 22373100 |
Julius S Ngwa1, Alisa K Manning, Jonna L Grimsby, Chen Lu, Wei V Zhuang, Anita L Destefano.
Abstract
Genome-wide association studies often emphasize single-nucleotide polymorphisms with the smallest p-values with less attention given to single-nucleotide polymorphisms not ranked near the top. We suggest that gene pathways contain valuable information that can enable identification of additional associations. We used gene set information to identify disease-related pathways using three methods: gene set enrichment analysis (GSEA), empirical enrichment p-values, and Ingenuity pathway analysis (IPA). Association tests were performed for common single-nucleotide polymorphisms and aggregated rare variants with traits Q1 and Q4. These pathway methods were evaluated by type I error, power, and the ranking of the VEGF pathway, the gene set used in the simulation model. GSEA and IPA had high power for detecting the VEGF pathway for trait Q1 (91.2% and 93%, respectively). These two methods were conservative with deflated type I errors (0.0083 and 0.0072, respectively). The VEGF pathway ranked 1 or 2 in 123 of 200 replicates using IPA and ranked among the top 5 in 114 of 200 replicates for GSEA. The empirical enrichment method had lower power and higher type I error. Thus pathway analysis approaches may be useful in identifying biological pathways that influence disease outcomes.Entities:
Year: 2011 PMID: 22373100 PMCID: PMC3287852 DOI: 10.1186/1753-6561-5-S9-S18
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Genomic control (λ) based on association testing of all common variants (MAF ≥ 0.01)
| Trait | No ethnicity adjustment | Ethnicity (dummy variables) | Principal components | |||
| Replicate 1 | Replicate 100 | Replicate 1 | Replicate 100 | Replicate 1 | Replicate 100 | |
| Q1 | 2.525 | 2.654 | 1.649 | 1.639 | 1.040 | 1.086 |
| Q4 | 0.915 | 0.916 | 1.011 | 0.964 | 0.966 | 0.929 |
All models were adjusted for age, sex, and smoking with columns representing different ethnicity adjustments.
Rank and nominal p-value from the association test between Q1 and the genes in the VEGF pathway included in the simulating model for Q1
| Rank | 112 | 1,007 | 1 | 1,013 | 698 | 544 | 34 | 985 | 214 |
| 0.009 | 0.1359 | 2.4 × 10−13 | 0.1441 | 0.0885 | 0.0616 | 0.0017 | 0.1380 | 0.0186 |
The smallest p-value from either single SNP association of common variants or aggregate of rare variants within a gene was chosen to represent the gene. All models were adjusted for age, sex, smoking, and principal components.
Comparison of type I error and power calculations among all three methods
| Method | Power | Median type I error |
|---|---|---|
| GSEA | 0.912 | 0.0083 |
| Empirical enrichment | 0.429 | 0.0133 |
| IPA | 0.930 | 0.0072 |
Figure 1Histogram, across replicates, of the ranking of the VEGF pathway among all candidate pathways using Ingenuity pathway analysis and gene set enrichment analysis.