| Literature DB >> 25519358 |
Jeremy S Edwards1, Susan R Atlas2, Susan M Wilson3, Candice F Cooper1, Li Luo4, Christine A Stidley4.
Abstract
Genome wide association studies (GWAS) have been used to search for associations between genetic variants and a phenotypic trait of interest. New technologies, such as next-generation sequencing, hold the potential to revolutionize GWAS. However, millions of polymorphisms are identified with next-generation sequencing technology. Consequently, researchers must be careful when performing such a large number of statistical tests, and corrections are typically made to account for multiple testing. Additionally, for typical GWAS, the p value cutoff is set quite low (approximately <10(-8)). As a result of this p value stringency, it is likely that there are many true associations that do not meet this threshold. To account for this we have incorporated a priori biological knowledge to help identify true associations that may not have reached statistical significance. We propose the application of a pipelined series of statistical and bioinformatic methods, to enable the assessment of the association of genetic polymorphisms with a disease phenotype--here, hypertension--as well as the identification of statistically significant pathways of genes that may play a role in the disease process.Entities:
Year: 2014 PMID: 25519358 PMCID: PMC4143684 DOI: 10.1186/1753-6561-8-S1-S104
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Summary information on demographic and phenotypic variables
| Demographic or phenotypic variable | Summary measures |
|---|---|
| Sex (% female) | 57.9 |
| Age (median [min, max]) in years | 38 (16, 94) |
| Smoking status (% current smoker) | 21.7 |
| Taking blood pressure medication (%) | 10.0 |
| Ever hypertensive (%) | 40.0 |
| SBP (median [25th, 75th percentiles]) (mm Hg) | 118 (110, 130) |
| DBP (median [25th, 75th percentiles]) (mm Hg) | 72 (65, 78) |
| SBP (mean [std]) (mm Hg) | 0.95 (2.09) |
| DBP (mean [std]) (mm Hg) | 0.23 (1.27) |
Figure 1Genome wide association scans for 5 different phenotypes related to hypertension. For each of the 5 phenotypes, the -log10 of the p value associated with each SNV in (or near) the coding regions. Data from only odd-number chromosomes was provided as part of the GAW18 project; consequently, there is no information for any of the even-number chromosomes, which appear as blank regions on the plots. HTN, hypertension.
Figure 2Venn diagram of top genes for each phenotype. A list of all SNVs that had p values <0.01 was analyzed. All the genes in these regions were tabulated and a Venn diagram was constructed to identify which genes existed on each of the 5 different lists. DDBP, change in DBP overtime; HTN, hypertension; DSBP, change in SBP overtime
Figure 3Pathways generated by lists of genes from the association analysis. A, Pathway analysis of top 40 statistically significant genes (p <3.5 × 10−5) derived from the union of the genes across all 5 phenotypes, MIPA = 70. Pathway score = 77, corresponding to p = 10−77. This was the only statistically significant pathway obtained for this set of genes. Molecules in red correspond to genes (N = 30, 75%) from the input list of 40. Functional annotations for this pathway include cell death and survival, gastrointestinal disease, and inflammatory disease. B, Pathway analysis of the top 38 statistically significant genes (p <1.0 × 10−4) derived from the hypertension phenotype, MIPA = 70. Pathway score = 62, corresponding to p = 10−62. This was the only statistically significant pathway obtained for this set of genes. Molecules in red correspond to genes (N = 25, 66%) from the input list of 38. Functional annotations for this pathway include cellular assembly and organization, cellular function and maintenance, and cell death and survival.