| Literature DB >> 19161620 |
Andrew D Johnson1, Christopher J O'Donnell.
Abstract
BACKGROUND: The number of genome-wide association studies (GWAS) is growing rapidly leading to the discovery and replication of many new disease loci. Combining results from multiple GWAS datasets may potentially strengthen previous conclusions and suggest new disease loci, pathways or pleiotropic genes. However, no database or centralized resource currently exists that contains anywhere near the full scope of GWAS results.Entities:
Mesh:
Year: 2009 PMID: 19161620 PMCID: PMC2639349 DOI: 10.1186/1471-2350-10-6
Source DB: PubMed Journal: BMC Med Genet ISSN: 1471-2350 Impact factor: 2.103
Figure 1Genome-wide plots of available GWAS results for all associations P = 0.0001.
Figure 2Genome-wide plots of available GWAS results for all associations 5 × 10.
Figure 3Cumulative percent of GWAS association SNPs falling within specified distances from a protein-coding RefSeq gene. SNP frequency by distance is plotted for significant GWAS associations from studies that employed only Illumina of only Affymetrix platforms (in 10 kb bins).
Figure 4Cumulative percent of arrayed SNPs falling within specified distances from a protein-coding RefSeq gene. SNP frequency by distance is plotted for arrayed SNPs on Affymetrix or Illumina arrays (in 10 kb bins). The gene-centric arrays from each company are not displayed since relatively few studies analyzed here have published results based on these arrays. The Illumina Hap240, Hap300 and Hap650 are omitted because they show a highly similar pattern to the Hap550 array.
Densest regions of GWAS associations in the human genome based on analysis of 100 kb bins.
| Chr | Position | GWAS assoc. in bin | Density rank | LD adj. rank | Min. p value | Gene(s) | Phenotype(s) in decreasing order of occurrence |
| 6 | 32405288–32497626 | 177 | 1 | 1 | 5.47E-111 | RA (65), Type I (50), MS (26), Type II (17), combined WTCCC cases (10), CAD (8), Alzheimer's (1) | |
| 6 | 32703038–32798923 | 107 | 3 | 2 | 1.12E-307 | Type I (47), RA (30), MS (14), combined WTCCC cases (7), Celiac disease (3), SLE (2), Crohn's disease (1), AAT levels (1), Type II (1), Alzheimer's (1) | |
| 6 | 32806435–32896489 | 116 | 2 | 3 | 4.86E-79 | Type I (59), RA (25), MS (21), combined WTCCC cases (9), Early onset obesity (1), 2 hr glucose tolerance test (1) | |
| 2 | 60502647–60591731 | 77 | 5 | 4 | 6.70E-35 | HbF (75), Type II (1), Hypertension (1) | |
| 2 | 60415435–60494833 | 87 | 4 | 5 | 2.50E-12 | downstream from | HbF (75), Type II (7), CAD (1), Abdominal aortic calcification (1), combined WTCCC cases (1), Bipolar (1), Parkinson's (1) |
| 1 | 67406551–67498692 | 65 | 6 | 6 | 3.03E-23 | Crohn's disease (60), Psoriasis (2), RA (1), CAD (1), combined WTCCC cases (1) | |
| 9 | 89310993–89384465 | 46 | 9 | 7 | 2.80E-05 | Alzheimer's (19), HOMA-IR (17), Waist:height (5), Type II (2), Fasting insulin (2), RA (1) | |
| 6 | 151254904–151296407 | 17 | 91 | 8 | 2.90E-08 | CAD (10), Type 1 (2), combined WTCCC cases (1), Hypertension (1), FPG (1), Insulinogenic index (1), ALS (1) | |
| 11 | 5401462–5499595 | 29 | 23 | 9 | 9.46E-24 | MCV (8), HbA2 (7), RBC (3), HbA1C (1), CAD (3), Type II (3), Fat mass (2), Bipolar disorder (1), Crohn's disease (1) | |
| 16 | 55526649–55589312 | 22 | 43 | 10 | 1.00E-73 | HDL cholesterol (16), Type II (3), ApoAI (2), HTN (1) | |
| 9 | 116701486–116796674 | 31 | 22 | 11 | 1.61E-09 | Crohn's disease (29), 2 hr glucose tolerance test (2) | |
| 8 | 128116770–128194686 | 24 | 31 | 12 | 1.10E-12 | intergenic, 8q24 | Colorectal cancer (10), Prostate cancer (8), Bipolar disorder (3), CAD (2), Hypertension (1) |
| 8 | 19857092–19899552 | 42 | 11 | 13 | 2.00E-28 | Triglycerides (22), HDL (11), Triglyceride:HDL (4), CAD (4), RA (1), Hip circumference (1) | |
| 9 | 22102599–22126489 | 26 | 29 | 20 | 1.00E-20 | Type II (12), CAD (11), combined WTCCC cases (2), Coronary artery calcification (1) | |
| 10 | 123321680–123395011 | 28 | 24 | 21 | 2.00E-76 | Breast cancer (24), RA (2), Type II (1), combined WTCCC cases (1) | |
| 6 | 20706282–20796100 | 33 | 21 | 28 | 4.11E-11 | Type II (32), Insulin response (1) | |
| 16 | 49300832–49399578 | 47 | 8 | 31 | 2.51E-49 | Crohn's disease (44), RA (1), combined WTCCC cases (1), ApoB (1) | |
| 16 | 52327178–52389272 | 38 | 14 | 33 | 7.30E-14 | TypeII (15), Hip circumference (15), Early onset obesity (6), Fasting glucose (1), ALS (1) | |
| 10 | 114722872–114798892 | 39 | 13 | 34 | 1.00E-48 | Type II (36), Waist:height (2), BMI (1) | |
| 9 | 22007836–22093813 | 52 | 7 | 41 | 2.10E-19 | CAD (48), Coronary artery calcification (2), Type II (2), Type I (1), RA (1) | |
| 17 | 35304874–35382291 | 44 | 10 | 43 | 2.00E-23 | Childhood asthma (34), ORMDL3 expression (7), Crohn's disease (3) | |
| 1 | 228303905–228375529 | 24 | 32 | 55 | 7.00E-15 | Type II (9), Triglycerides (7), HDL (6), Crohn's disease (1), ALS (1) | |
| 16 | 49218265–49298963 | 34 | 19 | 73 | 4.71E-29 | Crohn's disease (33), Bipolar disorder (1) | |
| 11 | 41805501–41887387 | 34 | 20 | 74 | 5.70E-08 | 11p12, 800 kb from | Type II (33), Coronary spasm (1) |
| 1 | 109608806–109623689 | 23 | 37 | 92 | 1.20E-33 | LDL cholesterol (16), CAD (4), Total cholesterol (1), Bipolar disorder (1) | |
| 8 | 128404855–128498005 | 23 | 38 | 99 | 1.27E-14 | Prostate cancer (8), Colorectal cancer (13), Breast cancer (1), Hypertension (1) | |
| 5 | 40513272–40597115 | 40 | 12 | 264 | 8.73E-12 | intergenic, 5p31.1 | Crohn's disease (40) |
| 20 | 33403387–33489397 | 36 | 17 | 322 | 5.01E-12 | Height (35), Hypertension (1) | |
| 10 | 64100271–64194296 | 24 | 35 | 494 | 1.00E-10 | Crohn's disease (16), Type II (8) | |
| 17 | 41401810–41495480 | 38 | 15 | 526 | 3.53E-05 | Crohn's disease (26), Alzheimer's (11), Insulinogenic index (1) | |
| 10 | 44011247–44096316 | 23 | 39 | 647 | 9.46E-08 | intergenic, 10q11.21 | CAD (20), Diastolic blood pressure (2), Fasting glucose (1) |
| 17 | 41305238–41395352 | 37 | 16 | 746 | 2.78E-05 | Crohn's disease (19), Alzheimer's (18) | |
| 2 | 21223320–21252721 | 27 | 27 | 796 | 8.10E-09 | upstream of | LDL cholesterol (17), Total cholesterol (10) |
Densest regions of GWAS associations in the human genome previously highlighted in a GWAS study (based on analysis of 100 kb bins). Bin rankings are given based on simple counts of associations, and after adjusting for HapMap CEU LD (average pairwise r2 in the same region), out of 18,310 bins in the genome that contained 1 or more GWAS association. Bins with known, replicated associations ranked in the top 40 by density and/or after LD adjustment are listed.
Densest regions of GWAS associations in the human genome based on analysis of 100 kb bins.
| Chr | Position | GWAS assoc. in bin | Density rank | LD adj. rank | Min. p value | Gene(s) | Phenotype(s) in decreasing order of occurrence |
| 10 | 6316580–6390596 | 17 | 92 | 14 | 2.70E-06 | LDL cholesterol (4), Total cholesterol (4), Celiac disease (2), RA (2), Waist circumference (2), Bipolar disorder (2), Weight (1) | |
| 13 | 109616103–109677555 | 19 | 67 | 15 | 7.61E-06 | Weight (11), Type II (2), ALS (2), BMI (1), Height (1), Diastolic blood pressure (1), combined WTCCC cases (1) | |
| 8 | 4621850–4669277 | 27 | 25 | 16 | 1.12E-04 | HIV-1 viral load set point (25), Crohn's disease (1), combined WTCCC cases (1) | |
| 3 | 21847197–21874359 | 20 | 56 | 17 | 2.00E-05 | intergenic, 3p24.1 | Type II (10), Diastolic blood pressure (4), Bipolar disorder (2), Alzheimer's (2), Parkinson's (1), Crohn's disease (1), combined WTCCC cases (1) |
| 13 | 23522469–23566520 | 15 | 142 | 18 | 2.62E-04 | Fat mass (8), Type II (3), Alzheimer's (2), Crohn's (1), RA (1) | |
| 13 | 109706778–109769126 | 9 | 447 | 24 | 2.0E-05 | HbA1c (3), CAD (2), RA (1), ApoB (1), Crohn's (1), Parkinson's (1), Alzheimer's (1) | |
| 20 | 59002911–59076486 | 17 | 94 | 25 | 2.11E-05 | intergenic, 20q13.33 | CAD (6), Type II (2), combined WTCCC cases (2), HTN (2), RA (1), Crohn's disease (1), Breast cancer (1), Alzheimer's (1), ALS (1) |
| 2 | 121639597–121685155 | 15 | 143 | 26 | 4.00E-06 | intergenic, 2q14.2 | Alzheimer's (4), Triglyceride/HDL ratio (4), Triglycerides (3), HDL (1), Type I (1), RA (1), Insulinogenic index (1) |
| 4 | 178511559–178569631 | 10 | 448 | 30 | 6.90E-06 | ALS (3), Parkinson's (1), Bipolar disorder (1), Type II (1), Type I (1), combined WTCCC cases (1), mFPG (1), heart rate variability (1) | |
| 16 | 76400711–76498035 | 11 | 333 | 32 | 1.10E-04 | 2 hr insulin test (5), combined WTCCC cases (2), LV diastolic dimension (1), Fasting insulin (1), HOMA-IR (1), Alzheimer's (1) | |
| 3 | 2801545–2885980 | 11 | 334 | 35 | 2.48E-05 | Total cholesterol (3), HIV-1 viral load set point (2), combined WTCCC cases (1), RA (1), Waist:height squared (1), 2 hr glucose tolerance test (1), Alzheimer's (1), ALS (1) | |
| 22 | 48109439–48199236 | 9 | 583 | 42 | 3.76E-05 | intergenic, 22q13.32 | Type II (3), CAD (2), HTN (2), LDL (1), Small vessel stroke (1) |
| 7 | 77505244–77596871 | 17 | 95 | 44 | 1.38E-04 | BMI (5), Waist circumference (5), Waist:height squared (5), HTN (1), ALS (1) | |
| 16 | 6603645–6620427 | 15 | 145 | 45 | 2.31E-05 | HOMA-IR (6), Fasting insulin (4), 2 hr insulin test (3), RA (1), Nicotine dependence (1) | |
| 8 | 134608378–134674046 | 14 | 177 | 46 | 5.66E-05 | Type II (6), Alzheimer's (6), Crohn's disease (2) | |
| 11 | 113445679–113499715 | 13 | 221 | 47 | 1.25E-05 | Waist circumference (4), HDL (3), BMI (2), Weight (2), Type II (1), LDL (1) | |
| 8 | 4200701–4270026 | 10 | 450 | 51 | 2.79E-04 | HIV-1 (4), HTN (1), Systolic blood pressure (1), Type I (1), FPG (1), Colorectal cancer (1), Alzheimer's (1) | |
| 20 | 54803001–54891697 | 8 | 761 | 52 | 2.65E-04 | intergenic, 20q13.31 | ApoB (2), CAD (1), HTN (1), Fasting glucose (1), 2 hr glucose (1), Alzheimer's (1), Triglyceride/HDL ratio (1) |
| 13 | 109813629–109898019 | 15 | 146 | 53 | 1.89E-05 | Type II (10), CAD (2), HTN (1), ALS (1), Alzheimer's (1) | |
| 16 | 82802796–82891261 | 8 | 762 | 54 | 1.60E-05 | HIV-1 (2), Type II (1), Crohn's disease (1), Fat mass (1), Insulinogenic index (1), ALS (1), combined WTCCC cases (1) | |
| 6 | 164313234–164367157 | 22 | 44 | 135 | 3.84E-05 | intergenic, 6q26 | Insulinogenic index (14), LDL (3), HOMA-IR (2), Alzheimer's (1), Parkinson's (1), RA (1) |
| 13 | 63720215–63799016 | 24 | 33 | 148 | 7.46E-05 | intergenic, 13q21.31 | HDL cholesterol (8), Waist circumference (7), Weight (5), CAD (2), BMI (1), Gallstone disease (1) |
| 3 | 162876798–162898990 | 24 | 34 | 229 | 2.41E-05 | intergenic, 3q26.1 | BMI (7), Waist circumference (6), Waist:height (6), Weight (2), Early onset Type II (1), Hypertension (1), Crohn's disease (1) |
| 12 | 58400177–58492903 | 35 | 18 | 267 | 1.12E-05 | Crohn's disease (21), HDL (14) | |
| 21 | 40500455–40546492 | 22 | 46 | 326 | 9.05E-05 | Height (11), Weight (7), Waist circumference (4) | |
| 12 | 111841825–111894699 | 22 | 47 | 440 | 5.92E-05 | Crohn's disease (18), Type I (4) | |
| 7 | 86415282–86498669 | 23 | 40 | 1434 | 1.60E-04 | Type II (20), Triglyceride/HDL ratio (2), CAD (1) | |
| 4 | 35101778–35198927 | 23 | 41 | 1569 | 7.72E-05 | intergenic, 4p15.1 | Nicotine dependence (21), Type I (1), Hypertension (1) |
Densest regions of GWAS associations in the human genome not previously highlighted in a GWAS study (based on analysis of 100 kb bins). Bin rankings are given based on simple counts of associations, and after adjusting for HapMap CEU LD (average pairwise r2 in the same region), out of 18,310 bins in the genome that contained 1 or more GWAS association. Bins with associations not previously highlighted and ranked in the top 55 by density and/or after LD adjustment are listed.
Top repeated occurrences from 52,554 unique SNPs among 56,411 SNP-phenotype associations.
| SNPid | Repeat assoc. | Gene(s) | Trait(s) | P value range | Chr | Position | Gene size | SNP on platforms? | Perfect CEU LD proxies on other platforms? |
| rs4420638 | 11 | Lipids, Alzheimer's, | 1.7E-4 to 1.0E-60 | 19 | 50114786 | 4 kb | Affymetrix, Illumina | Y | |
| rs599839 | 8 | Lipids, CAD | 2.2E-5 to 1.2E-33 | 1 | 109623689 | 88 kb | Affymetrix, Illumina | Y | |
| rs693 | 8 | Lipids | 6.8E-4 to 1.0E-21 | 2 | 21085700 | 43 kb | Affymetrix, Illumina | Y | |
| rs17482753 | 6 | Lipids | 3.6E-5 to 5.9E-19 | 8 | 19876926 | 29 kb | Affymetrix | Y | |
| rs780094 | 6 | Triglycerides, | 6.0E-3 to 1.0E-15 | 2 | 27594741 | 27 kb | Affymetrix, Illumina | Y | |
| rs1801282 | 6 | TypeII, | 4.5E-2 to 1.7E-6 | 3 | 12368125 | 146 kb | Affymetrix, Illumina | Y | |
| rs910049 | 5 | MHC locus ( | TypeII, TypeI, MS, | 7.8E-4 to 3.0E-93 | 6 | 32423705 | 79 kb | Affymetrix, Illumina | Y |
| rs2076530 | 5 | MHC locus ( | MS, TypeI, RA | 3.2E-10 to 6.7E-74 | 6 | 32471794 | 12 kb | Affymetrix, Illumina | Y |
| rs7901695 | 5 | TypeII, | 2.6E-4 to 1.0E-48 | 10 | 114744078 | 216 kb | Affymetrix, Illumina | Y | |
| rs7903146 | 5 | TypeII | 5.5E-8 to 1.0E-48 | 10 | 114748339 | 216 kb | Illumina | N | |
| rs4506565 | 5 | TypeII, BMI | 8.6E-4 to 2.3E-31 | 10 | 114746031 | 216 kb | Affymetrix | N | |
| rs10503669 | 5 | Lipids | 9.1E-4 to 3.9E-22 | 8 | 19891970 | 29 kb | Affymetrix | Y | |
| rs562338 | 5 | Lipids | 2.2E-6 to 5.6E-22 | 2 | 21141826 | 43 kb | Affymetrix | Y | |
| rs11209026 | 5 | Crohn's, Psoriasis | 4.8E-4 to 6.6E-19 | 1 | 67478546 | 93 kb | Affymetrix, Illumina | Y | |
| rs8050136 | 5 | BMI, TypeII | 9.8E-7 to 7.3E-14 | 16 | 52373776 | 410 kb | Affymetrix, Illumina | Y | |
| rs9939609 | 5 | BMI, TypeII | 1.9E-6 to 9.0E-12 | 16 | 52378028 | 410 kb | Affymetrix | Y | |
| rs4970834 | 5 | Lipids, CAD | 3.4E-4 to 3.0E-11 | 1 | 109616403 | 26 kb | Affymetrix, Illumina | Y | |
| rs1111875 | 5 | TypeII | 1.4E-3 to 5.7E-10 | 10 | 94452862 | 6 kb | Affymetrix, Illumina | Y | |
| rs13266634 | 5 | TypeII | 3.3E-6 to 5.3E-8 | 8 | 118253964 | 42 kb | Illumina | N | |
| rs968671 | 5 | 3.3E-4 to 2.5E-6 | 15 | 25036654 | 650 kb | Affymetrix | N | ||
| rs7034356 | 5 | 3.7E-4 to 1.3E-5 | 9 | 133468838 | 161 kb | Affymetrix | Y | ||
| rs4740294 | 5 | 8.3E-4 to 1.3E-5 | 9 | 133462842 | 161 kb | Affymetrix | Y | ||
| rs481843 | 5 | Intergenic (near | Lipids, | 9.3E-4 to 2.1E-5 | 11 | 116031077 | - | Affymetrix | Y |
| rs2889849 | 5 | 8.8E-4 to 1.9E-4 | 20 | 32627938 | 117 kb | Affymetrix, Illumina | Y | ||
| rs10498015 | 5 | 9.7E-4 to 1.2E-4 | 2 | 214425824 | 165 kb | Affymetrix, Illumina | Y |
Highly repeated occurrences of unique SNPs among 56,411 SNP-phenotype associations. We report the presence of these SNPs and whether a perfect proxy LD SNP exists based on the HapMap CEU population, on both Affymetrix and Illumina platforms, and the other platform in the case of a SNP only genotyped on an array from one company. This information was determined using the SNAP tool [33] and demonstrates that signals from a small minority of these SNPs could be missed depending on the platform applied in each case. Underlined entries represent newly recognized loci or phenotype associations.
GOminer gene ontology analysis of GWAS results, indicating the most over-represented ontological categories.
| Including WTCCC/DGI results (n = 5966 genes) | Excluding WTCCC/DGI results (n = 2888 genes) | |||||
| Gene Ontology category | RefSeq genes in Category | RefSeq genes observed in GWAS | GO category enrichment | P-value for enrichment | FDR | P-value for enrichment |
| GO:0007155_cell_adhesion | 650 | 309 | 1.41 | 4.56E-14 | 0 | 1.99E-17 |
| GO:0051056_regulation_of_small_GTPase_mediated_signal_transduction | 161 | 94 | 1.73 | 9.73E-11 | 0 | 3.76E-06 |
| GO:0051179_localization | 2493 | 975 | 1.16 | 1.12E-10 | 0 | 2.32E-07 |
| GO:0006810_transport | 2128 | 837 | 1.17 | 1.13E-09 | 0 | 2.46E-06 |
| GO:0051234_establishment_of_localization | 2189 | 856 | 1.16 | 2.64E-09 | 0 | 2.92E-06 |
| GO:0006811_ion_transport | 682 | 297 | 1.29 | 2.39E-08 | 0 | 1.32E-08 |
| GO:0009966_regulation_of_signal_transduction | 408 | 189 | 1.37 | 5.37E-08 | 0 | 0.00132 |
| GO:0046578_regulation_of_Ras_protein_signal_transduction | 122 | 70 | 1.70 | 6.27E-08 | 0 | 0.00014 |
| GO:0007156_homophilic_cell_adhesion | 134 | 75 | 1.66 | 8.92E-08 | 0 | 6.35E-09 |
| GO:0016337_cell-cell_adhesion | 245 | 121 | 1.46 | 2.26E-07 | 0.000091 | 1.04E-07 |
| GO:0006468_protein_amino_acid_phosphorylation | 536 | 236 | 1.31 | 2.43E-07 | 0.000083 | 7.42E-07 |
| GO:0043687_post-translational_protein_modification | 1171 | 473 | 1.20 | 2.83E-07 | 0.000154 | 0.00097 |
| GO:0007265_Ras_protein_signal_transduction | 177 | 92 | 1.54 | 3.59E-07 | 0.000143 | 0.00013 |
| GO:0006464_protein_modification_process | 1399 | 555 | 1.18 | 3.91E-07 | 0.0002 | 0.00721 |
| GO:0007242_intracellular_signaling_cascade | 1210 | 486 | 1.19 | 4.26E-07 | 0.000188 | 0.00040 |
| GO:0006817_phosphate_transport | 83 | 50 | 1.79 | 6.31E-07 | 0.000176 | 2.10E-05 |
Potential criteria for standardized reporting of GWAS results.
| Methodological description (array(s), calling algorithm, DNA extraction method, DNA pooling) |
| Full disclosure of SNP list genotyped, with genomic build and position |
| Genotyped strand and call rates for each SNP |
| QC filtering criteria employed |
| Sample sizes for all analyzed populations and indication of genotyping conducted for each |
| Sample demographics: gender, mean age, ethnogeographic make-up, geographic coordinates |
| Specific phenotype description(s) |
| Description of analytical and statistical procedures applied (genetic models, imputation, etc.) |
| Full p-values and effects estimates (such as direction of effect with respect to alleles, beta coefficients and standard errors) for association in main GWAS scans, and for meta-analysis if done across scans |
| List of SNPs with genomic build and position for which replication was attempted |
| Appropriate measures to mask group and individual identification if data are posted publicly |
| Contact information for submitters and parties responsible for portions of the GWAS |