| Literature DB >> 23696881 |
Andrew R Wood1, John R B Perry, Toshiko Tanaka, Dena G Hernandez, Hou-Feng Zheng, David Melzer, J Raphael Gibbs, Michael A Nalls, Michael N Weedon, Tim D Spector, J Brent Richards, Stefania Bandinelli, Luigi Ferrucci, Andrew B Singleton, Timothy M Frayling.
Abstract
Genome-wide association (GWA) studies have been limited by the reliance on common variants present on microarrays or imputable from the HapMap Project data. More recently, the completion of the 1000 Genomes Project has provided variant and haplotype information for several million variants derived from sequencing over 1,000 individuals. To help understand the extent to which more variants (including low frequency (1% ≤ MAF <5%) and rare variants (<1%)) can enhance previously identified associations and identify novel loci, we selected 93 quantitative circulating factors where data was available from the InCHIANTI population study. These phenotypes included cytokines, binding proteins, hormones, vitamins and ions. We selected these phenotypes because many have known strong genetic associations and are potentially important to help understand disease processes. We performed a genome-wide scan for these 93 phenotypes in InCHIANTI. We identified 21 signals and 33 signals that reached P<5×10(-8) based on HapMap and 1000 Genomes imputation, respectively, and 9 and 11 that reached a stricter, likely conservative, threshold of P<5×10(-11) respectively. Imputation of 1000 Genomes genotype data modestly improved the strength of known associations. Of 20 associations detected at P<5×10(-8) in both analyses (17 of which represent well replicated signals in the NHGRI catalogue), six were captured by the same index SNP, five were nominally more strongly associated in 1000 Genomes imputed data and one was nominally more strongly associated in HapMap imputed data. We also detected an association between a low frequency variant and phenotype that was previously missed by HapMap based imputation approaches. An association between rs112635299 and alpha-1 globulin near the SERPINA gene represented the known association between rs28929474 (MAF = 0.007) and alpha1-antitrypsin that predisposes to emphysema (P = 2.5×10(-12)). Our data provide important proof of principle that 1000 Genomes imputation will detect novel, low frequency-large effect associations.Entities:
Mesh:
Year: 2013 PMID: 23696881 PMCID: PMC3655956 DOI: 10.1371/journal.pone.0064343
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Schematic overview of imputation.
(A) Directly genotyped SNPs are phased to estimate haplotypes that alleles reside on. (B) Publically available reference haplotypes from projects including the International HapMap Project and the 1000 Genomes Project are downloaded that contain variants to be imputed. (C) Phased haplotypes from the study are assessed and likened to publically available haplotypes. The most likely genotypes are imputed into the study based on the alleles found in reference haplotype panel.
Basic characteristics of the 1210 InCHIANTI subjects at baseline.
| Characteristic | Mean (Range) or% |
| Age (years) | 68.2 (21–102) |
| Sex (% male) | 44.6% |
| BMI | 27.2 (18–46.6) |
| Current smokers (% case) | 18.8% |
| History of hypertension (% case) | 38.4% |
| Treatment for hypertension - former and current (% case) | 30.1% |
| History of diabetes (% case) | 9.5% |
| Treatment for diabetes - former and current (% case) | 6.58% |
| History of myocardial infarction (% case) | 3.6% |
| Treatment for myocardial infarction - former and current (% case) | 2.0% |
Figure 2Distributions of imputation quality (r 2) across different minor allele frequency (MAF) bins.
Note differences in Y axes scales.
Associations of circulating factors based on HapMap imputation ordered by minor allele frequency group (<5% or ≥5), chromosome and base-pair position.
| Trait | GC | Chr | SNP | BP (b37) | MAF |
| Nearest Gene | Annotation |
| Alkaline phosphatase | 1.055 | 12 | rs1880889 | 41721235 | 0.03 | 1.96E-08 |
| Known |
| Homocysteine | 1.043 | 1 | rs1801133 | 11856378 | 0.495 | 1.75E-11 |
| Known |
| Vitamin B6 | 1.017 | 1 | rs4654748 | 21786068 | 0.499 | 8.84E-09 |
| Known |
| Interleukin-6R | 1.134 | 1 | rs8192284 | 154426970 | 0.385 | 3.60E-53 |
| Known |
| C-reactive protein | 1.035 | 1 | rs12741825 | 159670145 | 0.301 | 1.08E-08 |
| Known |
| Oxidized LDL | 1.029 | 2 | rs676210 | 21231524 | 0.239 | 4.05E-30 |
| Known (other trait) |
| Macrophage inflammatory protein-1b | 1.016 | 3 | rs1500004 | 46150937 | 0.104 | 2.06E-08 |
| Novel |
| Lipoprotein a | 1.057 | 6 | rs10455872 | 161010118 | 0.056 | 1.18E-20 |
| Known |
| Insulin-like growth factor bp-3 | 1.038 | 7 | rs10260816 | 46010100 | 0.456 | 4.42E-14 |
| Known |
| 90K Protein | 1.016 | 8 | rs406652 | 16042507 | 0.159 | 1.56E-09 |
| Novel |
| Interleukin-18 | 1.047 | 11 | rs2250417 | 112085316 | 0.44 | 2.94E-14 |
| Known |
| Vitamin E alpha tocopherol | 1.066 | 11 | rs964184 | 116648917 | 0.152 | 1.33E-08 |
| Known |
| Triglycerides | 1.052 | 11 | rs964184 | 116648917 | 0.152 | 9.03E-09 |
| Known |
| Soluble transferrin receptor | 1.04 | 11 | rs1242229 | 117062370 | 0.129 | 6.55E-09 |
| Known |
| HDL cholesterol | 1.033 | 16 | rs1800775 | 56995236 | 0.486 | 8.01E-13 |
| Known |
| Alpha-2 globulin (%) | 1.058 | 16 | rs2000999 | 72108093 | 0.228 | 2.13E-08 |
| Known (other trait) |
| Lutein | 1.026 | 16 | rs11645428 | 81258896 | 0.468 | 1.17E-13 |
| Known |
| Beta carotene | 1.051 | 16 | rs6564851 | 81264597 | 0.364 | 1.70E-10 |
| Known |
| Free thyroxine (FT4) | 1.03 | 17 | rs7212734 | 6340460 | 0.444 | 1.10E-08 |
| Novel |
| Macrophage inflammatory protein-1b | 1.016 | 17 | rs1015673 | 34477386 | 0.443 | 1.25E-22 |
| Known |
| Vitamin E alpha tocopherol | 1.066 | 19 | rs10401969 | 19407718 | 0.057 | 9.32E-09 |
| Known (other trait) |
GC = genomic control factor used to adjust P-value, Chr = chromosome, BP (b37) = human genome build 37 base pair position, SNP = index SNP representing signal, MAF = minor allele frequency, P* = P-value derived after applying genomic control.
Annotations refer to whether associations represent known (present in NHGRI catalogue at P<5×10−8 or GWAS literature) signals for same trait, different traits or represent putative novel signals.
Genome-wide significant associations of circulating factors based on 1000 Genomes imputation ordered by minor allele frequency group (<5% or ≥5), chromosome and base-pair position.
| Trait | GC | Chr | SNP | BP (b37) | MAF |
| Nearest Gene | Annotation |
| Total cholesterol | 1.049 | 1 | rs116435220 | 46890686 | 0.011 | 2.18E-08 |
| Novel |
| Tumor necrosis factor-a | 1.024 | 2 | rs188141385 | 14975369 | 0.004 | 4.16E-10 |
| Novel |
| Beta globulins (%) | 1.034 | 4 | rs28645201 | 91528927 | 0.02 | 1.22E-09 |
| Novel |
| Vitamin B12 | 1.055 | 5 | rs146226203 | 150185826 | 0.012 | 3.01E-08 |
| Novel |
| Lipoprotein a | 1.057 | 6 | rs55730499 | 161005610 | 0.048 | 3.43E-25 |
| Known |
| Cortisol:DHEAS ratio | 1.006 | 7 | rs34670419 | 99130834 | 0.028 | 2.35E-08 |
| Known |
| Dehydroepiandrosterone | 1.023 | 7 | rs34670419 | 99130834 | 0.028 | 2.07E-09 |
| Known |
| Cl- | 1.075 | 7 | rs2371549 | 144172841 | 0.005 | 5.38E-09 |
| Novel |
| Alkaline phosphatase | 1.055 | 12 | rs1880889 | 41721235 | 0.029 | 3.08E-08 |
| Known |
| Alpha-1 globulin (%) | 1.054 | 14 | rs112635299 | 94838142 | 0.007 | 2.51E-12 |
| Novel |
| Alpha-2 globulin (%) | 1.058 | 17 | rs181929163 | 63740545 | 0.01 | 1.57E-08 |
| Novel |
| Homocysteine | 1.043 | 1 | rs1801133 | 11856378 | 0.497 | 2.01E-11 |
| Known |
| Vitamin B6 | 1.017 | 1 | rs4654932 | 21809436 | 0.417 | 7.39E-09 |
| Known |
| Interleukin-6R | 1.134 | 1 | rs12730935 | 154419892 | 0.377 | 2.28E-53 |
| Known |
| Oxidized LDL | 1.029 | 2 | rs676210 | 21231524 | 0.239 | 4.12E-30 |
| Known (other trait) |
| Macrophage inflammatory protein-1b | 1.016 | 3 | rs113341849 | 46384204 | 0.09 | 2.14E-11 |
| Novel |
| Insulin-like growth factor bp-3 | 1.038 | 7 | rs71550311 | 45987430 | 0.459 | 1.39E-14 |
| Known |
| 90K Protein | 1.016 | 8 | rs28491433 | 16050871 | 0.148 | 6.03E-10 |
| Novel |
| Beta Cryptoxanthin | 1.021 | 8 | rs75226183 | 77295977 | 0.087 | 1.69E-09 |
| Novel |
| Creatine phosphokinase | 1.029 | 10 | rs61871700 | 101828261 | 0.28 | 4.18E-08 |
| Known |
| Interleukin-18 | 1.047 | 11 | rs2250417 | 112085316 | 0.44 | 2.97E-14 |
| Known |
| Vitamin E alpha tocopherol | 1.066 | 11 | rs964184 | 116648917 | 0.151 | 6.66E-09 |
| Known |
| Triglycerides | 1.052 | 11 | rs964184 | 116648917 | 0.151 | 1.01E-08 |
| Known |
| Soluble transferrin receptor | 1.04 | 11 | rs7940310 | 117024481 | 0.13 | 7.17E-09 |
| Known |
| HDL cholesterol | 1.033 | 16 | rs711752 | 56996211 | 0.436 | 8.32E-13 |
| Known |
| Alpha-2 globulin (%) | 1.058 | 16 | rs2287997 | 72140553 | 0.227 | 1.45E-10 |
| Known (other trait) |
| Beta carotene | 1.051 | 16 | rs12926540 | 81258987 | 0.362 | 6.98E-11 |
| Known |
| Lutein | 1.026 | 16 | rs9708919 | 81260786 | 0.391 | 7.84E-14 |
| Known |
| Free thyroxine (FT4) | 1.03 | 17 | rs58926603 | 6341870 | 0.393 | 5.62E-09 |
| Novel |
| Macrophage inflammatory protein-1b | 1.016 | 17 | rs4796217 | 34819191 | 0.345 | 6.62E-21 |
| Known |
| Vitamin E alpha tocopherol | 1.066 | 19 | rs58542926 | 19379549 | 0.064 | 1.18E-08 |
| Known (other trait) |
| LDL cholesterol | 1.041 | 19 | rs1065853 | 45413233 | 0.075 | 1.76E-09 |
| Known |
| Gamma-glutamyltransferase | 1.043 | 22 | rs2006227 | 24995756 | 0.386 | 1.70E-08 |
| Known |
= associations that represent novel signals in both HapMap and 1000 Genomes-based analyses, P** = P-value derived after applying genomic control.
The 20 signals and loci found in both HapMap and 1000 Genomes ordered by minor allele frequency group (<5% or ≥5) and chromosome.
| Trait | Chr | HapMap-based Analysis | 1000 Genomes-based Analysis | BP Dist. | LD ( | ||||
| SNP | MAF |
| SNP | MAF |
| ||||
| Alkaline Phosphatase | 12 | rs1880889 | 0.03 | 1.96E-08 | rs1880889 | 0.029 | 3.08E-08 | – | – |
| Interleukin-6R | 1 | rs8192284 | 0.385 | 3.60E-53 | rs12730935 | 0.377 | 2.28E-53 | 7078 | 0.957 |
| Homocysteine | 1 | rs1801133 | 0.495 | 1.75E-11 | rs1801133 | 0.497 | 2.01E-11 | – | – |
| Vitamin B6 | 1 | rs4654748 | 0.499 | 8.84E-09 | rs4654932 | 0.417 | 7.39E-09 | 23368 | 0.744 |
| Oxidized LDL | 2 | rs676210 | 0.239 | 4.05E-30 | rs676210 | 0.239 | 4.12E-30 | – | – |
| Macrophage inflammatory protein-1b | 3 | rs1500004 | 0.104 | 2.06E-08 | rs113341849 | 0.09 | 2.14E-11 | 233267 | 0.667 |
| Lipoprotein a | 6 | rs10455872 | 0.056 | 1.18E-20 | rs55730499 | 0.048 | 3.43E-25 | 4508 | 0.888 |
| Insulin-like growth factor bp-3 | 7 | rs10260816 | 0.456 | 4.42E-14 | rs71550311 | 0.459 | 1.39E-14 | 22670 | 0.871 |
| 90K Protein | 8 | rs406652 | 0.159 | 1.56E-09 | rs28491433 | 0.148 | 6.03E-10 | 8364 | 0.932 |
| Soluble transferrin receptor | 11 | rs1242229 | 0.129 | 6.55E-09 | rs7940310 | 0.13 | 7.17E-09 | 37889 | 0.956 |
| Triglycerides | 11 | rs964184 | 0.152 | 9.03E-09 | rs964184 | 0.151 | 1.01E-08 | – | – |
| Vitamin E alpha tocopherol | 11 | rs964184 | 0.152 | 1.33E-08 | rs964184 | 0.151 | 6.66E-09 | – | – |
| Interleukin-18 | 11 | rs2250417 | 0.44 | 2.94E-14 | rs2250417 | 0.44 | 2.97E-14 | – | – |
| Alpha-2 globulin (%) | 16 | rs2000999 | 0.228 | 2.13E-08 | rs2287997 | 0.227 | 1.45E-10 | 32460 | 0.937 |
| Beta Carotene | 16 | rs6564851 | 0.364 | 1.70E-10 | rs12926540 | 0.362 | 6.98E-11 | 5610 | 0.954 |
| Lutein | 16 | rs11645428 | 0.468 | 1.17E-13 | rs9708919 | 0.391 | 7.84E-14 | 1890 | 0.53 |
| HDL cholesterol | 16 | rs1800775 | 0.486 | 8.01E-13 | rs711752 | 0.436 | 8.32E-13 | 975 | 0.774 |
| Macrophage inflammatory protein-1b | 17 | rs1015673 | 0.443 | 1.25E-22 | rs4796217 | 0.345 | 6.62E-21 | 341805 | 0.577 |
| Free thyroxine (FT4) | 17 | rs7212734 | 0.444 | 1.10E-08 | rs58926603 | 0.393 | 5.62E-09 | 1410 | 0.547 |
| Vitamin E alpha tocopherol | 19 | rs10401969 | 0.057 | 9.32E-09 | rs58542926 | 0.064 | 1.18E-08 | 28169 | 1 |
SNP = index SNP representing signal, MAF = minor allele frequency, P* = P-value derived after applying genomic control correction, BP Dist. = base pair distance between the HapMap and 1000 Genomes index SNPs, LD (r2) = linkage disequilibrium measure based on r 2 correlation.
Fourteen associations represent the same signals (r
2>0.8) of which 6 are represented by the same index SNP. The remaining 6 represent the same locus (0.2
The 13 signals identified as having an associated 1000 Genomes-based imputed SNP reaching genome-wide significance where the best HapMap SNP did not, ordered by minor allele frequency group (<5% or ≥5) and the minor allele frequency of the 1000 Genomes SNP.
| Trait | HapMap-based Analysis | 1000 Genomes-based Analysis | BP Dist. | LD ( | Annotation | |||||
| SNP | MAF |
| SNP | MAF |
| Beta | ||||
| Tumor necrosis factor-a | rs10206283 | 0.289 | 5.40E-03 | rs188141385 | 0.004 | 4.16E-10 | −2.101 | 218450 | 0.002 | Novel |
| Cl- | rs10262256 | 0.091 | 2.21E-02 | rs2371549 | 0.005 | 5.38E-09 | −2.059 | 508371 | 0.001 | Novel |
| Alpha-1 globulin (%) | rs10149388 | 0.054 | 2.25E-04 | rs112635299 | 0.007 | 2.51E-12 | −1.891 | 371308 | 0.034 | Novel |
| Alpha-2 globulin (%) | rs16959655 | 0.079 | 1.58E-03 | rs181929163 | 0.01 | 1.57E-08 | 1.472 | 656607 | 0.004 | Novel |
| Total cholesterol | rs17102098 | 0.021 | 1.16E-05 | rs116435220 | 0.011 | 2.18E-08 | −1.243 | 149475 | 0.526 | Novel |
| Vitamin B12 | rs3797617 | 0.146 | 1.58E-03 | rs146226203 | 0.012 | 3.01E-08 | −1.38 | 566327 | 0.009 | Novel |
| Beta globulins (%) | rs10516879 | 0.163 | 1.51E-03 | rs28645201 | 0.02 | 1.22E-09 | 1.116 | 211623 | 0.032 | Novel |
| Dehydroepiandrosterone | rs11761528 | 0.092 | 9.67E-07 | rs34670419 | 0.028 | 2.07E-09 | −0.78 | 12033 | 0.263 | Known |
| Cortisol:DHEAS ratio | rs10278040 | 0.045 | 1.17E-05 | rs34670419 | 0.028 | 2.35E-08 | 0.721 | 10539 | 0.588 | Known |
| LDL cholesterol | rs445925 | 0.114 | 3.60E-05 | rs1065853 | 0.075 | 1.76E-09 | −0.526 | 2407 | 0.714 | Known |
| Beta Cryptoxanthin | rs1464093 | 0.178 | 5.05E-06 | rs75226183 | 0.087 | 1.69E-09 | −0.514 | 821919 | 0.049 | Novel |
| Creatine phosphokinase | rs17112705 | 0.295 | 6.92E-07 | rs61871700 | 0.28 | 4.18E-08 | 0.257 | 108933 | 0.78 | Known |
| Gamma-glutamyltransferase | rs2017869 | 0.394 | 7.94E-08 | rs2006227 | 0.386 | 1.70E-08 | 0.287 | 1553 | 0.973 | Known |
- P-value derived after applying genomic control correction.
Details of the best HapMap SNP within a 2 MB window centred on the 1000 Genomes SNP are provided. Annotation “Novel” refers to signals not previously reported. Annotation “known” refers to signals not identified by HapMap but identified in the NHGRI GWAS catalogue.
Figure 3Locus zoom plots of 6 loci reaching p<5×10−8 in both the HapMap- and the 1000 Genomes imputed data and that represent known associations (r 2>0.2 in the NHGRI GWAS catalogue).
Figure 4Locus zoom plots of 6 loci reaching p<5×10−8 in both the HapMap- and the 1000 Genomes imputed data and that represent known associations (r 2>0.2 in the NHGRI GWAS catalogue).
Results from the univariable and multivariable analysis using the Index HapMap and 1000 Genomes SNPs for 13 traits across 14 loci where both index SNPs were genome-wide significant and differed from each other.
| Trait | Index HM SNP | HM Uni | HM Multi | Index 1 KG SNP | 1 KG Uni | 1 KG Multi |
| 90 K Protein | rs406652 | 2.33E-09 | 0.90 | rs28491433 | 4.29E-10 | 0.07 |
| Alpha-2 globulin (%) | rs2000999 | 2.79E-10 | 0.78 | rs2287997 | 1.01E-10 | 0.14 |
| Vitamin E alpha tocopherol | rs10401969 | 1.07E-08 | 0.80 | rs58542926 | 9.38E-09 | 0.58 |
| Beta carotene | rs6564851 | 1.26E-10 | 0.86 | rs12926540 | 4.73E-11 | 0.1625 |
| HDL cholesterol | rs1800775 | 1.98E-12 | 0.17 | rs711752 | 4.77E-13 | 0.0305 |
| Free thyroxine (FT4) | rs7212734 | 8.49E-09 | 0.04 | rs58926603 | 4.39E-09 | 0.0342 |
| Insulin-like growth factor bp-3 | rs10260816 | 2.07E-14 | 0.39 | rs71550311 | 6.16E-15 | 0.0774 |
| Interleukin-6R | rs8192284 | 8.84E-60 | 0.19 | rs12730935 | 4.87E-60 | 0.0882 |
| Lipoprotein a | rs10455872 | 5.06E-25 | 0.44 | rs55730499 | 2.34E-26 | 0.0097 |
| Lutein | rs11645428 | 5.96E-14 | 0.01 | rs9708919 | 4.03E-14 | 0.0072 |
| Macrophage inflammatory protein-1b | rs1015673 | 1.37E-23 | 0.002 | rs4796217 | 1.09E-21 | 0.2842 |
| Macrophage inflammatory protein-1b | rs1500004 | 9.33E-09 | 0.68 | rs113341849 | 1.37E-11 | 0.0003 |
| Soluble transferrin receptor | rs1242229 | 7.51E-09 | 0.71 | rs7940310 | 5.66E-09 | 0.4052 |
| Vitamin B6 | rs4654748 | 6.73E-09 | 0.20 | rs4654932 | 4.33E-09 | 0.1922 |
HM = HapMap, 1 KG = 1000 Genomes, Uni = P-value derived from univariable analysis after applying genomic control correction. Multi P* = P-value derived from multivariable analysis after applying genomic control correction.
Replication data from three biomarker phenotypes available in the BLSA and or TwinsUK studies.
| Trait | 1000 G Index SNP | MAF |
| Beta | BLSA Replication | TwinsUK Replication |
| TNF-a | rs188141385 | 0.004 | 4.20E-10 | 2.1 | 0.38 | 0.56 |
| Cl- | rs2371549 | 0.005 | 5.40E-09 | 2.06 | – | 0.9 |
| Vitamin B12 | rs146226203 | 0.01 | 3.00E-08 | 1.38 | 0.35 | 0.41 |
Beta = standard deviations from the double inverse-normalised phenotype values.
Figure 5Locus zoom plots of the associations between HapMap and 1000 genomes imputed variants and alpha-1 globulin.