| Literature DB >> 24497850 |
Susan K Service1, Tanya M Teslovich2, Christian Fuchsberger2, Vasily Ramensky1, Pranav Yajnik2, Daniel C Koboldt3, David E Larson3, Qunyuan Zhang3, Ling Lin3, Ryan Welch2, Li Ding3, Michael D McLellan3, Michele O'Laughlin3, Catrina Fronick3, Lucinda L Fulton3, Vincent Magrini3, Amy Swift4, Paul Elliott5, Marjo-Riitta Jarvelin6, Marika Kaakinen7, Mark I McCarthy8, Leena Peltonen9, Anneli Pouta10, Lori L Bonnycastle4, Francis S Collins4, Narisu Narisu4, Heather M Stringham2, Jaakko Tuomilehto11, Samuli Ripatti12, Robert S Fulton3, Chiara Sabatti13, Richard K Wilson3, Michael Boehnke2, Nelson B Freimer1.
Abstract
Genome-wide association studies (GWAS) have identified >500 common variants associated with quantitative metabolic traits, but in aggregate such variants explain at most 20-30% of the heritable component of population variation in these traits. To further investigate the impact of genotypic variation on metabolic traits, we conducted re-sequencing studies in >6,000 members of a Finnish population cohort (The Northern Finland Birth Cohort of 1966 [NFBC]) and a type 2 diabetes case-control sample (The Finland-United States Investigation of NIDDM Genetics [FUSION] study). By sequencing the coding sequence and 5' and 3' untranslated regions of 78 genes at 17 GWAS loci associated with one or more of six metabolic traits (serum levels of fasting HDL-C, LDL-C, total cholesterol, triglycerides, plasma glucose, and insulin), and conducting both single-variant and gene-level association tests, we obtained a more complete understanding of phenotype-genotype associations at eight of these loci. At all eight of these loci, the identification of new associations provides significant evidence for multiple genetic signals to one or more phenotypes, and at two loci, in the genes ABCA1 and CETP, we found significant gene-level evidence of association to non-synonymous variants with MAF<1%. Additionally, two potentially deleterious variants that demonstrated significant associations (rs138726309, a missense variant in G6PC2, and rs28933094, a missense variant in LIPC) were considerably more common in these Finnish samples than in European reference populations, supporting our prior hypothesis that deleterious variants could attain high frequencies in this isolated population, likely due to the effects of population bottlenecks. Our results highlight the value of large, well-phenotyped samples for rare-variant association analysis, and the challenge of evaluating the phenotypic impact of such variants.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24497850 PMCID: PMC3907339 DOI: 10.1371/journal.pgen.1004147
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Overview of quantitative trait loci investigated in this study.
| Locus | Chr | 5′ boundary (Build 37) (Mb) | Size (kb) | ROI | Genes Targeted/Total | # Validated Variant Sites | Associated Trait(s) | Array SNP MAF | Array SNP P-value |
|
| 1 | 109.656946 | 782.534 | 36.464 | 9/21 | 254 | LDL-C:rs646776 | .22 | 3.0E-17 |
| TC:rs646776 | 6.7E-13 | ||||||||
|
| 1 | 230.273866 | 164.123 | 4.530 | 1/1 | 67 | HDL-C:rs611229 | .41 | 3.9E-05 |
| TG:rs4846930 | 2.3E-02 | ||||||||
|
| 2 | 26.893100 | 1594.61 | 2.189 | 1/42 | 12 | FG:rs7588910 | .33 | 2.8E-02 |
| TC:rs780090 | .05 | 2.0E-03 | |||||||
| TG:rs1260326 | .35 | 1.2E-12 | |||||||
|
| 2 | 43.458071 | 819.384 | 38.183 | 7/7 | 231 | LDL-C:rs6756629 | .09 | 9.8E-06 |
| TC:rs6756629 | 1.2E-05 | ||||||||
|
| 2 | 169.312969 | 557.867 | 17.240 | 5/5 | 97 | FG:rs560887 | .31 | 1.1E-10 |
|
| 8 | 19.518908 | 458.35 | 3.747 | 1/3 | 43 | HDL-C:rs10096633 | .10 | 1.7E-06 |
| TG:rs10096633 | 5.4E-09 | ||||||||
|
| 9 | 107.543376 | 201.285 | 11.176 | 1/1 | 73 | HDL:rs2575875 | .30 | 3.4E-05 |
| TC:rs2740486 | .45 | 1.5E-03 | |||||||
|
| 10 | 91.343009 | 62.133 | 3.684 | 1/3 | 12 | FI:rs1075374 | .21 | 3.2E-05 |
|
| 11 | 45.706162 | 210.619 | 10.997 | 3/4 | 60 | FG:rs2696935 | .16 | 4.8E-04 |
|
| 11 | 46.273702 | 5320.50 | 47.317 | 15/50 | 325 | FG:rs2696935 | .16 | 4.8E-04 |
| HDL-C:rs7946766 | .19 | 2.5E-06 | |||||||
|
| 11 | 61.282504 | 446.559 | 9.273 | 3/10 | 43 | FG:rs2072114 | .29 | 5.6E-04 |
| HDL-C:rs509360 | .34 | 2.3E-03 | |||||||
| LDL-C:rs174546 | .43 | 1.0E-06 | |||||||
| TC:rs174546 | .43 | 5.7E-05 | |||||||
| TG:rs6591657 | .15 | 2.5E-03 | |||||||
|
| 11 | 92.664875 | 62.999 | 1.662 | 1/1 | 15 | FG:rs7121092 | .42 | 3.1E-08 |
|
| 11 | 116.462585 | 640.063 | 5.057 | 4/11 | 55 | HDL-C:rs12805061 | .27 | 1.2E-04 |
| LDL-C:rs11216267 | .46 | 8.3E-08 | |||||||
| TC:rs11216267 | .46 | 6.1E-07 | |||||||
| TG:rs12805061 | .27 | 8.4E-07 | |||||||
|
| 12 | 109.641978 | 474.532 | 6.093 | 2/7 | 54 | HDL-C:rs12314392 | .41 | 3.0E-03 |
|
| 15 | 58.541083 | 319.88 | 2.175 | 1/1 | 27 | HDL-C:rs1532085 | .44 | 1.0E-12 |
| TC:rs261336 | .23 | 2.6E-03 | |||||||
| TG:rs261336 | .23 | 3.4E-04 | |||||||
|
| 16 | 56.733941 | 383.464 | 16.686 | 5/5 | 148 | HDL-C:rs3764261 | .27 | 7.3E-38 |
| LDL-C:rs12445698 | .19 | 1.7E-02 | |||||||
| TC:rs12445698 | .19 | 1.4E-03 | |||||||
| TG:rs1561140 | .47 | 2.1E-02 | |||||||
|
| 19 | 19.250145 | 628.669 | 55.570 | 18/20 | 336 | LDL-C:rs12610185 | .06 | 2.3E-03 |
| TC:rs2228603 | .07 | 3.0E-05 | |||||||
| TG:rs2304130 | .06 | 1.3E-05 |
The loci are named according to the first gene in the region of interest, starting at the 5′ end of the region.
ROI = region of interest. Number of kb sequenced for the locus.
Total Genes is the number of genes in the locus; Targeted Genes is the number of genes investigated in this study.
Please see Table S3 for a complete listing of all variant sites, along with their MAF and annotation.
Phenotype abbreviations: TC = total cholesterol, LDL-C = low density lipoprotein cholesterol, HDL-C = high density lipoprotein cholesterol, TG = triglycerides, FG = Fasting Glucose, FI = Fasting Insulin. Previous association evidence for lipid traits is from Kathiresan et al. 2008, Willer et al. 2008 and Teslovich et al. 2010; for glucose Dupuis et al. 2010, and for insulin Sabatti et al. 2009. The “array SNP” is the GWAS array-genotyped SNP (not sequence variant) with the smallest p-value to indicated traits in this study and is not necessarily the same SNP highlighted in previous studies.
Array SNP MAF and p-values are taken from the current study.
Loci with multiple independent single-variant association signals to SNPs with MAF>0.1%.
| Trait/Locus | Gene | Variant | MAF | Allele | Type | Beta | P-value | PV array SNP only | PV array SNP+sequence variants |
|
| rs6756629 | .090 | A | array SNP | −0.15 | 1.3E-05 | |||
|
| rs145756111 | .011 | A | synonymous | 0.31 | 6.1E-04 | 1.2% | 1.4% | |
|
| rs560887 | .310 | A | array SNP | −0.15 | 9.0E-12 | |||
|
| rs138726309 | .014 | T | missense probably damaging | −0.41 | 2.6E-06 | 1.2% | 1.7% | |
|
| rs10096633 | .100 | A | array SNP | 0.15 | 6.8E-06 | |||
|
| rs268 | .018 | G | missense benign | −0.38 | 2.2E-07 | 0.72% | 1.2% | |
|
| rs10096633 | .100 | A | array SNP | −0.19 | 1.9E-08 | |||
|
| rs268 | .018 | G | missense benign | 0.31 | 9.2E-05 | 0.59% | 0.85% | |
|
| rs2575875 | .300 | A | array SNP | −0.08 | 8.4E-05 | |||
|
| rs2066718 | .015 | T | missense benign | 0.32 | 9.3E-05 | |||
|
| rs2066715 | .055 | T | missense benign | 0.15 | 5.1E-04 | 0.62% | 1.1% | |
|
| rs12805061 | .270 | G | array SNP | 0.004 | .86 | |||
|
| rs2266788 | .089 | G | 3′-UTR | 0.27 | 7.7E-13 | |||
|
| rs3135506 | .059 | C | coding-synonymous | 0.24 | 6.9E-08 | 0.4% | 1.6% | |
|
| rs11216267 | .460 | G | array SNP | 0.08 | 2.3E-05 | |||
|
| rs651821 | .085 | C | 5′-UTR | 0.13 | 2.2E-04 | 1.4% | 1.6% | |
|
| rs11216267 | .460 | G | array SNP | 0.08 | 1.3E-04 | |||
|
| rs651821 | .085 | C | 5′-UTR | 0.14 | 9.3E-05 | 1.1% | 1.4% | |
|
| rs1532085 | .440 | A | array SNP | 0.13 | 9.4E-12 | |||
|
| rs28933094 | .016 | T | missense probably damaging | 0.42 | 7.0E-08 | 1.2% | 1.8% | |
|
| rs261336 | .230 | G | array SNP | 0.09 | 1.4E-04 | |||
|
| rs28933094 | .016 | T | missense probably damaging | 0.33 | 2.7E-05 | 0.2% | 0.5% | |
|
| rs3764261 | .270 | A | array SNP | 0.27 | 1.6E-40 | |||
|
| rs5880 | .024 | C | missense probably damaging | −0.26 | 1.8E-05 | 3.3% | 3.6% |
FG and TG were LN transformed prior to analysis; the regression coefficient is on the LN scale.
Beta is the estimate of the regression coefficient, and provides the amount and direction the phenotype changes for each copy of the indicated allele.
The p-values come from separate multivariate models for each locus that include all variants listed below, and the first five PCs. P-values shown here represent the independent evidence for the specified variant, after conditioning on the array SNP.
The percent variance in the phenotype accounted for by just the array SNP.
The percent variance in the phenotype accounted for by the array SNP and the independent sequence variants.
Figure 1Schematic of rare (MAF<1%) non-synonymous variants used in the gene-level test of total cholesterol (TC) in gene ABCA1.
The x-axis scale (AA) is in amino acid positions. Numbers in parenthesis are the number of copies of the rare variant in persons with phenotype data. The mean TC level in persons possessing variants with bold naming is increased relative to persons without the variant, for all other variants the mean TC level in persons possessing variant alleles is decreased relative to persons without the variant.
Rare and common variants contribute to the association signal to HDL-C in gene ABCA1.
| Variant | MAF | MAC | Allele | Type | Beta | P-value |
| rs2575875 | .30 | 4209 | A | array SNP | −0.08 | 2.7E-04 |
| rs2066718 | .015 | 181 | T | missense benign | 0.32 | 1.0E-04 |
| rs2066715 | .055 | 674 | T | missense benign | 0.15 | 5.1E-04 |
| Chr9:107548661 | .00025 | 3 | G | missense probably damaging | −2.08 | 2.9E-04 |
| Chr9:107555091 | .00049 | 6 | C | missense benign | 1.50 | 6.9E-04 |
| Chr9:107555452 | .00016 | 2 | G | missense benign | −2.94 | 2.8E-05 |
P-values for rs2575875, rs2066718, and rs2066715 may be different from those recorded in Table 2 because of the addition of the three rare variants to the model. MAC: minor allele count.
Figure 2Schematic of rare (MAF<1%) non-synonymous variants used in the gene-level test HDL-C in gene CETP.
The x-axis scale (AA) is in amino acid positions. Numbers in parenthesis are the number of copies of the rare variant in persons with phenotype data. The mean HDL-C level in persons possessing variants with bold naming is increased relative to persons without the variant, for all other variants the mean HDL-C level in persons possessing variant alleles is decreased relative to persons without the variant.
Rare and common variants contribute to the association signal to HDL-C in gene CETP.
| Variant | MAF | MAC | Allele | Type | Beta | P-value |
| rs3764261 | .27 | 3913 | A | array SNP | 0.28 | 1.6E-40 |
| rs5883 | .040 | 490 | T | coding-synonymous | 0.22 | 5.2E-06 |
| rs5880 | .024 | 296 | C | missense probably damaging | −0.29 | 4.2E-05 |
| rs2303790 | .00057 | 7 | G | missense possibly damaging | 1.31 | 1.0E-03 |
P-values for rs3764261 and rs5880 may be different from those recorded in Table 2 because of the addition of rs2303790 and rs5883 to the model. MAC: minor allele count.