| Literature DB >> 24633158 |
Oddgeir L Holmen1, He Zhang2, Yanbo Fan2, Daniel H Hovelson3, Ellen M Schmidt3, Wei Zhou4, Yanhong Guo4, Ji Zhang4, Arnulf Langhammer5, Maja-Lisa Løchen6, Santhi K Ganesh7, Lars Vatten8, Frank Skorpen9, Håvard Dalen10, Jifeng Zhang4, Subramaniam Pennathur11, Jin Chen4, Carl Platou12, Ellisiv B Mathiesen13, Tom Wilsgaard6, Inger Njølstad6, Michael Boehnke14, Y Eugene Chen4, Gonçalo R Abecasis14, Kristian Hveem15, Cristen J Willer16.
Abstract
Blood lipid levels are heritable, treatable risk factors for cardiovascular disease. We systematically assessed genome-wide coding variation to identify new genes influencing lipid traits, fine map known lipid loci and evaluate whether low-frequency variants with large effects exist for these traits. Using an exome array, we genotyped 80,137 coding variants in 5,643 Norwegians. We followed up 18 variants in 4,666 Norwegians and identified ten loci with coding variants associated with a lipid trait (P < 5 × 10(-8)). One variant in TM6SF2 (encoding p.Glu167Lys), residing in a known genome-wide association study locus for lipid traits, influences total cholesterol levels and is associated with myocardial infarction. Transient TM6SF2 overexpression or knockdown of Tm6sf2 in mice alters serum lipid profiles, consistent with the association observed in humans, identifying TM6SF2 as a functional gene within a locus previously known as NCAN-CILP2-PBX4 or 19p13. This study demonstrates that systematic assessment of coding variation can quickly point to a candidate causal gene.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24633158 PMCID: PMC4169222 DOI: 10.1038/ng.2926
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Coverage of coding variation by exome array
| Percentage of variants discovered | ||||
|---|---|---|---|---|
| Variant type | Frequency | Number of variants | All samples | Average for each |
| > 5% | 174 | 66.7 | 65.7 | |
| 1–5% | 121 | 72.0 | 70.3 | |
| 6 copies - 1% | 875 | 56.5 | 57.0 | |
| 1–5 copies | 945 | 39.0 | 39.0 | |
| > 5% | 11,348 | 78.1 | 78.3 | |
| 1–5% | 7,976 | 77.5 | 77.3 | |
| 6 copies - 1% | 30,959 | 71.2 | 72.1 | |
| 1–5 copies | 27,739 | 45.0 | 45.0 | |
| Loss-of-function | ≥ 6 copies | 1,170 | 64.4 | 65.8 |
| Missense | ≥ 6 copies | 50,283 | 76.0 | 78.1 |
| Missense + LoF | ≥ 6 copies | 51,453 | 75.9 | 78.0 |
| All frequencies | 53/47/32/62 | 96.7 | - | |
This table shows the annotation and frequency of variants successfully genotyped. We estimated the coverage of variants in each category by exome-sequencing 152 Norwegians (average 45× coverage of target region). Lipid GWAS markers as published by the Global Lipids Genetics Consortium (2013)[4] were also genotyped. We separated variants with 6 or more copies (in 5,643 individuals) to demonstrate the characteristics of the genetic variants we examined using single variant association tests.
Loss-of-function (LoF) refers to splice, nonsense, and read-through.
Number of variants with a primary association with LDL cholesterol, HDL cholesterol, total cholesterol and triglycerides, respectively.
Variants in this column were categorized by their frequency in the discovery sample (N=5,643).
Variants in these two columns were categorized by their frequency in 152 samples as follows: > 5%, 1–5%, 2 copies – 1% and 1 copy.
Figure 1Power estimates for current study compared to estimated effect sizes for coding variants and GWAS index SNPs
This figure shows effect size estimates for the coding variants identified in this study and previous GWAS results. Estimated power curves are shown (as dotted lines) for the minimum standardized effect sizes (in standard deviation units) that could be identified for a given effect-allele frequency with 10% (pink), 50% (green), and 90% (blue) power assuming sample size 10,000 and alpha level 5×10−8. Observed coding variants reaching genome-wide significant association with Stage 1 lipid levels from Table 2 are shown in red. Previously known lipid marker effect sizes and frequencies as identified by the Global Lipid Genetic Consortium[4] are shown in black.
Coding variants thatreached genome-wide significant association with lipid levels
| Lipid Association | MI Association | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Stage 1 (N=5,643) | Stage 2 (N=4,666) | Combined (N=10,309) | Stage 1 or Stages 1+2 | ||||||||||||
| Gene | Position | Rsid | Allele | Variant | MAF, % | Effect (s.e.), | MAF, % | Effect (s.e.), | MAF, % | Effect (s.e.), | OR (95% CI) | ||||
| 19:45412079 | rs7412 | T/C | p.Arg176Cys | 7.9 | −21.2 (1.5) | 2.2×10−46 | 6.1 | −28.6 (2.0) | 7.3×10−48 | 7.1 | −24.3 (1.2) | 2.2×10−90 | 0.88 (0.79 – 0.98) | 0.017 | |
| 2: 21263900 | rs1367117 | A/G | p.Thr98Ile | 37.5 | 4.8 (0.9) | 3.0×10−8 | -- | -- | -- | -- | -- | -- | 1.02 (0.94 – 1.09) | 0.70 | |
| 2:44066247 | rs11887534 | C/G | p.Asp19His | 6.5 | −8.7 (1.7) | 1.3×10−7 | 6.3 | −6.5 (2.0) | 9.8×10−4 | 6.4 | −7.8 (1.3) | 8.0×10−10 | 0.93 (0.84 – 1.04) | 0.22 | |
| 2:44065090 | rs6756629 | A/G | p.Arg50Cys | 6.5 | −8.7 (1.7) | 1.5×10−7 | 6.3 | −6.5 (2.0) | 9.5×10−4 | 6.4 | −7.8 (1.3) | 8.3×10−10 | 0.93 (0.83 – 1.04) | 0.21 | |
| 16:57015091 | rs5880 | C/G | p.Ala390Pro | 5.4 | −5.4 (0.6) | 9.6×10−19 | -- | -- | -- | -- | -- | -- | 1.16 (0.99 – 1.36) | 0.075 | |
| 16:57016092 | rs5882 | G/A | p.Val422Ile | 31.7 | 2.2 (0.3) | 2.5×10−13 | -- | -- | -- | -- | -- | -- | 0.94 (0.87 – 1.02) | 0.13 | |
| 15:58855748 | rs113298164 | T/C | p.Thr405Met | 0.7 | 10.5 (1.7) | 5.6×10−10 | 0.8 | 8.9 (1.9) | 1.9×10−6 | 0.75 | 9.6 (1.2) | 1.4×10−14 | 0.91 (0.66 – 1.25) | 0.56 | |
| 18:47109955 | rs77960347 | G/A | p.Asn396Ser | 1.6 | 5.7 (1.1) | 4.0×10−7 | 1.2 | 6.1 (1.5) | 3.1×10−5 | 1.4 | 5.8 (0.9) | 3.6×10−11 | 0.87 (0.69 – 1.09) | 0.24 | |
| 19:45412079 | rs7412 | T/C | p.Arg176Cys | 7.9 | −18.0 (1.6) | 1.4×10−27 | 6.1 | −20.9 (2.1) | 3.3×10−23 | 7.1 | −19.4 (1.3) | 2.1×10−48 | 0.88 (0.79 – 0.98) | 0.017 | |
| 2:44066247 | rs11887534 | C/G | p.Asp19His | 6.5 | −10.0 (1.8) | 4.4×10−8 | 6.3 | −9.5 (2.1) | 4.3×10−6 | 6.4 | −10.0 (1.4) | 5.7×10−13 | 0.93 (0.84 – 1.04) | 0.22 | |
| 2:44065090 | rs6756629 | A/G | p.Arg50Cys | 6.5 | −10.0 (1.8) | 4.7×10−8 | 6.3 | −9.5 (2.1) | 5.3×10−6 | 6.4 | −10.0 (1.4) | 9.3×10−13 | 0.93 (0.83 – 1.04) | 0.21 | |
| 19:19379549 | rs58542926 | T/C | p.Glu167Lys | 8.9 | −8.1 (1.6) | 2.7×10−7 | 7.9 | −4.7 (1.9) | 0.010 | 8.4 | −6.6 (1.2) | 4.4×10−8 | 0.87 (0.79 – 0.95) | 5.0×10−3 | |
| 11:116662407 | rs3135506 | C/G | p.Ser19Trp | 6.1 | 24.8 (4.0) | 4.1×10−10 | -- | -- | -- | -- | -- | -- | 0.92 (0.80 – 1.08) | 0.33 | |
| 8:19819724 | rs328 | G/C | p.Ser474X | 9.6 | −19.7 (3.3) | 1.3×10−9 | -- | -- | -- | -- | -- | -- | 0.96 (0.84 – 1.08) | 0.47 | |
| 19:8429323 | rs116843064 | A/G | p.Glu40Lys | 3.2 | −32.1 (5.5) | 8.2×10−9 | 2.5 | −18.6 (7.5) | 0.014 | 2.9 | −26.9 (4.6) | 4.2×10−9 | 0.78 (0.66 – 0.92) | 3.1×10−3 | |
Variants that reached significant association (P < 5×10−8) in stage 1 or stage 1+2 are listed, with positions given for hg19, [chromosome]:[base pair position]. Alleles are reported as minor/major on the forward strand of the reference genome. The “Variant” column lists predicted changes in amino acid sequence using ANNOVAR[34]. Effects are reported for the minor allele in mg/dL (±s.e.). LDL, low-density lipoprotein cholesterol; HDL, high-density lipoprotein cholesterol; MAF, minor allele frequency; s.e, standard error; MI, myocardial infarction; OR, odds ratio. MI association results were determined from 2,350 medical-record confirmed MI cases and 2,318 healthy controls in our Stage 1 samples. At the APOA5 locus, the p.Ser19Trp variant showed weaker evidence for association with triglyceride levels than the GWAS index SNP, indicating that the coding variant may not be functional.
variants intended for follow-up which failed array design.
Figure 2Functional follow-up in C57BL/6J mice implicates TMF6SF2 in lipid metabolism
This figure demonstrates the functional impact of altered TM6SF2 levels in C57BL/6J mice. (a) Transient overexpression of human TM6SF2 in C57BL/6J mice demonstrate an increase in fasting total cholesterol (2.3 fold, P = 9×10−4), triglyceride levels (1.1 fold, P = 0.031), LDL cholesterol levels (5.8 fold, P = 4×10−4) and a decrease in HDL cholesterol levels (0.45 fold, P = 9×10−4) five days after tail-vein injection, compared to LacZ controls (8 mice in each group). (b) Western blot demonstrating increased expression of TM6SF2 in mice injected with Ad-TM6SF2 compared to the Ad-LacZ control mice from panel a. (c) The mean increase in expression of TM6SF2 was quantitated as 2.4-fold (P = 0.0066, 5 mice in each group). (d) Knockdown of Tm6sf2 by tail-vein injection of adenovirus containing shTm6sf2 in C57BL/6J mice (0.15 OD per mouse) showed decreased levels of fasting total cholesterol (0.81 fold, P = 0.013) six days after injection, compared to Ad-shLacZ controls (8 mice in each group). (e) Western blot demonstrating decreased expression of TM6SF2 in mice injected with Ad-shTM6SF2 compared to control mice. (f) The mean decrease in expression of TM6SF2 was quantitated as 0.51 fold (P = 0.0375, 3 mice in each group). P values from Mann-Whitney U-test. Standard error of the mean indicated. *, P < 0.05; **, P < 0.01.