| Literature DB >> 25050552 |
Claudia Lamina1, Margot Haun1, Stefan Coassin1, Anita Kloss-Brandstätter1, Christian Gieger2, Annette Peters3, Harald Grallert4, Konstantin Strauch5, Thomas Meitinger6, Lyudmyla Kedenko7, Bernhard Paulweber7, Florian Kronenberg1.
Abstract
Structural genetic variants as short tandem repeats (STRs) are not targeted in SNP-based association studies and thus, their possible association signals are missed. We systematically searched for STRs in gene regions known to contribute to total cholesterol, HDL cholesterol, LDL cholesterol and triglyceride levels in two independent studies (KORA F4, n = 2553 and SAPHIR, n = 1648), resulting in 16 STRs that were finally evaluated. In a combined dataset of both studies, the sum of STR alleles was regressed on each phenotype, adjusted for age and sex. The association analyses were repeated for SNPs in a 200 kb region surrounding the respective STRs in the KORA F4 Study. Three STRs were significantly associated with total cholesterol (within LDLR, the APOA1/C3/A4/A5/BUD13 gene region and ABCG5/8), five with HDL cholesterol (3 within CETP, one in LPL and one inAPOA1/C3/A4/A5/BUD13), three with LDL cholesterol (LDLR, ABCG5/8 and CETP) and two with triglycerides (APOA1/C3/A4/A5/BUD13 and LPL). None of the investigated STRs, however, showed a significant association after adjusting for the lead or adjacent SNPs within that gene region. The evaluated STRs were found to be well tagged by the lead SNP within the respective gene regions. Therefore, the STRs reflect the association signals based on surrounding SNPs. In conclusion, none of the STRs contributed additionally to the SNP-based association signals identified in GWAS on lipid traits.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25050552 PMCID: PMC4106801 DOI: 10.1371/journal.pone.0102113
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Gene regions for which STRs have been selected, reporting the magnitude of p-values taken from Teslovich et al. for all four investigated lipid phenotypes together with the respective lead SNP.
| Gene/Gene region | STR in or near gene | Magnitude of p-value from the association with the following traits | Lead-SNP | |||
| TC | LDL | HDL | TG | |||
|
| ABCG5 | 10 | 10 | rs4299376 | ||
|
| BUD13 | 10 | 10 | 10 | 10 | rs964184 |
|
| CETP_1 | 10 | 10 | 10 | 10 | rs3764261 |
|
| CETP_2 | 10 | 10 | 10 | 10 | rs3764261 |
|
| CETP_3 | 10 | 10 | 10 | 10 | rs3764261 |
|
| FRMD5 | 10 | rs2929282 | |||
|
| HNF1A | 10 | 10 | rs1169288 | ||
|
| HNF4A | 10 | 10 | rs1800961 | ||
|
| JMJD1C | 10 | rs10761731 | |||
|
| LDLR | 10 | 10 | rs6511720 | ||
|
| LIPG | 10 | 10 | rs7241918 | ||
|
| LPA | 10 | 10 | 10 | rs1564348 (TC, LDL), rs1084651 (HDL) | |
|
| LPL | 10 | 10 | rs12678919 | ||
|
| SCARB1 | 10 | rs838880 | |||
|
| TOP1 | 10 | 10 | rs6029526 | ||
|
| TRIB1 | 10 | 10 | 10 | 10 | rs2954029 |
according to Teslovich et al. [1]; with the exception of LPA, the reported lead SNP in each gene region is the same for all associated lipid phenotypes.
Characteristics and localization of selected STRs.
| STR in gene | Repeat | Position of STR according to HG build 19 | Position in gene | Minimum/Median/Maximum of number of repeats in | |||
| Chr | Start (bp) | End (bp) | KORA F4 | SAPHIR | |||
| ABCG5 | AAT | 2 | 44060592 | 44060473 | Intron1 | 6/12/18 | 8/12/17 |
| BUD13 | TAT | 11 | 116638744 | 116638685 | Intron 2 | 6/12/18 | 8/12/17 |
| CETP_1 | TAT | 16 | 56989702 | 56989761 | ∼5 kb upstream | 3/9/17 | 3/9/16 |
| CETP_2 | TTTA | 16 | 56997877 | 56997996 | Intron 2 | 7/12/16 | 7/12/16 |
| CETP_3 | GAAA | 16 | 56993722 | 56993961 | promoter | 37/48/56 | 38/48/56 |
| FRMD5 | GAT | 15 | 44379990 | 44379931 | Intron 1 | 11/15/20 | 11/15/20 |
| HNF1A | ATCT | 12 | 121425538 | 121425657 | Intron 1 | 7/11/14 | 8/11/14 |
| HNF4A | TTAT | 20 | 42982780 | 42982839 | ∼1.5 kb upstream | 6/10/13 | 7/10/12 |
| JMJD1C | TTG | 10 | 65151334 | 65151215 | Intron 1 | 8/12/15 | 8/12/14 |
| LDLR | TTG | 19 | 11202856 | 11202915 | Intron 1 | 7/10/18 | 7/10/17 |
| LIPG | AATA | 18 | 47083781 | 47083900 | ∼5 kb upstream | 5/9/11 | 5/9/11 |
| LPA | TTTTA | 6 | 161086738 | 161086619 | Intron 1 | 5/8/12 | 4/8/11 |
| LPL | TTAT | 8 | 19815455 | 19815574 | Intron 6 | 6/11/14 | 7/11/14 |
| SCARB1 | AAAGA | 12 | 125255821 | 125255702 | ∼6 kb downstream | 5/10/28 | 5/10/28 |
| TOP1 | AAAT | 20 | 39688036 | 39688095 | Intron 2 | 7/12/17 | 8/12/16 |
| TRIB1 | AAAC | 8 | 126440163 | 126440222 | ∼2 kb upstream | 7/11/14 | 7/11/14 |
Results of linear mixed models and linear models on the investigated lipid phenotypes for all STRs which are significantly associated with lipids in KORA F4 and SAPHIR combined: 1) regression of the sum of STR alleles on lipids in KORA F4 and SAPHIR combined, 2) regression of the sum of STR alleles on lipids in SAPHIR, 3) regression of the sum of STR alleles on lipids in KORA F4, 4) regression of the minor allele using the lead SNP (LS) in the gene region on lipids in KORA F4, 5) regression of the minor allele using the best SNP in the gene region (LS+/−100 kB) on lipids in KORA F4.
| STR/gene | SAPHIR & KORA F4 | SAPHIR study | KORA F4 study | KORA F4 study | r2 between lead SNP | KORA F4 study | |||||
| STR (sum of alleles) | STR (sum of alleles) | STR (sum of alleles) | Lead SNP | STR, adjusted for lead SNP | |||||||
| beta | p-value | beta | p-value | beta | p-value | beta | p-value | beta | p-value | ||
|
| |||||||||||
| LDLR | −1.3585 | 6.23E-07 | −1.1168 | 0.0232 | −1.4730 | 6.27E-06 | −8.0470 | 2.01E-06 | 0.8660 | −0.4522 | 0.6111 |
| BUD13 | 1.2125 | 1.42E-05 | 1.8036 | 1.04e-04 | 0.8646 | 0.0132 | 5.4035 | 3.56E-04 | 0.5668 | −0.0798 | 0.8800 |
| ABCG5 | 1.2864 | 6.51E-05 | 1.9165 | 1.93e-04 | 0.8436 | 0.0417 | 2.5913 | 0.0234 | 0.1556 | 0.5433 | 0.2277 |
|
| |||||||||||
| LDLR | −1.4080 | 9.96E-09 | −1.0408 | 0.0218 | −1.5796 | 4.79E-08 | −8.689 | 7.01E-09 | 0.8660 | −0.4325 | 0.5831 |
| ABCG5 | 1.1806 | 4.70E-05 | 1.6747 | 4.05e-04 | 0.8251 | 0.0246 | 1.8916 | 0.0622 | 0.1556 | 0.6331 | 0.1128 |
| CETP_1 | 0.6998 | 2.32E-05 | 0.6395 | 0.0199 | 0.7400 | 3.36E-04 | −2.2240 | 0.0265 | 0.3008 | 0.7265 | 0.0033 |
|
| |||||||||||
| CETP_1 | −0.4513 | 7.62E-12 | −0.3773 | 5.59e-04 | −0.5014 | 1.15E-09 | 4.0230 | 3.67E-24 | 0.3008 | −0.0715 | 0.4610 |
| CETP_2 | 0.5595 | 2.19E-14 | 0.5267 | 1.14e-05 | 0.5798 | 3.37E-10 | 4.0230 | 3.67E-24 | 0.0463 | 0.4015 | 1.55e-05 |
| CETP_3 | −0.3393 | 1.35E-28 | −0.3451 | 4.36e-11 | −0.3359 | 3.33E-19 | 4.0230 | 3.67E-24 | 0.8259 | −0.0085 | 0.9240 |
| LPL | 0.6805 | 1.08E-05 | 0.7685 | 0.0027 | 0.6140 | 0.0015 | 1.8011 | 0.0042 | 0.1609 | 0.4706 | 0.0259 |
| BUD13 | −0.3824 | 1.43E-04 | −0.7041 | 3.60e-05 | −0.1887 | 0.1278 | −1.2328 | 0.0216 | 0.5668 | 0.0275 | 0.884 |
|
| |||||||||||
| BUD13 | 0.0311 | 1.04E-15 | 0.0408 | 2.46e-10 | 0.0252 | 2.02E-07 | 0.1532 | 3.16e-13 | 0.5668 | −0.0011 | 0.8810 |
| LPL | −0.0208 | 5.34E-04 | −0.0272 | 0.0052 | −0.0167 | 0.0285 | −0.0963 | 9.88e-05 | 0.1609 | −0.0059 | 0.4772 |
All analyses are adjusted for age and sex.
according to Teslovich et al. [1]; beta effect estimate for the lead SNP refers to the minor allele, assuming an additive model.
Figure 1Regional plot showing the association of SNPs/STR in the LDLR region with LDL cholesterol.
LD refers to the lead SNP according to Teslovich et al. [1] (rs6511720); p-values of the STR in KORA F4 is marked as a star.
Figure 2Distribution of sum of LDLR-STR-alleles, separated for rs6511720 genotypes.
Figure 3Regional plot showing the association of SNPs/STR in the CETP region with HDL cholesterol.
LD refers to the lead SNP according to Teslovich et al. [1] (rs3764261); p-values of STRs in KORA F4 are marked as stars.
Figure 4Distribution of sum of STR-alleles, separated for genotypes:
A) Distribution of CETP_1, separated for rs3764261 genotypes, B) Distribution of CETP_3, separated for rs3764261 genotypes, C) Distribution of CETP_2, separated for rs3764261 genotypes, D) Distribution of CETP_2, separated for rs7203984 genotypes.