| Literature DB >> 27353450 |
Laura J Scott1, Michael R Erdos2, Jeroen R Huyghe1, Ryan P Welch1, Andrew T Beck1, Brooke N Wolford2, Peter S Chines2, John P Didion2, Narisu Narisu2, Heather M Stringham1, D Leland Taylor2,3, Anne U Jackson1, Swarooparani Vadlamudi4, Lori L Bonnycastle2, Leena Kinnunen5, Jouko Saramies6, Jouko Sundvall5, Ricardo D'Oliveira Albanus7, Anna Kiseleva7, John Hensley7, Gregory E Crawford8,9, Hui Jiang1, Xiaoquan Wen1, Richard M Watanabe10,11, Timo A Lakka12,13,14, Karen L Mohlke4, Markku Laakso15,16, Jaakko Tuomilehto17,18,19,20, Heikki A Koistinen5,21,22, Michael Boehnke1, Francis S Collins2, Stephen C J Parker7,23.
Abstract
Type 2 diabetes (T2D) results from the combined effects of genetic and environmental factors on multiple tissues over time. Of the >100 variants associated with T2D and related traits in genome-wide association studies (GWAS), >90% occur in non-coding regions, suggesting a strong regulatory component to T2D risk. Here to understand how T2D status, metabolic traits and genetic variation influence gene expression, we analyse skeletal muscle biopsies from 271 well-phenotyped Finnish participants with glucose tolerance ranging from normal to newly diagnosed T2D. We perform high-depth strand-specific mRNA-sequencing and dense genotyping. Computational integration of these data with epigenome data, including ATAC-seq on skeletal muscle, and transcriptome data across diverse tissues reveals that the tissue-specific genetic regulatory architecture of skeletal muscle is highly enriched in muscle stretch/super enhancers, including some that overlap T2D GWAS variants. In one such example, T2D risk alleles residing in a muscle stretch/super enhancer are linked to increased expression and alternative splicing of muscle-specific isoforms of ANK1.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27353450 PMCID: PMC4931250 DOI: 10.1038/ncomms11764
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1Molecular profiling maps of skeletal muscle combined with dense phenotyping reveals insights about T2D.
(a) To understand the full spectrum of genetic variation and regulatory element usage in T2D-relevant tissue and across disease progression, we obtained skeletal muscle biopsies from the vastus lateralis of 271 well-phenotyped Finnish individuals with normal and impaired glucose tolerance, impaired fasting glucose or newly diagnosed, untreated T2D. (b) Correlogram of Spearman rank correlation coefficients for key metabolic traits. (c) Heatmap of GO terms for differentially expressed genes. For each trait, the 20 GO terms most significantly enriched for positive expression–trait association and the 20 GO terms most significantly enriched for negative expression–trait association were selected. GO terms were pruned to eliminate redundant terms. The terms were hierarchically clustered using the GO term enrichment beta. Darker red, stronger positive gene expression–trait association; darker blue, stronger negative association. Circle size represents number of significant GO terms.
Figure 2The genetic regulatory architecture of muscle-specific gene expression.
(a) Muscle cis-eQTL enrichment across chromatin states in diverse cell or tissue types11156364. (b) Genes that are both highly expressed in skeletal muscle and highly tissue-specific fall into muscle expression specificity index (mESI) decile 10; genes which are lowly and ubiquitously expressed fall in decile 1. (c) As mESI decile increases, genes have greater expression in skeletal muscle but not in other tissue types. (d,e) cis-eQTL genes stratified by mESI decile. cis-eQTLs which fall into muscle stretch enhancers (⩾3 kb) are significantly enriched for genes expressed highly and specifically in muscle versus cis-eQTLs in typical enhancers (≤800 bp).
Figure 3ATAC-seq maps in frozen skeletal muscle.
(a) ATAC-seq profiles in skeletal muscle replicates and combined compared with similar profiles in adipose21 and the lymphoblastoid cell line GM12878 (ref. 20), and to reference chromatin state maps in diverse cell or tissue types. (b) Skeletal muscle combined replicate ATAC-seq peak calls show enrichment for skeletal muscle-active chromatin states, which is pronounced at TSS-distal regions. (c) Example ATAC-seq footprint aggregate plots at CTCF and MYOD sites. (d) Relative genome coverage by different classes of skeletal muscle regulatory elements. (e) Skeletal muscle cis-eQTL enrichments at different mESI bins relative to the regulatory elements described in d.
Detection of independent genic cis-eQTLs associated with T2D and related traits.
| 2-h glucose | rs1019503 | A | 0.063 | 1.16 | 3 | 2.1 × 10−62 | 7.0 × 10−84 | ||
| 2-h glucose | rs1019503 | A | 0.063 | −0.97 | 7 | 8.1 × 10−36 | 7.8 × 10−39 | ||
| 2-h glucose | rs1019503 | A | 0.063 | −0.90 | 7 | 1.1 × 10−28 | 4.4 × 10−33 | ||
| Fasting glucose | rs11715915 | C | 0.012 | −0.61 | 1 | 1.3 × 10−10 | 5.8 × 10−32 | ||
| T2D | rs515071 | C | 1.18 | 1.01 | 10 | 1.9 × 10−19 | 2.0 × 10−24 | ||
| T2D | rs516946 | C | 1.09 | 1.01 | 10 | 1.9 × 10−19 | 2.0 × 10−24 | ||
| Fasting glucose | rs174550 | T | 0.022 | 0.77 | 3 | 4.2 × 10−18 | 3.2 × 10−20 | ||
| T2D | rs3132524 | G | 1.07 | −0.84 | 9 | 4.2 × 10−17 | 1.2 × 10−19 | ||
| T2D | rs3132524 | G | 1.07 | 0.79 | 2 | 4.6 × 10−15 | 1.9 × 10−14 | ||
| Fasting glucose | rs11715915 | C | 0.012 | −0.68 | 5 | 1.6 × 10−13 | 1.4 × 10−13 | ||
| T2D | rs849135 | G | 1.11 | −0.38 | 4 | 5.3 × 10−4 | 5.6 × 10−11 | ||
| T2D | rs5215 | C | 1.07 | −0.33 | NA | 1.5 × 10−2 | 1.4 × 10−10 | ||
| T2D | rs11787792 | A | 1.15 | 0.38 | 1 | 1.9 × 10−3 | 6.7 × 10−10 | ||
| Fasting glucose | rs340874 | C | 0.021 | 0.49 | 3 | 1.3 × 10−6 | 7.5 × 10−10 | ||
| T2D | rs340874 | C | 1.07 | 0.49 | 3 | 1.3 × 10−6 | 7.5 × 10−10 | ||
| T2D | rs9470794 | C | 1.12 | 0.90 | 9 | 5.1 × 10−8 | 1.4 × 10−9 |
2-h, 2 hour; eQTL, expression quantitative trait locus; GWAS, genome-wide association studies; SNP, single-nucleotide polymorphism; T2D, type 2 diabetes; TSS, transcription start site.
T2D and related trait GWAS and candidate gene-associated variants were tested for association with genes whose most distal TSS was within 1 Mb of the variant.
*We use GWAS to denote GWAS or candidate gene studies.
†T2D odds ratio (OR) or trait effect size for GWAS risk or higher trait level allele.
‡eQTL effect for GWAS risk or higher trait level allele.
§Decile 10 denotes most muscle-specific expression, NA denotes insufficient expression in the Illumina Body Map tissues to estimate specificity.
||15 most significant conditional cis-eQTL results (for the 4,545 tested GWAS-SNP–gene pairs) that also have an cis-eQTL q-value<0.05 (genome-wide).
¶See Supplementary Data 1 for ANK1 SNP rs515071 (high r2 with rs516946).
Figure 4T2D GWAS SNPs in a muscle-specific stretch enhancer of ANK1 provide mechanistic insights into T2D pathophysiology in skeletal muscle.
(a) Regional association plot showing the significance of SNPs (points) associated with expression of ANK1 (highlighted with red rectangle), where the best cis-eQTL rs516946 (purple point) is a T2D GWAS SNP. (b) UCSC genome browser view of chromatin states near ANK1 isoforms. The chromatin states between skeletal muscle and other T2D relevant cell types (adipose, liver, islets) are markedly different. ANK1 is associated with hereditary spherocytosis, a disease of the red blood cells, which is consistent with the transcribed chromatin states in K562, a myelogeneous leukaemia line of the erythroleukemia type. T2D and related trait GWAS SNPs (dark green) and SNPs in strong LD (r2⩾0.8; blue) are found within muscle-specific stretch enhancers. ANK1 expression is shown for each isoform and normalized so that the sum over all isoforms is 1. (c) The chromatin states across T2D relevant tissues or cells for SNPs in strong LD (r2⩾0.8) with T2D GWAS SNP rs516946. Location of the ATAC-seq peak is noted. Asterisks denote SNPs that reside in stretch (⩾3 kb) enhancers. Colour-coding of chromatin is as shown in b. (d) EMSA using human skeletal muscle cell (SkMC) nuclear extract demonstrates allele-specific binding (see lower horizontal arrow on the left side of the gel) for the non-risk allele (A) of rs508419. A supershift (see upper horizontal arrow on the left side of the gel) using the TR4 antibody shows that TR4 participates in the allele-specific binding.
Figure 5Truncated ANK1 isoforms are expressed in muscle and have differential splicing associated with rs508419 genotype.
(a) The four short isoforms of ANK1. (b) The mean expression of each isoform normalized by the total ANK1 expression is stratified by genotype; error bars represent ±1 s.d. N=8/82/174 for risk allele dosages 0/1/2. (c) The direction of effect (β) for splicing associated with the rs508419 risk allele (y-axis labels are the same as in b).