| Literature DB >> 30446528 |
Chun Jimmie Ye1, Jenny Chen2,3, Alexandra-Chloé Villani2,4, Rachel E Gate1,5, Meena Subramaniam1,5, Tushar Bhangale6, Mark N Lee2,4,7, Towfique Raj2,7,8, Raktima Raychowdhury2, Weibo Li2, Noga Rogel2, Sean Simmons2, Selina H Imboywa7, Portia I Chipendo7, Cristin McCabe2,8, Michelle H Lee7, Irene Y Frohlich7, Barbara E Stranger9, Philip L De Jager2,7,8, Aviv Regev2,10,11, Tim Behrens6, Nir Hacohen2,4.
Abstract
While genetic variants are known to be associated with overall gene abundance in stimulated immune cells, less is known about their effects on alternative isoform usage. By analyzing RNA-seq profiles of monocyte-derived dendritic cells from 243 individuals, we uncovered thousands of unannotated isoforms synthesized in response to influenza infection and type 1 interferon stimulation. We identified more than a thousand quantitative trait loci (QTLs) associated with alternate isoform usage (isoQTLs), many of which are independent of expression QTLs (eQTLs) for the same gene. Compared with eQTLs, isoQTLs are enriched for splice sites and untranslated regions, but depleted of sequences upstream of annotated transcription start sites. Both eQTLs and isoQTLs explain a significant proportion of the disease heritability attributed to common genetic variants. At the ERAP2 locus, we shed light on the function of the gene and how two frequent, highly differentiated haplotypes with intermediate frequencies could be maintained by balancing selection. At baseline and following type 1 interferon stimulation, the major haplotype is associated with low ERAP2 expression caused by nonsense-mediated decay, while the minor haplotype, known to increase Crohn's disease risk, is associated with high ERAP2 expression. In response to influenza infection, we found two uncharacterized isoforms expressed from the major haplotype, likely the result of multiple perfectly linked variants affecting the transcription and splicing at the locus. Thus, genetic variants at a single locus could modulate independent gene regulatory processes in innate immune responses and, in the case of ERAP2, may confer a historical fitness advantage in response to virus.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30446528 PMCID: PMC6280757 DOI: 10.1101/gr.240390.118
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Transcriptome changes in response to stimulation. (A) Scatter plot of log2 fold change of gene abundance (x-axis) versus log2 fold change of isoform usage percentage (y-axis) in flu-infected (left) and IFNB1-simulated (right) cells compared with baseline. Each dot represents one isoform. Isoforms that significantly differed in their usage (beta regression, FDR < 0.05) are highlighted in red. (B) Clustering of isoform usage ratios in baseline, flu-infected, and IFNB1-stimulated cells. Heatmap colors are row scaled (red indicates row maximum, blue indicates row minimum; left). Violin plots (right) summarize the usages of all isoforms within a cluster separated by condition. Only isoforms (one per gene) that most significantly changed (beta regression, FDR < 0.05) in usage are shown. (C,D) De novo constructed isoforms (top), gene abundance (second row), and isoform abundance (third row) and usage percentage (bottom) for TLR4 (C) and CASP8 (D).
Figure 2.Properties of local eQTLs and isoQTLs. (A) Frequency (y-axis) of the location of leading SNPs for local eQTLs and isoQTLs (permutation FDR < 0.05) with respect to meta gene structure (x-axis). Genes are normalized to five exonic regions (E1–E4 indicate exons 1 through 4; E* indicates exon 5 to the last exon) and four intronic regions (introns 1, 2, 3, and from intron 4 to the last intron). Upstream and downstream sequences are divided into 100-kb windows. (B) Log2 fold enrichment (x-axis) of leading SNPs for local eQTLs and isoQTLs (permutation FDR < 0.05) for genomic annotations. eQTL enrichments are calculated using a background set of SNPs matched for distance to TSS and allele frequency. IsoQTL enrichments are calculated with respect to a background set of eQTLs matched for distance to TSS and allele frequency. Multiple testing significance is indicated. (*) P < 0.05; (**) P < 0.01; (***) P < 0.001.
Figure 3.Comparison of local isoQTLs between conditions. (A) Correlation of effect sizes (R2) for significant local isoQTLs (permutation FDR < 0.05) between pairs of conditions. Transcripts are colored by differential expression (red indicates up-regulated in condition 2, y-axis; blue indicates up-regulated in condition 1, x-axis) for each pair of conditions. (B) Distributions of effect sizes (R2) for significant local isoQTLs (permutation FDR < 0.05) for each pair of conditions segregating genes based on expression in each condition. (C) De novo constructed transcript structure (top) and box-whisker plots (bottom three panels) between transcript quantitative traits (y-axis: log2(normalized gene abundance), log2(normalized isoform abundance), or isoform usage percentage) and genotype (x-axis) for three genes (IFI44L, WARS, and ZBP1) with risoQTLs.
Figure 4.GWAS enrichment of local eQTLs and isoQTLs. (A) Partitioned heritability analysis: proportion of SNP heritability explained (x-axis) for 28 traits (y-axis) by eQTLs (left) and isoQTLs (right). (DS) Depressive symptoms; (SWB) subject well-being; (IBD) inflammatory bowel disease; (SLE) systemic lupus erythematosus; (BMI) body mass index; (HDL) high density lipoprotein; (LDL) low density lipoprotein. (B) LocusZoom plots of the IRF7 region for SLE GWAS associations (top) and IRF7/Iso4 isoQTLs for baseline, flu-infected, or IFNB1-stimulated cells (bottom three panels). (y-axis) −log10(P-value) of association; (x-axis) genomic location. Points are colored based on LD to rs1061502. (C) Transcript structure (top) and box-whisker plots (bottom three panels) between IRF7 transcript quantitative traits (y-axis: log2(normalized gene abundance), log2(normalized isoform abundance), or isoform usage percentage) and rs1061502 genotype (x-axis). (D) Heatmap of genes distally associated (permutation FDR < 0.05) with risoQTL rs1061502. Heatmap colors are row-scaled TPM values (yellow indicates row maximum; purple indicates row minimum).
Figure 5.Genetics of ERAP2 regulation. (A) LocusZoom plot of Crohn's disease and ulcerative colitis associations at the ERAP2 locus. (y-axis) −log10(P-value) of association; (x-axis) genomic location. (B) Structures of transcripts derived from each haplotype (top) and box-whisker plots (bottom three panels) between ERAP2 transcript quantitative traits (y-axis: log2(normalized gene abundance), log2(normalized isoform abundance), or isoform usage percentage)) and genotype (x-axis). (C) Correlation between ERAP2/Iso1 (orange), ERAP2/Iso3 (light blue), and ERAP2/Iso4 (dark blue) abundance (y-axis) and abundance of flu segment 4 (x-axis) segregated by rs2248374 genotype (squares indicate aa; triangles, aG; circles, GG). (D) Western blot of MoDCs before and after flu-infection from five Haplotype B homozygotes and two heterozygotes. A full-length ERAP2 protein isoform is expected at 120 kDa. At least one flu-specific ERAP2 protein isoform is expected at 49 kDa. (E) A schematic of the hypothesized regulation and function of two ERAP2 haplotypes. (N) N terminal domain; (AP) amino peptidase domain; (H) hinge domain; (C) C terminal domain; (PTC) premature termination codon.