| Literature DB >> 26472073 |
Jennifer L Caswell1, Roman Camarda2, Alicia Y Zhou2, Scott Huntsman3, Donglei Hu3, Steven E Brenner4, Noah Zaitlen5, Andrei Goga2, Elad Ziv3.
Abstract
Genome-wide association studies have identified over 70 single-nucleotide polymorphisms (SNPs) associated with breast cancer. A subset of these SNPs are associated with quantitative expression of nearby genes, but the functional effects of the majority remain unknown. We hypothesized that some risk SNPs may regulate alternative splicing. Using RNA-sequencing data from breast tumors and germline genotypes from The Cancer Genome Atlas, we tested the association between each risk SNP genotype and exon-, exon-exon junction- or transcript-specific expression of nearby genes. Six SNPs were associated with differential transcript expression of seven nearby genes at FDR < 0.05 (BABAM1, DCLRE1B/PHTF1, PEX14, RAD51L1, SRGAP2D and STXBP4). We next developed a Bayesian approach to evaluate, for each SNP, the overlap between the signal of association with breast cancer and the signal of association with alternative splicing. At one locus (SRGAP2D), this method eliminated the possibility that the breast cancer risk and the alternate splicing event were due to the same causal SNP. Lastly, at two loci, we identified the likely causal SNP for the alternative splicing event, and at one, functionally validated the effect of that SNP on alternative splicing using a minigene reporter assay. Our results suggest that the regulation of differential transcript isoform expression is the functional mechanism of some breast cancer risk SNPs and that we can use these associations to identify causal SNPs, target genes and the specific transcripts that may mediate breast cancer risk.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26472073 PMCID: PMC4664170 DOI: 10.1093/hmg/ddv432
Source DB: PubMed Journal: Hum Mol Genet ISSN: 0964-6906 Impact factor: 6.150
Figure 1.Flowchart for determining splicing QTL associations. We identified 13 SNP-gene associations through exon, junction and whole-transcript association tests with risk-associated SNPs; several associations were identified by multiple methods. After excluding SNP-gene associations that could not be corroborated with other tests, that could be related to the presence of pseudogenes or paralogs or that could have derived from mapping bias to the reference genome, seven SNP-gene associations remained.
Splicing QTLs identified in ER-positive tumors at FDR < 0.05, after exclusions
| SNP rsID | Gene | β (ER+) | FDR (ER+) | Predicted splicing pattern associated with breast cancer risk | ||
|---|---|---|---|---|---|---|
| Component | ||||||
| rs6504950 | Longer exon 6 (6 bp 5′): ↑ use of 3′ acceptor site at chr17:53 076 993 and ↓ use of chr17:53 076 987 | |||||
| Exon 5:6 junction 1 | −0.73 | 5.5E − 24 | 8.3E − 20 | 1.9E − 11 | ||
| Exon 5:6 junction 2 | 0.59 | 1.9E − 23 | 1.4E − 19 | 1.6E − 07 | ||
| Transcript uc010dcc | −0.42 | 3.5E − 11 | 4.4E − 08 | |||
| rs11552449a | ↑ exon 2 inclusion | |||||
| Transcript uc001eei | −0.64 | 8.6E − 14 | 1.6E − 10 | 2.4E − 06 | ||
| Transcript uc001eeg | 0.26 | 2.2E − 10 | 2.3E − 07 | 4.4E − 05 | ||
| Exon 1:3 junction | −0.48 | 7.0E − 08 | 4.5E − 05 | 4.5E − 03 | ||
| Exon 2 | 0.24 | 2.7E − 08 | 2.1E − 05 | 6.1E − 06 | ||
| rs8170 | Longer exon 2 (38 bp 5′): ↑ use of 3′ acceptor site at chr19:17 379 565 | |||||
| Transcript uc002nfv | 0.47 | 3.0E − 08 | 2.1E − 05 | 8.2E − 06 | ||
| Transcript uc002nfu | −0.29 | 2.7E − 07 | 1.7E − 04 | 0.038 | ||
| rs11249433 | b | |||||
| Exon 2:3 junction | −0.39 | 1.3E − 08 | 1.2E − 05 | |||
| rs11552449a | ↑ inclusion of exons 1 and 2 | |||||
| Exon 1:2 junction | 0.47 | 3.0E − 08 | 2.1E − 05 | |||
| Exon 2 | 0.28 | 1.9E − 06 | 9.0E − 04 | |||
| rs616488 | ↓ transcript uc001arm (exons 1, 2, 6, 7) | |||||
| Exon 7 | −0.39 | 3.8E − 07 | 2.2E − 04 | 0.021 | ||
| Transcript uc001arm | −0.39 | 5.1E − 07 | 2.8E − 04 | |||
| rs999737 | ↓ transcript uc001xkf (exons 1–11, exon 14) | |||||
| Exon 15 | −0.33 | 2.3E − 05 | 9.1E − 03 |
β is for the effect of the breast cancer risk allele on the gene component.
bp, base pairs.
Transcripts are named according to UCSC ID. Genomic positions are for hg19 build.
P-values for ER-negative tumors displayed when is <0.05.
aSNP is associated with transcript expression of two different genes.
bSplicing QTL association not predicted to be linked to breast cancer given pattern of association at locus.
Figure 2.Alternative splice site usage in STXBP4 exon 6 based on rs6504950 genotype. (A) With the rs6504950 risk allele, virtually all STXBP4 exon 5–6 junction reads map to one junction, whereas with the non-risk allele, virtually all map to the other. (B) LocusZoom plot (23) displaying −log10 P-values for the association of each SNP within the window with STXBP4 exon 5–6 junction 1 by position. (C) The locus of rs11658717, the presumed causal SNP. The two alternative 3′ splice sites are highlighted in red. The minor allele of rs11658717 (G) is in high LD with the risk allele of rs6504950 (A). Screenshot from http://genome.ucsc.edu (24).
Figure 3.Alternative splice site usage in BABAM1 exon 2 based on rs8170 genotype. (A) Relative expression of RSEM reconstructed BABAM1 transcript uc002nfu decreases and uc002nfv increases with rs8170 risk genotype. (B) LocusZoom plot (23) displaying −log10 P-values for the association of each SNP within the window with BABAM1 uc002nfv transcript expression by position. (C) Locations of all candidate SNPs, defined as SNPs with r > 0.6 with rs8170, or LD unknown but splicing QTL association P-value of <1 × 10−6. SNPs are colored red if r > 0.8 and orange if r > 0.6. Screenshot from http://genome.ucsc.edu (24). (D) The locus of rs10424178, the presumed causal SNP. The two alternative 3′ splice sites are highlighted in red, and the two alternative branch points as identified by Human Splicing Finder (25) are highlighted in blue. The minor allele of rs10424178 (T) is in high LD with the risk allele of rs8170. Screenshot from http://genome.ucsc.edu (24). (E) Results of a six replicates of a minigene reporter vector assay, transfecting the major allele (C) or minor allele (T) of rs10424178. In each well, the lower band represents the shorter BABAM1 exon 2, as included in transcript uc002nfu, and the upper band represents the longer BABAM1 exon 2, as included in transcript uc002nfv; the identities of the bands were confirmed by sequencing. The percentages shown below each well are the intensity of the lower band divided by the sum of the intensities of the lower band and the upper band. In all six replicates, the percentage of the total bands represented by the shorter BABAM1 exon 2 is higher for the major allele than that for the minor (risk) allele.
Overlap of the posterior probabilities of the splicing QTL and case–control association statistics based on cumulative probability of case–control association of >0.95
| Target event | Best | Total number of SNPs | Number of SNPs to reach 0.95 probability case–control | Splicing QTL cumulative probability | |
|---|---|---|---|---|---|
| Locus 1 | 5.2 × 10−5 | 908 | 54 | 0.99 | |
| Locus 2 | 2.0 × 10−3 | 705 | 337 | 0.99 | |
| Locus 2 | 705 | 337 | >0.99 | ||
| Locus 3 | 5.1 × 10−3 | 588 | 452 | >0.99 | |
| Locus 4 | 4.0 × 10−8 | 37 | 3 | 4.3 × 10−9 | |
| Locus 5 | 8.3 × 10−4 | 537 | 90 | 0.60 | |
| Locus 5 | 537 | 90 | 0.97 | ||
| Locus 6 | 6.9 × 10−11 | 1151 | 11 | 9.2 × 10−8 | |
| Locus 6 | 1151 | 11 | 0.29 |
*P-value for BABAM1 was obtained from the ER-negative analysis from GAME-ON, because this locus was identified by a GWAS for ER-negative breast cancer.