| Literature DB >> 25888292 |
Yerbol Z Kurmangaliyev1,2, Alexander V Favorov3,4,5, Noha M Osman6,7, Kjong-Van Lehmann8, Daniel Campo9, Matthew P Salomon10, John Tower11, Mikhail S Gelfand12,13, Sergey V Nuzhdin14,15.
Abstract
BACKGROUND: Variation within splicing regulatory sequences often leads to differences in gene models among individuals within a species. Two alleles of the same gene may express transcripts with different exon/intron structures and consequently produce functionally different proteins. Matching genomic and transcriptomic data allows us to identify putative regulatory variants associated with changes in splicing patterns.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25888292 PMCID: PMC4373058 DOI: 10.1186/s12864-015-1415-6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Genotype-specific transcriptomes. a: Common reference design. 81 inbred lines, which were derived from natural populations (natural lines), were crossed with one common line (tester line). Each heterozygous F1-hybrid had the same tester allele (green bars) and variable natural allele (dark blue bars). Transcriptomes of these F1-hybrids were sequenced and used to annotate and quantify genotype-specific splicing patterns. b: Annotation and quantification of alternative splicing events. We focused on two most common types of alternative splicing events, namely, alternative donor/acceptor sites and cassette exons. The counts of inclusion ( ) and exclusion ( ) junction reads were used to quantify each alternative splicing event in each F1-hybrid. Ψ-value represented fraction of the longer isoform among two alternate transcripts.
Figure 2Genotype-specific alternative acceptor site associated with -sQTL (a2016). A cryptic acceptor site activation and removal of a 206 bp segment that is associated with four cis-sQTL. All plots are shown for the most significant cis-sQTL. a-b: Exon/intron coverage plots of the genotype-specific splicing event are shown for the F1-hybrids that carried derived (a) and ancestral (b) variants of cis-sQTL on natural alleles. Tester alleles of all F1-hybrids carried ancestral variant of cis-sQTL. The coverage for each genomic positions is shown separately for exonic (blue) and intronic (yellow) parts of reads. The coverage from multiple genotypes that carry the same allele is merged. Exon/intron boundaries are marked by vertical dashed lines. The position of cis-sQTL is shown by the red arrow. The alternative gene models are shown under the coverage plots: blue rectangles and yellow lines correspond to exons and introns, respectively. c: The distributions of Ψ-values for alleles of this cis-sQTL. d: The Manhattan plot of association test log-p-values for all tested cis-SNPs. e: Q-Q plot for the observed (y-axis) and expected (x-axis) distributions of association p-values for the tested cis-SNPs (red dots) and 10000 random SNPs from other chromosomes (black dots). f: RT-PCR validation of gene models. The predicted lengths of PCR-products corresponding to the long and short isoforms were 337 bp and 131 bp, respectively. A/A: inbred line carrying only ancestral alleles (tester line); A/D: F1-hybrid carrying both ancestral and derived alleles; D/D: inbred line carrying only derived alleles (natural line).
Figure 3Genotype-specific alternative donor site associated with -sQTL in canonical splicing dinucleotide. De novo donor site and inclusion of an 84 bp segment that was associated with cis-sQTL in canonical splicing dinucleotide. a-b: Exon/intron coverage plots of the genotype-specific splicing event are shown for the F1-hybrids that carried derived (a) and ancestral (b) variants of cis-sQTL on natural alleles. Tester alleles of all F1-hybrids carried ancestral variant of cis-sQTL. c: The distributions of Ψ-values for alleles of this cis-sQTL. d: RT-PCR validation of gene models. The predicted lengths of PCR-products corresponding to the long and short isoforms were 234 bp and 150 bp, respectively. For other details see the legend to Figure 2.
Figure 4Allelic imbalance of genotype-specific cassette exon inclusion associated with -sQTL (a625). Genotype-specific cassette exon that exhibits strong allelic imbalance is shown. The allele-specific exon/intron coverage of genotype-specific splicing event are shown for the F1-hybrids that carried derived (ad) and ancestral (be) variants of cis-sQTL on natural alleles. Tester alleles of all F1-hybrids carried ancestral variant of cis-sQTL. The plots were created separately for the reads expressed from natural (ab) and tester alleles (de). Only genotypes with estimated ns-Ψ and ts-Ψ-values were considered. cf: The distributions of ns-Ψ (c) and ts-Ψ-values (c) for alleles of cis-sQTL. g: RT-PCR validation of gene models. The predicted lengths of PCR-products corresponding to the long and short isoforms are 231 bp and 102 bp, respectively. For other details see the legend to Figure 2.
Figure 5Distributions of allele-specific ΔΨ-values. ΔΨ-values as calculated for natural allele-specific reads (ns-ΔΨ), for tester allele-specific reads (ts-ΔΨ) and for all reads (ΔΨ). Values were calculated for cis-sQTL with sufficient coverage for calculation of both values for at least one F1-hybrid for each variant.
Comparison of genotype-specific splicing events with the overall set of alternative splicing events
|
|
|
| |
|---|---|---|---|
| Total number of events | 59 | 2744 | |
| Fraction of event types | |||
|
| 51% | 46% | n/s |
|
| 42% | 31% | n/s |
|
| 7% | 23% | 0.01 |
| Fraction of event in UTRs | 66% | 29% | 4.88 × 10−9 |
| Fraction of frame-preserving events* | 90% | 78% | n/s |
| Median length of alternative gene segment (bp)* | 9.5 | 20.5 | 2.2 × 10−3** |
| Fraction of unannotated events | 75% | 31% | 1.45 × 10−11 |
The percents represent fractions within the “Total number of alternative splicing events”. * – In case of alternative donor/acceptor sites, the lengths of alternative gene segments were calculated as distance between alternative splice sites. This is inapplicable for terminal alternative splicing events (i.e. alternative first you set and last exons). Thus, on the estimation of the fraction of frame-preserving splicing events and median length of alternative gene segments, we exclude splicing events with more than 500 bp between alternative splice sites. ** – p-value for Wilcoxon rank-sum test.