| Literature DB >> 21167051 |
Shengrong Lin1, Wenyi Wang, Curtis Palm, Ronald W Davis, Kara Juneau.
Abstract
BACKGROUND: A sensitive, high-throughput method for monitoring pre-mRNA splicing on a genomic scale is needed to understand the spectrum of alternatively spliced mRNA in human cells.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21167051 PMCID: PMC3022918 DOI: 10.1186/1471-2164-11-712
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1A schematic of the asMIP methodology. (A) The asMIP design. Unreacted probes terminate with 3'- and 5'-interrogation sequences (no. 1) abutted by two DraI cleavage sites (no. 2). Each probe contains two unique sequence tags (no. 3) and two common primer binding sites (no. 4). (B) The asMIP assay. Step 1, sample preparation: immobilized cDNA was reverse transcribed (arrow) from polyadenylated RNA (curved lines followed by AAAAAA) using oligo-dT primers (TTTTT), which were covalently attached to magnetic beads (gray circles). Following cDNA synthesis, RNA is digested and washed away. Step 2, asMIP hybridization: unreacted asMIP probes (flattened nicked circles) terminate with 3'- and 5'-interrogation sequences (gray and colored lines), which are homologous to the exon sequences that flank splice junctions on the cDNA. Probes quantitatively anneal to the appropriate exon-exon junction (colored probes). Probes that do not hybridize (gray probes) are washed away. Step 3, asMIP circularization: bound asMIPs are ligated into circles (small arrows). Step 4, asMIP amplification and quantitation: only the successfully ligated probes (contiguous circles) can be exponentially amplified using the common PCR primers (thin black lines). Each probe contains two unique sequence tags, which are amplified by PCR for multiplexed detection via array hybridization or high-throughput sequencing (bar graph).
asMIP targets
| Gene | tissue specificity | total | ||
|---|---|---|---|---|
| CAMK2D* | neural | 8 (4) | 5 (5) | 13 (9) |
| CLTB* | neural | 3 (3) | 3 (3) | 6 (6) |
| EHBP1* | neural | 8 (3) | 4 (4) | 12 (7) |
| ERC1* | neural | 8 (4) | 10 (10) | 18 (14) |
| FEZ2* | neural | 2 (2) | 9 (9) | 11 (11) |
| MARK4 | neural | 8 | 3 | 11 |
| MINK1 | neural | 11 | 10 | 21 |
| MLLT4 | neural | 9 | 10 | 19 |
| MYH10 | neural | 15 | 8 | 23 |
| MYO6 | neural | 10 | 6 | 16 |
| TPD52 | neural | 3 | 6 | 9 |
| TPM1 | muscle | 5 | 7 | 12 |
| TPM2 | muscle | 5 | 6 | 11 |
| TPM3 | muscle | 4 | 6 | 10 |
| ACTB | control | 5 | 0 | 5 |
| GAPDH | control | 8 | 0 | 8 |
| TUBA1B | control | 3 | 0 | 3 |
| totals: | 115 | 93 | 208 | |
Numbers in parentheses correspond to total junctions analyzed by qPCR.
1 The number of predicted constitutive exon junctions targeted by asMIPs indentified from RefSeq [21].
2 The number of potentially alternatively spliced exon junctions targeted identified from RefSeq [21], of which 58 were confirmed in the literature to be alternatively spliced in the tissues we studied [10,20].
* Genes that were assayed with both asMIPs and qPCR.
Figure 2The asMIP data correctly characterize splicing controls and correlate with qPCR measurements. Receiver Operating Characteristic (ROC) curves comparing asMIP data with (A) junctions known from the literature to be alternatively spliced and (B) qPCR data. Data from asMIPs quantified using sequencing is black, asMIP data from arrays is blue; R-score data are dashed lines; M-score data are solid lines. (A) An analysis of 157 tissue specific splicing events: 58 positive alternative splicing controls [10,20] and 99 predicted constitutive negative controls [21]. For this plot, only relevant pairs of tissues, known to show alternative splicing for the genes assayed, were included in the calculation of R- and M-scores; skeletal and stomach data were used for tropomyosin genes, stomach and frontal lobe for brain specific genes, placenta and frontal lobe for ERC1. Absolute values of these scores were used. To generate the standard for comparison, positive spicing controls were assigned a value of 1, negative constitutive controls were 0. (B) An analysis of the 235 splicing events that were studied using both qPCR and asMIPs. For this plot we transformed ternary splice scores (1, 0, and -1) into binary scores (non-zero versus zero). Because there were cases where the qPCR and asMIP scores had the opposite sign, the ROC curves do not reach the upper right-hand corner of the graph. Individual ROC curves for positive or negative splice calls are located in Additional File 4. The solid black circle marks the point on the M-score sequencing line that corresponds to the +/-1.3 cutoff, which was used to identify alternative splicing.
Evaluation of asMIPs using splicing controls and qPCR data
| Statistic | ||||
|---|---|---|---|---|
| AUC-controls1 | 0.90 | 0.96 | 0.89 | 0.96 |
| AUC-qPCR2 | 0.65 | 0.65 | 0.65 | 0.76 |
| CorrqPCR-controls3 ( | 0.84 | 0.86 | 0.63 | 0.82 |
| CorrqPCR-total2 ( | 0.44 | 0.64 | 0.29 | 0.59 |
The table includes results from both the R- and M-score analyses (R- and M-), as well as data from asMIPs quantified with either arrays (Array) or sequencing (Seq). AUC and Pearson correlation (Corr, ρ) values are reported.
1 asMIP data was compared to tissue specific, splicing controls from the literature: 58 tissue specific, alternatively spliced junctions [10,20] and 99 predicted constitutive junctions [21,34].
2 asMIP data was compared to the complete set of qPCR data, 235 tissue specific splice events were analyzed.
3 asMIP data from the set of splicing controls was compared to qPCR data from the same set of controls [10,20,21,34], 34 tissue specific splice events were analyzed.
Figure 3Tissue-specific alternative splicing of . Tissue-specific isoforms and splice-score data are color-coded; skeletal muscle is black, stomach is blue, and reference sample data is yellow. (A & B) Schematic representations of two muscle specific isoforms. Exons are represented by numbered boxes. Thin lines depict introns. (C & D) Graphs of the raw sequence counts for asMIPs interrogating the exon junctions specified (e.g. 4-5, 5-6 et cetera). Signal variation for constitutive junctions is due predominantly to sequence-based probe effects. (E & F) Graphs of the M-scores for each junction in which probe effects, gene expression differences and experimental variance have all been accounted for. The first five junctions listed for the gene are constitutively spliced and the remaining junctions are alternatively spliced. In (E & F) both the sequencing data (squares) and array data (diamonds) are presented for comparison. The gray dashed lines show the cutoff (+/-1.3) used to make splicing calls. The tissue-specific splicing of these control genes are clear; known constitutive junctions map near zero, alternatively spliced junctions all map above or below the cutoffs.
Figure 4Splice scores compare well with RT-PCR data. Tissue-specific alternative splicing data are shown for four genes: FEZ2 (A), TPD52 (B), CAMK2D (C) and MINK1 (D). In each panel schematic representations of the analyzed splice junctions are shown on top. Exons are depicted as boxes and numbered; lines connecting the exons specify exon junctions; arrows show the position of PCR primers used for the RT-PCR gel electrophoresis splicing analysis. Below the schematics are graphs that plot splice scores for three sets of data: sequenced asMIPs (squares), array quantified asMIPs (diamonds) and qPCR (triangles). The M-score data are plotted for five different tissues: placenta (PLA), skeletal muscle (SKM), stomach (STM), cerebellum (CEB) and frontal lobe (FRL). Analyzed exon junctions are color-coded and specified to the right of each graph (e.g. "5-6" corresponds to data for the junction between exons five and six). Gray dashed lines show the M-score cutoffs for qPCR (+/-2.7) and asMIP data (+/-1.3). Tissue-specific RT-PCR, gel electrophoresis data is pictured below each splice-score graph. PCR primers surrounding the analyzed exon junctions were used to amplify cDNA reverse transcribed from RNA extracted from the specified tissues. A reference sample (REF) containing reverse-transcribed total RNA from >20 human tissues is pictured to provide a pictorial approximation of baseline splicing for comparison with the graph. The exons contained within each PCR gel band are specified to the right of each gel.