Literature DB >> 22675076

Re-splicing of mature mRNA in cancer cells promotes activation of distant weak alternative splice sites.

Toshiki Kameyama1, Hitoshi Suzuki, Akila Mayeda.   

Abstract

Transcripts of the human tumor susceptibility gene 101 (TSG101) are aberrantly spliced in many cancers. A major aberrant splicing event on the TSG101 pre-mRNA involves joining of distant alternative 5' and 3' splice sites within exon 2 and exon 9, respectively, resulting in the extensive elimination of the mRNA. The estimated strengths of the alternative splice sites are much lower than those of authentic splice sites. We observed that the equivalent aberrant mRNA could be generated from an intron-less TSG101 gene expressed ectopically in breast cancer cells. Remarkably, we identified a pathway-specific endogenous lariat RNA consisting solely of exonic sequences, predicted to be generated by a re-splicing between exon 2 and exon 9 on the spliced mRNA. Our results provide evidence for a two-step splicing pathway in which the initial constitutive splicing removes all 14 authentic splice sites, thereby bringing the weak alternative splice sites into close proximity. We also demonstrate that aberrant multiple-exon skipping of the fragile histidine triad (FHIT) pre-mRNA in cancer cells occurs via re-splicing of spliced FHIT mRNA. The re-splicing of mature mRNA can potentially generate mutation-independent diversity in cancer transcriptomes. Conversely, a mechanism may exist in normal cells to prevent potentially deleterious mRNA re-splicing events.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22675076      PMCID: PMC3439910          DOI: 10.1093/nar/gks520

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Pre-mRNA splicing takes place precisely and efficiently in a spatially and temporally regulated manner to generate mRNAs for translation into active protein products. Defects in the splicing process are closely associated with the formation and development of cancers [reviewed in (1–3)]. Tumor susceptibility gene 101 (TSG101) was originally identified with a screen for potential tumor suppressors using a transformation assay in mouse NIH3T3 cells (4), but its role as a tumor suppressor gene remained controversial. As a component of the endosomal sorting complex required for transport I (ESCRT-I), the TSG101 protein mediates a variety of endosomal and non-endosomal functions, such as protein sorting, the biogenesis of multivesicular bodies, virus budding, stimulation of the cell cycle, regulation of transcription and the maintenance of epithelial cell polarity [reviewed in (5,6)]. The fragile histidine triad (FHIT) gene has also been identified as encoding a putative tumor suppressor, diadenosine triphosphate/tetraphosphate (Ap3A/Ap4A) hydrolase involved in the regulation of signaling pathways (7,8). Because ApA is a signaling molecule that responds to cellular stress and affects several cellular processes, FHIT protein plays an important role in the promotion of apoptosis and the control of the cell cycle in response to oxidative and replicative stress [reviewed in (9–11)]. TSG101 and FHIT pre-mRNAs are aberrantly spliced in many cancers, although no mutations have been found in the authentic splice sites of these pre-mRNAs in most cases [reviewed in (1,2)]. Some types of transcript variants have also been detected in normal tissues, but the overall complexity and frequency of their aberrant splicing are clearly increased in cancer tissues, suggesting a progressive loss of splicing fidelity during malignant transformation (7,12–24). Most of the aberrant mRNAs result from activation of distant weak splice sites accompanied by elimination of multiple exons. Considering that this aberrant splicing occurs via conventional one-step pathway, we do not yet know why the splicing machinery ignores many strong bona fide splice sites that exist between the activated 5′ and 3′ splice sites, which are conspicuously weak. By analyzing the typical cancer-specific aberrant splicing of TSG101 and FHIT pre-mRNAs, we have obtained considerable evidence that normal constitutive splicing precedes aberrant splicing, which clarifies the above issue. Our discovery of re-splicing of the mRNAs in cancer cells may reflect a general mechanism for alternative splicing between extremely distant 5′ and 3′ splice sites, which must be well controlled in normal cells.

MATERIALS AND METHODS

Full descriptions, including the detailed experimental conditions, are provided in Supplementary Materials and Methods.

Splice sites scoring methods

We calculated the 5′ and 3′ splice site scores using nine computer programs (Table 1), which are available online (the URLs are provided in Supplementary Materials and Methods). Detailed descriptions and the original references for the algorithms used have been given previously (25–27).
Table 1.

Strengths of the authentic and alternative splice sites in TSG101 pre-mRNA estimated with different scoring methods

5′ splice siteS&SaΔGbH-bondcNNdMAXENTeMMfMDDgWMMhSDi3′ splice siteS&SaNNdMAXENTeMMfWMMh
Intron 189.63−8.515.70.999.169.6213.9810.39−2.09Intron 190.890.907.857.418.19
Alternative67.27−0.610.70.786.336.249.483.264.05Intron 291.040.9911.5811.8613.87
Intron 286.25−7.216.10.9810.139.2214.089.29−2.17Intron 398.930.9811.2212.0014.07
Intron 354.14−5.514.90.928.397.3112.388.42−2.73Intron 487.150.967.638.9811.08
Intron 486.77−4.319.10.9910.169.7215.2810.22−2.99Intron 570.070.033.153.40−2.44
Intron 579.30−4.215.70.958.656.4913.187.50−2.96Intron 681.260.978.9011.4312.26
Intron 682.79−6.015.80.959.437.9813.488.23−2.43Intron 794.050.9913.0513.8114.67
Intron 787.40−7.318.71.0010.479.0915.089.00−2.27Intron 895.990.999.6210.519.43
Intron 876.53−4.214.00.988.015.5712.586.66−3.31Alternative80.740.316.676.415.76
Intron 977.74−4.112.30.927.646.0610.285.86−2.66Intron 982.220.896.828.4811.68

aS&S: Shapiro and Senapathy Score, a position-weight matrix.

bΔG: Free energy of the 5′ splice site–U1 snRNA duplex.

cH-Bond: An algorithm based on the hydrogen bonding of the 5′ splice site–U1 base pairing.

dNN: Neural network, a machine learning approach.

eMAXENT: Maximum entropy model, an algorithm that considers dependencies between positions.

fMM: First-order Markov model, an algorithm that considers dependencies between adjacent positions.

gMDD: Maximum dependence deposition, a decision-tree approach.

hWMM: Weight matrix model, a quantification of the relative likelihood of the candidate splice site sequence.

iSD: SD score, a common logarithm of the frequency of the 5′ splice sites used in the human genome.

Strengths of the authentic and alternative splice sites in TSG101 pre-mRNA estimated with different scoring methods aS&S: Shapiro and Senapathy Score, a position-weight matrix. bΔG: Free energy of the 5′ splice site–U1 snRNA duplex. cH-Bond: An algorithm based on the hydrogen bonding of the 5′ splice site–U1 base pairing. dNN: Neural network, a machine learning approach. eMAXENT: Maximum entropy model, an algorithm that considers dependencies between positions. fMM: First-order Markov model, an algorithm that considers dependencies between adjacent positions. gMDD: Maximum dependence deposition, a decision-tree approach. hWMM: Weight matrix model, a quantification of the relative likelihood of the candidate splice site sequence. iSD: SD score, a common logarithm of the frequency of the 5′ splice sites used in the human genome.

Construction of plasmids

The expression plasmids for the TSG101-EGFP fusion proteins (TSG101-EGFP, TSG101-EGFP[+] and TSG101-EGFP[−]; Figure 2C) were constructed by subcloning the corresponding polymerase chain reaction (PCR) fragments from the TSG101 gene together with PCR fragments from the pBI-EGFP plasmid (Clontech) into the pCXN2 mammalian expression vector (28). The FHIT-EGFP fusion plasmid (FHIT-EGFP; Figure 5C) was constructed with the same procedure. Overlap-extension PCR was performed with Ex Taq HS DNA polymerase (Takara Bio) and the corresponding primers (Invitrogen).
Figure 2.

Ectopically expressed intronless TSG101 gene (cDNA) is spliced and generates equivalent product as the endogenous aberrant mRNA in cancer cells. (A) The structure of the TSG101 mRNA. Red line and triangles indicate the postulated mRNA re-splicing via activated alternative 5′ and 3′ splice sites. Blue triangle indicates the PTC (1112–1114) generated by the postulated splicing. Black flags indicate the open reading frame (ORF; 127–1296). The positions of the PCR primers (P1–P4, P7–P10) are shown. (B) Using normal cells (HMEC) and cancer cells (MCF-7), endogenous TSG101 mRNAs were analyzed by RT-PCR (RT) and the genomic DNA was analyzed by PCR (Ge) with primer sets (P1–P4, P7–P10) annealed to the indicated exons. The black and red arrowheads denote the full-length and major aberrant mRNAs, respectively (Figure 1). (C) The structures of three reporter TSG101-EGFP plasmids (see (A) for the red and blue triangles). The postulated splicing-dependent EGFP expression of these plasmids is indicated with (−) and (+). The positions of the PCR primer sets (P1–P5, P3–P6) are shown. (D) Cancer cells (MCF-7) were transfected with the indicated TSG101-EGFP reporter plasmids and the EGFP expression was analyzed by fluorescence microscopy. (E) These ectopically expressed TSG101-EGFP transcripts were analyzed by RT-PCR. The black and red arrowheads indicate the unspliced and spliced products via the alternative splice sites, respectively. Two minor spliced products (indicated with * and **) were generated from different alternative 3′ splice sites, i.e. Δ190–776 (587-nt deletion) and Δ190–1236 (1047-nt deletion), respectively.

Figure 5.

Ectopically expressed intronless FHIT gene (cDNA) is spliced, generating a product equivalent to the endogenous aberrant mRNA in cancer cells. (A) Schematic representation of the FHIT pre-mRNA and the observed aberrant splicing in cancer cells (red line). The aligned sequences of the normal (full-length) and aberrant FHIT mRNAs reveal the skipping of exon 3 to exon 6. (B) Using normal cells (HMEC) and cancer cells (MCF-7), endogenous FHIT mRNAs (schematically shown) were analyzed by RT-PCR with the indicated primer sets (P25–P26, P25′–P26′). The full-length mRNA (black arrowhead) and aberrantly spliced mRNA (red arrowhead) are indicated. Black flags indicate the ORF (372–815) and the red line indicates the postulated mRNA re-splicing that skips the whole exon 3 to exon 6 region. (C) Cancer cells (MCF-7) were transfected with the FHIT-EGFP reporter plasmid (schematically shown) and the spliced products were analyzed by RT-PCR with the indicated primer sets (P25–P5, P25′–P6). The black and red arrowheads indicate the unspliced and spliced products, respectively. Sequence analysis mapped the spliced sites (red triangles) to the 5′ end of exon 3 and the 3′ end of exon 6 (data not shown).

Cell lines, transfection of cells and preparation of total RNA

Most of the indicated cell lines were purchased from Lonza, the Cell Resource Center for Biomedical Research (at the Tohoku University), the Cell Bank of the RIKEN BioResource Center and ATCC. MCF-7 cells were transiently transfected with indicated expression plasmids using Lipofectamine LTX (Invitrogen), according to the manufacturer’s instructions. At 24 h after transfection, the cells were examined for the expression of the green fluorescent protein (GFP) signal. At 48 h after transfection, their total cellular RNA was extracted with TRIzol reagent (Invitrogen) and digested with DNase I (Takara Bio).

RT-PCR detection of TSG101 and FHIT mRNAs

cDNA was synthesized from the total cellular RNA using PrimeScript reverse transcriptase (Takara Bio) and an oligo(dT) primer (Invitrogen), according to the manufacturers’ instructions. The sequences of all the indicated RT-PCR primers (Invitrogen) are listed in Supplementary Table S1. All the PCR products were analyzed by 2% agarose gel electrophoresis. The reaction mixture containing cDNA was used for the first PCR amplification (24 and 22 cycles for TSG101 and FHIT mRNAs, respectively) with Ex Taq HS DNA polymerase (Takara Bio) and the indicated primers (Figures 2A, B and 5B). After the amplified products were purified with Nucleospin Extract II (Macherey-Nagel), the second nested PCR was performed as described earlier, with the indicated inner primers (Figures 2A, B and 5B). The PCR was also performed with genomic DNA prepared from the cells, with the indicated primers (Figure 2A and B). To detect the spliced products from the TSG101-EGFP and FHIT-EGFP-transfected cells (Figures 2E and 5C), 1:100-diluted cDNA solutions, prepared with SuperScript III RNase H− reverse transcriptase (Invitrogen), were used for the first PCR (30 and 22 cycles, respectively) with the indicated primers (Figures 2C and 5C). After purification, as described earlier, 1:100-diluted PCR solutions were used for the second nested PCR, also with the indicated inner primers (Figures 2C and 5C).

Detection/identification of TSG101 and FHIT splicing products

The selective identification of lariat RNA products by RT-PCR of RNase R-digested samples was performed essentially as described previously (29). Total cellular RNA was digested with purified recombinant RNase R (provided by Dr A. Malhotra) at 37°C for 1 h, as described previously (29). All the PCR products were analyzed by 2% agarose gel electrophoresis. The linear RNA-free sample obtained was reverse transcribed using the PrimeScript reverse transcriptase (Takara Bio) or SuperScript III reverse transcriptase (Invitrogen) with random hexamer primers (Figure 3B) or random hexamer primers with lariat-specific primers (Figures 4A, 6 and 7A). The reaction mixture contained cDNA was used for the first PCR (35 or 36 and 40 cycles for the TSG101 and FHIT products, respectively) with PrimeSTAR Max or Ex Taq HS DNA polymerase (Takara Bio) and the indicated primers (Figures 3B, 4A, 6 and 7A). The amplified products were purified and used for the second nested PCR, as described earlier, with the corresponding inner primers. The isolated DNA fragments were subcloned into the pGEM-T Easy vector (Promega) and the sequences were verified.
Figure 3.

Detection of the lariat RNA consisting of only exons demonstrates the re-splicing of the endogenous TSG101 mRNA. (A) Schematic representation of the two postulated pathways leading to the generation of the aberrant mRNA, i.e. a conventional one-step direct aberrant splicing (right) and the proposed two-step process, including the re-splicing of the constitutively spliced mRNA (left). The pre-mRNA [1] and specific splicing products [2–5] from these two pathways were analyzed by RT-PCR with the indicated primers (arrows). (B) Detection of the specific splicing products [2–5] by RT-PCR using RNase R-digested (+) or RNase R-undigested (−) endogenous total RNA. The detected RT-PCR signals in the RNase R-digested sample (containing no linear RNAs) indicates an RNA species with either a 5′–2′ lariat or a 5′–3′ circular structure. However, the latter case was ruled out by the identification of a 5′–2′ branched structure (Figure 4A). Primers P19, P20, P21 and P22 anneal to introns 6, 7, 7 and 8, respectively. Primers P13/14 and P15/16 anneal to introns 7 and 8, respectively. Primers P11, P12, P17 and P18 anneal to exons 8, 10, 5 and 8, respectively. We used exactly the same PCR cycle numbers to amplify all these RNAs (Supplementary Materials and Methods).

Figure 4.

The lariat RNA containing TSG101 exon 2 to exon 9 was identified by sequencing. (A) Detection of the lariat exons by lariat-specific RT-PCR amplification across the branch point with the indicated primer sets (P23–P24, P23′–P24′). The RT-PCR signal remained in the RNase R-digested (+) RNA sample, revealing a closed lariat structure. (B) Sequencing analysis of the lariat-specific RT-PCR product. Electropherogram of the sequence containing the branch point (arrow), followed by the alternative 5′ splice site (arrow) in the lariat exons. (C) A sequence alignment of the PCR fragment (blue) and the TSG101 gene (black) reveals a 2′–5′ branched connection between the end of the alternative 5′ splice site (GT) and the branch point (A), which is located upstream from the alternative 3′ splice site (AG).

Figure 6.

Detection of a lariat RNA consisting only of exons demonstrates the re-splicing of the endogenous FHIT mRNA. RT-PCR analysis of the specific splicing products with the indicated primer sets (P27–P28, P29–P30, P31–P32) using RNase R-digested (+) or RNase R-undigested (−) endogenous total RNA. The RT-PCR signals in the RNase R-digested (+) sample (containing no linear RNAs) indicate either a lariat or circular structure, but a lariat structure was confirmed by the detection of the branched structure (Figure 7A). We used exactly the same number of PCR cycles to amplify all these RNAs (Supplementary Materials and Methods).

Figure 7.

The lariat RNA containing FHIT exon 3 to exon 6 was identified by sequencing. (A) Detection of the lariat exons by a lariat-specific RT-PCR across the branch point with the indicated primer sets (P33–P34, P33′–P34′). The RT-PCR signal remained in the RNase R-digested (+) RNA sample, revealing a closed lariat structure. (B) The sequence of the RT-PCR product is shown in an electropherogram. The alternative 5′ splice site is indicated. (C) A sequence alignment of the PCR fragment (blue) with the FHIT gene (black) reveals a branched connection between the end of the alternative 5′ splice site (GT) and the branch point (A), which is located upstream from the alternative 3′ splice site (AG). A gap at the branch point (indicated with hyphens) results from skipping that occurred during reverse transcription, as described previously (48).

RESULTS

Alternative splice sites are weaker than the authentic splice sites in TSG101 pre-mRNA

The most frequent aberrant splicing event on TSG101 pre-mRNA in cancers generates an mRNA with a 901-nucleotide (nt) deletion (Δ190–1090, formerly Δ154–1054) (12,13–18). It arises through activation of alternative 5′ and 3′ splice sites in distant exons 2 and 9, respectively, thus eliminating a large 37 834-nt span of sequence present in the pre-mRNA, which includes seven pairs of authentic splice sites (Figure 1) (16). Here, we analyzed the aberrant splicing that occurs in cancer cells, which can be classified as combinatorial alternative splicing, i.e. multi-exon skipping coupled to the selection of distant 5′ and 3′ splice sites (30).
Figure 1.

TSG101 pre-mRNA is aberrantly spliced in cancer cells by the activation of distant weak alternative splice sites. The structure of the TSG101 pre-mRNA and the major aberrant splicing often observed in various cancers are shown schematically. The sequences of the normal and aberrant TSG101 mRNAs are aligned with the encoded amino acids. The premature termination codon (PTC) generated in the aberrant mRNA is indicated.

TSG101 pre-mRNA is aberrantly spliced in cancer cells by the activation of distant weak alternative splice sites. The structure of the TSG101 pre-mRNA and the major aberrant splicing often observed in various cancers are shown schematically. The sequences of the normal and aberrant TSG101 mRNAs are aligned with the encoded amino acids. The premature termination codon (PTC) generated in the aberrant mRNA is indicated. The sequence-dependent strength of splice sites is a crucial determinant of their utilization (31,32). Therefore, we first statistically estimated the strengths of all the authentic and alternative splice sites on the TSG101 pre-mRNA with nine computer programs, which produced consistent results (Table 1). The scores indicated that the 5′ and 3′ alternative splice sites are significantly weaker than all the authentic constitutive splice sites, with the exception of one authentic 3′ splice site (numbers in bold, Table 1). Weak alternative splice sites activated when the adjacent strong authentic splice sites are destroyed by mutation, but are not otherwise utilized, are termed ‘cryptic’ splice sites (31,32). The adjacent authentic splice sites were not mutated in the TSG101 gene tested, and the aberrant mRNA was indeed detected in several tissues from fetuses (12). Therefore, the activated splice sites in cancers are not cryptic but alternative. We postulated that a natural mechanism that eliminates the competing authentic splice sites must precede activation of the alternative splice sites. Using the TSG101 pre-mRNA as a model substrate, we tested the hypothesis that the product of the normal constitutive splicing process is re-spliced in cancer cells.

The major aberrant TSG101 mRNA is detected in various cancer cells with different efficiencies

We first confirmed that the endogenous production of a reported major aberrant product with a 901-nt deletion (Figure 2A) was recapitulated in various cancer cell lines with varying efficiencies, whereas it was not detectable in normal cells derived from adult tissues (Figure 2B, lane ‘RT’, and Supplementary Figure S1A). In the series of breast cancer cell lines, a modest correlation was found between the tumor stage and the efficiency of aberrant splicing (Supplementary Figure S1B). These data suggest that this specific aberrant splicing event is under the control of certain trans-acting factor(s), which could be deficient or over-expressed to varying degrees in cancer cells. Ectopically expressed intronless TSG101 gene (cDNA) is spliced and generates equivalent product as the endogenous aberrant mRNA in cancer cells. (A) The structure of the TSG101 mRNA. Red line and triangles indicate the postulated mRNA re-splicing via activated alternative 5′ and 3′ splice sites. Blue triangle indicates the PTC (1112–1114) generated by the postulated splicing. Black flags indicate the open reading frame (ORF; 127–1296). The positions of the PCR primers (P1–P4, P7–P10) are shown. (B) Using normal cells (HMEC) and cancer cells (MCF-7), endogenous TSG101 mRNAs were analyzed by RT-PCR (RT) and the genomic DNA was analyzed by PCR (Ge) with primer sets (P1–P4, P7–P10) annealed to the indicated exons. The black and red arrowheads denote the full-length and major aberrant mRNAs, respectively (Figure 1). (C) The structures of three reporter TSG101-EGFP plasmids (see (A) for the red and blue triangles). The postulated splicing-dependent EGFP expression of these plasmids is indicated with (−) and (+). The positions of the PCR primer sets (P1–P5, P3–P6) are shown. (D) Cancer cells (MCF-7) were transfected with the indicated TSG101-EGFP reporter plasmids and the EGFP expression was analyzed by fluorescence microscopy. (E) These ectopically expressed TSG101-EGFP transcripts were analyzed by RT-PCR. The black and red arrowheads indicate the unspliced and spliced products via the alternative splice sites, respectively. Two minor spliced products (indicated with * and **) were generated from different alternative 3′ splice sites, i.e. Δ190–776 (587-nt deletion) and Δ190–1236 (1047-nt deletion), respectively. To evaluate the relative abundance of this particular aberrant mRNA and normal mRNA in these breast cancer cell lines, we performed single-round RT-PCR (no nested PCR) with specific primer sets that exclusively amplified either the aberrant mRNA or normal mRNA (Supplementary Figure S1C, lower gel picture and map). The semi-quantitative RT-PCR products revealed the substantial production of this aberrant mRNA in these breast cancer cell lines (MCF-7 and HCC series), whereas it was barely detectable in normal cells (HMEC) (Supplementary Figure S1C, upper histogram), consistent with previous reports (13,15,18). Notably, this particular aberrant product was often generated predominantly or even exclusively in the native tissues from patients with malignant or metastatic cancer (13–15,18). We also observed an overall increase in normal TSG101 mRNA production in breast cancer cells (MCF-7 and HCC series) compared with that in normal cells (HMEC), consistent with the finding that TSG101 expression is up-regulated in advanced stages of various human cancers (33–37). No processed TSG101 pseudogene was detected in the human genome; however, a pseudogene has been identified in mouse, although its expression is very unlikely (38,39). To rule out possible aberrant TSG101 products generated from a processed TSG101 pseudogene, we performed a PCR analysis of human genomic DNA. We detected no processed TSG101 pseudogene with a primer set covering all the exons, whereas we observed the authentic TSG101 gene with primer sets that targeted exon 1 and exon 9 to exon 10 with intron 9 (Figure 2B, lane ‘Ge’). We thus confirmed that the observed aberrant TSG101 transcripts did not arise from a deleted gene or a processed pseudogene.

Ectopically expressed TSG101 cDNA can be spliced in cancer cells

If the aberrantly spliced TSG101 mRNA arises through re-splicing of a mature mRNA as we hypothesized, ectopically expressed TSG101 mRNA in cancer cells must be a substrate for further splicing. Consistent with this assumption, we found that an aberrantly spliced product with a 901-nt deletion that is equivalent to that from the endogenous TSG101 gene could be generated in a breast cancer cell line (MCF-7) transfected with a plasmid expressing an intronless TSG101 gene (cDNA). To detect the aberrant splicing of the TSG101 cDNA transcripts in vivo, we constructed three reporter minigenes fused to an enhanced GFP (EGFP) cDNA, so that the aberrant splicing could be monitored visually (Figure 2C). MCF-7 cells transfected with the TSG101-EGFP plasmid, which included the full-length (exons 1–10) TSG101 mRNA, expressed GFP properly in the Golgi complex, as previously reported (Figure 2D, TSG101-EGFP) (40). The TSG101-EGFP[+] reporter was designed so that EGFP could not be translated unless the aberrant splicing event occurred. Remarkably, the transfected cells expressed GFP and the fused protein showed a diffuse cellular distribution (Figure 2D, TSG101-EGFP[+]). This demonstrates that in-frame aberrant splicing via the alternative splice sites took place on the transcript. The TSG101-EGFP[−] reporter was a frame-shifted negative control that did not result in the translation of EGFP after the aberrant splicing event. This reporter did not express GFP in the transfected cells (Figure 2D, TSG101-EGFP[−]), confirming that the GFP fluorescence in the TSG101-EGFP[+] cells was not attributable to leaky expression. The transcripts generated from these reporter minigenes, together with those generated from the endogenous TSG101 gene, were then analyzed by RT-PCR (Figure 2E). The verified sequence of the spliced product from the transfected TSG101-EGFP[+] construct exactly matched the endogenous aberrant TSG101 mRNA (Supplementary Figure S2). These results demonstrate that cancer cells (MCF-7) can potentially splice mature TSG101 mRNA via the alternative sites.

Spliced TSG101 mRNA from the endogenous TSG101 gene is re-spliced in cancer cells

We next analyzed the aberrantly spliced products (with a 901-nt deletion) from the endogenous TSG101 gene, which could have been generated through two possible pathways (Figure 3A): by conventional alternative splicing in one step, or by splicing in two steps with re-splicing of the mature mRNA. In an effort to distinguish between these pathways, we searched for the lariat RNA molecules predicted to be generated in each case. If one-step aberrant splicing occurred on the nascent pre-mRNA, a large lariat product consisting of both exonic and intronic sequences would be predicted to exist, albeit transiently ([5] in Figure 3A). However, if aberrant re-splicing occurs on the endogenous mature mRNA, then a lariat product consisting solely of exonic sequences should be generated ([4] in Figure 3A). Detection of the lariat RNA consisting of only exons demonstrates the re-splicing of the endogenous TSG101 mRNA. (A) Schematic representation of the two postulated pathways leading to the generation of the aberrant mRNA, i.e. a conventional one-step direct aberrant splicing (right) and the proposed two-step process, including the re-splicing of the constitutively spliced mRNA (left). The pre-mRNA [1] and specific splicing products [2-5] from these two pathways were analyzed by RT-PCR with the indicated primers (arrows). (B) Detection of the specific splicing products [2-5] by RT-PCR using RNase R-digested (+) or RNase R-undigested (−) endogenous total RNA. The detected RT-PCR signals in the RNase R-digested sample (containing no linear RNAs) indicates an RNA species with either a 5′–2′ lariat or a 5′–3′ circular structure. However, the latter case was ruled out by the identification of a 5′–2′ branched structure (Figure 4A). Primers P19, P20, P21 and P22 anneal to introns 6, 7, 7 and 8, respectively. Primers P13/14 and P15/16 anneal to introns 7 and 8, respectively. Primers P11, P12, P17 and P18 anneal to exons 8, 10, 5 and 8, respectively. We used exactly the same PCR cycle numbers to amplify all these RNAs (Supplementary Materials and Methods). To distinguish between the two pathways, we took advantage of a sensitive method that we previously developed to selectively detect splicing-specific lariat RNAs. The method uses RT-PCR with RNase R (E. coli 3′ to 5′ exoribonuclease)-treated total cellular RNA, which lacks the majority of linear RNA species (29). We demonstrated previously that RNase R thoroughly degrades the abundant linear RNAs, including rRNAs, tRNAs, pre-mRNAs and mRNAs, while preserving the loop portion of lariat RNAs and fully circular RNAs (29). Here, we show that the RT-PCR signal produced by the mature TSG101 mRNA was completely abolished by RNase R digestion before the RT-PCR ([2] in Figure 3A and B), whereas the signals detected from the excised lariat introns produced by the normal constitutive splicing of endogenous TSG101 pre-mRNA were fully resistant to RNase R digestion ([3] in Figure 3A and B). In this RNase R-treated RNA sample, we could not detect any RT-PCR signals corresponding to a large lariat RNA that included exons and flanking introns ([5] in Figure 3A and B). Therefore, the signals in the RNase R-untreated sample could be derived from the pre-mRNA [1]. In contrast, we did detect a lariat RNA consisting of exonic sequences only, because the RT-PCR signal remained after RNase R digestion ([4] in Figure 3A and B). These data provide compelling evidence that the two-step pathway involving a re-splicing event on the spliced TSG101 mRNA does indeed occur (Figure 3A). However, because the large lariat molecule predicted to be generated by the one-step pathway might be more susceptible to breakage, rendering it RNase R-sensitive, we cannot completely exclude the possibility it also occurs. In theory, the exonic lariat RNA ([4] in Figure 3A) could also be produced from the large lariat RNA [5] through re-splicing of its intron portions. If this were the case, the large lariat RNA [5] must exist, albeit transiently, as the precursor of the exonic lariats [4]. We therefore performed RT-PCR at very high cycle number (35 cycles followed by 35 cycles of nested PCR). As no signal for the large lariat RNA [5] was detected, whereas all other predicted lariat RNAs in the total RNA sample were detected (Figure 3B) (29), we believe it is highly unlikely that this is the pathway through which the exonic lariat is generated. Our finding that the ectopically expressed intronless TSG101 cDNA (Figure 2A) was spliced at exactly the same alternative splice sites in the same cells provides further evidence in support of a re-splicing event on the constitutively spliced TSG101 mRNA (Figure 2E and Supplementary Figure S2).

Exonic lariat was verified as a re-spliced product from TSG101 mRNA

Because RNase R-digested RNA samples might contain not only lariat RNAs but also circular RNAs (29), we further confirmed that the exonic lariat product has the canonical structure through RT-PCR amplification across the branch site, an established technique for identifying trace amounts of splicing-specific lariat molecules (29,41–43). We performed the assay using a DNA primer set that targeted the branched exon junction (Figure 4A, left panel). We observed a discrete RT-PCR signal of 112 bp, which was resistant to RNase R digestion (Figure 4A, right panel), indicating that it is derived from an intact lariat structure, not a cleaved lariat or Y-shaped structure (29). Sequencing of the PCR product revealed that the fragment contained the junction between the alternative 5′ splice site and the branch point ‘A’ nucleotide located 19-nt upstream from the alternative 3′ splice site (Figure 4B and C), verifying its identity as a bona fide exonic lariat RNA excised by re-splicing. The lariat RNA containing TSG101 exon 2 to exon 9 was identified by sequencing. (A) Detection of the lariat exons by lariat-specific RT-PCR amplification across the branch point with the indicated primer sets (P23P24, P23′–P24′). The RT-PCR signal remained in the RNase R-digested (+) RNA sample, revealing a closed lariat structure. (B) Sequencing analysis of the lariat-specific RT-PCR product. Electropherogram of the sequence containing the branch point (arrow), followed by the alternative 5′ splice site (arrow) in the lariat exons. (C) A sequence alignment of the PCR fragment (blue) and the TSG101 gene (black) reveals a 2′–5′ branched connection between the end of the alternative 5′ splice site (GT) and the branch point (A), which is located upstream from the alternative 3′ splice site (AG). Altogether, we have provided compelling evidence for an aberrant re-splicing event that occurs on the constitutively spliced TSG101 mRNA in cancer cells.

A second mRNA re-splicing event was identified on the FHIT pre-mRNA

One of the major aberrant splicing events on the FHIT pre-mRNA in cancers generates an mRNA with a 412-nt deletion, extending from exon 3 to exon 6 (Figure 5A and B). This aberrant splicing eliminates a huge 1 189 164-nt span of the pre-mRNA sequence, which includes four exons and five flanking huge introns (Figure 5A) (14,24). By applying the same strategy and logic used in the analysis of TSG101, we show that the aberrant FHIT mRNA is also the product of re-splicing. In this case, the initial constitutive splicing events greatly shorten the distance between the activated 5′ and 3′ splice sites joined to produce the final mRNA (to <1/2800). Ectopically expressed intronless FHIT gene (cDNA) is spliced, generating a product equivalent to the endogenous aberrant mRNA in cancer cells. (A) Schematic representation of the FHIT pre-mRNA and the observed aberrant splicing in cancer cells (red line). The aligned sequences of the normal (full-length) and aberrant FHIT mRNAs reveal the skipping of exon 3 to exon 6. (B) Using normal cells (HMEC) and cancer cells (MCF-7), endogenous FHIT mRNAs (schematically shown) were analyzed by RT-PCR with the indicated primer sets (P25P26, P25′–P26′). The full-length mRNA (black arrowhead) and aberrantly spliced mRNA (red arrowhead) are indicated. Black flags indicate the ORF (372–815) and the red line indicates the postulated mRNA re-splicing that skips the whole exon 3 to exon 6 region. (C) Cancer cells (MCF-7) were transfected with the FHIT-EGFP reporter plasmid (schematically shown) and the spliced products were analyzed by RT-PCR with the indicated primer sets (P25–P5, P25′–P6). The black and red arrowheads indicate the unspliced and spliced products, respectively. Sequence analysis mapped the spliced sites (red triangles) to the 5′ end of exon 3 and the 3′ end of exon 6 (data not shown). We first confirmed that the endogenous generation of this aberrant FHIT mRNA was recapitulated in various cancer cell lines, including MCF-7, CaSki, HepG2 and HEK293 cells, whereas it was not detected in normal cells (HMEC) (Figure 5B, right panel; data not shown). We then showed that breast cancer cells (MCF-7), transfected with an intronless FHIT-EGFP construct, were able to splice the FHIT-EGFP transcripts at the exonic 5′ and 3′ splice sites, exactly at the 5′ end of exon 3 and the 3′ end of exon 6, respectively (Figure 5C). These results indicate that cancer cells can potentially splice mature FHIT mRNA via exonic splice sites, which is consistent with but not sufficient to prove the occurrence of re-splicing on endogenous FHIT mRNA. Therefore, we assayed for two-step versus one-step pathway-specific splicing products from the endogenous FHIT gene by RT-PCR (Figure 6). We were able to detect an RNase R-resistant RT-PCR signal for the excised lariat intron 5 (middle panel), whereas no RNase R-resistant RT-PCR signal for the predicted large lariat RNA containing exon 5 and the flanking introns was observed (top panel). RT-PCR performed under the same conditions did allow detection of a lariat RNA consisting of the entire exon 3 to exon 6 region in the RNase R-treated sample (lower panel). This experiment also confirmed that there are no introns between these exons in the lariat form, because intron 3 (∼220 kb), intron 4 (∼286 kb) and intron 5 (∼523 kb) are huge. It is highly unlikely that a pathway exists that generates exonic lariats from a large lariat containing exons and introns because no such large lariat, which must exist as an intermediate, was detected, even when a large number of PCR cycles was used. Finally, the canonical lariat structure, and the absence of a circular structure, was verified by RT-PCR across the branch site, followed by sequencing (Figure 7). Taken together, these data indicate that a re-splicing event occurs on the spliced FHIT mRNA, while providing no evidence for one-step direct splicing of the FHIT pre-mRNA. Detection of a lariat RNA consisting only of exons demonstrates the re-splicing of the endogenous FHIT mRNA. RT-PCR analysis of the specific splicing products with the indicated primer sets (P27P28, P29–P30, P31–P32) using RNase R-digested (+) or RNase R-undigested (−) endogenous total RNA. The RT-PCR signals in the RNase R-digested (+) sample (containing no linear RNAs) indicate either a lariat or circular structure, but a lariat structure was confirmed by the detection of the branched structure (Figure 7A). We used exactly the same number of PCR cycles to amplify all these RNAs (Supplementary Materials and Methods). The lariat RNA containing FHIT exon 3 to exon 6 was identified by sequencing. (A) Detection of the lariat exons by a lariat-specific RT-PCR across the branch point with the indicated primer sets (P33P34, P33′–P34′). The RT-PCR signal remained in the RNase R-digested (+) RNA sample, revealing a closed lariat structure. (B) The sequence of the RT-PCR product is shown in an electropherogram. The alternative 5′ splice site is indicated. (C) A sequence alignment of the PCR fragment (blue) with the FHIT gene (black) reveals a branched connection between the end of the alternative 5′ splice site (GT) and the branch point (A), which is located upstream from the alternative 3′ splice site (AG). A gap at the branch point (indicated with hyphens) results from skipping that occurred during reverse transcription, as described previously (48). The discovery of a second case of mRNA re-splicing supports the generality of this pathway, and argues against the possibility that mis-splicing is unique to a single gene. An extensive analysis of exonic lariat molecules in the whole human transcriptome is underway.

DISCUSSION

Re-splicing of constitutively spliced mRNA is a novel pathway of multi-step splicing

Previously, two cases of multi-step splicing that occur on particular pre-mRNAs were reported (Figure 8, left column). (i) ‘Recursive splicing’ in the fruit fly Ultrabithorax (Ubx) gene and several other genes, i.e. the stepwise removal of introns by sequential re-splicing at composite 3′/5′ splice sites (0-nt length exons) (42–44); to date, recursive splicing has not been found in human or other vertebrate cells. Interestingly, the alternative use, but not the sequential use, of a composite 3′/5′ splice site, termed dual-specificity splice site, was reported in both human and mouse (45); in contrast to recursive splicing or the events described here, this is not multi-step splicing. (ii) ‘Intrasplicing’ in the human EPB41 gene (encoding the 4.1R protein), i.e. prerequisite internal splicing followed by specific external splicing (joining the far upstream promoter to an alternative 3′ splice site) (46).
Figure 8.

Different types of multi-step pre-mRNA splicing pathways. ‘Recursive splicing’ and ‘intrasplicing’ have been previously identified in Drosophila Ubx pre-mRNA and human EPB41 pre-mRNA, respectively (42,44,46). Novel multi-step splicing pathways were identified in human TSG101 and FHIT pre-mRNAs (right column), which are distinctive from the previously known multi-step splicing (left column). Each pathway of multi-step splicing is represented here with minimal numbers of exons/introns. 5′ and 3′ denote the 5′ splice site and the 3′ splice site, respectively. The black and gray colors indicate the active and inactive splice sites, respectively, in the each process.

Different types of multi-step pre-mRNA splicing pathways. ‘Recursive splicing’ and ‘intrasplicing’ have been previously identified in Drosophila Ubx pre-mRNA and human EPB41 pre-mRNA, respectively (42,44,46). Novel multi-step splicing pathways were identified in human TSG101 and FHIT pre-mRNAs (right column), which are distinctive from the previously known multi-step splicing (left column). Each pathway of multi-step splicing is represented here with minimal numbers of exons/introns. 5′ and 3′ denote the 5′ splice site and the 3′ splice site, respectively. The black and gray colors indicate the active and inactive splice sites, respectively, in the each process. In the known instances of pre-mRNA re-splicing, intronic sequences are partially removed to generate intermediates that are eventually spliced to produce the final mature mRNAs (Figure 8, left column). A specific intronic element, i.e. a composite 3′/5′ splice site, is essential for the subsequent splicing step in the case of ‘recursive splicing’. In contrast, we demonstrate that normal constitutive splicing occurs on TSG101 and FHIT pre-mRNAs and that the fully exonic mRNAs produced are substrates for subsequent aberrant re-splicing (Figure 8, right column). The constitutive splicing removes many competitive authentic splice sites, which should be important to allow activation of suboptimal exonic alternative splice sites on the mature mRNAs to produce the final aberrant mRNAs with internal deletions. All of the previously reported instances of pre-mRNA re-splicing take place on particular pre-mRNAs in normal cells, whereas mRNA re-splicing can be induced or activated on any pre-mRNAs under the abnormal or deregulated conditions in cancer cells. Considering the evidence together, we infer that the re-splicing of constitutively spliced mRNA is a distinct pathway of multi-step splicing. A fascinating question that remains to be answered is what causes these mature mRNAs to remain substrates for the splicing machinery, the identity of which is currently under investigation. Although both involve re-splicing, from the mechanistic viewpoint of splice-site activation, the pathways that generate aberrant TSG101 and FHIT mRNAs differ. In FHIT mRNA re-splicing, the new 5′ and 3′ splice sites are regenerated at the exon–exon junctions by the preceding constitutive splicing event (Figure 8, right column). Therefore, this process is analogous to ‘recursive splicing’ of the Ubx pre-mRNA, in which the first upstream splicing event creates a composite 5′ splice site with the joined upstream exon. In TSG101 mRNA re-splicing, the activated alternative splice sites are not altered by the preceding constitutive splicing event (right column); therefore, this process is similar to ‘intrasplicing,’ in which the distal splice site pair is already present in the pre-mRNA. All these types of multi-step splicing may partly share a common mechanism, which remains a fascinating issue to be explored.

Functional and physiological implications of mRNA re-splicing

The re-spliced TSG101 mRNA contains a premature termination codon (PTC) in exon 9, and the re-spliced FHIT mRNA lacks an initiation codon for the translation. Therefore, these particular aberrant mRNAs do not encode the corresponding proteins. Thus, the functional contribution of these aberrant mRNAs to tumorigenesis, if any, would most likely arise through negative regulation or partial silencing of the genes. Endogenous aberrant TSG101 mRNA, including the PTC, may be subject to nonsense-mediated mRNA decay (NMD), which might account for the low yield of aberrant TSG101 splicing we observed in cultured cells. The unprecedented pathway we have uncovered, re-splicing after constitutive splicing, might generally account for particular types of functional alternative splicing that occur over very long distances, eliminating multiple exons with their surrounding huge introns (30). The fact that aberrant mRNA re-splicing is induced in cancer cells implies that a mechanism controlling re-splicing operates in normal cells. Indiscriminate re-splicing in cancer cells could involve not only the deregulation of splicing but also deficiencies in NMD and mRNA export, which could allow further splicing of mRNAs containing potential splice sites. This scenario is consistent with the observation that aberrant proteins accumulate globally in cancer cells, irrespective of the corresponding genomic mutations [reviewed in (1,47)]. Our experimental system provides a potentially useful tool to investigate the presumptive mechanism that prevents ‘extra’ splicing of mRNAs, which could be important for the quality control of mRNAs to ensure that they can serve as proper blueprints for proteins.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online: Supplementary Figures 1 and 2, Supplementary Table 1 and Supplementary Materials and Methods.

FUNDING

Grants-in-Aid for Challenging Exploratory Research, Japan Society for the Promotion of Science (JSPS) [21657033 and 23651198 to T.K., 20651051 to A.M.]; Grant-in-Aid for Scientific Research on Innovative Areas ‘Diversity and asymmetry achieved by RNA program’, Japan Ministry of Education, Culture, Sports, Science and Technology (MEXT) [23112722 to A.M.]; Program for the Strategic Research Foundation at Private Universities, MEXT (to A.M., in part). Funding for the open access charge: Grant-in-Aid, MEXT [23112722]. Conflict of interest statement. None declared.
  47 in total

1.  MAASE: an alternative splicing database designed for supporting splicing microarray applications.

Authors:  Christina L Zheng; Young-Soo Kwon; Hai-Ri Li; Kui Zhang; Gabriela Coutinho-Mansfield; Canzhu Yang; T Murlidharan Nair; Michael Gribskov; Xiang-Dong Fu
Journal:  RNA       Date:  2005-10-26       Impact factor: 4.942

2.  Subdivision of large introns in Drosophila by recursive splicing at nonexonic elements.

Authors:  James M Burnette; Etsuko Miyamoto-Sato; Marc A Schaub; Jamie Conklin; A Javier Lopez
Journal:  Genetics       Date:  2005-03-31       Impact factor: 4.562

3.  Stabilization and analysis of intron lariats in vivo.

Authors:  Jamie F Conklin; Aaron Goldman; A Javier Lopez
Journal:  Methods       Date:  2005-12       Impact factor: 3.608

Review 4.  Endosomal and non-endosomal functions of ESCRT proteins.

Authors:  Thomas Slagsvold; Krupa Pattni; Lene Malerød; Harald Stenmark
Journal:  Trends Cell Biol       Date:  2006-05-22       Impact factor: 20.808

5.  Significant increase of a specific variant TSG101 transcript during the progression of cervical neoplasia.

Authors:  R Klaes; M Kloor; F Willeke; P Melsheimer; M von Knebel Doeberitz; R Ridder
Journal:  Eur J Cancer       Date:  1999-05       Impact factor: 9.162

Review 6.  The connection between splicing and cancer.

Authors:  Anabella Srebrow; Alberto R Kornblihtt
Journal:  J Cell Sci       Date:  2006-07-01       Impact factor: 5.285

Review 7.  Aberrant and alternative splicing in cancer.

Authors:  Julian P Venables
Journal:  Cancer Res       Date:  2004-11-01       Impact factor: 12.701

Review 8.  Alterations of pre-mRNA splicing in cancer.

Authors:  Zane Kalnina; Pawel Zayakin; Karīna Silina; Aija Linē
Journal:  Genes Chromosomes Cancer       Date:  2005-04       Impact factor: 5.006

9.  Characterization of RNase R-digested cellular RNA source that consists of lariat and circular RNAs from pre-mRNA splicing.

Authors:  Hitoshi Suzuki; Yuhong Zuo; Jinhua Wang; Michael Q Zhang; Arun Malhotra; Akila Mayeda
Journal:  Nucleic Acids Res       Date:  2006-05-08       Impact factor: 16.971

10.  Aberrant 3' splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization.

Authors:  Igor Vorechovský
Journal:  Nucleic Acids Res       Date:  2006-09-08       Impact factor: 16.971

View more
  18 in total

1.  Research progress on circularRNAs in pancreatic cancer: emerging but promising.

Authors:  Yi-Zhi Wang; Yang An; Bing-Qi Li; Jun Lu; Jun-Chao Guo
Journal:  Cancer Biol Ther       Date:  2019-05-28       Impact factor: 4.742

2.  Research progress of circular RNAs in lung cancer.

Authors:  Yi Ma; Xin Zhang; Yi-Zhi Wang; Hao Tian; Shun Xu
Journal:  Cancer Biol Ther       Date:  2018-11-07       Impact factor: 4.742

Review 3.  Normal and altered pre-mRNA processing in the DMD gene.

Authors:  Sylvie Tuffery-Giraud; Julie Miro; Michel Koenig; Mireille Claustres
Journal:  Hum Genet       Date:  2017-06-09       Impact factor: 4.132

Review 4.  Circular RNAs-one of the enigmas of the brain.

Authors:  Ivan B Filippenkov; Eugene O Kalinichenko; Svetlana A Limborska; Lyudmila V Dergunova
Journal:  Neurogenetics       Date:  2016-07-23       Impact factor: 2.660

5.  Novel transcription factor variants through RNA-sequencing: the importance of being "alternative".

Authors:  Margherita Scarpato; Antonio Federico; Alfredo Ciccodicola; Valerio Costa
Journal:  Int J Mol Sci       Date:  2015-01-13       Impact factor: 5.923

Review 6.  A view of pre-mRNA splicing from RNase R resistant RNAs.

Authors:  Hitoshi Suzuki; Toshifumi Tsukahara
Journal:  Int J Mol Sci       Date:  2014-05-26       Impact factor: 5.923

Review 7.  Recent Advances in the Potential Use of Circular RNA for the Diagnosis and Treatment of Pancreatic Cancer.

Authors:  Xinzhu Sun; Dongyan Liu; Nan Ge; Jintao Guo; Sheng Wang; Xiang Liu; Guoxin Wang; Siyu Sun
Journal:  Cancer Manag Res       Date:  2021-05-28       Impact factor: 3.989

8.  Rbm20 regulates titin alternative splicing as a splicing repressor.

Authors:  Shijun Li; Wei Guo; Colin N Dewey; Marion L Greaser
Journal:  Nucleic Acids Res       Date:  2013-01-09       Impact factor: 16.971

9.  Identification of TSG101 functional domains and p21 loci required for TSG101-mediated p21 gene regulation.

Authors:  Yu-Shiuan Lin; Yin-Ju Chen; Stanley N Cohen; Tzu-Hao Cheng
Journal:  PLoS One       Date:  2013-11-11       Impact factor: 3.240

10.  TSGΔ154-1054 splice variant increases TSG101 oncogenicity by inhibiting its E3-ligase-mediated proteasomal degradation.

Authors:  Huey-Huey Chua; Chiun-Sheng Huang; Pei-Lun Weng; Te-Huei Yeh
Journal:  Oncotarget       Date:  2016-02-16
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.