| Literature DB >> 24498620 |
Roberta Spinelli1, Alessandra Pirola2, Sara Redaelli2, Nitesh Sharma2, Hima Raman2, Simona Valletta2, Vera Magistroni2, Rocco Piazza2, Carlo Gambacorti-Passerini3.
Abstract
Point mutations in intronic regions near mRNA splice junctions can affect the splicing process. To identify novel splicing variants from exome sequencing data, we developed a bioinformatics splice-site prediction procedure to analyze next-generation sequencing (NGS) data (SpliceFinder). SpliceFinder integrates two functional annotation tools for NGS, ANNOVAR and MutationTaster and two canonical splice site prediction programs for single mutation analysis, SSPNN and NetGene2. By SpliceFinder, we identified somatic mutations affecting RNA splicing in a colon cancer sample, in eight atypical chronic myeloid leukemia (aCML), and eight CML patients. A novel homozygous splicing mutation was found in APC (NM_000038.4:c.1312+5G>A) and six heterozygous in GNAQ (NM_002072.2:c.735+1C>T), ABCC 3 (NM_003786.3:c.1783-1G>A), KLHDC 1 (NM_172193.1:c.568-2A>G), HOOK 1 (NM_015888.4:c.1662-1G>A), SMAD 9 (NM_001127217.2:c.1004-1C>T), and DNAH 9 (NM_001372.3:c.10242+5G>A). Integrating whole-exome and RNA sequencing in aCML and CML, we assessed the phenotypic effect of mutations on mRNA splicing for GNAQ, ABCC 3, HOOK 1. In ABCC 3 and HOOK 1, RNA-Seq showed the presence of aberrant transcripts with activation of a cryptic splice site or intron retention, validated by the reverse transcription-polymerase chain reaction (RT-PCR) in the case of HOOK 1. In GNAQ, RNA-Seq showed 22% of wild-type transcript and 78% of mRNA skipping exon 5, resulting in a 4-6 frameshift fusion confirmed by RT-PCR. The pipeline can be useful to identify intronic variants affecting RNA sequence by complementing conventional exome analysis.Entities:
Keywords: ABCC3; CML; DNAH9; GNAQ; HOOK1; KLHDC1; RNA-Seq; SMAD9; aCML; somatic mutations; splicing mutation; whole-exome sequencing
Year: 2013 PMID: 24498620 PMCID: PMC3865592 DOI: 10.1002/mgg3.23
Source DB: PubMed Journal: Mol Genet Genomic Med ISSN: 2324-9269 Impact factor: 2.183
Splicing prediction summary for CML and aCML samples
| Gene | Patient ID | Locus | Mutation | Absolute coverage T (N) | Mutation frequency T (N) | Mutation fraction T(N) (%) | Splicing prediction |
|---|---|---|---|---|---|---|---|
| GNAQ | Ph+001 | 9,79599197,79599198,1,C/T | c.735+1C>T | 23 (65) | 8 (0) | 35% (0%) | Donor lost |
| ABCC3 | Ph+004 | 17,46100788,46100789,1,G/A | c.1783-1G>A | 59 (36) | 30 (0) | 51% (1%) | Acceptor lost |
| KLHDC1 | Ph+005 | 14,49265391,49265392,1,A/G | c.568-2A>G | 93 (14) | 48 (1) | 52% (7%) | Acceptor lost |
| SMAD9 | Ph-005 | 13,36325812,36325813,1,C/T | c.1004-1C>T | 27 (16) | 14 (0) | 52% (0%) | Acceptor lost |
| HOOK1 | Ph-006 | 1,60103421,60103422,1,G/A | c.1662-1G>A | 186 (153) | 87 (0) | 47% (0%) | Acceptor lost |
| DNAH9 | Ph-007 | 17,11715832,11715833,1,G/A | c.10242+5G>A | 37 (43) | 19 (0) | 51% (0%) | Donor lost |
Description of six splicing mutations according to NCBI 36.1 nomenclature. T (N), tumor and matched normal sample.
Figure 1GNAQ (NM_002072.2:c.735+1C>T) splicing mutation near the 5’ donor splice site at position +1 in the intron between exons 5 and 6 in UCSC panel and somatic mutation frequency of 35% in IGV visualization. Below, the Sanger validation.
Figure 2(a) RNA-Seq read coverage of GNAQ (NM_002072) in Ph+001. (b) RNA-seq showed 28 junction reads between exon 4 and exon 6 resulting in exon 4 to exon 6 frameshift fusion (78% mutant sequence), five junction reads between exons 5 and 6, and three junction reads between exons 4 and exons 5 (22% wild-type sequence).
Point mutations within splice sites and their effect on mRNA splicing scored by MutationTaster, SSPNN, and NetGene2 (NCBI36.1 nomenclature)
| Gene | Patient ID | Mutation | MutationTaster | SSPNN w.type/mutant | NetGene2 w.type/mutant | Exon length (bp) | Reads counts (nr) | RNA-seq expression level (FPKM) | Comment |
|---|---|---|---|---|---|---|---|---|---|
| GNAQ | Ph+001 | c.735+1C>T | 1 | 1.00/– | 0.997/– | 2160 | 2843 | 32.938 | 22% wt RNA, 78% skipping ex5 |
| ABCC3 | Ph+004 | c.1783-1G>A | 1 | 0.91/– | 0.390/– | 5155 | 859 | 6.109 | 58% wt RNA, 37% intron retention, 5% activation cryptic site, out of frame |
| KLHDC1 | Ph+005 | c.568-2A>G | 1 | 0.93/– | 0.946/– | 2644 | 72 | 0.939 | Low expression |
| SMAD9 | Ph-005 | c.1004-1C>T | 1 | 0.98/– | 0.988/– | 5558 | 42 | 0.379 | Low expression |
| HOOK1 | Ph-006 | c.1662-1G>A | 1 | 0.94/– | 0.877/– | 5861 | 344 | 3.767 | 58% wt RNA, 42% intron retention, |
| DNAH9 | Ph-007 | c.10242+5G>A | 0.999 | 0.95/– | 0.949/– | 14087 | 0 | 0 | Not expression |
Transcript quantification by RNA-seq analysis. –, the constitutive splice site is not recognized in mutant sequence.
Data confirmed by RT-PCR.
Figure 3(a) PCR product of GNAQ (NM_002072) (from exon 4 to exon 6) from cDNA of tumor and matched remission Ph+001 sample showed the presence of different bands: the wild type one (417 bp) and the shorter aberrant one (287 bp) present only in the tumor sample. (b) Sanger sequencing of the aberrant GNAQ product showed complete loss of exon 5.
Figure 4(a) RNA-seq read coverage of HOOK1 (NM_015888) in Ph-006. RNA-seq showed seven junction reads between exon 17 and exon 18 (58% wild-type sequence) and five reads mapping on the acceptor site (42% intron retention) and carrying the splicing mutation. (b) Sanger sequencing of the aberrant HOOK1 product showed the presence of the mutated base (adenine) and the absence of the wild-type guanine in the retained intron sequence meaning that the intron retention is exclusively caused by the mutation.