| Literature DB >> 27683109 |
Taina T Nieminen1, Walter Pavicic1,2, Noora Porkka1, Matti Kankainen3, Heikki J Järvinen4, Anna Lepistö5, Päivi Peltomäki1.
Abstract
Allele-specific expression (ASE) of the Adenomatous Polyposis Coli (APC) gene occurs in up to one-third of families with adenomatous polyposis (FAP) that have screened mutation-negative by conventional techniques. To advance our understanding of the genomic basis of this phenomenon, 54 APC mutation-negative families (21 with classical FAP and 33 with attenuated FAP, AFAP) were investigated. We focused on four families with validated ASE and scrutinized these families by sequencing of the blood transcriptomes (RNA-seq) and genomes (WGS). Three families, two with classical FAP and one with AFAP, revealed deep intronic mutations associated with pseudoexons. In all three families, intronic mutations (c.646-1806T>G in intron 6, c.1408+729A>G in intron 11, and c.1408+731C>T in intron 11) created new splice donor sites resulting in the insertion of intronic sequences (of 127 bp, 83 bp, and 83 bp, respectively) in the APC transcript. The respective intronic mutations were absent in the remaining polyposis families and the general population. Premature stop of translation as the predicted consequence as well as co-segregation with polyposis supported the pathogenicity of the pseudoexons. We conclude that next generation sequencing on RNA and genomic DNA is an effective strategy to reveal and validate pseudoexons that are regularly missed by traditional screening methods and is worth considering in apparent mutation-negative polyposis families.Entities:
Keywords: APC; RNA-seq; allele-specific expression; familial adenomatous polyposis; pseudoexon
Mesh:
Substances:
Year: 2016 PMID: 27683109 PMCID: PMC5342583 DOI: 10.18632/oncotarget.12206
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Pedigrees of ASE families
Pedigrees of adenomatous polyposis families with ASE. Individuals with polyposis and/or colorectal cancer are indicated (see Table 1 for additional clinical details). Plus sign denotes carriers of deep intronic mutations associated with pseudoexons of APC. Index persons are marked with arrows.
Clinical and molecular characteristics of polyposis cases investigated
| Case ID | ASE status | Inheritance pattern | Number of polyps | Age at diagnosis | Extracolonic manifestations | Classification of family | Large rearrangement by MLPA | ||
|---|---|---|---|---|---|---|---|---|---|
| RESEARCH BASED | 42 | ASE | Sporadic | 100-1000 | 40 | No | FAP | No | No |
| 78 | N | Sporadic | 50 | 55 | No | AFAP | No | No | |
| 85-1 | ASE | dominant | 2000 | 38 | Yes | FAP | No | No | |
| 85-2 | ASE | dominant | 100-200 | 16 | No | FAP | No | No | |
| 85-3 | NI | dominant | 2000 | 44 | No | FAP | NA | No | |
| 85-4 | dominant | 100-200 | 12 | No | FAP | NA | No | ||
| 88 | N | sporadic | 100-1000 | 58 | No | FAP | No | No | |
| 92 | N | sporadic | 200 | 51 | No | FAP | No | No | |
| 96 | NI | sporadic | 561 | 48 | No | FAP | No | No | |
| 97 | N | sporadic | > 1000 | 58 | No | FAP | No | No | |
| 98 | dominant | 100-1000 | 30 | No | FAP | No | No | ||
| 100 | NI | sporadic | 30 | 62 | No | AFAP | No | No | |
| 103 | (ASE) | sporadic | > 100 | 51 | No | FAP | No | No | |
| 104 | N | dominant? | 210 | 54 | Yes | FAP | No | No | |
| 111 | NI | sporadic | 30-40 | 36 | No | AFAP | No | No | |
| 123 | NI | sporadic | 2100 | 37 | No | FAP | No | No | |
| 125 | N | sporadic | 300 | 31 | No | FAP | No | No | |
| CLINIC BASED | 134 | sporadic | 200-300 | 55 | No | FAP | No | No | |
| 136 | sporadic | >100 | 67 | Yes | FAP | No | No | ||
| 139 | sporadic | 100 | 71 | No | FAP | No | No | ||
| 145 | recessive | 20-50 | 61 | No | AFAP | No | No | ||
| 148 | sporadic | 150-200 | 50 | No | FAP | No | No | ||
| 158 | sporadic | 50 | 49 | Yes | AFAP | No | No | ||
| 159 | sporadic | 200 | 50 | No | FAP | No | No | ||
| 162 | sporadic | >50 | 52 | No | AFAP | No | No | ||
| 163 | (ASE) | sporadic | 10-20 | 16 | Yes | AFAP | No | No | |
| 165-1 | dominant? | Colon cancer x 2 | 50 | NA | AFAP | No | No | ||
| 165-2 | dominant? | 20-30 | 33 | Yes | AFAP | No | No | ||
| 168 | sporadic | 100 | 56 | Yes | FAP | No | No | ||
| 177 | sporadic | 100-200 | 52 | No | FAP | No | No | ||
| 179 | dominant? | >10 | 23 | No | AFAP | No | No | ||
| 180 | NA | >100 | 38 | NA | FAP | No | No | ||
| 1001 | dominant | 10 | 48 | NA | AFAP | No | No | ||
| 1003 | sporadic | 20-30 | 70 | NA | AFAP | No | No | ||
| 1005 | dominant | 10-20 | 68 | Yes | AFAP | No | No | ||
| 1006 | sporadic | 20 | 60 | No | AFAP | No | No | ||
| 1007 | sporadic | 20 | 30 | No | AFAP | No | No | ||
| 1010 | dominant | 5-10 | 68 | NA | AFAP | No | No | ||
| 1011 | NA | 60-100 | 31 | NA | FAP | No | No | ||
| 1013 | sporadic | >100 | 48 | NA | FAP | No | No | ||
| 1015 | sporadic | 10 | 47 | Yes | AFAP | No | No | ||
| 1017 | sporadic? | 10-20 | 57 | NA | AFAP | No | No | ||
| 1018 | sporadic | 20-30 | 74 | NA | AFAP | No | No | ||
| 1019 | sporadic | 2-3 | 30 | Yes | AFAP | No | No | ||
| 1020 | sporadic | 3 | 35 | Yes | AFAP | No | No | ||
| 1021 | sporadic | 30 | 72 | NA | AFAP | No | No | ||
| 1022 | dominant | 3 | 65 | Yes | AFAP | No | No | ||
| 1023 | sporadic | 40 | 33 | NA | AFAP | No | No | ||
| 1024 | sporadic | 20 | 72 | NA | AFAP | No | No | ||
| 1025 | sporadic | 20-30 | 67 | NA | AFAP | No | No | ||
| 1026 | sporadic | 10-20 | 51 | NA | AFAP | No | No | ||
| 1029 | sporadic | 20-30 | 56 | No | AFAP | No | No | ||
| 1030 | sporadic | >10 | 59 | No | AFAP | No | No | ||
| 1032 | NA | 8 | 63 | Yes | AFAP | No | No | ||
| 1034 | NA | >10 | 62 | No | AFAP | No | No | ||
| 1035 | sporadic | 20-30 | 71 | No | AFAP | No | No | ||
| 1036 | dominant | >10 | 61 | No | AFAP | No | No | ||
| 1037 | sporadic | ~10 | 52 | No | AFAP | No | No |
Identification number of family, followed by identification number of individual if several family members were studied.
ASE, shows allele-specific expression of APC; (ASE), putative ASE (see Materials and Methods); N, no ASE; NI, not informative (homozygous); blank, no RNA available
Polyposis or colorectal carcinoma, whichever comes first
Desmoids and duodenal adenomas in particular
Based on the number of intestinal adenomas with 100 as the cut-off
P043-C1 assay from MRC-Holland
ME001-C1 assay from MRC-Holland
NA, information not available
Figure 2RNA-seq (42, 85-2, and 163)
Sashimi plots to visualize splice junctions. IGV display of RNA-seq data is provided for an affected representative of each family and a healthy control individual for reference for each region. Sequence alignments are based on TopHat. The region between APC exons 6 and 7 (GRCh37/Hg19) is shown for FAP42 (Figure 2A) and that between exons 11 and 12 for FAP85 (individual 85-2) and AFAP163 (Figure 2B). The locations of pseudoexons are indicated by horizontal bars. A 54-bp in-frame insertion present in the normal reference sample, too, and not associated with any genomic change is denoted by a dashed bar (Figure 2B). The same insertion was discovered in an earlier investigation [12]. Numbers on the plots indicate APC exon coverages expressed as junction depth. Splice events corresponding to pseudoexons are boxed and those associated with the 54-bp insertion are underlined; the remaining ones represent canonical splicing.
Figure 3RT-PCR (42, 85-1, 163, 103)
RT-PCR analysis of samples from ASE families. RT-PCR products separated by gel electrophoresis are shown. Arrows denote fragments with intronic insertions (pseudoexons). Fragment 2 (upper panel) encompasses a 615-bp cDNA segment from exon 4 to exon 9 [6] and shows a heterozygous 127-bp insertion in family 42. The wild-type size of the exon 11 - exon 13 fragment (lower panel) is 246 bp (Supplementary Table S1). An identical 83-bp insertion in families 85 (case 85-1) and 163 is evident. The RT-PCR products from the index persons and healthy controls were cloned and sequenced to verify their DNA sequences. In the exon 11 - exon 13 fragment, a 54-bp in-frame insertion (see legend for Figure 2) accompanied the pseudoexon and wild-type sequences in a proportion (up to one-third) of all clones and likely contributed to the multiplicity of fragments seen after gel electrophoresis.
Figure 4WGS (42 and 85-2) + Sanger seq
Deep intronic mutations in APC. Upper panels provide IGV display of WGS data for intron 6 in FAP42 (Figure 4A) and intron 11 in FAP85, individual 85-2 (Figure 4B). Lower panels show Sanger sequence tracings of the mutations. In Figure 4B, the Sanger sequencing result of AFAP163 is also given (AFAP163 was not included in WGS analysis).
Figure 5Schematic diagrams of pseudoexons
Schematic diagrams of APC pseudoexons identified. The canonical splice sites at the exon/intron borders, pseudoexons (underlined), and the responsible deep intronic mutations (in bold) are highlighted.
Summary of the variants
| Family | Location in | Insertion length (bp) | Genomic variant | RNA alteration | Predicted protein alteration |
|---|---|---|---|---|---|
| FAP42 | intron 6 (5:112126337/5:112790640) | 127 | c.646-1806T>G | r.645_646ins646-1933_646-1807 | p.Arg216* |
| FAP85 | intron 11 (5:112158419/5:112822720) | 83 | c.1408+731C>T | r.1408_1409ins1408+647_1408+729 | p.Gly471Serfs*55 |
| AFAP163 | intron 11 (5:112158417/5:112822722) | 83 | c.1408+729A>G | r.1408_1409ins1408+647_1408+729 | p.Gly471Serfs*55 |
Note: An updated APC nomenclature based on 16 exons (NM_012583R138.4, ENST012583R10257430) was used for exon annotation.