| Literature DB >> 26821878 |
Lan Lin1, Peng Jiang2, Juw Won Park3,4,5, Jinkai Wang6, Zhi-Xiang Lu7, Maggie P Y Lam8, Peipei Ping9,10, Yi Xing11.
Abstract
BACKGROUND: Alu elements are major contributors to lineage-specific new exons in primate and human genomes. Recent studies indicate that some Alu exons have high transcript inclusion levels or tissue-specific splicing profiles, and may play important regulatory roles in modulating mRNA degradation or translational efficiency. However, the contribution of Alu exons to the human proteome remains unclear and controversial. The prevailing view is that exons derived from young repetitive elements, such as Alu elements, are restricted to regulatory functions and have not had adequate evolutionary time to be incorporated into stable, functional proteins.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26821878 PMCID: PMC4731929 DOI: 10.1186/s13059-016-0876-5
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Identification and analysis of putative coding Alu exons in human genes. a The bioinformatics work flow to identify and characterize putative coding Alu exons using RNA-seq and proteomics data. b The Venn diagram illustrating the number of putative coding Alu exons with different types of RNA-seq or proteomic evidence. c The splicing patterns and deduced peptide sequences of putative coding Alu exons with supporting peptide sequences from the PRIDE database. The stop codon of each isoform is indicated by a STOP sign
Putative coding Alu exons with significant splicing activities and supporting peptide evidence in proteomics databases
| Gene symbol | Gene ID | Exon coordinate (hg19) | Gene description | MaxJC ≥10 ψ ≥0.33 | MaxJC ≥10 |Δψ| ≥10 % | In PRIDE | In PeptideAtlas | |
|---|---|---|---|---|---|---|---|---|
| 1 | SUGT1 | ENSG00000165416 | chr13:53235609-53235705 | SGT1, suppressor of G2 allele of SKP1 (S. cerevisiae) | Yes | Yes | Yes | Yes |
| 2 | DNMT1 | ENSG00000130816 | chr19:10290862-10290910 | DNA (cytosine-5-)-methyltransferase 1 | Yes | Yes | Yes | Yes |
| 3 | C21orf7 | ENSG00000156265 | chr21:30463988-30464082 | Chromosome 21 open reading frame 7 | Yes | No | Yes | No |
| 4 | ADARB1 | ENSG00000197381 | chr21:46604388-46604508 | Adenosine deaminase, RNA-specific, B1 | Yes | Yes | Yes | No |
| 5 | ALG8 | ENSG00000159063 | chr11:77813919-77813994 | Asparagine-linked glycosylation 8, alpha-1,3-glucosyltransferase homolog (S. cerevisiae) | Yes | No | Yes | No |
| 6 | NEK4 | ENSG00000114904 | chr3:52783707-52783845 | NIMA (never in mitosis gene a)-related kinase 4 | Yes | No | Yes | No |
| 7 | ZNF415 | ENSG00000170954 | chr19:53618462-53618560 | Zinc finger protein 415 | Yes | Yes | Yes | No |
| 8 | TPRKB | ENSG00000144034 | chr2:73959710-73959827 | TP53RK binding protein | Yes | No | Yes | No |
| 9 | C21orf7 | ENSG00000156265 | chr21:30463821-30463897 | Chromosome 21 open reading frame 7 | Yes | No | Yes | No |
| 10 | C11orf80 | ENSG00000173715 | chr11:66523823-66523976 | Chromosome 11 open reading frame 80 | Yes | No | Yes | No |
| 11 | ZNF573 | ENSG00000189144 | chr19:38249115-38249236 | Zinc finger protein 573 | No | Yes | Yes | No |
| 12 | MCM3 | ENSG00000112118 | chr6:52130032-52130183 | Minichromosome maintenance complex component 3 | No | Yes | Yes | No |
| 13 | SLC3A2 | ENSG00000168003 | chr11:62639048-62639141 | Solute carrier family 3 (activators of dibasic and neutral amino acid transport), member 2 | Yes | No | Yes | No |
| 14 | EIF4E | ENSG00000151247 | chr4:99807604-99807697 | Eukaryotic translation initiation factor 4E | Yes | Yes | Yes | No |
| 15 | KCNRG | ENSG00000198553 | chr13:50592958-50593056 | Potassium channel regulator | Yes | No | Yes | No |
| 16 | FAM124B | ENSG00000124019 | chr2:225265097-225265222 | Family with sequence similarity 124B | Yes | Yes | Yes | No |
| 17 | B3GALNT2 | ENSG00000162885 | chr1:235659495-235659618 | Beta-1,3-N-acetylgalactosaminyltransferase 2 | Yes | No | Yes | No |
| 18 | ZNF195 | ENSG00000005801 | chr11:3382972-3383119 | Zinc finger protein 195 | Yes | Yes | Yes | No |
Fig. 2Ribo-seq data (HeLa cells) provide evidence for the translational activities of putative coding Alu exons. a Comparison of Alu exon inclusion levels in Ribo-seq and RNA-seq data of HeLa cells. Each dot represents an Alu exon. Seventy-six putative coding Alu exons with at least 10 reads mapped to one of the three splice junctions in both Ribo-seq and RNA-seq data are shown in the plot. Exons with PRIDE peptide sequences are indicated in red. b-e The UCSC genome browser view of the Ribo-seq and RNA-seq data of four representative Alu exons. The upstream junction read count (UJC), downstream junction read count (DJC), and skipping junction read count (SJC) are also indicated. The stop codon of each mRNA isoform is indicated by a STOP sign
Fig. 3A protein-coding Alu exon in SUGT1 supported by multiple lines of proteomics and Ribo-seq evidence. a The splicing pattern and deduced peptide sequence of a putative coding Alu exon in SUGT1 and its corresponding peptide evidence from PRIDE and PeptideAtlas. b Tandem mass spectrometry (MS/MS) spectrum of the peptide TSSDPPALDSQSAGITGADAN from PRIDE (experiment ID: 26855, spectrum ID: 7275). c The UCSC genome browser view of the Ribo-seq and RNA-seq data of the SUGT1 Alu exon. d RT-PCR analysis of the SUGT1 Alu exon in four different tissues in human (Hs), chimpanzee (Pt), and rhesus macaque (Rm). Error bars show standard error of the mean from at least three replicate experiments
Fig. 4Minigene splicing reporter analysis of the human-specific splicing change of SUGT1 protein-coding Alu exon (Exon 6). a Schematic diagrams of the pI-11-H3 minigene splicing reporter and the wild-type/mutant minigene constructs of SUGT1 Alu exon. The nucleotide differences between human and chimpanzee are indicated by asterisks on the chimpanzee sequence. b Representative gel image for fluorescently labeled RT-PCR analyses in Hela cells using the SUGT1 Alu exon wild-type and mutant minigene constructs
Fig. 5ADARB1 Alu exon inclusion isoform encodes an active RNA editing enzyme with altered editing activity. a The schematic diagram of the protein domain structure of ADARB1 isoforms and supporting peptide sequences from the PRIDE database. b The change in overall RNA editing levels of 7,618 RNA editing sites in HEK293 cells upon ectopic expression of the exon skipping (Short) or the exon inclusion (Long) ADARB1 isoform as compared to the empty vector (EV) control. Error bars show standard errors calculated based on the 7,618 known RNA editing sites used in this analysis. c Common and isoform-specific differentially edited sites upon ectopic expression of the Alu exon inclusion (Long) or skipping (Short) ADARB1 isoform. Isoform-specific differentially edited sites are further classified (low-confidence, high-confidence) based on the strength of the RNA-seq evidence