| Literature DB >> 32484558 |
Devlin C Moyer1, Graham E Larue2, Courtney E Hershberger1, Scott W Roy3, Richard A Padgett1.
Abstract
During nuclear maturation of most eukaryotic pre-messenger RNAs and long non-coding RNAs, introns are removed through the process of RNA splicing. Different classes of introns are excised by the U2-type or the U12-type spliceosomes, large complexes of small nuclear ribonucleoprotein particles and associated proteins. We created intronIC, a program for assigning intron class to all introns in a given genome, and used it on 24 eukaryotic genomes to create the Intron Annotation and Orthology Database (IAOD). We then used the data in the IAOD to revisit several hypotheses concerning the evolution of the two classes of spliceosomal introns, finding support for the class conversion model explaining the low abundance of U12-type introns in modern genomes.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32484558 PMCID: PMC7367187 DOI: 10.1093/nar/gkaa464
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.(A) Overview of the major steps of the intronIC algorithm. (B) Scatter plot of all classified introns in the human genome; gray: U2-type introns, red: U12-type introns with probability scores ≤84%; yellow: U12-type introns with probability scores from 84–90%, green: U12-type introns with probability scores >90%, our chosen scoring threshold. (C) Sequence logos of the 5′SS and BPS PWMs for GT-AG/AT-AC U2- and U12-type introns. (D) Balanced accuracy performance of the classifier with different values of hyperparameter C on test sets during the first round of the cross-validation process. (E) Histogram (with logarithmic scale y-axis) of probability scores for the human data shown in part B.
Figure 2.Phylogenetic distribution of U12-type introns in all species annotated by the Intron Annotation and Orthology Database (IAOD), U12DB (18), SpliceRack (35) and ERISdb (24). Blank entries in the table represent organisms not represented in the respective database. Counts of U12-type introns in the IAOD only represent introns flanked by coding exons. The NCBI Taxonomy Browser (70) and Integrative Tree of Life (71) were used to create the phylogenetic tree.
Figure 3.Phase distribution of introns within each class in all genomes annotated in the IAOD. Organisms are grouped by phylogeny. The bias against phase 0 U12-type introns is statistically significant in all organisms but G. max, O. sativa, X. tropicalis and Z. mays (chi-squared; P < 0.10). The bias toward phase 0 U2-type introns is statistically significant in all organisms but A. mellifera, D. melanogaster, S. cerevisiae and S. pombe (chi-squared; P < 0.10).
Figure 4.Percentages of introns with the specified nucleotide immediately upstream of the 5′ splice site in each phase of both classes of introns. Organisms are grouped by phylogeny.
Figure 5.Nucleotide biases at the -1 position relative to the 5′ splice site for all organisms (excluding those lacking U12-type introns) annotated in the IAOD, grouped by terminal dinucleotides and intron class.
Groups of orthologous introns of different classes. Introns with nothing in the gene column came from transcripts with no gene name annotated by Ensembl. Vertical bars in the sequence column denote splice sites, and the middle sequence flanked by ellipses is the putative branch point region as annotated by intronIC
| Organism | Gene | Intron Class | Phase | Sequence |
|---|---|---|---|---|
|
| U12-type | 0 |
| |
|
| U2-type | 0 |
| |
|
| U2-type | 0 |
| |
|
| U12-type | 0 |
| |
|
| U2-type | 1 |
| |
|
| U12-type | 1 |
| |
|
| U2-type | 1 |
| |
|
|
| U12-type | 1 |
|
|
|
| U12-type | 1 |
|
|
|
| U12-type | 1 |
|
|
|
| U2-type | 1 |
|
|
|
| U12-type | 1 |
|
|
|
| U12-type | 1 |
|
|
| U2-type | 2 |
| |
|
|
| U12-type | 2 |
|
|
|
| U12-type | 2 |
|
|
|
| U12-type | 2 |
|
|
|
| U12-type | 2 |
|
|
|
| U2-type | 2 |
|
Percentages of introns with various terminal dinucleotides in each class in all annotated genomes. Organisms are sorted in the phylogenetic order shown in Figure 2
| Intron class | U2-type | U12-type | ||||||
|---|---|---|---|---|---|---|---|---|
| Terminal dinucleotides | GT-AG | GC-AG | AT-AC | Other | GT-AG | GC-AG | AT-AC | Other |
|
| 912 | 2.7 | 0 | 5.7 | 0 | 0 | 0 | 0 |
|
| 100 | 0.12 | 0 | 0.01 | 0 | 0 | 0 | 0 |
|
| 98 | 1.6 | 0 | 0 | 76 | 0.01 | 23 | 0 |
|
| 99 | 1.0 | 0.01 | 0.08 | 73 | 0 | 26 | 1.1 |
|
| 99 | 0.50 | 0.05 | 0.41 | 86 | 0 | 13 | 1.2 |
|
| 97 | 0.30 | 0.01 | 2.5 | 74 | 0 | 22 | 3.8 |
|
| 99 | 0.64 | 0 | 0.18 | 0 | 0 | 0 | 0 |
|
| 99 | 0.65 | 0.01 | 0.03 | 91 | 0.71 | 8.6 | 0 |
|
| 99 | 0.75 | 0.01 | 0.07 | 47 | 5.3 | 47 | 0 |
|
| 100 | 0.26 | 0 | 0.09 | 88 | 4.2 | 8.3 | 0 |
|
| 91 | 0.70 | 0.04 | 8.7 | 65 | 0 | 23 | 12 |
|
| 97 | 2.2 | 0.01 | 0.56 | 77 | 0.97 | 18 | 3.4 |
|
| 85 | 0.90 | 0.07 | 14 | 78 | 0.22 | 8 | 14 |
|
| 97 | 1.2 | 0.02 | 1.4 | 75 | 0 | 22 | 2.7 |
|
| 86 | 1.4 | 0.03 | 13.0 | 77 | 0.80 | 12 | 11 |
|
| 91 | 5.7 | 0 | 3.4 | 79 | 4.3 | 12 | 5.0 |
|
| 98 | 0.50 | 0.01 | 1.1 | 82 | 0.20 | 14 | 4.1 |
|
| 94 | 0.89 | 0.03 | 5.1 | 69 | 0.81 | 21 | 10 |
|
| 95 | 0.86 | 0.02 | 3.7 | 69 | 0.35 | 20 | 11 |
|
| 97 | 0.78 | 0.02 | 2.1 | 69 | 0.49 | 23 | 6.5 |
|
| 99 | 0.78 | 0.01 | 0.16 | 69 | 0.47 | 25 | 5.4 |
|
| 97 | 3.4 | 0.01 | 0.02 | 78 | 2.8 | 17 | 2 |
|
| 97 | 2.8 | 0.01 | 0.14 | 72 | 1.7 | 21 | 4.7 |
|
| 99 | 0.77 | 0.01 | 0.15 | 68 | 0.61 | 26 | 5.2 |