| Literature DB >> 30813258 |
Zhixin Zhao1,2, Xiaohui Wu3, Guoli Ji4, Chun Liang5, Qingshun Quinn Li6,7,8.
Abstract
Pre-mRNA cleavage and polyadenylation is an essential step for almost all mRNA in eukaryotes. The cis-elements around the poly(A) sites, however, are very diverse among different organisms. We characterized the poly(A) signals of seven different species, and compared them with that of four well-studied organisms. We found that ciliates do not show any dominant poly(A) signal; a triplet (UAA) and tetramers (UAAA and GUAA) are dominant in diatoms and red alga, respectively; and green alga Ostreococcus uses UGUAA as its poly(A) signal. Spikemoss and moss use conserved AAUAAA signals that are similar to other land plants. Our analysis suggests that the first two bases (NN in NNUAAA) are likely degenerated whereas UAAA appears to be the core motif. Combined with other published results, it is suggested that the highly conserved poly(A) signal AAUAAA may be derived from UAA with an intermediate, putative UAAA, following a pathway of UAA→UAAA→AAUAAA.Entities:
Keywords: RNA processing; bioinformatic analysis; mRNA 3′-end formation; polyadenylation signal
Mesh:
Substances:
Year: 2019 PMID: 30813258 PMCID: PMC6413133 DOI: 10.3390/ijms20040958
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1The single nucleotide profiles around poly(A) sites for the seven species. The −1 position is poly(A) site, “−” is designated as the upstream sequences (300 nt), and ‘‘+’’ represents the downstream sequences (100 nt). (A) diatom Thalassiosira pseudonana, (B) diatom Phaeodactylum tricornutum, (C) ciliate Tetrahymena thermophila, (D) green alga Ostreococcus lucimarinus, (E) red alga Cyanidioschyzon merolae, (F) spikemoss Selaginella moellendorffii, (G) moss Physcomitrella patens.
The conserved poly(A) signals in the near upstream element (NUE) region of the seven species.
| Species Name (Common Name) | Conserved Signal | Frequency (%) | Z-Score |
|---|---|---|---|
| UAA | 86.51 | 19.12 1 | |
| UAA | 78.21 | 6.94 1 | |
| - | - | - | |
| UGUAA | 30.4 | 30.21 1 | |
| UAAA | 86.45 | 15.33 1 | |
| GUAA | 43.23 | 8.78 1 | |
| AAUAAA | 7.83 | 7.18 3 | |
| AAUAAA | 7.25 | 15.46 3 |
The superscript 1 in Z-score coloum means order-1 Markov model, the superscript 3 denotes order-3 Markov model.
Figure 2The phylogenetic relations among the 11 species investigated in our study. The common names and scientific names are listed in parentheses close to the relevant clade names in the phylogenetic tree. The common names are all underlined. The phylogenetic tree is constructed according to Tree of Life Web Project (http://tolweb.org/tree/phylogeny.html).
The frequencies of UGUAA and AAUAAA and their single nucleotide variants.
| Species Name (Common Name) | Canonical Signal (Frequency) | Variant (Frequency) | Overall Frequency | |
|---|---|---|---|---|
|
| UGUAA (50.38%) | UGCAA (6.42%) | 72.86% | |
| UGUAA (30.40%) | UUUAA (10.47%) | 59.64% | ||
|
| AAUAAA (7.25%) | AGUAAA (2.73%) | 11.93% | |
| AAUAAA (7.83%) | UAUAAA (3.31%) | 14.41% | ||
|
| AAUAAA (8.59%) | UAUAAA (3.44%) | 13.71% | |
| AAUAAA (64.92%) | AUUAAA (16.68%) | 85.19% |
Figure 3The combined logos of mononucleotide variants from the UGUAA group (top) and two AAUAAA groups (bottom).
Figure 4The overall frequencies of UGUAA and AAUAAA in a 2-D coordinate in the 11 species. Abbreviations: chlamy, green alga C. reinhardtii; Ostreococcus, green alga O. lucimarinus; diatom_t, diatom T. peudonana; diatom_p, diatom P. tricornutum.