| Literature DB >> 17442103 |
Sabina Wodniok1, Andreas Simon, Gernot Glöckner, Burkhard Becker.
Abstract
BACKGROUND: The Viridiplantae (green algae and land plants) consist of two monophyletic lineages: the Chlorophyta and the Streptophyta. Most green algae belong to the Chlorophyta, while the Streptophyta include all land plants and a small group of freshwater algae known as Charophyceae. Eukaryotes attach a poly-A tail to the 3' ends of most nuclear-encoded mRNAs. In embryophytes, animals and fungi, the signal for polyadenylation contains an A-rich sequence (often AAUAAA or related sequence) 13 to 30 nucleotides upstream from the cleavage site, which is commonly referred to as the near upstream element (NUE). However, it has been reported that the pentanucleotide UGUAA is used as polyadenylation signal for some genes in volvocalean algae.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17442103 PMCID: PMC1868727 DOI: 10.1186/1471-2148-7-65
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Sources and general characteristics of the organismal datasets used
| Genbank | 1002 | 28 | A 36.3% | |
| C 9.5% | ||||
| G 19.6% | ||||
| U 37.6% | ||||
| TIGR | 31608 | 10508 | A 21.2% | |
| G 30,9% | ||||
| C 24,8% | ||||
| U 23,1% | ||||
| Genbank | 1229 | 359 | A 21.1% | |
| C 26,6% | ||||
| G 29,0% | ||||
| U 23,3% | ||||
| Genbank | 5906 | 292 | A 20.3% | |
| C 31.2% | ||||
| G 28.5% | ||||
| U 20.0% | ||||
| this study | 5034 | 1260 | A 24.2% | |
| C 21.5% | ||||
| G 23.3% | ||||
| U 27.0% | ||||
| Genbank | 6016 | 265 | A 20.1% | |
| C 23.8% | ||||
| G 31.2% | ||||
| U 24.9% | ||||
| [26], and this study | 1032 | 110 | A 25.7% | |
| C 26.0% | ||||
| G 26.8% | ||||
| U 21.5% | ||||
| Genbank | 1888 | 54 | A 22.3% | |
| C 22.3% | ||||
| G 28.6% | ||||
| U 26.8% | ||||
| Genbank | 1201 | 136 | A 3.6% | |
| C 27.6% | ||||
| G 33.3% | ||||
| U 35.5% | ||||
| this study | 5094 | 142 | A 27.2% | |
| C 18,9% | ||||
| G 22,6% | ||||
| U 31,3% | ||||
| this study | 4651 | 473 | A 27.8% | |
| C 20.5% | ||||
| G 25.6% | ||||
| U 26.1% | ||||
| [18] | 10395 | 1327 | A 26.9% | |
| C 20.6% | ||||
| G 23.1% | ||||
| U 29.4% |
1) No of non-redundant sequences with a poly(A)-tail analyzed, 2) within 200 nt upstream from the CS.
Figure 1Single-nucleotide profiles of the 3'UTR in various green algae. Single-nucleotide frequencies within the 200 nt upstream from the CS are shown. For clarity, smoothed curves using the weighted average of 5 neighbours method are shown. The original point graphs are depicted in supplementary Fig. 1.
Frequencies of penta- and hexanucleotide words within 50 nt upstream from the CS
| U 57.1% | UGUAA 71.4% | 2.52 | AUGUAA 42.9% | 1.34 | |
| C 14.3% | UUUGU 50.0% | 1.51 | UGUAAU 32.9% | 0.79 | |
| G 28.6% | |||||
| G 42.4% | UGUAA 49.8% | 3.27 | UGUAAC 20.0% | 3.33 | |
| C 31.5% | GUAAC 23.1% | 2.03 | CUGUAA 15.2% | 3.00 | |
| U 26.1% | |||||
| C 51.5% | UGUAA 47.3% | 3.29 | UGUAAG 16.4% | 3.06 | |
| T 24.6% | UUGUA 20.6% | 1.95 | UGUAAC 14.5% | 3.00 | |
| G 23.9% | |||||
| C 61.6% | UGUAA 56.8% | 4.12 | UGUAAC 32.5% | 4.27 | |
| G 19.2% | GUAAC 40.0% | 2.81 | CUGUAA 23.6% | 3.82 | |
| U 19.2% | |||||
| C 45.1% | UUUUG 17.0% | 1.22 | AAAAAA 6.0% | 1.94 | |
| G 28.4% | AUUUU 17.5% | 1.21 | UUUUUG 8.1% | 1.75 | |
| U 26.5% | |||||
| C 38.6% | GUAAC 68.7% | 4.12 | UGUAAC 52.1% | 4.89 | |
| G 34.1% | UGUAA 66.4% | 3.97 | GUAACA 36.6% | 4.44 | |
| U 27.3% | |||||
| C 62.7% | UGUAA 61.8% | 3.76 | UGUAAA 27.3% | 3.65 | |
| U 19,1% | UUGUA 19.1% | 2.13 | UUGUAA 16.3% | 3.15 | |
| G 18.2% | |||||
| C 61.1% | UGUAA 66.7% | 3.76 | UGUAAC 24.1% | 3.65 | |
| U 22.2% | GUAAC 31.9% | 2.42 | UUGUAA 24.1% | 3.15 | |
| G 16.7% | |||||
| C 41.9% | UGUAA 22.8% | 3.41 | AUUGUA 12.5% | 2.96 | |
| U 33.8% | AAUGU 17.6% | 3.36 | UAUAAU 8.8% | 2.66 | |
| G 24.3% | |||||
| C 38.0% | UUUUG 31.7% | 1.43 | UUUUUU 21.1% | 1.80 | |
| U 34.5% | UGUUU 19.6% | 1.33 | UGUUUU 17.6% | 1.91 | |
| G 27.5% | |||||
| C 41.3% | CCCCC 10.1% | 1.90 | CCCCCC 5.9% | 2.93 | |
| U 30.1% | CCCUU 13.3% | 1.07 | CCCCCU 5.5% | 2.30 | |
| G 28.6% | |||||
| C 54.8% | AAUAA 29.1% | 1.68 | AAUAAA 19.1% | 1.95 | |
| U 26.2% | AUAAA 28.0% | 1.53 | AAUUAA 15.6% | 1.62 | |
| G 19.0% | |||||
Figure 2Distribution of (putative) polyadenylation signals within 50 nt upstream from the CS in different chlorophyte and streptophyte algae. Distribution of the (putative) polyadenylation signals UGUAA, AAUAAA and U-rich within 50 nt upstream from the CS in different chlorophyte and streptophyte algae.
Number of expressed genes containing multiple putative poly(A) signals
| Organism | No of expressed genes containing at least one putative poly(A) signal | No of expressed genes containing two putative poly(A) signals | No of expressed genes containing two putative poly(A) signals for which different CS were found | Distance between poly(A) signal und CS for expressed genes with mRNA isoforms |
| 20 | 6 | 1 | 26/27 | |
| 5232 | 160 | 54 | 15–23 | |
| 124 | 2 | 2 | 18/17 19/23 | |
| 166 | 4 | 2 | 21/18 17/18 | |
| 182 | 5 | 0 | - | |
| 68 | 3 | 0 | - | |
| 36 | 1 | 0 | - | |
| 386 | 40 | 5 | 9–45 |
Figure 3Drawing showing the phylogenetic relationships for the organisms investigated. The structure of the putative organismal and the ancestral poly(A) signal is indicated. Loss of the AAUAAA-like signal is indicated with a red arrow and the gain of the UGUAA signal is indicated with a purple arrow.