| Literature DB >> 12716459 |
Debashish Bhattacharya1, Dawn Simon, Jian Huang, Jamie J Cannone, Robin R Gutell.
Abstract
BACKGROUND: We have studied spliceosomal introns in the ribosomal (r)RNA of fungi to discover the forces that guide their insertion and fixation.Entities:
Mesh:
Substances:
Year: 2003 PMID: 12716459 PMCID: PMC156610 DOI: 10.1186/1471-2148-3-7
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Logo analysis of 50 nt upstream and downstream of insertion sites of 43 different spliceosomal rRNA introns. The information content of the 2 Gs of the intron proto-splice site is shown as is a line at p = 0.05 (95% quantile) that is based on simulations using random sequence data. This exon region contains a total of 6.91 bits of information.
Figure 2The distribution of SSU and LSU rRNA spliceosomal introns relative to the G-frequency in these genes. The raw G-frequencies are shown in the green circles, the smoothed loess curves for 50 nt and 100 nt smoothing windows are shown with the blue lines, and the positions of introns are shown with the vertical red lines.
Chi-Square Test of Association of Spliceosomal and Group I Introns with Conserved rRNA Sites
| sites | 178 | 175 | 116 | 1073 | 1542 | - |
| group I | 11 [4.85] | 5 [4.77] | 5 [3.16] | 21 [29.23] | 42 | 0.0106* |
| spliceosomal | 0 [3.00] | 3 [2.95] | 8 [1.96] | 15 [18.09] | 26 | <0.0000* |
| sites | 150 | 203 | 168 | 2383 | 2904 | - |
| group I | 10 [2.12] | 4 [2.82] | 4 [2.37] | 23 [33.64] | 41 | <0.0000* |
| spliceosomal | 3 [1.29] | 8 [1.75] | 2 [1.45] | 12 [20.51] | 25 | <0.0000* |
| sites | 355 | 156 | 80 | 951 | 1542 | - |
| group I | 17 [9.67] | 3 [4.25] | 1 [2.18] | 21 [25.90] | 42 | 0.0577 |
| spliceosomal | 4 [5.99] | 9 [2.63] | 2 [1.35] | 11 [16.04] | 26 | 0.0003* |
| sites | 595 | 349 | 283 | 1677 | 2904 | - |
| group I | 17 [8.40] | 5 [4.93] | 1 [4.00] | 18 [23.68] | 41 | 0.0059* |
| spliceosomal | 10 [5.12] | 3 [3.00] | 1 [2.44] | 11 [14.44] | 25 | 0.0969 |
Column headings: Introns are positioned relative to SSU and LSU rRNA sites for positions with a nucleotide in more than 95% of the sequences that are 1) 98–100%, 2) 90–97%, 3) 80–89%, and 4) either <80% conserved or positions that are present in <95% of the sequences in genes from; 3Dom2O, the three phylogenetic domains and two organelles; 3O, the three phylogenetic domains. Sites are the number of rRNA positions followed by group I and spliceosomal introns in each conservation class and the number of observed and expected introns (in brackets [under a null model of random insertion]) is shown for each gene. The P-values for each analysis are also shown. Significant probability values are marked with an asterisk.
Figure 3Distribution of Euascomycetes spliceosomal introns on a conservation diagram of fungal SSU rRNA overlaid on a secondary structure model of the Saccharomyces cerevisiae SSU rRNA. Spliceosomal introns are shown in large text with arrows denoting their positions. Positions with nucleotides in more than 95% of the 1042 sequences that were studied are shown as following: upper case, conserved at ≥ 95%, lower case, conserved at 90–94%, filled circle, conserved at 80–89%, and open circle, conserved at < 80%. Other regions are denoted as arcs. The numbers at the arcs show the upper and lower number of nucleotides that are found in these variable regions. The boxed regions are G-rich sequences upstream of intron insertion sites. Boxed filled circles indicate that the most frequent nucleotide at this site was a G in our alignment of 1434 fungal rRNAs that included both intron-containing and intron-less taxa.
Frequencies of Fungal Nucleotides at Sites of Spliceosomal Intron Insertion
| 265 | 336 | 99.8 | 100.0 | 99.8 | G | G | 95.0 | 85.7 | 95.3 | 4 |
| 297 | 369 | 65.3 | 97.8 | 63.9 | U | A | 100.0 | 100.0 | 99.5 | 5 |
| 298 | 370 | 100.0 | 100.0 | 99.5 | A | G | 100.0 | 100.0 | 100.0 | 2 |
| 299 | 371 | 100.0 | 100.0 | 100.0 | G | G | 99.8 | 100.0 | 99.8 | 12 |
| 300 | 372 | 99.8 | 100.0 | 99.8 | G | G | 99.7 | 100.0 | 99.6 | 1 |
| 330 | 402 | 99.1 | 100.0 | 99.1 | C | G | 99.5 | 100.0 | 99.4 | 15 |
| 331 | 403 | 99.5 | 100.0 | 99.4 | G | G | 99.7 | 100.0 | 99.7 | 8 |
| 332 | 404 | 99.7 | 100.0 | 99.7 | G | C | 99.6 | 100.0 | 99.6 | 1 |
| 333 | 405 | 99.6 | 100.0 | 99.6 | C | U | 91.5 | 100.0 | 91.1 | 1 |
| 337 | 409 | 76.7 | 87.0 | 76.3 | C | A | 99.8 | 100.0 | 99.8 | 1 |
| 390 | 461 | 99.3 | 97.5 | 99.4 | G | G | 93.1 | 100.0 | 92.9 | 2 |
| 393 | 464 | 99.6 | 100.0 | 99.6 | A | G | 99.6 | 100.0 | 99.5 | 10 |
| 400 | 471 | 99.6 | 100.0 | 99.6 | A | U | 97.6 | 95.0 | 97.7 | 1 |
| 514 | 561 | 99.5 | 100.0 | 99.4 | G | G | 99.5 | 100.0 | 99.4 | 1 |
| 674 | 885 | 99.7 | 100.0 | 99.7 | G | U | 99.8 | 100.0 | 99.8 | 4 |
| 882 | 1106 | 67.3 | 75.8 | 67.0 | U | G | 80.5 | 84.9 | 80.3 | 1 |
| 883 | 1107 | 80.5 | 84.9 | 80.3 | G | G | 99.4 | 100.0 | 99.4 | 6 |
| 939 | 1164 | 99.2 | 97.1 | 99.3 | G | G | 98.7 | 97.1 | 98.8 | 8 |
| 1057 | 1277 | 99.8 | 100.0 | 99.8 | G | G | 98.8 | 100.0 | 98.7 | 1 |
| 1071 | 1291 | 93.4 | 100.0 | 93.3 | G | G | 99.4 | 100.0 | 99.4 | 1 |
| 1083 | 1303 | 99.5 | 100.0 | 99.5 | U | G | 99.6 | 100.0 | 99.6 | 1 |
| 1226 | 1459 | 99.1 | 100.0 | 99.1 | C | A | 99.8 | 100.0 | 99.8 | 2 |
| 1229 | 1462 | 99.7 | 100.0 | 99.7 | G | C | 99.4 | 100.0 | 99.3 | 8 |
| 1514 | 1777 | 98.6 | 100.0 | 98.5 | G | G | 91.1 | 100.0 | 90.9 | 2 |
| 678 | 967 | 99.9 | 100.0 | 99.9 | G | A | 99.9 | 97.0 | 100.0 | 16 |
| 681 | 970 | 99.7 | 97.0 | 99.9 | G | G | 98.1 | 93.9 | 98.3 | 1 |
| 711 | 1000 | 99.6 | 97.0 | 99.7 | G | A | 98.2 | 81.8 | 99.0 | 3 |
| 775 | 1065 | 99.8 | 100.0 | 99.8 | G | G | 100.0 | 100.0 | 100.0 | 1 |
| 776 | 1066 | 100.0 | 100.0 | 100.0 | G | G | 100.0 | 100.0 | 100.0 | 5 |
| 777 | 1067 | 100.0 | 100.0 | 100.0 | G | G | 100.0 | 100.0 | 100.0 | 1 |
| 780 | 1070 | 100.0 | 100.0 | 100.0 | G | A | 100.0 | 100.0 | 100.0 | 1 |
| 783 | 1073 | 100.0 | 100.0 | 100.0 | A | G | 100.0 | 100.0 | 100.0 | 2 |
| 784 | 1074 | 100.0 | 100.0 | 100.0 | G | A | 100.0 | 100.0 | 100.0 | 3 |
| 786 | 1076 | 99.8 | 100.0 | 99.8 | C | U | 95.8 | 91.2 | 96.1 | 1 |
| 787 | 1077 | 95.8 | 91.2 | 96.1 | U | A | 98.7 | 91.2 | 99.1 | 1 |
| 824 | 1114 | 100.0 | 100.0 | 100.0 | U | C | 100.0 | 100.0 | 100.0 | 1 |
| 830 | 1120 | 99.8 | 100.0 | 99.8 | A | G | 99.8 | 100.0 | 99.8 | 1 |
| 858 | 1151 | 100.0 | 100.0 | 100.0 | G | G | 100.0 | 100.0 | 100.0 | 2 |
| 978 | 1306 | 99.3 | 95.7 | 99.6 | G | G | 100.0 | 100.0 | 100.0 | 3 |
| 1024 | 1351 | 98.6 | 100.0 | 98.5 | A | G | 99.3 | 100.0 | 99.3 | 1 |
| 1054 | 1387 | 97.6 | 100.0 | 97.4 | G | G | 100.0 | 100.0 | 100.0 | 4 |
| 1091 | 1424 | 100.0 | 100.0 | 100.0 | G | U | 99.3 | 100.0 | 99.2 | 1 |
| 1093 | 1426 | 100.0 | 100.0 | 100.0 | G | U | 100.0 | 100.0 | 100.0 | 1 |
| 1098 | 1431 | 100.0 | 100.0 | 100.0 | A | A | 99.3 | 100.0 | 99.2 | 1 |
| 1849 | 2367 | 100.0 | 100.0 | 100.0 | U | G | 100.0 | 100.0 | 100.0 | 1 |
| 1903 | 2404 | 97.3 | 100.0 | 97.1 | G | G | 100.0 | 100.0 | 100.0 | 1 |
| 1929 | 2430 | 97.3 | 100.0 | 97.1 | G | G | 97.3 | 100.0 | 97.1 | 1 |
| 2076 | 2576 | 100.0 | 100.0 | 100.0 | G | A | 100.0 | 100.0 | 100.0 | 1 |
| 2445 | 2972 | 100.0 | 100.0 | 100.0 | G | G | 100.0 | 100.0 | 100.0 | 1 |
Column headings: Intron Position, the sites of spliceosomal intron insertion in the SSU and LSU (below the broken line) rRNA genes. The homologous intron sites in the Escherichia coli (Ec, GenBank #J01695) and Saccharomyces cerevisiae (Sc, GenBank #U53879) genes are shown. The 5' and 3' nucleotides (5'-nt, 3'-nt) flanking the intron insertion sites (Insertion Site), the frequency of these nucleotides in the alignment of all fungal SSU and LSU rRNAs (All, 1434 and 880 sequences, respectively), of fungi containing spliceosomal introns (+ Int, 73 and 40 sequences, respectively), and of fungi lacking spliceosomal introns (- Int, 1361 and 840 sequences, respectively), and the number of taxa containing introns at each site (# Int) are shown.
Figure 4Analysis of rRNA intron distribution. A. The positions of introns mapped on the homologous sites in the primary structure of E. coli SSU and LSU rRNA. Group I and group II (underlined) introns are shown above the lines, whereas spliceosomal and archaeal (underlined) introns are shown below the lines. B. Results of the broken-stick analysis of rRNA intron distribution. The results of the simulations are shown as are the observed standard deviations for all introns or group I and spliceosomal introns individually for both SSU and LSU rRNA genes.