| Literature DB >> 19920122 |
Christian Koetschan1, Frank Förster, Alexander Keller, Tina Schleicher, Benjamin Ruderisch, Roland Schwarz, Tobias Müller, Matthias Wolf, Jörg Schultz.
Abstract
The internal transcribed spacer 2 (ITS2) is a widely used phylogenetic marker. In the past, it has mainly been used for species level classifications. Nowadays, a wider applicability becomes apparent. Here, the conserved structure of the RNA molecule plays a vital role. We have developed the ITS2 Database (http://its2.bioapps.biozentrum.uni-wuerzburg.de) which holds information about sequence, structure and taxonomic classification of all ITS2 in GenBank. In the new version, we use Hidden Markov models (HMMs) for the identification and delineation of the ITS2 resulting in a major redesign of the annotation pipeline. This allowed the identification of more than 160,000 correct full length and more than 50,000 partial structures. In the web interface, these can now be searched with a modified BLAST considering both sequence and structure, enabling rapid taxon sampling. Novel sequences can be annotated using the HMM based approach and modelled according to multiple template structures. Sequences can be searched for known and newly identified motifs. Together, the database and the web server build an exhaustive resource for ITS2 based phylogenetic analyses.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19920122 PMCID: PMC2808966 DOI: 10.1093/nar/gkp966
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Flow chart of the new pipeline for the ITS2 annotation.
Taxonomic breakdown of predicted ITS2 structures
| Structure | Partials | All | ||||
|---|---|---|---|---|---|---|
| Count | Percentage | Count | Percentage | Count | Percentage | |
| Alveolata | 1750 | 34.67 | 947 | 18.76 | 5048 | 53.43 |
| Amoebozoa | 19 | 13.01 | 9 | 6.16 | 146 | 19.18 |
| Apusozoa | 0 | 0.00 | 0 | 0.00 | 35 | 0.00 |
| Choanoflagellida | 0 | 0.00 | 0 | 0.00 | 1 | 0.00 |
| Cryptophyta | 25 | 38.46 | 17 | 26.15 | 65 | 64.62 |
| Environmental samples | 26 | 28.26 | 7 | 7.61 | 92 | 35.87 |
| Euglenozoa | 3 | 0.62 | 191 | 39.71 | 481 | 40.33 |
| Fornicata | 0 | 0.00 | 0 | 0.00 | 3 | 0.00 |
| Fungi | 79 251 | 59.14 | 28 124 | 20.99 | 134 005 | 80.13 |
| Fungi/Metazoa incertae sedis | 2 | 2.86 | 0 | 0.00 | 70 | 2.86 |
| Haptophyceae | 6 | 19.35 | 3 | 9.68 | 31 | 29.03 |
| Heterolobosea | 1 | 0.59 | 1 | 0.59 | 170 | 1.18 |
| Metazoa | 4754 | 20.14 | 1357 | 5.75 | 23 603 | 25.89 |
| Nucleariidae | 0 | 0.00 | 0 | 0.00 | 2 | 0.00 |
| Parabasalidea | 1 | 0.51 | 0 | 0.00 | 197 | 0.51 |
| Rhizaria | 12 | 2.66 | 2 | 0.44 | 451 | 3.10 |
| Rhodophyta | 27 | 3.52 | 28 | 3.65 | 768 | 7.16 |
| Stramenopiles | 4441 | 52.01 | 2537 | 29.71 | 8539 | 81.72 |
| Viridiplantae | 72 322 | 72.95 | 20 488 | 20.67 | 99 141 | 93.61 |
| Sum | 162 640 | 59.61 | 53 711 | 19.69 | 272 848 | 79.29 |
Figure 2.General ITS2 topology and visualization of plant HMM motifs for the secondary structure of D. brevis (gi: 31281745). Annotation from HMMs of 5.8S and 28S are displayed as dotted lines tracing the outline of their position, whereas the ITS2 motif HMMs are represented by coloured lines. In parts of these motifs, nucleotide frequencies are presented (21,22). Nucleotides are coloured yellow in unpaired regions, whereas paired nucleotides are blue. CBCs between secondary structures of D. brevis and D. scapigeroides (gi: 31281755) are shown in red.