| Literature DB >> 19821996 |
Frank Förster1, Chunguang Liang, Alexander Shkumatov, Daniela Beisser, Julia C Engelmann, Martina Schnölzer, Marcus Frohme, Tobias Müller, Ralph O Schill, Thomas Dandekar.
Abstract
BACKGROUND: Tardigrades represent an animal phylum with extraordinary resistance to environmental stress.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19821996 PMCID: PMC2768748 DOI: 10.1186/1471-2164-10-469
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
CLANS clusters of sequence similar proteins in published tardigrade sequences1
| Number/color | Cluster description | Sequences/percentage |
|---|---|---|
| 2 | Cytochrome c oxidase like (subunit 1, EC 1.9.3.1) | 425 (3.94%) |
| 3 | Uncharacterized protein U88/Glycosyltransferase 8 family | 302 (2.80%) |
| 5 | Proteins containing a Chitin binding domain | 191 (1.77%) |
| 6 | Proteins containing an IBR/Neuroparsin/DUF1096 domain | 189 (1.75%) |
| 7 | Fatty-acid binding protein (FABP) family | 127 (1.18%) |
| 8 | TSP2 (remote homology to Sericin 1) | 126 (1.17%) |
| 9 | Proteins containing a RNA polymerase Rpb3/Rpb11 dimerisation domain | 92 (0.85%) |
| 10 | Metallothionein superfamily (Type 15 family./Thioredoxin like) | 84 (0.78%) |
| 12 | GTP-binding elongation factor family. EF-Tu/EF-1A sub- family | 79 (0.72%) |
| 13 | GST superfamily. Sigma family | 78 (0.70%) |
| 14 | Ubiquitin family | 75 (0.69%) |
| 15 | Cathepsin family (EC 3.4.22.-) | 74 (0.67%) |
| 16 | Carboxypeptidase A inhibitor like | 72 (0.64%) |
| 17 | Trichohyalin/Translation initiation factor like | 69 (0.60%) |
| 18 | TSP2 | 65 (0.57%) |
| 19 | TSP2 | 61 (0.56%) |
| 20 | RNA/DNA-binding proteins | 60 (0.55%) |
| ... | ||
| 23 | Small Heat Shock Protein (HSP20) family | 53 (0.47%) |
| 24 | Diapause-specific proteins | 51 (0.44%) |
| ... | ||
| 38 | LEA type 1 family proteins | 31 (0.28%) |
| ... |
1 Shown are the number of proteins found for the specified cluster, their percentages and the corresponding cluster number in Figure 1. The full Table with all clusters and their color code matching to Figure 1 is given in [additional file 1: Table S1]. Clusters 1, 4 and 11 contain rRNA and are given in [additional file 1].
2 Tardigrade specific protein cluster
Highly represented protein functions in Tardigrades (COGs and KOGs).
| Information storage and processing |
|---|
| 75 Translation elongation factor EF-1 (COG5256) |
| 64 GTPases - translation elongation (COG0050) |
| 58 Peptide chain release factor RF-3 (COG4108) |
| Cellular processes and signaling |
| 31 Ubiquitin (COG5272) |
| 25 Membrane GTPase LepA (COG0481) |
| 21 Cysteine protease (COG4870) |
| Metabolism |
| 75 Heme/copper-type cytochrome/quinol oxidases (COG0843) |
| 67 GTPases - Sulfate adenylate transferase (COG2895) |
| Poorly characterized |
| 11 Dehydrogenases with different specificities (COG1028) |
| 11 Uncharacterized homolog of Blt101 (COG0401) |
| Information storage and processing |
| 77 Translation elongation factor EF-1 (KOG0052) |
| 71 Polypeptide release factor 3 (KOG0459) |
| 70 Elongation factor 1 alpha (KOG0458) |
| 53 Mitochondrial translation elongation factor Tu (KOG0460) |
| Cellular processes and signaling |
| 52 Glutathione S-transferase (KOG1695) |
| 46 Cysteine proteinase Cathepsin L (KOG1543) |
| 34 Apolipoprotein D/Lipocalin (KOG4824) |
| 31 Cysteine proteinase Cathepsin F (KOG1542) |
| Metabolism |
| 78 Cytochrome c oxidase (KOG4769) |
| 74 Fatty acid-binding protein FABP (KOG4015) |
| Poorly characterized |
| 31 Ubiquitin and ubiquitin-like proteins (KOG0001) |
| 16 GTPase Rab18, small G protein superfamily (KOG0080) |
| 15 Ras-related GTPase (KOG0394) |
| 15 GTPase Rab21, small G protein superfamily (KOG0088) |
1 Detailed data and all COG/KOG numbers are given on the WEB page http://waterbear.bioapps.biozentrum.uni-wuerzburg.de/cgi-bin/cog_stat.pl. Summarized here are the functions of those clusters of orthologous groups (COGs) occurring particularly often or suggesting tardigrade specific adaptations.
Identified DnaJ-family COGs/KOGs in Tardigrades and Milnesium tardigradum1.
| present in | ||||
|---|---|---|---|---|
| KOG/COG number | COG distribution | COG name | Tardigrades | |
| COG0484 | DnaJ-class molecular chaperone with C-terminal Zn finger domain | 5 | ||
| COG2214 | DnaJ-class molecular chaperone | 8 | ||
| KOG0550 | Molecular chaperone (DnaJ superfamily) | 3 | 2 | |
| KOG0691 | Molecular chaperone (DnaJ su perfamily) | 7 | 2 | |
| KOG0712 | Molecular chaperone (DnaJ su perfamily) | 8 | 2 | |
| KOG0713 | Molecular chaperone (DnaJ su perfamily) | 5 | 1 | |
1Shown are the number of proteins found for the specified COG/KOG number, the KOG distribution of the KOG in different eukaryotic species (see abbreviations) and the COG/KOG annotation.
Abbreviations: A Arabidopsis thaliana, C Caenorhabditis elegans, D Drosophila melanogaster, H Homo sapiens, Y Saccharomyces cerevisiae, P Schizosaccharomyces pombe, E Encephalitozoon cuniculi. *These include specific unpublished data from ongoing work on M. tardigradum
Regulatory elements in Hypsibius dujardini1 and Milnesium tardigradum2 mRNA sequences.
| Motiv |
|
|
|---|---|---|
| 15-LOX-DICE | 1528 (1269)3 | 46 (45)3 |
| ADH DRE | 60 (58)3 | 1 (1)3 |
| BRE | 1(1)3 | -- |
| Brd-Box | 152 (149)3 | 28 (22)3 |
| CPE | 21 (21)3 | 15 (15)3 |
| Elastin G3A | 1 (1)3 | -- |
| GLUT1 | 1 (1)3 | -- |
| GY-Box | 406 (372)3 | 21 (21)3 |
| IRE | 1 (1)3 | -- |
| IRES | 1353 (1353)3 | 90 (90)3 |
| K-Box | 469 (447)3 | 35 (33)3 |
| SECIS-1 | 1 (1)3 | -- |
| SECIS-2 | 6 (6)3 | -- |
| TGE | 5 (5)3 | 1 (1)3 |
| TOP | 50 (50)3 | 1 (1)3 |
1 We considered 5,378 ESTs in H. dujardini. 2 We considered 607 ESTs in M. tardigradum. 3 The number of hits is followed by the number of mRNAs with this hit in brackets to indicate multiple hits.
Figure 1Functional clusters by CLANS of sequence related proteins in tardigrades. All available [see additional file 1: Figure S5] tardigrade protein sequences were clustered in a 3D sphere according to their sequence distance and were projected to the paper plane. Individual protein functions are colored [for color code see additional file 1: Table S1] and all listed in Table 1. Functional clusters appear as patches of an individual color. Color code and clusters can be interactively examined at the Tardigrade workbench http://waterbear.bioapps.biozentrum.uni-wuerzburg.de and are given in [additional file 1 Table S1]. figure1.pdf
HSP90 proteins identified in Hypsibius dujardini using the Tardigrade analyzer1.
| Hit | Predicted function/name (Tardigrade analyzer) | Pattern matched | Start | End |
|---|---|---|---|---|
| position | ||||
| gi:37213462 | hsp902 | 68 | 77 | |
| gi:37213713 | hsp902 | 70 | 79 | |
1These are hits using the pattern hit option and the heat shock protein PROSITE entry PS00298 for pattern generation and recognition. The pattern has the signature of Y-x- [NQHD]- [KHR]- [DE]- [IVA]-F- [LM]-R- [ED]. 2 Predicted similarity to Q7PT10 (HSP83 ANOGA) from Swissprot