| Literature DB >> 16817973 |
Abstract
BACKGROUND: The epsilon proteobacteria, which include many important human pathogens, are presently recognized solely on the basis of their branching in rRNA trees. No unique molecular or biochemical characteristics specific for this group are known.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16817973 PMCID: PMC1557499 DOI: 10.1186/1471-2164-7-167
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Proteins that are uniquely present in most epsilon proteobacteria (Campylobacterales)
| WS0030 | 68 aa | Probable periplasmic protein, tat-domain | All significant hits from ε-Proteobacteria | |
| WS0086 | 181 aa | Putative helicase, gnl|CDD|14084, COG4951 | All significant hits from ε-Proteobacteria | |
| WS0133 | 397 aa | Putative integral membrane protein | All significant hits from ε-Proteobacteria | |
| WS0134 | 214 aa | Conserved hypothetical protein | All significant hits from ε-Proteobacteria | |
| WS0154 | 336 aa | Probable membrane protein | All significant hits from ε-Proteobacteria | |
| WS0159 | 203 aa | Conserved hypothetical protein, unknown function | All significant hits from ε-Proteobacteria | |
| WS0169 | 92 aa | Possible membrane protein (corresponds to Cj0692c and HP0748) | All significant hits from ε-Proteobacteria | |
| WS0172 | 675 aa | Putative membrane protein, similar to HP0358 | All significant hits from ε-Proteobacteria | |
| WS0180 | 74 aa | Related to Cbb3-type cytochrome oxidase, subunit 3, gnl|CDD|13876, COG4736 | All significant hits from ε-Proteobacteria | |
| WS0184 | 205 aa | Probable membrane protein | All significant hits from ε-Proteobacteria | |
| WS0185 | 163 aa | Related to FixH protein, gnl|CDD|23975 | All significant hits from ε-Proteobacteria | |
| WS0216 | 330 aa | Conserved hypothetical protein, unknown function | All significant hits from ε-Proteo; E value for | |
| WS0260 | 142 aa | Conserved hypothetical protein, unknown function | All significant hits from ε-Proteobacteria; E value for | |
| WS0266 | 271 aa | Conserved protein, | All significant hits from ε-Proteobacteria; Jonsson et al. (2004) | |
| WS0316 | 163 aa | Conserved protein related to the RDD family; gnl|CDD|25144, pfam06271 | Besides ε-Proteo, three other hits were below the threshold value; Large jump in E value from last ε-Proteo (6E-15) to the first of these hits (.003). | |
| WS0447 | 328 aa | Putative membrane protein, corresponds to antigen P44Hh9 of | All significant hits from ε-Proteobacteria | |
| WS0448 | 276 aa | Probable periplasmic protein | All significant hits from ε-Proteobacteria | |
| WS0476 | 77 aa | NuoE, Putative NADH Oxidoreductase I | All significant hits from ε-Proteobacteria | |
| WS0480 | 428 aa | Putative NADH Oxidoreductase I | All significant hits from ε-Proteobacteria | |
| WS0490 | 778 aa | Flagellar functional protein, Pf1a | All significant hits from ε-Proteobacteria | |
| WS0520 | 247 aa | TonB domain protein | All significant hits from ε-Proteobacteria | |
| WS0563 | 164 aa | Putative integral membrane protein; identified by similarity to PIR:B71953 | All significant hits from ε-Proteobacteria | |
| WS0575 | 217 aa | Putative lipoprotein, The | All significant hits from ε-Proteobacteria. | |
| WS0604 | 390 aa | Probable periplasmic protein | All significant hits from ε-Proteobacteria | |
| WS0802 | 333 aa | Probable lipoprotein; identified as plasminogen binding protein pgpB. | All significant hits from ε-Proteobacteria; Jonsson et al. (2004) | |
| WS0865 | 126 aa | Conserved hypothetical protein, unknown function | All significant hits from ε-Proteobacteria; missing in | |
| WS1039 | 156 aa | Conserved hypothetical protein, unknown function | All significant hits from ε-Proteobacteria | |
| WS1040 | 236 aa | Conserved hypothetical protein, unknown function | All significant hits from ε-Proteobacteria | |
| WS1235 | 412 aa | Putative periplasmic protein; COG5659 | All significant hits from ε-Proteobacteria; not found in | |
| WS1244 | 167 aa | Putative lipoprotein | All significant hits from ε-Proteobacteria | |
| WS1329 | 246 aa | Putative periplasmic protein | All significant hits from ε-Proteobacteria; absent in | |
| WS1344 | 123 aa | Putative periplasmic protein | All significant hits from ε-Proteobacteria | |
| WS1349 | 110 aa | Probable membrane protein | All significant hits from ε-Proteobacteria | |
| WS1485 | 89 aa | Probable integral membrane protein | All significant hits from ε-Proteobacteria | |
| WS1495 | 87 aa | Conserved hypothetical protein, unknown function | All significant hits from ε-Proteo; The E value for | |
| WS1496 | 208 aa | Probable periplasmic protein | All significant hits from ε-Proteobacteria | |
| WS1640 | 117 aa | Probable integral membrane protein | All significant hits from ε-Proteobacteria; absent in | |
| WS1730 | 183 aa | Conserved hypothetical protein, unknown function | All significant hits from ε-Proteobacteria | |
| WS1755 | 168 aa | Probable lipoprotein | All significant hits from ε-Proteobacteria | |
| WS1771 | 183 aa | Putative membrane protein | All significant hits from ε-Proteobacteria; absent in | |
| WS1773 | 351 aa | Putative membrane protein | All significant hits from ε-Proteobacteria | |
| WS1777 | 80 aa | Conserved hypothetical protein, unknown function | All significant hits from ε-Proteobacteria | |
| WS1814 | 85 aa | Conserved hypothetical protein, unknown function | All significant hits from ε-Proteobacteria | |
| WS1874 | 352 aa | HolA, DNA polymerase III, delta subunit; gnl|CDD|11180, COG1466 | All significant hits except one from ε-Proteo; E value for | |
| WS1965 | 121 aa | Conserved hypothetical protein, unknown function | All significant hits from ε-Proteobacteria | |
| WS1990 | 118 aa | Conserved domain DUF 177: COG1399 | All significant hits from ε-Proteobacteria | |
| WS2120 | 162 aa | Conserved hypothetical protein, unknown function | All significant hits from ε-Proteobacteria | |
| WS2123 | 246 aa | Conserved hypothetical protein, unknown function | All significant hits from ε-Proteobacteria | |
| WS2146 | 147 aa | Contains Sua5_yciO_yrdC domain involved in binding to dsRNA; gnl|CDD|15330 | All significant hits except one from ε-Proteo; E value changes from 3e-19 to 2e-4 for | |
| WS0230 | 432 aa | Show significant similarity to deacylase domain; gnl|CDD|12932, COG3608 | Besides ε-Proteo, homologs with very low E values also present in two | |
| WS2059 | 259 aa | Conserved hypothetical protein, unknown function | All significant hits from ε-Proteo; not found in | |
| WS1752 | 145 aa | Conserved hypothetical protein, unknown function | All significant hits from ε-Proteo; not found in | |
| WS1211 | 621 aa | Homologous to CiaB invasion antigen of | All significant hits from ε-Proteo; not found in |
The species distribution of these proteins was determined by BLASTp and PSI-BLAST searches as described in the Methods section. Unless otherwise indicated all of these proteins are uniquely found in the following sequenced genomes: H. pylori 26695, H. pylori J99, H. hepaticus ATCC 51449, C. jejuni (various strains: NCTC 11168, RM1221, HB93-13, 84-25, CF93-6, 260.94, 11168 and 81-176), C. lari, C. coli, C. upsaliensis, C. fetus, W. succinogenes DSM 1740 and Thiomicrospira denitrificans ATCC 33889.
Figure 1Partial nucleotide sequence alignment for an ε-proteobacterial specific protein WS0086. The initial part of this alignment, which is less conserved and some of which is also missing in C. lari, is not shown. The asterisks (*) denote residues that are completely conserved. A number of conserved regions that are suitable for designing PCR primers or other diagnostic probes are boxed.
Proteins specific for the Wolinella and Helicobacter species (Helicobacteraceae family)
| WS0068 | 79 | Hypothetical protein, unknown function | |
| WS0584 | 171 | Hypothetical protein, unknown function | |
| WS1041 | 277 | Hypothetical protein, unknown function | |
| WS1051 | 80 | Hypothetical protein, unknown function | |
| WS1084 | 81 | Hypothetical protein, unknown function | |
| WS2139 | 161 | Hypothetical protein, unknown function | |
| WS2156 | 137 | Hypothetical protein, unknown function | |
| WS0682a | 217 | Hypothetical protein, unknown function | |
| WS0805a | 110 | Hypothetical protein, unknown function | |
| WS0828a | 125 | Hypothetical protein, unknown function | |
| WS1624a | 221 | Hypothetical protein, unknown function |
Homologs showing significant similarities to these proteins are only detected in the sequenced Wolinella and Helicobacter genomes (W. succinogenes DSM 1740, H. pylori 26695, H. pylori J99 and H. hepaticus ATCC 51449).
a These proteins are only found in W. succinogenes and H. hepaticus.
Proteins specific for all sequenced Campylobacter species
| CJE0368 [ | hypothetical protein (398) | CJE1156 [ | putative membrane protein (154) |
| CJE0399 [ | hypothetical protein (140) | CJE1173 [ | outer membrane protein MapA (214) |
| CJE0627 [ | probable membrane protein (144) | CJE1222 [ | probable periplasmic protein (150) |
| CJE0751 [ | hypothetical protein (144) | CJE1367 [ | hypothetical protein (110) |
| CJE0754 [ | membrane protein, putative (164) | CJE1499 [ | acyl carrier protein, putative (74) |
| CJE0790 [ | membrane protein, putative (163) | CJE1574 [ | hypothetical protein (231) |
| CJE0888 [ | prevent-host-death family protein (71) | CJE1623 [ | putative ATP/GTP-binding protein (187) |
| CJE0986 [ | Putative periplasmic protein (156) | CJE1670 [ | hypothetical protein (142) |
| CJE1022 [ | putative periplasmic protein (244) | CJE1745 [ | hypothetical protein (230) |
These proteins are uniquely found in all of the following Campylobacter genomes, unless otherwise indicated: C. jejuni (various strains: NCTC 11168, RM1221, HB93-13, 84-25, CF93-6, 260.94, 11168 and 81-176), C. lari, C. coli, C. upsaliensis and C. fetus.
a – missing in C. upsaliensis
b – missing in C. lari.
Campylobacter-specific proteins that are missing in C. fetus
| Geneomic ID [Accession Number] | Possible Function (length) | Geneomic ID [Accession Number] | Possible Function (length) |
| CJE0037 [ | hypothetical protein (215) | CJE0959 [ | hypothetical protein (210) |
| CJE0039 [ | hypothetical protein (107) | CJE1180 [ | hypothetical protein (83) |
| CJE0193 [ | hypothetical protein (115) | CJE1221 [ | prepilin-type N-terminal cleavage/methylation domain protein (220) |
| CJE0455 [ | putative lipoprotein (299) | CJE1327 [ | putative periplasmic protein (268) |
| CJE0470 [ | membrane protein, putative (318) | CJE1351 [ | hypothetical protein (67) |
| CJE0476 [ | hypothetical protein (111) | CJE1378 [ | hypothetical protein (106) |
| CJE0867 [ | putative periplasmic protein (339) | CJE1572 [ | lipoprotein, putative (176) |
| CJE0899 [ | small hydrophobic protein (101) | CJE1849 [ | probable periplasmic protein (254) |
| CJE0929 [ | putative lipoprotein (161) | CJE1890 [ | Ribbon-helix-helix protein, copG family (82) |
These proteins are uniquely present in all of these species unless otherwise noted: C. jejuni (various strains: NCTC 11168, RM1221, HB93-13, 84-25, CF93-6, 260.94, 11168 and 81-176), C. lari, C. coli, and C. upsaliensis.
a – missing in C. upsaliensis
b – missing in some C. jejuni strains
Proteins uniquely found in C. jejuni, C. coli and C. upsalienesis
| Geneomic ID [Accession Number] | Possible Function (length) | Geneomic ID [Accession Number] | Possible Function (length) |
| CJE0052 [ | hypothetical protein (90) | CJE1095 [ | site-specific recombinase XerC, putative (86) |
| CJE0053 [ | hypothetical protein (67) | CJE1096 [ | hypothetical protein (76) |
| CJE0079 [ | hypothetical protein (34) | CJE1099 [ | hypothetical protein (43) |
| CJE0413 [ | hypothetical protein (83) | CJE1795 [ | membrane protein, putative (173) |
| CJE0761 [ | putative periplasmic protein (182) | CJE1803 [ | hypothetical protein (292) |
a – missing in some C. jejuni strains
Proteins unique to C. jejuni and C. coli
| Geneomic ID [Accession Number] | Geneomic ID [Accession Number] | Geneomic ID [Accession Number] | Geneomic ID [Accession Number] |
| CJE1150 [ | CJE1098 [ | CJE0425 [ | CJE1131 [ |
| CJE1153 [ | CJE1101 [ | CJE0387 [ | CJE1375 [ |
| CJE1154 [ | CJE1093 [ | CJE0388 [ | CJE1376 [ |
| CJE1125 [ | CJE0835 [ | CJE0389 [ | CJE1392 [ |
| CJE1126 [ | CJE0690 [ | CJE0266 [ | CJE1550 [ |
| CJE1104 [ | CJE0671 [ | CJE0053 [ | CJE1551 [ |
| CJE1105 [ | CJE0477 [ | CJE0067 [ | CJE1552 [ |
Proteins specific for Campylobacter jejuni
| CJE0392 [ | CJE0213 [ | CJE0257 [ | CJE1053 [ |
| CJE0602 [ | CJE0216 [ | CJE0259 [ | CJE1054 [ |
| CJE0668 [ | CJE0223 [ | CJE0267 [ | CJE1424 [ |
| CJE0669 [ | CJE0225 [ | CJE0268 [ | CJE1431 [ |
| CJE1760 [ | CJE0239 [ | CJE0271 [ | CJE1432 [ |
| CJE0204 [ | CJE0240 [ | CJE0574 [ | CJE1433 [ |
| CJE0205 [ | CJE0245 [ | CJE0946 [ | CJE1470 [ |
| CJE0206 [ | CJE0247 [ | CJE1046 [ | CJE1629 [ |
| CJE0208 [ | CJE0248 [ | CJE1048 [ | CJE1829 [ |
| CJE0211 [ | CJE0253 [ | CJE1052 [ | CJE1840 [ |
The first 5 proteins marked by * are present in all C. jejuni strains (NCTC 11168, RM1221, HB93-13, 84-25, CF93-6, 260.94, 11168 and 81-176). The other proteins are missing in one or more strains.
Figure 2Partial sequence alignments of the B protein from exinuclease ABC complex (A) and phenylalanyl-tRNA synthetase (B) showing two conserved indels that are specific for ε-Proteobacteria and not found in other organisms. The dashes (-) in the alignment show identity with the amino acid on the top line. The accession numbers of the sequences (second column) and position of the sequence in C. jejuni homolog (on top) are indicated. Sequence information for only representative species is shown.
Figure 3Partial sequence alignments of the FtsH protease (A) and RNA polymerase β' subunit (B) showing two conserved indels that are specific for the indicated subgroups of ε-Proteobacteria. The dashes (-) denote identity with the amino acid on the top line. Sequence information for only representative species is shown.
Figure 4Diagrammatic representation of the arrangements of two largest subunits of RNA polymerase, i.e. β subunit (RpoB) and β' subunit (RpoC) in different bacteria. In contrast to other bacteria where these proteins are made as distinct polypeptides, in Helicobacter and Wolinella a rare genetic event has led to fusion/joining of the genes for these proteins so that they are now made as a single large polypeptide.
Figure 5Phylogenetic trees based on (A) 16S rRNA and (B) concatenated sequences for 9 proteins (AlaRS, Gyrase A, Gyrase B, EF-Tu, EF-G, Hsp60, Hsp70, RpoB and RpoC) containing 7919 aligned positions. The sequences were bootstrapped either 100 (A) or 500 times (B) and bootstrap scores for all nodes above 50% are shown. (C) A model depicting the evolutionary stages where different Campylobacterales- (or ε-proteobacteria) specific proteins and other RGCs were introduced.