| Literature DB >> 19834620 |
Daniela Lepka1, Tobias Kerrinnes, Evelyn Skiebe, Birgitt Hahn, Angelika Fruth, Gottfried Wilharm.
Abstract
We report the nucleotide sequence of two novel cryptic plasmids (4357 and 14 662 base pairs) carried by a Yersinia enterocolitica biotype 1A strain isolated from pork. As distinguished from most biotype 1A strains, this isolate, designated 07-04449, exhibited adherence to eukaryotic cells. The smaller plasmid pYe4449-1 carries five attributable open reading frames (ORFs) encoding the first CcdA/CcdB-like antitoxin/toxin system described for a Yersinia plasmid, a RepA-like replication initiation protein, and mobilizing factors MobA and MobC. The deduced amino acid sequences showed highest similarity to proteins described in Salmonella (CcdA/B), Klebsiella (RepA), and Plesiomonas (MobA/C) indicating genomic fluidity among members of the Enterobacteriaceae. One additional ORF with unknown function, termed ORF5, was identified with an ancestry distinct from the rest of the plasmid. While the C+G content of ORF5 is 38.3%, the rest of pYe4449-1 shows a C+G content of 55.7%. The C+G content of the larger plasmid pYe4449-2 (54.9%) was similar to that of pYe4449-1 (53.7%) and differed from that of the Y. enterocolitica genome (47.3%). Of the 14 ORFs identified on pYe4449-2, only six ORFs showed significant similarity to database entries. For three of these ORFs likely functions could be ascribed: a TnpR-like resolvase and a phage replication protein, localized each on a low C+G island, and DNA primase TraC. Two ORFs of pYe4449-2, ORF3 and ORF7, seem to encode secretable proteins. Epitope-tagging of ORF3 revealed protein expression at 4 degrees C but not at or above 27 degrees C suggesting adaptation to a habitat outside swine. The hypothetical protein encoded by ORF7 is the member of a novel repeat protein family sharing the DxxGN(x)(n)DxxGN motif. Our findings illustrate the exceptional gene pool diversity within the species Y. enterocolitica driven by horizontal gene transfer events.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19834620 PMCID: PMC2760160 DOI: 10.1155/2009/398434
Source DB: PubMed Journal: J Biomed Biotechnol ISSN: 1110-7243
Figure 1Yersinia enterocolitica biotype 1A strain 07-04449 adhering to eukaryotic cells. Rat intestinal IEC-6 cells (a) and human intestinal Int-407 cells (b) were infected with strain 07-04449 at a multiplicity of infection (MOI) of 50, stained with Giemsa and subjected to microscopy.
Figure 2Maps of plasmids pYe4449-1 and pYe4449-2 . Open reading frames (ORFs) with attributable functions and e-values below e-20 are indicated by black arrows, ORFs with significant homology to database entries (e-values below e-10) but unclear functions are indicated by grey arrows, and ORFs without significant homology (e-values above 0.1) to database entries are indicated by white arrows. Details about identified ORFs are given in Tables 1 and 2.
ORFs identified on plasmid pYe4449-1.
| Name1 | Position (bp) | Strand | Size (aa) | MW2 (kDa) | pI2 | Identities | Expect | Best BlastP match3/function | Accession number |
|---|---|---|---|---|---|---|---|---|---|
| 298–537 | + | 79 | 9 | 10.13 | 56/79 (70%) | 2e-26 | CdcA-like protein ( | gb∣AAV53396.1∣ | |
| 537–845 | + | 102 | 11.06 | 5.02 | 70/102 (68%) | 5e-36 | cytotoxic protein CcdB( | ref∣ZP_02659678.1∣ | |
| 848–2419 | − | 523 | 59.2 | 8.42 | 289/514 (56%) | 3e-133 | putative relaxase/mobilization nuclease MobA ( | emb∣CAB56514.1∣ | |
| 2409–2735 | − | 108 | 12.5 | 10.18 | 84/109 (77%) | 4e-37 | putative mobilization protein ( | emb∣CAB56514.1∣ | |
| ORF 5 | 2981–3481 | − | 166 | 18.7 | 6.41 | hypothetical protein | |||
| 3481–4227 | − | 248 | 28.2 | 8.83 | 184/244 (75%) | 9e-103 | replication initiation protein A ( | ref∣NP_620615.1∣ |
1Gene names according to putative conserved protein domains if available.
2Molecular weights and isoelectric points are calculated using the algorithm from ExPASy's Compute pI/Mw program.
3Only significant BlastP matches with e-values below e-10 are listed; all others are indicated as “hypothetical protein” and show e-values above 0.1.
ORFs identified on plasmid pYe4449-2.
| Name1 | Position (bp) | Strand | Size (aa) | MW2 (kDa) | pI2 | Identities | Expect | Best BlastP match3/function | Accession |
|---|---|---|---|---|---|---|---|---|---|
| ORF 1 | 446–913 | + | 155 | 16.8 | 5.3 | hypothetical protein | |||
| ORF 2 | 957–3410 | + | 817 | 86.4 | 9.9 | hypothetical protein | |||
| ORF 3 ( | 3933–4511 | − | 192 | 20.9 | 6.2 | protein with unknown function expressed at low temperature | |||
| ORF 4 | 4511–5146 | − | 211 | 23.2 | 5.4 | hypothetical protein | |||
| 5371–5997 | + | 208 | 23.3 | 9.6 | 92/208 (44%) | 2e-46 | Resolvase, N-terminal domain ( | ref∣YP_957859.1∣ | |
| ORF 6 | 6107–6301 | + | 64 | 7.1 | 10.8 | hypothetical protein | |||
| ORF 7 ( | 6361–6744 | + | 127 | 13.5 | 4.2 | 42/113 (37%) | 3e-10 | hypothetical protein BACOVA_04330 ( | ref∣ZP_02067325.1∣ |
| ORF8 | 6795–7478 | − | 227 | 25.3 | 5.3 | 52/127 (40%) | 8e-18 | hypothetical protein ( | gb∣AAR06625.1∣ |
| ORF 9 | 7643–8119 | + | 158 | 17.6 | 10.2 | hypothetical protein | |||
| ORF 10 | 8637–9917 | − | 426 | 48.6 | 8 | 157/386 (40%) | 2e-80 | predicted P-loop ATPase (Bacteriophage APSE-3) | gb∣ACJ10118.1∣ |
| ORF 11 | 10971–11462 | + | 163 | 18.2 | 6.1 | hypothetical protein | |||
| ORF 12 | 11798–12250 | − | 150 | 16.5 | 5.8 | hypothetical protein | |||
| ORF 13 | 12247–12534 | − | 95 | 9.9 | 4.5 | 72/95 (75%) | 1e-28 | hypothetical protein pKlebpneu12_p49 ( | ref∣YP_002286974.1∣ |
| 12592–13803 | − | 403 | 44.8 | 6.3 | 345/401 (86%) | 0 | hypothetical protein pKlebpneu12_p48 ( | ref∣YP_002286973.1∣ |
1Gene names according to putative conserved protein domains if available.
2Molecular weights and isoelectric points are calculated using the algorithm from ExPASy's Compute pI/Mw program.
3Only significant BlastP matches with e-values below e-10 are listed; all others show e-values above 0.1 and are either indicated as “hypothetical protein” or experimental confirmation is specified.
Figure 3ORF7 of plasmid pYe4449-2 encodes a putative new repeat protein with a high content of phosphorylatable amino acids. (a) Scheme of the arrangement of the ORF7-encoded hypothetical protein. The nonameric repeat module “RTDSLGNTY” occurring twice in each repeat is indicated by dark grey boxes. (b) Alignment of the protein repeat segments encoded by ORF7, residues identical in all three repeats are highlighted against a dark grey background and residues identical in two of the three repeats are highlighted against a light grey background. The repeat module “RTDSLGNTY” occurring twice in each repeat is indicated by a bar below the consensus sequence. Below these bars, the interspecies repeat motif “DxxGN(x)nDxxGN” is given as deduced from the closest homologues of ORF 7 (BACOVA_04330; NE2449; MPG2080_02246). Serine and threonine residues most likely to be phosphorylated in a eukaryotic background according to NetPhos 2.0, server predictions (score 0.968 and above) were indicated by circles.
BLAST search for proteins containing the DxxGN(x)DxxGN repeat motif.
| DxxGN(x)nDxxGN with | Number of repeats‡ | Reference, gene product, organism |
|---|---|---|
| 0 | 3 | CAQ39838.1, hypothetical protein, conserved in |
| 1 | 3 | ZP_03280828.1, hypothetical protein ENTCAN_00574, |
| 2 | 8 | YP_001743821.1, putative autotransporter, |
| 8 | XP_001554189.1, predicted protein, | |
| 3 | 4 | XP_002046888.1, GJ13133, |
| 3 | ZP_03508148.1, hypothetical protein RetlB5_24044, | |
| 4 | 5§ | XP_002224392.1, hypothetical protein BRAFLDRAFT_86726, |
| 4* | XP_002206131.1, hypothetical protein BRAFLDRAFT_68704, | |
| 5 | 7 | ZP_03166503.1, hypothetical protein RUMLAC_00154 |
| 6 | 7079543 LMHCC_2776, YD repeat protein, | |
| 6 | 7 | ZP_02067325.1, hypothetical protein BACOVA_04330, |
| 6 | 7079543 LMHCC_2776, YD repeat protein, | |
| 7 | 20 | YP_002441035.1, hypothetical protein PLES_34491, |
| 5 | XP_002226734.1, hypothetical protein BRAFLDRAFT_89165, | |
| 11¶ | 2 | XP_001635831.1, predicted protein, |
‡BLAST hits with highest number of repeats were selected.
§Precisely, there are 11 consecutive DxxGN(x)4 repeats.
*Three of the four repeats are part of the triple tandem DxxGN(x)4DxxGN(x)4DxxGN.
¶For spacer regions (x)n with n > 7, a more specific search strategy is required to obtain significant hits; the repeat used for the Nematostella hit is DxLGN(x)11DxLGN.
Figure 4Detection of epitope-tagged ORF3 expressed at 4°C. A transposon mutant derivative of isolate 07-04449 carrying plasmid pYe4449-2/ORF3::Tn5 was transformed with plasmid pACYC184/ORF3-RGS(H)6 and pACYC184, respectively, and cultured over night at 4°C in LB broth. Bacterial pellet fractions “P” and TCA-precipitated supernatant fractions “S” were subjected to denaturing SDS-PAGE. Subsequently, the gel was Western-blotted followed by chromogenic detection using a monoclonal antibody directed against the RGS(H)4 epitope. The arrow indicates the ORF3-RGS(H)6 product.