| Literature DB >> 19727445 |
Christophe Carnoy1, Claude-Alain Roten.
Abstract
In E. coli, 10 to 15% of growing bacteria produce dimeric chromosomes during DNA replication. These dimers are resolved by XerC and XerD, two tyrosine recombinases that target the 28-nucleotide motif (dif) associated with the chromosome's replication terminus. In streptococci and lactococci, an alternative system is composed of a unique, Xer-like recombinase (XerS) genetically linked to a dif-like motif (dif(SL)) located at the replication terminus. Preliminary observations have suggested that the dif/Xer system is commonly found in bacteria with circular chromosomes but that assumption has not been confirmed in an exhaustive analysis. The aim of the present study was to extensively characterize the dif/Xer system in the proteobacteria, since this taxon accounts for the majority of genomes sequenced to date. To that end, we analyzed 234 chromosomes from 156 proteobacterial species and showed that most species (87.8%) harbor XerC and XerD-like recombinases and a dif-related sequence which (i) is located in non-coding sequences, (ii) is close to the replication terminus (as defined by the cumulative GC skew) (iii) has a palindromic structure, (iv) is encoded by a low G+C content and (v) contains a highly conserved XerD binding site. However, not all proteobacteria display this dif/XerCD system. Indeed, a sub-group of pathogenic epsilon-proteobacteria (including Helicobacter sp and Campylobacter sp) harbors a different recombination system, composed of a single recombinase (XerH) which is phylogenetically distinct from the other Xer recombinases and a motif (dif(H)) sharing homologies with dif(SL). Furthermore, no homologs to dif or Xer recombinases could be detected in small endosymbiont genomes or in certain bacteria with larger chromosomes like the Legionellales. This raises the question of the presence of other chromosomal deconcatenation systems in these species. Our study highlights the complexity of dif/Xer recombinase systems in proteobacteria and paves the way for systematic detection of these components in prokaryotes.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19727445 PMCID: PMC2731167 DOI: 10.1371/journal.pone.0006531
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Genome and dif features of a representative panel of proteobacteria.
| Genome features | putative | |||||||
| Species | size (bp) | G+C content | Maximum CGC skew | Nucleotide sequence | position on genome | G+C content | distance from GCG skew | intergenic location |
|
| ||||||||
| Caulobacterales | ||||||||
|
| 4016947 | 0.672 | 1930040 |
| 1946380 | 0.250 | 14159 | yes |
| Rhizobiales | ||||||||
|
| 2841490 | 0.593 | 1485983 |
| 1478815 | 0.250 | 7168 | yes |
|
| 1581384 | 0.387 | 724981 |
| 720906 | 0.179 | 4075 | yes |
|
| 9105828 | 0.640 | 4893406 |
| 4996172 | 0.286 | 102766 | yes |
|
| 2117136 | 0.571 | 955452 |
| 954740 | 0.321 | 712 | yes |
|
| 1177785 | 0.573 | 757557 |
| 758183 | 0.214 | 626 | yes |
|
| 7036071 | 0.627 | 203543 |
| 299619 | 0.321 | 96076 | yes |
| Rhodobacterales | ||||||||
|
| 3188599 | 0.690 | 1435088 |
| 1436843 | 0.321 | 1755 | yes |
|
| 943016 | 0.690 | 399979 |
| 371575 | 0.250 | 28404 | yes |
| Rhodospirillales | ||||||||
|
| 4967148 | 0.650 | 2616185 |
| 2610339 | 0.393 | 5846 | yes |
| Rickettsiales | ||||||||
|
| 1516355 | 0.274 | 766761 |
| 747982 | 0.143 | 18779 | yes |
|
| 1111523 | 0.290 | 628915 |
| 596105 | 0.179 | 32810 | yes |
| Sphingomonadales | ||||||||
|
| 3561584 | 0.652 | 2030402 |
| 2048172 | 0.179 | 17770 | yes |
|
| ||||||||
| Burkholderiales | ||||||||
|
| 4086189 | 0.677 | 2227724 |
| 2229069 | 0.214 | 1345 | yes |
|
| 3510148 | 0.681 | 1086094 |
| 1081309 | 0.214 | 4785 | hyp. prot |
|
| 2325379 | 0.689 | 1077185 |
| 1075135 | 0.286 | 2050 | yes |
|
| 3716413 | 0.670 | 2009173 |
| 2031219 | 0.250 | 22046 | yes |
|
| 4712337 | 0.598 | 2485317 |
| 2472550 | 0.250 | 12767 | yes |
| Hydrogenophilales | ||||||||
|
| 2909809 | 0.660 | 1440104 |
| 1430783 | 0.214 | 9321 | yes |
| Methylophilales | ||||||||
|
| 2971517 | 0.557 | 1573478 |
| 1564653 | 0.214 | 8825 | yes |
| Neisseriales | ||||||||
|
| 2272351 | 0.515 | 1231577 |
| 1229349 | 0.214 | 2228 | hyp. prot |
| Nitrosomonadales | ||||||||
|
| 2812094 | 0.507 | 964528 |
| 974219 | 0.143 | 9691 | yes |
| Rhodocyclales | ||||||||
|
| 4501104 | 0.592 | 2186143 |
| 2192508 | 0.286 | 6365 | yes |
|
| ||||||||
| Bdellovibrionales | ||||||||
|
| 3782950 | 0.506 | 1940732 |
| 1946858 | 0.286 | 6126 | yes |
| Desulfobacterales | ||||||||
|
| 3523383 | 0.468 | 2260306 |
| 2338241 | 0.250 | 77935 | yes |
| Desulfovibrionales | ||||||||
|
| 3570858 | 0.631 | 1735879 |
| 1754277 | 0.250 | 18398 | yes |
| Desulfuromonadales | ||||||||
|
| 3814139 | 0.609 | 1865942 |
| 1891880 | 0.286 | 25938 | yes |
| Myxococcales | ||||||||
|
| 5013479 | 0.749 | 1906268 |
| 1906477 | 0.357 | 209 | yes |
|
| 9139763 | 0.688 | 4547166 |
| 4489697 | 0.393 | 57469 | yes |
| Syntrophobacterales | ||||||||
|
| 3179300 | 0.514 | 1665473 |
| 1665861 | 0.250 | 388 | yes |
|
| ||||||||
| Campylobacterales | ||||||||
|
| 2201561 | 0,34 | 1135161 |
| 1122264 | 0.175 | 12897 | yes |
|
| ||||||||
| Aeromonadales | ||||||||
|
| 4744448 | 0.615 | 2494705 |
| 2514936 | 0.286 | 20231 | yes |
| Alteromonadales | ||||||||
|
| 2839318 | 0.470 | 1411650 |
| 1387623 | 0.179 | 24027 | yes |
|
| 4969795 | 0.459 | 2490130 |
| 2476928 | 0.286 | 13202 | yes |
| Chromatiales | ||||||||
|
| 3481691 | 0.503 | 1849931 |
| 1850410 | 0.214 | 479 | yes |
| Enterobacteriales | ||||||||
|
| 5064019 | 0.509 | 2552458 |
| 2532133 | 0.250 | 20325 | yes |
|
| 4639675 | 0.507 | 1549688 |
| 1588788 | 0.286 | 39100 | yes |
|
| 4171146 | 0.546 | 2461688 |
| 2471148 | 0.250 | 9460 | yes |
|
| 4653728 | 0.476 | 2562641 |
| 2562919 | 0.286 | 278 | yes |
| Methylococcales | ||||||||
|
| 3304553 | 0.635 | 1531625 |
| 1492525 | 0.214 | 39100 | yes |
| Oceanospirillales | ||||||||
|
| 7215267 | 0.538 | 3439027 |
| 3437061 | 0.214 | 1966 | yes |
| Pasteurellales | ||||||||
|
| 1830023 | 0.381 | 1474989 |
| 1473975 | 0.143 | 1014 | yes |
| Pseudomonadales | ||||||||
|
| 3598621 | 0.404 | 1847121 |
| 1848733 | 0.179 | 1612 | yes |
|
| 6264403 | 0.665 | 2428120 |
| 2443082 | 0.214 | 14962 | yes |
| Thiotrichales | ||||||||
|
| 1892819 | 0.322 | 950050 |
| 994689 | 0.143 | 44639 | yes |
| Vibrionales | ||||||||
|
| 2961116 | 0.476 | 1564264 |
| 1564118 | 0.250 | 146 | yes |
|
| 1072311 | 0.469 | 512448 |
| 507996 | 0.357 | 4452 | yes |
| Xanthomonadales | ||||||||
|
| 5076172 | 0.650 | 2442019 |
| 2441762 | 0.286 | 257 | yes |
The central nucleotide in bold defines the position of the dif sequence on the chromosome. The nucleotides involved in the palindrome are underlined.
hyp.prot. = dif inserted into a hypothetical protein-encoding gene.
Sulfurimonas denitrificans strain DSM 1251 = Thiomicrospira denitrificans ATCC 33889.
maximum of the GC skew.
Figure 1Nucleotide variability within dif-related sequences.
(A) Consensus sequence and dif nucleotide variability for 161 dif-related sequences from 137 proteobacterial species. Nucleotide sequence characters in bold represent the dif sequence (28-mer). If the nucleotide frequency represents more than 50%, it is written in upper case letters; if not, the nucleotide is written in lower case letters. The nucleotide variability at each position in the 28-mer was defined as 1–f, where f is the frequency of the most frequent nucleotide. Nucleotide frequencies at each position are given in Table S2. Black bars represent dif XerC and dif XerD nucleotides, whereas grey bars correspond to the the dif cent nucleotides. White bars represent nucleotides outside dif. (B) Degree of variability in the dif sequence in 21 multi-strain species and in 19 multi-chromosome species. The degree of variability was calculated for each nucleotide position, as described in the Methods section.
Figure 2Phylogeny of proteobacterial XerC and XerD recombinases.
Representative proteobacterial species of each taxon were selected for the analysis (Table 1). β-proteobacterial species are represented in blue, with γ in red, δ in green, α in magenta and ε in black. Amino acid sequence alignments were performed using Clustal W (MEGA 4 [60]). The evolutionary history was inferred by using the Neighbor-Joining method [61] conducted in MEGA4. Similar results were obtained using the Minimum Evolution method (data not shown). Only significant bootstrap values (≥90%) obtained with 1000 runs are indicated next to the branches (white with a grey background). The tree is drawn to scale, with branch lengths (below the branches) in the same units as those of the evolutionary distances used to infer the phylogenetic tree. Branch lengths below the value 0.05 are not shown. The evolutionary distances were computed using the Poisson correction method and are given as the number of amino acid substitutions per site.
Figure 3Correlation between the position of the dif sequence and the terminus of replication as defined by cumulative GC skew.
The analysis was performed on the 161 proteobacterial chromosomes from the 137 representative dif + species (Table S1). Chromosome of Wolbachia endosymbiont of Drosophila melanogaster and chromosome 2 of Pseudoalteromonas haloplanktis were not included in the analysis since no terminus of replication could be located for these species by the method of the cumulative GC skew. The equation of the plot and the coefficient of determination (R2) are given.
Figure 4Phylogenetic analysis of XerC, XerD, XerH and XerS recombinases.
XerH from the ε subgroup species (listed in Table 2) were compared with XerD and XerC recombinases from other ε species and representative bacteria from the α, β, δ and γ taxa (Table 1). XerS recombinases of S. pyogenes M1 GAS and L. lactis Il1403 [23] were added for comparison. Amino acid sequence alignment (with Clustal W) and phylogenetic analyses were performed in MEGA4 [60]. The phylogeny was built using the Neighbor-Joining method [61]. The tree is drawn to scale, with branch lengths in the same units as those of the evolutionary distances used to infer the phylogenetic tree. The evolutionary distances were computed using the Poisson correction method and are in the units of the number of amino acid substitutions per site. The size range of the recombinases (in amino acids) is indicated under the recombinase name, in brackets.
Features of the putative dif sequences of ε-proteobacteria.
| ε-proteobacteria species | chromosomal features | putative | |||||||
| size (bp) | G+C content | CGC skew | sequence (40-mer) | G+C content | position on chromosome | distance from GC skew (bp) | distance from | intergenic location | |
|
| |||||||||
|
| 2341251 | 0.270 | 1232417 |
| 0.100 | 1197716 | 34701 | 66 | yes |
|
| 2052006 | 0.394 | 971215 |
| 0.125 | 999842 | 28627 | 59459 | hyp. prot |
|
| 1971264 | 0.445 | 1008113 |
| 0.100 | 991768 | 16345 | 68030 | hyp. prot |
|
| 1773615 | 0.333 | 908630 |
| 0.150 | 886842 | 21788 | 35968 | yes |
|
| 1711272 | 0.317 | 875024 |
| 0.125 | 851021 | 24003 | 214673 | yes |
|
| 1845106 | 0.306 | 853771 |
| 0.125 | 892865 | 39094 | 217 | yes |
|
| 1777831 | 0.303 | 893799 |
| 0.150 | 888360 | 5439 | 215 | yes |
|
| 1553927 | 0.382 | 748099 |
| 0.225 | 747275 | 824 | 282 | yes |
|
| 1799146 | 0.359 | 1794500 |
| 0.175 | 1765790 | 28710 | 125 | yes |
|
| 1667867 | 0.388 | 813426 |
| 0.225 | 723517 | 89909 | 1981 | yes |
|
| 1877931 | 0.397 | 927614 |
| 0.125 | 1001399 | 73785 | 52 | yes |
|
| 2110355 | 0.484 | 1188027 |
| 0.200 | 1170384 | 17643 | 16 | yes |
| consensus sequence |
| ||||||||
|
| distance from XerC / XerD (bp) | ||||||||
|
| 2201561 | 0.345 | 1135161 |
| 0.175 | 1122264 | 12897 | 705981/193695 | yes |
|
| 2562277 | 0.439 | 1189307 |
| 0.150 | 1186929 | 2378 | 1109695/758085 | yes |
| consensus sequence |
| ||||||||
One representative per species.
All genomes are circular.
The position of the putative dif motif on the chromosome corresponds to the nucleotide in bold type, located between the two inverted repeats.
hyp. prot. = hypothetical protein.
maximum of the GC skew.
Underlined nucleotides correspond to the inverted repeats.
Sulfurimonas denitrificans strain DSM 1251 = Thiomicrospira denitrificans ATCC 33889.
Figure 5Alignment of dif and dif.
The dif sequence corresponds to the putative dif motif of H. pylori 26695 (Table 2), whereas dif was described by Le Bourgeois et al. [23]. Asterisks indicate the common nucleotides and arrows designate inverted repeats.