| Literature DB >> 20478826 |
Wolfgang Fischer1, Lukas Windhager, Stefanie Rohrer, Matthias Zeiller, Arno Karnholz, Reinhard Hoffmann, Ralf Zimmer, Rainer Haas.
Abstract
The availability of multiple bacterial genome sequences has revealed a surprising extent of variability among strains of the same species. The human gastric pathogen Helicobacter pylori is known as one of the most genetically diverse species. We have compared the genome sequence of the duodenal ulcer strain P12 and six other H. pylori genomes to elucidate the genetic repertoire and genome evolution mechanisms of this species. In agreement with previous findings, we estimate that the core genome comprises about 1200 genes and that H. pylori possesses an open pan-genome. Strain-specific genes are preferentially located at potential genome rearrangement sites or in distinct plasticity zones, suggesting two different mechanisms of genome evolution. The P12 genome contains three plasticity zones, two of which encode type IV secretion systems and have typical features of genomic islands. We demonstrate for the first time that one of these islands is capable of self-excision and horizontal transfer by a conjugative process. We also show that excision is mediated by a protein of the XerD family of tyrosine recombinases. Thus, in addition to its natural transformation competence, conjugative transfer of genomic islands has to be considered as an important source of genetic diversity in H. pylori.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20478826 PMCID: PMC2952849 DOI: 10.1093/nar/gkq378
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Comparison of the P12 genome with other complete H. pylori genomes
| Strain | P12 | 26695 | J99 | HPAG1 | Shi470 | G27 | B38 |
|---|---|---|---|---|---|---|---|
| Chromosome size (bp) | 1 673 813 | 1 667 867 | 1 643 831 | 1 596 366 | 1 608 548 | 1 652 982 | 1 576 758 |
| Plasmid size (bp) | 10 225 | – | – | 9370 | – | 10 031 | – |
| GC content | 38.8% | 38.9% | 39.2% | 39.1% | 38.9% | 38.9% | 39.2% |
| GC content plasmid | 35.1% | – | – | 36.4% | – | 34.9% | – |
| Number of CDS | 1567 + 11 | 1566c | 1491 | 1536 + 8 | 1569 | 1493 + 11 | 1528 |
| Average CDS length | 958 bp | 954 bp | 997 bp | 954 bp | 913 bp | 959 bp | 946 bp |
| Strain-specific CDS | 54 + 56 (7.0%) | 83 + 48 (8.4%) | 33 + 40 (4.9%) | 68 + 34 (6.6%) | 158 + 36 (12.4%) | 45 + 41 (5.8%) | 96 + 20 (7.6%) |
| Plasticity zones (PZ) | PZ1 (40.7kb) | PZ1 (38.9 kb) | PZ1 (39.4 kb) | ||||
| PZ2 (14.1kb) | PZ (31.0 kb) | PZ (45.4 kb) | PZ (4.1 kb) | PZ2 (7.9 kb) | PZ2 (12.0 kb) | PZ (18.1 kb) | |
| PZ3 (30.0kb) | |||||||
| No. of PZ CDS | 90 | 56 | 38 | 6 | 46 | 46 | 18 |
| PZ GC content | 33.4%h | 33.2% | 34.1% | n.d. | 32.5%i | 32.7%i | n.d. |
| Cag-PAI (TFS1) | + | + | + | + | + | + | – |
| ComB (TFS2) | + | + | + | + | + | + | + |
| TFS3/TFS4 | +/+ | +k/+k | −/+k | −/– | −/+ | −/+ | −/– |
aGenBank accession numbers of chromosome sequences: P12, CP001217; 26695, AE000511; J99, AE001439; HPAG1, CP000241; Shi470, CP001072; G27, CP001173; B38, FM991728.
bCoding sequences on chromosome + plasmid.
cRevised annotation according to (22).
dCDS that are absent from all other six genomes + additional CDS absent from at least four of the other genomes. Numbers in brackets indicate the percentage of these genes in relation to all CDS.
eParts of a single PZ2 that was split by a genome rearrangement and contains PZ1- and PZ3-like elements
fPZ2 that contains PZ1- and PZ3-like elements.
gExcluding IS elements.
hPZ1, 33.8%; PZ2, 33.9%; PZ3, 32.8%.
iPZ1-like regions only.
jPresence or absence of type IV secretion systems (TFS) is indicated by (+) or (–), respectively.
kOnly fragments present.
Figure 1.Strain-specific genes in the H. pylori P12 genome. (A) Circular representation of the genome. Genes predicted on the plus and minus strands are shown as bars on the outer circles. Third and fourth circles: Positions of strain-specific genes (being absent from at least three out of six complete genome sequences). Fifth and sixth circles: GC content and GC skew calculated with a window size of 10 000 and steps of 100 bp. Positions of the putative origin and terminus of replication, the cag pathogenicity island (cagPAI), the comB genes and the plasticity zones (PZ1-3) with their corresponding type IV secretion systems (tfs) are indicated. (B) Venn diagrams showing numbers of common and strain-specific genes for the complete genome sequences indicated. (C) Power law regression fit for the identification of new genes in H. pylori genomes [according to (1)]. The x-axis indicates the number N of complete genome sequences examined, and the y-axis the number n of newly identified genes in each genome. The straight green line, representing a regression fit of the mean numbers of n, indicates a power law progression (n ∼ N−α), and a power law coefficient (α = 0.339) <1 (dashed line) indicates that the rate of newly identified genes is decreasing very slowly and that H. pylori has thus an open pan-genome.
Analysis of integration sites of 101 strain-specific genes in strain P12
| No. of genes | No. of sites | Unique sitesa | Hot spotsb | |
|---|---|---|---|---|
| Syntenic regionsc | 20 | 16 | 7 | 9 |
| Synteny breakpointsd | 32 | 21 | 3 | 18 |
| Plasticity zones | 49 | 3 | 3 | 0 |
| Total | 101 | 40 | 13 | 27 |
aInsertion sites containing either a certain specific gene or no insertion at all.
bInsertion sites containing different strain-specific genes in different strains.
cInsertion sites within regions of gene synteny.
dInsertion sites at breakpoints of gene synteny between H. pylori strains or between H. pylori and H. acinonychis.
Figure 2.Type IV secretion systems in plasticity zones. (A) Gene arrangement in PZ1 and comparison with corresponding regions in other genome sequences. PZ1 of strain P12 is inserted into a restriction–modification system pseudogene (similar to gene hp464 in strain 26695). Genes encoding type IV secretion system components are indicated as black arrows, and frameshift mutations are indicated by asterisks. Genes encoding proteins with 90–95% sequence similarity to the P12 proteins are shown in full colour, genes encoding proteins with 50–75% sequence similarity are hatched. Note that tfs4 has been termed tfs3a (for strains Shi470 and G27) or tfs3b (for strain P12) previously (32). PeCan18B plasticity zone, GenBank accession AF487344.3. (B) Neighbor-joining tree showing relationships between PZ type IV secretion systems and other Helicobacter and Campylobacter type IV secretion systems, based on average distances of the corresponding VirB4, VirB9 and VirB10 homologs. The TFS4 systems of strains P12 (tfs4-HPP12), G27 (tfs4-G27) and Shi470 (tfs4-HPSH) are depicted individually to show their mutual relationships. pTet, C. jejuni 81–176 pTet plasmid; pVir, C. jejuni 81–176 pVir plasmid.
Figure 3.Gene content analysis of plasticity zones in H. pylori isolates by microarray hybridization. Fragmented and biotin-labeled genomic DNA preparations of the indicated strains were used as probes for array hybridization. Plasticity zone genes are indicated by their hpp12 gene numbers on the left, and putative gene functions are indicated on the right. The presence of individual genes is indicated in white, and their absence in black color. The tfs3 and tfs4 gene clusters are boxed.
Figure 4.Horizontal gene transfer mediated by the plasticity zone 1 T4SS. (A) DNA transfer experiments were performed in the presence of DNase with a virB4/topATFS4-reconstituted P12 donor strain containing a chloramphenicol resistance cassette inserted into an intergenic region of PZ1, and a recA deletion to render the donor strain non-transformable. As recipient strains, we used a P12 wild-type strain with a ΔmoeB(hpp12_765)::aphA-3 insertion conferring kanamycin resistance, or a tfs4 deletion variant with the same aphA-3 insertion. Growth of chloramphenicol/kanamycin double-resistant clones indicated a unidirectional transfer of (parts of) PZ1 to the recipient strains. (B) DNA transfer rates from co-cultivation experiments using the donor and recipient strains indicated. Data shown are mean values of at least four independent experiments including standard deviations (C) Chromosomal DNA of transconjugant clones was examined by PCR and sequencing for co-transfer of the intact (non-frameshifted) virB4 allele, indicating transfer of the whole PZ or the whole tfs4 system. Proportions of transconjugants containing intact virB4 alleles are expressed as percentages of at least 12 independent clones sequenced. (D) Detection of a circular PZ1 intermediate. PCR products spanning the junction sites of circular intermediates were generated using primers WS362 and WS363 from the donor strains indicated. (E) Empty-site PCR with primers WS429 and WS432 was used to obtain DNA fragments from chromosomal DNA prepared from a co-cultivation mixture. Sequences of empty-site PCR fragments and circular intermediates are shown.