| Literature DB >> 18442984 |
Armelle Ménard1, Antoine Danchin, Sandrine Dupouy, Francis Mégraud, Philippe Lehours.
Abstract
The present study concerns the identification of a novel coding sequence in a region of the Helicobacter pylori genome, located between JHP1069/HP1141 and JHP1071/HP1143 according to the numbering of the J99 and 26,695 reference strains, respectively, and spanning three different coding DNA sequences (CDSs). The CDSs located at the centre of this locus were highly polymorphic, as determined by the analysis of 24 European isolates, 3 Asian, and 3 African isolates. Phylogenetic and molecular evolutionary analyses showed that the CDSs were not restricted to the geographical origin of the strains. Despite a very high variability observed in the deduced protein sequences, significant similarity was observed, always with the same protein families, i.e. ATPase and bacteriophage receptor/invasion proteins. Although this variability could be explained by isotopic gene replacement via horizontal transfer of a gene with the same function but coming from a variety of sources, it seems more likely that the very high sequence variation observed at this locus is the result of a strong selection pressure exerted on the corresponding gene product. The CDSs identified in the present study could be used as strain specific markers.Entities:
Mesh:
Year: 2008 PMID: 18442984 PMCID: PMC2650637 DOI: 10.1093/dnares/dsn006
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1Representation of the genomic area of interest, according to the genome sequences of the two Helicobacter pylori reference strains J99 and 26695, which contain the variable CDS identified in the present study. Each CDS is represented by an arrow with the direction indicating the translational direction. The numbering under each CDS corresponds to the number of CDS in H. pylori J99 (top line) and 26695 (bottom line). The function of each CDS is indicated according to the revised annotation of Boneca et al.17 Fmt, methionyl-tRNA formyltransferase; BirA, biotin ligase bifunctional protein; ParB, replication/partition-related protein; ParA, chromosome partition protein (Soj).
Figure 2Phylogenetic analysis of CDS, CDS1, CDS2, and CDS3, proteins generated with the neighbour-joining method. The phylogeny presented is based on the alignment of the entire deduced protein. The bootstrap values are indicated next to each node. Nucleotide and protein sequences are available in GenBank (EMBL) for the Helicobacter pylori strains B34, G2, 8038, TALLAN, 12001, 8033, 19A, and BAPOO1 under the accession numbers EF492441 (AM902682), EF492442 (AM902683), EU553483 (AM946633), EU553485 (AM946634), EU553505 (AM946635), EU553482 (AM946636), EU553481 (AM946637), and EU556504 (AM946638), respectively. Nucleotide and protein sequences of H. pylori reference strains 26695, J99, and HPAG1 are available in GenBank under the accession numbers AE000511, AE001439, and ABF85147, respectively.
Analysis of molecular distances and synonymous and non-synonymous nucleotide substitutions within CDSs, CDS1 (n = 4), CDS2 (n = 4), and CDS3 (n = 3), in different Helicobacter pylori strains
| CDS1 | CDS2 | CDS3 | |
|---|---|---|---|
| Mol. distance (nt) | 0.045 ± 0.003 | 0.080 ± 0.005 | 0.069 ± 0.004 |
| No. differences (nt) | 98.167 ± 7.246 | 182.167 ± 9.753 | 170.00 ± 9.334 |
| Ks | 0.088 ± 0.012 | 0.161 ± 0.015 | 0.123 ± 0.013 |
| Ka | 0.035 ± 0.004 | 0.061 ± 0.004 | 0.057 ± 0.004 |
| Ka/Ks | 0.398 ± 0.071 | 0.379 ± 0.043 | 0.463 ± 0.059 |
nt, nucleotides; Ks, synonymous substitutions; Ka, non-synonymous substitutions.
†P Z-Test < 0.001 for purifying selection hypothesis (Ka/Ks < 1).
&Value ± standard error.
The GenBank accession numbers of the sequences used in this study are listed in Fig. 2.