| Literature DB >> 30463511 |
Sujay Chattopadhyay1, Peter B Chi2, Vladimir N Minin3, Douglas E Berg4, Evgeni V Sokurenko5.
Abstract
BACKGROUND: Helicobacter pylori is a human stomach pathogen, naturally-competent for DNA uptake, and prone to homologous recombination. Extensive homoplasy (i.e., phylogenetically-unlinked identical variations) observed in H. pylori genes is considered a hallmark of such recombination. However, H. pylori also exhibits a high mutation rate. The relative adaptive role of homologous recombination and mutation in species diversity is a highly-debated issue in biology. Recombination results in homoplasy. While convergent mutation can also account for homoplasy, its contribution is thought to be minor. We demonstrate here that, contrary to dogma, convergent mutation is a key contributor to Helicobacter pylori homoplasy, potentially driven by adaptive evolution of proteins.Entities:
Keywords: Adaptive evolution; Convergent mutations; Gastric pathogen; Helicobacter pylori; Protein-coding genes
Mesh:
Year: 2018 PMID: 30463511 PMCID: PMC6249973 DOI: 10.1186/s12864-018-5231-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Phylogram of concatenated sequences of 7 housekeeping genes (atpA, efp, mutY, ppa, trpC, ureI, yphC) for 38 H. pylori strains analyzed, along with two H. cetorum strains used as outgroups. The red renctangle includes the strains from Amerindian population (representating the ‘local’ subset analyzed in Additional file 2: Figure S1). The red asterisks (*) beside the strain-names denote the representatives of the ‘global’ subset (analyzed in Additional file 2: Figure S1)
Comparison of nucleotide diversity in core genes and homoplasy in the encoded proteins of H. pylori
| Complete set (38 strains) | ‘Local’ (Amerindian) subset (12 strains) | ‘Global’ subset (12 strains) | |
|---|---|---|---|
| core genes analyzed | 992 | 1106 | 1200 |
| diversity, π | 3.81 ± 0.03% | 2.25 ± 0.02% | 4.21 ± 0.03% |
| dN | 1.44 ± 0.03% | 0.91 ± 0.02% | 1.74 ± 0.04% |
| dS | 14.14 ± 0.11% | 7.79 ± 0.08% | 15.23 ± 0.13% |
| genes showing homoplasy in encoded proteins | 971 | 963 | 1126 |
| amino acid positions with repeated changes | 7.85 ± 0.15% | 2.33 ± 0.06% | 5.10 ± 0.11% |
| recombinant genes | 488 | 258 | 490 |
| amino acid positions with repeated changes in encoded proteins of recombinant genes | 9.03 ± 0.20% | 2.83 ± 0.12% | 6.02 ± 0.18% |
| non-recombinant genes | 504 | 848 | 710 |
| amino acid positions with repeated changes in encoded proteins of non-recombinant genes | 6.66 ± 0.21% | 2.15 ± 0.07% | 4.40 ± 0.13% |
The mean values are denoted in percentages along with their standard error values
Fig. 2Average distribution of amino acid positions with different kinds of point mutations in encoded proteins of non-recombinant core genes with convergent changes in H. pylori
Fig. 3Comparison of convergent mutation-frequencies in encoded proteins of real and simulated datasets of H. pylori non-recombinant core genes
Fig. 4Average frequency distribution of mutations in the encoded proteins showing convergence in non-recombinant core genes of H. pylori
Fig. 5Enriched (or overrepresented) functional categories of proteins encoded by non-recombinant core genes with convergent mutations