| Literature DB >> 32843556 |
Iratxe Estibariz1,2,3, Florent Ailloud1,2,3, Sabrina Woltemate2,3, Boyke Bunk3,4, Cathrin Spröer3,4, Jörg Overmann3,4, Toni Aebischer5, Thomas F Meyer5, Christine Josenhans6,2,3, Sebastian Suerbaum6,2,3.
Abstract
Multiple studies have demonstrated rapid bacterial genome evolution during chronic infection with Helicobacter pylori In contrast, little was known about genetic changes during the first stages of infection, when selective pressure is likely to be highest. Using single-molecule, real-time (SMRT) and Illumina sequencing technologies, we analyzed genome and methylome evolution during the first 10 weeks of infection by comparing the cag pathogenicity island (cagPAI)-negative H. pylori challenge strain BCS 100 with pairs of H. pylori reisolates from gastric antrum and corpus biopsy specimens of 10 human volunteers who had been infected with this strain as part of a vaccine trial. Most genetic changes detected in the reisolates affected genes with a surface-related role or a predicted function in peptide uptake. Apart from phenotypic changes of the bacterial envelope, a duplication of the catalase gene was observed in one reisolate, which resulted in higher catalase activity and improved survival under oxidative stress conditions. The methylomes also varied in some of the reisolates, mostly by activity switching of phase-variable methyltransferase (MTase) genes. The observed in vivo mutation spectrum was remarkable for a very high proportion of nonsynonymous mutations. Although the data showed substantial within-strain genome diversity in the challenge strain, most antrum and corpus reisolates from the same volunteers were highly similar to each other, indicating that the challenge infection represents a major selective bottleneck shaping the transmitted population. Our findings suggest rapid in vivo selection of H. pylori during early-phase infection providing adaptation to different individuals by common mechanisms of genetic and epigenetic alterations.IMPORTANCE Exceptional genetic diversity and variability are hallmarks of Helicobacter pylori, but the biological role of this plasticity remains incompletely understood. Here, we had the rare opportunity to investigate the molecular evolution during the first weeks of H. pylori infection by comparing the genomes and epigenomes of H. pylori strain BCS 100 used to challenge human volunteers in a vaccine trial with those of bacteria reisolated from the volunteers 10 weeks after the challenge. The data provide molecular insights into the process of establishment of this highly versatile pathogen in 10 different human individual hosts, showing, for example, selection for changes in host-interaction molecules as well as changes in epigenetic methylation patterns. The data provide important clues to the early adaptation of H. pylori to new host niches after transmission, which we believe is vital to understand its success as a chronic pathogen and develop more efficient treatments and vaccines.Entities:
Keywords: DNA methylation; DNA modification; Helicobacter pylorizzm321990; MiSeq Illumina; PacBio; SMRT sequencing; adaptive mutations; genome analysis; methylome
Mesh:
Substances:
Year: 2020 PMID: 32843556 PMCID: PMC7448279 DOI: 10.1128/mBio.01803-20
Source DB: PubMed Journal: mBio Impact factor: 7.867
FIG 1(A) Schematic representation of the vaccine/challenge trial (28) which was the basis of the present study. Human volunteers were given a Salmonella Ty21a live vaccine control or a recombinant Ty21a vaccine expressing the H. pylori urease. Forty-two days later, human volunteers were challenged with the cagPAI-negative BCS 100 H. pylori strain, a heterogeneous strain from which several subclones (H1 to H16) were genome sequenced. Six and 10 weeks postchallenge (wpc), gastroscopies were performed and blood samples were collected. Reisolates were cultured from antrum and corpus biopsy specimens taken 10 wpc, and reisolates from 10 volunteers were subjected to SMRT sequencing technology to obtain complete genome sequences and methylome data. (B) Phylogenetic tree representing the genomes of all strains. The reisolates 12A3, 12C8, 81A1, 81C9, and 87C7 represent a distinct subpopulation compared to the clones obtained from the challenge strain H1 to H16. Blue, H1 to H16 clones. Red, volunteers infected by reisolates descending from distinct subpopulations of the inoculum. The scale bar indicates substitutions per site. Statistical branch support is indicated by posterior probability values and node circles.
Key characteristics of the genome sequences of H. pylori reisolates from human volunteers challenged with strain BCS 100
| Strain | Location | Genome | No. of SNPs | Mutation rate | ||
|---|---|---|---|---|---|---|
| Unique | Pair specific | Total | ||||
| H1 | Challenge | 1,563,305 | ||||
| 8A3 | Antrum | 1,563,267 | 1 | 1 | 2 | 6.67E−06 |
| 8C10 | Corpus | 1,563,260 | 1 | 1 | 2 | 6.67E−06 |
| 12A3 | Antrum | 1,563,420 | 1 | 0 | 1 | 3.34E−06 |
| 12C8 | Corpus | 1,563,409 | 1 | 0 | 1 | 3.34E−06 |
| 29A2 | Antrum | 1,563,333 | 1 | 0 | 1 | 3.34E−06 |
| 29C8 | Corpus | 1,561,020 | 2 | 0 | 2 | 6.68E−06 |
| 48A2 | Antrum | 1,563,280 | 0 | 1 | 1 | 3.34E−06 |
| 48C8 | Corpus | 1,563,290 | 0 | 1 | 1 | 3.34E−06 |
| 78A3 | Antrum | 1,563,274 | 2 | 0 | 2 | 6.67E−06 |
| 78C8 | Corpus | 1,563,292 | 0 | 0 | 0 | 0.00E+00 |
| 81A1 | Antrum | 1,563,405 | 1 | 0 | 1 | 3.34E−06 |
| 81C9 | Corpus | 1,563,438 | 0 | 0 | 0 | 0.00E+00 |
| 87A3 | Antrum | 1,563,302 | 3 | 0 | 3 | 1.00E−05 |
| 87C7 | Corpus | 1,563,411 | 0 | 0 | 0 | 0.00E+00 |
| 103A4 | Antrum | 1,563,266 | 1 | 0 | 1 | 3.34E−06 |
| 103C8 | Corpus | 1,566,721 | 3 | 0 | 3 | 9.98E−06 |
| 125A2 | Antrum | 1,563,276 | 1 | 0 | 1 | 3.34E−06 |
| 125C7 | Corpus | 1,563,239 | 2 | 0 | 2 | 6.67E−06 |
| 119A2 | Antrum | 1,563,232 | 2 | 0 | 2 | 6.67E−06 |
| 119C10 | Corpus | 1,563,317 | 1 | 0 | 1 | 3.34E−06 |
| Total | 23 | 2 | 25 | 4.50E−06 | ||
The letters A and C in the reisolate names indicates whether a strain was cultured from antrum (A) or corpus (C) biopsy specimens.
Unique and pair-specific SNPs in H. pylori reisolates from infected volunteers
| Functional | Strain(s) | Position | From > to | Type | ORF | HP no. |
|---|---|---|---|---|---|---|
| Energy metabolism | ||||||
| 1 | 78A3 | 317530 | c > t | Nonsynonymous | HP1166 | |
| 2 | 87A3 | 622512 | c > t | Nonsynonymous | HP0631 | |
| 3 | 8A3, 8C10 | 1407887 | c > t | Nonsynonymous | HP0056 | |
| Cell envelope, transport, and binding proteins | ||||||
| 4 | 12C8 | 33480 | a > g | Nonsynonymous | HP1503 | |
| 5 | 81A1 | 34327 | t > c | Nonsynonymous | HP1503 | |
| 6 | 29A2 | 231005 | g > a | Nonsynonymous | HP1243 | |
| 7 | 8C10 | 231370 | c > a | Stop | HP1243 | |
| 8 | 8A3 | 232590 | g > a | Nonsynonymous | HP1243 | |
| 9 | 87A3 | 896139 | c > a | Nonsynonymous | HP1251 | |
| 10 | 119C10 | 1185884 | g > a | Nonsynonymous | HP0279 | |
| 11 | 119A2 | 1211998 | g > a | Nonsynonymous | HP0251 | |
| 12 | 125A3 | 1213476 | g > a | Nonsynonymous | HP0250 | |
| 13 | 78A3 | 1213737 | g > t | Stop | HP0250 | |
| 14 | 87A3 | 1213789 | g > a | Nonsynonymous | HP0250 | |
| DNA metabolism | ||||||
| 15 | 12A3 | 815510 | t > c | Synonymous | HP0821 | |
| Protein synthesis | ||||||
| 16 | 29C8 | 1336072 | c > t | Nonsynonymous | HP0123 | |
| Regulatory functions | ||||||
| 17 | 103A4 | 1298936 | c > t | Nonsynonymous | HP0164 | |
| Unknown and hypothetical | ||||||
| 18 | 125C7 | 166082 | c > t | Nonsynonymous | NA | HP0953 |
| 19 | 29C8 | 332008 | a > g | Nonsynonymous | NA | HP1154 |
| 20 | 103C8 | 426225 | c > t | Nonsynonymous | NA | HP0394 |
| 21 | 103C8 | 1303279 | c > t | Nonsynonymous | HP0160 | |
| 22 | 48A2, 48C8 | 1331716 | c > t | Nonsynonymous | NA | HP0130 |
| 23 | 103C8 | 1560203 | g > a | Nonsynonymous | NA | HP1533 |
| Noncoding regions | ||||||
| 24 | 119A2 | 472706 | c > t | Intergenic | ||
| 25 | 125C7 | 925414 | c > t | Intergenic | ||
Position refers to the location of the SNPs in the reference genome of clone H1. HP no. refers to the reference strain H. pylori 26695. ORF, open reading frame; NA, not available.
FIG 2In vivo rearrangement in the reisolate 103C8 due to gene fusions and duplication of the katA gene. (A) Representation of the genomic context of katA in H1 and the duplication of katA and a fusion of duplicated fragments of the genes frpB and kapA in reisolate 103C8. (B) Transcription of the catalase gene in various H. pylori clones was measured by qPCR. katA transcript is shown as % relative to the H1 reference, which was set to 100%. The reisolate 103C8 with two katA copies showed significantly higher katA transcript amounts than H1. A mutant lacking the katA gene did not show transcript, as expected. All results were normalized to the respective 16S transcript amounts, also using strain H1 as reference. (C) Catalase activity was measured in bacterial lysates of strain H1, the reisolate 103C8, and H1ΔkatA (negative control) The reisolate 103C8, with two katA copies, displayed higher catalase activity than H1. The calculation of the catalase units/ml is defined as in the Megazyme catalase assay kit instructions. Test, one-way analysis of variance (ANOVA) (P < 0.05). (D) Survival experiment using paraquat (PQ) as oxidizing agent on live bacteria. Results are shown as relative survival values, where individual data sets were normalized to their respective bacterial counts obtained at time zero hours, which were set to 1. Strains H1 and 103C8 were treated with 10 μM PQ or left untreated (untr), and the number of colonies was counted 10 h postexposure. Strain H1 was less able to resist the oxidative stress, in contrast to reisolate 103C8, which was more resistant to PQ. Test, two-way ANOVA (P < 0.05). For panel B, dotted lines refer to the duplicated and fused regions from the kapA and frpB1 genes. For panels C and D, * = P < 0.05; ** = P < 0.01; ns, not significant.
FIG 3Graphical representation of the H. pylori oligopeptide transport (opp) gene cluster and genetic changes observed in BCS 100 and reisolates from infected volunteers. (A) Schematic representation of the opp gene clusters in E. coli and H. pylori. opp genes form a contiguous gene cluster in E. coli, while the four opp genes in H. pylori are located in two different loci. H. pylori does not have a homolog of oppF. (B) Graphical representation of the opp gene configurations found in strains whose genomes were sequenced in this study. Based on the nucleotide sequences, genes in blue are predicted to be truncated while genes in gray are likely to be active. Note that all reisolates and challenge strain clones H2 to H16 are predicted to express all four opp genes.
Methylated sequence motifs detected by SMRT sequencing in clone H1 of challenge strain BCS 100 and reisolates from human volunteers
| MTase | Modified | % motifs | No. of | H1# | 12A3 | 12C8 | 29C8 | 48C8 | 78A3 | 81A1 | 81C9 | 87C7 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| G | m6A | 100 | 9,890 | + | + | + | + | + | + | + | + | + |
| G | m6A | 100 | 5,202 | + | + | + | + | + | + | + | + | + |
| m6A | 100 | 2,412 | + | + | + | + | + | + | + | + | + | |
| GAAG | m6A | 100 | 4,446 | + | + | + | + | + | + | + | + | + |
| T | m4C | 99.98 | 4,446 | + | + | + | + | + | + | + | + | + |
| m6A | 100 | 3,732 | + | + | + | + | + | + | + | + | + | |
| C | m6A | 100 | 14,130 | + | + | + | + | + | + | + | + | + |
| m4C | 100 | 5,444 | + | + | + | + | + | + | + | + | + | |
| G | m6A | 100 | 572 | + | + | + | + | + | + | + | + | + |
| m6A | 100 | 1,449 | + | + | + | + | + | + | + | + | + | |
| m6A | 100 | 4,762 | + | + | + | + | + | + | + | + | + | |
| m6A | 100 | 530 | + | + | + | + | + | + | + | + | + | |
| m6A | 99.98 | 11,346 | + | + | + | + | + | + | + | + | + | |
| m4C | 99.94 | 3,358 | + | + | + | + | + | + | + | + | + | |
| G | m6A | 99.89 | 4,494 | + | + | + | + | + | + | + | + | + |
| CY | m6A | 99.87 | 3,712 | + | − | − | + | − | + | − | − | − |
| m6A | 99.53 | 856 | + | + | + | + | + | + | + | + | + | |
| G | m6A | 100 | 198 | + | + | + | + | + | + | + | + | + |
| m6A | 100 | 3,364 | + | + | + | + | + | + | + | + | + | |
| m6A | 99.95 | 6,357 | − | + | + | − | − | − | + | + | − | |
| *CG | m4C | − | + | − | + | − | + | − | − | − | ||
| **TGCAGA | m6A | − | − | − | + | − | − | − | − | − |
The % of motifs detected and the total number of motifs in the genome are based on the methylome of clone H1. The CCAAK motif quantitation is based on the 12C8 sequence, since the motif was not methylated in clone H1. + means methylation, − means absence of methylation, novel motifs are indicated in italic, and phase-variable MTases are shaded in gray. Modified bases are in bold. Underlined bases refer to the modified base in the complementary strand. *, the GCGC motif that cannot be reliably detected with SMRT sequencing. **, the motif TGCAGA was found only in 29C8 (62.78% of the motifs detected, 309 motifs within the genome), and it might probably be the motif TGCA. #, the following reisolates had methylation patterns identical to strain H1: 8A3, 8C10, 29A2, 48A2, 78C8, 87A3, 103A4, 103C8, 119A2, 119C10, 125A3, and 125C7.
FIG 4Circos plot displaying the distribution of methylated sequence motifs in H. pylori BCS 100 clone H1. Every circle of colored tracks represents one MTase target motif; the legend at the right side of the figure states the order of motifs represented by the 22 rings, from outer to inner circle. The motif CAAK, which is methylated in some reisolates, is not depicted, because it is not methylated in clone H1.