| Literature DB >> 25516619 |
Sandra Nell, Lynn Kennemann1, Sandra Schwarz1, Christine Josenhans, Sebastian Suerbaum2.
Abstract
UNLABELLED: Helicobacter pylori undergoes rapid microevolution during chronic infection, but very little is known about how this affects host interaction factors. The best-studied adhesin of H. pylori is BabA, which mediates binding to the blood group antigen Lewis b [Le(b)]. To study the dynamics of Le(b) adherence during human infection, we analyzed paired H. pylori isolates obtained sequentially from chronically infected individuals. A complete loss or significant reduction of Le(b) binding was observed in strains from 5 out of 23 individuals, indicating that the Le(b) binding phenotype is quite stable during chronic human infection. Sequence comparisons of babA identified differences due to mutation and/or recombination in 12 out of 16 strain pairs analyzed. Most amino acid changes were found in the putative N-terminal extracellular adhesion domain. One strain pair that had changed from a Le(b) binding to a nonbinding phenotype was used to study the role of distinct sequence changes in Le(b) binding. By transformations of the nonbinding strain with a babA gene amplified from the binding strain, H. pylori strains with mosaic babA genes were generated. Recombinants were enriched for a gain of Le(b) binding by biopanning or for BabA expression on the bacterial surface by pulldown assay. With this approach, we identified several amino acid residues affecting the strength of Le(b) binding. Additionally, the data showed that the C terminus of BabA, which is predicted to encode an outer membrane β-barrel domain, plays an essential role in the biogenesis of this protein. IMPORTANCE: Helicobacter pylori causes a chronic infection of the human stomach that can lead to ulcers and cancer. The bacterium can bind to gastric epithelial cells with specialized outer membrane proteins. The best-studied protein is the BabA adhesin which binds to the Lewis b blood group antigen. Since H. pylori is a bacterium with very high genetic variability, we asked whether babA evolves during chronic infection and how mutations or recombination in babA affect binding. We found that BabA-mediated adherence was stable in most individuals but observed a complete loss of binding or reduced binding in 22% of individuals. One strain pair in which binding was lost was used to generate babA sequences that were mosaics of a functional allele and a nonfunctional allele, and the mosaic sequences were used to identify amino acids critically involved in binding of BabA to Lewis b.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25516619 PMCID: PMC4271554 DOI: 10.1128/mBio.02281-14
Source DB: PubMed Journal: mBio Impact factor: 7.867
Le(b) binding and bab genotyping of sequential H. pylori isolates
| Stain set no.[ | Strain[ | Le(b) | Stat. sign.[ | Protein expression | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| BabA | BabB | Locus | Locus | Locus | BabA length | |||||
| 1 | NQ267 | 4.9 ± 1.8 | ns | + | − | B | A | B | 2,253 | 750 |
| NQ1624 | 3.5 ± 0.8 | + | + | B | A | B | 2,241 | 746 | ||
| 2 | NQ315 | 3.3 ± 1.0 | ns | + | + | A | A | B | 2,229 | 742 |
| NQ1712 | 4.5 ± 2.5 | + | + | A | A | B | 2,229 | 742 | ||
| 3 | NQ331 | 2.3 ± 0.7 | ns | + | + | AB | A | B | 2,235 | 744 |
| NQ1832 | 4.6 ± 1.7 | + | + | AB | A | B | 2,235 | 744 | ||
| 4 | NQ352 | 6.9 ± 2.6 | + | − | A | B | A | 2,211 | 736 | |
| NQ1701 | 1.3 ± 0.7 | + | + | A | B | A | 2,208 | 735 | ||
| 5 | NQ366 | 3.7 ± 0.7 | + | + | AB | A | B | 2,247 | 748 | |
| NQ1790 | 2.1 ± 0.4 | + | + | AB | A | B | 2,247 | 748 | ||
| 6 | NQ367 | 4.2 ± 0.8 | ns/ | + | + | A | B | A | 2,250 | 749 |
| NQ1671 | 3.3 ± 1.3 | + | + | A | B | A | 2,250 | 749 | ||
| NQ4191[ | 0.9 ± 0.1 | − | + | B | B | |||||
| 7 | NQ372 | 1.8 ± 0.6 | ns | + | + | A | A | B | 2,232 | 743 |
| NQ1886 | 1.7 ± 0.7 | + | + | A | A | B | 2,232 | 743 | ||
| 8 | LSU1010-1[ | 0.8 ± 0.3 | ns | − | + | B | B | |||
| LSU1010-3[ | 0.9 ± 0.2 | − | + | B | B | |||||
| 9 | LSU1013-2 | 4.4 ± 0.7 | ns | + | + | A | AB | ND | ||
| LSU1013-6 | 5.6 ± 2.1 | + | + | A | AB | A | 2,235 | 744 | ||
| 10 | LSU1014-1 | 3.8 ± 1.0 | + | + | A | B | A | 2,235 | 744 | |
| LSU1014-6 | 0.7 ± 0.1 | − | + | A | AB | A | 2,214 | 737 | ||
| 11 | LSU1016-1 | 6.6 ± 0.5 | + | − | A | AB | A | 2,238 | 745 | |
| LSU1016-5 | 2.4 ± 0.5 | + | + | A | AB | A | 2,238 | 745 | ||
| 12 | LSU1021-6 | 2.9 ± 1.0 | ns | + | + | A | AB | A | 2,220 | 739 |
| LSU1021-7 | 4.4 ± 2.3 | + | + | A | AB | A | 2,220 | 739 | ||
| 13 | LSU1027-1 | 1.0 ± 0.1 | ns | − | + | A | AB | A | 759 | 252 |
| LSU1027-3 | 1.0 ± 0.1 | − | + | A | AB | A | 759 | 252 | ||
| 14 | LSU1037-1[ | 0.9 ± 0.2 | ns | − | + | B | B | |||
| LSU1037-5[ | 0.9 ± 0.1 | − | + | B | B | |||||
| 15 | LSU1040-1[ | 0.8 ± 0.1 | ns | − | + | B | B | |||
| LSU1040-6[ | 1.0 ± 0.1 | − | + | B | B | |||||
| 16 | LSU1054-1 | 3.1 ± 1.8 | ns | + | − | A | AB | A | 2,226 | 741 |
| LSU1054-5 | 1.1 ± 0.2 | + | + | A | AB | A | 2,226 | 741 | ||
| 17 | LSU1062-1 | 0.9 ± 0.2 | ns | + | + | A | B | A | 2,220 | 739 |
| LSU1062-3 | 0.9 ± 0.2 | + | − | A | B | A | 2,220 | 739 | ||
| 18 | LSU1067-1 | 2.5 ± 1.1 | ns | + | + | A | B | A | 2,247 | 748 |
| LSU1067-5 | 3.2 ± 1.2 | + | + | A | B | A | 2,247 | 748 | ||
| 19 | LSU1074-1 | 0.9 ± 0.2 | ns | − | − | A | B | A | 1,638 | 545 |
| LSU1074-4 | 0.7 ± 0.4 | − | − | A | B | A | 1,638 | 545 | ||
| 20 | LSU2002-1[ | 0.8 ± 0.2 | ns | − | + | B | B | |||
| LSU2002-8[ | 0.7 ± 0.2 | − | + | B | B | |||||
| 21 | LSU2003-1 | 4.0 ± 1.4 | ns | + | + | A | AB | ND | ||
| LSU2003-7 | 5.0 ± 1.8 | + | − | A | A | ND | ||||
| 22 | LSU3001-1 | 1.9 ± 0.6 | ns | + | − | A | B | 2,214 | 737 | |
| LSU3001-6 | 1.7 ± 0.6 | + | − | A | A | B | 2,217 | 738 | ||
| 23 | LSU3005-1[ | 0.9 ± 0.0 | ns | − | + | B | B | |||
| LSU3005-4[ | 1.0 ± 0.4 | − | + | B | B | |||||
Sequential isolates were obtained from 23 individuals.
The origin of H. pylori isolates is indicated as follows: NQ for the isolates from Colombia and LSU for isolates from Louisiana in the United States.
Values represent mean ± SD of Le(b)/BSA binding ratios determined in at least three independent experiments. Strains are considered nonadherent if the Le(b)/BSA ratio was ≤1.5.
Stat. sign., statistical significance. The values for Le(b) binding of sequential H. pylori isolates were compared by Welch’s t test. The P value of statistically significant differences is indicated. ns, no statistically significant difference.
A, babA; B, babB; AB, babA and babB.
babA genes at the indicated locus were completely sequenced. The length of the encoded BabA proteins (in amino acids [aa]) was deduced from the corresponding gene sequences. ND, no complete babA sequence was determined.
No babA gene was identified for this strain.
FIG 1 Distribution of bab genotypes. Frequency of babA and babB genes at the chromosomal loci A and B. None of the strains (n = 47) possessed babC in any of the three loci or had a bab gene in locus C. AB, babA in locus A and babB in locus B; BA, babB in locus A and babA in locus B; AA, babA in loci A and B; BB, babB in loci A and B; AB A, babA and babB in locus A and babA in locus B; A AB, babA in locus A and babA and babB in locus B; - A, no bab gene in locus A and babA in locus B.
FIG 2 babA sequence variation between sequential isolates. Pairwise alignments of babA nucleotide sequences and deduced BabA amino acid sequences were performed using MUSCLE (45). The polymorphic sites in each nucleotide alignment were determined using the program PSFIND and graphically displayed with the program HAPPLOT. Polymorphic sites are displayed by vertical lines. Corresponding plots for the protein alignments were generated manually. Blue vertical lines indicate differences between sequential isolates. The signal sequence (S) as well as the predicted domain structure based on Phyre2 analysis are indicated. aa, amino acids.
Comparison of babA sequences between pairs of sequential isolates
| Strain | Strain | Interval between isolates | BabA protein | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Length | Sequence | No. of | No. of SNPs | Length | No. of aa | Sequence | |||||
| 1 | NQ267 | 36 | 2,253 | 91.41 | 215 | 1 CNP | 35.5 | 139.5 | 750 | 70 | 92.94 |
| NQ1624 | 2,241 | 746 | |||||||||
| 2 | NQ315 | 36 | 2,229 | 99.91 | 3 | 1 CNP | 3 | 0 | 742 | 0 | 100 |
| NQ1712 | 2,229 | 742 | |||||||||
| 3 | NQ331 | 36 | 2,235 | 99.95 | 1 | 1 SNP | 0 | 1 | 744 | 1 | 99.86 |
| NQ1832 | 2,235 | 744 | |||||||||
| 4 | NQ352 | 36 | 2,211 | 97.01 | 71 | 3 CNPs | 46.75 | 21.25 | 736 | 19 | 98.09 |
| NQ1701 | 2,208 | 735 | |||||||||
| 5 | NQ366 | 36 | 2,247 | 99.73 | 6 | 1 CNP | 5 | 1 | 748 | 1 | 99.86 |
| NQ1790 | 2,247 | 748 | |||||||||
| 6 | NQ367 | 36 | 2,250 | 99.15 | 19 | 1 SNP | 14 | 5 | 749 | 4 | 99.59 |
| NQ1671 | 2,250 | 1 CNP | 749 | ||||||||
| 7 | NQ372 | 36 | 2,232 | 100 | 0 | 0 | 0 | 743 | 0 | 100 | |
| NQ1886 | 2,232 | 743 | |||||||||
| 8 | LSU1014-1 | 24 | 2,235 | 98.03 | 42 | 3 CNPs | 12 | 11 | 744 | 23 | 96.63 |
| LSU1014-6 | 2,214 | 737 | |||||||||
| 9 | LSU1016-1 | 12 | 2,238 | 99.91 | 2 | 1 CNP | 0 | 2 | 745 | 2 | 99.86 |
| LSU1016-5 | 2,238 | 745 | |||||||||
| 10 | LSU1021-6 | 12 | 2,220 | 100 | 0 | 0 | 0 | 739 | 0 | 100 | |
| LSU1021-7 | 2,220 | 739 | |||||||||
| 11 | LSU1027-1 | 3 | 759 | 100 | 0 | 0 | 0 | 252 | 0 | 100 | |
| LSU1027-3 | 759 | 252 | |||||||||
| 12 | LSU1054-1 | 12 | 2,226 | 99.86 | 3 | 1 CNP | 1 | 2 | 741 | 2 | 99.73 |
| LSU1054-5 | 2,226 | 741 | |||||||||
| 13 | LSU1062-1 | 3 | 2,220 | 99.95 | 1 | 1 SNP | 0 | 1 | 739 | 1 | 99.86 |
| LSU1062-3 | 2,220 | 739 | |||||||||
| 14 | LSU1067-1 | 12 | 2,247 | 98.26 | 40 | 2 CNPs | 24 | 16 | 748 | 13 | 98.52 |
| LSU1067-5 | 2,247 | 748 | |||||||||
| 15 | LSU1074-1 | 6 | 1,638 | 100 | 0 | 0 | 0 | 545 | 0 | 100 | |
| LSU1074-4 | 1,638 | 545 | |||||||||
| 16 | LSU3001-1 | 24 | 2,214 | 98.15 | 44 | 1 CNP | 15 | 19 | 737 | 17 | 98.1 |
| LSU3001-6 | 2,217 | 738 | |||||||||
babA sequences were obtained from 16 pairs of sequential isolates.
The number of polymorphic sites (PS) was determined with the program PSFind based on pairwise sequence alignments (MUSCLE) and includes substitutions, insertions, and deletions.
SNP, single nucleotide polymorphism; CNP, cluster of nucleotide polymorphisms (polymorphisms are separated by <200 bp).
The average number of synonymous sites (S) or nonsynonymous sites (N) was determined with the program MEGA5.
FIG 3 Sequence variability of BabA from BabA-expressing NQ and LSU strains. The Shannon diversity index (H) for each site in the multiple-sequence alignment of BabA (see Fig. S2 in the supplemental material) was determined using the Protein Variability Server (PVS) (24). H can range from 0 (the same amino acid is present at that position in all sequences) to 4.322 (all 20 amino acids are equally represented in that position). The predicted domain structure based on Phyre2 analysis is indicated on top of the figure.
FIG 4 Analysis of babA mosaic alleles. (A) Schematic representation of differences in the deduced BabA protein sequence of strain pair LSU1014-1/-6. The signal sequence (S) is indicated as well as the domain structure as predicted by Phyre2 analysis (44). BabA of strain LSU1014-6 differs in 11 amino acids and in addition contains a distinct and shortened C terminus due to a frameshift mutation. (B) Schematic representation of BabA sequences from the generated mosaic alleles based on strain pair LSU1014-1/-6. The numbers above the schematic representation of BabA in strain LSU1014-1 indicate the positions of amino acid changes. The length of the BabA protein (in amino acids [aa]) and result of BabA expression determined by Western blotting (WB) are indicated to the right of the schematic representations of the Bab sequences. (C) Le(b) binding was determined by ELISA and is represented as mean + SD from at least four independent experiments. All strains except LSU1014-6 and clone 135 had Le(b)/BSA ratios of >1.5. Le(b) binding of recombinant clone 1 was set at 100%, and statistical significance versus the value for clone 1 was determined by Welch’s t test. Statistical significance is indicated as follows: *, P ≤ 0.05; **, P ≤ 0.01; ***, P ≤ 0.001; ns, not statistically significant.
FIG 5 Phyre2 structure prediction of BabA. The BabA sequences of strains LSU1014-1 and LSU1014-6 and clone 135 were subjected to Phyre2 analysis for structure prediction (see Materials and Methods). (A and B) The models are colored showing the secondary structure (A) or confidence of the predicted structure (B), ranging from high (red) to low (blue). (C) Structural model generated with the BabA sequence of strain LSU1014-1 [Le(b) binding] with the amino acids that differed in strains LSU1014-1 and LSU1014-6 (not binding) indicated in orange. Additionally, the location of the distinct C terminus is colored in pink.