| Literature DB >> 23462115 |
Dagmara I Kisiela1, Sujay Chattopadhyay, Veronika Tchesnokova, Sandip Paul, Scott J Weissman, Irena Medenica, Steven Clegg, Evgeni V Sokurenko.
Abstract
UNLABELLED: Salmonella and Escherichia coli mannose-binding type 1 fimbriae exhibit highly similar receptor specificities, morphologies, and mechanisms of assembly but are nonorthologous in nature, i.e., not closely related evolutionarily. Their operons differ in chromosomal location, gene arrangement, and regulatory components. In the current study, we performed a comparative genetic and structural analysis of the major structural subunit, FimA, from Salmonella and E. coli and found that FimA pilins undergo diverse evolutionary adaptation in the different species. Whereas the E. coli fimA locus is characterized by high allelic diversity, frequent intragenic recombination, and horizontal movement, Salmonella fimA shows structural diversity that is more than 5-fold lower without strong evidence of gene shuffling or homologous recombination. In contrast to Salmonella FimA, the amino acid substitutions in the E. coli pilin heavily target the protein regions that are predicted to be exposed on the external surface of fimbriae. Altogether, our results suggest that E. coli, but not Salmonella, type 1 fimbriae display a high level of structural diversity consistent with a strong selection for antigenic variation under immune pressure. Thus, type 1 fimbriae in these closely related bacterial species appear to function in distinctly different physiological environments. IMPORTANCE: E. coli and Salmonella are enteric bacteria that are closely related from an evolutionary perspective. They are both notorious human pathogens, though with somewhat distinct ecologies and virulence mechanisms. Type 1 fimbriae are rod-shaped surface appendages found in most E. coli and Salmonella isolates. In both species, they mediate bacterial adhesion to mannose receptors on host cells and share essentially the same morphology and assembly mechanisms. Here we show that despite the strong resemblances in function and structure, they are exposed to very different natural selection environments. Sequence analysis indicates that E. coli, but not Salmonella, fimbriae are subjected to strong immune pressure, resulting in a high level of major fimbrial protein gene shuffling and interbacterial transfer. Thus, evolutionary analysis tools can provide evidence of divergent physiological roles of functionally similar traits in different bacterial species.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23462115 PMCID: PMC3604780 DOI: 10.1128/mBio.00625-12
Source DB: PubMed Journal: MBio Impact factor: 7.867
FIG 1 Genetic organization of fim operons in S. enterica serovar Typhimurium and E. coli K-12 and of the sfm operon of E. coli K-12. The fimA (sfmA) and fimH (sfmH) genes are shown in pink and yellow, respectively. The remaining structural fimbrial genes are shown in light gray. The genes involved in regulation are shown in dark gray and tRNA-Arg genes in green. The percent identity for corresponding DNA and protein sequences was determined by pairwise alignment (BioEdit). Chromosomal locations of the operons are designated by numbers below the designations of the neighboring genes of the operons (boxed). “PDE” indicates gene STM0551 encoding phosphodiesterase. The phage-related genes are marked by double slashes (//).
Nucleotide diversity of fimA and 3-locus MLST in S. enterica and E. coli
| Gene | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Subsp. I | Subsp. I–VI | Entire species | B2 group | |||||||||
| π (%) | dS | dN | π (%) | dS | dN | π (%) | dS | dN | π (%) | dS | dN | |
| 3-locus housekeeping genes[ | 1.4 ± 0.02 | 0.055 | 0.001 | 3.2 ± 0.08 | 0.132 | 0.004 | 1.7 ± 0.05 | 0.075 | 0.001 | 0.5 ± 0.02 | 0.019 | 0.001 |
| 1.4 ± 0.04 | 0.035 | 0.007 | 3.9 ± 0.12 | 0.125 | 0.015 | 7.9 ± 0.14 | 0.222 | 0.041 | 7.8 ± 0.14 | 0.215 | 0.041 | |
Numbers in parentheses represent numbers of analyzed sequences.
Data represent the results determined for concatenated sequences of internal (450- to 500-bp) fragments of the housekeeping genes from S. enterica (thrA, aroC, and hisD) and E. coli (adk, fumC, and gyrB) MLST schemes.
FIG 2 Maximum-likelihood DNA phylograms of concatenated 3-locus MLST and fimA sequences of S. enterica. The MLST (A) and fimA (B) trees were constructed based on an alignment of 66 sequences obtained for S. enterica subsp. I to VI. The colored boxes mark strain clades with identical sequence types (STs). The S. enterica subsp. I clade is boxed (gray dashed line). The scale at the bottom of the phylograms indicates phylogenetic distance and corresponds to a 1-nucleotide difference.
FIG 3 Maximum-likelihood DNA phylograms of concatenated 3-locus MLST and fimA sequences of E. coli. The MLST (A) and fimA (B) trees were constructed based on an alignment of 53 sequences obtained for E. coli. The colored boxes mark strain clades with identical sequence types (STs). Cross-connecting lines indicate corresponding MLST strain clades that carry different fimA haplotypes. The E. coli clade of strains with very low MLST sequence diversity (the major clade in the MLST tree) is boxed (gray dashed line). The scale at the bottom of the phylograms indicates phylogenetic distance and corresponds to a 1-nucleotide difference.
Nucleotide diversity of fim and sfm in publicly available fully assembled genome sequences of S. enterica and E. coli
| Gene | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Entire species | B2 group | ||||||||
| π (%) | dS | dN | π (%) | dS | dN | π (%) | dS | dN | |
| 3-locus | 1.3 ± 0.05 | 0.049 | 0.001 | 1.8 ± 0.05 | 0.077 | 0.001 | 0.5 ± 0.04 | 0.02 | 0.001 |
| 6-locus | 1.2 ± 0.04 | 0.047 | 0.001 | 2.7 ± 0.05 | 0.117 | 0.002 | 1.3 ± 0.07 | 0.053 | 0.002 |
| 1.2 ± 0.07 | 0.031 | 0.006 | 7.8 ± 0.21 | 0.217 | 0.041 | 8.6 ± 0.62 | 0.246 | 0.045 | |
| 1.2 ± 0.05 | 0.032 | 0.006 | 1.3 ± 0.03 | 0.042 | 0.004 | 1.1 ± 0.01 | 0.034 | 0.004 | |
| Not applicable | 0.7 ± 0.06 | 0.025 | 0.002 | Not available | |||||
| Not applicable | 1.5 ± 0.06 | 0.047 | 0.005 | Not available | |||||
Numbers in parentheses represent numbers of analyzed sequences.
Data represent the results determined for concatenated sequences of internal (450- to 500-bp) fragments of the housekeeping genes (thrA, aroC, and hisD for S. enterica and adk, fumC, and gyrB for E. coli).
Data represent the results determined for concatenated sequences of internal (450- to 500-bp) fragments of the housekeeping genes (thrA, aroC, hisD, adk, fumC, and gyrB for both S. enterica and E. coli).
Data represent the results for 29 sfmA sequences determined in 44 E. coli strains.
Data represent the results for 28 sfmC sequences determined in 44 E. coli strains.
FIG 4 Distribution of amino acid variability along FimA sequence alignments of S. enterica subsp. I and E. coli. The amino acid variability across the FimA sequence is represented by the Shannon entropy plot as determined by BioEdit software. The entropy values refer to the complexity in amino acid composition at the position in the sequence alignment, taking into account the numbers and frequencies of different amino acids observed for each position. Red bars indicate sites of hot spot mutations. For Salmonella and E. coli, 24 and 28 FimA structural variants were analyzed, respectively.
FIG 5 Intragenic recombination in E. coli and S. enterica subsp. I fimA as determined by SiScan and MaxChi tests. Combined Z score plots (showing values higher than the threshold score of 1.96, i.e., P < 0.05) obtained for 10 sets of randomly chosen triplet sequences of E. coli and Salmonella are shown. The sequence triplets with recombination signal are shown in bold colors (red, orange, green, blue, and black). The sequence triplets without recombination signal are shown in light colors. Dashed lines indicate frequent recombination breakpoints. Graphs presenting Z score plots for each sequence triplet separately are shown in Fig. S3A and B in the supplemental material. The colors used here correspond to those used in the supplemental figures for these triplets.
FIG 6 Primary, secondary, and tertiary structures of Salmonella and E. coli FimA. (A) Primary sequence alignment with predicted (Phyre2) and solved (PDB 2JTY) secondary motifs of S. Typhimurium (S. Thm) and E. coli FimA. Beta strands are indicated as yellow highlights; α helices are marked in light brown. Cysteine residues are shown in blue. The sequences of N-terminal extensions and added donor strands are in red, and sequences of glycine linkers used to construct the self-complemented subunits are in green. (B) Ribbon representation of the 3D structure of E. coli FimA (PDB 2JTY) and the predicted Salmonella FimA model. The orange spheres represent C-α atoms of cysteine residues. (C) Distribution of structural mutations in tertiary structures of E. coli and Salmonella FimA. The amino acids at variable sites are shown as green spheres.