| Literature DB >> 25428893 |
Atsushi Iguchi1, Sunao Iyoda2, Taisei Kikuchi3, Yoshitoshi Ogura4, Keisuke Katsura5, Makoto Ohnishi2, Tetsuya Hayashi4, Nicholas R Thomson6.
Abstract
The O antigen constitutes the outermost part of the lipopolysaccharide layer in Gram-negative bacteria. The chemical composition and structure of the O antigen show high levels of variation even within a single species revealing itself as serological diversity. Here, we present a complete sequence set for the O-antigen biosynthesis gene clusters (O-AGCs) from all 184 recognized Escherichia coli O serogroups. By comparing these sequences, we identified 161 well-defined O-AGCs. Based on the wzx/wzy or wzm/wzt gene sequences, in addition to 145 singletons, 37 serogroups were placed into 16 groups. Furthermore, phylogenetic analysis of all the E. coli O-serogroup reference strains revealed that the nearly one-quarter of the 184 serogroups were found in the ST10 lineage, which may have a unique genetic background allowing a more successful exchange of O-AGCs. Our data provide a complete view of the genetic diversity of O-AGCs in E. coli showing a stronger association between host phylogenetic lineage and O-serogroup diversification than previously recognized. These data will be a valuable basis for developing a systematic molecular O-typing scheme that will allow traditional typing approaches to be linked to genomic exploration of E. coli diversity.Entities:
Keywords: E. coli; O serogroup; O-antigen biosynthesis gene cluster; genomic diversity; horizontal gene transfer
Mesh:
Substances:
Year: 2014 PMID: 25428893 PMCID: PMC4379981 DOI: 10.1093/dnares/dsu043
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1.Sixteen Escherichia coli O-AGC groups identified in this study. Group members have different O serogroups in each group, but these share nearly identical or highly similar genetic organizations. Group names (Gp) are indicated at the left side. DNA sequence identities (%) between group members are indicated in each group.
Figure 2.Phylogenetic analysis of homologues of (A) Wzx and Wzy and (B) Wzm and Wzt from Escherichia coli O-serogroup reference strains based on the amino acid sequences. The group names are indicated outside of trees. The pair or groups of homologues with high DNA sequence identity (≥95%, mostly ≥97%) are indicated in red. The Wzx homologues of O96 and O170, which are indicated in blue and by an asterisk, showed 86% DNA sequence identity, but in all other proteins showed low-sequence homologies to each other (<70% identity). Note that while the DNA sequence identity between the wzx_O46 and wzx_O134 in Gp6 is 99.7%, the wzx_O46 has a 2-bp deletion at the 3′-region, causing a frame shift.
Figure 3.Correlation between the Escherichia coli evolutionary lineages and the distribution of O-AGCs. The phylogenetic tree was constructed based on the concatenated sequences of seven housekeeping genes from all 184 E. coli O-serogroup reference strains. The group names of O-AGCs (Gp1–Gp16) are indicated in the outermost region. Members in groups indicated in green were found to belong to the same or very closely related lineage, whereas members of the groups indicated in blue were found in distinct lineages. The outer circle next to the O serogroup names indicates the distribution of sugar synthesis gene sets identified in each O-AGC. The inner circle indicates the type of O-antigen processing system (wzx/wzy or wzm/wzt). Phylogenetic groups (A, B1, B2, D, and E) were determined by comparing the sequences of the strains tested with the known sequences from the ECOR collection (http://mlst.warwick.ac.uk/mlst/dbs/Ecoli).