| Literature DB >> 31439690 |
Yoko Arimizu1,2, Yumi Kirino3, Mitsuhiko P Sato1, Koichi Uno4, Toshio Sato4, Yasuhiro Gotoh1, Frédéric Auvray5, Hubert Brugere5, Eric Oswald5,6, Jacques G Mainil7, Kelly S Anklam8, Dörte Döpfer8, Shuji Yoshino9, Tadasuke Ooka10, Yasuhiro Tanizawa11, Yasukazu Nakamura11, Atsushi Iguchi12, Tomoko Morita-Ishihara13, Makoto Ohnishi13, Koichi Akashi2, Tetsuya Hayashi1, Yoshitoshi Ogura1.
Abstract
How pathogens evolve their virulence to humans in nature is a scientific issue of great medical and biological importance. Shiga toxin (Stx)-producing Escherichia coli (STEC) and enteropathogenic E. coli (EPEC) are the major foodborne pathogens that can cause hemolytic uremic syndrome and infantile diarrhea, respectively. The locus of enterocyte effacement (LEE)-encoded type 3 secretion system (T3SS) is the major virulence determinant of EPEC and is also possessed by major STEC lineages. Cattle are thought to be the primary reservoir of STEC and EPEC. However, genome sequences of bovine commensal E. coli are limited, and the emerging process of STEC and EPEC is largely unknown. Here, we performed a large-scale genomic comparison of bovine commensal E. coli with human commensal and clinical strains, including EPEC and STEC, at a global level. The analyses identified two distinct lineages, in which bovine and human commensal strains are enriched, respectively, and revealed that STEC and EPEC strains have emerged in multiple sublineages of the bovine-associated lineage. In addition to the bovine-associated lineage-specific genes, including fimbriae, capsule, and nutrition utilization genes, specific virulence gene communities have been accumulated in stx- and LEE-positive strains, respectively, with notable overlaps of community members. Functional associations of these genes probably confer benefits to these E. coli strains in inhabiting and/or adapting to the bovine intestinal environment and drive their evolution to highly virulent human pathogens under the bovine-adapted genetic background. Our data highlight the importance of large-scale genome sequencing of animal strains in the studies of zoonotic pathogens.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31439690 PMCID: PMC6724679 DOI: 10.1101/gr.249268.119
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Strains used in this study
Figure 1.Phylogenetic relationship between human and bovine commensal E. coli and distribution of stx- and LEE-positive isolates. (A) The NJ tree of 937 bovine and human commensal isolates with 36 completely sequenced reference strains based on the concatenated nucleotide sequences of seven housekeeping genes (adk, fumC, gyrB, icd, mdh, purA, and recA). The phylogeny was rooted using Escherichia fergusonii strain ATCC35469 and cryptic Escherichia clade 1 strain TW15838. Sources of sequences and phylogroups of strains are shown in the NJ tree. Pathotypes of reference strains are indicated by differently colored characters. (B) The core gene–based ML tree of 937 bovine and human commensal isolates with 197 human clinical isolates. Cryptic Escherichia clade I strain TW15838 was included as an outgroup. The tree was constructed based on 247,627 SNPs located on 1755 core genes. Country, BAPS group, phylogroup, ST, host and pathotype of each strain, and the presence of stx1, stx2, and LEE are shown. Positions of clinical isolates of major STEC serotypes are indicated.
Figure 2.Distribution of virulence genes among 937 bovine and human commensal E. coli strains. (A) The ML tree based on 262,788 SNP sites on 1958 core genes. The tree was rooted by cryptic Escherichia clade I strain TW15838. LEE-positive lineages/strains are highlighted by orange lines. Frequently observed serotypes (more than four strains) and major STEC serotypes are indicated. The presence of stx1, stx2, and LEE in each strain is shown. The presence of other STEC/EPEC virulence genes (B) and virulence genes associated with other E. coli pathotypes (C) is shown, respectively. For the functions and nucleotide sequences of each gene analyzed, see Supplemental Table S7.
Figure 3.Conservation of STEC/EPEC-related and other E. coli virulence genes in bovine and human commensal isolates. Conservation of the virulence genes of STEC/EPEC and those of other E. coli pathotypes in LEE-positive (n = 104) and LEE-negative (n = 833) strains (A), in stx-positive (n = 146) and stx-negative (n = 791) strains (B), and in the bovine-associated (n = 647) and human-associated (n = 290) lineages (C) are summarized. Statistical analyses were performed using the two-sided Fisher's exact test. Asterisks indicate that the differences were significant after the Bonferroni correction for multiple comparisons: (*) P < 0.05; (**) P < 1 × 10−6.
Figure 4.Co-occurrence network analysis of virulence genes. (A) Co-occurrence network analysis of virulence genes in bovine and human commensal isolates. Gene names are colored based on the virulence gene classification indicated in the box. The node sizes represent the number of strains in which each gene was conserved. Seven communities identified are indicated by differently colored edges (connections). Edge widths represent the number of co-occurrences. Only one co-occurrence between genes was excluded from the analysis. (B) Community members and their total numbers of co-occurrence within each community.
Figure 5.Bovine- or human-associated lineage-specific gene. (A) A scattered plot of gene conservation in the bovine- and human-associated lineages. Genes that were significantly (positively or negatively) associated with the bovine-associated lineage (Bonferroni P < 0.05) are indicated by blue dots. Among the positively associated (bovine-associated lineage-specific) and negatively associated (human-associated lineage-specific) genes, the top 50 genes with known or predictable functions (Bonferroni P < 1 × 10−135 and P < 1 × 10−141, respectively) are indicated in each group by purple and red dots, respectively. (B) Distribution of the bovine- or human-associated lineage-specific genes in the core gene–based ML tree (the same tree shown in Fig. 2). Presence and absence of each gene are indicated by purple and beige, respectively.