| Literature DB >> 35251780 |
Nicholas M Thomson1, Rachel Gilroy1, Maria Getino2,3, Ebenezer Foster-Nyarko1,4, Arnoud H M van Vliet3, Roberto M La Ragione3,5, Mark J Pallen1,3,6.
Abstract
The genus Escherichia has been extensively studied and it is known to encompass a range of commensal and pathogenic bacteria that primarily inhabit the gastrointestinal tracts of warm-blooded vertebrates. However, the presence of E. coli as a model organism and potential pathogen has diverted attention away from commensal strains and other species in the genus. To investigate the diversity of Escherichia in healthy chickens, we collected fecal samples from antibiotic-free Lohmann Brown layer hens and determined the genome sequences of 100 isolates, 81 of which were indistinguishable at the HC0 level of the Hierarchical Clustering of Core Genome Multi-Locus Sequence Typing scheme. Despite initial selection on CHROMagar Orientation medium, which is considered selective for E. coli, in silico phylotyping and core genome single nucleotide polymorphism analysis revealed the presence of at least one representative of all major clades of Escherichia, except for E. albertii, Shigella, and E. coli phylogroup B2 and cryptic clade I. The most frequent phylogenomic groups were E. coli phylogroups A and B1 and E. ruysiae (clades III and IV). We compiled a collection of reference strains isolated from avian sources (predominantly chicken), representing every Escherichia phylogroup and species, and used it to confirm the phylogeny and diversity of our isolates. Overall, the isolates carried low numbers of the virulence and antibiotic resistance genes typically seen in avian pathogenic E. coli. Notably, the clades not recovered are ones that have been most strongly associated with virulence by other studies.Entities:
Keywords: Chickens; Commensal; Cryptic clades; Escherichia; Genomic diversity; Phylogenomics
Year: 2022 PMID: 35251780 PMCID: PMC8896058 DOI: 10.7717/peerj.12935
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Summary of the major stages of sample collection, processing and data analysis and the primary programs and databases used.
Collection dates, age of birds and number of isolates recovered for each fecal sample.
| Faecal sample | Date collected | Age of birds (weeks) | GTG5 unique isolates | HC0 unique isolates | Phylogroups |
|---|---|---|---|---|---|
| 1 | 09/07/2018 | 84 | 5 | 5 | A, B1 |
| 2 | 09/07/2018 | 84 | 6 | 5 | A, B1, (II) |
| 3 | 09/07/2018 | 84 | 6 | 6 | A, B1, C |
| 4 | 09/07/2018 | 84 | 6 | 3 | A, Ef |
| 5 | 09/07/2018 | 84 | 6 | 4 | A, D, (V) |
| 6 | 13/08/2018 | 89 | 7 | 7 | A, B1, F, Ef, (III), (V) |
| 7 | 13/08/2018 | 89 | 5 | 4 | A, B1 |
| 8 | 13/08/2018 | 89 | 6 | 5 | A, Ef, (III), (V) |
| 9 | 13/08/2018 | 89 | 6 | 6 | A, B1, (V) |
| 10 | 13/08/2018 | 89 | 5 | 3 | B1, (III), (IV) |
| 11 | 10/09/2018 | 93 | 6 | 6 | A, B1, E, Ef, (III) |
| 12 | 10/09/2018 | 93 | 7 | 5 | A, B1, E, Ef |
| 13 | 10/09/2018 | 93 | 7 | 5 | A, B1, (III) |
| 14 | 10/09/2018 | 93 | 7 | 4 | A, (IV), (V) |
| 15 | 10/09/2018 | 93 | 6 | 4 | A, (III), (V) |
| 16 | 08/10/2018 | 97 | 1 | 1 | B1 |
| 17 | 08/10/2018 | 97 | 1 | 1 | (IV) |
| 18 | 08/10/2018 | 97 | 1 | 1 | (III) |
| 19 | 08/10/2018 | 97 | 1 | 1 | A |
| 20 | 08/10/2018 | 97 | 1 | 1 | D |
| 21 | 12/11/2018 | 102 | 1 | 1 | (V) |
| 22 | 12/11/2018 | 102 | 1 | 1 | D |
| 23 | 12/11/2018 | 102 | 1 | 1 | B1 |
| 24 | 12/11/2018 | 102 | 1 | 1 | A |
| 100 | 81 |
Note:
Following culturing on MacConkey agar and CHROMagar Orientation medium, 100 presumptive E. coli were isolated. Multiple colonies per fecal sample were screened by GTG5 PCR to reduce the collection of identical strains. Whole-genome sequence analysis revealed the large diversity of Escherichia present despite apparently selecting for E. coli, although only 81 of the initial strains were distinguishable at the HC0 level using HierCC clustering (see Materials and Methods). Phylogroups of E. coli are referred to by individual letters; cryptic clades are referred to by Roman numerals in parentheses; Ef, E. fergusonii.
Clustering of isolates by HierCC level HC0 and selection of representative isolate for each cluster.
| HC0 cluster | MLST | Isolates | Core SNP distance | Representative isolate |
|---|---|---|---|---|
| 148525 | 5643 | 021, 043 | 124 | 021 |
| 148529 | 155 | 007, 037 | 21 | 037 |
| 148537 | 48 | 024, 032 | 11 | 032 |
| 148541 | 1112 | 026, 067 | 6 | 067 |
| 148542 | 1638 | 006, 055 | 11 | 006 |
| 148547 | 5573 | 025, 041, 057, 073, 087 | See | 025 |
| 148548 | 1276 | 034, 089 | 18 | 034 |
| 148555 | 48 | 018, 042 | 20 | 042 |
| 148556 | 7059 | 045, 085 | 55 | 045 |
| 148564 | 1844 | 019, 050, 078 | See | 050 |
| 148571 | 6540 | 063, 080 | 257 | 063 |
| 148574 | 2456 | 068, 069, 071 | See | 068 |
| 148603 | 48 | 004, 081 | 2 | 004 |
| 148607 | 11513 | 020, 058 | 143 | 058 |
SNP distance matrices for HC0 clusters of >2 isolates.
| SNP distance matrix for Cluster HC0_148547 | ||||
|---|---|---|---|---|
|
| 041 | 057 | 073 | 087 |
| 025 | 9 | 9 | 9 | 4 |
| 041 | 0 | 5 | 11 | 11 |
| 057 | 0 | 13 | 8 | |
| 073 | 0 | 10 | ||
|
| ||||
| 050 | 078 | |||
| 019 | 16 | 15 | ||
| 050 | 0 | 14 | ||
|
| ||||
| 069 | 071 | |||
| 068 | 8 | 14 | ||
| 069 | 0 | 10 | ||
Note:
Table 2 provides SNP distances for 11 HC0 clusters comprising pairs of isolates. The remaining three clusters contained >2 isolates, requiring a matrix for full pairwise comparison.
Figure 2Core SNP maximum likelihood tree of the final 81 Escherichia isolates, including reference sequences for every Escherichia species and all phylogroups of E. coli.
Names of our isolates and their reference strains are coloured according to their species/phylogroup. Names of reference strains for clades not found in our samples are coloured black. Reference strains are named with their species and strain name, with phylogroups given in parentheses. Salmonella bongori and Salmonella. enterica ser. Typhi are included as outgroups.
Figure 3Anvi’o representation of the Escherichia pangenome showing 81 Escherichia isolates recovered from chicken faeces and 33 publicly available reference Escherichia isolates from poultry species.
Each layer represents a single genome, with black colouring signifying the presence of a gene cluster. Gene clusters are organised according to their distribution across the genome, with co-occurring genes shown closer together. The heatmap at the top-right of the image represents average nucleotide identity across all included genomes, with darker red colours indicating a higher percentage of average nucleotide identity. Assigned phylogroup is shown by the colour bar on the right of the image. Singleton gene clusters (present only in one genome; n = 3,641) are highlighted in blue while core gene clusters (present in all genomes; n = 2,449) are shown in green.
Figure 4Summary of the plasmid replicon contents of 81 Escherichia isolates.
(A) Number of isolates containing 0–7 distinct plasmid replicons from the PlasmidFinder database. (B) Frequency of each plasmid replicon identified from the PlasmidFinder database. Some isolates contained >1 of the same replicon type (see main text).
Figure 5Presence (solid squares) and absence (open squares) of predicted antibiotic resistance genes, virulence-associated genes and plasmid replicons in the final 81 isolates.
Isolates are arranged according to the phylogenetic tree shown in Fig. 2 with their names coloured according to phylogenetic group. Ew, E. whittamii; Em, E. marmotae; Er, E. ruysiae; Ef, E. fergusonii; A–F, E. coli phylogroups. Virulence-associated genes shown here are from the custom panel of APEC-associated genes. Members of the panel not represented here were not present in any isolates. Plasmid replicon and predicted antibiotic resistance gene data have been condensed to show major classes. Therefore, presence indicates ≥1 members of that class were detected. Full gene and plasmid detection data are available in Tables S5–S7.
Members of the panel of virulence-associated genes used to predict virulence potential of isolates in this study.
| Gene | Function | Protein | N° isolates | Ref. |
|---|---|---|---|---|
|
| Adhesion | Afimbrial adhesin | 0 | ( |
|
| Type 1 D-mannose specific adhesin | 70 | ( | |
|
| P fimbrial adhesin (type F11) | 0 | ( | |
|
| Outer membrane usher | 0 | ( | |
|
| P fimbrial tip adhesin | 0 | ( | |
|
| S fimbrial adhesin | 0 | ( | |
|
| Host evasion | Increased serum survival lipoprotein | 30 | ( |
|
| Polysialic acid transport protein | 15 | ( | |
|
| Outer membrane protease | 32 | ( | |
|
| Complement resistance protein | 19 | ( | |
|
| Iron acquisition | TonB-dependent heme receptor | 24 | ( |
|
| Siderophore yersiniabactin receptor | 9 | ( | |
|
| Iron-regulated outer membrane virulence protein | 1 | ( | |
|
| Siderophore salmochelin receptor | 8 | ( | |
|
| Yersiniabactin biosynthetic protein | 8 | ( | |
|
| L-lysine N6-monooxygenase (aerobactin synthesis) | 3 | ( | |
|
| Siderophore aerobactin receptor | 3 | ( | |
|
| Fe/Mn ABC transporter substrate binding | 26 | ( | |
|
| Secretion | type IVB secretion system protein | 23 | ( |
|
| Sugar transport | PTS fructose transporter subunit IIC | 1 | ( |
|
| Toxin | Heat-stable enterotoxin EAST1 | 15 | ( |
|
| SDR Family oxidoreductase | 8 | ( | |
|
| Temperature-sensitive haemagglutinin | 0 | ( | |
|
| Vacuolated autotransporter toxin | 0 | ( |