| Literature DB >> 35710940 |
George S Long1,2, Jennifer Klunk3,4,5, Ana T Duggan4,6, Madeline Tapson4,5, Valentina Giuffra7, Lavinia Gazzè8, Antonio Fornaciari9, Sebastian Duchene10, Gino Fornaciari9, Olivier Clermont11, Erick Denamur12,13, G Brian Golding3, Hendrik Poinar14,15,16.
Abstract
Escherichia coli - one of the most characterized bacteria and a major public health concern - remains invisible across the temporal landscape. Here, we present the meticulous reconstruction of the first ancient E. coli genome from a 16th century gallstone from an Italian mummy with chronic cholecystitis. We isolated ancient DNA and reconstructed the ancient E. coli genome. It consisted of one chromosome of 4446 genes and two putative plasmids with 52 genes. The E. coli strain belonged to the phylogroup A and an exceptionally rare sequence type 4995. The type VI secretion system component genes appears to be horizontally acquired from Klebsiella aerogenes, however we could not identify any pathovar specific genes nor any acquired antibiotic resistances. A sepsis mouse assay showed that a closely related contemporary E. coli strain was avirulent. Our reconstruction of this ancient E. coli helps paint a more complete picture of the burden of opportunistic infections of the past.Entities:
Mesh:
Year: 2022 PMID: 35710940 PMCID: PMC9203756 DOI: 10.1038/s42003-022-03527-1
Source DB: PubMed Journal: Commun Biol ISSN: 2399-3642
Fig. 1Characterization of both the physical sample and the authentication of the sequenced data.
a Liver and gallbladder of Giovani d'Avalos. The gallstones can be seen in the red rectangle. Note its dark brown coloring. Scale bar represents 1 cm b Gallbladder with a thickened wall (a) and Rokitansky-Aschoff sinuses (b) (Hematoxylin-eosin, 3X and 250X). The scale bar for (a) represents 2 cm and 100 μm for (b). c Damage plots of the and ends of mapped reads for E.coli. Colors refer to the digest, not damage type. Mapdamage 2.0[52] was used to calculate the damage rates. d Fragment length distribution of deduplicated mapped reads from E. coli. A scale is used to emphasize the differences between the digests. A minimum length of 30 bp was required for a read to be kept.
Fig. 2Read coverage metrics for the E. coli pan-genome in comparison to other genomes.
a Distribution of mean gene coverages with a CV ≤1 for the ancient genome. The dashed line indicates the detection threshold at 10×. The black rectangle on the right indicates the area where genes with a high copy number (as defined by ). b Coverage plot for FSIS11816402. A window of 1% was used for illustration purposes. c Coverage plot for CP019906.1. A window of 0.1% was used for illustration purposes. The first track indicates the coverage with the red line illustrating the overall mean. The second track indicates the number of SNPs over the same window while the third is the GC content. d Gene coverage of the T6SS in K. aerogenes using a 100 bp window. Gene names are included when available.
Read coverage metrics for the E. coli pan-genome, FSIS11816402, K-12 MG1655, CP019906.1, and CP012732.1 references.
| Sequence | % Positions with coverage ≥1 × | Mean coverage |
|---|---|---|
| Pan-Genome | 97.93% | 28.46[28.31, 28.62] × |
| FSIS11816402 | 77.84% | 18.00[16.17, 19.83] × |
| K-12 MG1655 | 78.30% | 20.94[20.93, 20.95] × |
| CP019906.1 | 80.62% | 20.95[20.80, 21.10] × |
| CP012732.1 | 64.01% | 16.46[16.28, 16.64] × |
FSIS11816402 used the mean of means greater than 1 × as it was an assembly. The 95% confidence interval is listed in the square brackets.
Fig. 3Maximum likelihood SNP phylogeny of E. coli.
a The global phylogeny with bootstrap values and phylogenetic groups with E. coli EC42405 as the out-group. b Phylogeny of the reduced subgroup A0 as defined by the Clermont genotype (+ - - -)[61]. Tip points represent the sequence type of the strain. IAI1 was the out-group of the phylogeny. c Phylogeny of the ST4995 strains. Red rectangles represent the 95% confidence interval for the topology; labels indicate the median date of divergence. The evolutionary rate for the phylogeny was 2.555 × 10−6[1.567 × 10−6, 3.992 × 10−6] subs/site/year. CFSAN051544 was the out-group. The brown arrows indicate the position of the ancient genome.
SNP distances of seven strains from the ST4995 phylogeny.
| Genomes | CFSAN051544 | Ancient Ecoli | FSIS11816402 | ATCC 11229 | ESC_SA9272AA | ESC_DB2295AA | ESC_CA2237AA |
|---|---|---|---|---|---|---|---|
| CFSAN051544 | 0 | 771 | 869 | 920 | 971 | 1012 | 1024 |
| Ancient Ecoli | 771 | 0 | 419 | 503 | 554 | 596 | 607 |
| FSIS11816402 | 869 | 419 | 0 | 588 | 639 | 681 | 692 |
| ATCC 11229 | 920 | 503 | 588 | 0 | 215 | 219 | 230 |
| ESC_SA9272AA | 971 | 554 | 639 | 215 | 0 | 308 | 319 |
| ESC_DB2295AA | 1012 | 596 | 681 | 219 | 308 | 0 | 323 |
| ESC_CA2237AA | 1024 | 607 | 692 | 230 | 319 | 323 | 0 |
CFSAN051544 was the out-group of the phylogeny.
Fig. 4Principal coordinate analyses (PCoA) of gene presence/absence data.
The accessory genome (a, c) and identified virulence genes (b, d) were clustered using a binary distance. Phylogroups were identified using Clermont Typing[59] while pathovars were determined using available metadata from the Patric[49] database. The ancient genome is indicated by the black arrow. In both binary distances were calculated prior to create the PCoA.
Identified virulence factors which contained genes not found in K-12 MG1655.
| Gene family | Genes | Mean coverage | Function | Pathovar |
|---|---|---|---|---|
| 13 | 38.21[32.31, 44.10] × | Type VI Secretion System[ | EAEC[ | |
| 4 | 31.08[28.27, 33.89] × | Fimbriae[ | EAEC, EHEC, ExPEC[ | |
| 3 | 36.02[22.49, 49.54] × | Type VI Secretion System[ | EAEC, EHEC, ExPEC[ | |
| 2 | 23.01[5.01, 41.02] × | Fimbriae[ | EAEC, EHEC, ExPEC[ | |
| 2 | 51.62[49.80, 53.44] × | Type VI Secretion System[ | EAEC[ | |
| 2 | 47.53[46.14, 48.91] × | Type VI Secretion System[ | EAEC[ |
EAEC enteroaggregative E. coli, EHEC enterohemorrhagic E. coli, ExPEC extraintestinal pathogenic E.coli.
Only families which had more than one distinct gene are present.
AMR profile of the ancient E. coli.
| Resistances | With efflux | No efflux |
|---|---|---|
| fluoroquinolone antibiotic | 17 | 1 |
| penam | 13 | 1 |
| tetracycline antibiotic | 10 | 1 |
| cephalosporin | 6 | 1 |
| peptide antibiotic | 5 | 3 |
| phenicol antibiotic | 5 | 1 |
| cephamycin | 4 | 1 |
| glycylcycline | 4 | 1 |
| rifamycin antibiotic | 4 | 1 |
| triclosan | 4 | 1 |
| carbapenem | 2 | 1 |
| penem | 2 | 1 |
| monobactam | 1 | 1 |
Antibiotic results from an RGI[29] search of the E. coli scaffolds. A total of 29 genes were identified, with genes appearing multiple times. Only drug classes were detected at least once without the use of antibiotic effluxes. A single gene can be present in multiple drug classes.