| Literature DB >> 32669379 |
Anna Maria Malberg Tetzschner1, James R Johnson2,3, Brian D Johnston3, Ole Lund1, Flemming Scheutz4.
Abstract
Extraintestinal pathogenic Escherichia coli (ExPEC) is the leading cause in humans of urinary tract infection and bacteremia. The previously published web tool VirulenceFinder (http://cge.cbs.dtu.dk/services/VirulenceFinder/) uses whole-genome sequencing (WGS) data for in silico characterization of E. coli isolates and enables researchers and clinical health personnel to quickly extract and interpret virulence-relevant information from WGS data. In this study, 38 ExPEC-associated virulence genes were added to the existing E. coli VirulenceFinder database. In total, 14,441 alleles were downloaded. A total of 1,890 distinct alleles were added to the database after removal of redundant sequences and analysis of the remaining alleles for open reading frames (ORFs). The database now contains 139 genes-of which 44 are related to ExPEC-and 2,826 corresponding alleles. Construction of the database included validation against 27 primer pairs from previous studies, a search for serotype-specific P fimbriae papA alleles, and a BLASTn confirmation of seven genes (etsC, iucC, kpsE, neuC, sitA, tcpC, and terC) not covered by the primers. The augmented database was evaluated using (i) a panel of nine control strains and (ii) 288 human-source E. coli strains classified by PCR as ExPEC and non-ExPEC. We observed very high concordance (average, 93.4%) between PCR and WGS findings, but WGS identified more alleles. In conclusion, the addition of 38 ExPEC-associated genes and the associated alleles to the E. coli VirulenceFinder database allows for a more complete characterization of E. coli isolates based on WGS data, which has become increasingly important considering the plasticity of the E. coli genome.Entities:
Keywords: ExPEC; in silicozzm321990; virulence typing; whole-genome sequencing
Mesh:
Substances:
Year: 2020 PMID: 32669379 PMCID: PMC7512150 DOI: 10.1128/JCM.01269-20
Source DB: PubMed Journal: J Clin Microbiol ISSN: 0095-1137 Impact factor: 5.948
Gene content of the ExPEC database downloaded from NCBI and added to the VirulenceFinder database
| Gene | Description | No. from NCBI | No. in database | Identity (%) |
|---|---|---|---|---|
| Transcriptional regulator | 14 | 4 | 68 | |
| Periplasmic chaperone | 12 | 3 | 69 | |
| Outer membrane usher protein | 13 | 11 | 69 | |
| Afimbrial adhesion | 62 | 37 | 46 | |
| Adhesin protein | 42 | 30 | 43 | |
| Colicin E1 | 132 | 23 | 63 | |
| Outer membrane hemin receptor | 423 | 79 | 75 | |
| Colicin Ia | 259 | 37 | 58 | |
| Colicin Ib | 24 | 6 | 98 | |
| Hybrid nonribosomal peptide/polyketide megasynthase | 270 | 77 | 97 | |
| Microcin C | 166 | 10 | 90 | |
| Putative type I secretion outer membrane protein | 169 | 18 | 47 | |
| S fimbrial/F1C minor subunit | 710 | 2 | 99 | |
| S fimbrial/F1C minor subunit | 2 | |||
| F1C adhesion | 9 | 2 | 99.8 | |
| S fimbrial/F1C minor subunit | 5 | 1 | ||
| Siderophore receptor | 465 | 98 | 97 | |
| Hemolysin F | 287 | 21 | 65 | |
| Heat-resistant agglutinin | 132 | 11 | 89 | |
| Invasin of brain endothelial cells | 369 | 66 | 97 | |
| High-molecular-weight protein 2 nonribosomal peptide synthetase | 1,033 | 346 | 97 | |
| Aerobactin synthetase | 335 | 47 | 95 | |
| Ferric aerobactin receptor | 350 | 71 | 70 | |
| Capsule polysaccharide export inner membrane protein | 54 | 21 | 49 | |
| Polysialic acid transport protein | 94 | 82 | 47 | |
| Bacteriocin microcin B17 | 949 | 2 | 98 | |
| Polysialic acid capsule biosynthesis protein | 961 | 68 | 45 | |
| Outer membrane protease (protein protease 7) | 3,564 | 314 | 66 | |
| Major pilin subunit | 116 | 42 | 54 | |
| Outer membrane usher P fimbriae | 786 | 40 | 50 | |
| S fimbrial/F1C minor subunit | 18 | 10 | 98 | |
| S fimbrial/F1C minor subunit | 3 | 1 | 98 | |
| Sialic acid-binding adhesion | 54 | 1 | 99.8 | |
| Iron transport protein | 369 | 56 | 91 | |
| Tir domain-containing protein | 24 | 3 | 99 | |
| Tellurium ion resistance protein | 126 | 25 | 52 | |
| Outer membrane protein complement resistance | 1,386 | 200 | 40 | |
| Uropathogenic-specific protein | 19 | 6 | 91 | |
| Fimbrial protein | 768 | 14 | 87.8 | |
| Total | 14,441 | 1,890 |
The three afaB genes are more than 67% identical to the five already included nfaE alleles in the original VirulenceFinder database.
Two sfaE and focC alleles were 100% identical and are called focC/sfaE in the database.
Including one fteA (F10), two feiA (F8), two fsiA (F16) one ffiA (F15), and one ffoA (F14).
Identity was below 60% for two new papA alleles (see Text S1 in the supplemental material).
Identity to the already included allele in the original VirulenceFinder database.
Comparison of the typing results for the nine control strains by PCR versus WGS typing by strain, serotype and ExPECJJ/UPECHM status
| Isolate no., serotype, ExPECjj/UPECHM status | Concordance, WGS and PCR | PCR only | WGS only | WGS additional genes |
|---|---|---|---|---|
| 11A, | ||||
| 2H16, | ||||
| 2H25, O18ac:K1:H7, ExPECjj/UPECHM | ||||
| 31A, ONT:H9, non-ExPECJJ/non-UPECHM | ||||
| 536, O6:K15:H31, ExPECjj/UPECHM | ||||
| J96, O4:K-:H5; F1C:F13, ExPECjj/UPECHM | ||||
| K-12, | ||||
| PM9, O9:K34:H-, ExPECJJ by WGS only/non-UPECHM | ||||
| V27, O2:K5:H1, ExPECjj/UPECHM |
These genes were sought by PCR.
These genes were not sought by PCR and only identified by use of the added ExPEC genes and alleles to the original VirulenceFinder database. See Table S1 for a complete list of genes.
This strain, isolated from calf diarrhea, is a typical ETEC strain and also positive for sta1 (heat-stabile enterotoxin ST-Ia).
This strain, isolated from urine, is a typical EAEC strain that is positive for aggR, aap, aar, aatA, pic, sat, sepA, and astA.
Strains were considered afa-draBC-positive by WGS if afaB or nfaE and afaC was present in the strain.
Strains were considered positive for papAH by WGS if papA was identified.
Strains were considered positive for the sfa-focDE operon by WGS if a combination of focC or sfaE and focI or sfaD was identified.
The strain K-12 was not subject to the PCR protocol used to identify ExPEC genes in the control strains. PCR findings were based on NCBI annotation and compared with the WGS findings in this study.
Concordance for the 288 evaluation strains by PCR versus WGS typing
| Virulence gene | No. of virulence genes identified by PCR, WGS, or both | ||
|---|---|---|---|
| Concordance, WGS and PCR (%) | PCR only | WGS only | |
| 249 (86.5) | 5 | 34 | |
| 265 (92.3) | 2 | 20 | |
| 277 (96.2) | 3 | 8 | |
| 281 (97.6) | 2 | 5 | |
| 276 (95.8) | 2 | 10 | |
| 275 (95.5) | 7 | 6 | |
| 287 (99.7) | 1 | 0 | |
| 271 (94.1) | 5 | 12 | |
| 282 (97.9) | 3 | 3 | |
| 275 (95.5) | 3 | 10 | |
| 266 (92.4) | 1 | 21 | |
| 275 (95.5) | 6 | 7 | |
| 288 (100.0) | 0 | 0 | |
| K1 | 271 (94.1) | 15 | 2 |
| K5 | 257 (88.9) | 1 | 30 |
| K15 | 288 (100.0) | 0 | 0 |
| 281 (97.6) | 0 | 7 | |
| 271 (94.1) | 8 | 9 | |
| 254 (88.2) | 7 | 27 | |
| 278 (96.5) | 3 | 7 | |
| 252 (87.5) | 3 | 33 | |
| 273 (94.8) | 2 | 13 | |
| Average | 272 (94.6) | ||
Concordance (%) is calculated based on agreement between PCR and WGS findings. If both methods identify or do not identify a gene, it is classified as concordance (see Table S3 in the supplemental material). PCR results are those after PCR was redone for 7 strains.