| Literature DB >> 32719405 |
Saskia-Camille Flament-Simon1, María de Toro2, Liubov Chuprikova3, Miguel Blanco1, Juan Moreno-González4, Margarita Salas4, Jorge Blanco1, Modesto Redrejo-Rodríguez5,6.
Abstract
Self-synthesizing transposons are integrative mobile genetic elements (MGEs) that encode their own B-family DNA polymerase (PolB). Discovered a few years ago, they are proposed as key players in the evolution of several groups of DNA viruses and virus-host interaction machinery. Pipolins are the most recent addition to the group, are integrated in the genomes of bacteria from diverse phyla and also present as circular plasmids in mitochondria. Remarkably, pipolins-encoded PolBs are proficient DNA polymerases endowed with DNA priming capacity, hence the name, primer-independent PolB (piPolB). We have now surveyed the presence of pipolins in a collection of 2,238 human and animal pathogenic Escherichia coli strains and found that, although detected in only 25 positive isolates (1.1%), they are present in E. coli strains from a wide variety of pathotypes, serotypes, phylogenetic groups and sequence types. Overall, the pangenome of strains carrying pipolins is highly diverse, despite the fact that a considerable number of strains belong to only three clonal complexes (CC10, CC23 and CC32). Comparative analysis with a set of 67 additional pipolin-harboring genomes from GenBank database spanning strains from diverse origin, further confirmed these results. The genetic structure of pipolins shows great flexibility and variability, with the piPolB gene and the attachment sites being the only common features. Most pipolins contain one or more recombinases that would be involved in excision/integration of the element in the same conserved tRNA gene. This mobilization mechanism might explain the apparent incompatibility of pipolins with other integrative MGEs such as integrons. In addition, analysis of cophylogeny between pipolins and pipolin-harboring strains showed a lack of congruence between several pipolins and their host strains, in agreement with horizontal transfer between hosts. Overall, these results indicate that pipolins can serve as a vehicle for genetic transfer among circulating E. coli and possibly also among other pathogenic bacteria.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32719405 PMCID: PMC7385651 DOI: 10.1038/s41598-020-69356-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Pipolin identification survey among E. coli strains from diverse origins and pathotypes.
| Source | Infection | Strains | Pipolins (%) |
|---|---|---|---|
| Human | Intestinal | 608 | 8 (1.3%) |
| Human | Extraintestinal | 1,061 | 9 (0.8%) |
| Swine | Intestinal | 323 | 5 (1.5%) |
| Avian | Extraintestinal | 246 | 3 (1.2%) |
| Total | 2,238 | 25 (1.1%) | |
Features of the 25 pipolin-harboring E. coli genomes from LREC dataset.
| Name | PhyG1 | ST2 | CC | Pathotype | Virulence genes (PCR)3 | Virulence genes4 | Plasmids5 | Prophages6 | Complete Integrons5 | CRISPR/Cas (Type)6 |
|---|---|---|---|---|---|---|---|---|---|---|
| 3-373-03_S1_C2 | A | 5,293 | 206 | – | IncFII, IncB/O/K/Z, Col(BS512) | – | – | 9/3 (I–E) | ||
| LREC237 | D* | 524 | 32 | aEPEC | IncFIB/IncX1, ColRNAI, NT | 4P, 1 M, 1S, 2U | CALINP | 9/1 (I–E) | ||
| LREC239 | C | 88 | 23 | – | IncFIB/IncFIC, NT, Col(MG828) | 1P, 1S, 3 M, 1U | In0P | 12/1 (I–E) | ||
| LREC240 | B1 | 156 | 156 | APEC | IncFIB/IncFIC, NT | 1S | 2 CALINP | 5/1 (I–E) | ||
| LREC241 | A | 48 | 10 | ETEC | NT, Col(MGD2), NT | 2P, 2S, 1 M, 2U | – | 9/4 (I–E) | ||
| LREC242 | A | 746 | 10 | ETEC | IncFIB(K)/IncFIA, IncFIB/IncFIC/IncFII, NT, NT, NT, NT, Col8282, NT, NT | 1 M, 2S | – | 9/7 (I–E) | ||
| LREC243 | A | 3,011 | 10 | ETEC | IncX1/IncR, NT, NT | 2P, 1U | 3 CALINP, CP | 10/2 | ||
| LREC244 | A | 10 | 10 | STEC | IncFIB/IncR, NT, IncX4, NT, Col156, NT, NT | 4 M, 5P, 3 M/P, 1S, 3U | In0P | 7/3 | ||
| LREC245 | A | 10888 N | NONE | – | IncFIB, NT, IncX1, NT, Col8282, Col156, NT | 1 M, 1S | CALINP, In0P, CP | 10/5 (I–E) | ||
| LREC246 | C | 10889 N | 23 | – | IncFIB | 1P, 3 M, 2S | – | 6/1 (I–E) | ||
| LREC247 | D* | 137 | 32 | aEPEC | IncFIB, NT, NT, Col8282 | 3P, 1 M, 2S, 1S/P, 1U | – | 8/2 (I–E) | ||
| LREC248 | A | 10,850 | 10 | NTEC | IncFIB, Col8282, Col156, NT | 1P, 4 M, 5S, 1U | – | 8/0 | ||
| LREC249 | D* | 32 | 32 | aEPEC | IncFIB | 3P, 2 M, 3S, 2U | – | 7/2 (I–E, I–D) | ||
| LREC250 | D* | 137 | 32 | aEPEC | IncFIB, NT, Col(MG828) | 3P, 1S, 1U | – | 7/2 (I–E) | ||
| LREC251 | D* | 32 | 32 | STEC | IncFIB | 1P, 3 M, 2S, 1U | – | 7/1 (I–E) | ||
| LREC252 | A | 48 | 10 | – | – | – | – | 12/4 (I–E) | ||
| LREC253 | A | 347 | NONE | – | – | 4 M, 1U | – | 6/3 (I–E) | ||
| LREC254 | B1 | 359 | 101 | APEC | IncFIB, NT, IncX1, NT | 2 S/P | – | 0 | ||
| LREC255 | C | 88 | 23 | APEC | IncY, NT, NT, NT | 1P, 4S | – | 10/2 (I–E, I–D) | ||
| LREC256 | C | 88 | 23 | – | IncFIB | 1P, 4 M, 2S | 1 | 8/1 (I–E) | ||
| LREC257 | C | 88 | 23 | ExPEC/ APEC | IncFIB, NT, NT, Col(MG828) | 4 M, 1U | – | 9/4 (I–E, I–D) | ||
| LREC258 | A | 46 | 46 | APEC | IncFIB, IncFIB, NT, NT | 2S, 1S/P, 1U | 1 | 8/2 (I–E) | ||
| LREC259 | C | 10890 N | 23 | APEC | IncFIB NT, NT | 3 M, 1U | 1 | 9/2 (I–E) | ||
| LREC260 | A | 10 | 10 | – | IncY, NT, NT, Col440II, ColRNAI | 4 M, 1S, 1 M/P | – | 7/0 | ||
| LREC261 | A | 8,233 | NONE | – | IncFIB, NT, NT, Col440II | 2 M, 1S | – | 6/2 (I–E) | ||
| LREC262 | B1 | 1,049 | 155 | – | NT, NT | 1S | – | 3/1 (I–E) |
For reference, the strain 3-373-03_S1_C2 was included in the analysis. See Table S1 for more detailed analysis.
PhyG: phylogenetic groups, where ”*” indicates strains with discrepancies between the assignation obtained with the quadruplex PCR of Clermont et al. (2014)[14] and the in silico assignation using ClermonTyping tool, showing phylogroup E by PCR, but phylogroup D in silico.
New sequence types (ST) are indicated with N.
Virulence genes determined by conventional PCR as detailed in Methods.
Virulence genes identified with VirFinder database (P indicates plasmid location): astA, EAST-1 heat-stable toxin; cba, Colicin B; cdtB, Cytolethal distending toxin B; cif, Type III secreted effector; cma, Colicin M; cnf1, Cytotoxic necrotizing factor; eae, Intimin; ehxA, Enterohaemolysin; espA, Type III secretion system; espB, Secreted protein B; espF, Type III secretion system; espI, Serine protease autotransporters of Enterobacteriaceae; espJ, Prophage-encoded type III secretion system effector; espP, Extracellular serine protease plasmid-encoded; etpD, Type II secretion protein; fanA, Involved in biogenesis of K99/F5 fimbriae; fasA, Fimbrial 987P/F6 subunit; fedF, Fimbrial adhesin AC precursor; gad, Glutamate decarboxylase; iha, Adherence protein; ireA, Siderophore receptor; iroN, Enterobactin siderophore receptor protein; iss, Increased serum survival; katP, Plasmid-encoded catalase peroxidase; lpfA, Long polar fimbriae; ltcA, Heat-labile enterotoxin A subunit; mchB, Microcin H47 part of colicin H; mchC, MchC protein; mchF, ABC transporter protein MchF; nleA, Non-LEE encoded effector A; nleB, Non-LEE encoded effector B; nleC, Non-LEE-encoded effector C; saa, STEC autoagglutinating adhesin; sepA, Serine protease autotransporters of Enterobacteriaceae; sta1, Heat-stable enterotoxin ST-Ia; stx2A, Shiga toxin 2 subunit A; stx2B, Shiga toxin 2 subunit B; tccP, Tir cytoskeleton coupling protein; tir, Translocated intimin receptor protein; toxB, Toxin B; tsh, Temperature-sensitive hemagglutinin.
Plasmids are enumerated according to their compatibility group. NT, not typed.
Prophages (analyzed with Phigaro[56] and Phaster[57]): P, Podoviridae; M, Myoviridae; S, Siphoviridae; U, Unknown; a slash (/) indicate an ambiguous family assignment.
Integrons: Integrons were analyzed with IntegronFinder[56] as indicated in Materials and Methods. C, complete, Int0, integron lacking attC site, CALIN, integron lacking functional integrase gene. P indicates plasmid location.
Number of Crispr units and associated proteins (Cas), as well as the element type determined with CRISPRCasFinder[58] are indicated.
Figure 1Maximum-likelihood tree generated from the core-genome data of new pipolin-harboring strains. Strain names are colored according to the phylogenetic groups as indicated. Previously described pipolin-harboring isolate 3-373-03_S1_C2 was included as a reference. The best-fit model was GTR + F + R2 for all considered criteria in ModelFinder[62]. Scale bar indicates substitution rate per site. The main features are indicated on the right: source, isolation date, pathotype, sequence type (ST) and clonal complex (CC). New ST combinations assigned at Enterobase are indicated with N.
Figure 2Genetic structure of new pipolins from LREC collection. Predicted protein-coding genes are represented by arrows, indicating the direction of transcription and colored following Prokka annotation as indicated in the legend. The greyscale on the right reflects the percent of amino acid identity between pairs of sequences. The image was generated by EasyFig software and re-annotated pipolins sorted according to the hierarchical clustering of the gene presence/absence matrix. Names of pipolin carrying strains are colored according to phylogroups as in Fig. 1.
Figure 3Maximum-likelihood tree of the new piPolB genes from the LREC dataset. As indicated, strain names are colored based on the phylogenetic group of strains. Previously described pipolin-harboring isolate 3-373-03_S1_C2 was included as a reference. The best-fit model was GTR + F + R2 for all considered criteria by ModelFinder[62]. Scale bar indicates the substitution rate per site.
Figure 4Cophylogeny of pipolins and host strains. Tanglegram representation of maximum-likelihood comparative phylogenies of piPolB and host strains core genome as hallmark of pipolins. Modelfinder Best-fit models were K3Pu + F + R2 and GTR + F + R7, respectively. Compared phylogenies are also displayed in Figures S1 and S3, respectively. Links between pipolins and E. coli strains are colored based on the phylogenetic groups.