Robert A Moran1, Haiyang Liu2,3,4, Emma L Doughty1, Xiaoting Hua2,3,4, Elizabeth A Cummins1, Tomas Liveikis1, Alan McNally1, Zhihui Zhou2,3,4, Willem van Schaik1, Yunsong Yu2,3,4. 1. Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham B15 2TT, UK. 2. Department of Infectious Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, 310016, PR China. 3. Key Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province, Hangzhou, Zhejiang, 310016, PR China. 4. Regional Medical Center for National Institute of Respiratory Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, 310016, PR China.
Sequencing reads and the complete sequences of the chromosomes and plasmids of
DETAB-E227 and
DETAB-B39 are available from NBCI BioSample accessions SAMN18498586 and SAMN18498587, respectively, under BioProject accession PRJNA716893. Complete sequences can be found in the GenBank nucleotide database under accession numbers CP073060-CP073061 and CP072526-CP072529. Supplementary files are available on Figshare (https://doi.org/10.6084/m9.figshare.19494596) [1] .species are particularly well-adapted for persistence in hospital environments where they pose a life-threatening infection risk to the most clinically vulnerable patients. Plasmids with the potential to transfer multiple antibiotic resistance determinants between
species are therefore concerning, but most are not well-characterized. This work sheds further light on the poorly understood mobile gene pool associated with
. We show here that GR13 plasmids carry a small set of core genes but have access to a highly diverse set of accessory segments that might provide fitness advantages under certain conditions. The complex evolutionary dynamics of GR13 plasmids appear to be driven by the exchange of dif modules and by the actions of a diverse population of insertion sequences. The novel dif modules characterized here emphasize the broader importance of these elements to the dissemination of accessory genes in
. This study has improved our understanding of the diversity and distribution of dif modules, plasmids that carry them, and how both disseminate in the continuum of
populations that link hospitals and the wider environment.
Introduction
is a genus of Gram-negative coccobacilli that are typically found in soils and moist environments but are also well-adapted for persistence in hospital settings [2].
is the most prominent pathogenic species and can cause human infections with high mortality rates, particularly given some strains exhibit extensive antibiotic resistance that severely compromises treatment [2, 3]. While less commonly reported, drug-resistant infections caused by other
species are an emerging threat [4-8].Plasmids in
are typed according to replication initiation gene (rep) identity and have been divided into rep groups (GRs) [9]. A recent review listed 33 rep groups [10], and we have since described an additional group, GR34 [11]. Plasmids carrying clinically significant antibiotic resistance genes (ARGs) have been reported in
[12-16] and in other
species [7, 8, 17–19], clearly indicating their important role in the emergence and transmission of antimicrobial resistance in this genus. Few plasmid groups have been the subject of comparative analyses, so how the remaining types evolve or are distributed, geographically and throughout the
genus, is poorly understood and their genetic structures remain largely undescribed.Some
plasmids carry multiple pairs of recombination sites that resemble chromosomal dif sites, which are targets for XerC and XerD tyrosine recombinases [20]. These have been called plasmid-dif (pdif) sites [12], and have been shown to be recognized by
XerC and XerD [21]. ARGs have been found in pdif-flanked structures called dif modules, including the carbapenemase genes bla
OXA-24 [22], bla
OXA-58 [23] and bla
OXA-72 [24], the tetracycline resistance gene tet(39) [12], the macrolide resistance genes msr(E)-mph(E) [12], the aminoglycoside resistance gene aacC2d and the sulphonamide resistance gene sul2 [11]. Identical ARG-containing dif modules have been found in multiple contexts and in different types of plasmids [12]. Further dif modules, including those carrying genes for chromium resistance, a serine recombinase, RND efflux system and multiple toxin–antitoxin systems have also been described [11, 12, 25, 26]. Given the apparent importance of dif modules to the evolution of some
plasmids, it is important to understand the breadth of genetic cargo they carry and which types of plasmids can interact with them.We recently described the GR34 family of plasmids that share a 10 kbp core segment but can grow to as large as 190 kbp through the acquisition of dif modules [11]. The exemplar GR34 plasmid, pDETAB2, is from an
isolated in an intensive care unit (ICU) in Hangzhou, China, and carries six ARGs in a series of dif modules [11]. Here, we report two GR13-type plasmids found in unrelated
isolated 1 and 2 months later in that same ICU, one cryptic and the other carrying ARG-containing dif modules identical to ones in pDETAB2. GR13 plasmids have not been studied in detail since the group was defined using
plasmid p3ABAYE as a reference in 2010 [9]. In order to contextualize the differences between them, we undertook a detailed comparative analysis of the ICU GR13 plasmids and 41 complete GR13 plasmid sequences from GenBank. This facilitated the first evaluation of the distribution, gene content, structures and evolutionary characteristics of the GR13 plasmid family.
Methods
Ethics
Ethical approval and informed consent were obtained by the Sir Run Run Shaw Hospital local ethics committee (approval number 20190802–1).
Bacterial isolation and antibiotic susceptibility testing
DETAB-E227 was isolated from a cleaning cart surface swab and DETAB-P39 from a patient rectal swab in Sir Run Run Shaw Hospital Intensive Care Unit in Hangzhou, China in 2019. Both samples were cultured on CHROMagar (CHROMagar, Paris, France) containing 2 mg l−1 meropenem at 37 °C for 24 h. Isolated colonies of presumptive
were sub-cultured on Mueller–Hinton agar (MHA) (Oxoid, Hampshire, UK) and incubated at 37 °C for 24 h. MICs for imipenem, meropenem, tobramycin, gentamicin, ciprofloxacin, levofloxacin, ceftazidime, colistin and tigecycline were determined using broth microdilution with results interpreted according to CLSI 2019 guidelines.
Plasmid transfer assays
DETAB-E227 was filter-mated with a rifampicin-resistant derivative of
ATCC 17978 or
strain XH1816 as described previously [27]. XH1816 is a colistin-resistant, meropenem-sensitive clinical
strain that was isolated from a human urine sample in 2011. Transconjugants were selected on MHA supplemented with rifampicin (50 µg ml−1) and meropenem (8 µg ml−1). The identity of transconjugants was confirmed through PFGE fingerprinting after digestion of genomic DNA with ApaI. Transconjugants were tested for the presence of pDETAB4 and pDETAB5 by PCR with primers that target the replication genes of each plasmid (Table S1). ATCC 17978 transconjugants containing pDETAB2 were mated with XH1816 as above, with transconjugants selected on MHA supplemented with colistin (2 µg ml−1) and meropenem (8 µg ml−1).
S1 nuclease digestion, PFGE and Southern blot
To confirm transfer had occurred, plasmids were visualized following S1 nuclease treatment via PFGE, and the locations of resistance genes were confirmed via Southern blot as described previously [28]. Briefly, genomic DNA was digested with S1 nuclease (TaKaRa, Kusatsu, Japan) at 37 °C for 20 min. Treated DNA was loaded on a 1 % agarose Gold gel and PFGE was performed at 14 °C for 18 h, with 6 V/cm and pulse times from 2.16 to 63.8 s using the Bio-Rad CHEF-Mapper XA machine (Bio-Rad, CA, USA). DNA was transferred to a positively charged nylon membrane (Millipore, Billerica, MA, USA) by the capillary method and hybridized with digoxigenin-labelled bla
OXA-58 and bla
NDM-1-specific probes with an NBT/BCIP colour detection kit (Roche, Mannheim, Germany) according to the manufacturer’s instructions. XbaI-treated genomic DNA from
H9812 was used as a size marker.
Whole-genome sequencing and analysis
Genomic DNA was extracted from
DETAB-E227 and DETAB-P39 using a Qiagen minikit (Qiagen, Hilden, Germany) in accordance with the manufacturer’s instructions. Whole-genome sequencing was performed using both the Illumina HiSeq (Illumina, San Diego, USA) and the Oxford Nanopore GridION (Nanopore, Oxford, UK) platforms (Tianke, Zhejiang, China). Nanopore reads were trimmed with Filtlong v0.2.0 (https://github.com/rrwick/Filtlong) under default settings targeting approximately 100-fold genome coverage and Illumina sequence reads were trimmed with Shovill v1.1.0 under default (https://github.com/tseemann/shovill). De novo assembly of the Illumina and Nanopore reads was performed using Unicycler v0.4.8 [29]. MLST with the Pasteur and Oxford schemes was performed using mlst (https://github.com/tseemann/mlst) [30, 31].
Plasmid characterization
For alignment and visualization, all plasmids were opened in the same orientation and at the same position 48 bp upstream of the GR13 rep gene. ARGs and rep genes were identified using ABRicate v0.8.13 (https://github.com/tseemann/abricate) with the ResFinder [32] and pAci (File S1) databases, respectively. Insertion sequences were identified using the ISFinder database [33]. To screen the entire plasmid collection, an offline version of the ISFinder nucleotide database was constructed from an available version from October 2020 (https://github.com/thanhleviet/ISfinder-sequences). The database was used with abricate, initially with a minimum nucleotide identity threshold of 80 % and coverage threshold of 90 % to identify putative novel IS. Representative sequences with greater than 90 % coverage and between 80 and 95% nucleotide identity were validated manually and those that represented novel IS (Table S2) were submitted to ISFinder, named, and added to the database. The resulting database was used with 95 % identity and coverage thresholds, and sequences identified were considered isoforms of the representative IS or putative IS in accordance with ISFinder’s directions for isoform identification. Gene Construction Kit (Textco Biosoftware, Raleigh, USA) was used to annotate and examine plasmid sequences.
Plasmid pangenome analysis
Plasmids were annotated with Prokka 1.14.6 [34], using reference protein sequences to standardise annotations. Reference sequences were obtained from the NCBI Identical Protein Groups resource by querying ‘Acinetobacter[Organism] AND (uniprot[filter] OR refseq[filter])’. As insertion sequences were analysed separately, lines matching ‘transposase’ or ‘product=IS’ were removed from gff annotation files. Pangenomes and a core-gene alignment were constructed from these annotations using Panaroo 0.1.0 [35], reducing contamination-removal processes using --mode relaxed --no_clean_edges --min_trailing_support 0 min_edge_support_sv 0 --trailing_recursive 0 to reflect the use of complete sequences of highly mosaic plasmids. Functional annotation based on the eggNog orthology database version 5.0.2 [36] was performed with emapper-2.1.6–43-gd6e6cdf [37] using Diamond version 2.0.13 [38] for protein sequence alignments.
Core-gene analysis
Plasmid rep gene sequences were extracted manually, then aligned using mafft version 7 [39] with the GNS-i iterative refinement method and additional parameters, --reorder --anysymbol --maxiterate 2 --retree 1 –globalpair. Low-confidence residues in the alignment were masked with GUIDANCE2 [40]. Phylogenies were constructed from the rep gene alignment using RaxML version 8.2.12 [41] and the GTRGAMMA model with automated bootstrapping.For investigation of core-gene recombination, blastn was used to query all plasmid sequences with parA, parB and tyr13 from reference plasmid p3ABAYE and identify their homologues. The resulting sequences were aligned with mafft as described above and a neighbour-joining phylogeny was constructed. BAPS was used to partition all core-gene phylogenies and the highest level of BAPS discrimination was used to define distinct core-gene variants.
Results
Carbapenem-resistant DETAB-E227 carries a multidrug resistance GR13-type plasmid
DETAB-E227 was resistant to imipenem, meropenem, ceftazidime, gentamicin, tobramycin and ciprofloxacin, but susceptible to colistin and tigecycline (Table S3). The complete genome of DETAB-E227 includes a 3 749 086 bp chromosome and three plasmids, pDETAB4, pDETAB5 and pDETAB6 (Table 1). DETAB-E227 is a novel sequence type according to the Pasteur (STIP1554: cpn60-3, fusA-3, gtlA-2, pyrG-79, recA-3, rplB-4, rpoB-4) and Oxford (STOX2210: cpn60-1, gdhB-208, gltA-1, gpi-171, gyrB-231, recA-1, rpoD-153) MLST schemes.
*The DETAB-E227 chromosome contains the intrinsic genes oxaAb (OXA-24) and ampC (96.6 % identical to bla
ADC-25).
†The DETAB-P39 chromosome contains the intrinsic genes oxaAb (OXA-88) and ampC (95.9 % identical to bla
ADC-25).
DETAB-E227 and DETAB-P39 genome characteristicsIsolateDNA elementReplicon typeSize (bp)% GCAntibiotic resistance genesDETAB-E227chromosome–3 749 08639.1–*pDETAB4GR24113 68242.0sul2, tet(B)pDETAB5GR1397 03541.5bla
OXA-58, bla
NDM-1, ble
MBL, sul2, aacC2d, msr(E)-mph(E)pDETAB6Aci3714532.0–DETAB-P39chromosome–3 877 09338.9–†pDETAB13GR1391 08339.9–*The DETAB-E227 chromosome contains the intrinsic genes oxaAb (OXA-24) and ampC (96.6 % identical to bla
ADC-25).†The DETAB-P39 chromosome contains the intrinsic genes oxaAb (OXA-88) and ampC (95.9 % identical to bla
ADC-25).Nine antibiotic resistance genes were found in the DETAB-E227 genome (Table 1). Two of these, bla
ADC-25 and bla
OXA-424, are the native ampC and oxaAb β-lactamase genes found in the chromosome. The sul2 and tet(B) genes are in the 113 682 bp GR24-type plasmid pDETAB4 and the remaining resistance genes, bla
NDM-1, bla
OXA-58, ble
MBL, aacC2d [also called aac(3’)-IId], msr(E)-mph(E) and a second copy of sul2 are in the 97 035 bp GR13-type plasmid pDETAB5 (Table 1, Fig. 1a).
Fig. 1.
GR13 plasmid pDETAB5. (a) Circular map of pDETAB5 drawn from GenBank accession CP072528. Plasmid sequence is shown as a black line, with arrows inside representing ORFs. Coloured boxes represent IS. Black lines marked ‘C/D’ or ‘D/C’ represent pdif sites and indicate their orientations. Arrows filled with blue and labelled rep, parAB and tyr13 represent the GR13 genes of pDETAB5. (b) Linear maps of pDETAB2, pDETAB5 and pDETAB13, drawn to scale from GenBank accessions CP047975, CP072528 and CP073061. Near-identical sequences in pDETAB2 and pDETAB5 are bridged by grey shading and homologous regions of pDETAB5 and pDETAB13 are marked by lines labelled ‘a’ to ‘h’. IS are shown as coloured, labelled boxes and the locations of ARGs are indicated.
GR13 plasmid pDETAB5. (a) Circular map of pDETAB5 drawn from GenBank accession CP072528. Plasmid sequence is shown as a black line, with arrows inside representing ORFs. Coloured boxes represent IS. Black lines marked ‘C/D’ or ‘D/C’ represent pdif sites and indicate their orientations. Arrows filled with blue and labelled rep, parAB and tyr13 represent the GR13 genes of pDETAB5. (b) Linear maps of pDETAB2, pDETAB5 and pDETAB13, drawn to scale from GenBank accessions CP047975, CP072528 and CP073061. Near-identical sequences in pDETAB2 and pDETAB5 are bridged by grey shading and homologous regions of pDETAB5 and pDETAB13 are marked by lines labelled ‘a’ to ‘h’. IS are shown as coloured, labelled boxes and the locations of ARGs are indicated.In three independent conjugation experiments, pDETAB5 transferred from DETAB-E227 to
ATCC 17978 at a mean frequency of 6.96×10−7 transconjugants per donor (Table S4). The presence of pDETAB5 in ATCC 17978 transconjugants was confirmed using S1-PFGE, Southern blotting targeting the bla
NDM-1 and bla
OXA-58 genes, and PCR targeting the pDETAB5 rep gene (Fig. S1). pDETAB4 was not detected in transconjugants and pDETAB6 was not tested for. pDETAB5 did not transfer from DETAB-E227 to
XH1816 or from ATCC 17978 to XH1816 in three independent experiments.
pDETAB5 resembles the GR34 plasmid pDETAB2
The combination of ARGs in pDETAB5 resembles that in the GR34 plasmid pDETAB2, found in a STIP138
isolated in the same ICU 1 month prior to DETAB-E227 [11]. pDETAB5 contains 14 pdif sites (Fig. 1a) and pDETAB2 contains 16. Alignment of pDETAB5 and pDETAB2 reveals that approximately 63 kbp of the pDETAB2 sequence is present in pDETAB5 (Fig. 1b). The sequence they share includes multiple dif modules but not the region of pDETAB2 that has been identified as core to GR34 plasmids [11]. The aacC2d, bla
OXA-58 and msr(E)-mph(E)-containing dif modules in pDETAB2 and pDETAB5 are identical and their sul2-containing modules differ only through a 132 bp deletion in the copy of IS1006 in pDETAB2. Other dif modules shared by the plasmids encode HigAB-like and AbkAB-like toxin–antitoxins, a putative serine recombinase and a putative RND efflux system.The bla
NDM-1 and ble
MBL genes in pDETAB5 and pDETAB2 are in different contexts. In pDETAB2, the bla
NDM-1 and ble
MBL genes are in a complete copy of Tn125 inserted in a 696 bp dif module that contains putative toxin–antitoxin genes [11]. This module is uninterrupted in pDETAB5 (Fig. 1a) and instead, the bla
NDM-1 and ble
MBL genes are in a partial copy of Tn125 that retains one copy of ISAba125 and 3062 bp of the passenger segment (labelled red line in Fig. 1b). This indicates that, despite sharing a collection of dif modules that must have a common origin, pDETAB2 and pDETAB5 acquired bla
NDM-1 independently in distinct events.
pDETAB13 of carbapenem-sensitive DETAB-P39 is only distantly related to pDETAB5
The complete genome of DETAB-P39 includes a 3 877 093 bp chromosome and the 91 083 bp plasmid pDETAB13 (Fig. S2). DETAB-39 is STIP221 and STOX351. Despite growing on the initial meropenem-supplemented CHROMagar plate, DETAB-P39 was phenotypically sensitive to meropenem and to all other antibiotics tested (Table S3), and its genome does not contain any acquired antibiotic resistance genes.The rep gene of pDETAB13 is 99.4 % identical to that of the reference GR13 plasmid pA3ABAYE and 98.3 % identical to that of pDETAB5. pDETAB13 contains nine pdif sites and nine complete insertion sequences (ISs), including the novel ISAba62, ISAba63 and ISAba64 (Fig. 1b and S2). Excluding ISs, just 20 998 bp of pDETAB13 is homologous to pDETAB5, but the shared sequences are split across eight regions that range from 99–93% identical (Fig. 1b). The shared segments include the rep gene and putative partitioning genes parAB in a contiguous region (a and h in Fig. 1b), and determinants for several proteins with putative functions: a HipA-like toxin, a toxin–antitoxin system, formaldehyde dehydrogenase, UvrA-like excision endonuclease, and two tyrosine recombinases (also called integrases, Pfam: PF00589). The toxin–antitoxin genes in pDETAB13 are found in a 696 bp dif module (dif-696b) that is 93.6 % identical to the one in pDETAB5 (dif-696a). Accumulated SNPs differentiate the dif-696 variants, suggesting that the presence of these modules in pDETAB5 and pDETAB13 is not the result of a recent horizontal transfer event.Some notable ORFs in pDETAB13 are not shared by pDETAB5. A cluster of nine ORFs located in a 10 724 bp region, which we termed ADH, between ISAba62 and ISAba63 includes determinants for a putative transcriptional regulator, putative alcohol dehydrogenases and putative metabolic enzymes including a monooxygenase, amidotransferase, hydrolase, alkene reductase and oxidoreductase. A 2111 bp dif module, dif-2111, encodes a putative NAD(P)-dependent alcohol dehydrogenase and a LysR-family transcriptional regulator. Three other dif modules in pDETAB13 were not assigned functions as they encode hypothetical proteins.
GR13 plasmids have been collected from diverse sources
To characterize the GR13 plasmid family, we conducted a comparative analysis of publicly available sequences. The 1173 bp rep gene of reference GR13 plasmid p3ABAYE (GenBank accession CU459140) [9] was used to query the GenBank non-redundant nucleotide database (last search 9 August 2021), and 40 complete plasmid sequences containing rep genes that ranged from 99.1–97.3% identical to the query were found, along with a single plasmid with a rep gene 79.8 % identical to the query (Table S5). These GR13 plasmids are from
,
,
,
, A. soli, A. seifertii and A. radioresistans that were isolated from various cities in China as well as from Japan, Cambodia, Thailand, Vietnam, India, Pakistan, Australia, Chile, the USA, the Czech Republic, France, Germany and the Netherlands between 1986 and 2020 (Table S5). Sources of isolation ranged from human clinical specimens and hospital environments to terrestrial and marine animals and environments (Table S5). The plasmids range in size from 50 047 bp to 206 659 bp and three carry additional replication genes (Table S5), suggesting that they have formed cointegrates with plasmids from different rep groups.
The small GR13 core genome has been subject to recombination
To characterize the gene content of GR13 plasmids, a pangenome was constructed. This consisted of 1190 genes: two considered core (present in 43 plasmids), two soft-core (42 plasmids), 123 shell (7 to 40 plasmids) and 1063 cloud (1 to 6 plasmids).The four core genes were rep, the putative partitioning genes parA and parB, and a putative tyrosine recombinase gene that we will refer to as tyr13. Though parAB were found in only 42 of the 43 plasmids by the pangenome approach, using tblastn to query the remaining plasmid sequence (CP038259) with the amino acid sequences of ParA and ParB of pDETAB13 revealed equivalent genes with nucleotide identities of 78.8 and 79.8 %, respectively. The parAB genes were found adjacent to one another in all 43 plasmids and were usually adjacent to rep, but tyr13 was never found adjacent to rep or parAB.The conservation of core genes was investigated by using BAPS to place gene sequences into allelic groups that differed by few SNPs and exhibited common SNP patterns that likely arose cumulatively from a recent ancestor. The distribution of parAB and tyr13 allelic groups were visualized relative to a rep gene phylogeny (Fig. 2a) and instances of recombination were identified where phylogenetic clusters did not contain parA, parB and tyr13 from the same allelic groups. Substitution of tyr13 genes appears to have occurred on multiple occasions while a single example of parB allele substitution was seen in CP022299.
Fig. 2.
The GR13 plasmid family pangenome. (a) Plasmid core-gene allelic group identities displayed relative to a rep gene phylogeny. (b) GR13 pangenome displayed relative to the rep gene phylogeny. Cluster and lineage memberships are indicated to the left of the pangenome visualization.
The GR13 plasmid family pangenome. (a) Plasmid core-gene allelic group identities displayed relative to a rep gene phylogeny. (b) GR13 pangenome displayed relative to the rep gene phylogeny. Cluster and lineage memberships are indicated to the left of the pangenome visualization.
GR13 plasmid lineages have disseminated widely
The rep gene phylogeny was used to sub-type GR13 plasmids. The collection was partitioned into four broad-ancestry clusters of plasmids that, apart from CP022299, shared core replication and partitioning genes from the same allelic groups, reflecting their common ancestry. Further rep gene variation, as evident in the phylogenetic tree (Fig. 2b), indicated that clusters could be partitioned into epidemiologically relevant plasmid lineages. We have defined three GR13 lineages that are represented by four or more plasmids in this collection that are not separated by any SNPs in the rep gene phylogeny (Fig. S3), equivalent to a total rep identity of >99.8 % for lineage 1 (where the rep genes of three plasmids are missing a nucleotide in three different homopolymeric runs, likely due to sequencing errors) and 100 % for lineages 2 and 3. Plasmids in the same lineage share significant accessory gene content (Fig. 2b), consistent with them having descended from an ancestral plasmid that contained the same rep gene and a conserved set of accessory genes. The presence of accessory genes that differentiate some plasmids from other members of the same lineage highlight their capacity to diversify through gene acquisition.Host species and sources of isolation varied within lineages, indicating that they have disseminated internationally and between
species. The best-represented lineage in this collection, lineage 1, contains the reference GR13 plasmid p3ABAYE and 15 others. p3ABAYE is from a clinical
isolated in France in 2001, while other members of lineage 1 are from
,
and
strains from human clinical samples in various Chinese provinces, Australia, Colombia and Germany (Table S5). A single lineage 1 plasmid is derived from marine sediment. Lineage 1 has a well-conserved accessory genome, consisting of 39 core genes (present in all 16 plasmids), 58 shell genes (in two to 15 plasmids) and just two cloud genes (in one plasmid each). Lineage 2 plasmids include pDETAB5 and four other ARG-bearing plasmids. Representatives of lineage 2 have been found in
,
and an isolate of indeterminate
species derived from clinical samples, an ICU environment or sewage, but only in mainland China, Taiwan or Vietnam. In contrast, the four representatives of lineage 3 are from
or an indeterminate
that were isolated across wide geographic and temporal spans: soil from the USA in 1986, a spacecraft-associated clean room in the Netherlands in 2008, an intensive care unit sink in Pakistan in 2016 and bigeye tuna in China in 2018. Taken together, the distributions of lineage 1, 2 and 3 plasmids emphasize the capacity of GR13 plasmids for widespread dissemination and persistence.
Most accessory genes in GR13 plasmids are unique
There were 1090 gene families in the GR13 pangenome, with 4453 genes identified in total. Of the 1090 gene families, 745 could be assigned putative functions with our Prokka annotation and Panaroo approach (68.3 %), while 526 (48.2 %) and 442 (40.6 %) were assigned functions with the COG and KEGG schemes, respectively. COG placed gene families in broad functional categories, most commonly replication and repair (135/526, 25.7 %), transcription (83, 15.8 %) and inorganic transport and metabolism (78, 14.8 %) (File S2). KEGG categories offered more specific functional annotation and facilitated identification of the most common gene functions in the collection. Amongst the 50 most prevalent gene families that were assigned functions (File S2), families with putative metabolic functions were most common. The second most common gene families encode components of toxin–antitoxin systems, with HipA-like and Fic-like toxin-encoding genes the most abundant overall. Other functions of note included DNA integration and recombination (42 gene families, 194 genes, 43 plasmids), antimicrobial resistance (three gene families, six genes, five plasmids), heavy metal resistance (22 gene families, 51 genes, seven plasmids), alcohol tolerance (11 gene families, 172 genes, 32 plasmids), and phage defence (five gene families, 89 genes, 17 plasmids). Some functional groups contained multiple gene families, suggesting that genes with the same functions have been acquired on multiple occasions by different GR13 lineages.Of the 1186 accessory genes, 746 (62.9 %) were found in just a single plasmid each. These so-called ‘singleton genes’ were found in 25 of the 43 plasmids, where they accounted for between 0.7 and 61.4 % of total gene content. The pangenome network showed that many singleton genes were found adjacent to one another in long, contiguous sequences that were unique to the plasmids that carried them (File S3). In CP028560 and CP069498, an abundance of singleton genes coincided with the presence of additional rep genes of types GR34 and GR24, respectively, indicating that plasmid cointegrate formation was associated with the introduction of significant numbers of genes.
ARG-bearing dif modules are subject to rearrangement by insertion sequences
Lineage 2 plasmids and the only other ARG-containing plasmid, CP033563 from an
isolated in Taiwan in 2010, carry ARGs in dif modules. Only two plasmids from lineage 2 appear to have acquired additional ARGs: KT852971 has acquired a sul1-containing class 1 integron with cassette array aadB-arr-2-cmlA1-aadA1 and JX101647 has acquired sul2 and aphA1 in an insertion within an existing ARG-containing dif module, described below.To assess whether and how individual dif modules have evolved since they were acquired by the ancestor of lineage 2 plasmids, their ARG-containing dif modules were compared. The msr(E)-mph(E) module was unchanged between the four plasmids that carried it, but variation was seen across bla
OXA-58 (Fig. 3a), sul2 (Fig. 3b) and aacC2d-containing (Fig. 3c) modules. The novel IS elements ISAso1 and ISAso2, members of the uncharacterized ISNCY family, were acquired by both the bla
OXA-58 and sul2-containing modules in JX101647 (Fig. 3). Both IS inserted in the same orientation adjacent to the XerC binding ends of pdif sites that flank their respective dif modules (Fig. 3a, b), with the 5 bp immediately adjacent to pdif involved in the target site duplication generated by insertion. The IS6/IS26-family element IS1008 fused the bla
OXA-58-containing module of CP027245 to the remnant of a previously described RND efflux module [11] following a deletion of indeterminate length (Fig. 3a). IS1008 also deleted part of the sul2-containing module of CP028560 and brought the remainder adjacent to another sequence, possibly a remnant of a floR-containing module as IS1008 has truncated the floR gene (Fig. 3b). In JX101467 a partial deletion of the ISAba14-like element is associated with the acquisition of a 12213 bp segment bounded at one end by an ISOur1-like element and at the other by an ISAlw27-like element (Fig. 3c). The acquired segment includes sul2 and a truncated copy of Tn5393 that is interrupted by the aphA1-containing Tn4352. These examples highlight the capacity of IS to influence the accessory content of dif modules through insertion and by mediating deletion events.
Fig. 3.
ARG-containing dif module variants. Scaled diagrams of dif modules containing (a) bla
OXA-58, (b) sul2 and (c) aacC2d. The extents and orientations of ORFs are indicated by labelled horizontal arrows and IS are shown as labelled boxes. IS that are the same colour belong to the same family. Drawn to scale from GenBank accessions CP072528, JX101647, CP028560, KT852971 and CP027245.
ARG-containing dif module variants. Scaled diagrams of dif modules containing (a) bla
OXA-58, (b) sul2 and (c) aacC2d. The extents and orientations of ORFs are indicated by labelled horizontal arrows and IS are shown as labelled boxes. IS that are the same colour belong to the same family. Drawn to scale from GenBank accessions CP072528, JX101647, CP028560, KT852971 and CP027245.
Diverse accessory genes are found in dif modules
To further characterize their potential for mobilizing accessory genes other than the well-known ARGs, we examined the content and distribution of 17 dif modules identified in pDETAB5, pDETAB13, p3ABAYE, AP024799, CP022299 and CP068175 (Table S6). The sequences of these modules were used to query the GR13 collection with blastn and their distributions are shown in Fig. 4(a). Twelve dif modules were only carried by the plasmid or plasmid lineage that they were identified in, but five were found in multiple lineages, suggesting that they have been acquired independently.
Fig. 4.
Novel dif modules carrying diverse accessory genes. (a) Presence and absence of dif modules identified in GR13 plasmids. Plasmids are ordered as in the rep gene phylogeny shown in Fig. 2, and membership of lineages 1, 2 and 3 is indicated. The presence of variant BREX and ADH segments are indicated by different shades of colour (b) Scaled diagrams of selected dif modules. (c) Scaled diagrams of accessory gene segments. Sequences in parts (b) and (c) were drawn to scale from GenBank accessions CP072528, CP073061, CU459140, AP024799, CP022299 and CP068175.
Novel dif modules carrying diverse accessory genes. (a) Presence and absence of dif modules identified in GR13 plasmids. Plasmids are ordered as in the rep gene phylogeny shown in Fig. 2, and membership of lineages 1, 2 and 3 is indicated. The presence of variant BREX and ADH segments are indicated by different shades of colour (b) Scaled diagrams of selected dif modules. (c) Scaled diagrams of accessory gene segments. Sequences in parts (b) and (c) were drawn to scale from GenBank accessions CP072528, CP073061, CU459140, AP024799, CP022299 and CP068175.Three dif modules identified here (dif-2111, dif-6874 and dif-7136) encode putative alcohol dehydrogenases, which may be involved with alcohol tolerance and quorum sensing [42]. The largest module we identified, dif-28327, encodes putative copper resistance proteins, and dif-7932 encodes a set of putative metabolic proteins that appear to be involved with aromatic compound degradation (Fig. 4b). The dif-1769 module carries a putative sulphate permease determinant and is 88 % identical to part of a sulP module that has been described previously [26]. The dif-6620 module found only in the cointegrate plasmid CP022299 contains a rep gene 82.3 % identical to the reference GR26 rep (GenBank accession CP015365), as well as a putative mobilisation gene (Fig. 4b). This appears to be a small plasmid that has been integrated through recombination at pdif sites. The remaining modules could not be assigned putative functions, though one of these, dif-801, encodes a protein with a VRR-NUC domain (Pfam: PF08774), equivalents to which have been described in restriction endonucleases (43).Two sets of ORFs from the GR13 shell genome that appeared to be discrete units in the genome network were examined to determine whether they were found in well-conserved dif modules. The first set of ORFs resemble determinants for bacteriophage exclusion (BREX) systems [44], and likely have anti-phage functions. The BREX determinants are not part of an identifiable dif module. Instead, they are found in a 26 140 bp segment flanked by partial copies of ISAlw4 and ISAba12 (Fig. 4c). The same BREX segment is present in 14 of 16 plasmids from lineage 1, while variant sequences are present in four plasmids from elsewhere in the phylogeny (Fig. 4a). The second set of well-conserved ORFs correspond to the ADH segment of pDETAB13 that contains two putative alcohol dehydrogenase determinants, as well as several ORFs with expected metabolic functions and one for a putative transcriptional regulator (Fig. 4c). Variants of the ADH segment are found in 30 of the 43 GR13 plasmids studied here (Fig. 4a).
Diverse insertion sequences shape GR13 plasmid accessory content
We used a version of the ISFinder database to screen the GR13 plasmid collection and assess the abundance, diversity and richness of IS. Individual plasmids contained between one and 34 IS, with up to 19 different IS and up to six copies of the same IS found in a single plasmid (Fig. S4). Sixty-four different IS were found, representing 15 different IS families. These included 17 novel IS that differed from named sequences by greater than 5 % nucleotide identity and were characterized, submitted to ISFinder and assigned names as part of this study (Fig. S4, Table S2).Members of the IS3 (17 different IS) and IS5 (12 different IS) families dominated the IS population, with representatives of one or both found in all but two GR13 plasmids. The next best-represented family was IS6/IS26 (6 different IS), members of which are known to be strongly associated with antibiotic resistance genes [45]. The highest numbers of IS6/IS26 family elements were found in the ARG-containing lineage 2 plasmids, where many were associated with ARG-containing dif modules (green in Fig. 3), but these elements were also seen in 13 plasmids that do not contain ARGs. ISNCY-family IS (7 different IS; 26 copies) including ISAlw22, ISAso1, ISAso2 (Figs. 3 and 4B), and 4 more novel IS identified here are distributed throughout the GR13 family.
Discussion
Our discovery of two GR13-type plasmids in unrelated
strains isolated a month apart in the same ICU, one cryptic and the other conferring multi-drug resistance, prompted an investigation of the wider GR13 plasmid family, which had not been studied previously. GR13 plasmids are found in multiple
species from a diverse set of environments. The four-gene core of GR13 plasmids is associated with a diverse accessory genome influenced by the exchange of dif modules, the acquisition of translocatable elements and IS-mediated deletions. This characterization of a family of
plasmids that can carry clinically significant ARGs adds to a growing body of literature on the accessory genepool of this important human pathogen and the role of plasmids in generating diversity across this genus.
Diversity in GR13 plasmids: consequences for genomic surveillance and epidemiology
We identified three GR13 lineages on the basis of rep gene identity that we found share lineage-specific sets of accessory genes. Although rep or core-gene typing cannot account for all accessory gene diversity within GR13 lineages, we found that plasmids in the same lineage share significant gene content. Lineage-specific markers like rep and parAB might be used in targeted surveillance programmes to detect clinically relevant plasmids such as pDETAB5 and other members of lineage 2. Representatives of lineage 2 have so far only been seen in isolates from China or neighbouring Vietnam (Table S5), where the first example appeared in 2005, but it will be interesting to trace this lineage and monitor the dynamics of its dispersal in epidemiological studies. We have provided the sequences of the rep and parAB genes that can be used to identify plasmids from lineages 1, 2 and 3 in File S4. These can be used to track the dissemination of GR13 lineages across
.
How do GR13 plasmids spread horizontally?
Plasmids from the same GR13 lineages have been found in different host species that have been isolated from various sources and geographic locations. This is clear evidence for their widespread dissemination and ability to replicate in various
species. However, the mechanisms responsible for the horizontal transmission of GR13 plasmids remain unclear. In this study, pDETAB5 transferred from DETAB-E227 to
ATCC 17978 at a relatively low frequency, but failed to transfer from DETAB-E227 or ATCC 17978 to
strain XH1816.No candidate set of ORFs for a type IV secretion system that might be associated with conjugation were found in pDETAB5 or any of the GR13 plasmids examined here, so it appears they rely on alternative mechanisms for horizontal transfer. In contrast, other large
plasmids have been shown to carry conjugation determinants in conserved backbones [13, 16, 46] while small plasmids that carry an origin-of-transfer (oriT) and cognate mobilisation genes [25] or oriT alone [47] can be mobilized by co-resident conjugative plasmids. It is possible that the integration of small mobilisable plasmids through recombination at pdif sites contributes to the mobility of GR13 plasmids. An example of this is seen in CP022299 where all or part of a putatively mobilisable plasmid is present in the dif-6620 module (Fig. 4). The acquisition of oriT sequences through small plasmid integration has been described for large plasmids in
and
[48, 49], though in those cases integration did not involve pdif sites. Horizontal transfer in outer membrane vesicles has also been reported in
[50, 51] and this, or other passive DNA transfer mechanisms, might play a role in plasmid dispersal. The potential role of ATCC 17978’s resident plasmids in horizontal transfer of pDETAB5 was not examined experimentally here, so we cannot exclude the possibility that these played a role.
Site-specific recombination and tyrosine recombinase genes in GR13 plasmids
The importance of site-specific recombination to the evolution of some types of plasmids in
has become increasingly evident. XerC and XerD-mediated recombination at pdif sites is implicated in the movement of dif modules between plasmids of different types [12, 25, 52], and has been shown experimentally to generate cointegrate plasmids [53]. Recombination at pdif sites can also resolve cointegrates, potentially generating hybrids of the initial cointegrate-forming molecules [53]. This process likely explains the striking similarity of the GR13 plasmid pDETAB5 and GR34 plasmid pDETAB2 (Fig. 1b). Supporting this hypothesis, another plasmid examined here, CP028560, is a cointegrate with GR13 and GR34 replicons identical to those in pDETAB5 and pDETAB2, and appears to represent an evolutionary intermediate.Given pdif sites appear to play a major role in the evolution of some plasmids, it will be important to define the types of plasmids that carry them and can participate in XerC/D-mediated cointegration events or the exchange of dif modules. A recently characterized family of
plasmids has pangenome characteristics similar to the GR13 family, and representatives carry mosaic regions that were called ‘hotspots’ [54]. The movement of dif modules might explain the dynamics of these hotspot regions. It will be useful to identify and study specific pdif-containing plasmid lineages over sustained periods of time to track small-scale evolutionary changes and further our understanding of the evolutionary consequences of pdif carriage.
The dif module gene repertoire continues to grow
The first-described dif modules contained ARGs, but further studies have revealed that these mobile elements can carry a diverse array of passenger genes. Our characterization of selected dif modules in GR13 plasmids expands the known repertoire of genes associated with these elements, further highlighting their important role in the diversification of the
accessory genome.Many modules with diverse functions, including those expected to contribute to clinically relevant traits such as antibiotic resistance or alcohol tolerance, are accompanied by one or more dif modules carrying toxin–antitoxin genes [11, 12, 25]. These are expected to contribute to the stability of their host plasmids, and therefore to co-resident dif modules. ORFs with toxin–antitoxin functions made up 16 % of functionally annotated gene families in GR13 plasmids, suggesting that they play an important role in plasmid persistence. Diversity seen in toxin–antitoxin modules here and elsewhere support the hypothesis that these and other dif modules are ancient elements that have co-evolved with the plasmids of
[25].
Insertion sequences target and reshape dif module-containing plasmids
By definition IS do not encode proteins other than those required for their transposition, but their actions can profoundly influence the evolution of their host molecules [55]. In this study we found cases where IS that are expected to generate target site duplications on insertion are not flanked by them, suggesting that they have mediated deletion events. These deletion events have clearly been responsible for sequence loss from dif modules, or the fusion of dif modules to other sequences (Fig. 3). It appears IS-mediated deletion events can produce novel, hybrid dif modules, though whether these are mobile is likely to depend on the precise sequences of their new flanking pdif sites [52]. IS-mediated deletions might also remove pdif sites associated with dif modules, creating larger segments that might resemble the IS-flanked BREX and ADH segments (Fig. 4c).Two elements characterized here, ISAso1 and ISAso2, are distantly related to one another (encoding 71.0 % identical transposases), but inserted in the same orientation immediately adjacent to the XerC binding regions of pdif sites (Fig. 3). A further four elements characterized here (ISAba69, ISAba70, ISAjo5 and ISApi2) are related to ISAso1 and ISAso2 and appear to exhibit the same target specificity. Together with previous descriptions of ISAjo2-like elements [12, 25], our findings support the notion that this group of IS are ‘dif site hunters’. The presence of dif site hunters can be considered strongly indicative of the presence of pdif sites in
plasmids, and might aid in the identification of plasmid types that participate in the exchange of dif modules.
Conclusions
GR13 plasmids have the capacity to accumulate diverse accessory sequences that may provide fitness advantages in the array of environments inhabited by
species. Some accessory modules pose risks to human health as they might contribute to the persistence of
populations in hospital environments. GR13 plasmid lineages have disseminated internationally and amongst different
species. Genomic surveillance should be coupled with experimental characterization of these plasmids to better understand their contribution to the diversification and success of
, particularly in nosocomial settings.Click here for additional data file.
Authors: Haiyang Liu; Robert A Moran; Ying Chen; Emma L Doughty; Xiaoting Hua; Yan Jiang; Qingye Xu; Linghong Zhang; Jessica M A Blair; Alan McNally; Willem van Schaik; Yunsong Yu Journal: J Antimicrob Chemother Date: 2021-04-13 Impact factor: 5.790
Authors: María M Cameranesi; Jorgelina Morán-Barrio; Adriana S Limansky; Guillermo D Repizo; Alejandro M Viale Journal: Front Microbiol Date: 2018-01-26 Impact factor: 5.640
Authors: Gerry Tonkin-Hill; Neil MacAlasdair; Christopher Ruis; Aaron Weimann; Gal Horesh; John A Lees; Rebecca A Gladstone; Stephanie Lo; Christopher Beaudoin; R Andres Floto; Simon D W Frost; Jukka Corander; Stephen D Bentley; Julian Parkhill Journal: Genome Biol Date: 2020-07-22 Impact factor: 13.583
Authors: Robert A Moran; Haiyang Liu; Emma L Doughty; Xiaoting Hua; Elizabeth A Cummins; Tomas Liveikis; Alan McNally; Zhihui Zhou; Willem van Schaik; Yunsong Yu Journal: Microb Genom Date: 2022-06