| Literature DB >> 31653906 |
Helen Alexandra Shaw1,2, Ladan Khodadoost3, Mark D Preston4,5, Jeroen Corver6, Peter Mullany3, Brendan W Wren4.
Abstract
The major global pathogen Clostridium difficile (recently renamed Clostridioides difficile) has large genetic diversity including multiple mobile genetic elements. In this study, whole genome sequencing of 86 strains from the poorly characterised clade 3, predominantly PCR ribotype (RT)023, of C. difficile revealed distinctive surface architecture characteristics and a large mobile genetic island. These strains have a unique sortase substrate phenotype compared with well-characterised strains of C. difficile, and loss of the phage protection protein CwpV. A large genetic insertion (023_CTnT) comprised of three smaller elements (023_CTn1-3) is present in 80/86 strains analysed in this study, with genes common among other bacterial strains in the gut microbiome. Novel cargo regions of 023_CTnT include genes encoding a sortase, putative sortase substrates, lantibiotic ABC transporters and a putative siderophore biosynthetic cluster. We demonstrate the excision of 023_CTnT and sub-elements 023_CTn2 and 023_CTn3 from the genome of RT023 reference strain CD305 and the transfer of 023_CTn3 to a non-toxigenic C. difficile strain, which may have implications for the use of non-toxigenic C. difficile strains as live attenuated vaccines. Finally, we show that the genes within the island are expressed in a regulated manner in C. difficile RT023 strains conferring a distinct "niche adaptation".Entities:
Mesh:
Substances:
Year: 2019 PMID: 31653906 PMCID: PMC6814731 DOI: 10.1038/s41598-019-51628-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1PPEP-1 is inactive in RT023 resulting in stable anchoring of sortase substrates to the cell wall. PPEP-1 from RT023 is insoluble in E. coli and inactive in C. difficile. (a) Translated protein sequence alignment of PPEP-1 in 630 and CD305 showing high sequence identity (*) until truncation of the CD305 protein after the putative active site (blue box). (b) Structural prediction of PPEP-1 in 630 and CD305. (c) Expression of 6xHisTag PPEP-1 from 630 and CD305 in E. coli by Coomassie staining and immunoblotting (Mouse anti-His 1:2,000, 680IRDye anti-mouse 1:2,000). U, uninduced; W, whole cell lysate; S, soluble; I, insoluble; FL, full length; Tr, truncated. Samples normalised to an OD 20/ml. (d) Localisation of sortase substrate CD2831 in C. difficile strains 630 and CD305 by Coomassie staining and immunoblotting (Mouse anti-CD2831 1:2,000, 680IRDye anti-mouse 1:2,000). Sup, supernatant; WCL, whole cell lysate. Black arrow indicates CD2831. Samples normalised to OD 50/ml. Full length gels are provided in Supplementary Fig. S1.
Figure 2RT023 strains show an alteration of CwpV. CwpV contains an in frame stop codon in its signal sequence and has truncated repeats. (a) DNA sequence of first 102 bp of CwpV in 630 and CD305 genomes with adenosine deletion highlighted in red. Translated protein sequences represented in blue arrows with the frame shift represented as a break in CD305, # indicating a stop codon. The signal peptide cleavage site is indicated with a white arrow. (b) PCR of entire CwpV region in three strains of RT023 demonstrating the uniform length of CwpV representing two Type III repeats.
Putative transposable element insertion into clade 3 strains.
| CD305 Locus Tag | Product | CD305 Locus Tag | Product | CD305 Locus Tag | Product |
|---|---|---|---|---|---|
| CD305_02397 | Serine recombinase | CD305_02439 | Site-specific serine recombinase, resolvase family | CD305_02469 | Serine recombinase |
| CD305_02398 | Conjugal transfer protein | CD305_02440 | hypothetical protein | CD305_02470 | conjugal transfer protein/hypothetical protein |
| CD305_02399 | Transcriptional regulator | CD305_02441 | type II toxin-antitoxin system PemK/MazF, mRNA interferase EndoA | CD305_02471 | Helix-turn-helix protein |
| CD305_02400 | DNA-directed RNA polymerase sigma-70 factor | CD305_02442 | RNA polymerase sigma-70 factor, ECF subfamily | CD305_02472 | RNA polymerase sigma-70 factor |
| CD305_02401 | ABC transporter ATP-binding protein | CD305_02443 | hypothetical protein | CD305_02473 | signal transduction histidine kinase |
| CD305_02402 | ABC transporter ATP-binding protein | CD305_02444 | hypothetical protein | CD305_02474 | Transcriptional regulatory protein SpaR/DNA-binding response regulator |
| CD305_02403 | AraC family transcriptional regulator | CD305_02445 | ylaC, RNA polymerase sigma factor | CD305_02475 | hypothetical protein/nsuI protein |
| CD305_02404 | ABC transporter ATP-binding protein | CD305_02446 | hypothetical protein | CD305_02476 | lantibiotic protection ABC transporter permease subunit, MutG family protein |
| CD305_02405 | cobalt transporter, Ecf (energy-coupling factor) | CD305_02447 | hypothetical protein/ABC transporter permease | CD305_02477 | lantibiotic protection ABC transporter permease subunit, MutE/EpiE family protein |
| CD305_02406 | membrane protein | CD305_02448 | ABC transporter ATP binding protein | CD305_02478 | lantibiotic protection ABC transporter, ATP binding protein srtF |
| CD305_02407 | Thiazolinyl imide reductase | CD305_02449 | HTH DNA binding protein, XRE family | CD305_02479 | HTH DNA binding protein, XRE family |
| CD305_02408 | Saccharopine dehydrogenase | CD305_02450 | transglycosylase/CHAP domain protein | CD305_02480 | glutamine amidotransferase, DJ-1/Pfp1 family protein, YdeA |
| CD305_02409 | non-ribosomal peptide synthetase/pyochelin synthetase F | CD305_02451 | srtB | CD305_02481 | transcriptional regulator, deoR-like HTH DNA binding protein, YafY family transcriptional regulator |
| CD305_02410 | non-ribosomal peptide synthetase | CD305_02452 | hypothetical protein | CD305_02482 | hypothetical protein/conjugative transposon protein |
| CD305_02411 | 2,3-dihydrozybenzoate-AMP ligase | CD305_02453 | hypothetical protein/TraE family protein/TrsE protein | CD305_02483 | membrane hypothetical protein, |
| CD305_02412 | 4′-phosphopantetheinyl transferase | CD305_02454 | PrgI superfamily | CD305_02484 | transcriptional regulator, AbrB family domain protein |
| CD305_02413 | thioesterase | CD305_02455 | hypothetical protein | CD305_02485 | Conjugative transposon protein |
| CD305_02414 | 3-deoxy-7-phosphoheptulonate synthase | CD305_02456 | TraG/TraD family protein/TsrK family protein conjugal transfer protein | CD305_02486 | lysozyme like superfamily, peptidase NLPC_P60 superfamily |
| CD305_02415 | Salicylate synthase | CD305_02457 | hypothetical protein/ltrC-like protein | CD305_02487 | MFS transporter, transposon protein |
| CD305_02416 | transcriptional regulator (DtxR/MntR Manganese regulation) | CD305_02458 | hypothetical protein/PcfB family protein | CD305_02488 | AAA-like protein, ATP/GTP binding protein |
| CD305_02417 | CDGSH-type zinc finger/transposase | CD305_02459 | relaxase | CD305_02489 | ArdA, antirestriction family protein, Tn916 like |
| CD305_02418 | DEAD/DEAH box helicase/Type 1 restriction endonuclease subunit R | CD305_02460 | topoisomerase/ltrC-like protein | CD305_02490 | ArdA, antirestriction family protein, Tn916 like |
| CD305_02419 | Hypothetical protein/Putative nucleotide binding | CD305_02461 | hypothetical protein | CD305_02491 | alpha/beta hydrolase family protein |
| CD305_02420 | Type 1 restriction-modification protein subunit S | CD305_02462 | hypothetical protein | CD305_02492 | Hypothetical protein |
| CD305_02421 | SAM-dependent DNA methyltransferase | CD305_02463 | DNA methylase/DNA helicase | CD305_02493 | Putative conjugal transfer protein |
| CD305_02422 | Conjugal transfer protein | CD305_02464 | hypothetical protein | CD305_02494 | Cro/Cl family transcriptional regulator, XRE family transcriptional regulator, replication initiation protein |
| CD305_02423 | Peptidase P60, cell wall hydrolase | CD305_02465 | hypothetical protein | CD305_02495 | Cell division protein FtsK/SpoIIIE-family protien Tn916-like |
| CD305_02424 | Transposase/major facilitator superfamily | CD305_02466 | CnaB collagen binding protein/TonB-dependent receptor - LPXTG | CD305_02496 | MBL-fold metallo-hydrolase/beta-lactamase |
| CD305_02425 | ATP/GTP binding protein (CTn3) | CD305_02467 | chromosome partitioning protein parB | CD305_02497 | Conjugative transposon protein |
| CD305_02426 | conjugal transfer protein, tcpE family protein | CD305_02468 | chromosome partitioning protein parA/sporulation initiation inhibition soj_1 | CD305_02498 | Conjugative transposon protein |
| CD305_02427 | Hypothetical protein | CD305_02499 | Putative collagen binding - homologous to CD3392 | ||
| CD305_02428 | Hypothetical protein | ||||
| CD305_02429 | antirestriction protein ArdA | ||||
| CD305_02430 | hypothetical protein | ||||
| CD305_02431 | hypothetical protein | ||||
| CD305_02432 | hypothetical protein | ||||
| CD305_02433 | Cro/Cl family transcriptional regulator, XRE family transcriptional regulator, replication initiation protein | ||||
| CD305_02434 | Cell division protein FtsK/SpoIIIE-family protien Tn916-like | ||||
| CD305_02435 | Conjugal transfer protein | ||||
| CD305_02436 | Conjugal transfer protein | ||||
| CD305_02437 | collagen binding, Cna B domain, LPXTG protein | ||||
| CD305_02438 | hypothetical protein/DNA binding protein |
Figure 3RT023 strains can contain a large novel genomic insertion at the 630_CTn2 site. Analysis of genomic insertions shows a large transposable region in most strains of clade 3. (a) Schematic demonstrating the insertion site in strain CD305 and the empty site within strains 91 and 108698. Grey genes 02396 and 02500 are found within the core genome, with blue genes 02397 and 02499 representing the 5′ and 3′ termini of 023_CTnT. Sequence analysis of the empty site and 5′/3′ sequence of CD305 are shown. (b) Phylogenetic tree demonstrating the clustering of clade 3 strains from this study coloured according to presence (blue) and absence (red) of the transposon region.
Figure 4023_CTnT shows sequence identity with C. difficile and human microbiome genomes. Regions within 023_CTnT are found within other strains of C. difficile and human microbiome genomes. (a) Schematic of the three sequential putative transposons within 023_CTnT in CD305; 023_CTn1, 023_CTn2, 023_CTn3. Regions of homology with strain 630 and UHL-19 transposons are indicated below each RT023 transposon. Gene colours indicate putative functions: pale grey, serine recombinase; grey, transposable element/plasmid conjugation; blue, surface proteins and cell wall regulation; green, DNA associated and regulators; red, ABC transporters; purple, signal transduction; pink, biosynthesis/metabolism; yellow, various functional proteins; white, unknown function. (b) BLASTn analysis of sequence coverage of 023_CTnT. Each sub-element is represented by a shaded grey box with the serine recombinases shown above indicating the predicted junction between each sub-element. Sequence identities of each species is indicated by a black bar representing >70% sequence identity.
Figure 5023_CTnT elements are capable of excising. PCR analysis of CD305 genomic DNA confirmed localisation of 023_CTnT and demonstrated some elements are capable of excising from the genome. (a) Schematic illustrating primer binding to demonstrate element localisation, empty site and circularised sequences. Junctions could be amplified with 1 + 2, 3 + 4, 5 + 6 and 7 + 8. Empty sites could be amplified with 1 + 4, 3 + 6, 5 + 8 and 1 + 8. Circularisation could be amplified with 2 + 3, 4 + 5, 6 + 7 and 2 + 7. (b) PCR analysis of four junctions, empty site and circularisation for 023_CTn1, 023_CTn2, 023_CTn3 and the total site (023_CTnT). +, DNA positive; -, DNA negative. (c) Sequence analysis of the PCR products for 023_CTn3 and 023_CTnT. S, site; C, circularisation; L, left junction; R, right junction.
Frequency of conjugation per donor or recipient (Average of three technical replicates).
| Donor | Recipient | Frequency of conjugation/donor | Frequency of conjugation/donor without DNase | Frequency of conjugation/recipient | Frequency of conjugation/recipient |
|---|---|---|---|---|---|
| CD305 (clone 1) | CD37 | 1.42 × 10−7 | 4.53 × 10−7 | 5.4 × 10−7 | 6.6 × 10−7 |
| CD305 (clone 2) | CD37 | 5.6 × 10−7 | 2.4 × 10−7 | 6 × 10−7 | 2.8 × 10−7 |
Figure 6Genes within 023_CTnT are expressed in clade 3 strains. RNA extracted from exponential and stationary phase cultures of clade 3 strains CD305, CZ0502 and SLH89 show expression of fourteen genes within 023_CTnT. 16S PCRs were undertaken on RT+ and RT- samples to show uniform production of cDNA and an absence of gDNA respectively. L, ladder; 1, 2, 3 – exponential cultures; 4, 5, 6 – stationary phase cultures; 1, 4 – CD305; 2, 5 – CZ0502; 3, 6 – SLH89; G, genomic DNA from CD305; N, water negative control. CD305 gene ID and putative functions as indicated.