Literature DB >> 26008700

A Taxonomic Review of Clostridium difficile Phages and Proposal of a Novel Genus, "Phimmp04likevirus".

Katherine R Hargreaves1,2, Martha R J Clokie3.   

Abstract

Currently, only three phages that infect the medically important bacterium Clostridium difficile have been discussed by the International Committee of Viral Taxonomy (ICTV). They are all myoviruses, and have been assigned to the genus "phicd119likevirus". An additional nine phages have since been described in the literature with their genome data available. The Phicd119likevirus is named after the type species: the myovirus ΦCD119 which was the first C. difficile phage to be sequenced. The two additional myoviruses, ϕCD27 and φC2, also fall into this genus based on the similarity of their genome and morphological characteristics. The other nine phages have not been assigned to this genus, and four of them do not fit the criteria for the current taxonomic grouping. We have applied protein clustering analysis to determine their phylogenetic relationships. From these results we propose an additional myoviridae genus, that we term "phiMMP04likevirus".

Entities:  

Keywords:  Clostridium difficile; Myoviridae; Siphoviridae; bacteriophage; bacteriophage phylogeny; protein clustering; small myovirus; taxonomy

Mesh:

Substances:

Year:  2015        PMID: 26008700      PMCID: PMC4452919          DOI: 10.3390/v7052534

Source DB:  PubMed          Journal:  Viruses        ISSN: 1999-4915            Impact factor:   5.048


1. Introduction

The sequenced Clostridium difficile phages that have been described in the literature are all members of the Caudovirales order (tailed phages) and belong to either members of the Myoviridae or Siphoviridae families [1,2,3,4,5,6,7,8]. The myoviruses can be classified into three distinct morphological groupings based on the capsid diameter and tail lengths [9]: medium sized with capsid diameters between ~60–70 nm and tail lengths of 110–130 nm, long tailed with capsid diameter between ~60 to ~70 nm and tail lengths between 150–260 nm, and small myoviruses with capsid diameters of 40–60 nm and tail lengths of 105–110 nm (Table 1). The phages are all temperate as defined by the presence of predicted integrases, and their genome sizes range from ~31 to ~57 kbps in length with GC% from 28.4% to 30.8% (see Table S1 for genome accession numbers).
Table 1

Morphological types of sequenced phages.

MorphotypeSequenced phages of this morphologyCapsid diameter (nm)Tail length (nm)Genome size (kbp)
MMΦCD119, φC2, phiCDHM1, phiCDHM19,60–70110–13053–57
SMVΦMMP04, phiCDHM11, phiCDHM13, phiCDHM1440–58106~31
LTMϕCD27, ΦMMP0260–70150–260~51
Morphological types of sequenced phages. Previously, the criteria for inclusion to the myovirus genus phicd119likevirus is the presence of a cytosine-C5 specific DNA methylase (ΦCD119 protein YP_529611.1), and a three gene cassette containing a DnaD (ΦCD119 protein YP_529603.1), a hypothetical protein which is unannotated in the record of ΦCD119 and a single-stranded DNA binding protein (ΦCD119 protein YP_529604.1) [10]. Since this publication an additional six phages have been reported that have not been assigned to this genus, or do not meet these criteria [4,5,6,7,8]. Genome comparisons using either whole genome alignments at the nucleotide level or blastn similarity scores support the distinct groupings of these phages [8,9]. C. difficile phages have been the subject of interest due to their ability to further our understanding of the biology of this major human pathogen, and because of their possible exploitation as genetic tools, or their application as novel therapeutics. Indeed, the phages φC2, ϕCD27 and φCD38-2 have been found to modulate toxin production [5,11,12], and φC2 has been demonstrated to perform generalised transduction of a transposon containing antibiotic resistance genes [13]. Phages infecting C. difficile, and their products, have been studied for the purposes of developing novel treatments for C. difficile infections, this includes the sequenced ϕCD27 and its endolysin [12,14]. In order to make sense of both the aspects of fundamental biology that C. difficile phages impact, and to assist in their selection and development as therapeutics, an update on their taxonomic classification is needed. In order to determine suitable new genera and the defining characteristics we have referenced existing genome comparisons, and have applied protein clustering analysis to twelve C. difficile phages.

2. Results

Taxonomic Clusters and Protein Cluster Analysis

We performed a protein clustering based analysis to determine the core proteomes from the annotated Genbank files for the C. difficile phages (Figure 1). A total of 798 protein sequences from the twelve phage genomes (Supplementary Table S1 for details) were compared and 445 protein clusters were identified. The largest cluster included ten of the phages, which were all myoviruses (Figure 1A). This one shared protein cluster is based on a gene encoding an unknown hypothetical protein (reference ORF 3 in ΦCD119). A binary matrix derived from the protein cluster data was used to perform phylogenetic analysis using Bayesian inference in Mr. Bayes (Figure 1B and Supplementary Table S2).
Figure 1

Network of C. difficile phages based on conserved protein clusters. (A) Conserved protein clusters shown per phage, colour indicates grouping by morphology (blue for siphoviruses, purple for small myovirses, magenta for medium myoviruses and red for long tailed myoviruses). The y axis is cluster size (number of phages represented in each cluster), x axis is protein cluster, sorted by cluster size and z axis is each phage; (B) Tree generated using Bayesian phylogenetic inference with node probability based on standard data produced from protein clustering. Taxa clusters correspond to the proposed phage genus groups: phiCD119viruslike with ΦCD119, phiCDHM19, φC2, phiCDHM1, ϕCD27 and ΦMMP02, phiMMP04likevirus with ΦMMP04, phiCDHM13, phiCDHM14 and phiCDHM11. Branch colours correspond to phage particle morphology as in A.

Network of C. difficile phages based on conserved protein clusters. (A) Conserved protein clusters shown per phage, colour indicates grouping by morphology (blue for siphoviruses, purple for small myovirses, magenta for medium myoviruses and red for long tailed myoviruses). The y axis is cluster size (number of phages represented in each cluster), x axis is protein cluster, sorted by cluster size and z axis is each phage; (B) Tree generated using Bayesian phylogenetic inference with node probability based on standard data produced from protein clustering. Taxa clusters correspond to the proposed phage genus groups: phiCD119viruslike with ΦCD119, phiCDHM19, φC2, phiCDHM1, ϕCD27 and ΦMMP02, phiMMP04likevirus with ΦMMP04, phiCDHM13, phiCDHM14 and phiCDHM11. Branch colours correspond to phage particle morphology as in A. This analysis supports the proposed novel genus and additional members to the existing phicd119ikevirus group. There is a clade for the small myoviruses which is distinct from the medium and long tailed myoviruses with a branch probability of 0.8664 (Figure 1A). From the clustering analysis it is clear that that there is one protein cluster (PC_376, Table S3) which is common to all the myoviruses examined. This encodes a hypothetical protein 60 aa in length, located within the structural region of the phage genomes. A blastp search using the sequence of CDHM1_gp32 showed a high degree of conservation across multiple C. difficile strains, and has usually been annotated as a phage protein. It does share similarity (with an E value of 3e-24 and 85% identity) in a 60 aa region of a 771 aa protein in CD630 (CD630_29521, accession CAJ69777.3) which encodes a predicted ABC transport ATP binding protein. The region sharing similarity is not across the ATP binding predicted domains, or signature motifs, but it suggests that the phage protein could interact with the C. difficile cell membrane. This prediction is supported by its location in the phage genome, which is adjacent to other CDSs that are predicted as tail fibers and base plate proteins. Due to its conserved nature across the phage and prophage genomes, this protein may be a specific marker of phage access to C. difficile. Phylogenetic analysis using Maximum Likelihood (ML) reveals there is no strong signature across the phage morphotypes (Figure 2).
Figure 2

Phylogeny of shared C. difficile myovirus protein in cluster 396.

The nucleotide sequences of shared protein from the ten myoviruses were aligned using MUSCLE and phylogenetic analysis was performed using Maximum likelihood analysis with a Hasegawa-Kishino-Yano [15] nucleotide substitution model and a Gamma distribution rate. The distance between the sequences ranged from 0 to 0.223 and the scale bar indicates number of substitutions per site. The total length of the alignment was 183 positions. The taxa are generally not well supported by bootstrap values, with three clusters having greater than 50; one contains the sequences from the three highly related small myoviruses phiCDHM11, phiCDHM14 and phiCDHM13; the second contains those from ΦCD119 and ΦMMP04 and third expands this cluster to contain phiCDHM19. Phylogeny of shared C. difficile myovirus protein in cluster 396.

3. Discussion

3.1. PhiCD119likevirus Clade

The grouping of ΦCD119, φC2, ϕCD27, ΦMMP04, phiCDHM1 and phiCDHM19 into the phicd119likevirus genus is supported by their clustering with a branch probability of 0.9941. These phages are all represented in 1 protein cluster (PC_166, Table S4) which also encodes a protein of unknown function in the DNA replication region of the genomes. There are distinct subgroupings within this genus, with clades containing φC2 and phiCDHM1, which have 19 protein clusters in common (Table S5). These contain mostly proteins that are predicted to be part of the virion particle and an anti-repressor protein (ORF 55 in φC2). The clade containing ΦCD119 and phiCDHM19 also has 19 protein clusters in common (Table S6) which contain a mixture of structural proteins as well as those located in the DNA replication region of the genomes. The two long tailed myoviruses, ϕCD27 and ΦMMP02 cluster together and share 16 protein clusters (Table S7), also mainly comprising of proteins in the structural region with one located in the putative lysogenic conversion module (ORF 41 in ϕCD27).

3.2. PhiMMP04likevirus Clade

We propose a new genus, phiMMP04likevirus, which is named after the first published small myovirus, ΦMMP04 [16] and which now includes the three small myoviruses phiCDHM14, phiCDHM11 and phiCDHM13 [8]. This taxonomic grouping is supported by the clustering of the four small myoviruses together, supported by a branch probability of 1. These phages share 16 protein clusters in common (Table S8). The majority of these clusters contain sequences of predicted structural genes, but also three ORFS located in the DNA replication region of the genomes all encoding proteins of unknown functions.

3.3. Characteristic Genome Features of the New Genus

Previously, a gene cassette and putative DNA methylase gene was identified in all the genomes of the three current members of phicd119likevirus [17]. This DNA replication cassette is also present in phiCDHM1, phiCDHM19 and ΦMMP02 and supports the addition of these phages to this genus. However, the genomes of the four short tailed myoviruses do not carry this cassette or DNA methylase. The genomes of all four small myoviruses show a high degree of similarity to one another, and contain a putative ParA homolog. The role of this protein in these phages is unknown: it is located on the antisense strand, downstream of the lysis gene cassette which suggests it may form part of the lysogenic conversion module. ParA has a chromosome partitioning role in bacteria replication [17]. The phages amino acid sequence contains a CbiA protein domain (PF01656), and has homology to SpoOJ regulator protein of Clostridium sordellii (Accession CEP41774.1, E value of 4e−116 and identity of 63%), suggesting that it may alternatively influence bacterial sporulation. The two siphoviruses also carry homologs of the ParA-like gene. The small myoviruses and siphoviruses also have in common that their predicted integrases are located on the sense strand following the DNA replication region in the genomes. In contrast, the small myoviruses differ considerably to the siphoviruses across their structural regions and instead have similarity to ΦCD119 in this region. However, a defining feature is the presence of a gene encoding a predicted Clp protease (a family of serine peptidases, PfaM PF00574) which is not present in ΦCD119. We propose that the presence of these two genes, the ParA homolog and Clp protease, forms the genetic basis of their grouping into the novel genus phiMMP04likevirus, alongside particle morphology dimensions and a genome size ranging between 30–33 kb.

3.4. C. Difficile Phages Outside of the Established and Proposed Genera

The two siphoviruses do not fall within either of these genus groups. Between them, the two siphoviruses share four protein clusters in common (Table S9). These four clusters include the endolysin (ORF 28 in ΦCD6356) and holin (ORF 27 ΦCD6356) sequences as well as two from the DNA replication module, one encoding a protein of unknown functions (ORF 47 and ORF 49). The clustering of the two siphoviruses however is not supported by a branch probability, and it may be suitable in the future to further divide these into separate genera depending on future phage discovery. The two siphoviruses both encode the ParA homolog and the genes represented in the protein clusters encoding a protein of unknown function.

3.5. Phage and Prophage Sequences in NCBI

There are several C. difficile phage genomes in the NCBI database and many can be assigned these to these proposed genera based on their overall genome identity using blastn and presence of the characteristic genes described above. For example, phiCD111 (accession LN681535) and phiCD146 (accession LN681536) are both similar to the siphoviruses and contain a ParA homolog and the two hypothetical proteins corresponding to ORF 47 and ORF 49 in ΦCD6356. The phages phi481-1 (accession LN681538) and phiCD506 (accession LN681540) can be placed into the phiMMP04viruslike genus due to their genome size and the presence of both a Clp protease homolog and ParA. Where previously described, the morphological characteristics of these phages correspond to these assignments [6,16]. Other phage genomes and prophage sequences in shotgun genome sequences suggest there may be further diversity across phages infecting this species and it is likely that additional modifications to their taxonomic assignments may be needed in the future following their characterisation.

4. Materials and Methods

Phage sequences were accessed via NCBI or ENA and their genomes were visualised using Artemis v15.0.0 [18]. Protein clusters were generated in CD-HIT [19] and phylogenetic analysis was performed in MrBayes v3.2.4 [20]. Protein cluster statistics were generated in Microsoft Office Excel and phylogenetic tree imaged using Figtree v1.4.2 [21] and Inkscape v0.91 [22]. Phylogenetic analysis of the conserved amino acid sequences were performed in MEGA v6.0 [23], using alignment with MUSCLE and best Maximum likelihood (ML) model was determined for each analysis. Newick trees were visualised in Figtree. Phage genomes were searched using blastn against the NCBI nr/nt databases and individual genes using BLASTp and BLASTn.
  21 in total

1.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

2.  Prophage-stimulated toxin production in Clostridium difficile NAP1/027 lysogens.

Authors:  Ognjen Sekulovic; Mathieu Meessen-Pinard; Louis-Charles Fortier
Journal:  J Bacteriol       Date:  2011-03-25       Impact factor: 3.490

Review 3.  The bacterial ParA-ParB partitioning proteins.

Authors:  C Bignell; C M Thomas
Journal:  J Biotechnol       Date:  2001-09-13       Impact factor: 3.307

4.  Effect of phage infection on toxin production by Clostridium difficile.

Authors:  Shan Goh; Barbara J Chang; Thomas V Riley
Journal:  J Med Microbiol       Date:  2005-02       Impact factor: 2.472

5.  Genomic organization and molecular characterization of Clostridium difficile bacteriophage PhiCD119.

Authors:  Revathi Govind; Joe A Fralick; Rial D Rolfe
Journal:  J Bacteriol       Date:  2006-04       Impact factor: 3.490

6.  Evidence of in vivo prophage induction during Clostridium difficile infection.

Authors:  Mathieu Meessen-Pinard; Ognjen Sekulovic; Louis-Charles Fortier
Journal:  Appl Environ Microbiol       Date:  2012-08-24       Impact factor: 4.792

7.  Characterization of temperate phages infecting Clostridium difficile isolates of human and animal origins.

Authors:  Ognjen Sekulovic; Julian R Garneau; Audrey Néron; Louis-Charles Fortier
Journal:  Appl Environ Microbiol       Date:  2014-02-14       Impact factor: 4.792

8.  Phage ϕC2 mediates transduction of Tn6215, encoding erythromycin resistance, between Clostridium difficile strains.

Authors:  Shan Goh; Haitham Hussain; Barbara J Chang; Warren Emmett; Thomas V Riley; Peter Mullany
Journal:  mBio       Date:  2013-11-19       Impact factor: 7.867

9.  Classification of Myoviridae bacteriophages using protein sequence similarity.

Authors:  Rob Lavigne; Paul Darius; Elizabeth J Summer; Donald Seto; Padmanabhan Mahadevan; Anders S Nilsson; Hans W Ackermann; Andrew M Kropinski
Journal:  BMC Microbiol       Date:  2009-10-26       Impact factor: 3.605

10.  CD-HIT: accelerated for clustering the next-generation sequencing data.

Authors:  Limin Fu; Beifang Niu; Zhengwei Zhu; Sitao Wu; Weizhong Li
Journal:  Bioinformatics       Date:  2012-10-11       Impact factor: 6.937

View more
  13 in total

Review 1.  Surviving Between Hosts: Sporulation and Transmission.

Authors:  Michelle C Swick; Theresa M Koehler; Adam Driks
Journal:  Microbiol Spectr       Date:  2016-08

2.  High Prevalence and Genetic Diversity of Large phiCD211 (phiCDIF1296T)-Like Prophages in Clostridioides difficile.

Authors:  Julian R Garneau; Ognjen Sekulovic; Bruno Dupuy; Olga Soutourina; Marc Monot; Louis-Charles Fortier
Journal:  Appl Environ Microbiol       Date:  2018-01-17       Impact factor: 4.792

Review 3.  Clostridioides difficile phage biology and application.

Authors:  Joshua Heuler; Louis-Charles Fortier; Xingmin Sun
Journal:  FEMS Microbiol Rev       Date:  2021-09-08       Impact factor: 16.408

4.  Complete genome sequence of the newly discovered temperate Clostridioides difficile bacteriophage phiCDKH01 of the family Siphoviridae.

Authors:  Krzysztof Hinc; Monika Kabała; Adam Iwanicki; Gajane Martirosian; Alessandro Negri; Michał Obuchowski
Journal:  Arch Virol       Date:  2021-05-20       Impact factor: 2.685

5.  Complete Genome Sequence of the Novel Temperate Clostridium difficile Phage phiCDIF1296T.

Authors:  Johannes Wittmann; Thomas Riedel; Boyke Bunk; Cathrin Spröer; Sabine Gronow; Jörg Overmann
Journal:  Genome Announc       Date:  2015-08-20

6.  Two Novel Myoviruses from the North of Iraq Reveal Insights into Clostridium difficile Phage Diversity and Biology.

Authors:  Srwa J Rashid; Jakub Barylski; Katherine R Hargreaves; Andrew A Millard; Gurinder K Vinner; Martha R J Clokie
Journal:  Viruses       Date:  2016-11-16       Impact factor: 5.048

7.  Toward Understanding Phage:Host Interactions in the Rumen; Complete Genome Sequences of Lytic Phages Infecting Rumen Bacteria.

Authors:  Rosalind A Gilbert; William J Kelly; Eric Altermann; Sinead C Leahy; Catherine Minchin; Diane Ouwerkerk; Athol V Klieve
Journal:  Front Microbiol       Date:  2017-12-05       Impact factor: 5.640

Review 8.  Mechanistic Insights in the Success of Fecal Microbiota Transplants for the Treatment of Clostridium difficile Infections.

Authors:  Amoe Baktash; Elisabeth M Terveer; Romy D Zwittink; Bastian V H Hornung; Jeroen Corver; Ed J Kuijper; Wiep Klaas Smits
Journal:  Front Microbiol       Date:  2018-06-12       Impact factor: 5.640

Review 9.  The Regulatory Networks That Control Clostridium difficile Toxin Synthesis.

Authors:  Isabelle Martin-Verstraete; Johann Peltier; Bruno Dupuy
Journal:  Toxins (Basel)       Date:  2016-05-14       Impact factor: 4.546

10.  Bacteriophage Combinations Significantly Reduce Clostridium difficile Growth In Vitro and Proliferation In Vivo.

Authors:  Janet Y Nale; Janice Spencer; Katherine R Hargreaves; Anthony M Buckley; Przemysław Trzepiński; Gillian R Douce; Martha R J Clokie
Journal:  Antimicrob Agents Chemother       Date:  2015-12-07       Impact factor: 5.191

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.