| Literature DB >> 31133656 |
Przemyslaw Decewicz1, Lukasz Dziewit2, Piotr Golec1, Patrycja Kozlowska3, Dariusz Bartosik1, Monika Radlinska3.
Abstract
Bacteria of the genus Paracoccus inhabit various pristine and anthropologically-shaped environments. Many Paracoccus spp. have biotechnological value and several are opportunistic human pathogens. Despite extensive knowledge of their metabolic potential and genome architecture, little is known about viruses of Paracoccus spp. So far, only three active phages infecting these bacteria have been identified. In this study, 16 Paracoccus strains were screened for the presence of active temperate phages, which resulted in the identification of five novel viruses. Mitomycin C-induced prophages were isolated, visualized and their genomes sequenced and thoroughly analyzed, including functional validation of their toxin-antitoxin systems. This led to the identification of the first active Myoviridae phage in Paracoccus spp. and four novel Siphoviridae phages. In addition, another 53 prophages were distinguished in silico within genomic sequences of Paracoccus spp. available in public databases. Thus, the Paracoccus virome was defined as being composed of 66 (pro)phages. Comparative analyses revealed the diversity and mosaicism of the (pro)phage genomes. Moreover, similarity networking analysis highlighted the uniqueness of Paracoccus (pro)phages among known bacterial viruses.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31133656 PMCID: PMC6536676 DOI: 10.1038/s41598-019-44460-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Sizes of heads and tails of the identified Paracoccus phages.
| Phage | Head width (nm)* | Head length (nm)* | Tail width (nm)* | Tail length (nm)* |
|---|---|---|---|---|
| vB_PbeS_Pben1 | 48.2 ± 8.3 | 49.7 ± 6.9 | 8.7 ± 0.5 | 144.0 ± 11.1 |
| vB_PkoS_Pkon1 | 49.3 ± 1.5 | 59.2 ± 2.5 | 8.8 ± 0.3 | 132.6 ± 9.0 |
| vB_PsuS_Psul1 | 56.6 ± 4.5 | 56.3 ± 0.9 | 8.4 ± 0.4 | 120.4 ± 5.1 |
| vB_PthS_Pthi1 | 53.4 ± 2.4 | 57.7 ± 3.4 | 7.7 ± 1.1 | 98.3 ± 8.3 |
| vB_PyeM_Pyei1 | 58.3 ± 3.0 | 59.3 ± 2.3 | 14.3 ± 1.5 | 135.8 ± 7.8 |
*The presented sizes are averages (±standard deviation) calculated from measurements of 10 randomly picked phage particles.
Figure 1Particle morphology and genome organization of phages vB_PbeS_Pben1, vB_PkoS_Pkon1, vB_PsuS_Psul1, vB_PthS_Pthi1 and vB_PyeM_Pyei1. (A) Transmission electron micrographs of the phage particles. A scale bar is shown below each micrograph. (B) Phage genome organization. Arrows indicate the transcriptional orientation of the genes. The distinguished genetic modules are indicated by black boxes.
General characteristics and features of Paracoccus phage genomes.
| Phage | Phage Family | Genome size (bp) | GC content (%) | No. of genes | Attachment site | |
|---|---|---|---|---|---|---|
| vB_PbeS_Pben1 |
| 39,879 | 64.3 | 71 | Intergenic region | GCGTCTCGTTTACACTGAGA |
| vB_PkoS_Pkon1 |
| 49,723 | 60.6 | 79 | Gene encoding OmpR family transcriptional regulator | GTTTCTCAA(G/C)CAT |
| vB_PsuS_Psul1 |
| 37,901 | 60.9 | 57 | tRNATrp(CCA) | CGGTCTCCAAAACCGAGGGTCGTGGGTTCGAGTCCCCCAACCCCTGCCAGT |
| vB_PthS_Pthi1 |
| 39,547 | 63.8 | 52 | tRNASer(GGA) | CCTCACCGTCCGCCA |
| vB_PyeM_Pyei1 |
| 50,161 | 65.5 | 75 | tRNAPro(TGG) | GACGGTTTTGGGTACCGTAGGCCGGAGGTTCGAATCCTCTCGCCCCGACCAG |
*Sequences are shown in the 5′ to 3′ orientation.
General properties of Paracoccus prophages identified in genomic sequences in the NCBI database.
| Host | Phage name | Family | Integration strategy* | Integration site | Genome size (bp) | % GC | No. of genes | Genome acc. no (coordinates) |
|---|---|---|---|---|---|---|---|---|
| vB_PamS_Pami1 |
| tyr | tRNA-Pro(TGG) | 43,882 | 61.6 | 57 | NC_022041 (725,048–768,929) | |
| vB_PamS_Pami2 |
| tyr | intergenic | 37,658 | 60.8 | 51 | NC_022041 (1,209,587–1,247,244) | |
| vB_PamS_Pami3 |
| tyr | tRNA-Arg(CCG) | 35,083 | 61.9 | 53 | NC_022041 (2,047,014–2,082,096) | |
| vB_PamS_Pami4 |
| tyr | partial lyase protein | 48,068 | 60.4 | 63 | NC_022041 (2,286,040–2,334,107) | |
| vB_PamS_Pami5 |
| tyr | tRNA-Gly(TCC) | 43,256 | 61.0 | 59 | NC_022041 (2,707,458–2,750,713) | |
| vB_PamS_Pami6 |
| ser | tRNA-Met(CAT) | 38,779 | 61.6 | 57 | NC_022041 (3,004,809–3,043,587) | |
| vB_PamS_PD1 |
| ser | not identified | 46,846 | 66.9 | 53 | NZ_FOPU01000032 (604–47,449) | |
| vB_PamP_PD2 |
| tyr | tRNA-Thr(CTG) | 42,749 | 63.7 | 80 | NZ_KQ955208 (101,365–144,113) | |
| vB_PamS_PD3 |
| ser | not identified | 41,904 | 67.7 | 48 | NZ_KQ955210 (20,997–179,000) | |
| vB_PcoS_PD4 |
| tyr | tRNA-Met(CAT) | 43,069 | 65.1 | 47 | NZ_CP020612 (535,143–578,211) | |
| vB_PcoS_PD5 |
| tyr | tRNA-Met(CAT) | 61,208 | 65.2 | 54 | NZ_CP020612 (1,101,277–1,162,484) | |
| vB_PcoS_PD6 |
| ser | not identified | 41,102 | 68.5 | 50 | NZ_CP020612 (2,589,532–2,630,633) | |
| vB_PcoS_PD7 |
| ser | not identified | 51,188 | 68.2 | 58 | NZ_CP020612 (2,662,598–2,713,785) | |
| vB_PdeP_PD8 |
| tyr | tRNA-Thr(TGT) | 43,858 | 64.6 | 73 | NZ_FNEA01000026, NZ_FNEA01000018 (58,894–61,814, 1–41,307) | |
| vB_PdeP_PD9 |
| tyr | tRNA-Thr(TGT) | 43,779 | 64.6 | 73 | NZ_FOYK01000026, NZ_FOYK01000018 (58,876–61,788, 1–40,867) | |
| vB_PdeS_PD10 |
| tyr | tRNA-Arg(TCT) | 41,212 | 65.1 | 61 | NZ_PPGA01000004 (118,900–160,111) | |
| vB_PdeS_PD11 |
| tyr | tRNA-Ser(GCT) | 42,731 | 64.4 | 61 | NC_008686 (318,321–361,051) | |
| vB_PdeP_PD12 |
| tyr | tRNA-Thr(TGT) | 43,827 | 64.6 | 75 | NC_008687 (875,986–919,812) | |
| vB_PhoS_PD13 |
| ser | not identified | 44,822 | 67.7 | 56 | NZ_FOHO01000007 (108,618–153,439) | |
| vB_PpaP_PD14 |
| tyr | tRNA-Arg(CTT) | 38,850 | 62.9 | 74 | NZ_FPKI01000005 (21–38,870) | |
| vB_PpaP_PD15 |
| tyr | putative ompR regulator | 42,547 | 61.9 | 72 | NZ_KI912520 (48,334–90,880) | |
| vB_PsaS_PD16 |
| tyr | tRNA-Gly(GCC) | 45,275 | 59.5 | 55 | NZ_FTOU01000011 (482–45,756) | |
| vB_PsaS_PD17 |
| tyr | not identified | 52,526 | 64.2 | 59 | NZ_JRKR01000006 (2–52,527) | |
| vB_PsaS_PD18 |
| tnp | not identified | 41,698 | 68.5 | 60 | NZ_JRKP01000001 (14,587–56,284) | |
| vB_PsaS_PD19 |
| tnp | not identified | 39,502 | 68.6 | 53 | NZ_JRKP01000010 (2–39,503) | |
| vB_PsaS_PD20 |
| ser | not identified | 41,827 | 68.4 | 48 | NZ_JRKT01000001 (8,678–50,504) | |
| vB_PsaS_PD21 |
| tyr | not identified | 38,594 | 65.3 | 46 | NZ_JRKT01000026 (2–38,595) | |
| vB_PsaS_PD22 |
| tyr | tRNA-Phe(GAA) | 53,314 | 66.0 | 59 | NZ_JRKQ01000001 (25,096–78,409) | |
| vB_PsaS_PD23 |
| ser | not identified | 61,802 | 66.5 | 70 | NZ_JRKQ01000003 (21,690–83,491) | |
| vB_PsaS_PD24 |
| tyr | tRNA-Met(CAT) | 49,592 | 65.1 | 52 | NZ_JRKQ01000004 (242–49,833) | |
| vB_PsaS_PD25 |
| tyr | not identified | 32,827 | 66.9 | 42 | NZ_JRKQ01000005 (3,432–36,258) | |
| vB_PsaS_PD26 |
| tnp | not identified | 44,577 | 68.5 | 56 | NZ_JRKQ01000008 (2–44,578) | |
| vB_PsaS_PD27 |
| tnp | not identified | 41,697 | 68.5 | 60 | NZ_FNNA01000001 (674,710–716,406) | |
| vB_PsaS_PD28 |
| tnp | not identified | 39,255 | 68.5 | 57 | NZ_FNNA01000006 (82,874–122,128) | |
| vB_PsaS_PD29 |
| tyr | tRNA-Met(CAT) | 51,410 | 65.3 | 58 | NZ_FNNA01000009 (123–51,532) | |
| vB_PseS_PD30 |
| tyr | tRNA-Met(CAT) | 48,532 | 64.7 | 48 | NZ_FZNM01000001 (93–48,624) | |
| vB_PsoS_PD31 |
| ser | not identified | 42,946 | 66.0 | 63 | NZ_FRCK01000001 (414,364–457,309) | |
| vB_PspS_PD32 |
| tyr | tRNA-Met(CAT) | 51,828 | 60.8 | 51 | NZ_CP025408 (420,882–472,709) | |
| vB_PspS_PD33 |
| tyr | tRNA-Met(CAT) | 50,035 | 63.6 | 58 | NZ_CP025408 (1,328,234–1,378,268) | |
| vB_PspS_PD34 |
| ser | not identified | 44,369 | 67.8 | 53 | NZ_CP025583 (807,987–852,355) | |
| vB_PspS_PD35 |
| ser | not identified | 50,207 | 63.2 | 54 | NZ_CP025583 (862,881–913,087) | |
| vB_PspS_PD36 |
| tyr | not identified | 44,896 | 64.7 | 53 | NZ_CP025583 (43,796–88,691) | |
| vB_PspS_PD37 |
| tyr |
| 40,213 | 63.6 | 65 | NZ_JAEN01000011 (46,163–86,375) | |
| vB_PspS_PD38 |
| tyr | tRNA-Pro(TGG) | 46,553 | 64.3 | 66 | NZ_AQUO01000001 (2,255,488–2,302,040) | |
| vB_PspS_PD39 |
| tyr | tRNA-Gly(CCC) | 37,866 | 62.7 | 55 | NZ_JXYF01000001 (25,265–63,130) | |
| vB_PspS_PD40 |
| tyr | Intergenic | 36,248 | 63.7 | 50 | NZ_JXYF01000039 (20–36,267) | |
| vB_PspP_PD41 |
| tyr | tRNA-Thr(GGT) | 48,266 | 64.6 | 63 | MEES01000006 (113,916–162,181) | |
| vB_PspS_PD42 |
| tyr | tRNA-Cys(GCA) | 42,396 | 63.7 | 55 | NZ_JPKW01000001 (529,373–571,768) | |
| vB_PspP_PD43 |
| tyr | tRNA-Lys(CAA) | 44,599 | 63.4 | 67 | NZ_JPKW01000003 (122,849–167,447) | |
| vB_PspS_PD44 |
| tyr | tRNA-Gln(TTC) | 41,926 | 63.9 | 71 | NZ_JPKW01000009 (114,600–156,525) | |
| vB_PspS_PD45 |
| tyr | tRNA-Met(CAT) | 58,117 | 66.0 | 59 | NZ_JRKS01000013 (81–58,197) | |
| vB_PveS_PD46 |
| ser | not identified | 41,696 | 67.6 | 49 | NZ_JRKO01000007 (57,139–98,834) | |
| vB_PyeS_PD47 |
| tyr | Intergenic | 56,744 | 61.8 | 59 | NZ_KK211402 (25,940–82,683) | |
| vB_PyeS_PD48 |
| ser | not identified | 43,101 | 68.1 | 55 | NZ_JHWH01000027 (8,448–51,578) | |
| vB_PyeS_PD49 |
| tyr | tRNA-Met(CAT) | 50,833 | 65.3 | 60 | NZ_KK211402 (315,002–365,834) | |
| vB_PyeM_PD50 |
| tyr | tRNA-Pro(TGG) | 54,200 | 65.0 | 80 | CP024422 (2,080,701–2,134,900) | |
| vB_PyeS_PD51 |
| tyr | tRNA-Asn(GTT) | 38,463 | 61.7 | 39 | CP024422 (2,509,005–2,547,467) | |
| vB_PyeS_PD52 |
| ser | not identified | 52,363 | 67.8 | 61 | CP024422 (2,660,340–2,712,702) | |
| vB_PyeS_PD53 |
| ser | not identified | 44,067 | 65.3 | 58 | CP024422.1 (2,725,351–2,769,417) |
*Names tyr, ser and tnp refer to tyrosine recombinase, serine recombinase and Mu-like transposase, respectively.
Figure 2Diversity of DNA MTases of Paracoccus (pro)phages and genomic location of their genes. (A) General diversity of identified MTases as a network of MTases (nodes) connected with lines (edges) that reflect at least 80% amino acid sequence identity over at least 75% sequence coverage. The colours of the nodes representing single MTases, reflect the target base of their methylation, except the half blue-half green nodes, which additionally indicates the presence of a ParB domain at the N-terminus. Several nodes are also marked with stars to indicate experimental verification of their specificity. The labels/numbers give the names of the phage from which each MTase originates (e.g. Pami4 from vB_PamS_Pami4, 11 from vb_PdeS_PD11). This corresponds to data presented in Table 3. Where more than one MTase gene is present within a (pro)phage genome, the suffixes “a”, “b” or “c” are added to the label/number, corresponding to their order in that genome. (B) Simplified schematic representation of phage genomes showing virus-specific gene modules and the location of the MTase genes. MT blocks are coloured according to MTase target base specificity. The prophages that share each genome arrangement are listed on the right side of the genome diagrams.
Figure 3Comparison of Paracoccus (pro)phage genomes. Whole-genome similarity analysis was performed using Circoletto with e-100 as the threshold. The ribbon colours reflect the percentage identity of particular genomic regions. The bars within the first ring represent subsequent phages. The next ring, comprised of histograms, shows the frequency of hits in certain regions of the analyzed genomes. The outer-most ring reflects the (pro)phage classification: orange – Siphoviridae, green – Myoviridae and violet – Podoviridae. Outer-most, gray curves indicate polylysogenic host strains: 1 – P. aminophilus JCM 7686, 2 – P. aminovorans HPD-2, 3 – P. contaminans RKI, 4 - P. denitrificans PD1222, 5 – P. sanguinis 39524, 6 – P. sanguinis 4681, 7 – P. sanguinis 5503, 8 – P. sanguinis DSM 29303, 9 – Paracoccus sp. BM15, 10 – Paracoccus sp. CBA4604, 11 – Paracoccus sp. S4493, 12 – Paracoccus sp. SCN 68–21, 13 – P. yeei ATCC BAA-599, 14 – P. yeei TT13.
Figure 4Protein-based similarity network of Paracoccus (pro)phages. The general clustering of (pro)phages based on their summarized proteomes (A), integrases (B), large terminase subunits (C) and major capsid proteins (D). Nodes represent a single (pro)phage, while edges correspond to the summarized quantity of reciprocally similar proteins. Orphan nodes are made transparent for better visibility. On (A) the size of the node corresponds to the number of prophages with which they share proteins. The nodes represent the following (pro)phages (Paracoccus strain): Pben1 – vB_PbeS_Pben1 (P. bengalensis); Pkon1 – vB_PkoS_Pkon1 (P. kondratieve); Psul1 – vB_PsuS_Psul1 (P. sulfuroxidans); Pthi1 – vB_PthS_Pthi1 (P. thiocyanatus); Pyei1 – vB_PyeM_Pyei1 (P. yeei CCUG 32053); IMEP1 – vB_PmaS_IMEP1 (P. marcusii); Shpa – vB_PmaS_Shpa (P. marinus); Pami1-Pami6 – vB_PamS_Pami1-vB_PamS_Pami6 (P. aminophilus); 1 – vB_PamS_PD1 (P. aminovorans DSM 8537); 2–3 – vB_PamP_PD2-vB_PamS_PD3 (P. aminovorans HPD-2); 4–7 – vB_PcoS_PD4-vB_PcoS_PD7 (P. contaminans); 8 – vB_PdeP_PD8 (P. denitrificans DSM 413); 9 – vB_PdeP_PD9 (P. denitrificans DSM 415); 10 – vB_PdeS_PD10 (P. denitrificans ISTOD1); 11–12 – vB_PdeS_PD11-vB_PdeP_PD12 (P. denitrificans PD1222); 13 – vB_PhoS_PD13 (P. homiensis); 14 – vB_PpaP_PD14 (P. pantotrophus DSM 1403); 15 – vB_PpaP_PD15 (P. pantotrophus J46); 16 – vB_PsaS_PD16 (P. saliphilus); 17 – vB_PsaS_PD17 (P. sanguinis 10990); 18–19 – vB_PsaS_PD18-vB_PsaS_PD19 (P. sanguinis 39524); 20–21 – vB_PsaS_PD20-vB_PsaS_PD21 (P. sanguinis 4681); 22–26 – vB_PsaS_PD22-vB_PsaS_PD26 (P. sanguinis 5503); 27–29 – vB_PsaS_PD27-vB_PsaS_PD29 (P. sanguinis DSM 29303); 30 – vB_PseS_PD30 (P. sediminis); 31 – vB_PsoS_PD31 (P. solventivorans); 32–33 – vB_PspS_PD32-vB_PspS_PD33 (Paracoccus sp. BM15); 34–36 – vB_PspM_PD34-vB_PspS_PD36 (Paracoccus sp. CBA4604); 37 – vB_PspS_PD37 (Paracoccus sp. J39); 38 – vB_PspS_PD38 (Paracoccus sp. N5); 39–40 – vB_PspS_PD39-vB_PspS_PD40 (Paracoccus sp. S4493); 41–44 – vB_PspP_PD41-vB_PspS_PD44 (Paracoccus sp. SCN 68–21); 45 – vB_PspS_PD45 (P. sphaerophysae); 46 – vB_PveS_PD46 (P. versutus); 47–49 – vB_PyeS_PD47-vB_PyeS_PD49 (P. yeei ATCC BAA-599); 50–53 – vB_PyeM_PD50-vB_PyeS_PD53 (P. yeei TT13).
Figure 5Protein-based similarity network analysis of Paracoccus (pro)phages and other bacteriophages retrieved from the NCBI Viruses database. (A) Overall similarity network of known bacterial phages. Nodes are coloured based on the taxonomy of the phage host (at phylum level, except Proteobacteria where classes are considered). Paracoccus (pro)phages are distinguished within the network. The host taxonomy is based on manually-curated qualifiers in the source section and organism name of the virus GenBank files. (B) Magnified image of Alphaproteobacteria (pro)phage network. The colour scheme is based on the host genus classification of the phages. The topology of the clustering of Paracoccus phages is the same as the one presented on Fig. 4, where (pro)phages were coloured based on their classification to Sipho-, Myo- and Podoviridae families.