| Literature DB >> 29270250 |
Ananda Shankar Bhattacharjee1,2, Amir Mohaghegh Motlagh1,3, Eddie B Gilcrease4, Md Imdadul Islam1, Sherwood R Casjens4,5, Ramesh Goel1.
Abstract
A lytic bacteriophage RG-2014 infecting a biofilm forming multidrug resistant bacterium Delftia tsuruhatensis strain ARB-1 as its host was isolated from a full-scale municipal wastewater treatment plant. Lytic phage RG-2014 was isolated for developing phage based therapeutic approaches against Delftia tsuruhatensis strain ARB-1. The strain ARB-1 belongs to the Comamonadaceae family of the Betaproteobacteria class. RG-2014 was characterized for its type, burst size, latent and eclipse time periods of 150 ± 9 PFU/cell, 10-min, <5-min, respectively. The phage was found to be a dsDNA virus belonging to the Podoviridae family. It has an isometric icosahedrally shaped capsid with a diameter of 85 nm. The complete genome of the isolated phage was sequenced and determined to be 73.8 kbp in length with a G + C content of 59.9%. Significant similarities in gene homology and order were observed between Delftia phage RG-2014 and the E. coli phage N4 indicating that it is a member of the N4-like phage group.Entities:
Keywords: Bacteriophage; Biofilm; Biofouling; Delftia tsuruhatensis; Genome; Multidrug resistant; Podoviridae
Year: 2017 PMID: 29270250 PMCID: PMC5735904 DOI: 10.1186/s40793-017-0290-y
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Negative strain transmission electron micrographs of (a) RG-2014 virions (scale bar represents 100 nm), (b) RG-2014 infecting D. tsuruhatensis ARB1 (scale bar represents 1 μm) and (c) Pulsed field electrophoresis gel strained with acridine orange; Lane 1, Molecular weight marker (numbers shown are in kbp); Lane 2, 2 μg of DNA from phage RG-2014 virions; lane 3, same as lane 2 with 0.5 μg of phage DNA
Classification and general features of Delftia tsuruhatensis ARB-1 bacteriophage RG-2014
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain Viruses | TAS [ | |
| Kingdom Viruses | TAS [ | ||
| Phylum: unassigned | TAS [ | ||
| Class: dsDNA viruses, no RNA phase | TAS [ | ||
| Order: | TAS [ | ||
| Family: | TAS [ | ||
| Genus: N4likevirus | TAS [ | ||
| Species: unassigned | |||
| (Type) strain: RG-2014 (KM879221.2) | |||
| Gram stain | Not applicable | TAS [ | |
| Virion shape | Icosahedral | IDA | |
| Motility | non-motile | IDA | |
| Sporulation | Not reported | IDA | |
| Temperature range | 20–38 °C | IDA | |
| Optimum Temperature | 37 °C | IDA | |
| pH range; Optimum | 6.5–7.6 | IDA | |
| Carbon Source | Host cell | IDA | |
| MIGS-6 | Habitat | Wastewater | IDA |
| MIGS-6.3 | Salinity | Not reported | |
| MIGS-22 | Oxygen | Facultative aerobic | IDA |
| MIGS-15 | Biotic relationship | Obligate intracellular parasite of | IDA |
| MIGS-14 | Pathogenicity | Infective phage of | IDA |
| MIGS-4 | Geographic location | Central Valley Water Reclamation Facility, UT, USA | IDA |
| MIGS-5 | Sample collection time | 02/01/2011, 11:00 AM | IDA |
| MIGS-4.1 | Latitude | 40.7056 | IDA |
| MIGS-4.2 | Longitude | 111.913953 | IDA |
| MIGS-4.3 | Depth | Surface | IDA |
| MIGS-4.4 | Altitude | 0 m |
aEvidence codes - IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature)
Fig. 2Phylogenetic tree highlighting the position of major coat protein of phage RG-2014 relative to major coat proteins of other hosts. The corresponding GenBank accession numbers for each phage coat protein is indicated in parenthesis. Eleven other types of Podoviridae are included below the N4-like group for comparison. The tree construction used MUSCLE model to align the protein sequences by MEGA (v.5), and the Maximum-likelihood algorithm to construct a distance matrix based on alignment model positions using bootstrap method with 1000 bootstrap replications
Project information of Delftia tsuruhatensis ARB-1 bacteriophage RG-2014
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Closed |
| MIGS-28 | Libraries used | N/A |
| MIGS-29 | Sequencing platforms | Illumina MiSeq Benchtop |
| MIGS-31.1 | Fold coverage | 20× |
| MIGS-30 | Assemblers | CLC genomics workbench v. 7.0.3 |
| MIGS-32 | Gene calling method | GeneMarkS |
| Locus Tag | RG2014 | |
| Genome database release | Genbank | |
| Genbank ID | KM879221.2 | |
| Genbank Date of Release | Oct, 8, 2014; Mar, 17, 2017 (Corrected genome release date) | |
| GOLD ID | Go0332698 | |
| BIOPROJECT | PRJNA287956 | |
| MIGS 13 | Source Material Identifier | Personal culture collection |
| Project relevance | Virulence, Bacteriophage based biocontrol |
Genome statistics
| Attribute | Value | % of Totala |
|---|---|---|
| Genome size (bp) | 73,882 | 100.00 |
| DNA Coding (bp) | 69,793 | 93.90 |
| DNA G + C (bp) | 44,247 | 59.90 |
| DNA scaffold | 0 | 0.00 |
| Total genes | 88 | 100.00 |
| Protein-coding genes | 88 | 100.00 |
| RNA genes | 0 | 0.00 |
| Pseudo genes | 0 | 0.00 |
| Genes in internal clusters | 0 | 0.00 |
| Genes with function prediction | 21 | 23.86 |
| Genes assigned to COGs | 10 | 9.09 |
| Genes with Pfam domains | 12 | 13.64 |
| Genes with signal peptides | 2 | 2.27 |
| Genes with transmembrane helices | 13 | 14.77 |
| CRISPR repeats | 0 | 0.00 |
aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
Number of genes associated with the 25 general COG functional categories
| Code | Value | % agea | Description |
|---|---|---|---|
| J | 0 | 0 | Translation |
| A | 0 | 0 | RNA processing and modification |
| K | 2 | 2.27 | Transcription |
| L | 2 | 2.27 | Replication, recombination and repair |
| B | 0 | 0 | Chromatin structure and dynamics |
| D | 0 | 0 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0 | Nuclear structure |
| V | 0 | 0 | Defense mechanisms |
| T | 0 | 0 | Signal transduction mechanisms |
| M | 1 | 1.14 | Cell wall/membrane biogenesis |
| N | 1 | 1.14 | Cell motility |
| Z | 0 | 0 | Cytoskeleton |
| W | 0 | 0 | Extracellular structures |
| U | 0 | 0 | Intracellular trafficking and secretion |
| O | 0 | 0 | Posttranslational modification, protein turnover, chaperones |
| C | 0 | 0 | Energy production and conversion |
| G | 0 | 0 | Carbohydrate transport and metabolism |
| E | 0 | 0 | Amino acid transport and metabolism |
| F | 2 | 2.27 | Nucleotide transport and metabolism |
| H | 0 | 0 | Coenzyme transport and metabolism |
| I | 0 | 0 | Lipid transport and metabolism |
| P | 0 | 0 | Inorganic ion transport and metabolism |
| Q | 0 | 0 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 2 | 2.27 | General function prediction only |
| S | 1 | 1.14 | Function unknown |
| – | 77 | 87.5 | Not in COGs |
aThe total is based on the total number of protein coding genes in the annotated genome
Delftia phage RG-2014 gene prediction
| Gene | Strand | Number of codons | Predicted function | Organism with best match | N4 genea | Gene accession no. | % Idb | E-valueb |
|---|---|---|---|---|---|---|---|---|
| 1 | + | 101 | Conserved hypothetical protein |
| – | AIU44254 | 32 | 0.002 |
| 2 | + | 139 | Conserved hypothetical protein |
| 2 | AHC56518 | 36 | 2e-21 |
| 3 | + | 121 | Hypothetical protein | – | – | – | – | – |
| 4 | + | 122 | Conserved hypothetical protein |
| 14 | CBW47037 | 57 | 3e-45 |
| 5 | + | 109 | Hypothetical protein | – | – | – | – | – |
| 6 | + | 115 | Hypothetical protein | – | – | – | – | – |
| 7 | – | 104 | Hypothetical protein | – | – | – | – | – |
| 8 | + | 105 | Hypothetical protein | – | – | – | – | – |
| 9 | + | 50 | Hypothetical protein | – | – | – | – | – |
| 10 | + | 69 | Hypothetical protein | – | – | – | – | – |
| 11 | + | 186 | Conserved Hypothetical protein |
| – | YP 009001006 | 32 | 3e-22 |
| 12 | + | 285 | Conserved hypothetical protein |
| – | CYTR01000018 | 38 | 2e-26 |
| 13 | + | 108 | Conserved hypothetical protein |
| 3 | YP_007348409 | 29 | 3e-04 |
| 14 | + | 137 | Hypothetical protein | – | – | – | – | – |
| 15 | + | 89 | Hypothetical protein | – | – | – | – | – |
| 16 | + | 44 | Hypothetical protein | – | – | – | – | – |
| 17 | + | 77 | Hypothetical protein | – | – | – | – | – |
| 18 | + | 142 | Conserved hypothetical protein |
| – | YP_009148381 | 55 | 3e-37 |
| 19 | + | 193 | Hypothetical protein | – | – | – | – | – |
| 20 | + | 76 | Conserved hypothetical protein |
| – | YP_008873216 | 40 | 5e-09 |
| 21 | + | 217 | Hypothetical protein | – | – | – | – | – |
| 22 | + | 272 | RNA polymerase I subunit |
| 15 | ANJ65251 | 54 | 1e-102 |
| 23 | + | 432 | RNA polymerase II subunit |
| 16 | AAL71577 | 47 | 1e-135 |
| 24 | + | 181 | Virion decoration protein |
| 17 | YP_009208670 | 36 | 1e-10 |
| 25 | + | 157 | Hypothetical protein | – | – | – | – | – |
| 26 | + | 155 | Hypothetical protein | – | – | – | – | – |
| 27 | + | 122 | Hypothetical protein | – | – | – | – | – |
| 28 | + | 82 | Hypothetical protein | – | – | – | – | – |
| 29 | + | 80 | Hypothetical protein | – | – | – | – | – |
| 30 | + | 115 | Hypothetical protein | – | – | – | – | – |
| 31 | + | 242 | Hypothetical protein | – | – | – | – | – |
| 32 | + | 209 | Hypothetical protein | – | – | – | – | – |
| 33 | + | 359 | Conserved hypothetical protein |
| 24 | AHI60096 | 46 | 7e-104 |
| 34 | + | 127 | Conserved hypothetical protein |
| – | YP009208682 | 41 | 7e-11 |
| 35 | + | 92 | Hypothetical protein | – | – | – | – | – |
| 36 | + | 405 | Conserved hypothetical protein |
| 25 | ABK54394 | 39 | 1e-86 |
| 37 | + | 170 | dCTP deaminase |
| 26 | AHN83412 | 51 | 4e-53 |
| 38 | + | 78 | Hypothetical protein | – | – | – | – | – |
| 39 | + | 124 | Hypothetical protein | – | – | – | – | – |
| 40 | + | 140 | Hypothetical protein | – | – | – | – | – |
| 41 | + | 169 | Hypothetical protein | – | – | – | – | – |
| 42 | + | 121 | Hypothetical protein | – | – | – | – | – |
| 43 | + | 103 | Hypothetical protein | – | – | – | – | – |
| 44 | + | 73 | Hypothetical protein | – | – | – | – | – |
| 45 | + | 317 | Thymidylate synthase |
| – | KU726251 | 48 | 2e-101 |
| 46 | + | 104 | Conserved hypothetical protein |
| 35 | YP_950513 | 59 | 1e-36 |
| 47 | + | 135 | Conserved hypothetical protein |
| – | YP_008129928 | 66 | 5E-54 |
| 48 | + | 197 | Nucleotide pyrophospho-hydrolase |
| – | YP_009199508 | 33 | 3e-13 |
| 49 | + | 436 | DNA helicase |
| 37 | AHC56567 | 48 | 4e-137 |
| 50 | + | 172 | Conserved hypothetical protein |
| 38 | YP_009004756 | 34 | 2e-27 |
| 51 | + | 884 | DNA polymerase |
| 39 | ABK54408 | 60 | 0.0 |
| 52 | + | 127 | Hypothetical protein | – | – | – | – | – |
| 53 | + | 286 | Conserved Hypothetical protein |
| – | YP 009037286 | 47 | 1e-12 |
| 54 | + | 327 | Conserved hypothetical protein |
| 41 | AEL79653 | 45 | 7e-97 |
| 55 | + | 724 | DNA primase |
| 42 | ALA45517 | 62 | 0.0 |
| 56 | + | 249 | Conserved hypothetical protein |
| 43 | ABK54413 | 57 | 3e-100 |
| 57 | + | 253 | Single-stranded DNA-binding protein |
| 44 | AEJ81593 | 38 | 5e-37 |
| 58 | + | 372 | Conserved hypothetical protein |
| – | YP_008240188 | 43 | 4e-24 |
| 59 | + | 61 | Hypothetical protein | – | – | – | – | – |
| 60 | + | 65 | Hypothetical protein | – | – | – | – | – |
| 61 | + | 235 | Hypothetical protein | – | – | – | – | – |
| 62 | + | 102 | Hypothetical protein | – | – | – | – | – |
| 63 | + | 59 | Hypothetical protein | – | – | – | – | – |
| 64 | + | 98 | Conserved Hypothetical protein |
| – | YP 007003301 | 40 | 3e-10 |
| 65 | + | 288 | Possible transcriptional regulator |
| AEY69538 | 38 | 5e-44 | |
| 66 | + | 110 | Hypothetical protein | – | – | – | – | – |
| 67 | + | 172 | Hypothetical protein |
| – | YP_003359016 | 39 | 1e-10 |
| 68 | – | 3413 | Virion RNA polymerase |
| 50 | ALA45523 | 42 | 0.0 |
| 69 | – | 712 | Lysozyme-like domain virion structural protein |
| 51 | AFR52010 | 25 | 5e-18 |
| 70 | – | 135 | Conserved hypothetical protein |
| – | AHC56583 | 75 | 2e-38 |
| 71 | – | 921 | Conserved hypothetical protein |
| 53 | ALA45526 | 36 | 1e-168 |
| 72 | – | 300 | Virion structural protein |
| 54 | AAO24827 | 50 | 2e-101 |
| 73 | – | 265 | Conserved hypothetical protein |
| 55 | ALA45528 | 38 | 3e-47 |
| 74 | – | 411 | Major capsid protein |
| 56 | ALA45529 | 66 | 0.0 |
| 75 | – | 281 | Conserved hypothetical protein |
| 57 | AFV29058 | 38 | 7e-42 |
| 76 | – | 116 | Hypothetical protein |
| 58 | YP_007005822 | 71 | 0.006 |
| 77 | – | 138 | Conserved Hypothetical protein |
| – | CBW47064 | 28 | 0.002 |
| 78 | – | 766 | Portal protein |
| 59 | ANJ65209 | 59 | 0.0 |
| 79 | – | 170 | Lysis / possible Rz-like spanin |
| 60 | AHC94031 | 40 | 4e-21 |
| 80 | – | 201 | Lysis / N-acetylmuramidase |
| 61 | AEL79672 | 52 | 7e-71 |
| 81 | – | 108 | Conserved hypothetical protein |
| 63 | ABK54424 | 34 | 1e-17 |
| 82 | – | 416 | Conserved hypothetical protein |
| 64 | ALA45537 | 64 | 0.0 |
| 83 | – | 1388 | Tail sheath and receptor binding virion protein |
| 65 | ALA45538 | 51 | 0.0 |
| 84 | – | 140 | Hypothetical protein | – | – | – | – | – |
| 85 | – | 234 | Possible virion appendage protein |
| 66 | AHI60147 | 44 | 2e-67 |
| 86 | – | 536 | Large terminase subunit |
| 68 | AFR52033 | 61 | 0.0 |
| 87 | – | 228 | Conserved hypothetical protein |
| 69 | ABK54430 | 46 | 2e-61 |
| 88 | – | 340 | Conserved hypothetical protein |
| 49 | ALA45543 | 34 | 1e-17 |
a E. coli phage N4 is the best characterized and therefore the prototypical member of this phage group
b% identity and e-value determined by BLASTp at NCBI web site; unless otherwise noted, values are listed if the patch of similarity includes ≥60% of the protein
cAll phages in this column are in the N4-like group except AH2, øW-14, SEGD1, PG1, PaMx74 and PPpw-3
dSequence similarity only in N-terminal region
eSequence similarity only in C-terminal region
Delftia phage RG 2014 annotation using conserved domain database*
| Gene | Evidence | E value | Bit Score | Accession |
|---|---|---|---|---|
| 4 | cl10259 superfamily | 2.22E-55 | 167.72 | Cl10259 |
| 12 | MTTB superfamily | 0.004977 | 36.9774 | Cl15385 |
| 15 | MDR superfamily | 0.0037317 | 33.0936 | Cl16912 |
| 22 | Pha00452 | 1.96E-05 | 44.2438 | Pha00452 |
| 23 | RNA_pol superfamily | 4.77E-09 | 56.9554 | Cl20211 |
| 24 | Big_2 | 0.00242 | 34.2896 | Pfam02368 |
| 24 | Big_2 superfamily | 3.49E-07 | 39.6898 | Cl02708 |
| 24 | Cog5492 | 3.72E-09 | 53.2664 | Cog5492 |
| 33 | Aaa | 6.72E-05 | 41.3627 | Cd00009 |
| 33 | ABC_atpase superfamily | 6.72E-05 | 41.3627 | Cl21455 |
| 36 | Vwfa | 0.0009673 | 38.3158 | Cd00198 |
| 36 | Vwfa superfamily | 1.20E-20 | 85.5169 | Cl00057 |
| 36 | DUF2201_N superfamily | 9.26E-31 | 117.611 | Cl16157 |
| 37 | Trimeric_dutpase | 3.53E-13 | 60.5857 | Cd07557 |
| 37 | Trimeric_dutpase superfamily | 2.62E-23 | 89.0534 | Cl00493 |
| 45 | TS_Pyrimidine_hmase | 5.70E-91 | 268.76 | Cd00351 |
| 45 | TS_Pyrimidine_hmase superfamily | 5.17E-137 | 387.525 | Cl19097 |
| 48 | NTP-ppase superfamily | 0.002418 | 35.1816 | Cl16941 |
| 49 | ABC_atpase superfamily | 2.77E-17 | 78.3824 | Cl21455 |
| 49 | Uvrd_C_2 | 8.48E-08 | 47.1547 | Pfam13538 |
| 49 | Uvrd_C_2 superfamily | 8.48E-08 | 47.1547 | Cl22491 |
| 49 | Aaa_30 | 2.77E-17 | 78.3824 | Pfam13604 |
| 49 | Cog1112 | 9.34E-05 | 43.4113 | Cog1112 |
| 51 | DNA_pol_A superfamily | 1.80E-26 | 110.198 | Cl02626 |
| 51 | DNAq_like_exo superfamily | 0.0005505 | 40.4172 | Cl10012 |
| 55 | Prict_1 | 1.28E-07 | 47.6526 | Pfam08708 |
| 55 | Prict_1 superfamily | 1.28E-07 | 47.6526 | Cl07362 |
| 56 | ABC_atpase superfamily | 0.0005674 | 38.4072 | Cl21455 |
| 63 | Prk14085 | 0.0005556 | 34.1837 | Prk14085 |
| 64 | DUF2829 | 1.18E-16 | 66.5176 | Pfam11195 |
| 64 | DUF2829 superfamily | 1.18E-16 | 66.5176 | Cl12744 |
| 65 | Parbc | 0.0004658 | 37.3039 | Pfam02195 |
| 65 | Parbc superfamily | 0.0003729 | 37.2839 | Cl02129 |
| 66 | DUF1178 | 0.0021343 | 34.0766 | Pfam06676 |
| 67 | Extradiol_Dioxygenase_3B_like superfamily | 0.0057676 | 34.7714 | Cl00599 |
| 69 | Lt_gewl | 1.36E-18 | 80.5286 | Cd00254 |
| 69 | Lysozyme_like superfamily | 1.36E-18 | 80.5286 | Cl00222 |
| 70 | Polyadenylate-binding_protein_3 | 0.0067594 | 34.0122 | Tigr01628 |
| 72 | DUF3584 | 0.0060894 | 36.9891 | Pfam12128 |
| 74 | Hypothetical_protein | 5.26E-76 | 237.638 | Tigr04387 |
| 74 | P22_coatprotein superfamily | 5.26E-76 | 237.638 | Cl22542 |
| 78 | Cog4913 | 0.001198 | 41.1603 | Cog4913 |
| 79 | Prk09039 | 0.000734 | 37.6381 | Prk09039 |
| 80 | Glyco_hydro_108 superfamily | 9.31E-23 | 86.0288 | Cl09583 |
| 80 | PG_binding_3 superfamily | 0.0001066 | 38.2277 | Cl09627 |
| 86 | COG5362 superfamily | 3.02E-08 | 51.3532 | Cl02216 |
| 88 | Phage_gp49_66 | 2.28E-21 | 85.3759 | Pfam13876 |
| 88 | Phage_gp49_66 superfamily | 2.28E-21 | 85.3759 | Cl10351 |
*Evidence of gene functions provided by blast analysis using conserved domain database (e-value ≤10−5)
Fig. 3Whole genome comparison of Delftia phage RG-2014 (KM872991.2) phage to E. coli phage N4 (NC_008720). The Figure was generated with Easyfig [38]. Genomes were aligned using Easyfig [38]. The functions of genes in phage N4 are shown above and predicted functions of RG-2014 genes are indicated below the maps
Fig. 4Dotplot of N4-like phage genomes. Phage genomes were arranged in the same orientation and a dot plot was constructed by Gephard [39] with a word length setting of 11. The phages in the figure include the current extant diversity among the N4-like phages; those that are not included are very similar to one of the phages that is included (their sequences are all in GenBank and can be retrieved by searching with their names). In the plot thin red lines separate the phage genomes, and thick red lines separate the most clearly delineated subtypes. At the right, the genus (red text), family (black text) and class (blue text) of each phage’s host bacteria are indicated; vertical very thick red lines on the right indicate phages that infect the same host genus, and very thick blue lines mark host families