| Literature DB >> 28106814 |
Marina de Leeuw1, Maayan Baron2, Asher Brenner3, Ariel Kushmaro4,5,6.
Abstract
Bacteriophages are viruses that infect bacteria, and consequently they have a major impact on the development of a microbial population. In this study, the genome of a novel broad host range bacteriophage, Aquamicrobium phage P14, isolated from a wastewater treatment plant, was analyzed. The Aquamicrobium phage P14 was found to infect members of different Proteobacteria classes (Alphaproteobacteria and Betaproteobacteria). This phage contains a 40,551 bp long genome and 60% of its genes had blastx hits. Furthermore, the bacteriophage was found to share more than 50% of its genes with several podoviruses and has the same gene order as other polyvalent bacteriophages. The results obtained in this study led to the conclusion that indeed general features of the genome of the Aquamicrobium phage P14 are shared with other broad host range bacteriophages, however further analysis of the genome is needed in order to identify the specific mechanisms which enable the bacteriophage to infect both Alphaproteobacteria and Betaproteobacteria.Entities:
Keywords: Aquamicrobium; bacteriophage; genome; polyvalent; wastewater
Year: 2017 PMID: 28106814 PMCID: PMC5295034 DOI: 10.3390/genes8010040
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Negatively stained transmission electron microscopy image of the Aquamicrobium phage P14.
Figure 2The genome of the Aquamicrobium phage P14. Open reading frames (ORFs) with blastx hits [29] are indicated in color. Yellow: hypothetical proteins; orange: lysozyme; blue: phage packaging (assembly); purple: internal virion proteins; green: structure proteins; and red: all other proteins.
Open reading frames on the genome of the Aquamicrobium phage P14 and their blastx [29] hits. Sources of similar proteins that are phages are marked in bold. Only blastx matches with an E-value smaller than 0.00001 are listed.
| ORF | Start | End | Strand | Protein Length | Blastx Match | E-Value | Source | Accession No. |
|---|---|---|---|---|---|---|---|---|
| ORF1 | 40092 | 149 | + | 202 | lysozyme | 1.00 × 10−24 | WP_043138231.1 | |
| putative lysozyme | 4.00 × 10−22 | YP_005098158.1 | ||||||
| ORF2 | 133 | 342 | + | 69 | none | |||
| ORF3 | 395 | 508 | + | 37 | none | |||
| ORF4 | 2014 | 2211 | + | 65 | none | |||
| ORF5 | 2232 | 2795 | + | 187 | hypothetical protein | 2.00 × 10−39 | YP009151803.1 | |
| ORF6 | 2818 | 2982 | + | 54 | none | |||
| ORF7 | 3027 | 3191 | + | 54 | none | |||
| ORF8 | 3188 | 3394 | + | 68 | hypothetical protein | 1.00 × 10−18 | YP008859401.1 | |
| ORF9 | 3559 | 4107 | + | 182 | seryl/threonyl protein kinase | 7.00 × 10−5 | YP008766718.1 | |
| ORF10 | 4367 | 4549 | + | 60 | none | |||
| ORF11 | 4551 | 4742 | + | 63 | none | |||
| ORF12 | 4718 | 4843 | + | 41 | none | |||
| ORF13 | 4919 | 5215 | + | 98 | none | |||
| ORF14 | 5218 | 5475 | + | 85 | none | |||
| ORF15 | 5486 | 5863 | + | 125 | none | |||
| ORF16 | 6011 | 7024 | + | 337 | none | |||
| ORF17 | 7028 | 7534 | + | 168 | none | |||
| ORF18 | 7531 | 7692 | + | 53 | none | |||
| ORF19 | 7689 | 7919 | + | 76 | none | |||
| ORF20 | 8101 | 8727 | + | 208 | putative DNA primase | 2.00 × 10−53 | CDL65258.1 | |
| ORF21 | 8697 | 10010 | + | 437 | putative DNA helicase | 4.00 × 10−146 | ADD21652.1 | |
| ORF22 | 10012 | 10680 | + | 222 | none | |||
| ORF23 | 10677 | 13109 | + | 810 | DNA polymerase A family protein | 0.0 | KGV49475.1 pfam00476 | |
| putative DNA polymerase | 0.0 | BAP15824.1 | ||||||
| ORF24 | 13106 | 13984 | + | 292 | hypothetical protein | 8.00 × 10−60 | AJY44207.1 | |
| hypothetical protein | 1.00 × 10−46 | YP005098178.1 | ||||||
| ORF25 | 13984 | 14958 | + | 324 | putative exonuclease | 3.00 × 10−76 | CDK30092.1 | |
| ORF26 | 15181 | 14993 | − | 62 | DNA endonuclease P40 | 3.00 × 10−17 | YP002922654.1 | |
| ORF27 | 15292 | 15573 | + | 93 | none | |||
| ORF28 | 15552 | 16331 | + | 259 | RNase H superfamily protein | 3.00 × 10−119 | WP_060050861.1 pfam13482 | |
| DNA exonuclease | 2.00 × 10−102 | YP008853863.1 | ||||||
| ORF29 | 16345 | 16809 | + | 154 | hypothetical protein | 4.00 × 10−67 | WP061238191.1 | |
| hypothetical protein | 3.00 × 10−29 | NP818474.1 | ||||||
| ORF30 | 16806 | 17015 | + | 69 | none | |||
| ORF31 | 17015 | 17860 | + | 281 | ATP-dependent DNA ligase | 8.00 × 10−67 | YP008859411.1 | |
| ORF32 | 17869 | 20283 | + | 804 | DNA-dependent RNA polymerase | 0.0 | ALF01667.1 | |
| ORF33 | 20400 | 20588 | + | 62 | hypothetical protein | 8.00 × 10−5 | ALF01668.1 | |
| ORF34 | 20677 | 21033 | + | 118 | hypothetical protein | 1.00 × 10−14 | ADD21663.1 | |
| ORF35 | 21404 | 22930 | + | 508 | head-to-tail joining protein | 2.00 × 10−172 | ALF01671.1 | |
| ORF36 | 22934 | 23722 | + | 262 | hypothetical protein | 2.00 × 10−21 | EIP87426.1 | |
| scaffold-like protein | 7.00 × 10−20 | ADD21667.1 | ||||||
| ORF37 | 23820 | 24815 | + | 331 | major capsid protein | 2.00 × 10−98 | ADD21668.1 | |
| ORF38 | 24897 | 25502 | + | 201 | tail tuber protein A | 4.00 × 10−48 | CDK30105.1 | |
| ORF39 | 25499 | 28045 | + | 848 | tail tubular protein B | 0.0 | ADD21670.1 | |
| ORF40 | 28057 | 28875 | + | 272 | hypothetical protein | 1.00 × 10−10 | ADD21671.1 | |
| internal virion protein | 7.00 × 10−7 | YP008859423.1 | ||||||
| ORF41 | 28885 | 31227 | + | 780 | hypothetical protein | 2.00 × 10−43 | ADD21672.1 | |
| internal virion protein | 1.00 × 10−38 | YP008858912.1 | ||||||
| ORF42 | 31250 | 35209 | + | 1319 | internal virion protein | 0.0 | ADD21673.1 | |
| ORF43 | 35267 | 37195 | + | 642 | tail fiber protein | 5.00 × 10−32 | ADD21674.1 | |
| ORF44 | 37195 | 37605 | + | 136 | tail fiber assembly protein | 3.00 × 10−24 | BAQ90231.1 | |
| ORF45 | 37625 | 37795 | + | 56 | none | |||
| ORF46 | 37782 | 38057 | + | 91 | putative DNA maturase A | 1.00 × 10−10 | ADD21677.1 | |
| ORF47 | 38057 | 39814 | + | 585 | terminase large subunit | 0.0 | ALF01684.1 | |
| ORF48 | 39831 | 40088 | + | 85 | hypothetical protein | 4.00 × 10−4 | YP005098203.1 |
Figure 3Genomic organization of the Aquamicrobium phage P14 compared to the Pseudomonas phage Bf7 (NC_016764.1), the Burkholderia phage Bp-AMP1 (HG793132.1), Xylella phage Prado (NC_022987.1) and Pseudomonas phage phiKMV (NC_005045.1). Coding sequences encoding proteins with similar functions are colored in the same color. Hypothetical proteins are marked by diagonal lines. A match between the proteins was either defined by similarity of the amino acid sequences (blastx hits with E-value < 0.00001) or proteins with the same function as defined in their entry.
Figure 4Phylogenetic tree constructed based on the DNA sequences encoding the tail fiber proteins of several phages. Phages that are also present in Table 2 are highlighted in orange. The best model for the phylogenetic tree construction was found by jmodeltest 2.1.10 [35] to be GTR+G+I. The model was then constructed using Mega6 [36] with 500 bootstrap replications.
CoreGenes3.0 correlation values for Aquamicrobium phage P14 versus several phages from Table 1.
| Phage | Hosts | CoreGenes3.0 Correlation | Genome Length | GC Content | Classification |
|---|---|---|---|---|---|
| - | 40,551 bp | 57.8% | |||
| 47.92% | 42,409 bp | 61.8% | |||
| 47.92% | 42,112 bp | 61.8% | |||
| 54.17% | 41,581 bp | 61.2% | |||
| 47.27% | 44,773 bp | 60.9% | |||
| 22.92% | 39,860 bp | 48.6% | |||
| 47.92% | 40,058 bp | 58.4% | |||
| 50.00% | 44,360 bp | 60.9% | |||
| 20.83% | 44,080 bp | 55.6% | |||
| 47.92% | 43,869 bp | 60.2% | |||
| 52.08% | 43,940 bp | 63.0% |
Figure 5Phylogenetic relationship between selected terminase large subunit DNA sequences. Phages that are classified as phiKMV-like are highlighted in magenta. The best model for the phylogenetic tree construction was found by jmodeltest 2.1.10 [35] to be GTR + G + I. The model was then constructed using Mega6 [36] with 500 bootstrap replications.
Figure 6Alignment using tblastx (minimum E-value: 0.00001, minimum aligned query length: 10 amino acids, and minimum identity cutoff: 25%) of the Aquamicrobium phage P14 genome against the genomes of Pseudomonas phage Bf7 (red; NC_016764.1); Xylella phage Prado (green; NC_022987.1) and the Burkholderia phage Bp-AMP1 (blue; HG793132.1); The outer circle shows the coding sequences marked by arrows. The two inner circles show the GC content (black); and GC skew + (green) and the GC skew − (magenta).