| Literature DB >> 25547158 |
Julianne H Grose1, Garrett L Jensen, Sandra H Burnett, Donald P Breakwell.
Abstract
BACKGROUND: The Bacillus genus of Firmicutes bacteria is ubiquitous in nature and includes one of the best characterized model organisms, B. subtilis, as well as medically significant human pathogens, the most notorious being B. anthracis and B. cereus. As the most abundant living entities on the planet, bacteriophages are known to heavily influence the ecology and evolution of their hosts, including providing virulence factors. Thus, the identification and analysis of Bacillus phages is critical to understanding the evolution of Bacillus species, including pathogenic strains.Entities:
Year: 2014 PMID: 25547158 PMCID: PMC4464726 DOI: 10.1186/1471-2164-15-1184
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Characteristics of reported phages with complete genome sequences
| Sub. | Phage Name | Host | Size (bp) | GC% | ORFS | tRNA | Accession # | Family | Ref. |
|---|---|---|---|---|---|---|---|---|---|
| A1 |
| A | 14319 | 36.84 | 27 | 0 | NC_022094 |
| [ |
| A1 | AP50 | A | 14398 | 38.65 | 31 | 0 | NC_011523 |
| [ |
| A2 | GIL16c | T | 14844 | 39.72 | 32 | 0 | NC_006945 |
| [ |
| A2 | Bam35c | T | 14935 | 40.08 | 31 | 0 | AY257 |
| [ |
| A2 | pGIL01 | T | 14931 | 39.73 | 30 | 0 | AJ536073 |
| [ |
| B1 |
| S | 19282 | 39.99 | 27 | 0 | EU771092 |
| |
| B1 | PZA | S | 19366 | 39.66 | 27 | 0 | M11813 |
| [ |
| B2 | B103 | S | 18630 | 37.66 | 17 | 0 | NC_004165 |
| [ |
| B2 | Nf | S | 18753 | 37.32 | 27 | 0 | EU622808 |
| [ |
| B3 | GA-1 | B | 21129 | 34.66 | 35 | 1 | X96987 |
| [ |
| C1 |
| M | 39844 | 40.70 | 48 | 0 |
|
| [ |
| C1 | Poppyseed | M | 39874 | 40.71 | 50 | 0 | KF669657 | ||
| C1 | Page | M | 39874 | 40.71 | 50 | 0 |
|
| [ |
| D1 |
| T | 37456 | 37.80 | 56 | 0 | NC_011645 |
| [ |
| D1 | BMBtp2 | T | 36932 | 37.79 | 53 | 0 | NC_019912 |
| [ |
| D1 | ProCM3 | T | 43278 | 37.36 | 66 | 0 | KF296717 |
| [ |
| E1 |
| A | 37373 | 35.13 | 53 | 0 | DQ289556 |
| [ |
| E1 | γ isolate 51 | A | 37253 | 35.22 | 53 | 0 | DQ222853 |
| [ |
| E1 | Wβ | A | 40867 | 35.26 | 53 | 0 | DQ289555 |
| [ |
| E1 | Gamma | A | 37253 | 35.22 | 53 | 0 | NC_007458 |
| [ |
| E1 | Cherry | A | 36615 | 35.27 | 51 | 0 | DQ222851 |
| [ |
| E1 | γ isolate 53 | A | 38067 | 35.10 | 50 | 0 | DQ222855 |
| [ |
| E1 | Fah | A | 37974 | 34.95 | 50 | 0 | NC_007814 |
| [ |
| E2 | phiCM3 | T | 38772 | 35.48 | 56 | 0 | NC_023599 |
| [ |
| E2 | phIS3501 | T | 44401 | 34.86 | 53 | 1 | JQ062992 |
| [ |
| E2 | BtCS33 | T | 41992 | 35.22 | 57 | 0 | NC_018085 |
| [ |
| E3 | BceA1 | C | 42932 | 35.66 | 63 | 0 | HE614282 |
| [ |
| F1 |
| C | 53104 | 36.42 | 86 | 0 | EU874396 |
| [ |
| F1 | 250 | C | 56505 | 36.44 | 54 | 0 | GU229986 |
| [ |
| G1 |
| P | 49259 | 41.91 | 79 | 0 | NC_020478 |
| [ |
| G1 | Gemini | P | 49362 | 41.9 | 79 | 0 | KC330681 |
| [ |
| G1 | Glittering | P | 49246 | 42.05 | 78 | 0 | NC_022766 |
| [ |
| G1 | Curly | P | 49425 | 41.82 | 77 | 0 | NC_020479 |
| [ |
| G1 | Eoghan | P | 49458 | 42.21 | 76 | 0 | NC_020477 |
| [ |
| G1 | Taylor | P | 49492 | 42.29 | 75 | 0 | KC330682 |
| [ |
| G1 | Riggi | P | 49836 | 41.46 | 79 | 0 | NC_022765 |
| [ |
| G1 | Blastoid | P | 50354 | 42.23 | 79 | 0 | NC_022773 |
| [ |
| G1 | Finn | P | 50161 | 41.69 | 77 | 0 | NC_020480 |
| [ |
| H1 | Staley | M | 81656 | 35.35 | 113 | 0 |
|
| [ |
| H1 | Slash | M | 80382 | 35.23 | 112 | 0 |
|
| [ |
| H2 | Basilisk | C | 81790 | 33.9 | 141 | 2 | KC595511 |
| [ |
| I1 |
| S | 132562 | 39.97 | 204 | 5 | NC_011421 |
| [ |
| I1 | CampHawk | S | 146193 | 40.2 | 229 | 2 | NC_022761 |
| [ |
| I2 | Shanette | C | 138877 | 40.8 | 223 | 3 | KC595513 |
| [ |
| I2 | JL | C | 137918 | 40.8 | 222 | 4 | KC595512 |
| [ |
| J1 |
| S | 155631 | 42.12 | 215 | 4 | NC_021856 | [ | |
| J1 | Grass | S | 156648 | 42.25 | 242 | 3 | NC_022771 |
| [ |
| J2 | SI0phi** | S | 146698 | 39.02 | 206 | 0 | KC699836 |
| |
| J3 | phiAGATE | P | 149844 | 49.97 | 204 | 4 | NC_020081 |
| [ |
| J4 | Bastille | T | 153962 | 38.14 | 273 | 7 | JF966203 |
| [ |
| J4 | Evoli | T | 159656 | 38.06 | 294 | 8 | KJ489398 |
| |
| J4 | Hoody T | T | 159837 | 38.01 | 270 | 8 | KJ489400 |
| |
| J4 | CAM003 | T | 160541 | 38.03 | 287 | 8 | KJ489397 |
| |
| J5 | B4 | C | 162596 | 37.71 | 277 | 0 | JN790865 |
| [ |
| J5 | Troll | T | 163019 | 37.83 | 289 | 0 | NC_022088 |
| |
| J5 | Spock | T | 164297 | 37.62 | 283 | 0 | NC_022763 |
| |
| J5 | BigBertha | T | 165238 | 37.77 | 287 | 0 | NC_022769 |
| [ |
| J5 | Riley | T | 162816 | 37.78 | 290 | 0 | NC_024788 |
| |
| J5 | B5S | C | 162598 | 37.71 | 272 | 0 | JN797796 |
| [ |
| J6 | BCP78 | C | 156176 | 39.86 | 227 | 18 | JN797797 |
| [ |
| J6 | BCU4 | C | 154371 | 39.86 | 223 | 19 | JN797798 |
| [ |
| J7 | BCP1 | C | 152778 | 39.76 | 227 | 17* | KJ451625 |
| |
| J7 | Bc431v3 | C | 158621 | 39.98 | 239 | 21 | JX094431 |
| [ |
| J8 | Hakuna | T | 158100 | 38.70 | 294 | 0 | KJ489399H |
| |
| J8 | Megatron | T | 158750 | 38.80 | 290 | 0 | KJ4894011H |
| |
| J8 | BPS10C | C | 159590 | 38.74 | 271 | 0 | NC_023501 |
| [ |
| J8 | BPS13 | C | 158305 | 38.75 | 268 | 0 | JN654439 |
| [ |
| J8 | W.Ph. | C | 156897 | 36.45 | 274 | 0 | HM144387 |
| [ |
| Single | MG-B1 | W | 27190 | 30.75 | 42 | 0 | NC_021336 |
| [ |
| Single | BV1 | B | 35055 | 44.85 | 54 | 0 | DQ840344 | ||
| Single | phBC6A52 | C | 38472 | 34.72 | 49 | 0 | NC_004821 | [ | |
| Single | phi105 | S | 39325 | 42.69 | 51 | 0 | NC_004167 |
| [ |
| Single | BCJA1C | Bc | 41092 | 41.74 | 59 | 0 | NC_006557 |
| [ |
| Single | PBC1 | C | 41164 | 41.68 | 50 | 0 | JQ619704 |
| [ |
| Single | SPP1 | S | 44010 | 43.72 | 81 | 0 | NC_004166 |
| [ |
| Single | PM1 | S | 50861 | 41.29 | 86 | 0 | NC_020883 |
| [ |
| Single | phBC6A51 | T | 61395 | 37.69 | 75 | 0 | NC_004820 | [ | |
| Single | BCD7 | C | 93839 | 38.04 | 140 | 0 | JN712910 |
| |
| Single | SPBc2 | S | 134416 | 34.64 | 185 | 0 | NC_001884 |
| [ |
| Single | SP10 | S | 143986 | 40.49 | 236 | 0 | NC_019487 |
| [ |
| Single | BanS-Tsamsa | A | 168876 | 34.32 | 272 | 19 | NC_023007 |
| [ |
| Single | 0305phi8-36 | T | 218948 | 41.8 | 247 | 2 | NC_009760 |
| [ |
| Single | G | M | 497513 | 29.93 | 675 | 18 | JN638751 |
|
The subcluster and cluster designation is given followed by the phage name. The founding phage for each cluster is in bold-italics. Hosts are the bacterial hosts on which the phages were isolated (not the host range) and are abbreviated as Bacillus anthracis (A), Bacillus cereus (C), Bacillus sp. (B), Bacillus clarkii (Bc), Bacillus megaterium (M), Bacillus pumilus (P), Bacillus subtilis (S), Bacillus thuringiensis (T), and Bacillus weihenstephanensis (W). ORFs are the number of Open Reading Frames predicted to be encoded by the genome as provided in the reported annotation. Family is Myoviridae (M), Siphoviridae (S) or Podoviridae (P). A reference (Ref) for the GenBank sequence is provided when available.
*tRNA predicted in this study using Aragorn and DNAMaster.
**Phage SI0phi is reported as an incomplete genome but is included in this analysis because it was complete enough to clearly assign it to a cluster.
Figure 1Nucleotide and amino acid dot plot analysis of 83 fully sequenced phages reveals 10 clusters (A-J) and 15 singletons. Nucleotide dot plot of Bacillus genomes of less than (A) or greater than (D) 100 kb organized by similarity reveals 10 clusters of related phages. Amino acid dot plot of Bacillus genomes of less than (C) or greater than (D) 100 kb organized by similarity reveals 10 clusters of related phages. Thick lines indicate cluster assignments, which are provided on the Y-axis (A-J). Dot plots were produced using Gepard [57] and whole genome amino acid sequences were retrieved from Phamerator [34].
Summary of cluster phage characteristics
| Cluster | Sub. | Phages | Hosts | Genome size | % GC | # ORFS (tRNA) | Type |
|---|---|---|---|---|---|---|---|
| A | 2 | 5 | A, T | 14685 ± 302 | 39.0 ± 1,3 | 30.2 ± 1.9(0) | T |
| B | 3 | 5 | B, S | 19432 ± 1001 | 37.9 ± 2.1 | 26.6 ± 6.3(0.3 ± 0.6) | P |
| C | 1 | 3 | M | 39864 ± 17 | 40.7 ± 0.0 | 49.3 ± 1.2(0) | P |
| D | 1 | 3 | C, T | 39222 ± 3522 | 37.7 ± 0.3 | 48.7 ± 10.2(0) | S |
| E | 3 | 11 | A, C, T | 39409 ± 2677 | 35.2 ± 0.2 | 53.8 ± 3.7(0.09 ± 0.3) | S |
| F | 1 | 2 | C | 54805 ± 2405 | 36.4 ± 0.0 | 70.0 ± 22.6(0) | S |
| G | 1 | 9 | M, P | 49621 ± 402 | 42.0 ± 0.3 | 77.7 ± 1.5(0) | S |
| H | 2 | 3 | C, M | 81276 ± 777 | 34.8 ± 0.8 | 122 ± 16.8(0.7 ± 1.2) | S |
| I | 2 | 4 | C, S | 138886 ± 5607 | 40.4 ± 0.4 | 220 ± 10.8(3.5 ± 1.3) | M |
| J | 8 | 23 | C, P, S, T | 158129 ± 4580 | 39.3 ± 2.7 | 261.0 ± 30.4(4.5 ± 6.6) | M |
Characteristics given are cluster assignment, number of subclusters (Sub.), number of phages in the cluster, host species from which the phages were isolated, the average genome size, average percent GC content, average number of ORFS with average number tRNA in parenthesis, and the morphotype. Averages are given with the standard deviation. Species abbreviations are Bacillus anthracis (A), Bacillus cereus (C), Bacillus sp. (B), Bacillus megaterium (M), Bacillus pumilus (P), Bacillus subtilis (S), Bacillus thuringiensis (T), and Bacillus westenstephanensis MG1, (W). Family/ Morphotype abbreviations are Tectiviridae (T), Podoviridae (P), Siphoviridae (S), and Myoviridae (M).
Figure 2Analysis of fully sequenced phage genomes belonging to clusters A through G reveals 12 subclusters. Subcluster divisions are provided by blue lines and are indicated on the Y-axis when there are more than one per cluster. Individual phages are separated by red lines. Phage names are provided on the X-axis and Y-axis with host abbreviation from which the phages were isolated indicated on the Y-axis. The founding phage for each cluster is bolded. Hosts abbreviations are Bacillus anthracis (A), Bacillus cereus (C), Bacillus sp. (B), Bacillus megaterium (M), Bacillus pumilus (P), Bacillus subtilis (S), Bacillus thuringiensis (T), and Bacillus weihenstephanensis MG1, (W). Dot plots were produced using Gepard [57]. Phage Andromeda is abbreviated (Andro.).
Figure 3Analysis of fully sequenced phage genomes belonging to clusters H through J reveals 12 subclusters. Subcluster divisions are provided by blue lines and are indicated on the Y-axis when there are more than one per cluster. Phages are separated by red lines. Phage names are provided on the X-axis and Y-axis with host abbreviation from which the phages were isolated provided first. The founding phage for each cluster is bolded. Hosts abbreviations are Bacillus anthracis (A), Bacillus cereus (C), Bacillus sp. (B), Bacillus megaterium (M), Bacillus pumilus (P), Bacillus subtilis (S), Bacillus thuringiensis (T), and Bacillus weihenstephanensis MG1, (W). Dot plots were produced using Gepard [57].
Figure 4Single gene amino acid dot plot analysis using the large terminase mirrors whole genome cluster assignment of phages. Bacillus phage clusters A-J are indicated on both the X-and Y-axis. Sequences for comparison were chosen by annotated large terminase gene products or a BlastP alignment to the closest relative when unannotated. Dot plots were produced using Gepard [57].
Figure 5A neighbor-joining tree analysis of the terminase mirrors whole genome cluster assignments. Phage names are colored by whole genome subcluster assignment, and this subcluster assignment is indicated on the right. Putative replication strategies for phages are also indicated when known. Abbreviations are direct, terminal repeats (DTR) and cohesive ends (cos). The phylogenetic tree was constructed using a MUSCLE [58] alignment and the neighbor-joining method in Mega5 [59]. Bootstrapping was set to 2000 and the unrooted tree was collapsed at a less than 50% bootstrap value.
Figure 6A comparison of gene content and order within the phage clusters reveals modularity and great diversity. Genome maps for representative phages from the subclusters within Bacillus phage clusters A-F are provided along with singleton MG-B1. Phages were mapped using Phamerator [34], where purple lines between phages denote regions of high nucleotide similarity and the ruler corresponds to genome base pairs. Boxes for gene products are labeled with predicted function, occasionally numbered, and colored to indicate similarity between the phages (E-value <1e - 4). Abbreviations are adenosine triphosphatase (ATPase), DnaB helicase (DNAB), double-stranded DNA binding (dsDNA binding), 2’-deoxyuridine 5’-triphosphatase (dUTPase), major capsid protein (MCP), N-acetyl-muramyl-L-alanine amidase (NAM amidase), pyrophosphate reductase (PP reductase) RNA polymerase (RNAP), sigma factor (σ factor), large terminase (TerL), small terminase (TerS), tape measure protein (TMP), pilus specific protein, ancillary protein involved in adhesion (SpaF1), single-stranded binding protein (SSB), single-strand recombinase (SS recombinase).
phage superclusters describe distantly related phages sharing significant proteome conservation
| Supercluster | Phages | % proteome conserved* |
|---|---|---|
| Gamma d’Herelle like | Clusters D, E and F, phage PBC1 | ~21% (D, E and F) |
| ~32% (F and PBC1) | ||
| SPO1-like | Clusters I, Cluster J | ~27% |
*Percent proteome conserved is the percentage conserved between two phages within different clusters as determined by CoreGenes.
Common, conserved phage proteins of predicted function with at least 20 homologues
| Pham # | Domain/Function | # Members | # Phages | Phages (cluster or phage name) |
|---|---|---|---|---|
|
| ||||
| 236 | DNA Polymerase I | 41 | 26 | I, J1, J2, J3, J4, J5, J6, J8, SP10, Bc431v3 |
| 274 | Chromosome Seg. ATPase | 38 | 25 | H2, J, BanS-Tsamsa |
| 247 | Class I RNR | 36 | 33 | H, I1, J, 0305φ836, BCD7, SP10, BanS-Tsamsa, SPB2 |
| 261 | Helicase | 28 | 28 | I, J, SP10 |
| 256 | SbcC-like Nuclease | 28 | 28 | I, J, SP10 |
| 257 | Endonuclease (SbcD) | 27 | 27 | I, J1, J2, J3, J4, J5, J6, J7, BPS10C, SP10, W.Ph., Hakuna, Megatron |
| 101 | Dihydrofolate Reductase | 27 | 27 | H2, J, BCD7, BanS-Tsamsa, G |
| 98 | Thymidylate Synthase | 24 | 24 | J, BCD7 |
| 99 | dNMP Kinase | 23 | 23 | J |
| 195 | Fumerate Reductase | 22 | 23 | J |
| 222 | Mre11 Nuclease | 23 | 23 | J |
| 232 | DNA Polymerase III | 23 | 23 | J |
| 238 | Histone | 23 | 23 | J |
| 252 | DUT | 23 | 23 | J |
| 258 | Replicative DNA Helicase | 23 | 23 | J |
| 229 | RecA-like | 23 | 23 | J |
| 254 | DNA primase | 23 | 23 | J |
| 246 | RNR beta subunit | 22 | 22 | H1, I1, J4, J5, J6, J8, 0305phi8-36 |
| 33 | Chromosome Seg. Protein | 22 | 22 | J1, J2, J4, J5, J6, J7, J8 |
| 244 | Glutaredoxin-like | 20 | 19 | J1, J2, J4, J5, J6, J8, SPBc2, BPS10c, BPS13, Megatron, W.Ph |
|
| ||||
| 278 | Tail Assembly Chaperone | 46 | 23 | J |
| 89 | Phage Terminase Large | 34 | 28 | I, J, SP10 |
| 264 | Adsorption Tail | 29 | 27 | I, J1, J3, J4, J5, J6, J7, J8 |
| 291 | MCP | 29 | 28 | I, J, SP10 |
| 276 | Tail Lysin | 29 | 28 | I, J, SP10 |
| 295 | Portal protein | 28 | 28 | I, J, SP10 |
| 266 | Baseplate | 28 | 28 | I, J, SP10 |
| 283 | Structural Protein | 28 | 28 | I, J, SP10 |
| 284 | Tail Sheath | 28 | 28 | I, J, SP10 |
| 293 | Prohead Protease | 28 | 27 | I, J2, J3, J4, J5, J6, J7, J8, phiNIT1, SP10 |
| 265 | Baseplate J | 27 | 23 | J |
| 273 | Tail Lysin | 27 | 24 | J, BanS-Tsamsa |
| 277 | Major Capsid L1 Protein | 24 | 24 | J, SP10 |
| 88 | Terminase | 23 | 23 | J |
|
| ||||
| 276 | Cell Wall Hydrolase | 28 | 28 | I, J, SP10 |
| 282 | Murein Transglycosylase | 28 | 28 | I1, J, 0305φ8-36, SP10, BanS-Tsamsa |
| 226 | Holin | 23 | 23 | J |
| 198 | Holin | 23 | 23 | J |
| 267 | Lysozyme-like | 23 | 23 | J |
|
| ||||
| 35 | Bacterial SH3-like | 25 | 25 | C, E1, F, J5, J8, BanS-Tsamsa, PhiCM3 |
| 370 | FtsK DNA Translocase | 23 | 23 | C, E1, G, BV1, PM1, phIS3501, phBC6A51 |
| 260 | cAMP Regulatory Protein | 23 | 23 | J |
| 269 | MisL-like transporter | 23 | 23 | J |
| 292 | Membrane Protein | 23 | 23 | J |
| 155 | Metallo-beta-lactamase | 21 | 20 | J4, J5, J6, J7, J8, BanS-Tsamsa |
Abbreviations include deoxynucleotide monophosphate kinase (dNMP kinase), chromosome segregation adenosine triphosphatase (chromosome seg ATPase), ribonucleotide diphosphate reductase (RNR), deoxyuridine nucleotidylhydrolase (DUT), and major capsid protein (MCP). Gene products are given and are organized by basic function (DNA Replication/Metabolism, Virion Structure and Assembly, Cell Lysis Proteins or Gene Regulation/Host Functions).
*Pham #’s are specific to this analysis due to assignment by Phamerator [34].
Common, conserved phage proteins of uncharacterized function with over 20 homologues
| Pham # | # Members | # Phages | Phages (cluster or phage name) |
|---|---|---|---|
|
| |||
| 248 | 28 | 28 | I, J, SP10 |
| 289 | 28 | 28 | I, J, SP10 |
| 263 | 24 | 24 | I1, J1, J3, J4, J5, J6, J7, J8 |
| 288 | 24 | 23 | J |
| 235 | 23 | 23 | J |
| 268 | 23 | 23 | J |
| 269 | 23 | 23 | J |
| 272 | 23 | 23 | J1, J2, J4, J5, J6, J7, J8, BanS-Tsamsa |
| 92 | 23 | 23 | J |
| 194 | 23 | 23 | J |
| 208 | 23 | 23 | J |
| 239 | 23 | 23 | J |
| 286 | 23 | 23 | J |
| 227 | 23 | 23 | J |
| 302 | 23 | 23 | J |
| 304 | 23 | 23 | J |
| 200 | 22 | 22 | J1, J2, J3, J4, J5, J6, J7, Hakuna, Megatron, BPS10C, W. Ph. |
| 225 | 22 | 22 | J1, J2, J4, J5, J6, J7, J8 |
| 228 | 22 | 22 | Grass, J2, J3, J4, J5, J6, J7, J8 |
| 255 | 22 | 22 | J1, J2, J4, J5, J6, J7, J8 |
| 287 | 22 | 22 | J1, J2, J3, J4, J5, J6, J7, Hakuna, Megatron, BPS10C, W. Ph. |
| 290 | 22 | 22 | J1, J2, J4, J5, J6, J7, J8 |
| 296 | 22 | 22 | J1, J2, J3, J4, J5, J6, J7, Hakuna, Megatron, BPS10C, W. Ph. |
| 301 | 22 | 22 | Grass, J2, J3, J4, J5, J6, J7, J8 |
| 97 | 20 | 20 | J3, J4, J5, J6, J7, J8 |
| 122 | 20 | 20 | J3, J4, J5, J6, J7, J8 |
*Pham #’s are specific to this analysis due to assignment by Phamerator [34].