| Literature DB >> 27738559 |
Casey Stamereilers1, Lucy LeBlanc2, Diane Yost1, Penny S Amy1, Philippos K Tsourkas1.
Abstract
American Foulbrood Disease, caused by the bacterium Paenibacillus larvae, is one of the most destructive diseases of the honeybee, Apis mellifera. Our group recently published the sequences of 9 new phages with the ability to infect and lyse P. larvae. Here, we characterize the genomes of these P. larvae phages, compare them to each other and to other sequenced P. larvae phages, and putatively identify protein function. The phage genomes are 38-45 kb in size and contain 68-86 genes, most of which appear to be unique to P. larvae phages. We classify P. larvae phages into 2 main clusters and one singleton based on nucleotide sequence identity. Three of the new phages show sequence similarity to other sequenced P. larvae phages, while the remaining 6 do not. We identified functions for roughly half of the P. larvae phage proteins, including structural, assembly, host lysis, DNA replication/metabolism, regulatory, and host-related functions. Structural and assembly proteins are highly conserved among our phages and are located at the start of the genome. DNA replication/metabolism, regulatory, and host-related proteins are located in the middle and end of the genome, and are not conserved, with many of these genes found in some of our phages but not others. All nine phages code for a conserved N-acetylmuramoyl-L-alanine amidase. Comparative analysis showed the phages use the "cohesive ends with 3' overhang" DNA packaging strategy. This work is the first in-depth study of P. larvae phage genomics, and serves as a marker for future work in this area.Entities:
Keywords: American Foulbrood; DNA packaging strategy; N-acetylmuramoyl-L-alanine amidase; Paenibacillus larvae; Phamerator; bacteriophage; comparative genomics; comparative proteomics; endolysin; large terminase; siphoviridae
Year: 2016 PMID: 27738559 PMCID: PMC5056774 DOI: 10.1080/21597081.2016.1220349
Source DB: PubMed Journal: Bacteriophage ISSN: 2159-7073
Geographical origin and isolation source of P. larvae phages.
| Phage name | Geographical location | Isolation Source |
|---|---|---|
| Diane | OH | Infected larva, ATCC culture 25747 |
| Fern | USDA lab Germantown MD | Infected larva, P. larvae wild strain 2231 |
| Harrison | Gilcrease Orchards, N. Las Vegas, NV | Soil |
| Hayley | Gilcrease Orchards, N. Las Vegas, NV | Soil |
| Paisley | PA | Soil |
| Vadim | NV | Lip balm |
| Vegas | NV | Lip balm |
| Willow | Near Bremerton, WA | Soil |
| Xenia | USDA lab Germantown MD | Infected larva |
Lysogenic phage from ATCC culture 25747 isolated in Ohio by White from an infected insect
Lysogenic phage from P. larvae wild strain 2231 isolated from an infected larva scale
Isolated from commercial products purchased in NV
Figure 1.Scanning electron micrographs of phages (A) Diane, (B) Fern, and (C) Hayley.
Accession numbers and genome assembly results of our P. larvae phages.
| Phage name | GenBank accession number | Genome length | Av. coverage depth | Min. coverage depth | GC content | Percent coding |
|---|---|---|---|---|---|---|
| Diane | KT361657 | 45,653 | 67 | 1 | 43.7 | 95.7 |
| Fern | KT361649 | 37,995 | 502 | 98 | 41.9 | 93.7 |
| Harrison | KT361651 | 44,247 | 291 | 61 | 40.2 | 93.6 |
| Hayley | KT361655 | 44,256 | 43 | 1 | 43.5 | 95.4 |
| Paisley | KT361653 | 44,172 | 350 | 58 | 40.0 | 93.5 |
| Vadim | KT361656 | 45,653 | 94 | 1 | 43.7 | 95.7 |
| Vegas | KT361654 | 45,653 | 128 | 1 | 43.7 | 95.7 |
| Willow | KT361650 | 37,994 | 122 | 50 | 41.9 | 93.7 |
| Xenia | KT361652 | 41,149 | 123 | 41 | 41.5 | 93.2 |
Comparative genomics of our P. larvae phages.
| Phage name | No. of genes | No. of genes with BLAST E-value <0.001 | No. of genes with putative function | Genes found in non- | Genes found only in | Gene unique to this phage |
|---|---|---|---|---|---|---|
| Diane | 86 | 83 | 45 | 22 | 64 | 0 |
| Fern | 68 | 65 | 36 | 18 | 50 | 0 |
| Harrison | 84 | 75 | 38 | 23 | 60 | 1 |
| Hayley | 84 | 81 | 43 | 21 | 63 | 0 |
| Paisley | 84 | 75 | 38 | 23 | 60 | 1 |
| Vadim | 86 | 83 | 45 | 22 | 64 | 0 |
| Vegas | 86 | 83 | 45 | 22 | 64 | 0 |
| Willow | 68 | 65 | 36 | 18 | 50 | 0 |
| Xenia | 77 | 72 | 43 | 20 | 48 | 9 |
Figure 2.Dotplot of the genomes for 9 new P. larvae phages. A black dot is placed where there is nucleotide identity between 2 phages.
Figure 3.Percent nucleotide sequence identity matrix for all 17 sequenced P. larvae phages. Phages are classified into clusters and subclusters based on nucleotide sequence identity.
Figure 4.Genome maps of our phages obtained from Phamerator (first half). Boxes represent genes, with boxes of the same color indicating genes in the same pham. Genes in a pham of their own are uncolored. Shaded areas indicate regions of high nucleotide sequence similarity between phages, with purple indicating the highest degree of similarity, and red the lowest.
Figure 5.Genome maps of our P. larvae phages obtained from Phamerator (second half). Boxes represent genes, with boxes of the same color indicating genes in the same pham. Genes in a pham of their own are uncolored. Shaded areas indicate regions of high nucleotide sequence similarity between phages, with purple indicating the highest degree of similarity, and red the lowest.
P. larvae phage genes with statistically significant BLAST and/or CDD matches (E-value < 1E-3) to proteins with known function. The gene product number is shown in the first row of each cell, and the pham number is shown in the second row, italicized in parentheses. Rows are colored according to protein function. We classify phage proteins into 6 functional categories: 1) virion particle (blue), 2) virion assembly (burgundy), 3) host lysis (purple), 4) DNA replication/metabolism (tan), 5) gene regulation, including putative transcription factors (green), and 6) host-related functions (yellow). Gene products whose function cannot be classified into these 6 categories due to lack of sufficient information or conflicting information are left uncolored. Instances where there are 2 or more unrelated functions with statistically significant matches are marked with a footnote, with the more statistically significant (or with higher bit score) function listed in the table, and the less statistically significant (or with lower bitscore) function listed in the footnotes at the end of the table.
| Diane/Vadim/Vegas | Hayley | Harrison | Paisley | Fern/Willow | Xenia | |
|---|---|---|---|---|---|---|
| Small terminase | gp1 | gp1 | gp1 | gp1 | gp1 | gp1 |
| Large terminase | gp2 | gp2 | gp2 | gp2 | gp2 | gp2 |
| Portal protein | gp3 | gp3 | gp3 | gp3 | gp3 | gp3 |
| Clp protease ClpP | gp4 | gp4 | gp4 | gp4 | gp4 | gp4 |
| Major capsid protein | gp5 | gp5 | gp5 | gp5 | gp5 | gp5 |
| Head-tail connector protein | gp7 | gp7 | gp7 | gp7 | gp7 | gp7 |
| Head-tail adaptor protein | gp8 | gp8 | gp8 | gp8 | gp8 | gp8 |
| Head-tail joining protein | gp9 | gp9 | gp9 | gp9 | ||
| Prohead protease | gp10 | gp10 | gp10 | gp10 | ||
| Tail protein | gp10 | gp10 | ||||
| Major tail protein | gp11 | gp11 | gp11 | gp11 | gp11 | gp11 |
| Tail tape measure protein | gp14 | gp14 | gp14 | gp14 | gp14 | gp14 |
| Tail protein | gp15 | gp15 | gp15 | gp15 | gp15 | gp15 |
| Endopeptidase tail protein | gp16 | gp16 | gp16 | gp16 | gp16 | gp16 |
| Tail protein | gp17 | gp17 | gp17 | gp17 | gp17 | gp17 |
| Bacteriocin biosynthesis protein | gp20 | gp20 | gp20 | gp20 | gp20 | gp20 |
| N-acetylmuramoyl- | gp21 | gp21 | gp21 | gp21 | gp21 | gp21 |
| Acetyltransferase | gp22 | gp22 | ||||
| Transposase | gp23 | gp23 | gp23 | gp23 | gp22 | gp22 |
| ABC-like transporter protein | gp25 | gp25 | ||||
| Amidase domain protein | gp26 | |||||
| Transposase | gp30 | gp29 | ||||
| Transposase | gp31 | |||||
| Toxin-like protein | gp26 | |||||
| YolD-like protein | gp28 | gp35 | ||||
| Toxin | gp30 | gp30 | ||||
| Ankyrin-repeat containing protein | gp31 | |||||
| Arc-like DNA binding protein | gp32 | gp30 | ||||
| Transposase | gp36 | |||||
| Transposase | gp37 | |||||
| Transposase | gp38 | |||||
| Stress protein | gp35 | gp33 | ||||
| Integrase | gp38 | gp36 | gp32 | |||
| Toxin | gp39 | gp37 | ||||
| Transcriptional regulator | gp35 | gp35 | ||||
| Telomeric repeat binding factor 2 | gp37 | gp37 | ||||
| Accessory gene regulator B | gp39 | gp39 | ||||
| Dipeptidyl aminopeptidase/ acylaminoacyl-peptidase | gp41 | gp41 | ||||
| Serine recombinase | gp44 | gp44 | ||||
| Helix-turn-helix domain XRE family transcriptional regulator | gp40 | gp38 | ||||
| Helix-turn-helix domain XRE family transcriptional regulator | gp41 | gp39 | gp45 | gp45 | gp33 | |
| Helix-turn-helix Cro/C1 family transcriptional regulator | gp34 | |||||
| Helix-turn-helix domain transcriptional regulator | gp35 | |||||
| Helix-turn-helix domain XRE family transcriptional regulator | gp46 | gp46 | ||||
| Rha-like antirepressor | gp41 | |||||
| Transcriptional regulator | gp42 | |||||
| Helix-turn-helix domain XRE family transcriptional regulator | gp43 | |||||
| Helix-turn-helix domain XRE family transcriptional regulator | gp44 | |||||
| Restriction endonuclease | gp43 | gp41 | ||||
| DNA binding antirepressor | gp45 | gp43 | gp37 | gp47 | ||
| Excisionase | gp46 | gp44 | ||||
| Antirepressor | gp53 | gp53 | gp41 | gp51 | ||
| AAA domain ATPase | gp45 | gp55 | ||||
| DNA recombinational protein RecT | gp47 | gp57 | ||||
| Metallo-hydrolase | gp48 | gp58 | ||||
| Primosome component-like protein | gp50 | gp60 | ||||
| Replicative DNA helicase | gp51 | gp61 | ||||
| yopX family protein | gp55 (16645 | |||||
| DNA methylase | gp61 | gp59 | ||||
| Single-stranded DNA binding protein, ERF superfamily | gp61 | gp61 | ||||
| Single-stranded DNA binding protein | gp62 | gp62 | ||||
| AbrB family transcriptional regulator | gp64 | gp62 | ||||
| Replication terminator protein | gp65 | gp63 | ||||
| Primosome, DnaD subunit | gp65 | |||||
| DNA damage-inducible protein DnaD | gp65 | |||||
| DNA replication protein | gp66 | gp66 | ||||
| Chromosome segregation protein SMC | gp67 | gp65 | ||||
| AAA domain ATPase | gp68 | gp66 | ||||
| DEAD/DEAH box helicase | gp70 | gp68 | ||||
| ABC-type transport system, ATP binding protein | gp71 | gp69 | ||||
| DNA primase | gp72 | gp70 | ||||
| Endodeoxyribonuclease RusA | gp69 | gp69 | ||||
| Crossover junction endodeoxyribonuclease RusA | gp75 | gp73 | ||||
| DNA–cytosine methyltransferase | gp72 | gp72 | gp63 | |||
| RuvC-like resolvase | gp62 | gp66 | ||||
| RNA polymerase sigma factor | gp77 | gp75 | ||||
| ArpU family/ArpU-like transcriptional regulator | gp78 | gp78 | gp64 | gp68 | ||
| RinA transcriptional activator-like protein | gp79 | gp77 | ||||
| Serine recombinase | gp70 | |||||
| Phosphomannomutase | gp80 | gp78 | gp71 (16819) | |||
| Toxin HicA | gp81 | gp79 | ||||
| Antitoxin HicB | gp82 | gp80 | gp80 | gp80 | gp65 | gp72 |
| Toxin HicA | gp81 | gp81 | gp66 | gp73 | ||
| Transglycosylase | gp83 | gp81 | gp83 | gp83 | gp67 | gp 76 |
| HNH endonuclease | gp86 | gp84 | gp84 | gp84 | gp68 | gp77 |
Also has equally strong BLAST and CDD matches to DNA packaging protein
Also has equally strong BLAST and CDD matches to head-tail joining protein
CDD matches only (Evalue=1E-123)
Also has strong BLAST and CDD matches to bhlA protein
Also has equally strong BLAST matches to DNA methyltransferase
Also has strong BLAST matches to holin
Also has equally strong BLAST matches to peptidase domain
Also has equally strong BLAST and CDD matches to integrase
Also has strong BLAST matches to toxin-like protein, DNA Smf single strand binding protein, transcriptional regulatory protein YclJ, phosphatase, transposase
Also has equally strong BLAST matches to integrase, ATPase, resolvase, invertase
Also has strong BLAST and CDD matches to peptidase
Also has strong BLAST and CDD matches to excisionase
Also has strong BLAST matches to Xre-like protein
Also has strong BLAST matches to repressor
Also has equally strong BLAST and CDD matches to Rha family transcriptional regulator
Also has equally strong BLAST and CDD matches to chromosomal replication initiator protein DnaA
Also has equally strong BLAST matches to DNA recombination protein RecF
Also has strong BLAST matches to oxidoreductase, putative DNA helicase, putative RecA NTPase, ATP-dependent Lon protease
Also has equally strong BLAST matches to RecA familyATPase
Also has equally strong BLAST and CDD matches to ycfA-like protein
Figure 6.Multiple alignment and percent amino acid sequence identity matrix of P. larvae phage large terminases. With the exception of Lily, all P. larvae phages have a large terminase that is either identical to that of Diane (Diane, Hayley, Vadim, Vegas, Harrison, Paisley), or Xenia (Xenia, Fern, Willow, Diva, Rani, Redbud, Shelly, Sitara, HB10c2, phiIBB_Pl23).
Figure 7.Multiple alignment and average amino acid identity matrix of P. larvae phage N-acetylmuramoyl-L-alanine amidases. There are 5 distinct P. larvae phage N-acetylmuramoyl-L-alanine amidases, with phages in the same group having an identical N-acetylmuramoyl-L-alanine amidase. Group 1 consists of phages Diane, Vadim, Vegas and Hayley, Group 2 consists of phages Harrison, Paisley and phiIBB_Pl23, Group 3 consists of phages Willow and Fern, Group 4 consists of phages Xenia, Shelly, Diva, and Sitara, and Group 5 consists of phages HB10c2, Rani, and Redbud.