| Literature DB >> 35923395 |
Henrique G Ribeiro1,2, Anna Nilsson3, Luís D R Melo1,2, Ana Oliveira1,2.
Abstract
Paenibacillus larvae is the etiological agent of American Foulbrood (AFB), a highly contagious and worldwide spread bacterial disease that affects honeybee brood. In this study, all complete P. larvae genomes available on the NCBI database were analyzed in order to detect presence of prophages using the PHASTER software. A total of 55 intact prophages were identified in 11 P. larvae genomes (5.0 ± 2.3 per genome) and were further investigated for the presence of genes encoding relevant traits related to P. larvae. A closer look at the prophage genomes revealed the presence of several putative genes such as metabolic and antimicrobial resistance genes, toxins or bacteriocins, potentially influencing host performance. Some of the coding DNA sequences (CDS) were present in all ERIC-genotypes, while others were only found in a specific genotype. While CDS encoding toxins and antitoxins such as HicB and MazE were found in prophages of all bacterial genotypes, others, from the same category, were provided by prophages particularly to ERIC I (enhancin-like toxin), ERIC II (antitoxin SocA) and ERIC V strains (subunit of Panton-Valentine leukocidin system (PVL) LukF-PV). This is the first in-depth analysis of P. larvae prophages. It provides better knowledge on their impact in the evolution of virulence and fitness of P. larvae, by discovering new features assigned by the viruses.Entities:
Keywords: Paenibacillus larvae; bacterial evolution; bacterial fitness; bacterial virulence; prophage
Year: 2022 PMID: 35923395 PMCID: PMC9341999 DOI: 10.3389/fmicb.2022.903861
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 6.064
Paenibacillus larvae strains and respective reference name, accession number, genome sequencing method and coverage, genotype classification, GC content, genome size, number of contigs, and respective sizes.
| Host | Strain | Reference strain | Accession no. | Sequencing method | Genome coverage | ERIC genotype | GC% | Size (Mbp) | No. of contigs | Size range of contigs (Kbp) | Prophages (validated) | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Total | Intact | Defective | |||||||||||
| 3* | DSM 25719 | DSM 25719 | NZ_ADFW00000000.1 | Sanger dideoxy sequencing; 454; Illumina | 94 | I | 44.1 | 4.58 | 8 | 8.1–3,664 | 21 | 8 | 13 |
| 5 | MEX14 | NZ_LAWY00000000 | 454 | 50 | I | 44.0 | 4.19 | 139 | 0.5–213.6 | 17 | 3 | 14 | |
| 6 | ATCC 9545 | ATCC 9545 | NZ_CP019687.1 | PacBio | 147.4 | I | 44.2 | 4.29 | NA | NA | 13 | 5 | 8 |
| 7 | ATCC 13537 | ATCC 13537 | NZ_CP019794.1 | PacBio | 56.4 | IV | 44.3 | 4.41 | NA | NA | 16 | 3 | 13 |
| 8 | CCM 38 | CCUG 7429 | NZ_CP020327.1 | PacBio | 150.8 | IV | 44.3 | 4.33 | NA | NA | 15 | 5 | 10 |
| 9* | SAG 10367 | SAG 10367 | NZ_CP020557.1 | PacBio | 214.2 | II | 44.1 | 4.67 | NA | NA | 18 | 7 | 11 |
| 10 | ERIC_I | DSM 7030 | NZ_CP019651.1 | PacBio; Illumina HiSeq2500 | 193 | I | 44.2 | 4.29 | NA | NA | 15 | 5 | 10 |
| 11* | ERIC_III | LMG 16252 | NZ_CP019655.1 | PacBio; Illumina HiSeq2500 | 114 | III | 44.2 | 4.49 | NA | NA | 18 | 7 | 11 |
| 12 | ERIC_IV | LMG 16247 | NZ_CP019659.1 | PacBio; Illumina HiSeq2500 | 113 | IV | 44.3 | 4.27 | NA | NA | 15 | 3 | 12 |
| 13 | DSM 25430; ERIC_II | DSM 25430 | NZ_CP019652.1 | PacBio; Illumina HiSeq2500 | 153 | II | 45.0 | 4.02 | NA | NA | 12 | 1 | 11 |
| 14* | ERIC_V | DSM 106052 | CP019717.1 | PacBio; Illumina HiSeq2500 | 257 | V | 44.1 | 4.67 | NA | NA | 21 | 8 | 13 |
| Average GC content | 44.3 | Total validated | 181 | 55 | 126 | ||||||||
|
|
|
|
|
|
|
|
|
|
|
|
| ||
|
|
|
| |||||||||||
| 1 | BRL-230010 | NZ_AARF00000000.1 | 454 | 50 | I | 44.1 | 3.98 | 646 | 0.25–58.6 | 23 | 0 | 23 | |
| 2 | B-3650 | LMG 16245 | NZ_ADZY00000000.3 | Sanger; Illumina | 1; 100 | I | 44.1 | 4.35 | 353 | 0.05–331.7 | 9 | 0 | 9 |
| 4 | DSM 25430 | DSM 25430 | NC_023134.1 | Sanger dideoxy sequencing; 454; Illumina | 64 | II | 45 | 4.05 | NA | NA | 8 | 0 | 8 |
The last three columns refer to the number of total, intact and defective prophages present in each strain, after manual validation. *Hosts with genome size >4.49 Mbp and with ≥7 no. of intact prophages. The second table details data from strains excluded from the analysis.
Host 5 (MEX14) classified as ERIC I without experimental validation; homology and position in the ERIC I branch of phylogenetic tree available on NCBI database.
The high number of contigs available restricted an accurate analysis.
Sequence reported by Djukic et al. (2014), identical to the obtained latter (re-sequenced) by Beims et al. (2020). NA, not applicable.
Figure 1Prophage prevalence in Paenibacillus larvae genomes: (A) Percentage of hosts with ≥ one and ≥ five intact prophages and ≥ one and ≥ eight defective prophages. (B) Whisker plots of prophage frequency per bacterial genome (total, defective and intact) before and after manual curing. Raw data provided directly from PHASTER, cured data results from manual verification. The horizontal line of each box represents the average prophages per genome and the external edges to the minimum/maximum number. (C) Average of total, defective, and intact prophages present per host genome. The error bars indicate the SD. Statistically significant, if value of p < 0.05 (*).
Figure 2Number of prophages per size of host Paenibacillus larvae genomes: (A) total (B) intact (C) defective.
Figure 3Average number of prophages (intact and defective) present per ERIC genotype (ERIC I-V). The error bars indicate the SD. Statistically significant, if value of p < 0.05 (*).
Figure 4List of categories Cluster of Orthologous Groups (COG). Frequency (%) of prophage-derived CDS with a given function per COG.
Coding DNA sequences (CDS) identified in prophages potentially influencing host virulence or fitness.
| ERIC I | ERIC II | ERIC III | ERIC IV | ERIC V | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| CDS | Host 3 | Host 5 | Host 6 | Host 10 | Host 9 | Host 13 | Host 11 | Host 7 | Host 8 | Host 12 | Host 14 |
| TetR family transcriptional regulator (TetR/AcrR) | |||||||||||
| β-Lactamase superfamily domain [metal β-lactamases (MBL) fold metallo-hydrolase Yycl] | |||||||||||
| β-Lactamase inhibitory proteins (BLIP) | |||||||||||
| ATP-binding cassette (ABC) group | |||||||||||
| Aromatic acid exporter family protein | * | ||||||||||
| Major facilitator superfamily (MFS) transporter | |||||||||||
| Multidrug efflux small multidrug resistance (SMR) transporter | |||||||||||
| Efflux transporter-like protein | * | ||||||||||
| Iron–sulfur (Fe-S) cluster assembly protein SufB | * | * | |||||||||
| NifU family iron–sulfur (Fe-S) cluster assembly protein—SUF system | * | * | |||||||||
| Metal β-lactamases (MBL) fold metallo-hydrolase | |||||||||||
| Phosphomannomutase | |||||||||||
| Transglycosylase | |||||||||||
| Toxin HicA | |||||||||||
| Antitoxin HicB | |||||||||||
| Antitoxin MazE | |||||||||||
| Antitoxin SocA | * | ||||||||||
| AbrB/MazE/SpoVT family DNA-binding domain-containing protein | |||||||||||
| Bacterial toxin 44 | * | ||||||||||
| Toxin | |||||||||||
| Toxin-like protein | * | ||||||||||
| Bacteriocin biosynthesis protein | * | ||||||||||
| Epsilon-toxin type B (EtxB) | |||||||||||
| Leukocidin LukF-PV precursor = leukotoxin | |||||||||||
| Closticin | * | ||||||||||
| DNA internalization-related competence protein ComEC/Rec2 | * | * | |||||||||
| Segregation and condensation protein B (ScpB) | |||||||||||
| YopX family protein | |||||||||||
| Enhancin / phosphohydrolase | * | ||||||||||
| Virulence-associated protein E | |||||||||||
| ImmA/IrrE family metallo-endopeptidase | |||||||||||
| Esterase family protein | * | ||||||||||
| S8 family serine peptidase | |||||||||||
| Histidine kinase-like protein | * | ||||||||||
| Pyruvate dehydrogenase E1 component subunit alpha | * | ||||||||||
| Metallopeptidase | |||||||||||
| FtsX-like permease family protein | * | ||||||||||
| GlcNAc-chitin binding protein (GbpA) | * | ||||||||||
| Dipeptidyl aminopeptidase/acylaminoacyl-peptidase | |||||||||||
| GTP pyrophosphokinase-like protein | * | ||||||||||
| Host-nuclease inhibitor protein Gam (Gam) | |||||||||||
| MazG-like nucleotide pyrophosphohydrolase | * | ||||||||||
| VRR-NUC domain-containing protein | |||||||||||
| YifB family Mg chelatase-like AAA ATPase | * | * | * | ||||||||
| YncE family protein | * | ||||||||||
| YxeA family protein | |||||||||||
| Coat protein | |||||||||||
| Iron-containing alcohol dehydrogenase | * | ||||||||||
| Ketopantoate hydroxymethyltransferase | |||||||||||
| Membrane protein | * | ||||||||||
| Outer spore coat protein (CotE) | * | ||||||||||
| Spore protease YyaC | * | * | * | * | |||||||
| Sporulation protein YhbH | * | ||||||||||
| Stress protein | * | * | * | * | |||||||
| DNA mismatch repair protein MutS | * | ||||||||||
| ERF superfamily protein | |||||||||||
| Acetyltransferase | |||||||||||
| dCMP deaminase family protein | * | * | |||||||||
| Murein transglycosylase-like protein | * | ||||||||||
| NTP-binding protein | * | ||||||||||
| Peptidase domain | * | ||||||||||
| Phenylalanine racemase | |||||||||||
| Phosphatidylglycerophosphatase A (PgpA) | * | ||||||||||
| Phosphatidylserine decarboxylase | * | ||||||||||
| STAS-like domain-containing protein | * | ||||||||||
| Thiamin pyrophosphokinase | * | ||||||||||
| Thioredoxin reductase | * | ||||||||||
| YqaE/Pmp3 family membrane protein | |||||||||||
The gray color identifies the corresponding CDS in the respective ERIC genotype. *CDS only found in this genotype.
Taxonomic classification of prophages based on structural proteins present (Y: protein present).
| Prophage | TSP | MTP/TTP | TAP | TAP | TMP | Family | Most common phage (PHASTER) | BLASTn parameters | |||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Homolog phage | Coverage (%) | Identity (%) | |||||||||
| H3_R2 | Y |
| Vegas ( | Dragolir | 67 | 99,92 | 0 | ||||
| H3_R3 | Y# | Y | Y | Y |
| Tripp ( | Heath | 38 | 90,61 | 0 | |
| H3_R5 | Y# | Y | Y | Y |
| Fern ( | Jacopo | 87 | 99,15 | 0 | |
| H3_R6 | Y | Y | Y |
| Tripp ( | Tripp | 75 | 99,45 | 0 | ||
| H3_R11 | Y | Y# | Y | Y |
| Lily ( | Lily | 41 | 84,28 | 0 | |
| H3_R14 | Y# | Y | Y |
| Vegas ( | Hayley | 72 | 99,98 | 0 | ||
| H3_R15 | Y | Y* | Y | HP | Y |
| Jimmer1 ( | Yerffej | 11 | 99,37 | 0 |
| H3_R18 | Y# | Y |
| Vegas ( | Vegas | 29 | 99,47 | 0 | |||
| H5_R2 | Y | Y* | Y | HP | Y |
| Harrison ( | Harrison | 25 | 99,97 | 0 |
| H5_R6 | Y# | Y |
| Vegas ( | Hayley | 32 | 99,47 | 0 | |||
| H5_R13 | Y# | Y | Y | Y |
| Fern ( | Leyra | 38 | 97,91 | 0 | |
| H6_R1 | Y# | Y |
| Vegas ( | Vegas | 29 | 99,47 | 0 | |||
| H6_R3 | Y# | Y | Y |
| Harrison ( | Paisley | 96 | 99,98 | 0 | ||
| H6_R6 | Y | Y | Y |
| Tripp ( | Heath | 61 | 90,57 | 0 | ||
| H6_R7 | Y | Y* | Y | Y |
| Jimmer1 ( | Harrison | 9 | 96,25 | 0 | |
| H6_R8 | Y# | Y | Y |
| Vegas ( | Hayley | 72 | 99,98 | 0 | ||
| H7_R1 | Y | Y |
| Tripp ( | Scottie | 94 | 99,91 | 0 | |||
| H7_R6 | Y# | Y |
| Vegas ( | Dragolir | 36 | 96,08 | 0 | |||
| H7_R10 | Y# | Y | Y | Y |
| Rani ( | Diva | 18 | 84,37 | 0 | |
| H8_R2 | Y | Y* | Y | Y |
| Jimmer1 ( | Harrison | 11 | 94,73 | 0 | |
| H8_R6 | Y | Y* | Y | HP | Y |
| Abouo ( | Dragolir | 22 | 93,18 | 0 |
| H8_R7 | Y | Y# | Y | Y |
| Lily ( | Lily | 79 | 91,09 | 0 | |
| H8_R8 | Y# | Y |
| Vegas ( | Dragolir | 36 | 96,09 | 0 | |||
| H8_R12 | Y | Y |
| Tripp ( | Scottie | 94 | 99,97 | 0 | |||
| H9_R3 | Y | Y* | Y | Y |
| Jimmer1 ( | Wanderer | 11 | 89,43 | 0 | |
| H9_R4 | Y | Y* | Y | Y |
| Jimmer1 ( | Paisley | 9 | 86,88 | 0 | |
| H9_R5 | Y# | Y | Y |
| Vegas ( | Diane | 41 | 92,1 | 0 | ||
| H9_R8 | Y | Y | Y |
| Tripp ( | C7Cdelta | 66 | 91,32 | 0 | ||
| H9_R10 | Y | Y | Y |
| Tripp ( | Ley | 55 | 89,51 | 0 | ||
| H9_R14 | Y | Y* | Y | Y |
| Jimmer1 ( | Sitara | 15 | 86,09 | 0 | |
| H9_R15 | Y | Y* | Y | Y |
| Jimmer1 ( | Leyra | 7 | 97,86 | 0 | |
| H10_R4 | Y# | Y | Y |
| Vegas ( | Vadim | 99 | 99,95 | 0 | ||
| H10_R5 | Y | Y* | Y | Y |
| Jimmer1 ( | Yerffej | 11 | 99,37 | 0 | |
| H10_R7 | Y | Y | Y |
| Tripp ( | Heath | 74 | 90,61 | 0 | ||
| H10_R10 | Y# | Y | Y |
| Harrison ( | Paisley | 96 | 99,99 | 0 | ||
| H10_R12 | Y# | Y |
| Vegas ( | Hayley | 34 | 99,46 | 0 | |||
| H11_R3 | Y# | Y | Y |
| Fern ( | Likha | 18 | 85,1 | 0 | ||
| H11_R8 | Y | Y |
| Tripp ( | Scottie | 94 | 99,98 | 0 | |||
| H11_R10 | Y | Y* | Y | Y |
| Jimmer1 ( | Harrison | 9 | 94,91 | 0 | |
| H11_R12 | Y | Y* | Y | Y |
| Abouo ( | Harrison | 11 | 94,73 | 0 | |
| H11_R13 | Y | Y* | Y | HP | Y |
| Abouo ( | Dragolir | 23 | 93,18 | 0 |
| H11_R14 | Y | Y# | Y | Y |
| Lily ( | Lily | 79 | 91,09 | 0 | |
| H11_R15 | Y# | Y |
| Vegas ( | Dragolir | 36 | 96,08 | 0 | |||
| H12_R3 | Y# | Y | Y |
| Fern ( | Yerffej | 19 | 84,49 | 0 | ||
| H12_R8 | Y | Y |
| Tripp ( | Scottie | 94 | 99,98 | 0 | |||
| H12_R13 | Y# | Y |
| Vegas ( | Dragolir | 36 | 96,09 | 0 | |||
| H13_R11 | Y | Y |
| Tripp ( | Scottie | 60 | 88,03 | 0 | |||
| H14_R2 | Y | Y |
| Tripp ( | Tripp | 52 | 89,95 | 0 | |||
| H14_R3 | Y | Y# | Y | Y |
| Lily ( | phiERICV | 94 | 100 | 0 | |
| H14_R7 | Y | Y* | Y | HP | Y |
| Jimmer2 ( | Sitara | 11 | 87,86 | 0 |
| H14_R8 | Y | Y* | Y | HP | Y |
| Jimmer2 ( | Paisley | 17 | 86,26 | 0 |
| H14_R9 | Y | Y |
| Tripp ( | Heath | 67 | 89,07 | 0 | |||
| H14_R10 | Y | Y* | Y | HP | Y |
| Jimmer1 ( | phiERICV | 7 | 94,99 | 0 |
| H14_R14 | Y | Y |
| Tripp ( | Heath | 76 | 89,11 | 0 | |||
| H14_R15 | Y | Y* | Y | HP | Y |
| Harrison ( | phiERICV | 12 | 95,37 | 0 |
TSP, tail sheath protein; MTP, major tail tube protein; TTP, tail tube protein; TAP, tail assembly protein; TMP, tail tape measure protein; Y#, one of both proteins present; Y*, only TTP present; HP, hypothetical protein; —prophages from this family based on phage homology of BLASTn; Siphoviridae—prophages with the four typical proteins; —three of the four typical proteins. Myoviridae*—prophages homologous to Brevibacillus laterosporus phages; Myoviridae—prophages homologous to other siphovirus phages.
Figure 5Phylogenetic analysis of Paenibacillus larvae phages. Whole genomes based on shared CDS content (nucleotide), obtained with Geneious. Database: Paenibacillus larvae reported phages (n = 50) and intact Paenibacillus larvae prophages (n = 55), identified here. Clusters have ≥60% of shared CDS and were highlighted by colored rectangles.