| Literature DB >> 28218261 |
Angela B Brueggemann1, Caroline L Harrold1, Reza Rezaei Javan1, Andries J van Tonder1, Angus J McDonnell1, Ben A Edwards1.
Abstract
Bacteriophages (phages) infect many bacterial species, but little is known about the diversity of phages among the pneumococcus, a leading global pathogen. The objectives of this study were to determine the prevalence, diversity and molecular epidemiology of prophages (phage DNA integrated within the bacterial genome) among pneumococci isolated over the past 90 years. Nearly 500 pneumococcal genomes were investigated and RNA sequencing was used to explore prophage gene expression. We revealed that every pneumococcal genome contained prophage DNA. 286 full-length/putatively full-length pneumococcal prophages were identified, of which 163 have not previously been reported. Full-length prophages clustered into four major groups and every group dated from the 1930-40 s onward. There was limited evidence for genes shared between prophage clusters. Prophages typically integrated in one of five different sites within the pneumococcal genome. 72% of prophages possessed the virulence genes pblA and/or pblB. Individual prophages and the host pneumococcal genetic lineage were strongly associated and some prophages persisted for many decades. RNA sequencing provided clear evidence of prophage gene expression. Overall, pneumococcal prophages were highly prevalent, demonstrated a structured population, possessed genes associated with virulence, and were expressed under experimental conditions. Pneumococcal prophages are likely to play a more important role in pneumococcal biology and evolution than previously recognised.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28218261 PMCID: PMC5317160 DOI: 10.1038/srep42976
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Graph depicting the number of prophages in each category: partial, putatively full-length or full-length.
Bars represent the number of prophages relative to the length of the prophage sequence that was identified.
Figure 2Description of nucleotide sequence similarity, phylogenetic clustering and shared genes among the 66 representative full-length prophages identified in the pneumococcal genome dataset.
(a) Heat map depicting the percentage nucleotide sequence identity shared between pairs of full-length prophage sequences. Groups of similar prophages are marked A-E and correspond to the clusters seen in part b. (b) Clusters of prophage sequences based upon nucleotide sequence identity. Bootstrap values are marked on branches of each cluster. (c) Venn diagram depicting genes unique to each prophage cluster and genes shared between clusters.
Figure 3Nucleotide sequence alignments of representative full-length prophages from clusters A and B.
The coloured bar at the top of each cluster indicates the mean pairwise nucleotide sequence identity over all pairs in the column: bright green = 100% identity; green-brown = <100% but >30% identity; and red = <30% identity. Prophage genes are coloured based on putative or known function. The names of each prophage are given, followed by the year of isolation (in brackets) of the oldest known pneumococcus harbouring that prophage.
Figure 4Nucleotide sequence alignments of representative full-length prophages from clusters C, D and E.
The coloured bar at the top of each cluster indicates the mean pairwise nucleotide sequence identity over all pairs in the column: bright green = 100% identity; green-brown = <100% but >30% identity; and red = <30% identity. Prophage genes are coloured based on putative or known function. The names of each prophage are given, followed by the year of isolation (in brackets) of the oldest known pneumococcus harbouring that prophage.
Pneumococcal genes flanking either side of the 93 integrated full-length prophage sequences.
| Cat | N | Genes upstream of phage integrase | PI | Genes downstream of phage amidase | ||||
|---|---|---|---|---|---|---|---|---|
| a | 35 | P-loop-containing kinase | transporter | putative sporulation transcription regulator whiA | I | pyridine nucleotide-disulfide oxidoreductase | ABC transporter permease | lipoprotein |
| transposase | recombination regulator RecX | adenylosuccinate synthetase | hypothetical protein | cytidine/deoxycytidylate deaminase | deoxyuridine 5’-triphosphate nucleotidohydrolase | |||
| transposase | recombination regulator RecX | adenylosuccinate synthetase | DNA-binding protein | hypothetical protein | cytidine/deoxycytidylate deaminase | |||
| IS1167 transposase | recombination regulator RecX | adenylosuccinate synthetase | DNA-binding protein | cytidine/deoxycytidylate deaminase | deoxyuridine 5’-triphosphate nucleotidohydrolase | |||
| IS1167 transposase | recombination regulator RecX | adenylosuccinate synthetase | DNA-binding protein | hypothetical protein | cytidine/deoxycytidylate deaminase | |||
| tRNA-Asn | recombination regulator RecX | adenylosuccinate synthetase | DNA-binding protein | cytidine/deoxycytidylate deaminase | deoxyuridine 5’-triphosphate nucleotidohydrolase | |||
| tRNA-Asn | recombination regulator RecX | adenylosuccinate synthetase | DNA-binding protein | hypothetical protein | cytidine/deoxycytidylate deaminase | |||
| transposase | recombination regulator RecX | adenylosuccinate synthetase | DNA-binding protein | cytidine/deoxycytidylate deaminase | phage integrase | |||
| assembly gap | recombination regulator RecX | adenylosuccinate synthetase | DNA-binding protein | hypothetical protein | cytidine/deoxycytidylate deaminase | |||
| transposase | recombination regulator RecX | adenylosuccinate synthetase | hypothetical protein | arginine deiminase | arginine deiminase | |||
| transposase-like protein, IS630 | recombination regulator RecX | adenylosuccinate synthetase | DNA-binding protein | cytidine/deoxycytidylate deaminase | deoxyuridine 5’-triphosphate nucleotidohydrolase | |||
| transposase-like protein, IS630 | recombination regulator RecX | adenylosuccinate synthetase | DNA-binding protein | hypothetical protein | cytidine/deoxycytidylate deaminase | |||
| tRNA-Asn | recombination regulator RecX | adenylosuccinate synthetase | DNA-binding protein | integrase | assembly gap | |||
| — | — | assembly gap | DNA-binding protein | cytidine/deoxycytidylate deaminase | deoxyuridine 5’-triphosphate nucleotidohydrolase | |||
| recombination regulator RecX | adenylosuccinate synthetase | cytidine/deoxycytidylate deaminase | deoxyuridine 5’-triphosphate nucleotidohydrolase | phosphoglycerate mutase | DNA repair protein RadA | |||
| phage lytic amidase | DNA-binding protein | cytidine/deoxycytidylate deaminase | deoxyuridine 5’-triphosphate nucleotidohydrolase | phosphoglycerate mutase | DNA repair protein RadA | |||
| DNA-binding protein | hypothetical protein | cytidine/deoxycytidylate deaminase | deoxyuridine 5’-triphosphate nucleotidohydrolase | phosphoglycerate mutase | DNA repair protein RadA | |||
| recombination regulator RecX | adenylosuccinate synthetase | cytidine/deoxycytidylate deaminase | deoxyuridine 5’-triphosphate nucleotidohydrolase | phosphoglycerate mutase | DNA repair protein RadA | |||
| superoxide dismutase | competence protein CglA | competence protein CglB | hypothetical protein | competence protein CglC | competence protein CglD | |||
| leucyl-tRNA synthetase | acetyltransferase | GNAT family acetyltransferase | DNA-binding protein | Holliday junction DNA helicase RuvB | hypothetical protein ProS | |||
aCat = pneumococcal genome insertion site category. 13 examples of genomes with miscellaneous prophage integration sites, including assembly gaps on either side of the prophage sequence, are not shown here. A more detailed list of the integration sites and flanking genes may be found in Table S5, which includes the flanking genes associated with the 193 putatively full-length prophage sequences.
bN = number of prophages.
cPI = prophage integrase group, determined by the nucleotide sequence of the integrase.
dTwo pneumococcal genomes had assembly gaps after the prophage amidase.
Figure 5Illustration of the number of different full-length and putatively full-length prophages identified in each of the major pneumococcal clonal complexes.
Clonal complexes are labelled on the y-axis followed by brackets containing the number of pneumococcal genomes within that complex and the years of isolation of those pneumococci. See Table 2 and S2 for details of the specific prophages identified within each clonal complex.
Associations between full-length and putatively full-length prophages and pneumococcal genetic lineages.
| Prophage frequency (years of isolation) | ||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Clonal complex | IPP32 (1977–1988) | MM1 (1984–2002) | IPPX302 (1978–1985) | SPN195_1 (1993–2008) | IPPX3 (1978–1984) | IPPX4 (1978–1987) | 3059_06_phage (1978–1985) | IPPX300 (1999–2003) | IPP57 (1992–2005) | IPP12 (1948–2002) | IPP29 (1978–2002) | IPPX328 (1978) | IPP34 (1939–1999) | IPP62 (1999–2006) | IPP67 (1957–1962) | SPN_1873 (1999–2000) | IPPX215 (1999–2006) | IPP8 (1989–2000) | IPPX100 (1965–1966) | Total |
| 1094 | 7 | 6 | 6 | 5 | 24 | |||||||||||||||
| 41/1605 | 14 | 3 | 17 | |||||||||||||||||
| 81 | 11 | 11 | ||||||||||||||||||
| 156/162 | 7 | 7 | ||||||||||||||||||
| 199 | 5 | 1 | 6 | |||||||||||||||||
| 124 | 1 | 4 | 5 | |||||||||||||||||
| 180 | 5 | 5 | ||||||||||||||||||
| 2090 | 3 | 1 | 4 | |||||||||||||||||
| 3122/5977 | 3 | 3 | ||||||||||||||||||
| 113 | 3 | 3 | ||||||||||||||||||
| 15 | 3 | 3 | ||||||||||||||||||
| 439 | 2 | 2 | ||||||||||||||||||
| Singleton | 1 | 1 | 2 | |||||||||||||||||
| 66 | 2 | 2 | ||||||||||||||||||
| 217 | 2 | 2 | ||||||||||||||||||
| 615 | 2 | 2 | ||||||||||||||||||
| 236/271/320 | 1 | 1 | ||||||||||||||||||
| 7232 | 1 | 1 | ||||||||||||||||||
| 68/1656 | 1 | 1 | ||||||||||||||||||
| Total | 16 | 11 | 7 | 7 | 6 | 6 | 6 | 5 | 5 | 4 | 4 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 101 |
aOnly the full-length and putatively full-length prophages present ≥3 times in the study dataset were included here. The years of isolation refer to when the host pneumococci were isolated.
Figure 6Heat maps describing the results of the RNA-seq experiment.
Prophage genes are depicted by rows and differential expression levels at each of five time points are presented in columns. An asterisk to the left of a cell indicates a statistically significant differential level of expression (p < 0.05). Prophage gene expression levels are given for two full-length prophages, Sp195_1 (a) and Sp195_2 (b), and one partial prophage sequence, IPPX439 (c). Mitomycin C was added to the broth culture after 3 h of incubation.