| Literature DB >> 31072320 |
Hugo Oliveira1, Marta Sampaio2, Luís D R Melo2, Oscar Dias2, Welkin H Pope3, Graham F Hatfull3, Joana Azeredo2.
Abstract
BACKGROUND: Bacteriophages are the most abundant and diverse entities in the biosphere, and this diversity is driven by constant predator-prey evolutionary dynamics and horizontal gene transfer. Phage genome sequences are under-sampled and therefore present an untapped and uncharacterized source of genetic diversity, typically characterized by highly mosaic genomes and no universal genes. To better understand the diversity and relationships among phages infecting human pathogens, we have analysed the complete genome sequences of 205 phages of Staphylococcus sp.Entities:
Keywords: Bacteriophages; Clusters; Endolysin; Genomes; Phams; Staphylococcus
Mesh:
Substances:
Year: 2019 PMID: 31072320 PMCID: PMC6507118 DOI: 10.1186/s12864-019-5647-8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Diversity of staphylococcal phage genomes. a) Splitstree 3D representation into 2D space of the 205 staphylococcal phages illustrating shared phams generated from a total of 20,579 predicted genes. A total of 2139 phams (a group of genes with related sequences) of which 745 orphams (a single gene without related sequences) were identified. b) The assignment of A) clusters and B) subclusters are shown in coloured circles. The scale bar indicates 0.01 substitution. The spectrum of diversity reveals four clusters and 31 subclusters (A1-A2, B1-B21, C1-C6 and D1-D2) and one singleton (phage SPbeta-like). A Venn diagram was also included to visualize the amount of proteins allocated and shared across each cluster. Common phams among different clusters that are represented by intersections of the circles. There is no universal pham in staphylococci phage genomes
Fig. 2Phage relationship under gene content dissimilarity index. GCD scores given by each pairwise comparison for a) all staphylococcal, b) S. aureus phage genomes or c) S. epidermidis phage genomes (where GCD = 1 meaning 100% dissimilar, GCD = 0 meaning 100% similar). d MaxGCDGap relationships for all staphylococcal phages ordered by median (where higher MaxGCDGap mean most diverse and lower MaxGCDGap mean less diverse, relative to the groups analysed). MaxGCDGap relationships for e) cluster of phages (a to d) or for f) subclusters of phages (A1-A2, B1-B21, C1-C6, D1-D2) and the singletons, where each data point represents a single phage genome. Horizontal lines show the MaxGCDGap mean per cluster and subclusters. Cluster and subclusters with less than five members were omitted from the analysis in e and f
Fig. 3Diversity of staphylococcal phage integrases. Maps of the lysis cassettes, virulence determinants, and integration cassettes for six Staphylococcus phages were constructed using Phamerator, genes are labelled with their putative functions where applicable
Staphylococcal cluster B phage integrases. The dataset includes 205 staphylococcal phages, of which 132 belong to the cluster B Siphoviridae. Phams related to integration functions and virulence determinants are represented to phage member, clusters and protein domains
| Pham | Function | Alternative nomenclaturea | Number of members | Domainsb | Conserved, accessory or unique pham |
|---|---|---|---|---|---|
| Integrases | |||||
| 148 | Y-Int | Sa3, Sa9, Sa10, Sa11 | 38 | pfam14659; pfam00589 | Conserved (B9); Accessory (B2, B3, B4, B5, B6, B7, B10); Unique (B8, B12, B14, B15, B16, B17) |
| 280 | Y-Int | Sa1, Sa5 | 27 | pfam14657; pfam14659; pfam00589 | Conserved (B1); Unique (B7); Accessory (B2, B3) |
| 288 | S-Int | Sa7, Se1, Se12 | 25 | pfam00239; pfam07508 | Accessory (B2, B3, B4); Unique (B6, B10, B11, B13) |
| 1656 | Y-Int | – | 1 | pfam14659; pfam00589 | Unique (B8) |
| 1661 | Y-Int | Sa2, Sa6 | 40 | pfam00589 | Accessory (B3, B5, B6, B7) |
| Virulence determinants | |||||
| 297 |
| 1 | pfam05272 | Unique (B5) | |
| 529 |
| 12 | pfam16935 | Accessory (B6, B7); Unique (B5) | |
| 555 | PVL ( | 26 | pfam07968 | Accessory (B5, B6, B7) | |
| 914 |
| 17 | pfam11546 | Accessory (B6, B7); Unique (B3) | |
| 1259 |
| 10 | pfam02452 | Accessory (B2, B3); Unique (B5) | |
| 1270 |
| 23 | pfam05272 | Accessory (B5); Unique (B15) | |
| 1322 |
| 1 | pfam16935 | Unique (B6) | |
| 1460 |
| 16 | pfam02821 | Accessory (B6, B7); Unique (B8) | |
| 1579 |
| 8 | pfam02452 | Accessory (B6) | |
| 1597 |
| 1 | Pfam03372 | Unique (B7) | |
| 1903 |
| 5 | pfam13365 | Accessory (B3); Unique (B2) | |
| 1939 | PVL ( | 27 | pfam07968 | Accessory (B5, B6, B7) | |
| 2064 |
| 7 | pfam01123; pfam02876 | Accessory (B6) | |
| 2122 |
| 10 | pfam11434 | Accessory (B6, B7) | |
a An alternative integrase nomenclature system is provided as in Goerke et al. 2009 (29)
bPham descriptions: pfam14659: Phage integrase, N-terminal SAM-like domain; pfam00589: Phage integrase family; pfam14657: AP2-like DNA-binding integrase domain; pfam00239: Resolvase, N terminal domain; pfam07508: Recombinase; pfam02899: Phage integrase, N-terminal SAM-like domain; pfam13495: Phage integrase, N-terminal SAM-like domain; pfam01123: Staphylococcal/Streptococcal toxin, OB-fold domain; pfam02876: Staphylococcal/Streptococcal toxin, beta-grasp domain; pfam02821: Staphylokinase/Streptokinase family; pfam11434: Chemotaxis-inhibiting protein CHIPS; pfam11546: Staphylococcal complement inhibitor SCIN; pfam05272: Virulence-associated protein E; pfam16935: Putative Holin-like Toxin (Hol-Tox); pfam07968: Leukocidin /Hemolysin toxin family; pfam02452: PemK-like, MazF-like toxin of type II toxin-antitoxin system; pfam13365: Trypsin-like peptidase domain; pfam03372: Endonuclease/Exonuclease/phosphatase family
Acronyms of integrase and virulence genes: Y-Int and S-Int, integrase of tyrosine or serine type; virE, virulence-associated protein E; PVL, Panton-Valentine leucocidin, that is activated by two polypeptide-enconding genes (lukS-PV, lukF-PV); scn, staphylococcal complement inhibitor; pemK, endoribonuclease toxin PemK; sak, plasminogen activator staphylokinase; mazF, endoribonuclease toxin MazF; hlb, β-hemolysin; eta, exfoliative toxin A; sea, staphylococcal enterotoxin A; chp, chemotaxis inhibitory protein
Note: The holin-toxin gene is different from the holin gene that participates in the lytic cassette. For instance, in phage P954, gp20 is the holin-toxin, gp21 is the holing and gp22 is the endolysin
Fig. 4Organization strategies of staphylococcal phage endolysins. a) Four endolysin synthesis strategies found in staphylococcal phage genomes (n = 205), 175 phages encode lysins with single genes, 20 phage lysin genes exhibit group I introns, nine phages encode lysins using two adjacent genes; one phage, 2638A, uses a single gene with an inter-lytic-domain secondary translational start site, described here [35]; b) Schematic representation of a S. aureus bacteria cell wall and peptidoglycan. The red arrow indicates the conserved glycine peptidoglycan bridge recognized by the SH3 binding domain. Black arrows indicate possible cutting sites of the catalytic domains: 1) N-acetyl-β-D-glucosaminidase, 2) N-acetyl- β -D-muramidase, 3) lytic transglycosylase, 4) N-acetylmuramoyl- L -alanine amidase, 5) L -alanyl-D-glutamate endopeptidase, 6) D -alanyl-glycyl endopeptidase, 7) L-alanoyl-D-glutamate endopeptidase. Endolysins of class 4) and 6) have been experimental validated [60]. Acronyms: CHAP, cysteine/histidine-dependent amidohydrolase/peptidase domain (PF05257/IPR007921); AMI-2, Amidase_2 domain (PF01510/IPR002502); AMI-3, Amidase_3 domain (PF01520/IPR002508); SH3–5, SH3_5 domain (PF08460/IPR013667); NAG, β-1,4-linked monomers of N-acetyl glucosamine; NAM; N-acetyl muramic acid