| Literature DB >> 29099783 |
Julia Villarroel1, Mette Voldby Larsen2, Mogens Kilstrup3, Morten Nielsen4,5.
Abstract
Phage therapy has regained interest in recent years due to the alarming spread of antibiotic resistance. Whilst phage cocktails are commonly sold in pharmacies in countries such as Georgia and Russia, this is not the case in western countries due to western regulatory agencies requiring a thorough characterization of the drug. Here, DNA sequencing of constituent biological entities constitutes a first step. The pyophage (PYO) cocktail is one of the main commercial products of the Georgian Eliava Institute of Bacteriophage, Microbiology and Virology and is used to cure skin infections. Since its first production in the 1930s, the composition of the cocktail has been periodically modified to add phages effective against emerging pathogenic strains. In this paper, we compared the composition of three PYO cocktails from 1997 (PYO97), 2000 (PYO2000) and 2014 (PYO2014). Based on next generation sequencing, de novo assembly and binning of contigs into draft genomes based on tetranucleotide distance, thirty and twenty-nine phage draft genomes were predicted in PYO97 and PYO2014, respectively. Of these, thirteen and fifteen shared high similarity to known phages. Eleven draft genomes were found to be common in the two cocktails. One of these showed no similarity to publicly available phage genomes. Representatives of phages targeting E. faecalis, E. faecium, E. coli, Proteus, P. aeruginosa and S. aureus were found in both cocktails. Finally, we estimated larger overlap of the PYO2000 cocktail to PYO97 compared to PYO2014. Using next generation sequencing and metagenomics analysis, we were able to characterize and compare the content of PYO cocktails separated by 17 years in time. Even though the cocktail composition is upgraded every six months, we found it to remain relatively stable over the years.Entities:
Keywords: PYO phage cocktail; human phage therapy; metagenomics
Mesh:
Year: 2017 PMID: 29099783 PMCID: PMC5707535 DOI: 10.3390/v9110328
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1The four batches of pyophage (PYO) cocktail investigated here. The glass ampoules are dated (a) 1997; (b) 2000; (c) 2010 and (d) 2014.
Reads statistics.
| Sample | # Reads | # Reads after Trimming | # Reads after Removing PhiX174 |
|---|---|---|---|
| PYO97 | 5,228,884 (1,280,000 kbp) | 2,035,496 (420,000 kbp) | 1,965,233 (410,000 kbp) |
| PYO2000 | 1,648,430 (410,000 kbp) | 1,366,749 (300,000 kbp) | 1,110,522 (240,000 kbp) |
| PYO2014 | 18,240,556 (4,470,000 kbp) | 6,660,081 (1,380,000 kbp) | 6,577,613 (1,370,000 kbp) |
# Means “number of”.
Figure 2Krona map based on reads from PYO97 (a) and PYO2014 (b) mapped to the Kraken Virus database. Interactive charts can be found at https://julvi.github.io/PYO97_krona.html and https://julvi.github.io/PYO2014_krona.html for the respective samples.
Summary of the assembly results.
| Sample | # Contigs | Longest and Shortest Contig | # Contigs Longer than 2 kbp–Percentage of Reads Mapping to the Contigs |
|---|---|---|---|
| PYO97 | 3005 (4403 kbp) | 169,029 bp (128 bp) | 179 (3034 kbp)–97.9% |
| PYO2014 | 4165 (4865 kbp) | 282,352 bp (128 bp) | 270 (2759 kbp)–99.1% |
# Means “number of”.
Number of bins yielded by MetaBAT and number and percentage of binned bp out of the total number of bp in contigs larger than 2000 bp.
| Sample | # Bins | # Binned bp (Percentage of Binned bp) |
|---|---|---|
| PYO97 | 33 | 2,735,811 (90.16%) |
| PYO2014 | 31 | 2,494,104 (90.39%) |
# Means “number of”.
Figure 3Blast Atlases. (a) Example of an original bin, PYO97_10 (coverage 13.9 ± 0.015), that was split into two bins: one made of the contiguous contigs closest to the reference in the middle until the orange contig, bin PYO97_10_85.139.47.48.59.38.5.35.15.78.55.44.14 (coverage 12.4 ± 0.014), and the second one, bin PYO97_10_3.8.10.28.42 (coverage 16.9 ± 0.025), containing the remaining contigs; (b) Merging of three bins PYO97_22, PYO97_3 and PYO97_14 into one PYO97_22.3.14 which covers the entire reference genome; (c) Collapsed bin in PYO2014; these three contigs have been grouped together to form a collapsed bin. The difference between a normal bin and a collapsed bin is the presence of overlapping contigs in the latter probably derived from shared sequences between species of the same phage family; (d) Corresponding draft genomes from the two samples aligning to the reference, Pseudomonas phage TL; (e) PYO97_27.21 and PYO2014_28 are highly similar and do not resemble any known sequence; (f) Alignment of PYO2014_28 to PYO97_27.21.
The bins to be merged are indicated in the first three columns. The fourth and fifth columns show the resulting merged bin and the closest reference, respectively. Coverage values are in parentheses.
| Bin to be Merged 1 | Bin to be Merged 2 | Bin to be Merged 3 | Merged Bin | Shared Reference |
|---|---|---|---|---|
| PYO97_17 (44.67 ± 0.05) | PYO97_30 (53.82 ± 0.07) | PYO97_17.30 (46.65 ± 0.04) | ||
| PYO97_3 (57.92 ± 0.08) | PYO97_14 (66.42 ± 0.14) | PYO97_22 (71.55 ± 0.14) | PYO97_22.3.14 (65.41 ± 0.08) | |
| PYO2014_3 (2294.3 ± 3.8) | PYO2014_16 (2294.98 ± 3.81) | PYO2014_29 (2222.82 ± 2.54) | PYO2014_3.16.29 (1659.65 ± 2.65) | |
| PYO2014_13 (109.82 ± 0.21) | PYO2014_26 (120.43 ± 0.16) | PYO2014_26.13 (118.21 ± 0.14) |
Number of phage draft genomes from the two samples which HostPhinder predicted to infect the respective host. Only results having a score higher than the reliability threshold of 0.1 are reported.
| Bacterial Host | # Representative Phage Draft Genomes in PYO97 | # Representative Phage Draft Genomes in PYO2014 |
|---|---|---|
| 2 | 3 | |
| 1 | 1 | |
| 4 | 7 | |
| 1 | 0 | |
| 2 | 1 | |
| 3 | 4 | |
| 2 | 0 | |
| 2 | 0 | |
| 0 | 1 | |
| 1 | 1 |
# Means “number of”.
Overview of correspondent draft genomes between PYO97 and PYO2014 and the reciprocal ANI. The last column displays the targeted host as predicted by HostPhinder. Bins 10_85…, and 27_42… in the table correspond to bin 10_85.139.47.48.59.38.5.35.15.78.55.44.14 and 27_42.21.133.90.116.20.14.109.73, respectively.
| Bin from PYO97 | Bin from PYO2014 | Reciprocal ANI (%) | Predicted Targeted Bacterial Host |
|---|---|---|---|
| 16 | 30 | 99.9 | |
| 27.21 | 28 | 98.6 | |
| 11 | 26.13 | 97.2 | |
| 12 | 1 | 98.8 | |
| 25 | 3.16.29 | 99.5 | |
| 15 | 12 | 98.7 | |
| 29 | 21 | 96.2 | |
| 4 | 27_42… | 98.1 | |
| 8 | 25 | 88.4 | |
| 10_85… | 31 | 89.6 | |
| 23 | 20 | 85.4 |
* Indicates that the prediction by HostPhinder had a low score and was hence unreliable. ** In this case, HostPhinder predicted a different host for each draft genome.
Count of draft genomes/bins belonging to each category.
| Class | PYO97 | PYO2014 |
|---|---|---|
| (1) Near-complete draft genome with high resemblance to reference phage and counterpart in the other sample. | 6 | 4 |
| (2) Near-complete draft genome with high resemblance to reference phage, but no counterpart in the other sample. | 5 | 8 |
| (3) Partial draft genome with low/medium resemblance to reference phage and counterpart in the other sample. | 1 | 1 |
| (4) Partial draft genome with no resemblance to reference phage and no counterpart in the other sample. | 11 | 8 |
| (5) Collapsed bins. | 2 | 4 |
| (6) Special cases, including highly fragmented bins and bins classified as non-phages. | 5 | 4 |
Figure A1Blast Atlas of PYO2014_21 towards the reference Shigella phage SHFML-11.
Overview of the phage draft genomes and bins of PYO97 indicating the most likely taxonomic group they belong to. PYO97_10_85… and PYO97_10_3… correspond to PYO97_10_85.139.47.48.59.38.5.35.15.78.55.44.14 and PYO97_10_3.8.10.28.42, respectively.
| Bin Name | # Contigs | Size (bp) | Closest Relative in the Database | ANI (%) | Most Likely Taxonomic Group |
|---|---|---|---|---|---|
| PYO97 | |||||
| PYO97 near-complete draft genomes with high resemblance to reference phage and counterpart in PYO2014. (Category 1) | |||||
| PYO97_4 | 1 | 149,561 | 89.77 | ||
| PYO97_10_85… | 13 | 344,749 | 90.848 | ||
| PYO97_11 | 1 | 72,136 | 97.37 | ||
| PYO97_15 | 1 | 44,667 | 92.02 | ||
| PYO97_16 | 1 | 130,932 | 96.86 | ||
| PYO97_29 | 1 | 169,029 | 89.959 | ||
| PYO97 near-complete draft genomes with high resemblance to reference phage, but no counterpart in PYO2014. (Category 2) | |||||
| PYO97_7 | 7 | 166,126 | 88.66 | ||
| PYO97_8 | 1 | 38,419 | 79.568 | ||
| PYO97_22.3.14 | 3 | 109,428 | 95.33 | ||
| PYO97_24 | 1 | 44,541 | 92.726 | ||
| PYO97_32 | 3 | 47,235 | 86.967 | unclassified | |
| PYO97 partial draft genome with low/medium resemblance to reference phage and counterpart in PYO2014. (Category 3) | |||||
| PYO97_27.21 | 2 | 37,487 | 22.104 | ||
| PYO97 partial draft genomes with no resemblance to reference phage and no counterpart in PYO2014. (Category 4) | |||||
| PYO97_1 | 1 | 11,445 | 10.99 | ||
| PYO97_5 | 3 | 29,155 | 68.72 | ||
| PYO97_9 | 1 | 10,727 | 19.779 | ||
| PYO97_10_3… | 5 | 343,801 | 28.408 | ||
| PYO97_13 | 1 | 37,843 | 9.777 | ||
| PYO97_17.30 | 7 | 90,209 | 58.832 | ||
| PYO97_20 | 1 | 90,712 | 19.54 | ||
| PYO97_25 | 1 | 25,293 | 41.01 | ||
| PYO97_26 | 5 | 171,908 | Cronobacter phage S13 (NC_028773.1) | 45.28 | |
| PYO97_28 | 5 | 30,952 | 21.43 | ||
| PYO97_31 * | 3 | 69,885 | 75.359 | ||
| PYO97 collapsed bins. (Category 5) | |||||
| PYO97_12 | 5 | 55,452 | 69.288 | ||
| PYO97_23 | 5 | 73,434 | 74.273 | ||
| PYO97 special cases, including bins classified as non-phages. (Category 6) | |||||
| PYO97_2 | 1 | 11,313 | uncultured Mediterranean phage uvMED-GF-C25-MedDCM-OCT-S33-C258 (AP014078.1) | 0.704 | unknown |
| PYO97_6 | 8 | 23,397 | uncultured Mediterranean phage uvMED-CGF-C14B-MedDCM-OCT-S36-C258 (AP013800.1) | 1.426 | unknown |
| PYO97_18 | 1 | 11,354 | 0.984 | unknown | |
| PYO97_19 | 10 | 284,533 | 18.88 | ||
| PYO97_33 | 3 | 10,088 | uncultured Mediterranean phage uvMED-CGF-C23-MedDCM-OCT-S24-C232 (AP013582.1) | 1.131 | unknown |
* PYO97_31 is 20 kbp shorter than the reference, therefore it was placed in this category, despite the high ANI of the reference genome. # Means “number of”. The 5th column reports the ANI of the reference genome towards the bin.
Overview of the phage draft genomes and bins of the cocktail PYO2014, indicating the most likely taxonomic group they belong to. PYO2014_27_42… corresponds to PYO2014_27_42.21.133.90.116.20.14.109.73.
| Bin Name | # Contigs | Size (bp) | Closest Relative in the Database | ANI (%) | Most Likely Taxonomic Group |
|---|---|---|---|---|---|
| PYO2014 | |||||
| PYO2014 near-complete draft genomes with high resemblance to reference phage and counterpart in PYO97. (Category 1) | |||||
| PYO2014_1 | 1 | 42,721 | 70.16 | ||
| PYO2014_12 | 1 | 47,209 | 97.91 | ||
| PYO2014_27_42… | 9 | 138,228 | 81.548 | ||
| PYO2014_30 | 1 | 138,269 | 99.36 | ||
| PYO2014 near-complete draft genomes with high resemblance to reference phage, but no counterpart in PYO97. (Category 2) | |||||
| PYO2014_2 | 1 | 76,529 | 77.91 | ||
| PYO2014_4 | 1 | 282,352 | 94.53 | ||
| PYO2014_8 | 1 | 36,807 | 74.45 | ||
| PYO2014_17 | 1 | 88,099 | 94.91 | ||
| PYO2014_18 | 1 | 147,760 | 91.5 | ||
| PYO2014_23 | 1 | 38,847 | 82.27 | ||
| PYO2014_26.13 | 2 | 65,818 | 90.705 | ||
| PYO2014_27_16.5 | 2 | 139,828 | 90.812 | ||
| PYO2014 partial draft genome with low/medium resemblance to reference phage and counterpart in PYO97. (Category 3) | |||||
| PYO2014_28 | 1 | 33,115 | 16.79 | ||
| PYO2014 partial draft genomes with no resemblance to reference phage and no counterpart in PYO97. (Category 4) | |||||
| PYO2014_7 | 1 | 103,078 | 14.32 | ||
| PYO2014_9 | 1 | 37,468 | 15.18 | ||
| PYO2014_19 | 1 | 43,272 | 45.35 | ||
| PYO2014_10 | 1 | 10,736 | 3.06 | ||
| PYO2014_11 | 1 | 13,190 | 29.52 | ||
| PYO2014_14 | 1 | 17,615 | 4.35 | ||
| PYO2014_20 | 4 | 16,677 | 20.55 | ||
| PYO2014_31 | 50 | 227,129 | 54.417 | ||
| PYO2014 collapsed bins. (Category 5) | |||||
| PYO2014_3.16.29 | 3 | 54,712 | 64.733 | ||
| PYO2014_5 | 25 | 193,706 | 90.11 | ||
| PYO2014_21 | 22 | 180,343 | 88.2 | ||
| PYO2014_25 | 3 | 78,290 | 76.539 | ||
| PYO2014 special cases, including bins classified as non-phages. (Category 6) | |||||
| PYO2014_6 | 26 | 75,778 | Uncultured phage WW-nAnB strain 2 (NC_026612.1) | 1.91 | unknown |
| PYO2014_15 | 1 | 20,152 | uncultured Mediterranean phage uvMED-CGF-C24-MedDCM-OCT-S28-C185 (AP013656.1) | 0.69 | unknown |
| PYO2014_22 | 5 | 57,117 | 1.78 | ||
| PYO2014_24 | 26 | 89,259 | 42.11 | ||
# Means “number of”. The 5th column reports the ANI of the reference genome towards the bin.
Figure 4All the bins from PYO97 and PYO2014 are plotted according to the Log to base 10 of the coverage by PYO97’s reads, x-axis and PYO2014’s reads, y-axis. Bins from PYO97 are depicted in blue, whilst bins from PYO2014 are red. Circle shaped data points represent pairing bins between the two samples, i.e., bins for which MetaPhinder found a matching counterpart in the other sample with a ANI > 70%; see Table 6 and Material and Methods, Estimate similarities between PYO97 and PYO2014.
Global mutation distances between samples.
| Sample | PYO97 | PYO2000 | PYO2014 |
|---|---|---|---|
| PYO97 | 0 | ||
| PYO2000 | 0.113 ± 0.0006 | 0 | |
| PYO2014 | 0.132 ± 0.0008 | 0.138 ± 0.0009 | 0 |