| Literature DB >> 26703713 |
Henrike Zschach1, Katrine G Joensen2, Barbara Lindhard3, Ole Lund4, Marina Goderdzishvili5, Irina Chkonia6, Guliko Jgenti7, Nino Kvatadze8, Zemphira Alavidze9, Elizabeth M Kutter10, Henrik Hasman11, Mette V Larsen12.
Abstract
Phage therapy, a practice widespread in Eastern Europe, has untapped potential in the combat against antibiotic-resistant bacterial infections. However, technology transfer to Western medicine is proving challenging. Bioinformatics analysis could help to facilitate this endeavor. In the present study, the Intesti phage cocktail, a key commercial product of the Eliava Institute, Georgia, has been tested on a selection of bacterial strains, sequenced as a metagenomic sample, de novo assembled and analyzed by bioinformatics methods. Furthermore, eight bacterial host strains were infected with the cocktail and the resulting lysates sequenced and compared to the unamplified cocktail. The analysis identified 23 major phage clusters in different abundances in the cocktail, among those clusters related to the ICTV genera T4likevirus, T5likevirus, T7likevirus, Chilikevirus and Twortlikevirus, as well as a cluster that was quite distant to the database sequences and a novel Proteus phage cluster. Examination of the depth of coverage showed the clusters to have different abundances within the cocktail. The cocktail was found to be composed primarily of Myoviridae (35%) and Siphoviridae (32%), with Podoviridae being a minority (15%). No undesirable genes were found.Entities:
Keywords: Eliava Intestiphage; metagenomics; phage therapy; whole genome sequence analysis
Mesh:
Substances:
Year: 2015 PMID: 26703713 PMCID: PMC4690881 DOI: 10.3390/v7122958
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
List of the strains used to specifically amplify phages from the Intesti cocktail and the number of reads obtained in their sequencing. All strains were tested for susceptibility to the cocktail prior to selection.
| Host Bacterial Strain | Number of Reads |
|---|---|
| 358,914 | |
| 134,966 | |
| 184,790 | |
| 265,772 | |
| 64,852 | |
| 133,980 | |
| 225,664 | |
| 401,722 |
Figure 1Schematic illustrating the contig grouping process. In a first step, a BLAST search against the non-redundant nucleotide collection is performed for all contigs. Afterwards, a hit profile is generated for each contig by applying a cutoff of 20% on the query coverage and 1 × 10−10 on the E-value to the raw BLAST results. During the second step contigs are sorted by size and the largest contig is automatically assigned to group 1. The third step consists of comparing the second-largest contig to all existing groups using the scoring system described in the text and either assigning the contig to the group with the lowest distance score or opening a new group if the lowest score is greater than 0.9. It is repeated until all contigs have been assigned (though some contigs may be the only member of their respective group).
Overview of selected characteristics of the phage clusters identified in the Intesti sample. If known, the family, subfamily and genus of the closest database reference as specified by the ICTV are given. In some cases, the closest reference phage has not been incorporated into the phage taxonomy yet but other references have. For those, both the closest reference and the closest reference within the taxonomy scheme are given. The genus “rv5-like virus” has been proposed by several authors [36,37], but is not confirmed in the current (2014) ICTV release. Remark that Bacteriophage G1 is annotated as a Staphylococcus phage.
| Phage Cluster | Cluster Size in bp | Reference Accession | Average Coverage of Phage Cluster | Average Percent Identity | Reference Phage Description Line | Phage Family | Subfamily | Genus | Size RatioCluster/Reference |
|---|---|---|---|---|---|---|---|---|---|
| D1 | 142,025 | KC012913.1 | 99.97 | 99.80 | 1.01 | ||||
| AY954969.1 | 97.98 | 99.74 | 1.02 | ||||||
| D2 | 76,960 | JX415536.1 | 87.89 | 87.60 | 1.00 | ||||
| D3 | 87,828 | KC862301.1 | 98.97 | 96.16 | 1.00 | ||||
| D4 | 69,023 | KF562340.1 | 87.20 | 94.02 | 0.96 | ||||
| D5 | 150,530 | FR775895.2 | 92.41 | 98.16 | 1.01 | ||||
| D6 | 81,563 | AB609718.1 | 35.55 | 77.46 | 0.57 | ||||
| D7 | 58,193 | KJ094032.2 | 77.23 | 88.35 | - | 1.06 | |||
| D8 | 50,277 | HM035024.1 | 98.16 | 90.67 | - | 0.99 | |||
| D9 | 39,912 | EU734172.1 | 88.25 | 93.45 | 1.02 | ||||
| D10 | 145,982 | KJ190158.1 | 93.95 | 93.00 | 1.05 | ||||
| DQ832317.1 | 93.72 | 92.62 | - | 1.06 | |||||
| D11 | 61,791 | JX094499.1 | 96.33 | 92.95 | 1.04 | ||||
| KC139512.1 | 95.15 | 93.86 | - | 1.04 | |||||
| D12 | 60,451 | KJ010489.1 | 54.57 | 87.35 | 1.42 | ||||
| D13 | 188,630 | GU070616.1 | 88.67 | 94.90 | 1.29 | ||||
| D14 | 133,015 | JX128259.1 | 94.55 | 96.24 | 0.80 | ||||
| DQ904452.1 | 93.42 | 96.00 | 0.80 | ||||||
| D15 | 43,967 | GQ468526.1 | 87.06 | 91.27 | 1.12 | ||||
| FJ194439.1 | 87.13 | 90.61 | 1.11 | ||||||
| D16 | 46,882 | KM233151.1 | 93.68 | 91.47 | 1.06 | ||||
| JX865427.2 | 91.64 | 91.03 | 1.08 | ||||||
| D17 | 41,098 | AY370674.1 | 88.68 | 94.28 | 0.93 | ||||
| D18 | 41,016 | HE775250.1 | 94.95 | 91.57 | 0.97 | ||||
| JX202565.1 | 92.76 | 91.41 | 0.96 | ||||||
| F1 | 13,855 | HG518155.1 | 99.97 | 99.02 | 0.30 | ||||
| AM910650.1 | 91.92 | 97.11 | 0.30 | ||||||
| F2 | 11,476 | EU877232.1 | 99.94 | 91.42 | 0.13 | ||||
| F3 | 5706 | HQ665011.1 | 83.42 | 86.09 | 0.05 | ||||
| AY543070.1 | 82.09 | 87.59 | 0.05 | ||||||
| F4 | 2624 | EF437941.1 | 98.59 | 97.76 | 0.02 | ||||
| Proteus phage | 104,213 | - | - | - | - | - |
Figure 2Comparison of the weighted mean of the depth of coverage between clusters in the Intesti sample. The weighted standard deviation is depicted as error bars. Note that cluster D2 is composed of only one contig and the standard deviation is therefore not applicable. It can be seen that the depth varies greatly between clusters, reflecting the different abundances of the represented phage types in the cocktail.
Depth of coverage ratio of host-amplified samples to the Intesti sample. Combinations with a ratio greater than 1.0 are indicated by green background coloring. Those are thought to be the infecting clusters, as they are more abundant in the host-amplified sample than in the original one. In the last line is shown a phage cluster, which has not even been considered in the initial contig grouping of the Intesti sample because of its small size of only 1346bp and low depth of coverage of only 2×. It has, however, been greatly amplified on P. aeruginosa strain PAO1.Results regarding the amplification on Salmonella were inconclusive and therefore removed (see text).
| Cluster | |||||||
|---|---|---|---|---|---|---|---|
| D1 | 0.03 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 |
| D2 | 0.02 | 0.00 | 0.00 | 0.00 | 0.02 | 0.02 | 0.00 |
| D3 | 0.30 | 0.00 | 0.00 | 22.29 | 0.10 | 0.00 | 0.00 |
| D4 | 0.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| D5 | 0.11 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| D6 | 0.06 | 0.00 | 0.02 | 0.00 | 0.01 | 0.01 | 0.00 |
| D7 | 0.05 | 2.57 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 |
| D8 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| D9 | 0.04 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.00 |
| D10 | 0.08 | 0.00 | 0.06 | 0.00 | 0.02 | 0.00 | 0.00 |
| D11 | 0.13 | 0.13 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| D12 | 0.04 | 0.00 | 0.01 | 0.00 | 0.02 | 0.00 | 0.00 |
| D13 | 0.05 | 0.05 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 |
| D14 | 4.74 | 0.00 | 0.00 | 0.00 | 2.82 | 2.06 | 0.00 |
| D15 | 0.04 | 0.00 | 0.00 | 0.00 | 4.97 | 9.84 | 0.00 |
| D16 | 0.04 | 0.00 | 0.00 | 0.00 | 0.02 | 0.02 | 0.00 |
| D17 | 47.17 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 10.01 |
| D18 | 0.37 | - | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| F1 | 0.00 | 0.00 | 1.47 | 0.00 | 0.00 | 0.00 | 0.00 |
| F2 | 0.00 | - | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| F3 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| F4 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 0.04 | 0.00 | 0.02 | 0.00 | 0.00 | 0.01 | 0.12 | |
| * | 0.00 | 0.00 | 1044.20 | 0.00 | 0.00 | 0.00 | 0.00 |
Note: The cluster marked by an asterisk (*) exists in the Intesti sample but has not been named due to its small size and low depth (see table header).
List of genes potentially relevant for efficacy, found by text mining annotation results. The annotation column details whether the gene was found in the annotation provided by RAST, by BLAST or both. If only one is named the other method either did not predict the gene or annotated it differently. Top BLAST hit, query coverage as given by BLAST and percent identity as given by BLAST are only filled out if applicable. Most genes which were picked up for their RAST annotation still have a BLAST hit description line, query coverage and percent identity values because that gene was also called by GeneMarkS. In any case, the last two columns apply to the BLAST hit, but not necessarily to the hit in the RAST databases. The acridine resistance gene evidenced in D14 was not called by GeneMarkS. If the gene was picked up for its BLAST annotation column 2 and 5 are identical.
| Text Mining Term | Description Line | Part of Cluster | Annotation by | Top BLAST Hit Description Line | Query Coverage | Percent Positives |
|---|---|---|---|---|---|---|
| “virulence” | Phage virulence-associated protein | D1 | RAST | ORF002 (Staphylococcus phage G1) | 100% | 100% |
| Phage virulence-associated protein | D6 | RAST | putative adsorption associated tail protein (Enterococcus phage phiEF24C) | 100% | 95% | |
| “resistance” | Acridine resistance | D14 | RAST | - | - | - |
| Acriflavin resistance protein | D3 | RAST | hypothetical protein PAK_P500103 (Pseudomonas phage PAK_P5) | 100% | 100% | |
| Tellurium resistance protein TerD | D5 | RAST | Phi92_gp172 (Enterobacteria phage phi92) | 100% | 100% | |
| Tellurium resistance protein TerD | D5 | RAST | Phi92_gp173 (Enterobacteria phage phi92) | 100% | 100% | |
| Tellurite resistance protein | D5 | RAST | Phi92_gp178 (Enterobacteria phage phi92) | 100% | 100% | |
| “methyltransferase” or “methylase” | DNA methylase | D7 | RAST/BLAST | See “Description line” | 100% | 99% |
| DNA N-6-adenine-methyltransferase | D8 | RAST/BLAST | See “Description line” | 94% | 90% | |
| putative site specific DNA methylase | D8 | BLAST | See “Description line“ | 100% | 99% | |
| DNA methyltransferase | D13 | RAST/BLAST | See “Description line” | 100% | 99% | |
| putative DNA N-6-adenine methyltransferase | D10 | RAST/BLAST | See “Description line” | 100% | 99% | |
| Dam methylase | D8 | BLAST | See “Description line” | 100% | 100% | |
| putative DNA adenine methylase | D11 | BLAST | See “Description line” | 100% | 100% | |
| putative DNA methyltransferase | unassigned | BLAST | See “Description line” | 100% | 100% | |
| DNA adenine methyltransferase | D14 | BLAST | See “Description line” | 100% | 99% | |
| putative DNA adenine methylase | D11 | RAST/BLAST | See “Description line” | 100% | 97% | |
| dCMPhydroxymethylase | D14 | RAST/BLAST | See “Description line” | 100% | 100% | |
| putative adenine methyltransferase | D10 | RAST/BLAST | See “Description line” | 100% | 98% | |
| DNA-cytosine methyltransferase | D5 | RAST | Phi92_gp043 (Enterobacteria phage phi92) | 100% | 99% | |
| Adenine-specific methyltransferase | D5 | RAST | Phi92_gp155 (Enterobacteria phage phi92) | 100% | 99% | |
| “integrase” | Phage integrase | D2 | RAST/BLAST | putative integrase (Escherichia phage KBNP1711) | 100% | 98% |
| Phage integrase | D4 | RAST/BLAST | integrase (Enterobacter phage IME11) | 100% | 99% |
Figure 3Rarefaction curve of the Intesti sequencing sample. The curve appears to flatten out as the percentage of reads used increases, indicating that the total assembly size is converging. This means that the most common phages are well represented in the sequencing reads. Phages that are in low abundance may not be adequately covered though.
Fraction of the strains found to be susceptible for each pathogen tested. Observe that this is only a small-scale experiment. All strains are part of an in-house collection.
| Pathogen | Susceptible Strains |
|---|---|
| 10/10 | |
| 5/10 | |
| 5/5 | |
| 5/7 | |
| 2/6 | |
| 3/5 | |
| 2/5 |