| Literature DB >> 34904625 |
Andrzej Zielezinski1, Sebastian Deorowicz2, Adam Gudyś2.
Abstract
SUMMARY: PHIST (Phage-Host Interaction Search Tool) predicts prokaryotic hosts of viruses based on exact matches between viral and host genomes. It improves host prediction accuracy at species level over current alignment-based tools (on average by 3 percentage points) as well as alignment-free and CRISPR-based tools (by 14-20 percentage points). PHIST is also two orders of magnitude faster than alignment-based tools making it suitable for metagenomics studies.Entities:
Year: 2021 PMID: 34904625 PMCID: PMC8826084 DOI: 10.1093/bioinformatics/btab837
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.(A) Host prediction accuracy for 2288 viral genomes against 62 493 candidate hosts, binned by host taxonomic level. (B) Host prediction accuracy at the species level across five highest scoring prokaryotic species for each virus sequence (the prediction was scored as correct if the correct host species was among the first five predicted hosts). (C) Unique and shared correct virus–host predictions among the tools. The bar chart indicates the intersection size of viruses with correct host prediction at the species level. Connected dots on the bottom panel indicate which pair of tools is considered for each intersection. (D) Runtime and memory usage for 2288 viruses and 62 493 prokaryotes. Time and memory measurements of Phirbo also include BLASTN, and SpacePHARER includes PILER-CR analysis. (E) Prediction accuracies for contigs subsampled at various lengths from the 2288 virus genomes. The solid line shows the mean prediction accuracy at host taxonomic level and the grey shade indicates the 95% confidence interval. The horizontal dashed line marks the prediction accuracy using the full-length viral genomes. (F and G) Proportions of congruent predictions for viral contigs between PHIST and those in Paez-Espino et al. and Nayfach et al., respectively. (H) Viral assignments to human gut bacteria and archaea. Distribution of host classes reported by Nayfach et al. (upper bar chart) and a set of PHIST predictions on viruses that were not assigned to hosts by Nayfach et al. (lower bar chart)