| Literature DB >> 30890181 |
Tingting Zheng1, Jun Li2,3, Yueqiong Ni4, Kang Kang4, Maria-Anna Misiakou5, Lejla Imamovic5, Billy K C Chow6, Anne A Rode7,8, Peter Bytzer7,8, Morten Sommer9, Gianni Panagiotou10,11,12.
Abstract
BACKGROUND: Viruses are important components of microbial communities modulating community structure and function; however, only a couple of tools are currently available for phage identification and analysis from metagenomic sequencing data. Here we employed the random forest algorithm to develop VirMiner, a web-based phage contig prediction tool especially sensitive for high-abundances phage contigs, trained and validated by paired metagenomic and phagenomic sequencing data from the human gut flora.Entities:
Keywords: Antibiotics; Metagenome; Phage; Phage-host interaction
Mesh:
Substances:
Year: 2019 PMID: 30890181 PMCID: PMC6425642 DOI: 10.1186/s40168-019-0657-y
Source DB: PubMed Journal: Microbiome ISSN: 2049-2618 Impact factor: 14.650
Fig. 1The workflow of VirMiner. The high-quality reads are assembled into contigs using IDBA_UD [33]. HMM model from GeneMark [34] was used for gene prediction. Functional profiles are generated by searching against different databases including KO, Pfam, and viral protein families defined by Paez-Espino et al. [24], POG 2012, and uPOGs. The R package randomForest was employed to identify phage contigs. The taxonomy affiliations of identified phage contigs are identified using the RDP classifier. Phage-host interaction prediction was performed using the CRISPR-spacer based method
Statistics for different POG databases
| uPOGs | pVOGs | POG 2012 | |
|---|---|---|---|
| Genomes | 4078 | 2993 | 1027 |
| Proteins | 357,460 | 295,653 | 97,731 |
| POGs | 16,710 | 9518 | 4542 |
Comparison of functionality for VirMiner (developed here), VirFinder [12], VirSorter [26], and iVirus [56]
| Tools | Input raw reads | Functional annotation | Phage contig identification | Inter-group comparison | Phage-host relationship prediction | Taxonomy analysis |
|---|---|---|---|---|---|---|
| VirMiner | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |
| VirSorter | ✖ | ✔ | ✔ | ✖ | ✖ | ✖ |
| VirFinder | ✖ | ✖ | ✔ | ✔ | ✖ | ✖ |
| iVirus | ✔ | ✔ | ✔ | ✖ | ✖ | ✔ |
Fig. 2Performance comparison of VirMiner, VirSorter, and VirFinder for long contigs (> 5 kb). The predictive performance was measured based on reads ratio of correctly predicted contigs (a) and the number of correctly predicted contigs (b). The predictive performance of VirFinder was evaluated with five different cutoffs including four commonly used cutoffs (FDR < 0.01, 0.05, 0.1, and 0.15) and the value at which VirFinder can achieve the same false positive rate (FPR) as VirSorter (FDR < 0.0220714)
Fig. 3True positive phage contigs identified by VirMiner, VirSorter, and VirFinder for contigs > 1 kb, 3 kb, and 5 kb. Box plots showing abundances of true positive phage contigs identified by the three tools (a). Only true positive phage contigs with relative abundance > 0.001 were selected for visualization. Abundance of phage contigs referred to the ratio of reads represented by contigs over total reads in the sample. Venn diagram showing the overlap of true positive phage contigs identified by the three tools, for contigs > 1 kb, 3 kb, and 5 kb, respectively (b)
Fig. 4Box plot showing genus-level microbial diversity of phages based on the independent cohort of humans treated with antibiotics. The microbial diversity was measured by diversity indices including the Shannon index (a), Simpson index (b), and Pielou eveness index (c). Data are shown based on three conditions before treatment, at the end of treatment and at 90 days after the end of treatment. The alpha diversity of before-treatment (“exposed 0”) phage communities was significantly higher compared to group exposed to antibiotics for 7 days (“exposed 7”) (Wilcoxon test; Shannon index, p < 0.044)
Fig. 5The phage-host interaction network produced from the independent cohort of humans treated with antibiotics at phage species level. The phage-host network was produced from antibiotics-exposed subjects that were divided into three groups (“exposed 0,” “exposed 7,” and “exposed 90”) (a) and control subjects that also had three groups (“control 0,” “control 7,” and “control 90”) (b). Phage (red) and bacteria (gray) are connected if the bacteria species were predicted as the host of phages. The edge color indicates the individual’s id where the phage-host interaction was observed