| Literature DB >> 28724991 |
Biruhalem Taye1,2,3, Candida Vaz4, Vivek Tanavde4,5, Vladimir A Kuznetsov4,6, Frank Eisenhaber4,7,6, Richard J Sugrue8, Sebastian Maurer-Stroh4,7,9.
Abstract
Several available online tools provide network growing functions where an algorithm utilizing different data sources suggests additional genes/proteins that should connect an input gene set into functionally meaningful networks. Using the well-studied system of influenza host interactions, we compare the network growing function of two free tools GeneMANIA and STRING and the commercial IPA for their performance of recovering known influenza A virus host factors previously identified from siRNA screens. The result showed that given small (~30 genes) or medium (~150 genes) input sets all three network growing tools detect significantly more known host factors than random human genes with STRING overall performing strongest. Extending the networks with all the three tools significantly improved the detection of GO biological processes of known host factors compared to not growing networks. Interestingly, the rate of identification of true host factors using computational network growing is equal or better to doing another experimental siRNA screening study which could also be true and applied to other biological pathways/processes.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28724991 PMCID: PMC5517527 DOI: 10.1038/s41598-017-06020-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The trend of network growing tools in detecting known IHFs. (a) Number of detected known IHFs upon growing networks with the small set (28 IHF) seeds and 153 known IHFs as positive set. (b) Number of detected known IHFs upon growing networks with the small set (28 IHF) seeds and 1463 known IHFs as positive set. (c) Number of detected known IHFs upon growing networks with medium set (153 IHFs) seeds and 1463 known IHFs as positive set. (d) Percentage of detected known IHFs upon growing networks with the small set (28 IHF) seeds and 153 known IHFs as positive set. (e) Percentage of detected known IHFs upon growing networks with the small set (28 IHF) seeds and 1463 known IHFs as positive set. (f) Percentage of detected known IHFs upon growing networks with medium set (153 IHFs) seeds and 1463 known IHFs as positive set.
Figure 2Detection rate of network growing tools after randomization of IHFs and random human proteins. (a) Comparison of the detection performance of the network growing tools after growing 100 genes using either small set (30 genes) IHFs or random human proteins as seeds and 1463 IHFs as positive sets. P-values are the result of a paired Student-t test analysis. (b) Comparison of the detection performance of the network growing tools after growing 100 genes using either medium set (146 genes) IHFs or random human proteins as seeds and 1463 IHFs as positive sets. P-values are the result of a paired Student-t test analysis. (c) Detection performance comparison of the three network growing tools (d) The detection rate of STRING and GeneMANIA after combination and interchange of the 1st 50 grown genes.
Figure 3Performance comparison of the three network growing tools in terms of KEGG pathways and GO BP. (a) Rate of overlapping KEGG pathways before and after network growing relative to pathways of the 153 and 1463 known IHFs. (b) Rate of overlapping GO BP of the grown genes relative to GO BP of the 153 and 1463 known IHFs. (c) Pairwise analysis of KEGG pathways from the grown genes in the network growing tools and KEGG pathways of the positive sets (1463 IHFs) (d) Pairwise analysis of GO BP from the grown genes in the network growing tools and GO BP of the positive sets (1463 IHFs).
Figure 4Comparison of siRNA experimental studies and network growing tools in detecting known IHFs.