| Literature DB >> 24931987 |
Mengfei Cao1, Christopher M Pietras1, Xian Feng1, Kathryn J Doroschak1, Thomas Schaffner1, Jisoo Park1, Hao Zhang1, Lenore J Cowen1, Benjamin J Hescott1.
Abstract
MOTIVATION: It has long been hypothesized that incorporating models of network noise as well as edge directions and known pathway information into the representation of protein-protein interaction (PPI) networks might improve their utility for functional inference. However, a simple way to do this has not been obvious. We find that diffusion state distance (DSD), our recent diffusion-based metric for measuring dissimilarity in PPI networks, has natural extensions that incorporate confidence, directions and can even express coherent pathways by calculating DSD on an augmented graph.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24931987 PMCID: PMC4058952 DOI: 10.1093/bioinformatics/btu263
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.An example of constructing auxiliary graphs for calculating different DSDs (with our BioGRID confidence scores). (a) The original PPI network and two KEGG pathways; (b) the weight graph with PPI confidence score as edge weights; (c) the directed graph with KEGG PPIs added; and (d) the augmented graph by incorporating KEGG pathways as weighted paths
Summary of protein MIPS function prediction performance for the STRING integrative network Gstr using DSD, cDSD/caDSD and capDSD compared to the original methods in 10 runs of 2-fold cross-validation (as a percentage)
| MIPS 1 | MIPS 2 | MIPS 3 | ||||||
|---|---|---|---|---|---|---|---|---|
| Accuracy | F1 score | Accuracy | F1 score | Accuracy | F1 | |||
| Majority Vote (MV) | 65.71 ± 0.36 | 49.50 ± 0.25 | 53.95 ± 0.47 | 37.96 ± 0.19 | 46.17 ± 0.50 | 33.75 ± 0.33 | ||
| MV with original DSD | 64.93 ± 0.56 | 48.55 ± 0.42 | 50.99 ± 0.35 | 36.10 ± 0.24 | 44.47 ± 0.35 | 31.85 ± 0.22 | ||
| MV with cDSD/caDSD | 69.38 ± 0.71 | 51.54 ± 0.36 | 58.01 ± 0.50 | 40.41 ± 0.32 | 51.48 ± 0.46 | 36.86 ± 0.32 | ||
| MV with capDSD | 70.25 ± 0.47 | 52.22 ± 0.39 | 61.22 ± 0.57 | 42.52 ± 0.29 | 55.54 ± 0.44 | 39.36 ± 0.21 | ||
| Weighted MV (WMV) with original DSD | 65.25 ± 0.45 | 49.15 ± 0.44 | 52.19 ± 0.42 | 37.10 ± 0.29 | 45.64 ± 0.41 | 33.00 ± 0.16 | ||
| WMV with cDSD/caDSD | 69.67 ± 0.56 | 52.20 ± 0.37 | 59.41 ± 0.42 | 41.62 ± 0.26 | 53.21 ± 0.37 | 38.29 ± 0.28 | ||
| Multi-way Cut (GMC) | 63.48 ± 0.56 | 43.03 ± 0.20 | 52.66 ± 0.54 | 31.67 ± 0.18 | 43.37 ± 0.60 | 26.20 ± 0.19 | ||
| GMC with original DSD | 63.29 ± 0.68 | 42.80 ± 0.23 | 52.34 ± 0.56 | 31.60 ± 0.21 | 43.59 ± 0.33 | 26.39 ± 0.18 | ||
| GMC with cDSD/caDSD | 65.18 ± 0.38 | 43.39 ± 0.16 | 53.59 ± 0.47 | 31.89 ± 0.18 | 44.46 ± 0.36 | 26.50 ± 0.17 | ||
| GMC with capDSD | 65.21 ± 0.46 | 43.31 ± 0.15 | 51.09 ± 0.37 | 30.74 ± 0.20 | 40.73 ± 0.40 | 25.49 ± 0.21 | ||
| Functional Flow (FF) | 39.91 ± 0.77 | 31.61 ± 0.25 | 22.26 ± 0.53 | 17.25 ± 0.21 | 18.48 ± 0.49 | 14.26 ± 0.09 | ||
| FF with original DSD | 47.44 ± 0.42 | 36.46 ± 0.18 | 29.46 ± 0.30 | 21.06 ± 0.25 | 23.08 ± 0.21 | 16.68 ± 0.16 | ||
| FF with cDSD/caDSD | 51.70 ± 0.43 | 38.57 ± 0.21 | 34.67 ± 0.27 | 24.03 ± 0.19 | 28.32 ± 0.35 | 19.39 ± 0.20 | ||
| FF with capDSD | 53.00 ± 0.37 | 39.73 ± 0.19 | 37.93 ± 0.50 | 26.56 ± 0.18 | 31.18 ± 0.36 | 21.59 ± 0.20 | ||
Note: Weighted majority vote with capDSD (in bold) gives the best results over all three levels of the MIPS hierarchy.
Confidence score assignment for PPIs when either only low-throughput or only high-throughput experiments are present
| No. of experiments | Low-throughput | High-throughput |
|---|---|---|
| 0 | 0 | 0 |
| 1 | 0.80 | 0.25 |
| 2 | 0.90 | 0.50 |
| 3 | 0.95 | 0.75 |
| ≥4 | 0.95 | 0.85 |
Summary of protein MIPS function prediction performance for the physical PPI network using DSD, cDSD, caDSD and capDSD compared to the original methods in 10 runs of 2-fold cross-validation (as a percentage)
| MIPS 1 | MIPS 2 | MIPS 3 | ||||||
|---|---|---|---|---|---|---|---|---|
| Accuracy | F1 score | Accuracy | F1 score | Accuracy | F1 | |||
| Majority Vote (MV) | 50.08 ± 0.72 | 41.45 ± 0.40 | 40.69 ± 0.49 | 30.85 ± 0.33 | 38.03 ± 0.37 | 29.50 ± 0.14 | ||
| MV with original DSD | 62.96 ± 0.45 | 47.40 ± 0.28 | 49.41 ± 0.65 | 35.71 ± 0.33 | 43.87 ± 0.47 | 32.33 ± 0.18 | ||
| MV with cDSD | 66.16 ± 0.56 | 49.10 ± 0.24 | 53.08 ± 0.54 | 38.12 ± 0.16 | 47.73 ± 0.56 | 35.13 ± 0.33 | ||
| MV with caDSD (directed edges) | 67.61 ± 0.56 | 50.37 ± 0.22 | 59.11 ± 0.67 | 41.58 ± 0.19 | 52.14 ± 0.55 | 38.09 ± 0.16 | ||
| MV with caDSD (no directed edges) | 67.61 ± 0.42 | 50.36 ± 0.24 | 59.11 ± 0.57 | 41.57 ± 0.25 | 52.13 ± 0.56 | 38.07 ± 0.21 | ||
| MV with capDSD | 67.60 ± 0.37 | 50.28 ± 0.27 | 59.46 ± 0.57 | 41.58 ± 0.22 | 52.97 ± 0.59 | 38.19±0.23 | ||
| Weighted MV (WMV) with original DSD | 63.40 ± 0.51 | 48.29 ± 0.25 | 50.69 ± 0.82 | 36.74 ± 0.36 | 45.20 ± 0.58 | 33.72 ± 0.27 | ||
| WMV with cDSD | 67.07 ± 0.45 | 50.12 ± 0.35 | 54.82 ± 0.56 | 39.53 ± 0.18 | 49.56 ± 0.49 | 36.71 ± 0.32 | ||
| WMV with caDSD (directed edges) | 68.69 ± 0.40 | 51.48 ± 0.29 | 60.96 ± 0.51 | 43.13 ± 0.23 | 54.51 ± 0.51 | 39.91 ± 0.28 | ||
| WMV with caDSD (no directed edges) | 68.68 ± 0.41 | 51.48 ± 0.25 | 60.96 ± 0.53 | 43.13 ± 0.22 | 54.51 ± 0.46 | 39.90 ± 0.32 | ||
| Multi-way Cut (GMC) | 55.31 ± 0.41 | 42.18 ± 0.29 | 42.02 ± 0.43 | 28.21 ± 0.36 | 36.69 ± 0.50 | 24.98 ± 0.21 | ||
| GMC with original DSD | 58.36 ± 0.32 | 42.51 ± 0.19 | 44.63 ± 0.32 | 29.51 ± 0.27 | 38.20 ± 0.40 | 25.49 ± 0.22 | ||
| GMC with cDSD | 61.11 ± 0.37 | 42.85 ± 0.23 | 47.11 ± 0.35 | 30.52 ± 0.25 | 40.83 ± 0.61 | 26.66 ± 0.22 | ||
| GMC with caDSD (directed edges) | 62.71 ± 0.30 | 43.46 ± 0.24 | 52.59 ± 0.25 | 32.47 ± 0.30 | 44.29 ± 0.63 | 28.46 ± 0.19 | ||
| GMC with caDSD (no directed edges) | 62.76 ± 0.31 | 43.45 ± 0.25 | 52.61 ± 0.25 | 32.50 ± 0.30 | 44.31 ± 0.63 | 28.46 ± 0.19 | ||
| GMC with capDSD | 62.44 ± 0.31 | 43.43 ± 0.17 | 52.30 ± 0.46 | 32.48 ± 0.31 | 44.18 ± 0.59 | 28.34 ± 0.32 | ||
| Functional Flow (FF) | 50.48 ± 0.48 | 37.17 ± 0.25 | 32.57 ± 0.48 | 22.64 ± 0.32 | 25.29 ± 0.39 | 18.27 ± 0.14 | ||
| FF with original DSD | 53.58 ± 0.36 | 40.75 ± 0.11 | 38.20 ± 0.65 | 26.71 ± 0.29 | 30.70 ± 0.45 | 22.29 ± 0.28 | ||
| FF with cDSD | 57.78 ± 0.49 | 42.82 ± 0.27 | 42.17 ± 0.58 | 29.29 ± 0.38 | 35.68 ± 0.48 | 25.72 ± 0.17 | ||
| FF with caDSD (directed edges) | 60.09 ± 0.55 | 44.81 ± 0.24 | 49.73 ± 0.41 | 33.89 ± 0.32 | 40.82 ± 0.60 | 28.94 ± 0.27 | ||
| FF with caDSD (no directed edges) | 60.18 ± 0.47 | 44.80 ± 0.20 | 49.67 ± 0.51 | 33.89 ± 0.28 | 40.82 ± 0.51 | 28.97 ± 0.23 | ||
| FF with capDSD | 58.98 ± 0.53 | 43.80 ± 0.27 | 49.32 ± 0.61 | 33.32 ± 0.29 | 41.04 ± 0.33 | 28.83 ± 0.33 | ||
Note: Weighted majority vote with capDSD (in bold) gives the best results over all three levels of the MIPS hierarchy.