| Literature DB >> 21106130 |
Min Wu1, Xiaoli Li, Hon Nian Chua, Chee-Keong Kwoh, See-Kiong Ng.
Abstract
BACKGROUND: Protein-protein interactions (PPIs) play important roles in various cellular processes. However, the low quality of current PPI data detected from high-throughput screening techniques has diminished the potential usefulness of the data. We need to develop a method to address the high data noise and incompleteness of PPI data, namely, to filter out inaccurate protein interactions (false positives) and predict putative protein interactions (false negatives).Entities:
Mesh:
Substances:
Year: 2010 PMID: 21106130 PMCID: PMC2957691 DOI: 10.1186/1471-2105-11-S7-S8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
AUC of each method using original raw scores and Bin-Confidences, respectively.
| Methods/Data | DIP data | BioGrid data | |||
|---|---|---|---|---|---|
| Original Raw Scores | Bin-Confidence | Original Raw Scores | Bin-Confidence | ||
| FS-weight | 0.741 | 0.768 | 0.741 | 0.745 | |
| PathRatio | 0.702 | 0.710 | - | - | |
| IRAP | 0.686 | 0.723 | 0.580 | 0.594 | |
| Gene-expression Correlation | 0.549 | 0.560 | 0.580 | 0.566 | |
| Interacting Domains | 0.547 | 0.544 | 0.561 | 0.562 | |
| Sequence Similarity | 0.576 | 0.569 | 0.523 | 0.529 | |
| InterSVM | 0.776 | 0.804 | 0.749 | 0.768 | |
| InterBC | 0.710 | 0.787 | 0.600 | 0.750 | |
Figure 1AUC for each method as the parameter . Figure 1 shows the AUC for each method as the parameter μ varies on DIP data.
Figure 2AUC for each method as the parameter . Figure 2 shows the AUC for each method as the parameter μ varies on BioGrid data.
Figure 3The average functional similarity of top-ranked interactions in DIP data. Each interaction may have functional similarity score which is the semantic similarity of GO terms annotating these two interacting proteins. Figure 3 shows the average functional similarity of top-ranked interactions in DIP data.
Figure 4The average functional similarity of top-ranked interactions in BioGrid data. Figure 4 shows the average functional similarity of top-ranked interactions in BioGrid data.
Figure 5The average functional similarity of top-ranked false negative candidates generated from DIP data. In DIP data, protein pairs with at least 2 common neighbors were selected as false negative candidates, resulting in 33482 such candidates. Figure 5 shows the average functional similarity of the top-ranked false negative candidates.
The comparison between Patil's method and ours.
| Methods/Data | DIP data | BioGrid data | ||
|---|---|---|---|---|
| InterSVM | 0.852 | 0.865 | 0.797 | 0.818 |
| InterBC | 0.743 | 0.841 | 0.776 | 0.798 |