| Literature DB >> 18269702 |
Jean-Philippe Vert1, Jian Qiu, William S Noble.
Abstract
BACKGROUND: Much recent work in bioinformatics has focused on the inference of various types of biological networks, representing gene regulation, metabolic processes, protein-protein interactions, etc. A common setting involves inferring network edges in a supervised fashion from a set of high-confidence edges, possibly characterized by multiple, heterogeneous data sets (protein sequence, gene expression, etc.).Entities:
Mesh:
Substances:
Year: 2007 PMID: 18269702 PMCID: PMC2230501 DOI: 10.1186/1471-2105-8-S10-S8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Performance on reconstruction of the yeast metabolic networks.
| MLPK | TPPK | MLPK + TPPK | Direct | ||||
| Data | Accuracy | AUC | Accuracy | AUC | Accuracy | AUC | AUC |
| Expression | 77.9 ± 1.2 | 84.8 ± 1.2 | 77.4 ± 0.9 | 84.1 ± 0.4 | 78.2 ± 0.9 | 84.9 ± 1.3 | 51.9 ± 1.6 |
| Localization | 63.8 ± 2.2 | 67.5 ± 3.0 | 62.4 ± 1.0 | 65.6 ± 0.8 | 64.4 ± 0.9 | 66.3 ± 1.0 | 55.1 ± 1.4 |
| Phylogenetic profile | 79.5 ± 0.9 | 84.3 ± 0.9 | 77.7 ± 1.6 | 83.6 ± 1.7 | 80.7 ± 0.8 | 85.4 ± 1.1 | 60.7 ± 1.4 |
| Yeast two-hybrid | 75.9 ± 1.2 | 82.5 ± 1.4 | 59.4 ± 1.0 | 65.4 ± 1.7 | 76.7 ± 0.8 | 83.0 ± 0.4 | 51.6 ± 1.4 |
| Sum | 83.9 ± 0.7 | 91.6 ± 0.5 | 84.0 ± 0.7 | 91.2 ± 0.4 | 83.9 ± 0.9 | 91.5 ± 0.6 | 60.6 ± 1.3 |
| Pairwise sum | 81.4 ± 0.5 | 89.0 ± 0.4 | 80.7 ± 1.1 | 88.6 ± 0.6 | 81.6 ± 0.7 | 89.2 ± 0.8 | - |
The table lists, for each type of data, the accuracy and area under the ROC curve obtained by each pairwise kernel. Values in the tables are means and standard errors in a 3 × 5 cv experiment. TPPK is the tensor product pairwise kernel, and MLPK is the metric learning pairwise kernel. The column MLPK + TTPK shows the results when an SVM is trained with the sum of the MLPK and TPPK pairwise kernels. The row Sum shows the results when the kernel between the genes is the sum of the expression, localization, phylogenetic profile and yeast two-hybrid kernels. The line Pairwise sum shows the results obtained with the SVM when the pairwise kernel used is the sum of pairwise kernels derived from the expression, localization, phylogenetic profile and yeast two-hybrid kernels, respectively. The Direct column shows the result of the direct method, where gene pairs are ranked according to their distance as defined by each kernel to predict edges.
Performance on reconstruction of the yeast co-complex networks.
| MLPK | TPPK | MLPK + TPPK | Direct | ||||
| Data | Accuracy | AUC | Accuracy | AUC | Accuracy | AUC | AUC |
| Localization | 76.2 ± 1.0 | 76.9 ± 2.0 | 79.5 ± 1.8 | 82.9 ± 1.7 | 80.6 ± 0.7 | 83.0 ± 1.2 | 73.9 ± 1.4 |
| Chip-chip | 82.2 ± 1.1 | 89.7 ± 0.8 | 63.8 ± 1.2 | 68.0 ± 1.1 | 84.4 ± 1.2 | 90.8 ± 1.2 | 58.4 ± 1.5 |
| Pfam | 92.1 ± 0.9 | 98.0 ± 0.5 | 86.1 ± 1.0 | 91.8 ± 0.9 | 93.8 ± 0.3 | 98.5 ± 0.1 | 67.3 ± 1.2 |
| PSI-BLAST | 89.0 ± 0.9 | 97.0 ± 0.1 | 88.3 ± 1.0 | 93.5 ± 0.9 | 93.1 ± 0.6 | 97.9 ± 0.2 | 67.8 ± 1.2 |
| Sum | 93.6 ± 0.3 | 98.7 ± 0.2 | 94.1 ± 0.6 | 98.0 ± 0.3 | 95.8 ± 0.3 | 99.1 ± 0.3 | 79.9 ± 0.8 |
| Pairwise sum | 93.3 ± 0.8 | 98.2 ± 0.4 | 90.5 ± 0.9 | 96.3 ± 0.7 | 95.2 ± 0.3 | 98.9 ± 0.2 | - |
The table lists, with the notation conventions explained in Figure 1, the results of the different methods on the reconstruction of the yeast co-complex networks.