| Literature DB >> 23514582 |
Tobias Hamp1, Rebecca Kassner, Stefan Seemayer, Esmeralda Vicedo, Christian Schaefer, Dominik Achten, Florian Auer, Ariane Boehm, Tatjana Braun, Maximilian Hecht, Mark Heron, Peter Hönigschmid, Thomas A Hopf, Stefanie Kaufmann, Michael Kiening, Denis Krompass, Cedric Landerer, Yannick Mahlich, Manfred Roos, Burkhard Rost.
Abstract
BACKGROUND: Any method that de novo predicts protein function should do better than random. More challenging, it also ought to outperform simple homology-based inference.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23514582 PMCID: PMC3584931 DOI: 10.1186/1471-2105-14-S3-S7
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1A functional annotation and its prediction. This Figure shows one annotation of a sample protein A and its prediction. Each node in a graph corresponds to one GO term and the edges to relationships such as "is a" or "part of". The edges always point to the root node (either "MFO" or "BPO"), which by itself is not informative and discarded in every evaluation. For clearity, the left subgraph only shows the experimental annotation of A. This means, all GO terms have either been experimentally verified or inferred from the same. The red circles indicate the leaf terms, i.e. the nodes which are not a parent of any other term. In the right subgraph, we see the experimental annotation again, but overlaid with predicted terms (green) and their reliabilities. This time, the leaf terms correspond to the predicted GO annotation, instead of the actual annotation.
Comparison of student methods.
| GO term counts | E-Values | GO term counts; percentage positives | |
| 1 | 2 | 2 | |
| maximum child | maximum child | maximum child; cumulative | |
| No | Yes | No | |
In this table, we have summarized the key differences between student methods. Input features include: the number of times a GO term appeared in the annotations of homologous proteins; the E-Values of the homologous proteins; and the percentage of 'positive' columns in their alignment matrices. Some groups used more than one way to score a GO term or differed during the propagation of a prediction by assigning a node the maximum value of its children or their sum. StudentB normalized the final score of a GO term to improve comparability among proteins.
Figure 5Results of evaluations before and after CAFA. Here, we show the results of all methods for each ontology and measure. Baseline classifiers share the same color (cyan), just like methods corresponding to the same design, but different parameter values (blue). Curves derived from the CAFA organizers are solid and bold, otherwise thin and dotted. As the area between recall 0.0 - 0.2 and precision 0.45 - 0.55 is extremely crowded in the BPO threshold measure plot, we provide an enlarged view with the inlet. In the BPO leaf threshold measure plot, Priors' is at the origin (0.0, 0.0).
Ranking of methods with respect to the maximum F1 score of the threshold measure curves.
| 0.29 | 13 | 0.47 | 4 | |
| 0.21 | - | 0.34 | - | |
| 0.27 | 15 | 0.41 | 12 | |
| 0.32 | 8 | 0.40 | 13 | |
| 0.15 | - | 0.20 | - | |
| 0.28 | 14 | 0.36 | - | |
| 0.37 | 1 | 0.49 | 1 | |
| 0.20 | - | 0.29 | - | |
| 0.33 | 8 | 0.43 | 10 | |
| 0.36 | 3 | 0.45 | 7 | |
| 0.34 | 6 | 0.48 | 3 | |
| 0.36 | 3 | 0.48 | 3 | |
This table shows the maximum F1 score (Fmax) of each threshold measure curve in Figure 5 and its rank in the list of competing methods which was shown at CAFA. This list actually consists of 36 predictors, but only the scores and ranks of the top 15 performers have been released. Classifiers which are actually part of this list are kept in bold. Ranks of other methods are hypothetical, either because calculated after CAFA or because discarded by the CAFA organizers. They considered only one method per participating group and we chose method A. Results for StudentB were compiled with the bug (Methods).
Ranking methods by maximal F1 score for various measures.
| 8 | 8 | 11 | 7 | 6 | 11 | |
| 10 | 10 | 10 | 10 | 10 | 6 | |
| 9 | 9 | 9 | 6 | 9 | 10 | |
| 6 | 6 | 8 | 2 | 3 | 9 | |
| 5 | 5 | 5 | 8 | 7 | 5 | |
| 3 | 4 | 4 | 5 | 5 | 2 | |
| 11 | 11 | 7 | 11 | 11 | 7 | |
| 2 | 2 | 1 | 3 | 4 | 1 | |
| 7 | 7 | 6 | 9 | 8 | 8 | |
| 4 | 3 | 3 | 4 | 2 | 4 | |
| 1 | 1 | 2 | 1 | 1 | 3 | |
We calculated the maximum F1 score (Eqn. 1) for each method and curve presented in Figure 5 and ranked the methods accordingly. The number in each cell is the rank of the method in the respective category. As we evaluated 11 different methods, ranks range from 1 (best) to 11 (worst). Results for StudentB were compiled with the bug (Methods).