| Literature DB >> 20507917 |
Sebastian Briesemeister1, Jörg Rahnenführer, Oliver Kohlbacher.
Abstract
Predicting subcellular localization has become a valuable alternative to time-consuming experimental methods. Major drawbacks of many of these predictors is their lack of interpretability and the fact that they do not provide an estimate of the confidence of an individual prediction. We present YLoc, an interpretable web server for predicting subcellular localization. YLoc uses natural language to explain why a prediction was made and which biological property of the protein was mainly responsible for it. In addition, YLoc estimates the reliability of its own predictions. YLoc can, thus, assist in understanding protein localization and in location engineering of proteins. The YLoc web server is available online at www.multiloc.org/YLoc.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20507917 PMCID: PMC2896088 DOI: 10.1093/nar/gkq477
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Performance of the YLoc and other state-of-the-art predictors on the BaCelLo IDS (27) (B) and Höglund IDS (20) (H) concerning F1 and ACC (in brackets)
| Data set | YLoc- LowRes | YLoc- HighRes | YLoc+ | MultiLoc2- LowRes | MultiLoc2- HighRes | BaCelLo | LOCTree | WoLF PSORT | Euk-mPloc |
|---|---|---|---|---|---|---|---|---|---|
| B Animals | 0.75 (0.79) | 0.69 (0.74) | 0.67 (0.58) | 0.76 (0.73) | 0.71 (0.68) | 0.66 (0.64) | 0.58 (0.62) | 0.67 (0.70) | 0.54 (0.61) |
| B Fungi | 0.61 (0.56) | 0.51 (0.56) | 0.51 (0.48) | 0.61 (0.60) | 0.58 (0.53) | 0.60 (0.57) | 0.43 (0.47) | 0.51 (0.50) | 0.56 (0.60) |
| B Plants | 0.58 (0.71) | 0.54 (0.58) | 0.49 (0.58) | 0.64 (0.76) | 0.54 (0.62) | 0.56 (0.69) | 0.58 (0.70) | 0.46 (0.57) | 0.37 (0.46) |
| H Animals | − (−) | 0.34 (0.56) | 0.37 (0.53) | − (−) | 0.41 (0.57) | − (−) | − (−) | 0.18 (0.36) | 0.24 (0.27) |
Performance of YLoc on the BaCelLo animal IDS (27) for different minimum confidence scores
| Predictor | Measure | 0.0 | 0.2 | 0.4 | 0.6 | 0.8 | 0.9 |
|---|---|---|---|---|---|---|---|
| YLoc-LowRes | 0.75 | 0.76 | 0.78 | 0.80 | 0.84 | 0.95 | |
| ACC | 0.79 | 0.79 | 0.81 | 0.86 | 0.91 | 0.93 | |
| No. of instances | 576 | 467 | 395 | 299 | 189 | 118 | |
| YLoc-HighRes | 0.69 | 0.74 | 0.76 | 0.76 | 0.77 | 0.77 | |
| ACC | 0.74 | 0.78 | 0.80 | 0.82 | 0.83 | 0.84 | |
| No. of instances | 576 | 507 | 470 | 428 | 391 | 354 | |
| YLoc+ | 0.67 | 0.69 | 0.72 | 0.77 | 0.76 | 0.81 | |
| ACC | 0.58 | 0.60 | 0.62 | 0.65 | 0.65 | 0.69 | |
| No. of instances | 576 | 494 | 423 | 324 | 219 | 142 |
Figure 1.The attribute table of the YLoc web service lists all attributes in order of their influence on the prediction outcome. All attributes are expressed in biological terms. The +(+) or −(−) indicates whether that attribute value is (very) typical or (very) untypical for a location.
Figure 2.The detailed attribute page of the feature ‘secretory pathway sorting signal’ in YLoc-LowRes (animal version). The distribution of proteins from the cy, mi, nu and SP over the different attribute intervals is shown.