| Literature DB >> 16953874 |
Davide Baú1, Alberto J M Martin, Catherine Mooney, Alessandro Vullo, Ian Walsh, Gianluca Pollastri.
Abstract
BACKGROUND: We describe Distill, a suite of servers for the prediction of protein structural features: secondary structure; relative solvent accessibility; contact density; backbone structural motifs; residue contact maps at 6, 8 and 12 Angstrom; coarse protein topology. The servers are based on large-scale ensembles of recursive neural networks and trained on large, up-to-date, non-redundant subsets of the Protein Data Bank. Together with structural feature predictions, Distill includes a server for prediction of Calpha traces for short proteins (up to 200 amino acids).Entities:
Mesh:
Substances:
Year: 2006 PMID: 16953874 PMCID: PMC1574355 DOI: 10.1186/1471-2105-7-402
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 2Distill's flowchart. Information flows from the top of the chart to the bottom.
Figure 1Distill's single-query interface. The multi-query interface is identical, except that the query name box is missing (names are extracted directly from the FASTA format).
Figure 3XXStout PNG example output. The 12Å contact map predicted for the example given in the "output format" section.
XXStout performance.
| separation | ≥ 6 | ≥ 12 | ≥ 24 |
| 8Å | 46.4% (5.9%) | 35.4% (5.7%) | 19.8% (4.6%) |
| 12Å | 89.9% (2.3%) | 62.5% (2.0%) | 49.9% (2.2%) |
Top protein length/5 contacts classification performance as: precision%(recall%)
XXStout performance.
| separation | ≥ 6 | ≥ 12 | ≥ 24 |
| 8Å | 36.6% (11.8%) | 27.0% (11.0%) | 15.7% (9.3%) |
| 12Å | 85.5% (5.5%) | 55.6% (4.6%) | 43.8% (4.9%) |
Top protein length/2 contacts classification performance as: precision%(recall%)
Figure 4Expected performances of 3Distill. 3D reconstruction performances measured as average TM scores against the correct structure. Tested on the S258 set (see text for details). Maximum sequence similarity allowed to homologues in the PDB: 95%.