| Literature DB >> 22545707 |
Jianlin Cheng1, Jilong Li, Zheng Wang, Jesse Eickholt, Xin Deng.
Abstract
BACKGROUND: As genome sequencing is becoming routine in biomedical research, the total number of protein sequences is increasing exponentially, recently reaching over 108 million. However, only a tiny portion of these proteins (i.e. ~75,000 or < 0.07%) have solved tertiary structures determined by experimental techniques. The gap between protein sequence and structure continues to enlarge rapidly as the throughput of genome sequencing techniques is much higher than that of protein structure determination techniques. Computational software tools for predicting protein structure and structural features from protein sequences are crucial to make use of this vast repository of protein resources.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22545707 PMCID: PMC3495398 DOI: 10.1186/1471-2105-13-65
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The organization of the MULTICOM toolbox.
The accuracy of the prediction of secondary structure (SS) and relative solvent accessibility (SA) on 100 CASP9 targets and 119 CASP8 targets, respectively
| | ||||
|---|---|---|---|---|
| CASP8 | 83.30% | 77.50% | 77.73% | 75.94% |
| CASP9 | 80.78% | 74.56% | 76.60% | 74.20% |
Figure 2The plot of sensitivity and specificity (y axis) against different probability thresholds of classifying residues as disordered residues on CASP8 targets.
Figure 3The plot of sensitivity and specificity (y axis) against different probability thresholds of classifying residues as disordered residues on CASP9 targets.
Accuracy for NNcon and SVMcon contact predictions on all CASP9 targets
| SVMcon | .35 | .32 | .27 | .24 | .14 |
| NNcon | .36 | .31 | .21 | .18 | .11 |
The average GDT-TS and TM scores of top-one and best-of-five models of MULTICOM predictors on 107 CASP9 targets
| MULTICOM (human) | 63.14 | 70.53 | 64.41 | 71.85 |
| MULTICOM (server) | 59.28 | 66.76 | 62.02 | 69.29 |
Figure 4Superimpositions of predicted models (blue) and native structures (orange) of four CASP9 targets. (A) T0520, TM-Score = 85, (B) T0527, TM-Score = 74, (C) T0634, TM-Score = 88, (D) T0641, TM-Score = 91.
Figure 5The MULTICOM toolbox web site.
The availability and running environment of the MULTICOM tools
| PSpro2.0 | Yes | Yes | Yes | Linux, Browser | PDF, HTML |
| PreDisorder1.1 | Yes | Yes | Yes | Linux, Browser | PDF, HTML |
| DoBo | | | Yes | Browser | PDF, HTML |
| NNCon | Yes | | Yes | Linux, Browser | PDF, HTML |
| SVMcon | Yes | | Yes | Linux, Browser | PDF, HTML |
| DIpro2.0 | Yes | Yes | | Linux | PDF, HTML |
| BETApro1.0 | Yes | Yes | Yes | Linux, Browser | PDF, HTML |
| MULTICOM | | | Yes | Browser | PDF, HTML |
| APOLLO | Yes | Yes | Yes | Linux, Browser | PDF, HTML |
| MUpro1.0 | Yes | Yes | Yes | Linux, Browser | PDF, HTML |
| SeqRate | Yes | | Yes | Linux, Browser | PDF, HTML |
| MSACompro1.2.0 | Yes | | | Linux | PDF, HTML |
| HMMEditor | Yes | Yes | Linux, Browser, Unix, Windows | PDF, HTML |