| Literature DB >> 27259540 |
Debswapna Bhattacharya1, Renzhi Cao1, Jianlin Cheng2.
Abstract
MOTIVATION: Recent experimental studies have suggested that proteins fold via stepwise assembly of structural units named 'foldons' through the process of sequential stabilization. Alongside, latest developments on computational side based on probabilistic modeling have shown promising direction to perform de novo protein conformational sampling from continuous space. However, existing computational approaches for de novo protein structure prediction often randomly sample protein conformational space as opposed to experimentally suggested stepwise sampling.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27259540 PMCID: PMC5018369 DOI: 10.1093/bioinformatics/btw316
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Parameterization and modeling of united-residue polypeptide conformational space. (a) United-residue polypeptide chain parameterized using virtual lengths and virtual angle pairs for backbone and side chain. (b) Conditional dependency graph of UniCon IOHMM. Circular nodes represent stochastic variables and arrows in the graph specify the conditional independence relationships among variables
Fig. 2.Visualization of stepwise sampling protocol. Foldon units are identified from the SSEs with a size restriction of at least 20 residues. Each foldon unit is then sequentially synthesized and assembled from N to C-terminus via probabilistic sampling conditioned on previously formed conformation
Average energy of 100 decoys and TM-scores of lowest-energy decoy for six small proteins
| Native | UniCon3D | B03D | |||
|---|---|---|---|---|---|
| Protein, PDB code | Energy | Energy | TM-score | Energy | TM-score |
| Protein A, 1FC2 | −459.03 | −419.99 | 0.66 | −403.61 | 0.63 |
| Homeodomain, 1ENH | −550.08 | −473.70 | 0.73 | −446.61 | 0.67 |
| Protein G, 2GB1 | −688.10 | −479.72 | 0.50 | −453.39 | 0.51 |
| Cro repressor, 2CRO | −794.04 | −454.45 | 0.51 | −432.18 | 0.44 |
| Protein L7/L12, 1CTF | −899.18 | −447.23 | 0.41 | −430.11 | 0.43 |
| Calbidin, 4ICB | −951.04 | −419.99 | 0.62 | −403.61 | 0.54 |
| Mean | −723.58 | −449.18 | 0.57 | −428.25 | 0.53 |
Comparison between UniCon3D and top-performing servers based on average TM-scores and Cα-rmsds of top-ranked model in 30 CASP11 FM domains
| Group name | TM-score ( | Cα-rmsd ( |
|---|---|---|
| QUARK | 0.29 ( | 14.83 ( |
| Zhang-Server | 0.28 ( | 15.39 ( |
| UniCon3D | 0.25 (–) | 15.82 (–) |
| RBO_Aleph | 0.25 ( | 16.67 ( |
| nns | 0.23 ( | 17.41 ( |
| BAKER-ROSETTASERVER | 0.23 ( | 17.71 ( |
* P-value of a one-sample t-test of TM-score difference to UniCon3D TM-scores.
** P-value of a one-sample t-test of Cα-RMSD difference to UniCon3D Cα-rmsds.
a The groups are sorted by descending TM-scores then by ascending Cα-RMSD.
b Not a participating group in CASP11 experiment.
Average TM-scores and Cα-rmsds of top-ranked decoys based on different MQAPs in 30 CASP11 FM domains*
| MQAP | TM-score | Cα-rmsd |
|---|---|---|
| UniCon3D | 0.25 | 15.82 |
| APOLLO | 0.24 | 15.88 |
| Qprob | 0.24 | 16.34 |
| ProQ2 | 0.23 | 16.28 |
| MUFOLD-CL | 0.23 | 16.63 |
* MQAPs are sorted by descending TM-scores then by ascending Cα-rmsd.
Comparison between UniCon3D and top-performing servers based on avarage TM-scores and Cα-rmsds of top-ranked model in 15 CASP10 FM domains
| Group name | TM-score ( | Cα-rmsd ( |
|---|---|---|
| Zhang-Server | 0.25 ( | 16.14 ( |
| QUARK | 0.25 ( | 16.42 ( |
| UniCon3D | 0.22 (–) | 17.45 (–) |
| PMS | 0.22 ( | 18.04 ( |
| MUFold_CRF | 0.21 ( | 18.70 ( |
| BAKER-ROSETTASERVER | 0.21 ( | 22.09 ( |
* P-value of a one-sample t-test of TM-score difference to UniCon3D TM-scores.
** P-value of a one-sample t-test of Cα-RMSD difference to UniCon3D Cα-rmsds.
a The groups are sorted by descending TM-scores then by ascending Cα-RMSD.
b Not a participating group in CASP10 experiment.
Comparison of average TM-scores and Cα-rmsds between UniCon3D, B03D, GDFuzz3D and FT-COMAR in CASP10 TBM targets. Best CASP10 results are also provided as a reference
| All 45 targets present in the dataset | 10 targets with reasonable contact maps | |||
|---|---|---|---|---|
| Method | TM-score | Cα-rmsd | TM-score | Cα-rmsd |
| UniCon3D | 0.45 | 10.73 | 0.54 | 5.66 |
| B03D | 0.42 | 11.86 | 0.50 | 7.26 |
| GDFuzz3D | 0.41 | 10.75 | 0.51 | 6.15 |
| FT-COMAR | 0.31 | 14.81 | 0.39 | 11.02 |
| Best CASP10 | 0.82 | 3.61 | 0.88 | 1.93 |
aPerformance of GDFuzz3D and FT-COMAR adopted from the published work of GDFuzz3D.
b Highest TM-score among all submitted server models during CASP10.