| Literature DB >> 31622328 |
Wei Zheng1,2, Qiqige Wuyun2,3, Yang Li1, S M Mortuza1, Chengxin Zhang1, Robin Pearce1, Jishou Ruan2,4, Yang Zhang1,5.
Abstract
Accurate prediction of atomic-level protein structure is important for annotating the biological functions of protein molecules and for designing new compounds to regulate the functions. Template-based modeling (TBM), which aims to construct structural models by copying and refining the structural frameworks of other known proteins, remains the most accurate method for protein structure prediction. Due to the difficulty in recognizing distant-homology templates, however, the accuracy of TBM decreases rapidly when the evolutionary relationship between the query and template vanishes. In this study, we propose a new method, CEthreader, which first predicts residue-residue contacts by coupling evolutionary precision matrices with deep residual convolutional neural-networks. The predicted contact maps are then integrated with sequence profile alignments to recognize structural templates from the PDB. The method was tested on two independent benchmark sets consisting collectively of 1,153 non-homologous protein targets, where CEthreader detected 176% or 36% more correct templates with a TM-score >0.5 than the best state-of-the-art profile- or contact-based threading methods, respectively, for the Hard targets that lacked homologous templates. Moreover, CEthreader was able to identify 114% or 20% more correct templates with the same Fold as the query, after excluding structures from the same SCOPe Superfamily, than the best profile- or contact-based threading methods. Detailed analyses show that the major advantage of CEthreader lies in the efficient coupling of contact maps with profile alignments, which helps recognize global fold of protein structures when the homologous relationship between the query and template is weak. These results demonstrate an efficient new strategy to combine ab initio contact map prediction with profile alignments to significantly improve the accuracy of template-based structure prediction, especially for distant-homology proteins.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31622328 PMCID: PMC6818797 DOI: 10.1371/journal.pcbi.1007411
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Performance of different threading methods on the 211 Hard targets from Benchmark Set-I.
P-values were calculated between the CEthreader alignment TM-scores and other methods’ TM-scores using one-sided Wilcoxon signed-rank tests. Coverage is equal to the number of aligned residues divided by the length of the query sequence. N represents the number of targets with an identified template whose TM-score was >0.5.
| Methods | TM-score | RMSD (Å) | Coverage | ||
|---|---|---|---|---|---|
| CEthreader | 0.453 | - | 9.53 | 0.875 | 80 |
| map_align | 0.414 | 4.39E-06 | 11.48 | 0.896 | 59 |
| EigenThreader | 0.413 | 1.31E-09 | 10.15 | 0.850 | 49 |
| HHsearch | 0.313 | 2.13E-24 | 10.92 | 0.654 | 29 |
| MUSTER | 0.304 | 5.02E-28 | 13.94 | 0.869 | 23 |
| PPA | 0.284 | 7.75E-30 | 14.78 | 0.832 | 19 |
| PROSPECT2 | 0.261 | 2.52E-35 | 16.63 | 0.915 | 10 |
| SAM-T99 | 0.208 | 1.04E-34 | 11.17 | 0.528 | 10 |
| FFAS | 0.189 | 1.25E-35 | 14.39 | 0.674 | 7 |
Threading results obtained by different methods for the 539 proteins from Benchmark Set-II.
P-values were calculated between the CEthreader alignment TM-scores and other methods’ TM-scores using one-sided Wilcoxon signed-rank tests; N represents the number of targets with a first identified template whose TM-score was >0.5; the success rate is equal to the fraction of targets whose Folds were correctly recognized, i.e., the first identified template had the same Fold as the query.
| Methods | TM-score | Success rate | |||
|---|---|---|---|---|---|
| All | CEthreader | 0.483 | - | 323 | 59.9% |
| map_align | 0.464 | 4.99E-05 | 261 | 48.4% | |
| EigenThreader | 0.450 | 1.31E-21 | 271 | 50.3% | |
| HHsearch | 0.301 | 2.84E-79 | 151 | 28.0% | |
| MUSTER | 0.285 | 6.57E-85 | 99 | 18.4% | |
| Easy | CEthreader | 0.493 | - | 228 | 60.2% |
| map_align | 0.478 | 5.62E-03 | 195 | 51.5% | |
| EigenThreader | 0.462 | 6.43E-16 | 199 | 52.5% | |
| HHsearch | 0.326 | 1.18E-55 | 119 | 31.4% | |
| MUSTER | 0.304 | 2.68E-61 | 80 | 21.1% | |
| Hard | CEthreader | 0.459 | - | 95 | 59.4% |
| map_align | 0.431 | 7.65E-04 | 66 | 41.3% | |
| EigenThreader | 0.422 | 2.26E-07 | 72 | 45.0% | |
| HHsearch | 0.242 | 1.65E-25 | 32 | 20.0% | |
| MUSTER | 0.241 | 8.37E-26 | 19 | 11.9% |