| Literature DB >> 29856860 |
Juan A Morales-Cordovilla1, Victoria Sanchez1, Martin Ratajczak2.
Abstract
The query-template alignment of proteins is one of the most critical steps of template-based modeling methods used to predict the 3D structure of a query protein. This alignment can be interpreted as a temporal classification or structured prediction task and first order Conditional Random Fields have been proposed for protein alignment and proven to be rather successful. Some other popular structured prediction problems, such as speech or image classification, have gained from the use of higher order Conditional Random Fields due to the well known higher order correlations that exist between their labels and features. In this paper, we propose and describe the use of higher order Conditional Random Fields for query-template protein alignment. The experiments carried out on different public datasets validate our proposal, especially on distantly-related protein pairs which are the most difficult to align.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29856860 PMCID: PMC5983487 DOI: 10.1371/journal.pone.0197912
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Alignment matrices and scoring functions Ψ of a 1st and 2nd-order CRF alignment for the label transitions I → M and M → I → M, respectively, at the alignment matrix position (3, 2).
Ψ depends on log-factors outputs, Neural Networks (NNs) in our case, which in turn depend on the query and template feature vectors qt.
Fig 2Loss function and alignment accuracies of the 1st and 2nd-order CRFs on the training set at every epoch.
Reference dependent alignment accuracy (%) for different methods on the test sets.
| Method | POSLINDAHL (Fa., Su., Fo.) | SALIGN | PROSUP |
|---|---|---|---|
| 2nd-CRF-Align | 67.60 | 64.68 | |
| CNFPred | 50.36 ( | ||
| 1st-CRF-Align | 47.11 (68, 43, 31) | 63.42 | 59.35 |
| HHAlign (loc) | 44.70 (72, 46, 17) | 68.58 | 64.26 |
| HHAlign (glob) | 38.26 (70, 37, 7) | 67.15 | 61.19 |
| CONTRAlign | 42.54 (64, 38, 26) | 56.37 | 54.37 |
| NWAlign | 37.34 (56, 32, 24) | 46.48 | 45.96 |
Reference independent alignment accuracy for different methods on the test sets (measured by cumulative TM-Score of the query 3D model).
| Method | POSLINDAHL (Fa., Su., Fo.) | SALIGN | PROSUP |
|---|---|---|---|
| 2nd-CRF-Align | 129 | 72.2 | |
| CNFPred | 1012 (964, | ||
| 1st-CRF-Align | 962 (933, 841, 1113) | 123 | 66.3 |
| HHAlign (loc) | 885 (948, 865, 841) | 129 | 69.2 |
| HHAlign (glob) | 780 (930, 775, 634) | 124 | 65.3 |
| CONTRAlign | 894 (901, 772, 1009) | 111 | 62.5 |
| NWAlign | 875 (857, 741, 1027) | 103 | 59.2 |
| DeepAlign (oracle) | 1052 (1078, 1216, 1774) | 153 | 86.8 |