Literature DB >> 34209399

COMTOP: Protein Residue-Residue Contact Prediction through Mixed Integer Linear Optimization.

Md Selim Reza1,2, Huiling Zhang1,2, Md Tofazzal Hossain1,2, Langxi Jin3, Shengzhong Feng2, Yanjie Wei1,2.   

Abstract

Protein contact prediction helps reconstruct the tertiary structure that greatly determines a protein's function; therefore, contact prediction from the sequence is an important problem. Recently there has been exciting progress on this problem, but many of the existing methods are still low quality of prediction accuracy. In this paper, we present a new mixed integer linear programming (MILP)-based consensus method: a Consensus scheme based On a Mixed integer linear opTimization method for prOtein contact Prediction (COMTOP). The MILP-based consensus method combines the strengths of seven selected protein contact prediction methods, including CCMpred, EVfold, DeepCov, NNcon, PconsC4, plmDCA, and PSICOV, by optimizing the number of correctly predicted contacts and achieving a better prediction accuracy. The proposed hybrid protein residue-residue contact prediction scheme was tested in four independent test sets. For 239 highly non-redundant proteins, the method showed a prediction accuracy of 59.68%, 70.79%, 78.86%, 89.04%, 94.51%, and 97.35% for top-5L, top-3L, top-2L, top-L, top-L/2, and top-L/5 contacts, respectively. When tested on the CASP13 and CASP14 test sets, the proposed method obtained accuracies of 75.91% and 77.49% for top-L/5 predictions, respectively. COMTOP was further tested on 57 non-redundant ɑ-helical transmembrane proteins and achieved prediction accuracies of 64.34% and 73.91% for top-L/2 and top-L/5 predictions, respectively. For all test datasets, the improvement of COMTOP in accuracy over the seven individual methods increased with the increasing number of predicted contacts. For example, COMTOP performed much better for large number of contact predictions (such as top-5L and top-3L) than for small number of contact predictions such as top-L/2 and top-L/5. The results and analysis demonstrate that COMTOP can significantly improve the performance of the individual methods; therefore, COMTOP is more robust against different types of test sets. COMTOP also showed better/comparable predictions when compared with the state-of-the-art predictors.

Entities:  

Keywords:  contact prediction; machine learning; mixed integer linear programming; protein residue–residue contact; protein sequence

Year:  2021        PMID: 34209399     DOI: 10.3390/membranes11070503

Source DB:  PubMed          Journal:  Membranes (Basel)        ISSN: 2077-0375


  38 in total

1.  PISCES: a protein sequence culling server.

Authors:  Guoli Wang; Roland L Dunbrack
Journal:  Bioinformatics       Date:  2003-08-12       Impact factor: 6.937

2.  Side-chain and backbone ordering in a polypeptide.

Authors:  Yanjie Wei; Walter Nadler; Ulrich H E Hansmann
Journal:  J Chem Phys       Date:  2006-10-28       Impact factor: 3.488

3.  Systematic study of the boundary composition in Poisson Boltzmann calculations.

Authors:  Parimal Kar; Yanjie Wei; Ulrich H E Hansmann; Siegfried Höfinger
Journal:  J Comput Chem       Date:  2007-12       Impact factor: 3.376

4.  An improved hybrid global optimization method for protein tertiary structure prediction.

Authors:  Scott R McAllister; Christodoulos A Floudas
Journal:  Comput Optim Appl       Date:  2010-03-01       Impact factor: 2.167

5.  CoinFold: a web server for protein contact prediction and contact-assisted protein folding.

Authors:  Sheng Wang; Wei Li; Renyu Zhang; Shiwang Liu; Jinbo Xu
Journal:  Nucleic Acids Res       Date:  2016-04-25       Impact factor: 16.971

6.  Improving residue-residue contact prediction via low-rank and sparse decomposition of residue correlation matrix.

Authors:  Haicang Zhang; Yujuan Gao; Minghua Deng; Chao Wang; Jianwei Zhu; Shuai Cheng Li; Wei-Mou Zheng; Dongbo Bu
Journal:  Biochem Biophys Res Commun       Date:  2016-02-23       Impact factor: 3.575

7.  De novo structure prediction of globular proteins aided by sequence variation-derived contacts.

Authors:  Tomasz Kosciolek; David T Jones
Journal:  PLoS One       Date:  2014-03-17       Impact factor: 3.240

8.  PconsFold: improved contact predictions improve protein models.

Authors:  Mirco Michel; Sikander Hayat; Marcin J Skwark; Chris Sander; Debora S Marks; Arne Elofsson
Journal:  Bioinformatics       Date:  2014-09-01       Impact factor: 6.937

9.  Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.

Authors:  Sheng Wang; Siqi Sun; Zhen Li; Renyu Zhang; Jinbo Xu
Journal:  PLoS Comput Biol       Date:  2017-01-05       Impact factor: 4.475

10.  Evaluation of residue-residue contact prediction methods: From retrospective to prospective.

Authors:  Huiling Zhang; Zhendong Bei; Wenhui Xi; Min Hao; Zhen Ju; Konda Mani Saravanan; Haiping Zhang; Ning Guo; Yanjie Wei
Journal:  PLoS Comput Biol       Date:  2021-05-24       Impact factor: 4.475

View more
  2 in total

1.  Inter-Residue Distance Prediction From Duet Deep Learning Models.

Authors:  Huiling Zhang; Ying Huang; Zhendong Bei; Zhen Ju; Jintao Meng; Min Hao; Jingjing Zhang; Haiping Zhang; Wenhui Xi
Journal:  Front Genet       Date:  2022-05-16       Impact factor: 4.772

2.  Metadata analysis to explore hub of the hub-genes highlighting their functions, pathways and regulators for cervical cancer diagnosis and therapies.

Authors:  Md Selim Reza; Md Alim Hossen; Md Harun-Or-Roshid; Mst Ayesha Siddika; Md Hadiul Kabir; Md Nurul Haque Mollah
Journal:  Discov Oncol       Date:  2022-08-22
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.