Literature DB >> 32976564

Fold recognition by scoring protein maps using the congruence coefficient.

Pietro Di Lena1, Pierre Baldi2,3.   

Abstract

MOTIVATION: Protein fold recognition is a key step for template-based modeling approaches to protein structure prediction. Although closely related folds can be easily identified by sequence homology search in sequence databases, fold recognition is notoriously more difficult when it involves the identification of distantly related homologs. Recent progress in residue-residue contact and distance prediction opens up the possibility of improving fold recognition by using structural information contained in predicted distance and contact maps.
RESULTS: Here we propose to use the congruence coefficient as a metric of similarity between maps. We prove that this metric has several interesting mathematical properties which allow one to compute in polynomial time its exact mean and variance over all possible (exponentially many) alignments between two symmetric matrices, and assess the statistical significance of similarity between aligned maps. We perform fold recognition tests by recovering predicted target contact/distance maps from the two most recent Critical Assessment of Structure Prediction editions and over 27 000 non-homologous structural templates from the ECOD database. On this large benchmark, we compare fold recognition performances of different alignment tools with their own similarity scores against those obtained using the congruence coefficient. We show that the congruence coefficient overall improves fold recognition over other methods, proving its effectiveness as a general similarity metric for protein map comparison.
AVAILABILITY AND IMPLEMENTATION: The congruence coefficient software CCpro is available as part of the SCRATCH suite at: http://scratch.proteomics.ics.uci.edu/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Year:  2021        PMID: 32976564      PMCID: PMC8088323          DOI: 10.1093/bioinformatics/btaa833

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  22 in total

1.  Fast overlapping of protein contact maps by alignment of eigenvectors.

Authors:  Pietro Di Lena; Piero Fariselli; Luciano Margara; Marco Vassura; Rita Casadio
Journal:  Bioinformatics       Date:  2010-07-07       Impact factor: 6.937

2.  Deep-learning contact-map guided protein structure prediction in CASP13.

Authors:  Wei Zheng; Yang Li; Chengxin Zhang; Robin Pearce; S M Mortuza; Yang Zhang
Journal:  Proteins       Date:  2019-08-14

3.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path.

Authors:  I N Shindyalov; P E Bourne
Journal:  Protein Eng       Date:  1998-09

4.  Deep architectures for protein contact map prediction.

Authors:  Pietro Di Lena; Ken Nagata; Pierre Baldi
Journal:  Bioinformatics       Date:  2012-07-30       Impact factor: 6.937

Review 5.  Advances in protein structure prediction and design.

Authors:  Brian Kuhlman; Philip Bradley
Journal:  Nat Rev Mol Cell Biol       Date:  2019-08-15       Impact factor: 94.444

6.  Protein structure determination using metagenome sequence data.

Authors:  Sergey Ovchinnikov; Hahnbeom Park; Neha Varghese; Po-Ssu Huang; Georgios A Pavlopoulos; David E Kim; Hetunandan Kamisetty; Nikos C Kyrpides; David Baker
Journal:  Science       Date:  2017-01-20       Impact factor: 47.728

7.  Accelerated Profile HMM Searches.

Authors:  Sean R Eddy
Journal:  PLoS Comput Biol       Date:  2011-10-20       Impact factor: 4.475

8.  MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins.

Authors:  David T Jones; Tanya Singh; Tomasz Kosciolek; Stuart Tetchner
Journal:  Bioinformatics       Date:  2014-11-26       Impact factor: 6.937

9.  Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13.

Authors:  Jie Hou; Tianqi Wu; Renzhi Cao; Jianlin Cheng
Journal:  Proteins       Date:  2019-04-25

10.  HH-suite3 for fast remote homology detection and deep protein annotation.

Authors:  Martin Steinegger; Markus Meier; Milot Mirdita; Harald Vöhringer; Stephan J Haunsberger; Johannes Söding
Journal:  BMC Bioinformatics       Date:  2019-09-14       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.