Literature DB >> 22101153

PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments.

David T Jones1, Daniel W A Buchan, Domenico Cozzetto, Massimiliano Pontil.   

Abstract

MOTIVATION: The accurate prediction of residue-residue contacts, critical for maintaining the native fold of a protein, remains an open problem in the field of structural bioinformatics. Interest in this long-standing problem has increased recently with algorithmic improvements and the rapid growth in the sizes of sequence families. Progress could have major impacts in both structure and function prediction to name but two benefits. Sequence-based contact predictions are usually made by identifying correlated mutations within multiple sequence alignments (MSAs), most commonly through the information-theoretic approach of calculating mutual information between pairs of sites in proteins. These predictions are often inaccurate because the true covariation signal in the MSA is often masked by biases from many ancillary indirect-coupling or phylogenetic effects. Here we present a novel method, PSICOV, which introduces the use of sparse inverse covariance estimation to the problem of protein contact prediction. Our method builds on work which had previously demonstrated corrections for phylogenetic and entropic correlation noise and allows accurate discrimination of direct from indirectly coupled mutation correlations in the MSA.
RESULTS: PSICOV displays a mean precision substantially better than the best performing normalized mutual information approach and Bayesian networks. For 118 out of 150 targets, the L/5 (i.e. top-L/5 predictions for a protein of length L) precision for long-range contacts (sequence separation >23) was ≥ 0.5, which represents an improvement sufficient to be of significant benefit in protein structure prediction or model quality assessment. AVAILABILITY: The PSICOV source code can be downloaded from http://bioinf.cs.ucl.ac.uk/downloads/PSICOV.

Mesh:

Substances:

Year:  2011        PMID: 22101153     DOI: 10.1093/bioinformatics/btr638

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  321 in total

1.  Protein topology from predicted residue contacts.

Authors:  William R Taylor; David T Jones; Michael I Sadowski
Journal:  Protein Sci       Date:  2011-12-21       Impact factor: 6.725

2.  Amino acid coevolution induces an evolutionary Stokes shift.

Authors:  David D Pollock; Grant Thiltgen; Richard A Goldstein
Journal:  Proc Natl Acad Sci U S A       Date:  2012-04-30       Impact factor: 11.205

3.  Accurate de novo structure prediction of large transmembrane protein domains using fragment-assembly and correlated mutation analysis.

Authors:  Timothy Nugent; David T Jones
Journal:  Proc Natl Acad Sci U S A       Date:  2012-05-29       Impact factor: 11.205

4.  Integration of QUARK and I-TASSER for Ab Initio Protein Structure Prediction in CASP11.

Authors:  Wenxuan Zhang; Jianyi Yang; Baoji He; Sara Elizabeth Walker; Hongjiu Zhang; Brandon Govindarajoo; Jouko Virtanen; Zhidong Xue; Hong-Bin Shen; Yang Zhang
Journal:  Proteins       Date:  2015-09-23

5.  From residue coevolution to protein conformational ensembles and functional dynamics.

Authors:  Ludovico Sutto; Simone Marsili; Alfonso Valencia; Francesco Luigi Gervasio
Journal:  Proc Natl Acad Sci U S A       Date:  2015-10-20       Impact factor: 11.205

6.  Protein contact prediction by integrating joint evolutionary coupling analysis and supervised learning.

Authors:  Jianzhu Ma; Sheng Wang; Zhiyong Wang; Jinbo Xu
Journal:  Bioinformatics       Date:  2015-08-14       Impact factor: 6.937

Review 7.  Constraint methods that accelerate free-energy simulations of biomolecules.

Authors:  Alberto Perez; Justin L MacCallum; Evangelos A Coutsias; Ken A Dill
Journal:  J Chem Phys       Date:  2015-12-28       Impact factor: 3.488

8.  Accurate disulfide-bonding network predictions improve ab initio structure prediction of cysteine-rich proteins.

Authors:  Jing Yang; Bao-Ji He; Richard Jang; Yang Zhang; Hong-Bin Shen
Journal:  Bioinformatics       Date:  2015-08-07       Impact factor: 6.937

9.  Synthetic protein alignments by CCMgen quantify noise in residue-residue contact prediction.

Authors:  Susann Vorberg; Stefan Seemayer; Johannes Söding
Journal:  PLoS Comput Biol       Date:  2018-11-05       Impact factor: 4.475

10.  Predicting functionally informative mutations in Escherichia coli BamA using evolutionary covariance analysis.

Authors:  Robert S Dwyer; Dante P Ricci; Lucy J Colwell; Thomas J Silhavy; Ned S Wingreen
Journal:  Genetics       Date:  2013-08-09       Impact factor: 4.562

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.