Literature DB >> 18274650

A study of residue correlation within protein sequences and its application to sequence classification.

Chris Hemmerich1, Sun Kim.   

Abstract

We investigate methods of estimating residue correlation within protein sequences. We begin by using mutual information (MI) of adjacent residues, and improve our methodology by defining the mutual information vector (MIV) to estimate long range correlations between nonadjacent residues. We also consider correlation based on residue hydropathy rather than protein-specific interactions. Finally, in experiments of family classification tests, the modeling power of MIV was shown to be significantly better than the classic MI method, reaching the level where proteins can be classified without alignment information.

Year:  2007        PMID: 18274650      PMCID: PMC3171355          DOI: 10.1155/2007/87356

Source DB:  PubMed          Journal:  EURASIP J Bioinform Syst Biol        ISSN: 1687-4145


  8 in total

1.  Positional dependence, cliques, and predictive motifs in the bHLH protein domain.

Authors:  W R Atchley; W Terhalle; A Dress
Journal:  J Mol Evol       Date:  1999-05       Impact factor: 2.395

2.  Species independence of mutual information in coding and noncoding DNA.

Authors:  I Grosse; H Herzel; S V Buldyrev; H E Stanley
Journal:  Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics       Date:  2000-05

3.  Information content of protein sequences.

Authors:  O Weiss; M A Jiménez-Montaño; H Herzel
Journal:  J Theor Biol       Date:  2000-10-07       Impact factor: 2.691

4.  Information-theoretic dissection of pairwise contact potentials.

Authors:  Melissa S Cline; Kevin Karplus; Richard H Lathrop; Temple F Smith; Robert G Rogers; David Haussler
Journal:  Proteins       Date:  2002-10-01

5.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

6.  Using information theory to search for co-evolving residues in proteins.

Authors:  L C Martin; G B Gloor; S D Dunn; L M Wahl
Journal:  Bioinformatics       Date:  2005-09-13       Impact factor: 6.937

7.  Correlations in protein sequences and property codes.

Authors:  O Weiss; H Herzel
Journal:  J Theor Biol       Date:  1998-02-21       Impact factor: 2.691

8.  The Pfam protein families database.

Authors:  Alex Bateman; Lachlan Coin; Richard Durbin; Robert D Finn; Volker Hollich; Sam Griffiths-Jones; Ajay Khanna; Mhairi Marshall; Simon Moxon; Erik L L Sonnhammer; David J Studholme; Corin Yeats; Sean R Eddy
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

  8 in total
  2 in total

1.  Use of mutual information arrays to predict coevolving sites in the full length HIV gp120 protein for subtypes B and C.

Authors:  Bo Wei; Na Han; Hai-zhou Liu; Anthony Rayner; Simon Rayner
Journal:  Virol Sin       Date:  2011-04-07       Impact factor: 4.327

2.  Computing highly correlated positions using mutual information and graph theory for G protein-coupled receptors.

Authors:  Sarosh N Fatakia; Stefano Costanzi; Carson C Chow
Journal:  PLoS One       Date:  2009-03-05       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.