Literature DB >> 10725404

Separation of phylogenetic and functional associations in biological sequences by using the parametric bootstrap.

K R Wollenberg1, W R Atchley.   

Abstract

Quantitative analyses of biological sequences generally proceed under the assumption that individual DNA or protein sequence elements vary independently. However, this assumption is not biologically realistic because sequence elements often vary in a concerted manner resulting from common ancestry and structural or functional constraints. We calculated intersite associations among aligned protein sequences by using mutual information. To discriminate associations resulting from common ancestry from those resulting from structural or functional constraints, we used a parametric bootstrap algorithm to construct replicate data sets. These data are expected to have intersite associations resulting solely from phylogeny. By comparing the distribution of our association statistic for the replicate data against that calculated for empirical data, we were able to assign a probability that two sites covaried resulting from structural or functional constraint rather than phylogeny. We tested our method by using an alignment of 237 basic helix-loop-helix (bHLH) protein domains. Comparison of our results against a solved three-dimensional structure confirmed the identification of several sites important to function and structure of the bHLH domain. This analytical procedure has broad utility as a first step in the identification of sites that are important to biological macromolecular structure and function when a solved structure is unavailable.

Entities:  

Mesh:

Year:  2000        PMID: 10725404      PMCID: PMC16231          DOI: 10.1073/pnas.97.7.3288

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  19 in total

1.  The rapid generation of mutation data matrices from protein sequences.

Authors:  D T Jones; W R Taylor; J M Thornton
Journal:  Comput Appl Biosci       Date:  1992-06

2.  Statistical tests of models of DNA substitution.

Authors:  N Goldman
Journal:  J Mol Evol       Date:  1993-02       Impact factor: 2.395

3.  Constructing amino acid residue substitution classes maximally indicative of local protein structure.

Authors:  M J Thompson; R A Goldstein
Journal:  Proteins       Date:  1996-05

4.  Modeling residue usage in aligned protein sequences via maximum likelihood.

Authors:  W J Bruno
Journal:  Mol Biol Evol       Date:  1996-12       Impact factor: 16.240

5.  Correlated mutations and residue contacts in proteins.

Authors:  U Göbel; C Sander; R Schneider; A Valencia
Journal:  Proteins       Date:  1994-04

6.  Compensating changes in protein multiple sequence alignments.

Authors:  W R Taylor; K Hatrick
Journal:  Protein Eng       Date:  1994-03

7.  Crystal structure of transcription factor E47: E-box recognition by a basic region helix-loop-helix dimer.

Authors:  T Ellenberger; D Fass; M Arnaud; S C Harrison
Journal:  Genes Dev       Date:  1994-04-15       Impact factor: 11.361

8.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.

Authors:  J D Thompson; D G Higgins; T J Gibson
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

9.  Crystal structure of MyoD bHLH domain-DNA complex: perspectives on DNA recognition and implications for transcriptional activation.

Authors:  P C Ma; M A Rould; H Weintraub; C O Pabo
Journal:  Cell       Date:  1994-05-06       Impact factor: 41.582

10.  Structure and function of the b/HLH/Z domain of USF.

Authors:  A R Ferré-D'Amaré; P Pognonec; R G Roeder; S K Burley
Journal:  EMBO J       Date:  1994-01-01       Impact factor: 11.598

View more
  45 in total

1.  Amino acids determining enzyme-substrate specificity in prokaryotic and eukaryotic protein kinases.

Authors:  Lewyn Li; Eugene I Shakhnovich; Leonid A Mirny
Journal:  Proc Natl Acad Sci U S A       Date:  2003-04-04       Impact factor: 11.205

2.  Finding important sites in protein sequences.

Authors:  Peter J Bickel; Katherina J Kechris; Philip C Spector; Gary J Wedemayer; Alexander N Glazer
Journal:  Proc Natl Acad Sci U S A       Date:  2002-11-04       Impact factor: 11.205

3.  Inferring functional constraints and divergence in protein families using 3D mapping of phylogenetic information.

Authors:  Christian Blouin; Yan Boucher; Andrew J Roger
Journal:  Nucleic Acids Res       Date:  2003-01-15       Impact factor: 16.971

4.  Phylogenetic analysis and classification of the fungal bHLH domain.

Authors:  Joshua K Sailsbery; William R Atchley; Ralph A Dean
Journal:  Mol Biol Evol       Date:  2011-11-22       Impact factor: 16.240

5.  Direct-coupling analysis of residue coevolution captures native contacts across many protein families.

Authors:  Faruck Morcos; Andrea Pagnani; Bryan Lunt; Arianna Bertolino; Debora S Marks; Chris Sander; Riccardo Zecchina; José N Onuchic; Terence Hwa; Martin Weigt
Journal:  Proc Natl Acad Sci U S A       Date:  2011-11-21       Impact factor: 11.205

6.  Context dependence and coevolution among amino acid residues in proteins.

Authors:  Zhengyuan O Wang; David D Pollock
Journal:  Methods Enzymol       Date:  2005       Impact factor: 1.600

7.  Solving the protein sequence metric problem.

Authors:  William R Atchley; Jieping Zhao; Andrew D Fernandes; Tanja Drüke
Journal:  Proc Natl Acad Sci U S A       Date:  2005-04-25       Impact factor: 11.205

8.  Coevolutionary patterns in cytochrome c oxidase subunit I depend on structural and functional context.

Authors:  Zhengyuan O Wang; David D Pollock
Journal:  J Mol Evol       Date:  2007-11       Impact factor: 2.395

9.  Correction for phylogeny, small number of observations and data redundancy improves the identification of coevolving amino acid pairs using mutual information.

Authors:  Cristina Marino Buslje; Javier Santos; Jose Maria Delfino; Morten Nielsen
Journal:  Bioinformatics       Date:  2009-03-10       Impact factor: 6.937

10.  Detecting coevolution without phylogenetic trees? Tree-ignorant metrics of coevolution perform as well as tree-aware metrics.

Authors:  J Gregory Caporaso; Sandra Smit; Brett C Easton; Lawrence Hunter; Gavin A Huttley; Rob Knight
Journal:  BMC Evol Biol       Date:  2008-12-03       Impact factor: 3.260

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.