Literature DB >> 24988248

Protein domain hierarchy Gibbs sampling strategies.

Andrew F Neuwald.   

Abstract

Hierarchically-arranged multiple sequence alignment profiles are useful for modeling protein domains that have functionally diverged into evolutionarily-related subgroups. Currently such alignment hierarchies are largely constructed through manual curation, as for the NCBI Conserved Domain Database (CDD). Recently, however, I developed a Gibbs sampler that uses an approach termed statistical evolutionary dynamics analysis to accomplish this task automatically while, at the same time, identifying sequence determinants of protein function. Here I describe the statistical model and sampling strategies underlying this sampler. When implemented and applied to simulated protein sequences (which conform to the underlying statistical model precisely), these sampling strategies efficiently converge on the hierarchy used to generate the sequences. However, for real protein sequences the sampler finds alternative, nearly-optimal hierarchies for many domains, indicating a significant degree of ambiguity. I illustrate how both the nature of such ambiguities and the most robust ("consensus") features of a hierarchy may be determined from an ensemble of independently generated hierarchies for the same domain. Such consensus hierarchies can provide reliably stable models of protein domain functional divergence.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24988248     DOI: 10.1515/sagmb-2014-0008

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  8 in total

1.  Episodic evolution of coadapted sets of amino acid sites in mitochondrial proteins.

Authors:  Alexey D Neverov; Anfisa V Popova; Gennady G Fedonin; Evgeny A Cheremukhin; Galya V Klink; Georgii A Bazykin
Journal:  PLoS Genet       Date:  2021-01-25       Impact factor: 5.917

2.  SPARC: Structural properties associated with residue constraints.

Authors:  Andrew F Neuwald; Hui Yang; B Tracy Nixon
Journal:  Comput Struct Biotechnol J       Date:  2022-04-07       Impact factor: 6.155

3.  Inference of Functionally-Relevant N-acetyltransferase Residues Based on Statistical Correlations.

Authors:  Andrew F Neuwald; Stephen F Altschul
Journal:  PLoS Comput Biol       Date:  2016-12-21       Impact factor: 4.475

4.  Deep Analysis of Residue Constraints (DARC): identifying determinants of protein functional specificity.

Authors:  Farzaneh Tondnevis; Elizabeth E Dudenhausen; Andrew M Miller; Robert McKenna; Stephen F Altschul; Linda B Bloom; Andrew F Neuwald
Journal:  Sci Rep       Date:  2020-02-03       Impact factor: 4.379

5.  Bayesian Top-Down Protein Sequence Alignment with Inferred Position-Specific Gap Penalties.

Authors:  Andrew F Neuwald; Stephen F Altschul
Journal:  PLoS Comput Biol       Date:  2016-05-18       Impact factor: 4.475

6.  Inferring joint sequence-structural determinants of protein functional specificity.

Authors:  Andrew F Neuwald; L Aravind; Stephen F Altschul
Journal:  Elife       Date:  2018-01-16       Impact factor: 8.140

7.  A survey of TIR domain sequence and structure divergence.

Authors:  Vladimir Y Toshchakov; Andrew F Neuwald
Journal:  Immunogenetics       Date:  2020-01-30       Impact factor: 2.846

8.  Identifying Function Determining Residues in Neuroimmune Semaphorin 4A.

Authors:  Svetlana P Chapoval; Mariah Lee; Aaron Lemmer; Oluwaseyi Ajayi; Xiulan Qi; Andrew F Neuwald; Achsah D Keegan
Journal:  Int J Mol Sci       Date:  2022-03-11       Impact factor: 6.208

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.