Literature DB >> 28771374

Initial Cluster Analysis.

Stephen F Altschul1, Andrew F Neuwald2.   

Abstract

We study a simple abstract problem motivated by a variety of applications in protein sequence analysis. Consider a string of 0s and 1s of length L, and containing D 1s. If we believe that some or all of the 1s may be clustered near the start of the sequence, which subset is the most significantly so clustered, and how significant is this clustering? We approach this question using the minimum description length principle and illustrate its application by analyzing residues that distinguish translational initiation and elongation factor guanosine triphosphatases (GTPases) from other P-loop GTPases. Within a structure of yeast elongation factor 1[Formula: see text], these residues form a significant cluster centered on a region implicated in guanine nucleotide exchange. Various biomedical questions may be cast as the abstract problem considered here.

Entities:  

Keywords:  Jeffreys' priors; Minimum Description Length principle; cluster analysis

Mesh:

Substances:

Year:  2017        PMID: 28771374      PMCID: PMC5806593          DOI: 10.1089/cmb.2017.0050

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  13 in total

1.  Classification and evolution of P-loop GTPases and related ATPases.

Authors:  Detlef D Leipe; Yuri I Wolf; Eugene V Koonin; L Aravind
Journal:  J Mol Biol       Date:  2002-03-15       Impact factor: 5.469

2.  Surveying the manifold divergence of an entire protein class for statistical clues to underlying biochemical mechanisms.

Authors:  Andrew F Neuwald
Journal:  Stat Appl Genet Mol Biol       Date:  2011-08-04

3.  An invariant form for the prior probability in estimation problems.

Authors:  H JEFFREYS
Journal:  Proc R Soc Lond A Math Phys Sci       Date:  1946

4.  Prediction of protein functional residues from sequence by probability density estimation.

Authors:  J D Fischer; C E Mayer; J Söding
Journal:  Bioinformatics       Date:  2008-01-02       Impact factor: 6.937

5.  A Bayesian sampler for optimization of protein domain hierarchies.

Authors:  Andrew F Neuwald
Journal:  J Comput Biol       Date:  2014-02-04       Impact factor: 1.479

6.  An evolutionary trace method defines binding surfaces common to protein families.

Authors:  O Lichtarge; H R Bourne; F E Cohen
Journal:  J Mol Biol       Date:  1996-03-29       Impact factor: 5.469

7.  Crystal structures of nucleotide exchange intermediates in the eEF1A-eEF1Balpha complex.

Authors:  G R Andersen; L Valente; L Pedersen; T G Kinzy; J Nyborg
Journal:  Nat Struct Biol       Date:  2001-06

8.  Characterizations of diverse residue clusters in protein three-dimensional structures.

Authors:  S Karlin; Z Y Zhu
Journal:  Proc Natl Acad Sci U S A       Date:  1996-08-06       Impact factor: 11.205

9.  CLIPS-1D: analysis of multiple sequence alignments to deduce for residue-positions a role in catalysis, ligand-binding, or protein structure.

Authors:  Jan-Oliver Janda; Markus Busch; Fabian Kück; Mikhail Porfenenko; Rainer Merkl
Journal:  BMC Bioinformatics       Date:  2012-04-05       Impact factor: 3.169

10.  Inference of Functionally-Relevant N-acetyltransferase Residues Based on Statistical Correlations.

Authors:  Andrew F Neuwald; Stephen F Altschul
Journal:  PLoS Comput Biol       Date:  2016-12-21       Impact factor: 4.475

View more
  4 in total

1.  SPARC: Structural properties associated with residue constraints.

Authors:  Andrew F Neuwald; Hui Yang; B Tracy Nixon
Journal:  Comput Struct Biotechnol J       Date:  2022-04-07       Impact factor: 6.155

2.  Inferring joint sequence-structural determinants of protein functional specificity.

Authors:  Andrew F Neuwald; L Aravind; Stephen F Altschul
Journal:  Elife       Date:  2018-01-16       Impact factor: 8.140

3.  Statistical investigations of protein residue direct couplings.

Authors:  Andrew F Neuwald; Stephen F Altschul
Journal:  PLoS Comput Biol       Date:  2018-12-31       Impact factor: 4.475

4.  Identifying Function Determining Residues in Neuroimmune Semaphorin 4A.

Authors:  Svetlana P Chapoval; Mariah Lee; Aaron Lemmer; Oluwaseyi Ajayi; Xiulan Qi; Andrew F Neuwald; Achsah D Keegan
Journal:  Int J Mol Sci       Date:  2022-03-11       Impact factor: 6.208

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.