Literature DB >> 24559108

Evaluating, comparing, and interpreting protein domain hierarchies.

Andrew F Neuwald1.   

Abstract

Arranging protein domain sequences hierarchically into evolutionarily divergent subgroups is important for investigating evolutionary history, for speeding up web-based similarity searches, for identifying sequence determinants of protein function, and for genome annotation. However, whether or not a particular hierarchy is optimal is often unclear, and independently constructed hierarchies for the same domain can often differ significantly. This article describes methods for statistically evaluating specific aspects of a hierarchy, for probing the criteria underlying its construction and for direct comparisons between hierarchies. Information theoretical notions are used to quantify the contributions of specific hierarchical features to the underlying statistical model. Such features include subhierarchies, sequence subgroups, individual sequences, and subgroup-associated signature patterns. Underlying properties are graphically displayed in plots of each specific feature's contributions, in heat maps of pattern residue conservation, in "contrast alignments," and through cross-mapping of subgroups between hierarchies. Together, these approaches provide a deeper understanding of protein domain functional divergence, reveal uncertainties caused by inconsistent patterns of sequence conservation, and help resolve conflicts between competing hierarchies.

Mesh:

Substances:

Year:  2014        PMID: 24559108      PMCID: PMC3962652          DOI: 10.1089/cmb.2013.0098

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  9 in total

1.  CDD: a curated Entrez database of conserved domain alignments.

Authors:  Aron Marchler-Bauer; John B Anderson; Carol DeWeese-Scott; Natalie D Fedorova; Lewis Y Geer; Siqian He; David I Hurwitz; John D Jackson; Aviva R Jacobs; Christopher J Lanczycki; Cynthia A Liebert; Chunlei Liu; Thomas Madej; Gabriele H Marchler; Raja Mazumder; Anastasia N Nikolskaya; Anna R Panchenko; Bachoti S Rao; Benjamin A Shoemaker; Vahan Simonyan; James S Song; Paul A Thiessen; Sona Vasudevan; Yanli Wang; Roxanne A Yamashita; Jodie J Yin; Stephen H Bryant
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

2.  Surveying the manifold divergence of an entire protein class for statistical clues to underlying biochemical mechanisms.

Authors:  Andrew F Neuwald
Journal:  Stat Appl Genet Mol Biol       Date:  2011-08-04

3.  Crystal structure of quinolinic acid phosphoribosyltransferase from Mmycobacterium tuberculosis: a potential TB drug target.

Authors:  V Sharma; C Grubmeyer; J C Sacchettini
Journal:  Structure       Date:  1998-12-15       Impact factor: 5.006

4.  A Bayesian sampler for optimization of protein domain hierarchies.

Authors:  Andrew F Neuwald
Journal:  J Comput Biol       Date:  2014-02-04       Impact factor: 1.479

5.  Funnels, pathways, and the energy landscape of protein folding: a synthesis.

Authors:  J D Bryngelson; J N Onuchic; N D Socci; P G Wolynes
Journal:  Proteins       Date:  1995-03

6.  Detecting patterns in protein sequences.

Authors:  A F Neuwald; P Green
Journal:  J Mol Biol       Date:  1994-06-24       Impact factor: 5.469

7.  Automated hierarchical classification of protein domain subfamilies based on functionally-divergent residue signatures.

Authors:  Andrew F Neuwald; Christopher J Lanczycki; Aron Marchler-Bauer
Journal:  BMC Bioinformatics       Date:  2012-06-22       Impact factor: 3.169

8.  CDD: a Conserved Domain Database for the functional annotation of proteins.

Authors:  Aron Marchler-Bauer; Shennan Lu; John B Anderson; Farideh Chitsaz; Myra K Derbyshire; Carol DeWeese-Scott; Jessica H Fong; Lewis Y Geer; Renata C Geer; Noreen R Gonzales; Marc Gwadz; David I Hurwitz; John D Jackson; Zhaoxi Ke; Christopher J Lanczycki; Fu Lu; Gabriele H Marchler; Mikhail Mullokandov; Marina V Omelchenko; Cynthia L Robertson; James S Song; Narmada Thanki; Roxanne A Yamashita; Dachuan Zhang; Naigong Zhang; Chanjuan Zheng; Stephen H Bryant
Journal:  Nucleic Acids Res       Date:  2010-11-24       Impact factor: 16.971

9.  The NCBI Taxonomy database.

Authors:  Scott Federhen
Journal:  Nucleic Acids Res       Date:  2011-12-01       Impact factor: 16.971

  9 in total
  5 in total

1.  A Bayesian sampler for optimization of protein domain hierarchies.

Authors:  Andrew F Neuwald
Journal:  J Comput Biol       Date:  2014-02-04       Impact factor: 1.479

2.  Tracing the origin and evolution of pseudokinases across the tree of life.

Authors:  Annie Kwon; Steven Scott; Rahil Taujale; Wayland Yeung; Krys J Kochut; Patrick A Eyers; Natarajan Kannan
Journal:  Sci Signal       Date:  2019-04-23       Impact factor: 8.192

3.  Hydrophobic Core Variations Provide a Structural Framework for Tyrosine Kinase Evolution and Functional Specialization.

Authors:  Smita Mohanty; Krishnadev Oruganty; Annie Kwon; Dominic P Byrne; Samantha Ferries; Zheng Ruan; Laura E Hanold; Samiksha Katiyar; Eileen J Kennedy; Patrick A Eyers; Natarajan Kannan
Journal:  PLoS Genet       Date:  2016-02-29       Impact factor: 5.917

4.  Inference of Functionally-Relevant N-acetyltransferase Residues Based on Statistical Correlations.

Authors:  Andrew F Neuwald; Stephen F Altschul
Journal:  PLoS Comput Biol       Date:  2016-12-21       Impact factor: 4.475

5.  Lipid-targeting pleckstrin homology domain turns its autoinhibitory face toward the TEC kinases.

Authors:  Neha Amatya; Thomas E Wales; Annie Kwon; Wayland Yeung; Raji E Joseph; D Bruce Fulton; Natarajan Kannan; John R Engen; Amy H Andreotti
Journal:  Proc Natl Acad Sci U S A       Date:  2019-10-07       Impact factor: 11.205

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.