Literature DB >> 19773335

HHsvm: fast and accurate classification of profile-profile matches identified by HHsearch.

Mensur Dlakić1.   

Abstract

MOTIVATION: Recently developed profile-profile methods rival structural comparisons in their ability to detect homology between distantly related proteins. Despite this tremendous progress, many genuine relationships between protein families cannot be recognized as comparisons of their profiles result in scores that are statistically insignificant.
RESULTS: Using known evolutionary relationships among protein superfamilies in SCOP database, support vector machines were trained on four sets of discriminatory features derived from the output of HHsearch. Upon validation, it was shown that the automatic classification of all profile-profile matches was superior to fixed threshold-based annotation in terms of sensitivity and specificity. The effectiveness of this approach was demonstrated by annotating several domains of unknown function from the Pfam database. AVAILABILITY: Programs and scripts implementing the methods described in this manuscript are freely available from http://hhsvm.dlakiclab.org/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Mesh:

Substances:

Year:  2009        PMID: 19773335      PMCID: PMC2913662          DOI: 10.1093/bioinformatics/btp555

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  27 in total

1.  SVM-HUSTLE--an iterative semi-supervised machine learning approach for pairwise protein remote homology detection.

Authors:  Anuj R Shah; Christopher S Oehmen; Bobbie-Jo Webb-Robertson
Journal:  Bioinformatics       Date:  2008-02-01       Impact factor: 6.937

2.  Methods of remote homology detection can be combined to increase coverage by 10% in the midnight zone.

Authors:  Adam James Reid; Corin Yeats; Christine Anne Orengo
Journal:  Bioinformatics       Date:  2007-08-20       Impact factor: 6.937

3.  Prediction of protein functional residues from sequence by probability density estimation.

Authors:  J D Fischer; C E Mayer; J Söding
Journal:  Bioinformatics       Date:  2008-01-02       Impact factor: 6.937

4.  Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methods.

Authors:  J Park; K Karplus; C Barrett; R Hughey; D Haussler; T Hubbard; C Chothia
Journal:  J Mol Biol       Date:  1998-12-11       Impact factor: 5.469

5.  Sequence logos: a new way to display consensus sequences.

Authors:  T D Schneider; R M Stephens
Journal:  Nucleic Acids Res       Date:  1990-10-25       Impact factor: 16.971

6.  Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships.

Authors:  S E Brenner; C Chothia; T J Hubbard
Journal:  Proc Natl Acad Sci U S A       Date:  1998-05-26       Impact factor: 11.205

Review 7.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

8.  SURVEY AND SUMMARY: holliday junction resolvases and related nucleases: identification of new families, phyletic distribution and evolutionary trajectories.

Authors:  L Aravind; K S Makarova; E V Koonin
Journal:  Nucleic Acids Res       Date:  2000-09-15       Impact factor: 16.971

9.  Data growth and its impact on the SCOP database: new developments.

Authors:  Antonina Andreeva; Dave Howorth; John-Marc Chandonia; Steven E Brenner; Tim J P Hubbard; Cyrus Chothia; Alexey G Murzin
Journal:  Nucleic Acids Res       Date:  2007-11-13       Impact factor: 16.971

10.  The Pfam protein families database.

Authors:  Robert D Finn; John Tate; Jaina Mistry; Penny C Coggill; Stephen John Sammut; Hans-Rudolf Hotz; Goran Ceric; Kristoffer Forslund; Sean R Eddy; Erik L L Sonnhammer; Alex Bateman
Journal:  Nucleic Acids Res       Date:  2007-11-26       Impact factor: 16.971

View more
  6 in total

1.  Structural studies of E73 from a hyperthermophilic archaeal virus identify the "RH3" domain, an elaborated ribbon-helix-helix motif involved in DNA recognition.

Authors:  Casey Schlenker; Anupam Goel; Brian P Tripet; Smita Menon; Taylor Willi; Mensur Dlakić; Mark J Young; C Martin Lawrence; Valérie Copié
Journal:  Biochemistry       Date:  2012-03-22       Impact factor: 3.162

2.  Proteomic analysis of Sulfolobus solfataricus during Sulfolobus Turreted Icosahedral Virus infection.

Authors:  Walid S Maaty; Kyla Selvig; Stephanie Ryder; Pavel Tarlykov; Jonathan K Hilmer; Joshua Heinemann; Joseph Steffens; Jamie C Snyder; Alice C Ortmann; Navid Movahed; Kevin Spicka; Lakshindra Chetia; Paul A Grieco; Edward A Dratz; Trevor Douglas; Mark J Young; Brian Bothner
Journal:  J Proteome Res       Date:  2012-01-24       Impact factor: 4.466

3.  Identification of new homologs of PD-(D/E)XK nucleases by support vector machines trained on data derived from profile-profile alignments.

Authors:  Mindaugas Laganeckas; Mindaugas Margelevicius; Ceslovas Venclovas
Journal:  Nucleic Acids Res       Date:  2010-10-20       Impact factor: 16.971

4.  Identification of divergent protein domains by combining HMM-HMM comparisons and co-occurrence detection.

Authors:  Amel Ghouila; Isabelle Florent; Fatma Zahra Guerfali; Nicolas Terrapon; Dhafer Laouini; Sadok Ben Yahia; Olivier Gascuel; Laurent Bréhélin
Journal:  PLoS One       Date:  2014-06-05       Impact factor: 3.240

5.  An ancient evolutionary connection between Ribonuclease A and EndoU families.

Authors:  Arcady Mushegian; Irina Sorokina; Alexey Eroshkin; Mensur Dlakić
Journal:  RNA       Date:  2020-04-13       Impact factor: 4.942

6.  Cyclic Tetra-Adenylate (cA4) Recognition by Csa3; Implications for an Integrated Class 1 CRISPR-Cas Immune Response in Saccharolobus solfataricus.

Authors:  Alexander A Charbonneau; Debra M Eckert; Colin C Gauvin; Nathanael G Lintner; C Martin Lawrence
Journal:  Biomolecules       Date:  2021-12-09
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.