Literature DB >> 15367730

Comparative homology agreement search: an effective combination of homology-search methods.

Intikhab Alam1, Andreas Dress, Marc Rehmsmeier, Georg Fuellen.   

Abstract

Many methods have been developed to search for homologous members of a protein family in databases, and the reliability of results and conclusions may be compromised if only one method is used, neglecting the others. Here we introduce a general scheme for combining such methods. Based on this scheme, we implemented a tool called comparative homology agreement search (chase) that integrates different search strategies to obtain a combined "E value." Our results show that a consensus method integrating distinct strategies easily outperforms any of its component algorithms. More specifically, an evaluation based on the Structural Classification of Proteins database reveals that, on average, a coverage of 47% can be obtained in searches for distantly related homologues (i.e., members of the same superfamily but not the same family, which is a very difficult task), accepting only 10 false positives, whereas the individual methods obtain a coverage of 28-38%.

Mesh:

Substances:

Year:  2004        PMID: 15367730      PMCID: PMC518839          DOI: 10.1073/pnas.0405612101

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  23 in total

1.  A comparative analysis of computational motif-detection methods.

Authors:  J Hudak; M A Mcclure
Journal:  Pac Symp Biocomput       Date:  1999

2.  The InterPro database, an integrated documentation resource for protein families, domains and functional sites.

Authors:  R Apweiler; T K Attwood; A Bairoch; A Bateman; E Birney; M Biswas; P Bucher; L Cerutti; F Corpet; M D Croning; R Durbin; L Falquet; W Fleischmann; J Gouzy; H Hermjakob; N Hulo; I Jonassen; D Kahn; A Kanapin; Y Karavidopoulou; R Lopez; B Marx; N J Mulder; T M Oinn; M Pagni; F Servant; C J Sigrist; E M Zdobnov
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

3.  The MetaFam Server: a comprehensive protein family resource.

Authors:  K A Silverstein; E Shoop; J E Johnson; A Kilian; J L Freeman; T M Kunau; I A Awad; M Mayer; E F Retzel
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

4.  Phylogenetic information improves homology detection.

Authors:  M Rehmsmeier; M Vingron
Journal:  Proteins       Date:  2001-12-01

5.  The PROSITE database, its status in 2002.

Authors:  Laurent Falquet; Marco Pagni; Philipp Bucher; Nicolas Hulo; Christian J A Sigrist; Kay Hofmann; Amos Bairoch
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

6.  Pcons: a neural-network-based consensus predictor that improves fold recognition.

Authors:  J Lundström; L Rychlewski; J Bujnicki; A Elofsson
Journal:  Protein Sci       Date:  2001-11       Impact factor: 6.725

7.  Phase4: automatic evaluation of database search methods.

Authors:  Marc Rehmsmeier
Journal:  Brief Bioinform       Date:  2002-12       Impact factor: 11.622

8.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

9.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.

Authors:  A Bairoch; R Apweiler
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

10.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.

Authors:  J D Thompson; D G Higgins; T J Gibson
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

View more
  7 in total

1.  Comparative genome analysis across a kingdom of eukaryotic organisms: specialization and diversification in the fungi.

Authors:  Michael J Cornell; Intikhab Alam; Darren M Soanes; Han Min Wong; Cornelia Hedeler; Norman W Paton; Magnus Rattray; Simon J Hubbard; Nicholas J Talbot; Stephen G Oliver
Journal:  Genome Res       Date:  2007-11-05       Impact factor: 9.043

Review 2.  Homology and phylogeny and their automated inference.

Authors:  Georg Fuellen
Journal:  Naturwissenschaften       Date:  2008-02-21

3.  Learning biomarkers of pluripotent stem cells in mouse.

Authors:  Lena Scheubert; Rainer Schmidt; Dirk Repsilber; Mitja Lustrek; Georg Fuellen
Journal:  DNA Res       Date:  2011-07-26       Impact factor: 4.458

4.  Biodefense Oriented Genomic-Based Pathogen Classification Systems: Challenges and Opportunities.

Authors:  Willy A Valdivia-Granda
Journal:  J Bioterror Biodef       Date:  2012-03-16

5.  Improved performance of sequence search approaches in remote homology detection.

Authors:  Adwait Govind Joshi; Upadhyayula Surya Raghavender; Ramanathan Sowdhamini
Journal:  F1000Res       Date:  2013-03-22

6.  Analysis of triglyceride synthesis unveils a green algal soluble diacylglycerol acyltransferase and provides clues to potential enzymatic components of the chloroplast pathway.

Authors:  Carolina Bagnato; María B Prados; Gisela R Franchini; Natalia Scaglia; Silvia E Miranda; María V Beligni
Journal:  BMC Genomics       Date:  2017-03-09       Impact factor: 3.969

7.  TransportTP: a two-phase classification approach for membrane transporter prediction and characterization.

Authors:  Haiquan Li; Vagner A Benedito; Michael K Udvardi; Patrick Xuechun Zhao
Journal:  BMC Bioinformatics       Date:  2009-12-14       Impact factor: 3.169

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.