R M MacCallum1, L A Kelley, M J Sternberg. 1. Biomolecular Modelling Laboratory, Imperial Cancer Research Fund, London, UK. R.MacCallum@icrf.icnet.uk
Abstract
MOTIVATION: Sequence database search methods often identify putative sub-threshold hits of known function or structure for a given query sequence. It is widespread practice to filter these hits by hand using knowledge of function and other factors; to the expert, some hits may appear more sensible than others. SAWTED (Structure Assignment With Text Description) is an automated solution to this post-filtering problem which will be applicable to large scale genome assignments. RESULTS: A standard document comparison algorithm is applied to text descriptions extracted from SWISS-PROT annotations. The added value of SAWTED in combination with PSI-BLAST has been shown with a benchmark of difficult remote homologues taken from the SCOP structure database. AVAILABILITY: A WAWTED PSI-BLAST Web server is available to perform sensitive searches against the protein structure database (http://www.bmm.icnet.uk/servers/sawted). CONTACT: R.MacCallum@icrf.icnet.uk
MOTIVATION: Sequence database search methods often identify putative sub-threshold hits of known function or structure for a given query sequence. It is widespread practice to filter these hits by hand using knowledge of function and other factors; to the expert, some hits may appear more sensible than others. SAWTED (Structure Assignment With Text Description) is an automated solution to this post-filtering problem which will be applicable to large scale genome assignments. RESULTS: A standard document comparison algorithm is applied to text descriptions extracted from SWISS-PROT annotations. The added value of SAWTED in combination with PSI-BLAST has been shown with a benchmark of difficult remote homologues taken from the SCOP structure database. AVAILABILITY: A WAWTED PSI-BLAST Web server is available to perform sensitive searches against the protein structure database (http://www.bmm.icnet.uk/servers/sawted). CONTACT: R.MacCallum@icrf.icnet.uk
Authors: Peter J M Steenbakkers; Wimal Ubhayasekera; Harry J A M Goossen; Erik M H M van Lierop; Chris van der Drift; Godfried D Vogels; Sherry L Mowbray; Huub J M Op den Camp Journal: Biochem J Date: 2002-07-01 Impact factor: 3.857
Authors: Brian J Hillier; Vidyasankar Sundaresan; C David Stout; Victor D Vacquier Journal: Acta Crystallogr Sect F Struct Biol Cryst Commun Date: 2005-12-16
Authors: Daniel J Rigden; Peter Setlow; Barbara Setlow; Irina Bagyan; Richard A Stein; Mark J Jedrzejas Journal: Protein Sci Date: 2002-10 Impact factor: 6.725