Literature DB >> 10842732

Fast assignment of protein structures to sequences using the intermediate sequence library PDB-ISL.

S A Teichmann1, C Chothia, G M Church, J Park.   

Abstract

MOTIVATION: For large-scale structural assignment to sequences, as in computational structural genomics, a fast yet sensitive sequence search procedure is essential. A new approach using intermediate sequences was tested as a shortcut to iterative multiple sequence search methods such as PSI-BLAST.
RESULTS: A library containing potential intermediate sequences for proteins of known structure (PDB-ISL) was constructed. The sequences in the library were collected from a large sequence database using the sequences of the domains of proteins of known structure as the query sequences and the program PSI-BLAST. Sequences of proteins of unknown structure can be matched to distantly related proteins of known structure by using pairwise sequence comparison methods to find homologues in PDB-ISL. Searches of PDB-ISL were calibrated, and the number of correct matches found at a given error rate was the same as that found by PSI-BLAST. The advantage of this library is that it uses pairwise sequence comparison methods, such as FASTA or BLAST2, and can, therefore, be searched easily and, in many cases, much more quickly than an iterative multiple sequence comparison method. The procedure is roughly 20 times faster than PSI-BLAST for small genomes and several hundred times for large genomes. AVAILABILITY: Sequences can be submitted to the PDB-ISL servers at http://stash.mrc-lmb.cam.ac.uk/PDB_ISL/ or http://cyrah.ebi.ac.uk:1111/Serv/PDB_ISL/ and can be downloaded from ftp://ftp.ebi.ac.uk/pub/contrib/jong/PDB_+ ++ISL/ CONTACT: sat@mrc-lmb.cam.ac.uk and jong@ebi.ac.uk

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 10842732     DOI: 10.1093/bioinformatics/16.2.117

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  13 in total

1.  SCOP: a structural classification of proteins database.

Authors:  L Lo Conte; B Ailey; T J Hubbard; S E Brenner; A G Murzin; C Chothia
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Evaluation of PSI-BLAST alignment accuracy in comparison to structural alignments.

Authors:  I Friedberg; T Kaplan; H Margalit
Journal:  Protein Sci       Date:  2000-11       Impact factor: 6.725

3.  PartsList: a web-based system for dynamically ranking protein folds based on disparate attributes, including whole-genome expression and interaction information.

Authors:  J Qian; B Stenger; C A Wilson; J Lin; R Jansen; S A Teichmann; J Park; W G Krebs; H Yu; V Alexandrov; N Echols; M Gerstein
Journal:  Nucleic Acids Res       Date:  2001-04-15       Impact factor: 16.971

4.  Identification of related proteins with weak sequence identity using secondary structure information.

Authors:  C Geourjon; C Combet; C Blanchet; G Deléage
Journal:  Protein Sci       Date:  2001-04       Impact factor: 6.725

5.  The CATH extended protein-family database: providing structural annotations for genome sequences.

Authors:  Frances M G Pearl; David Lee; James E Bray; Daniel W A Buchan; Adrian J Shepherd; Christine A Orengo
Journal:  Protein Sci       Date:  2002-02       Impact factor: 6.725

6.  Detection of homologous proteins by an intermediate sequence search.

Authors:  Bino John; Andrej Sali
Journal:  Protein Sci       Date:  2004-01       Impact factor: 6.725

7.  Alternative approach to protein structure prediction based on sequential similarity of physical properties.

Authors:  Yi He; S Rackovsky; Yanping Yin; Harold A Scheraga
Journal:  Proc Natl Acad Sci U S A       Date:  2015-04-06       Impact factor: 11.205

8.  Profiles of Natural and Designed Protein-Like Sequences Effectively Bridge Protein Sequence Gaps: Implications in Distant Homology Detection.

Authors:  Gayatri Kumar; Narayanaswamy Srinivasan; Sankaran Sandhya
Journal:  Methods Mol Biol       Date:  2022

9.  Improved detection of remote homologues using cascade PSI-BLAST: influence of neighbouring protein families on sequence coverage.

Authors:  Swati Kaushik; Eshita Mutt; Ajithavalli Chellappan; Sandhya Sankaran; Narayanaswamy Srinivasan; Ramanathan Sowdhamini
Journal:  PLoS One       Date:  2013-02-20       Impact factor: 3.240

10.  A protein domain interaction interface database: InterPare.

Authors:  Sungsam Gong; Changbum Park; Hansol Choi; Junsu Ko; Insoo Jang; Jungsul Lee; Dan M Bolser; Donghoon Oh; Deok-Soo Kim; Jong Bhak
Journal:  BMC Bioinformatics       Date:  2005-08-25       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.