| Literature DB >> 15980589 |
Guoli Wang1, Roland L Dunbrack.
Abstract
PISCES is a database server for producing lists of sequences from the Protein Data Bank (PDB) using a number of entry- and chain-specific criteria and mutual sequence identity. Our goal in culling the PDB is to provide the longest list possible of the highest resolution structures that fulfill the sequence identity and structural quality cut-offs. The new PISCES server uses a combination of PSI-BLAST and structure-based alignments to determine sequence identities. Structure alignment produces more complete alignments and therefore more accurate sequence identities than PSI-BLAST. PISCES now allows a user to cull the PDB by-entry in addition to the standard culling by individual chains. In this scenario, a list will contain only entries that do not have a chain that has a sequence identity to any chain in any other entry in the list over the sequence identity cut-off. PISCES also provides fully annotated sequences including gene name and species. The server allows a user to cull an input list of entries or chains, so that other criteria, such as function, can be used. Results from a search on the re-engineered RCSB's site for the PDB can be entered into the PISCES server by a single click, combining the powerful searching abilities of the PDB with PISCES's utilities for sequence culling. The server's data are updated weekly. The server is available at http://dunbrack.fccc.edu/pisces.Entities:
Mesh:
Year: 2005 PMID: 15980589 PMCID: PMC1160163 DOI: 10.1093/nar/gki402
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Scatterplots of sequence identities (top figures) and alignment lengths (bottom figures) for alignment pairs in common between PSI-BLAST and BLAST (left two figures) and CE and PSI-BLAST (right two figures). On the top two plots, a box is shown that contains points that would be used to eliminate sequences at 30% sequence identity using the method on the x-axis but not by the method on the y-axis.
Lengths of lists obtained using different sequence alignment methods
| Percentage | BLAST | PSI-BLAST | REPRDB | CE |
|---|---|---|---|---|
| 15 | 2092 | 1982 | 17 | 2041 |
| 20 | 2099 | 2268 | 75 | 2343 |
| 25 | 2184 | 2782 | 1492 | 2839 |
| 30 | 2562 | 3349 | 3677 | 3400 |
| 40 | 3863 | 4293 | 4570 | 4305 |
| 50 | 4878 | 5002 | 5200 | 5003 |
| 60 | 5408 | 5482 | 5695 | 5480 |
| 70 | 5845 | 5873 | 6102 | 5872 |
| 80 | 6194 | 6219 | 6566 | 6218 |
| 90 | 6686 | 6708 | 7462 | 6707 |
Criteria for inclusion in the lists: resolution ≤3.0 Å; including Cα chains; excluding non-X-ray entries.