| Literature DB >> 25300483 |
Jérémy Desaphy1, Guillaume Bret1, Didier Rognan1, Esther Kellenberger2.
Abstract
The sc-PDB database (available at http://bioinfo-pharma.u-strasbg.fr/scPDB/) is a comprehensive and up-to-date selection of ligandable binding sites of the Protein Data Bank. Sites are defined from complexes between a protein and a pharmacological ligand. The database provides the all-atom description of the protein, its ligand, their binding site and their binding mode. Currently, the sc-PDB archive registers 9283 binding sites from 3678 unique proteins and 5608 unique ligands. The sc-PDB database was publicly launched in 2004 with the aim of providing structure files suitable for computational approaches to drug design, such as docking. During the last 10 years we have improved and standardized the processes for (i) identifying binding sites, (ii) correcting structures, (iii) annotating protein function and ligand properties and (iv) characterizing their binding mode. This paper presents the latest enhancements in the database, specifically pertaining to the representation of molecular interaction and to the similarity between ligand/protein binding patterns. The new website puts emphasis in pictorial analysis of data.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25300483 PMCID: PMC4384012 DOI: 10.1093/nar/gku928
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 19.160
Figure 1.The general flow chart from PDB to sc-PDB.
Downloadable content of sc-PDB
| filename | Number of entries in file | Data description | Data type |
|---|---|---|---|
| protein.mol2 | one or list of PDB ID matching search criteria | All-atom description of sc-PDB protein(s) | Atomic data |
| ligand.mol2 | one or list of PDB ID matching search criteria | All-atom description of sc-PDB ligand | Atomic data |
| site.mol2 | one or list of PDB ID matching search criteria | All-atom description of sc-PDB ligand binding site | Atomic data |
| cavity6.mol2 | one or list of PDB ID matching search criteria | The cavity is the negative image of the binding site, described by regularly spaced points colored according to pharmacophoric properties of the site atoms | Atomic data |
| ints_M.mol2 | one or list of PDB ID matching search criteria | Non-bonded interactions between sc-PDB ligand and its binding site | Atomic data |
| Each interaction is characterized by three points, placed respectively on the protein atom in interaction, the ligand atom in interaction and at the center of the segment defined by these two points | |||
| ifp.txt | one or list of PDB ID matching search criteria | Non-bonded interactions between sc-PDB ligand and its binding site | Binary string |
| For each residue in site is marked the presence or absence of interaction with ligand (hydrophobic contact, aromatic bond, H-bond, ionic bond, metal-ion bond) | |||
| C- | Clusters of binding sites | The archive classifies all the sc-PDB entries of a UNIPROT protein. It is organized into directories, one for each cluster of sites. Each cluster contains protein.mol2, site.mol2 and ligand.mol2 files of all individual PDB entries, which have been 3D-aligned to the site at cluster center | Atomic data |
| Alignment.tar.gz | Pair of PDB ID, 3D-aligned for optimizing site similarity (1) or binding mode similarity (2) | The archive describes the protein, site, ligand and cavity (1) or non-bonded interaction (2) files for the reference entry (original coordinates) and the compared entry (fitted coordinates) | Atomic data |
| scPDB_results | list of entries matching search criteria | Annotation and 2D structure of ligands in csv, xlsx or sdf formats | Text |
| list of entries similar to query site (1) or query binding mode (2) | Similarity scores | Text |
All charts and pictures of sc-PDB website are downloadable in png format. The complete database is downloadable as a compressed archive from the database homepage.
Figure 2.Crystallographic water molecules in sc-PDB binding sites.
Figure 3.Search the sc-PDB for similar binding modes. Screenshots display the distribution of values for a given query binding mode (top left), the ranked list of similar entries (bottom left) and the 3D alignment of a selected hit with the query complex (top right). The closer view (bottom right) better shows aligned interaction points. The 3D structure of the query is colored in yellow (PDB ID: 1hop, HET: GCP), the selected hit in green (PDB ID: 1rj9, HET: GCP). Interaction pseudo-atoms are colored by interaction type (green, hydrophobic; blue, H-bond with ligand acceptor; red, H-bond with ligand donor; brown, metal chelation).