| Literature DB >> 24173824 |
Shuchismita Dutta1, Dimitris Dimitropoulos, Zukang Feng, Irina Persikova, Sanchayita Sen, Chenghua Shao, John Westbrook, Jasmine Young, Marina A Zhuravleva, Gerard J Kleywegt, Helen M Berman.
Abstract
With the accumulation of a large number and variety of molecules in the Protein Data Bank (PDB) comes the need on occasion to review and improve their representation. The Worldwide PDB (wwPDB) partners have periodically updated various aspects of structural data representation to improve the integrity and consistency of the archive. The remediation effort described here was focused on improving the representation of peptide-like inhibitor and antibiotic molecules so that they can be easily identified and analyzed. Peptide-like inhibitors or antibiotics were identified in over 1000 PDB entries, systematically reviewed and represented either as peptides with polymer sequence or as single components. For the majority of the single-component molecules, their peptide-like composition was captured in a new representation, called the subcomponent sequence. A novel concept called "group" was developed for representing complex peptide-like antibiotics and inhibitors that are composed of multiple polymer and nonpolymer components. In addition, a reference dictionary was developed with detailed information about these peptide-like molecules to aid in their annotation, identification and analysis. Based on the experience gained in this remediation, guidelines, procedures, and tools were developed to annotate new depositions containing peptide-like inhibitors and antibiotics accurately and consistently.Entities:
Keywords: Protein Data Bank; peptide-like antibiotic; peptide-like inhibitor
Mesh:
Substances:
Year: 2014 PMID: 24173824 PMCID: PMC3992913 DOI: 10.1002/bip.22434
Source DB: PubMed Journal: Biopolymers ISSN: 0006-3525 Impact factor: 2.505
Figure 1The chemical structures and sequences of (A) thiostrepton (PDB entry 1e9w)5 and (B) gramicidin S (PDB entry 1tk2).8 The chemical diagrams show cyclizations and modifications that lead to formation of the final antibiotic molecule. Red lines indicate the boundaries of the chemical components in the polymer, while the numbers indicate the correspondence with the gene or nonribosomal product.
Figure 2Chemical structure of three trifluoroacetyl-dipeptide-anilide inhibitors of elastase.9 In each case, the inhibitor's chemical component name is followed by its subcomponents (following the colon). (A) Chemical structure of inhibitor 0Z1 from PDB entry 1ela; (B) inhibitor 0Z4 from PDB entry 1elb; and (C) inhibitor 0Z3 from PDB entry 1elc.
Figure 3Representation of grouped peptide-like molecules. (A) Example of a peptide-like antibiotic that is a derivative of teicoplanin (from PDB entry 3vfj).10 The molecule shown here has lost a single chlorine atom during the experiment and is chemically different from naturally occurring teicoplanin. Both chemical structure and components list show that teicoplanin has a peptide core (shown in black), decorated with three saccharides (circled in blue, green, and orange) and a fatty acid (shown as R in red). The chemical components in the peptide core are numbered from 1 to 7. Bonds highlighted in green denote the peptide linkages between residues in the peptide core, while the purple bonds mark the covalent linkages between side-chains of the peptide residues. (B) Schematic representation of residues in the 22-mer “minigramicidin” (PDB entry 1kqe)11 showing two copies of the terminal 11-mer domains of gramicidin A, covalently linked in a head-to-head fashion. The linker between the two molecules is succinic acid (SIN).
Figure 4Covalent binding of the peptide-like inhibitor ACE-ASP-GLU-VAL-ASJ to its target, caspase 3, in PDB entry 4dcp.12 While the carbonyl carbon (marked with an arrow) of the C-terminal residue in the unbound inhibitor (of type ASA) is sp2, its hybridization state in the bound hemithioacetal product is sp3 and the residue is of type ASJ.
Overview of Representation and Annotation of Polymeric, Single Component and Grouped Peptide-Like Molecules in the PDB
| Molecule Properties | Peptide with Polymer Sequence | Single Component with Subcomponents | Group (Peptide Core with Additional Polymer and/or Nonpolymer Components) |
|---|---|---|---|
| Name | Polymer name is listed along with names of other polymers in the PDB entry | Component name is listed along with other ligands, ions or components in the PDB entry | Group name is listed for the complete molecule – including all polymer and nonpolymer components |
| Source | Source organism is included for naturally derived polymers | Not applicable (n/a) as most molecules are designed | Source organism is included for the polymeric portion(s) of the grouped molecule |
| Composition and linkage | • Polymer sequence is listed along with other polymers in the PDB entry. | • Apparent “sequence” is described in subcomponent sequence. | • All constituents of the molecule group are defined in PDB entry |
| • Standard peptide linkages between component residues are implied | • Explicit linkages between subcomponents are listed in the corresponding PRD files. | • Sequence of polymeric components is described just as any other polymer | |
| • All nonstandard linkages are explicitly described in the PDB entry and PRD file. | • All linkages between atoms are listed in the CCD | • Linkages between polymeric and nonpolymeric constituents explicitly defined in PDB and PRD entries. | |
| Reference | Sequence database reference for polymer is included, where available | n/a | Sequence database reference for polymeric components is included, where available |
| Structure | Regular regions of 3D structure (such as helices/sheets) are described, where appropriate | n/a | Regular regions of 3D structure (such as helices/sheets) are described for polymeric components, where appropriate |
| Binding environment | Residues interacting with or surrounding the polymer are highlighted | Residues interacting with or surrounding the component are highlighted | Residues interacting with or surrounding the grouped molecule are highlighted |
| Function | Overall function of the polymer is described | Overall function of the component is described | Overall function of the grouped molecule is described |
Overview of Information Content of PRD and FAM Entries in the BIRD Resource
| Property | PRD File | FAM File |
|---|---|---|
| Identifier | PRD_######, e.g., PRD_000001 | FAM_######, e.g., FAM_000001 |
| Name | Molecule name | Family name |
| Description | • Molecular formula and weight | • List of member PRD identifiers |
| • Specific function (class) | • Literature references | |
| • Structural details (type) | ||
| Chemical details | • List of polymer and nonpolymer entities comprising the molecule | • Synonyms for PRD family members collected from various resources including the PDB and primary citations |
| • Polymer or subcomponent sequence• Description of how all components in the molecule are linked | • Reference database IDs and links to various resources that provide information about the family members | |
| ○ Intra-entity linkages between components in polymer entities | • Family specific annotations | |
| ○ Inter-entity linkages between polymer segments and nonpolymer components (for molecules with grouped representation) | ○ IDs and information about related small molecule crystal structures in the CSD | |
| ○ Corresponding literature references | ||
| Biological details | Name of organism producing the molecule (for naturally produced molecules) and source of this information (e.g. from a database, author or literature) | Annotations about structural, functional and mechanistic details for family members, including pharmacological action (where appropriate) |
Figure 5Chemical structures of four members of the vancomycin family of glycopeptide antibiotics. All these molecules have the same peptide core, and the different decorations in each of the molecules are circled using different colors. (A) Vancomycin aglycon has no sugars linked to it (PDB entry 1ghg).15 (B) Desvancosaminyl vancomycin is an intermediate in the vancomycin-biosynthesis pathway. It has only one saccharide linked to the peptide core (PDB entry 1rrv).16 (C) Vancomycin has a disaccharide decorating the peptide core (PDB entry 1aa5).17 (D) Chloroorienticin A has a disaccharide and a monosaccharide decorating the core (PDB entry 1gac).18
Figure 6A flow-chart showing the logic used for deciding the representation of peptide-like inhibitor and antibiotic molecules in the PDB.