| Literature DB >> 21933373 |
Evan E Bolton1, Jie Chen, Sunghwan Kim, Lianyi Han, Siqian He, Wenyao Shi, Vahan Simonyan, Yan Sun, Paul A Thiessen, Jiyao Wang, Bo Yu, Jian Zhang, Stephen H Bryant.
Abstract
BACKGROUND: PubChem is an open repository for small molecules and their experimental biological activity. PubChem integrates and provides search, retrieval, visualization, analysis, and programmatic access tools in an effort to maximize the utility of contributed information. There are many diverse chemical structures with similar biological efficacies against targets available in PubChem that are difficult to interrelate using traditional 2-D similarity methods. A new layer called PubChem3D is added to PubChem to assist in this analysis. DESCRIPTION: PubChem generates a 3-D conformer model description for 92.3% of all records in the PubChem Compound database (when considering the parent compound of salts). Each of these conformer models is sampled to remove redundancy, guaranteeing a minimum (non-hydrogen atom pair-wise) RMSD between conformers. A diverse conformer ordering gives a maximal description of the conformational diversity of a molecule when only a subset of available conformers is used. A pre-computed search per compound record gives immediate access to a set of 3-D similar compounds (called "Similar Conformers") in PubChem and their respective superpositions. Systematic augmentation of PubChem resources to include a 3-D layer provides users with new capabilities to search, subset, visualize, analyze, and download data.A series of retrospective studies help to demonstrate important connections between chemical structures and their biological function that are not obvious using 2-D similarity but are readily apparent by 3-D similarity.Entities:
Year: 2011 PMID: 21933373 PMCID: PMC3269824 DOI: 10.1186/1758-2946-3-32
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Figure 1PubChem Compound database 3-D coverage. As one can see, 89.6% of all records have a 3-D conformer model. If one includes the parent compound of salts, this coverage can be considered to be 92.3%. Of the cases not having a 3-D conformer model, the majority are due to the flexibility of the chemical structure being too great to be suitable for conformer generation.
PubChem3D properties and descriptors
| For each | For each | |
|---|---|---|
| MMFF94 partial charges | X | |
| Pharmacophore features | X | |
| Conformer model sampling RMSD | X | |
| Diverse conformer ordering | X | |
| Conformer identifier | X | |
| MMFF94s Energy (minus coulombic terms) | X | |
| Conformer volume | X | |
| Steric shape moments | X | |
| Shape self-overlap volume | X | |
| Feature self-overlap volume | X | |
| Shape fingerprint | X |
Compound- and conformer-specific properties available for PubChem Compound records.
PubChem3D Entrez indexes
| Index name | Description and example query |
|---|---|
| ConformerCount3D | Conformer count ( |
| ConformerModelRmsd3D | Conformer sampling RMSD in Å ( |
| EffectiveRotorCount3D | Effective rotor count ( |
| FeatureCount3D | Total count of individual features ( |
| FeatureAcceptorCount3D | Hydrogen-bond acceptor count ( |
| FeatureAnionCount3D | Anion count at pH 7 ( |
| FeatureCationCount3D | Cation count at pH 7 ( |
| FeatureDonorCount3D | Hydrogen-bond donor count ( |
| FeatureHydrophobeCount3D | Hydrophobe count ( |
| FeatureRingCount3D | Ring count ( |
| Volume3D | Analytic volume of the first diverse conformer (default conformer) for each compound ( |
| XStericQuadrupole3D | Qx (describes length) of the first diverse conformer (default conformer) for each compound ( |
| YStericQuadrupole3D | Qy (describes width) of the first diverse conformer (default conformer) for each compound ( |
| ZStericQuadrupole3D | Qz (describes height) of the first diverse conformer (default conformer) for each compound ( |
Searchable compound- and conformer-specific fields available for PubChem Compound records.
Figure 2Summary page enhancements. A snapshot of the PubChem Compound summary page of dopamine (CID 681). Clicking on the "3D" tab on the right side of the page shows the 3-D structure of the molecule. Clicking the "Compound information" in the "Table of Contents" box directs users to 2-D neighbors ("Similar Compounds") and 3-D neighbors ("Similar Conformers").
Figure 3Visualization of a 3-D structure conformer. Clicking on the 3-D image on the PubChem Compound summary page (left) shows links to the web-based 3-D viewer (top right) and the Pc3D desktop helper application (bottom right).
Figure 4Visualization of 3-D structure conformer superpositions. Superpositions between compound conformers are accessible from various PubChem3D-aware applications. The PubChem Compound summary page (top left) allows the "Similar Conformers" neighboring relationship to be visualized. The PubChem3D web-based viewer (bottom left) allows arbitrary superpositions to be generated. The PubChem Structure Clustering tool (bottom right) enables all pair-wise superpositions to be examined.
Figure 53-D similarity relationship finds additional biological annotation. Comparison of the 2-D "Similar Compound" and 3-D "Similar Conformer" neighboring relationships using dopamine to demonstrate how both neighboring relationship complement each other when locating related chemical structures with unique biological annotation.
Figure 6Relating biologically active compounds by means of PubChem3D. Chemical probes ML088 (CID 704205) and ML087 (CID 25199559) from PubChem BioAssay 1548 against tissue non-specific alkaline phosphatase (TNAP, GI:116734717) are not similar by 2-D similarity but are by 3-D similarity.
Figure 7Similarity score matrix for selected histamine H1 receptor antagonist anti-inflammatory drugs. The lower triangle of the score matrix corresponds to the 2-D similarity computed using the PubChem fingerprint. The upper triangle corresponds to the 3-D similarity ST/CT scores. The matrix elements in red text indicate a 2-D similarity ≥ 0.75 or 3-D similarity with ST ≥ 0.74 and ComboT ≥ 1.0. The first ten diverse conformers per molecule were superimposed using shape-based optimization and the single conformer-pair per compound-pair with the largest ComboT retained.
Figure 83-D superposition of selected histamine H1 receptor antagonist anti-inflammatory drugs. Although there is little 2-D similarity, using the PubChem fingerprint, substantial 3-D similarity is found between various structurally diverse anti-inflammatory drugs.