| Literature DB >> 30090808 |
Ashutosh Kumar1, Kam Y J Zhang1.
Abstract
Molecular similarity is a key concept in drug discovery. It is based on the assumption that structurally similar molecules frequently have similar properties. Assessment of similarity between small molecules has been highly effective in the discovery and development of various drugs. Especially, two-dimensional (2D) similarity approaches have been quite popular due to their simplicity, accuracy and efficiency. Recently, the focus has been shifted toward the development of methods involving the representation and comparison of three-dimensional (3D) conformation of small molecules. Among the 3D similarity methods, evaluation of shape similarity is now gaining attention for its application not only in virtual screening but also in molecular target prediction, drug repurposing and scaffold hopping. A wide range of methods have been developed to describe molecular shape and to determine the shape similarity between small molecules. The most widely used methods include atom distance-based methods, surface-based approaches such as spherical harmonics and 3D Zernike descriptors, atom-centered Gaussian overlay based representations. Several of these methods demonstrated excellent virtual screening performance not only retrospectively but also prospectively. In addition to methods assessing the similarity between small molecules, shape similarity approaches have been developed to compare shapes of protein structures and binding pockets. Additionally, shape comparisons between atomic models and 3D density maps allowed the fitting of atomic models into cryo-electron microscopy maps. This review aims to summarize the methodological advances in shape similarity assessment highlighting advantages, disadvantages and their application in drug discovery.Entities:
Keywords: 3D Zernike descriptors; drug discovery; gaussian overlay; molecular similarity; shape similarity; spherical harmonics; virtual screening
Year: 2018 PMID: 30090808 PMCID: PMC6068280 DOI: 10.3389/fchem.2018.00315
Source DB: PubMed Journal: Front Chem ISSN: 2296-2646 Impact factor: 5.221
Figure 1A schematic overview of similarity calculation between a query and database molecules.
Atomic distance based shape comparison methods.
| USR | Extremely fast shape comparison method. Webserver can screen about 55 million conformers in 1 s. Different functional groups and enantiomers not recognized. | A ligand-based virtual screening webserver, USR-VS is available at | Ballester and Richards, |
| USR+MACCS | Functional group information added to USR. Enantiomers not recognized. | Available on request | Cannon et al., |
| CSR and USR:OptIso | Chiral shape recognition. Optical isomerism descriptors added to USR. | Developed by University of Oxford, UK. May be available from Oxford Drug Design company ( | Armstrong et al., |
| Electroshape | Chiral shape recognition, include descriptor for charge and lipophilicity. | Developed by University of Oxford, UK. May be available from Oxford Drug Design company ( | Armstrong et al., |
| UFSRAT | Pharmacophoric constraints by including atom-type information. | Developed by University of Edinburgh. Server available at | Shave, |
| USRCAT | Included CREDO atom-type information. | A python implementation of the method using RDKit toolkits is available from | Schreyer and Blundell, |
| ACPC | Method uses autocorrelation of partial charges. High throughput virtual screening possible. Cannot distinguish a molecule from its enantiomer. | Developed by Laboratory for Structural Bioinformatics, Centre for Biosystems Dynamics Research, RIKEN and is available from | Berenger et al., |
Figure 2(A) An overview of USR shape representation. In USR approach, the shape of a molecule is described by the distribution of atomic distance to four reference points. (B) An example of shape similarity calculation between two small molecules utilizing the USR approach.
An overview of commonly used Gaussian overlay based shape comparison methods.
| ROCS | Fast Gaussian overlay based shape comparison. Widely used shape based virtual screening tool. GPU version also available. | Developed by OpenEye Scientific Software ( | Rush et al., |
| PAPER | Accelerates large scale virtual screening experiments. Parallel implementation on NVIDIA GPUs. | Developed by Stanford University. Open source. Available from SimTK at | Haque and Pande, |
| MolShaCS | Uses Gaussian description of shape and charge. Hodgkin like similarity metric. Molecules are considered rigid. | Developed by University of Sao Paolo, Brazil. Open source tool available at | Vaz de Lima and Nascimento, |
| SHAFTS | It combines shape similarity with pharmacophoric features. Employs a hybrid similarity metric combining shape and chemical similarity. Suitable for large scale virtual screening. | Developed by Shanghai Key Laboratory of New Drug Design, East China University of Science & Technology, Shanghai, China. Available for download from | Liu et al., |
| Phase Shape | Uses atom triplets to generate initial alignments which are refined by Gaussian overlay. | Developed by Schrodinger. ( | Sastry et al., |
| ShaEP | Generate consensus shape pattern based on structural features of known ligands. | Developed by Abo Akademi University, Finland. Free for Academics. Available from the Abo Akademi University at | Vainio et al., |
| SimG | Uses downhill simplex method to evaluate shape and chemical similarity between two molecules. Comparison of ligand and binding pocket shape or chemical similarity is also possible. | Developed by Shanghai Key Laboratory of New Drug Design, East China University of Science & Technology, Shanghai, China. Available for download from | Cai et al., |
| SABRE | Uses consensus shapes to generate initial alignments which are later refined by rigid-body rotations and translations. | Academic license is available on request | Hamza et al., |
| WEGA | Uses a weighted Gaussian function to improve the accuracy of first order approximation. A GPU implementation (gWEGA) is also available for large scale virtual screenings. | Developed by Research Center for Drug Discovery, Sun Yat-sen University, China. Academic license is available on request at | Yan et al., |
Figure 3An overview of the shape similarity calculation by ROCS program.
An overview and availability of a few surface-based shape comparison methods.
| SURFCOMP | Molecular surface is divided into patches and corresponding patches are identified using geometrically invariant descriptors and physicochemical properties. | Available on request. | Hofbauer et al., |
| ParaFit | Performs 3D superposition and surface property comparison. Electronic surface properties are calculated using ParaSurf program. Spherical harmonics expansion coefficients of molecular surface are used. | Developed by CEPOS | Mavridis et al., |
| SHeMS | Uses spherical harmonics description of shape. Weights of spherical harmonics expansion coefficients are optimized using a genetic algorithm. | Developed by Shanghai Key Laboratory of New Drug Design, East China University of Science & Technology, Shanghai, China. Obtained by contacting Prof. Honglin Li at | Cai et al., |
| HPCC | Combined spherical harmonics shape comparison with pharmacophoric features. Tanimoto similarity coefficients for shape and chemical similarity are added to evaluate similarity between two molecules. | Developed by Harmonic Pharma. May be available from | Karaboga et al., |
| 3DZD | Uses 3D Zernike descriptors which are extension of spherical harmonics. Rotation translation invariant. | Developed by Kihara Bioinformatics laboratory at Purdue University, USA. Several implementations of 3DZD are available either as standalone program or web-server at | Sael et al., |
Figure 4Application of 3D Zernike descriptors in (A) protein protein similarity (B) small molecule similarity (C) protein ligand complementarity and (D) comparison of cryoEM maps.