Literature DB >> 30698657

Calculation of accurate interatomic contact surface areas for the quantitative analysis of non-bonded molecular interactions.

Judemir Ribeiro1, Carlos Ríos-Vera1, Francisco Melo1, Andreas Schüller1.   

Abstract

SUMMARY: Intra- and intermolecular contact surfaces are routinely calculated for a large array of applications in bioinformatics but are typically approximated from differential solvent accessible surface area calculations and not calculated directly. These approximations do not properly take the effects of neighboring atoms into account and tend to deviate considerably from the true contact surface. We implemented an extension of the original Shrake-Rupley algorithm to accurately estimate interatomic contact surface areas of molecular structures and complexes. Our extended algorithm is able to calculate the contact area of an atom to all nearby atoms by directly calculating overlapping surface patches, taking into account the possible shielding effects of neighboring atoms. Here, we present a versatile software tool and web server for the calculation of contact surface areas, as well as buried surface areas and solvent accessible surface areas (SASA) for different types of biomolecules, such as proteins, nucleic acids and small organic molecules. Detailed results are provided in tab-separated values format for analysis and Protein Databank files for visualization. Direct contact surface area calculation resulted in improved accuracy in a benchmark with a non-redundant set of 245 protein-DNA complexes. SASA-based approximations underestimated protein-DNA contact surfaces on average by 40%. This software tool may be useful for surface-based intra- and intermolecular interaction analyses and scoring function development.
AVAILABILITY AND IMPLEMENTATION: A web server, stand-alone binaries for Linux, MacOS and Windows and C++ source code are freely available from http://schuellerlab.org/dr_sasa/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30698657      PMCID: PMC6748739          DOI: 10.1093/bioinformatics/btz062

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Molecular interactions are a fundamental principle of all biological processes. Non-bonded interactions between the binding partners of a molecular complex may be modeled by considering the complementarity of the shape and physicochemical properties of the binding interface. A common way to measure shape complementarity is the estimation of the buried surface area of a molecular complex. Surface area-based methods have been employed successfully in various areas of bioinformatics including the estimation of the binding free energy (Genheden and Ryde, 2015), scoring functions for molecular docking and drug design (Sobolev ), protein structure modeling (Sobolev and Edelman, 1995), statistical potentials (McConkey ) and prediction of non-covalent contacts (Sobolev ). Interatomic contact surface areas (CSA) may be estimated approximately by calculating differences in solvent accessible surface areas (SASA) of artificially rearranged molecular objects (Ribeiro ; Sagendorf ). However, these approximations are not very accurate and produced an average relative difference of −40% (see below). Here, we present a new software tool dubbed dr_sasa for the direct calculation of interatomic CSA by a modified Shrake-Rupley algorithm (Shrake and Rupley, 1973). Our extended algorithm is able to calculate the contact area of an atom to all nearby atoms by directly calculating overlapping surface patches, taking into account the possible shielding effects of neighboring atoms. To our knowledge, a free, open-source software tool and web server for the calculation of interatomic contact surfaces between atoms of different types of biomolecules (proteins, nucleic acids and small molecules) is currently not available. For convenience, dr_sasa is also able to calculate buried surface areas (BSA) and solvent accessible surface areas (SASA) and accepts files in either Protein Databank (.pdb) or Tripos Mol2 (.mol2) formats.

2 Implementation

To calculate interatomic CSA we extended the Shrake-Rupley algorithm. The original algorithm estimates SASA by considering a sphere of equidistant points for each atom, where the radius corresponds to the vdW radius of an atom plus the radius of a water molecule (1.4 Å by default). Points located inside the volume of other point clouds are excluded and SASA is estimated by counting the remaining points followed by multiplication with the surface area they represent. In our modification, instead of excluding points, which are inside the point clouds of other atoms, we store the identity of these interacting atoms. It should be noted that the surface points of an atom may be included in the volumes of multiple other atoms. For all buried surface points of an atom we therefore find all unique groups of interacting atoms. The final contact surface between two atoms is calculated by adding up the surface areas corresponding to all interacting groups of the first atom, which contain the second atom. In order to avoid overestimation of the contact surface by counting the surface area shared by a group of atoms several times, the algorithm divides this surface area by the number of atoms in the group. In Figure 1a the contact surface of atom A with atom B is calculated as the sum of surface 1 and half of surface 2, since surface 2 is shared between atoms B and C (c.f. Supplementary Section 1 for more details and pseudo code). Surface area calculations depend on the employed atomic vdW radii. dr_sasa uses two sets of vdW radii definitions by default: (i) for Protein Databank files the definitions as published by Chothia, 1975 are employed, which are equivalent to the radii of the popular software tool NACCESS (Hubbard and Thornton, 1993); (ii) for files provided in the Tripos Mol2 format vdW radii according to Tsai are utilized, based on a more fine-grained atom typing scheme and similar to the molecular modeling software UCSF Chimera (Pettersen ).
Fig. 1.

dr_sasa algorithm and calculation examples. a) 2D diagram of the calculation of contact surface areas. Letters denote atoms, circles denote the vdW surface + 1.4 Å and numbers denote contact surfaces of atom ‘A’. b) Surface representation of the binding site of factor Xa bound to rivaroxaban, shown as sticks (PDB ID 2w26), colored according to CSA (dr_sasa mode 4; blue to red: low to high Å2). The figure was generated with PyMOL. c) Surface-based contact map of factor Xa and rivaroxaban

dr_sasa algorithm and calculation examples. a) 2D diagram of the calculation of contact surface areas. Letters denote atoms, circles denote the vdW surface + 1.4 Å and numbers denote contact surfaces of atom ‘A’. b) Surface representation of the binding site of factor Xa bound to rivaroxaban, shown as sticks (PDB ID 2w26), colored according to CSA (dr_sasa mode 4; blue to red: low to high Å2). The figure was generated with PyMOL. c) Surface-based contact map of factor Xa and rivaroxaban In a benchmark with a non-redundant set of 245 protein–DNA complexes derived from the Protein–DNA Interface Database (PDIdb) (Norambuena and Melo, 2010) we determined that direct calculation of CSA by dr_sasa is more accurate and that SASA-based approximations (Ribeiro ; Sagendorf ) underestimated protein–DNA contact surfaces on average by 40% (Supplementary Section 2.1). We further validated SASA calculations with a non-redundant set of 290 protein-ligand complexes derived from PDBbind (Liu ). We compared dr_sasa against NACCESS, FreeSASA (Mitternacht, 2016) and MSMS (Sanner ), and obtained a low relative difference <1% (Supplementary Section 2.3). The reader is referred to the Supporting Information for extensive benchmark results and further analyses. dr_sasa supports four modes of operation. The fallback mode of operation (mode 0) calculates only the SASA of an input structure and outputs a text file with tabulated data and a PDB file, where SASA is stored in the B-factor column. In the contact surface area (CSA) mode of operation (mode 1) the program calculates a complete matrix of all surface-based contacts per atom and per residue and saves them as tab-separated values files. The sum over columns in these matrices is equivalent to the overall BSA of an atom or residue. Theses sums are also saved in the B-factor column of a separate PDB output file for visualization (Fig. 1b). In addition, contact map images may be generated from these matrices with a separate Python script (Fig. 1c). The CSA-mode requires the selection of chains to be considered as separate objects, or in case the input file contains different types of biomolecules (protein and nucleic acids, protein and ligands, or any combination of these) the program can automatically identify the separate objects and calculate their interactions. Modes of operation 2 and 3 calculate intramolecular surface-based contact maps per residue and per atom. The last mode of operation (mode 4) calculates intermolecular CSA without requiring that the contact surfaces are solvent accessible. This is especially useful for internal and deep ligand binding cavities (Fig. 1b). dr_sasa is suitable for batch processing. SASA calculations for 10 000 PDB snapshots of a molecular dynamics trajectory of a protein–DNA complex (1830 atoms) took 36 min. on a 16-thread x86 notebook computer (AMD Ryzen 7 1700 @ 3.2 GHz), equivalent to 0.2 s per structure. CSA calculations on the same dataset took 214 min. (1.3 s per structure). In conclusion, we present a freely available, versatile command-line program and web server for the accurate calculation of interatomic contact surfaces by a modified Shrake-Rupley algorithm, which may be useful for the quantitative analysis of non-bonded molecular interactions. Click here for additional data file.
  15 in total

1.  The packing density in proteins: standard radii and volumes.

Authors:  J Tsai; R Taylor; C Chothia; M Gerstein
Journal:  J Mol Biol       Date:  1999-07-02       Impact factor: 5.469

2.  Automated analysis of interatomic contacts in proteins.

Authors:  V Sobolev; A Sorokine; J Prilusky; E E Abola; M Edelman
Journal:  Bioinformatics       Date:  1999-04       Impact factor: 6.937

3.  Discrimination of native protein structures using atom-atom contact scoring.

Authors:  Brendan J McConkey; Vladimir Sobolev; Marvin Edelman
Journal:  Proc Natl Acad Sci U S A       Date:  2003-03-11       Impact factor: 11.205

4.  UCSF Chimera--a visualization system for exploratory research and analysis.

Authors:  Eric F Pettersen; Thomas D Goddard; Conrad C Huang; Gregory S Couch; Daniel M Greenblatt; Elaine C Meng; Thomas E Ferrin
Journal:  J Comput Chem       Date:  2004-10       Impact factor: 3.376

5.  Structural invariants in protein folding.

Authors:  C Chothia
Journal:  Nature       Date:  1975-03-27       Impact factor: 49.962

6.  Forging the Basis for Developing Protein-Ligand Interaction Scoring Functions.

Authors:  Zhihai Liu; Minyi Su; Li Han; Jie Liu; Qifan Yang; Yan Li; Renxiao Wang
Journal:  Acc Chem Res       Date:  2017-02-09       Impact factor: 22.384

7.  The Protein-DNA Interface database.

Authors:  Tomás Norambuena; Francisco Melo
Journal:  BMC Bioinformatics       Date:  2010-05-18       Impact factor: 3.169

8.  PDIviz: analysis and visualization of protein-DNA binding interfaces.

Authors:  Judemir Ribeiro; Francisco Melo; Andreas Schüller
Journal:  Bioinformatics       Date:  2015-04-16       Impact factor: 6.937

Review 9.  The MM/PBSA and MM/GBSA methods to estimate ligand-binding affinities.

Authors:  Samuel Genheden; Ulf Ryde
Journal:  Expert Opin Drug Discov       Date:  2015-04-02       Impact factor: 6.098

10.  FreeSASA: An open source C library for solvent accessible surface area calculations.

Authors:  Simon Mitternacht
Journal:  F1000Res       Date:  2016-02-18
View more
  12 in total

1.  Deep Learning and Structure-Based Virtual Screening for Drug Discovery against NEK7: A Novel Target for the Treatment of Cancer.

Authors:  Mubashir Aziz; Syeda Abida Ejaz; Seema Zargar; Naveed Akhtar; Abdullahi Tunde Aborode; Tanveer A Wani; Gaber El-Saber Batiha; Farhan Siddique; Mohammed Alqarni; Ashraf Akintayo Akintola
Journal:  Molecules       Date:  2022-06-25       Impact factor: 4.927

Review 2.  A Quest for a Universal Plasma-Derived Antivenom Against All Elapid Neurotoxic Snake Venoms.

Authors:  Kavi Ratanabanangkoon
Journal:  Front Immunol       Date:  2021-04-23       Impact factor: 7.561

3.  Structural basis of P[II] rotavirus evolution and host ranges under selection of histo-blood group antigens.

Authors:  Shenyuan Xu; Kristen Rose McGinnis; Yang Liu; Pengwei Huang; Ming Tan; Michael Robert Stuckert; Riley Erin Burnside; Elsa Grace Jacob; Shuisong Ni; Xi Jiang; Michael A Kennedy
Journal:  Proc Natl Acad Sci U S A       Date:  2021-09-07       Impact factor: 11.205

4.  MAPIYA contact map server for identification and visualization of molecular interactions in proteins and biological complexes.

Authors:  Aleksandra E Badaczewska-Dawid; Chandran Nithin; Karol Wroblewski; Mateusz Kurcinski; Sebastian Kmiecik
Journal:  Nucleic Acids Res       Date:  2022-05-07       Impact factor: 19.160

5.  Drug Discovery Targeting the Disorder-To-Order Transition Regions through the Conformational Diversity Mimicking and Statistical Analysis.

Authors:  Insung Na; Sungwoo Choi; Seung Han Son; Vladimir N Uversky; Chul Geun Kim
Journal:  Int J Mol Sci       Date:  2020-07-24       Impact factor: 5.923

6.  Rational Design of Resveratrol O-methyltransferase for the Production of Pinostilbene.

Authors:  Daniela P Herrera; Andrea M Chánique; Ascensión Martínez-Márquez; Roque Bru-Martínez; Robert Kourist; Loreto P Parra; Andreas Schüller
Journal:  Int J Mol Sci       Date:  2021-04-21       Impact factor: 5.923

7.  Inactive-to-Active Transition of Human Thymidine Kinase 1 Revealed by Molecular Dynamics Simulations.

Authors:  Samanta Makurat; Zoe Cournia; Janusz Rak
Journal:  J Chem Inf Model       Date:  2021-12-17       Impact factor: 4.956

8.  Design and simulation of the liposomal model by using a coarse-grained molecular dynamics approach towards drug delivery goals.

Authors:  Jalil Parchekani; Abdollah Allahverdi; Majid Taghdir; Hossein Naderi-Manesh
Journal:  Sci Rep       Date:  2022-02-11       Impact factor: 4.379

9.  The Influence of Varying Fluorination Patterns on the Thermodynamics and Kinetics of Benzenesulfonamide Binding to Human Carbonic Anhydrase II.

Authors:  Steffen Glöckner; Khang Ngo; Björn Wagner; Andreas Heine; Gerhard Klebe
Journal:  Biomolecules       Date:  2020-03-27

10.  Cryo-EM structure of native human uromodulin, a zona pellucida module polymer.

Authors:  Alena Stsiapanava; Chenrui Xu; Martina Brunati; Sara Zamora-Caballero; Céline Schaeffer; Marcel Bokhove; Ling Han; Hans Hebert; Marta Carroni; Shigeki Yasumasu; Luca Rampoldi; Bin Wu; Luca Jovine
Journal:  EMBO J       Date:  2020-11-16       Impact factor: 14.012

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.