Literature DB >> 23609539

Adepth: New Representation and its implications for atomic depths of macromolecules.

Abstract

We applied the signed distance function (SDF) for representing the depths of atoms in a macromolecule. The calculations of SDF values were performed on grid points in a rectangular box that accommodates the macromolecule. The depth for an atom inside the molecule was then obtained as a result of tri-linear interpolation of SDF values at the nearest grid points surrounding the atom. For testing the performance of present program Adepth, we have constructed an artificial molecule whose atomic depths are known as the gold standard for accuracy assessments. On average, our results showed that Adepth reached an accuracy of 1.6% at 0.5 Å of grid spacing, whereas the current reference server DEPTH reached 7.5%. The Adepth program provides both depth and height representations; it is capable of computing iso-surfaces for atomic depths and presenting graphical view of macromolecular shape at some distance away from the surface. Web interface is available at http://biodev.cea.fr/adepth.

Entities: Chemical Disease Gene Species

Mesh：

Substances：

Year: 2013 PMID： 23609539 PMCID： PMC3692060 DOI： 10.1093/nar/gkt299

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

INTRODUCTION

The atomic depth has been found extremely useful in studying hydrogen/deuterium exchange rates (1), the characterization of protein domain size and identification of potential targets for phosphorylation (2), quantifications of driving force for a burial residue (3), protein fold recognition (4) and predictions of small-molecule binding cavities in proteins (5,6). As a closely related quantity, residue depth was used to study mutational impacts on protein stability (7) and the relationship of depth measure with protein sequences (8,9). In the late 70’, macromolecular shape was described by a map of solvent exclusion, a method that could claim the first computation of atomic depths (10). Since then several methods have been devised to compute atomic depths in proteins. The first one used a pre-equilibrated solvent-bath in which the macromolecule of interest is immerged. Water molecules that bump with macromolecular atoms are removed from the bath, and the depth is assigned as a value of the distance between any atom and the nearest water molecule on the surface (7). Several other methods require computations of ASA to determine which atoms are buried beforehand (11). The depth of a buried atom is then measured as the distance between the atom and its closest solvent-accessible atom (2) or the closest vertex of ASA surface (8,9). Alternatively, one may attribute the depth index to each atom in the macromolecule by measuring the solvent-exposed volume of a virtual sphere with a varying radius located at the atomic centre (12). In this work, we present an unsupervised program, Adepth, which computes the atomic depth without requiring previous knowledge of ASA or preparations of a solvent bath. Another advantage of Adepth relies in its integrity on identification of surface atoms and quantification of atomic depths in one run. Validation of the program was performed on a well-designed molecular structure with known depth information (Figure 1). Applications of Adepth includes definition of protein skins with a user-defined thickness, calculations of atom–atom overlaps in macromolecular assemblies (14) and a graphical visualization of depth using iso-surfaces in VMD (13). Another useful application of Adepth concerns the assessment of tip-convolution effects in atomic force microscopy (AFM) imaging (15). In the range of grid spacing 0.5–1 Å, Adepth has been shown to be more accurate than available servers that were tested in the present study.

Figure 1.

Representation of the 3D structure of the standard molecule drawn by the vdw method in VMD (13). Each atom was scaled down to 20% of its original size. The Cartesian co-ordinates of atoms in the molecule can be expressed as (Rcosφsinθ, Rsinφsinθ, Rcosθ), where R is the radius of sphere, 0 ≤ θ ≤ π and 0 ≤ φ < 2π; totally there are 25 sampling points along the θ coordinate and 50 for the φ coordinate. Atomic depth is colour-coded as follows: cyan = 0 Å, green = 3 Å, orange = 6 Å, yellow = 9 Å and red = 12 Å.

IMPLEMENTATION

In the earlier work (16), the signed distance function (SDF) field of a protein allowed us to identify solvent accessible atoms and determine how deep a residue is inside the protein. In summary, we placed the protein in a rectangular grid of which width, length and height were determined by the size of the protein. The grid was uniformly discretized in three dimensions at user-defined grid spacing (Δx, Δy and Δz). For each atom of the protein, the atomic van der Waals (vdw) radius was given based on the atomic type defined in the Charmm22 force field (17), as shown in Table 1. By default, the radius of solvent or probe was set to 1.5 Å. For representing the solvent-accessible surface of the molecule, an extended radius, the sum of vdw and solvent radii, was assigned to each heavy atom. Following level set methods (18), we first computed the SDF values at interfacial grid points (level 0) using information of where are limits of atomic and solvent force fields. For a grid point (i j k) at next level, the SDF value, D, was obtained by solving the quadratic equation (19) where

Table 1.

Vdw radii used for atoms in Adepth

Atomic type	Å
H (polar)	0.225
H (non-polar)	1.320
C	2.175
N	1.850
O	1.800
S	2.200
P	2.150

Vdw radii used for atoms in Adepth , , The computations were performed level by level until no un-visited grid point was left. The atomic depth was then obtained as tri-linear interpolation of SDFs at the nearest grid points surrounding the atomic centre. We denoted this strategy as Adepth. The performance of Adepth depends on parameters, such as grid spacing, probe radius and atomic vdw radii, used. The finer grid spacing the more accurate results, we found that grid spacing of 0.5 Å is a good compromise between computational cost and accuracy (Figure 2A and B). Accounting for atomic size effects on atom depths, we scaled the SDF magnitude by the atomic extended radius. In the output of PDB format, the atomic depths and scaled ones are presented in B-factor and occupancy columns, respectively. In the web server, an option allows the user to obtain the output of DX format so that one can display 3D-field data in the iso-surface form using VMD (13). Moreover, the user can also request an extruded slice of a macromolecule with pre-defined thickness as the surface skin of the macromolecule (14).

Figure 2.

(A) Relationship of computational cost and molecular size. The CPU time was plotted against the number of atoms (Natom) of the standard molecule for Δr = 0.5, 0.6, 0.8 and 1.0 Å. The four sets of data values were fitted with linear regression, resulting in a slope of 0.2, 0.1, 0.07 and 0.05 msec/atom for Δr = 0.5, 0.6, 0.8 and 1.0 Å, respectively. (B) Validation of Adepth and DEPTH on atomic depth calculations for the standard molecule using a probe radius of 1.5 Å. In the plot, the horizontal dashed lines represent the distances of spherical layers from the surface. Data points with open symbols represent the values of averaged atomic depth versus Δr from Adepth, whereas that with filled symbols located along the vertical axis are referred to the DEPTH server. Bars around the open symbols represent the standard deviation; when it is <1%, bars are not drawn.

VALIDATION

As there is no analytical solution to quantification of atomic depths for the 3D structure of a protein or a nucleic acid, a novel method is usually validated by comparing with other available methods. A qualitative comparison between various methods on different macromolecular architectures is presented in the Supplementary Figure S1: a nucleic acid [1BNA (20)], a fibrous protein collagen [1CGD (21)], a large human coagulation factor VIII [3CDZ (22)] and the 30S ribosome subunit [2WDK (23)]. However, the quantitative assessment of these methods remains elusive. A new approach for evaluating the performance of depth computation is presented. A pseudo-molecule was created as a standard structure that consists of one-atom (Cα) residues, namely, UNK (UNK means of unknown type; atomic co-ordinates of the gold-standard molecule in the PDB format are available in the web server). The 3D structure of the standard molecule is formed of four equally spaced (3 Å) concentric spheres as shown in Figure 1. For the present purpose, how dense the residues packed is not the matter, but how much the space filled up is concerned. Totally, there are 1201 residues (or atoms) in each spherical layer, and the outermost sphere has a radius of 12 Å. As described in ‘Implementation’ section, probe radius and atomic radii are factors greatly influencing the results of atomic depth. We have performed calculations on the standard molecule with several values of grid spacing Δr (0.5–2.0 Å) such that Δr = Δx = Δy = Δz, as well as with different probe radii, 1.4–1.6 Å. Among available web servers, only DEPTH server (6) could take in the standard molecule and go through the entire calculation process, whereas others such as the DPX servers (24) aborted the run because of failure of the associated surface program, whereas the SADIC (25) and PocketDepth servers (5) did not return any results. It should be noted that the inter-sphere distance is measured according to positions of atomic centres, whereas the depth of an atom depends on the distance between the atomic centre and the probe centre. Consequently, to make a direct comparison of depth computations and inter-sphere distances for the standard molecule, one has to subtract the extended radius of an atom from the depth value of that atom. To be explicit, the atomic depth was reduced by 2.175 plus 1.5 Å from Adepth and by 1.53 plus 1.5 Å from the DEPTH server (the value of 1.53 Å was obtained from the carbon-atom type Amber force field, the same force field used to equilibrate the solvent-bath in the DEPTH server). The results of Adepth and DEPTH are plotted in Figure 2B where the data of DEPTH were calculated using default parameters in the website. Adepth results for the probe radius of 1.4 and 1.6 Å are not shown, as their values are close to that of 1.5 Å. Each data point in the plot represents an averaged depth over all atoms in a spherical layer. Despite deviations from true depths become evident at large grid spacing, the order of depth ranking from the Adepth results remains unchanged (Figure 2B). The computational accuracy of atomic depth is defined as a ratio of measured depth to the true depth. The measured depth was obtained as data values of averaged depth shown in Figure 2B. The dependence of computational accuracy on grid spacing is presented in Table 2. Comparing the depth accuracy, we found that globally Adepth is more accurate than the DEPTH server at small to medium grid spacing (∼1 Å). Nevertheless, for the deepest layer, Adepth can obtain better results that DEPTH at fine grid spacing, i.e. 0.1 Å, which demands high CPU cost and huge memory load.

Table 2.

Comparison of computational accuracy between DEPTH and Adepth

Layer	True depth	DEPTH (%)	Δr = 0.5 (%)	Δr = 1.0 (%)	Δr = 2.0 (%)
1	3	17.3	2.6	1.1	8.5
2	6	7.4	1.5	1.0	6.8
3	9	5.1	0.2	2.4	9.1
4	12	0.3	2.2	6.7	13.9

aThe innermost sphere of the standard molecule is labelled 4 (red in Figure 1), and the layer enumeration is decreased by 1 for next outwards sphere; computations with different grid spacing were performed only by Adepth; true depth and Δr are in the unit of Å.

Comparison of computational accuracy between DEPTH and Adepth aThe innermost sphere of the standard molecule is labelled 4 (red in Figure 1), and the layer enumeration is decreased by 1 for next outwards sphere; computations with different grid spacing were performed only by Adepth; true depth and Δr are in the unit of Å.

DISCUSSION

The relationship between protein interior and atomic depth has been extensively discussed (2,7,25). New implications of atomic depths calculated by the Adepth server will be illustrated later in the text. Once atomic depths for a macromolecule are calculated, a built-in option in the Adepth server allows the user to obtain a molecular slice based on values of atomic depths. The corresponding outputs are called ‘extruded surfaces’ that are useful in molecular docking and assembly. It is usually done through an exhaustive computation of inter-atomic distance for detecting atomic overlaps in a complex. In practice, this is not necessary because the desired information is to find bumping atoms at the interface (14). Guided by an extruded surface of a molecular complex, one may reduce the number of atoms in distance calculations by ∼4–10 times while maintaining an accurate contour for the complex molecule (Figure 3A).

Figure 3.

Implications of new representation for atomic depths. (A) Cartoon representation of the extruded crystal structure of a Fab domain of antibody protein using a probe radius of 1.5 Å, Δr = 1 Å, and depth selection threshold <3.5 Å. Of 3291 atoms, 805 are found within the given threshold. (B) Iso-surface representation for an expanded surface of an antibody molecule. The crystal structure of the antibody is represented by 1IGT (26) with vdw spheres and coloured atomic types: cyan for carbon, red for oxygen, blue for nitrogen and yellow for sulphur atoms. The atomic depths of 1IGT and the associated SDF values at grid points were computed using a probe radius of 1.5 Å at Δr = 1 Å. The iso-surface in orange wireframe was located 50 Å away from the antibody surface. (C) Clipped core structure of the serine protease domain of FVIIa (27). The orange wireframe represents the iso-surface of the SDF value at a threshold of 7.5 Å using VMD. The main-chain conformation is displayed in a ribbon of red-to-white-to-blue spectrum, where red and blue are for buried and solvent-exposed regions, respectively. The deepest atom in FVIIa is 13.5 Å inwards from the protein surface. The mutated glycines with almost no solvent-accessible surface areas are represented by coloured CPK spheres. The residue depth was calculated as 10.0 Å for Gly343 and 4.8 Å for Gly391; thus, only Gly343 is enclosed in the orange wireframe. Another novel feature of Adepth allows the user to visualize 3D iso-surface mapping of atomic depths. As an SDF value carries either sign, it is now possible to accurately display an expanded volume of a given macromolecule. This is convenient for examining nearby surfaces of a macromolecule while still tracking the distance from the surface atoms. Analogous results could be obtained by traditional computations of ASA; yet, it is not plausible for most surface programs because of the need of a huge probe for generating such a remote surface at 50 Å away as shown in Figure 3B. This feature is useful in AFM for visualizing putative tip-convolution effects. As known, the tip effects lead to dilations of object boundary in AFM images (15,28). With a large radius of tip apex, the tip convolution effect greatly smoothed the macromolecular shape. When an expanded surface of a macromolecule can be displayed, it is easier to grasp an idea of what to expect from AFM imaging on a single molecule (Figure 3B). As tip convolution effect mostly occurred as a 2D artefact in an AFM image, one may scrutinize the strict 3D representation provided by Adepth in certain direction for viewing expected convoluted surfaces for the macromolecule. The final feature of Adepth is the use of 3D iso-potential surface for constructing a graphic representation of protein cores. Atomic depth is particularly well adapted to the identification of protein core residues as illustrated in the ProCoCoa method (25). As previously suggested, the residue depth is an important parameter for interpreting mutational data in proteins (7) and correlates well with residue hydrophobicity (2,3). Iso-surfaces provide an intuitive interpretation of atomic depth. For instance, graphical representation of atomic depth was helpful in illustrating the puzzling clinical data regarding the impact of a single-nucleotide polymorphism on activities of activated coagulation factor VII (FVIIa). Two apparently similar missense mutations were found in unrelated patients having a gene defect in FVII: Gly343Ser and Gly391Ser; however, the phenotype of Gly343Ser was severe, whereas that of Gly391Ser was minor (29). Using the iso-surface representation of the ternary complex model (27), it can be observed that although both Gly343 and Gly391 are completely buried (Figure 3C), Gly343 is located deeper in the hydrophobic core than Gly391. Thus, the apparent paradox could be interpreted in term of depth rather than solvent-burial. Although this observation remains phenomenological, it conveys the already known idea that a deep mutation as a greater impact on protein structure than a solvent-exposed mutation (7).

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online: Supplementary Figure 1.

FUNDING

Commissariat à l’Energie Atomique et aux Energies Alternatives; French ANR agency [ANR-07-PCVI-0002-01 to J.-L.P.]. Funding for open access charge: Institutional funds. Conflict of interest statement. None declared.

26 in total

1. Atom depth as a descriptor of the protein interior.

Authors: Alessandro Pintar; Oliviero Carugo; Sándor Pongor
Journal: Biophys J Date: 2003-04 Impact factor: 4.033

2. Quantifying the effect of burial of amino acid residues on protein stability.

Authors: Hongyi Zhou; Yaoqi Zhou
Journal: Proteins Date: 2004-02-01

3. DPX: for the analysis of the protein core.

Authors: Alessandro Pintar; Oliviero Carugo; Sándor Pongor
Journal: Bioinformatics Date: 2003-01-22 Impact factor: 6.937

4. PocketDepth: a new depth based algorithm for identification of ligand binding sites in proteins.

Authors: Yeturu Kalidas; Nagasuma Chandra
Journal: J Struct Biol Date: 2007-09-15 Impact factor: 2.867

5. Macromolecular shape and surface maps by solvent exclusion.

Authors: J Greer; B L Bush
Journal: Proc Natl Acad Sci U S A Date: 1978-01 Impact factor: 11.205

6. Hydration structure of a collagen peptide.

Authors: J Bella; B Brodsky; H M Berman
Journal: Structure Date: 1995-09-15 Impact factor: 5.006

7. Residual factor VII activity and different hemorrhagic phenotypes in CRM(+) factor VII deficiencies (Gly331Ser and Gly283Ser).

Authors: Mirko Pinotti; Daniela Etro; Debora Bindini; Maria Luisa Papa; Giuseppina Rodorigo; Angiola Rocino; Guglielmo Mariani; Nicola Ciavarella; Francesco Bernardi; Marko Pinotti; Dániela Etro; Guglieuto Mariani
Journal: Blood Date: 2002-02-15 Impact factor: 22.113

8. Structure of a B-DNA dodecamer: conformation and dynamics.

Authors: H R Drew; R M Wing; T Takano; C Broka; S Tanaka; K Itakura; R E Dickerson
Journal: Proc Natl Acad Sci U S A Date: 1981-04 Impact factor: 11.205

9. DEPTH: a web server to compute depth and predict small-molecule binding cavities in proteins.

Authors: Kuan Pern Tan; Raghavan Varadarajan; M S Madhusudhan
Journal: Nucleic Acids Res Date: 2011-05-16 Impact factor: 16.971

10. Prodepth: predict residue depth by support vector regression approach from protein sequences only.

Authors: Jiangning Song; Hao Tan; Khalid Mahmood; Ruby H P Law; Ashley M Buckle; Geoffrey I Webb; Tatsuya Akutsu; James C Whisstock
Journal: PLoS One Date: 2009-09-17 Impact factor: 3.240

2 in total

1. Safety, Stability and Pharmacokinetic Properties of (super)Factor Va, a Novel Engineered Coagulation Factor V for Treatment of Severe Bleeding.

Authors: Andrew J Gale; Vikas Bhat; Jean-Luc Pellequer; John H Griffin; Laurent O Mosnier; Annette Von Drygalski
Journal: Pharm Res Date: 2016-03-09 Impact factor: 4.200

Review 2. Measuring the shapes of macromolecules - and why it matters.

Authors: Jie Li; Paul Mach; Patrice Koehl
Journal: Comput Struct Biotechnol J Date: 2013-12-09 Impact factor: 7.271

2 in total