Literature DB >> 29860391

CASTp 3.0: computed atlas of surface topography of proteins.

Wei Tian1, Chang Chen1, Xue Lei1, Jieling Zhao2, Jie Liang1.   

Abstract

Geometric and topological properties of protein structures, including surface pockets, interior cavities and cross channels, are of fundamental importance for proteins to carry out their functions. Computed Atlas of Surface Topography of proteins (CASTp) is a web server that provides online services for locating, delineating and measuring these geometric and topological properties of protein structures. It has been widely used since its inception in 2003. In this article, we present the latest version of the web server, CASTp 3.0. CASTp 3.0 continues to provide reliable and comprehensive identifications and quantifications of protein topography. In addition, it now provides: (i) imprints of the negative volumes of pockets, cavities and channels, (ii) topographic features of biological assemblies in the Protein Data Bank, (iii) improved visualization of protein structures and pockets, and (iv) more intuitive structural and annotated information, including information of secondary structure, functional sites, variant sites and other annotations of protein residues. The CASTp 3.0 web server is freely accessible at http://sts.bioe.uic.edu/castp/.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29860391      PMCID: PMC6031066          DOI: 10.1093/nar/gky473

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Protein structures are complex and are sculpted with numerous surface pockets, internal cavities and cross channels. These topographic features provide structural basis and micro-environments for proteins to carry out their functions such as ligand binding, DNA interaction and enzymatic activity. Identification and quantification of these topographic features of proteins are therefore of fundamental importance for understanding the structure–function relationship of proteins (1), in engineering proteins for desired properties (2) and in developing therapeutics against protein targets (3).

MATERIALS AND METHODS

The Computed Atlas of Surface Topography of proteins (CASTp) server uses the alpha shape method (4) developed in computational geometry to identify topographic features, to measure area and volume and to compute imprint (5–8). The alpha shape method has also been applied in other studies of cavities and channels in protein structures (9,10). The secondary structures are calculated using DSSP (11). Residue annotations of proteins are obtained from UniProt database (12) and mapped to PDB structures with residue-level information from the SIFTS database (13). The biological assemblies are extracted from the .mmicf files of the PDB database. Only the assemblies with biological significance and designated by the authors of the PDB structures (http://mmcif.wwpdb.org) are processed and listed on the CASTp server.

THE CASTp SERVER

The CASTp server aims to provide comprehensive and detailed quantitative characterization of topographic features of proteins (14,15). Since its release 15 years ago, the CASTp server has ∼45 000 visits and fulfills ∼33 000 calculation requests annually. It has been proven to be a useful tool for a wide range of studies, including investigations of signaling receptors (1), discoveries of cancer therapeutics (16), understanding of mechanism of drug actions (17), studies of immune disorder diseases (18), analysis of protein–nanoparticle interactions (19), inference of protein functions (20) and development of high-throughput computational tools (21,22). To provide additional useful information and to deliver improved user experience, we introduce here an updated server called CASTp 3.0. All important features of the previous versions of the server are retained, including detecting and characterizing cavities, pockets and channels of protein structures (Figure 1). In addition, we have substantially extended its functions by providing pre-computed topographic features of biological assemblies in the PDB database, as well as imprints of negative volumes of these topographic features. Furthermore, the user interface has been redesigned, making it more intuitive and informative.
Figure 1.

The surface of the ATP binding pocket of CDK2 (PDB ID: 2xmy), computed by CASTp 3.0.

The surface of the ATP binding pocket of CDK2 (PDB ID: 2xmy), computed by CASTp 3.0.

New features of CASTp 3.0

Pre-computed results for biological assemblies

The atomic coordinates of a PDB ID in the Protein Data Bank describe an asymmetric unit, which is the minimum structure that can produce the unit cell of the crystal through duplication and crystal symmetry operations. For many PDB IDs, the asymmetric unit differs from the biological assembly, which is the unit that has either been shown or is thought to be the functional form of the protein. This difference can be significant, and can make the computation results of a PDB entry biologically irrelevant. For example, PDB ID: 2iwv of the outer membrane porin OmpG contains four units as deposited. Its direct computation leads to the detection of a giant artificial pocket (Figure 2A), which obscures the actual functional channel (Figure 2B). To uncover topographic features that are biologically most relevant, the new CASTp server pre-computes topographic features for the biological assemblies of PDB IDs. Users now are able to navigate effortlessly between results of the asymmetric unit and results of biological assemblies of a PDB ID.
Figure 2.

The case when the asymmetric unit differs from the biological assembly. (A) Asymmetric unit in the PDB ID: 2iwv and the artificial giant pocket. The giant pocket is formed by the four channels of the four biological units and the space among them with unclear biological relevance. (B) Biological structure of 2iwv (biological assembly 1) and its channel detected by the CASTp server.

The case when the asymmetric unit differs from the biological assembly. (A) Asymmetric unit in the PDB ID: 2iwv and the artificial giant pocket. The giant pocket is formed by the four channels of the four biological units and the space among them with unclear biological relevance. (B) Biological structure of 2iwv (biological assembly 1) and its channel detected by the CASTp server.

Imprints of negative volumes of topographic features

In the previous versions of CASTp servers, geometric and topological features, such as pockets, cavities and channels, were shown only through the representation of surface atoms participating in their formation (e.g. the atom surface in Figure 1 and the stick model shown in Figure 3). The new CASTp server has added imprints of the negative volumes (e.g. the red bulbs in Figures 2 and 3) as the default visualization option. The negative volume is the space encompassed by the atoms that form these geometric and topological features, which can give users direct and intuitive understanding of these important structural features of proteins (5,6).
Figure 3.

The main user interface of the CASTp server. (A) The pocket panel. (B) The sequence and annotation panels.

The main user interface of the CASTp server. (A) The pocket panel. (B) The sequence and annotation panels.

Improved user interface

The visualization techniques employed in previous versions of CASTp servers over 12-year ago are out-of-date and are incompatible with the modern browsers. The new CASTp server now uses 3Dmol.js (23) for structural visualization, which allows users to view, to interact with protein structures, and to examine computation results in modern web browsers such as Chrome and Firefox. Users can choose the representation style of the atoms that form each topographic feature. The imprint of these features can also be shown with user-selected colors (Figure 3A). An intuitive sequence panel is also presented to users with secondary structures color-coded, where residues in user-selected pockets are highlighted. Residues annotated from UniProt (12) are also labeled. Both information on topographic feature-forming atoms and UniProt annotations are conveniently displayed when the cursor hovers over the relevant residues. The annotations are also summarized in the annotation panel (Figure 3B). In addition, both the sequence panel and the annotation panel are linked to the structure viewer, so users can conveniently click on one residue on the sequence map to have the structure viewer zoomed into that specific residue of interest. Furthermore, a floating structure viewer has been added (Figure 3B). When a user inspects information in the sequence panel or the annotation panel, the structure viewer will automatically follow the web page scrolling of the user, which saves the user from unnecessary and unproductive efforts in scrolling up and down.

INPUT AND OUTPUT

Input

The CASTp server takes protein structures in the PDB format and a probe radius as input for topographic computation. Through the intuitive interface, users can either search for pre-computed results using a four-letter PDB ID, or submit their own protein structures to request customized computation. For pre-computed results, a default probe radius of 1.4 Å is used, which is the standard value for computing solvent accessible surface area. For customized computation request, users can specify any probe radius desired.

Output

The CASTp server identifies all surface pockets, interior cavities and cross channels in a protein structure and provides detailed delineation of all atoms participating in their formation. It also measures their exact volumes and areas, as well as sizes of the mouth openings if exist. These metrics are calculated analytically, using both the solvent accessible surface model (Richards’ surface) (24) and the molecular surface model (Connolly’s surface) (25). In addition, the CASTp server also provides imprints of topographic features. These results can be directly downloaded from CASTp server, which can be visualized using either the UCSF Chimera (26) or our PyMOL plugin, CASTpyMOL.

Case study

Figure 3 illustrates the adenosine triphosphate (ATP) binding pocket on the cyclin-dependent kinase 2 (CDK2, PDB ID: 2xmy) that is automatically identified by the CASTp server. CDK2 is an important enzyme that regulates cell cycle with other CDKs. Many functionally important residues are located in this pocket, including three residues Lys33, Asp86 and Asp145 in the ATP binding site, one residue Asp127 in the catalytic site, two residues Asn132 and Asp145 in Mg binding sites and one residue Lys89 in the binding site that interacts with CDK7, another important cell-cycle-controlling enzyme. Due to the importance of this pocket, it has been widely studied as a target site for antitumor agents (16). In the CASTp server, users can choose different representations of pockets. For example, the pocket can be displayed as surface (Figure 1). In Figure 3, the imprint of the pocket is shown in red, and the atoms forming the pocket are shown as sticks. Volume and area measures, and the list of all atoms participating in the pocket formation are also given (Figure 3A). Users can view the pocket from the sequence panel. Important annotations can also be found easily in either the sequence panel or the annotation panel (Figure 3B). Additional examples illustrating how CASTp computations have been used can be found in the literature (20–21,27–32).

DISCUSSION

This paper describes significant improvement of the CASTp server, including: (i) the addition of imprints of negative volumes as a visualization option; (ii) the inclusion of pre-computed results for biological assemblies, rather than reporting results for the asymmetric unit alone; (iii) an improved user interface, which contains an updated and adaptive structure viewer, a panel for sequence information and a more intuitive panel for UniProt annotations. We believe these updates with additional information and improved user experience will continue to facilitate studies of protein structures and functions.
  27 in total

1.  CASTp: Computed Atlas of Surface Topography of proteins.

Authors:  T Andrew Binkowski; Shapor Naghibzadeh; Jie Liang
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

2.  Anatomy of protein pockets and cavities: measurement of binding site geometry and implications for ligand design.

Authors:  J Liang; H Edelsbrunner; C Woodward
Journal:  Protein Sci       Date:  1998-09       Impact factor: 6.725

3.  Structural basis of influenza virus fusion inhibition by the antiviral drug Arbidol.

Authors:  Rameshwar U Kadam; Ian A Wilson
Journal:  Proc Natl Acad Sci U S A       Date:  2016-12-21       Impact factor: 11.205

4.  Solvent-accessible surfaces of proteins and nucleic acids.

Authors:  M L Connolly
Journal:  Science       Date:  1983-08-19       Impact factor: 47.728

5.  Accuracy of functional surfaces on comparatively modeled protein structures.

Authors:  Jieling Zhao; Joe Dundas; Sema Kachalo; Zheng Ouyang; Jie Liang
Journal:  J Struct Funct Genomics       Date:  2011-05-04

6.  Structure-function analysis identifies highly sensitive strigolactone receptors in Striga.

Authors:  Shigeo Toh; Duncan Holbrook-Smith; Peter J Stogios; Olena Onopriyenko; Shelley Lumba; Yuichiro Tsuchiya; Alexei Savchenko; Peter McCourt
Journal:  Science       Date:  2015-10-09       Impact factor: 47.728

7.  Discovery and characterization of 2-anilino-4- (thiazol-5-yl)pyrimidine transcriptional CDK inhibitors as anticancer agents.

Authors:  Shudong Wang; Gary Griffiths; Carol A Midgley; Anna L Barnett; Michael Cooper; Joanna Grabarek; Laura Ingram; Wayne Jackson; George Kontopidis; Steven J McClue; Campbell McInnes; Janice McLachlan; Christopher Meades; Mokdad Mezna; Iain Stuart; Mark P Thomas; Daniella I Zheleva; David P Lane; Robert C Jackson; David M Glover; David G Blake; Peter M Fischer
Journal:  Chem Biol       Date:  2010-10-29

8.  Structure and function of lysosomal phospholipase A2 and lecithin:cholesterol acyltransferase.

Authors:  Alisa Glukhova; Vania Hinkovska-Galcheva; Robert Kelly; Akira Abe; James A Shayman; John J G Tesmer
Journal:  Nat Commun       Date:  2015-03-02       Impact factor: 14.919

9.  UniProt: the universal protein knowledgebase.

Authors: 
Journal:  Nucleic Acids Res       Date:  2016-11-29       Impact factor: 16.971

10.  SIFTS: Structure Integration with Function, Taxonomy and Sequences resource.

Authors:  Sameer Velankar; José M Dana; Julius Jacobsen; Glen van Ginkel; Paul J Gane; Jie Luo; Thomas J Oldfield; Claire O'Donovan; Maria-Jesus Martin; Gerard J Kleywegt
Journal:  Nucleic Acids Res       Date:  2012-11-29       Impact factor: 16.971

View more
  276 in total

1.  Coarse-grain simulations on NMR conformational ensembles highlight functional residues in proteins.

Authors:  Sophie Sacquin-Mora
Journal:  J R Soc Interface       Date:  2019-07-10       Impact factor: 4.118

2.  Structure and redox properties of the diheme electron carrier cytochrome c4 from Pseudomonas aeruginosa.

Authors:  Jessica M Carpenter; Fangfang Zhong; Michael J Ragusa; Ricardo O Louro; Deborah A Hogan; Ekaterina V Pletneva
Journal:  J Inorg Biochem       Date:  2019-10-22       Impact factor: 4.155

Review 3.  Structures of SF3b1 reveal a dynamic Achilles heel of spliceosome assembly: Implications for cancer-associated abnormalities and drug discovery.

Authors:  Debanjana Maji; Alan Grossfield; Clara L Kielkopf
Journal:  Biochim Biophys Acta Gene Regul Mech       Date:  2019-11-09       Impact factor: 4.490

4.  Flavonoids enhance rod opsin stability, folding, and self-association by directly binding to ligand-free opsin and modulating its conformation.

Authors:  Joseph T Ortega; Tanu Parmar; Beata Jastrzebska
Journal:  J Biol Chem       Date:  2019-04-03       Impact factor: 5.157

5.  A fatty acid-binding protein of Streptococcus pneumoniae facilitates the acquisition of host polyunsaturated fatty acids.

Authors:  Jessica M Gullett; Maxime G Cuypers; Matthew W Frank; Stephen W White; Charles O Rock
Journal:  J Biol Chem       Date:  2019-09-17       Impact factor: 5.157

6.  Solution NMR structure of CGL2373, a polyketide cyclase-like protein from Corynebacterium glutamicum.

Authors:  Cong Cai; Yao Nie; Yixuan Gong; Shuangli Li; Theresa A Ramelot; Michael A Kennedy; Xiali Yue; Jiang Zhu; Maili Liu; Yunhuang Yang
Journal:  Proteins       Date:  2019-07-17

7.  The structural bases for agonist diversity in an Arabidopsis thaliana glutamate receptor-like channel.

Authors:  Andrea Alfieri; Fabrizio G Doccula; Riccardo Pederzoli; Matteo Grenzi; Maria Cristina Bonza; Laura Luoni; Alessia Candeo; Neli Romano Armada; Alberto Barbiroli; Gianluca Valentini; Thomas R Schneider; Andrea Bassi; Martino Bolognesi; Marco Nardini; Alex Costa
Journal:  Proc Natl Acad Sci U S A       Date:  2019-12-23       Impact factor: 11.205

8.  Non-cryogenic structure of a chloride pump provides crucial clues to temperature-dependent channel transport efficiency.

Authors:  Ji-Hye Yun; Xuanxuan Li; Jae-Hyun Park; Yang Wang; Mio Ohki; Zeyu Jin; Wonbin Lee; Sam-Yong Park; Hao Hu; Chufeng Li; Nadia Zatsepin; Mark S Hunter; Raymond G Sierra; Jake Koralek; Chun Hong Yoon; Hyun-Soo Cho; Uwe Weierstall; Leihan Tang; Haiguang Liu; Weontae Lee
Journal:  J Biol Chem       Date:  2018-11-19       Impact factor: 5.157

9.  Hydroxylation of Steroids by a Microbial Substrate-Promiscuous P450 Cytochrome (CYP105D7): Key Arginine Residues for Rational Design.

Authors:  Bingbing Ma; Qianwen Wang; Haruo Ikeda; Chunfang Zhang; Lian-Hua Xu
Journal:  Appl Environ Microbiol       Date:  2019-11-14       Impact factor: 4.792

10.  Glutamate optimizes enzymatic activity under high hydrostatic pressure in Desulfovibrio species: effects on the ubiquitous thioredoxin system.

Authors:  H Gaussier; M Nouailler; E Champaud; E B Garcin; C Sebban-Kreuzer; O Bornet; M Garel; C Tamburini; L Pieulle; A Dolla; N Pradel
Journal:  Extremophiles       Date:  2021-07-01       Impact factor: 2.395

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.