Literature DB >> 27423893

Online interactive analysis of protein structure ensembles with Bio3D-web.

Lars Skjærven1, Shashank Jariwala2, Xin-Qiu Yao2, Barry J Grant2.   

Abstract

Bio3D-web is an online application for analyzing the sequence, structure and conformational heterogeneity of protein families. Major functionality is provided for identifying protein structure sets for analysis, their alignment and refined structure superposition, sequence and structure conservation analysis, mapping and clustering of conformations and the quantitative comparison of their predicted structural dynamics. AVAILABILITY: Bio3D-web is based on the Bio3D and Shiny R packages. All major browsers are supported and full source code is available under a GPL2 license from http://thegrantlab.org/bio3d-web CONTACT: bjgrant@umich.edu or lars.skjarven@uib.no.
© The Author 2016. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27423893      PMCID: PMC5181562          DOI: 10.1093/bioinformatics/btw482

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Many structures are now available for homologous proteins determined under different crystallization conditions and oligomerization states. Detailed comparison of these structures can inform on structural dynamic mechanisms critical for protein function including ligand binding, enzymatic catalysis, allosteric regulation and bi-molecular recognition. A wide range of bioinformatics tools and online servers enable researchers to explore and analyze individual biomolecular structures. Notable examples include molecular visualization, pairwise structural alignment and biophysics based tools including various normal mode analysis servers (Eyal ; Suhre and Sanejouand, 2004; Tiwari ). However, existing tools for detailed quantitative analysis of the sequence, structure and dynamics of large heterogeneous protein families often require significant computational expertise and typically remain accessible only to expert users with relevant programming skills. For example, the Bio3D package requires R (Grant ), ProDy requires python and Maven requires Matlab knowledge (Bakan ; Zimmermann ). A recent contribution to making such analysis more widely available online is the PDBFlex database (Hrabe ). PDBFlex catalogues the structural variation of the same protein (PDB structures sharing 95% or more sequence identity). However, more in-depth interactive analysis of user defined structure sets across species and diverse protein families is not currently available. To this end we have developed Bio3D-web—a web application that implements a complete workflow for user customized investigation of protein sequence-structure-dynamic relationships. Bio3D-web provides unparalleled online functionality including inter-conformer relationship mapping with principal component analysis (PCA), and quantitative comparison of predicted internal dynamics across protein families via new ensemble normal mode analysis (eNMA). Together with conventional sequence and structure analysis methods these approaches allow researchers to map the structural dynamic properties of proteins for which PDB structures are available. Bio3D-web requires no programming knowledge and thus decreases the entry barrier to performing advanced comparative sequence, structure and dynamics analysis. Bio3D-web is powered by the previously described Bio3D R package for structural bioinformatics (Grant ; Skjærven ). In particular, Bio3D approaches for identification of related protein structures, multiple alignment, rigid-core identification, optimal superposition, PCA and eNMA form the basis of the application (Supplementary Fig. S1). Structure and sequence annotations are derived from the RCSB PDB (Berman, 2000) and PFAM databases (Finn ). Bio3D-web employs Shiny’s reactive programming and web application framework to provide its interactive online interface (Shiny, http://shiny.rstudio.com).

2 Example application

Providing the PDB ID of a single Rho GTPase (1FTN) in the SEARCH tab identifies 1059 sequence similar structures from which the top 214 are automatically selected as the most related structures for further analysis. Users can optionally (de)select additional structures for further analysis. In this example we proceed with these structures that span RhoA/B/C, Cdc42 and Rac1/2/3 PDB entries. In the ALIGN tab the selected structures are subject to multiple alignment, similarity and conservation analysis. Characterization of superimposed structures is available from the FIT tab. This includes rigid core identification, conservation analysis and RMSD based clustering. The PCA tab more clearly displays the relationship between all structures in terms of the principal displacements of their major variable regions (Fig. 1A). In particular, the interactive low-dimensional ‘conformer plot’, displaying structures projected onto user defined PCs, can be colored by sequence, RMSD and PCA based clustering results (Fig. 1B). This plot can also be clicked on to identify individual structures along with their annotations such as ligand bound species. For available Rho related structures three major conformational clusters are clearly apparent (black, green and red in Fig. 1B). Inspection of these structures in the linked annotation table reveals that the vast majority of cluster members correspond to GDP-bound, GTP-analogue bound and nucleotide free structures respectively. The residue-wise contributions to each PC are also displayed interactively highlighting a number of potentially key structural regions (Fig. 1C).
Fig. 1.

Bio3D-web analysis of Rho GTPases. (A) Visualization of the first principal component (PC) characterizing the major conformational variations. (B) Structures projected onto their two first PCs. Each point (or structure) is colored according to user specified criteria, in this case PCA-based clustering results. (C) Contribution of each residue to the first PC. (D) Ensemble normal mode analysis reveals the enhanced local dynamics of nucleotide free states (red) relative to the GTP- and GDP-bound (green and black) structures. Positions with significantly distinct flexibilities between states are indicated with light blue rectangles (P < 0.05) (Color version of this figure is available at Bioinformatics online.)

eNMA in the next tab indicates that these same regions display significantly distinct dynamics upon nucleotide exchange (Fig. 1D). Critically, the NMA here is performed on all selected structures in a way that facilitates the interpretation of their structural similarity and dissimilarity trends (Yao ). This allows a user to explore dynamic trends in all crystalized states in relation to each other without the conventional caveat of potentially over-interpreting the differences between extreme cases or a single artifactual structure. Collectively these results indicate the existence of three major Rho superfamily conformational states that differ by a collective displacement of two nucleotide-binding site regions, which in turn display significantly distinct flexibilities upon nucleotide binding. In the case of Rho GTPases these regions are known to be key for nucleotide dependent modulation of protein-protein interactions (Wittinghofer and Vetter, 2011). Further example application of Bio3D-web to a range of distinct protein families is presented as interactive demos online (http://thegrantlab.org/bio3d-web). Bio3D-web analysis of Rho GTPases. (A) Visualization of the first principal component (PC) characterizing the major conformational variations. (B) Structures projected onto their two first PCs. Each point (or structure) is colored according to user specified criteria, in this case PCA-based clustering results. (C) Contribution of each residue to the first PC. (D) Ensemble normal mode analysis reveals the enhanced local dynamics of nucleotide free states (red) relative to the GTP- and GDP-bound (green and black) structures. Positions with significantly distinct flexibilities between states are indicated with light blue rectangles (P < 0.05) (Color version of this figure is available at Bioinformatics online.)

3 Conclusion

Bio3D-web is an online application for analyzing user defined heterogeneous biomolecular structure data. The design of Bio3D-web emphasizes simplicity over exhaustive inclusion of the many analysis methods available in the full Bio3D package. This effectively reduces the required technical expertise and thus facilitates advanced structural bioinformatics analysis for a broader range of researchers. In many cases it is envisaged that researchers will use Bio3D-web to understand general trends in their protein family of interest, which may then inform more specialized analyses. Bio3D-web is therefore designed to quickly explore biomolecular structure datasets and to act as a hypothesis-generating tool with sharable summary reports that capture all users defined analysis choices and optionally enable collaborators to visit previous analysis sessions.
  12 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  ElNemo: a normal mode web server for protein movement analysis and the generation of templates for molecular replacement.

Authors:  Karsten Suhre; Yves-Henri Sanejouand
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

3.  Rapid Characterization of Allosteric Networks with Ensemble Normal Mode Analysis.

Authors:  Xin-Qiu Yao; Lars Skjærven; Barry J Grant
Journal:  J Phys Chem B       Date:  2016-04-20       Impact factor: 2.991

Review 4.  Structure-function relationships of the G domain, a canonical switch motif.

Authors:  Alfred Wittinghofer; Ingrid R Vetter
Journal:  Annu Rev Biochem       Date:  2011       Impact factor: 23.643

5.  ProDy: protein dynamics inferred from theory and experiments.

Authors:  Ahmet Bakan; Lidio M Meireles; Ivet Bahar
Journal:  Bioinformatics       Date:  2011-04-05       Impact factor: 6.937

6.  MAVENs: motion analysis and visualization of elastic networks and structural ensembles.

Authors:  Michael T Zimmermann; Andrzej Kloczkowski; Robert L Jernigan
Journal:  BMC Bioinformatics       Date:  2011-06-28       Impact factor: 3.307

7.  WEBnm@ v2.0: Web server and services for comparing protein flexibility.

Authors:  Sandhya P Tiwari; Edvin Fuglebakk; Siv M Hollup; Lars Skjærven; Tristan Cragnolini; Svenn H Grindhaug; Kidane M Tekle; Nathalie Reuter
Journal:  BMC Bioinformatics       Date:  2014-12-30       Impact factor: 3.169

8.  Pfam: the protein families database.

Authors:  Robert D Finn; Alex Bateman; Jody Clements; Penelope Coggill; Ruth Y Eberhardt; Sean R Eddy; Andreas Heger; Kirstie Hetherington; Liisa Holm; Jaina Mistry; Erik L L Sonnhammer; John Tate; Marco Punta
Journal:  Nucleic Acids Res       Date:  2013-11-27       Impact factor: 16.971

9.  Integrating protein structural dynamics and evolutionary analysis with Bio3D.

Authors:  Lars Skjærven; Xin-Qiu Yao; Guido Scarabelli; Barry J Grant
Journal:  BMC Bioinformatics       Date:  2014-12-10       Impact factor: 3.169

10.  PDBFlex: exploring flexibility in protein structures.

Authors:  Thomas Hrabe; Zhanwen Li; Mayya Sedova; Piotr Rotkiewicz; Lukasz Jaroszewski; Adam Godzik
Journal:  Nucleic Acids Res       Date:  2015-11-28       Impact factor: 16.971

View more
  13 in total

1.  Visualizing correlated motion with HDBSCAN clustering.

Authors:  Ryan L Melvin; Jiajie Xiao; Ryan C Godwin; Kenneth S Berenhaut; Freddie R Salsbury
Journal:  Protein Sci       Date:  2017-09-06       Impact factor: 6.725

2.  Most Probable Druggable Pockets in Mutant p53-Arg175His Clusters Extracted from Gaussian Accelerated Molecular Dynamics Simulations.

Authors:  Morad Mustafa; Mohammed Gharaibeh
Journal:  Protein J       Date:  2022-01-31       Impact factor: 2.371

3.  CDOCKER and λ-dynamics for prospective prediction in D₃R Grand Challenge 2.

Authors:  Xinqiang Ding; Ryan L Hayes; Jonah Z Vilseck; Murchtricia K Charles; Charles L Brooks
Journal:  J Comput Aided Mol Des       Date:  2017-09-07       Impact factor: 3.686

4.  Network approach of the conformational change of c-Src, a tyrosine kinase, by molecular dynamics simulation.

Authors:  Hyun Jung Yoon; Sungmin Lee; Sun Joo Park; Sangwook Wu
Journal:  Sci Rep       Date:  2018-04-04       Impact factor: 4.379

5.  Investigating Protein Sequence-structure-dynamics Relationships with Bio3D-web.

Authors:  Shashank Jariwala; Lars Skjærven; Xin-Qiu Yao; Barry J Grant
Journal:  J Vis Exp       Date:  2017-07-16       Impact factor: 1.355

6.  Evaluating the effect of mutations and ligand binding on transthyretin homotetramer dynamics.

Authors:  Tadeo E Saldaño; Giuseppe Zanotti; Gustavo Parisi; Sebastian Fernandez-Alberti
Journal:  PLoS One       Date:  2017-07-13       Impact factor: 3.240

7.  Molecular Docking and Dynamic Simulation of AZD3293 and Solanezumab Effects Against BACE1 to Treat Alzheimer's Disease.

Authors:  Mubashir Hassan; Saba Shahzadi; Sung Y Seo; Hany Alashwal; Nazar Zaki; Ahmed A Moustafa
Journal:  Front Comput Neurosci       Date:  2018-06-01       Impact factor: 2.380

8.  Investigating the pathogenic SNPs in BLM helicase and their biological consequences by computational approach.

Authors:  Faisal A Alzahrani; Firoz Ahmed; Monika Sharma; Mohd Rehan; Maryam Mahfuz; Mohammed N Baeshen; Yousef Hawsawi; Ahmed Almatrafi; Suliman Abdallah Alsagaby; Mohammad Azhar Kamal; Mohiuddin Khan Warsi; Hani Choudhry; Mohammad Sarwar Jamal
Journal:  Sci Rep       Date:  2020-07-23       Impact factor: 4.379

9.  ProSNEx: a web-based application for exploration and analysis of protein structures using network formalism.

Authors:  Rasim Murat Aydınkal; Onur Serçinoğlu; Pemra Ozbek
Journal:  Nucleic Acids Res       Date:  2019-07-02       Impact factor: 16.971

10.  In-Silico analysis reveals lower transcription efficiency of C241T variant of SARS-CoV-2 with host replication factors MADP1 and hnRNP-1.

Authors:  Armi Chaudhari; Minal Chaudhari; Sapna Mahera; Zuber Saiyed; Neelam M Nathani; Shantanu Shukla; Dhaval Patel; Chirag Patel; Madhvi Joshi; Chaitanya G Joshi
Journal:  Inform Med Unlocked       Date:  2021-07-21
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.