Literature DB >> 29931246

MolArt: a molecular structure annotation and visualization tool.

David Hoksza1,2, Piotr Gawron1, Marek Ostaszewski1, Reinhard Schneider1.   

Abstract

Summary: MolArt fills the gap between sequence and structure visualization by providing a light-weight, interactive environment enabling exploration of sequence annotations in the context of available experimental or predicted protein structures. Provided a UniProt ID, MolArt downloads and displays sequence annotations, sequence-structure mapping and relevant structures. The sequence and structure views are interlinked, enabling sequence annotations being color overlaid over the mapped structures, thus providing an enhanced understanding and interpretation of the available molecular data. Availability and implementation: MolArt is released under the Apache 2 license and is available at https://github.com/davidhoksza/MolArt. The project web page https://davidhoksza.github.io/MolArt/ features examples and applications of the tool.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29931246      PMCID: PMC6247942          DOI: 10.1093/bioinformatics/bty489

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

The number of available protein structures on one side, and the amount of sequence-related information (sequence annotations) on the other, grow constantly. This is an opportunity for integrated visual analytics approaches, where sequence features can be combined with structure visualization for better understanding of complex molecular data. This is possible due to integration efforts offering easy access to various sources of sequence, structure and annotation information through so called application programming interfaces—APIs (Nightingale ). Tools utilizing those APIs can access a wide range of resources to enable advanced interpretation through rich, browser-based display of the combined data. Moreover, recent advances in web technologies enabled easy integration of such tools into users’ own web sites in the form of plugins, or development of derived tools. One of examples is ProtVista (Watkins ), UniProt’s component for graphical representation of protein sequence features. Other tools, like LiteMol (Sehnal ) or NGL viewer (Rose and Hildebrand, 2015), are employed for structure visualization by PDBe and RCSB PDB. The next intuitive step in the integration efforts is to combine the sequence and structure visualization into a common environment. Such effort has not been made yet, with the exception of the recently introduced web server 3DBIONOTES (Segura ), which, however, is a stand-alone solution and not a reusable component. Here, we introduce MolArt, a new JavaScript tool and library for visualization of sequence-related annotations over available experimental or predicted structures. MolArt is built over the ProtVista plugin for sequence and annotations visualization, and uses the LiteMol plugin for structure display. Both tools provide JavaScript-based interfaces to enable data manipulation and handling various types of events. MolArt delivers an integrated environment with sequence and structure visualization capabilities of both of the tools and uses public APIs providing sequence-structure mapping. It is implemented as a library which can be easily used in a web page or become a part of a third-party tool.

2 Data retrieval and visualization

MolArt is purely a client-side application, thus all data to be visualized, namely sequence annotations, sequence-structure mapping and the corresponding structures, are downloaded on the fly. To do so, MolArt (i) utilizes ProtVista to obtain sequence annotations, either default or user provided, (ii) retrieves the sequence via UniProt website REST API, (iii) obtains the sequence-structure mapping from SIFTS (Velankar ) via the PDBe REST API or from Swiss Model Repository (SMR) API (Bienert ) and (iv) downloads and displays the structures via LiteMol. As for the sequence-structure mapping, MolArt first checks the existence of available experimental structures for given UniProt ID in PDBe. If no structures are available, SMR is queried for available predicted models. In case no model is available, MolArt falls back to sequence-only view and its functionality is then identical to that of ProtVista, or rather its modified version—see the MolArt’s repository for changes. The obtained sequence-structure mapping comprises not only the list of structures, but also the mapping of amino acid positions, because the structures do not necessarily represent the whole length of the sequence. With the mapping of positions, a structure can be matched to the sequence and visualized as an annotation track in ProtVista; all structure annotation tracks are then assembled into the first annotation group and visible to the user. The obtained data are visualized using MolArt’s responsive display, which features two resizable panels linking sequence (ProtVista, Fig. 1, left) and structure data (LiteMol, Fig. 1, right). Selecting a structure in the ProtVista’s structure annotation category instructs LiteMol to download and display the corresponding structure in LiteMol. The mapped part of the structure is highlighted by surface visualization with adjustable transparency level. The sequence and structure panels are interlinked, so hovering over the sequence highlights the respective amino acid in the structure and vice-versa. Similarly, the sequence annotations can color the corresponding parts of the active mapped structure simply by clicking on them. Moreover, all annotations in a track or category can be overlaid together enabling, for example, visualization of all post-translation modifications or binding regions. Variation data can be overlaid the same way as standard annotations, but one can also select only mutations of given type (e.g. mutations to given amino acids or loss-of-function mutations) or histogram of variations. This allows to see frequently mutated positions directly on the structure.
Fig. 1.

MolArt displays in the left panel the molecule’s sequence, relevant annotations (including variation data) and list of available structures (either experimental or predicted) for given molecule. The right panel shows selected 3D structure over which any of the sequence annotations can be color-overlaid. The above example displays Alpha-synuclein (UniProt ID P37840), a protein which accumulates in the brain cells of Parkinson's disease patients. One of the corresponding structures in PDB (ID 2n0a) shows the fibril structures of the protein. We can overlay the individual disease-related mutations to see that the mutations happen at positions which ensure stability of the structure and their disruption thus lead with high probability to adverse effects

MolArt displays in the left panel the molecule’s sequence, relevant annotations (including variation data) and list of available structures (either experimental or predicted) for given molecule. The right panel shows selected 3D structure over which any of the sequence annotations can be color-overlaid. The above example displays Alpha-synuclein (UniProt ID P37840), a protein which accumulates in the brain cells of Parkinson's disease patients. One of the corresponding structures in PDB (ID 2n0a) shows the fibril structures of the protein. We can overlay the individual disease-related mutations to see that the mutations happen at positions which ensure stability of the structure and their disruption thus lead with high probability to adverse effects Although the web environment is the appropriate choice for integrating a wide range of data sources and their interactive exploration, more advanced structure analysis might require a specialized environment. If that is the case, MolArt allows to extract all the annotations and their mapping into a single Python file to be later imported by PyMOL.

3 Summary

MolArt fills the gap between sequence and structure visualization by providing an integrated and interactive web experience where sequence annotations can be readily overlaid over the available protein structures. The tool provides a way to explore both sequence and structure features enabling life scientist to benefit from the wealth of molecular data existing in various databases, hopefully leading to a more streamlined biological hypothesis generation. MolArt’s code including all its dependencies is bundled into a single JavaScript file making it easy to embed into any web site. The project web page shows examples of its usage, including application for querying UniProt or implementation of MolArt in the MINERVA framework (Gawron ), notably providing gene-structure mapping for Parkinson’s disease map (Fujita ), see pdmap.uni.lu. MolArt is provided as open source with the source code and documentation available at https://github.com/davidhoksza/MolArt. Conflict of Interest: none declared.
  9 in total

1.  LiteMol suite: interactive web-based visualization of large-scale macromolecular structure data.

Authors:  David Sehnal; Mandar Deshpande; Radka Svobodová Vařeková; Saqib Mir; Karel Berka; Adam Midlik; Lukáš Pravda; Sameer Velankar; Jaroslav Koča
Journal:  Nat Methods       Date:  2017-11-30       Impact factor: 28.547

Review 2.  Integrating pathways of Parkinson's disease in a molecular interaction map.

Authors:  Kazuhiro A Fujita; Marek Ostaszewski; Yukiko Matsuoka; Samik Ghosh; Enrico Glaab; Christophe Trefois; Isaac Crespo; Thanneer M Perumal; Wiktor Jurkowski; Paul M A Antony; Nico Diederich; Manuel Buttini; Akihiko Kodama; Venkata P Satagopam; Serge Eifes; Antonio Del Sol; Reinhard Schneider; Hiroaki Kitano; Rudi Balling
Journal:  Mol Neurobiol       Date:  2013-07-07       Impact factor: 5.590

3.  NGL Viewer: a web application for molecular visualization.

Authors:  Alexander S Rose; Peter W Hildebrand
Journal:  Nucleic Acids Res       Date:  2015-04-29       Impact factor: 16.971

4.  The SWISS-MODEL Repository-new features and functionality.

Authors:  Stefan Bienert; Andrew Waterhouse; Tjaart A P de Beer; Gerardo Tauriello; Gabriel Studer; Lorenza Bordoli; Torsten Schwede
Journal:  Nucleic Acids Res       Date:  2016-11-29       Impact factor: 16.971

5.  The Proteins API: accessing key integrated protein and genome information.

Authors:  Andrew Nightingale; Ricardo Antunes; Emanuele Alpi; Borisas Bursteinas; Leonardo Gonzales; Wudong Liu; Jie Luo; Guoying Qi; Edd Turner; Maria Martin
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

6.  ProtVista: visualization of protein sequence annotations.

Authors:  Xavier Watkins; Leyla J Garcia; Sangya Pundir; Maria J Martin
Journal:  Bioinformatics       Date:  2017-07-01       Impact factor: 6.937

7.  SIFTS: Structure Integration with Function, Taxonomy and Sequences resource.

Authors:  Sameer Velankar; José M Dana; Julius Jacobsen; Glen van Ginkel; Paul J Gane; Jie Luo; Thomas J Oldfield; Claire O'Donovan; Maria-Jesus Martin; Gerard J Kleywegt
Journal:  Nucleic Acids Res       Date:  2012-11-29       Impact factor: 16.971

8.  MINERVA-a platform for visualization and curation of molecular interaction networks.

Authors:  Piotr Gawron; Marek Ostaszewski; Venkata Satagopam; Stephan Gebel; Alexander Mazein; Michal Kuzma; Simone Zorzan; Fintan McGee; Benoît Otjacques; Rudi Balling; Reinhard Schneider
Journal:  NPJ Syst Biol Appl       Date:  2016-09-22

9.  3DBIONOTES v2.0: a web server for the automatic annotation of macromolecular structures.

Authors:  Joan Segura; Ruben Sanchez-Garcia; Marta Martinez; Jesus Cuenca-Alba; Daniel Tabas-Madrid; C O S Sorzano; J M Carazo
Journal:  Bioinformatics       Date:  2017-11-15       Impact factor: 6.937

  9 in total
  9 in total

1.  GWYRE: A Resource for Mapping Variants onto Experimental and Modeled Structures of Human Protein Complexes.

Authors:  Sukhaswami Malladi; Harold R Powell; Alessia David; Suhail A Islam; Matthew M Copeland; Petras J Kundrotas; Michael J E Sternberg; Ilya A Vakser
Journal:  J Mol Biol       Date:  2022-04-27       Impact factor: 6.151

2.  GPCRsignal: webserver for analysis of the interface between G-protein-coupled receptors and their effector proteins by dynamics and mutations.

Authors:  Przemysław Miszta; Paweł Pasznik; Szymon Niewieczerzał; Jakub Jakowiecki; Sławomir Filipek
Journal:  Nucleic Acids Res       Date:  2021-07-02       Impact factor: 16.971

3.  Proteo3Dnet: a web server for the integration of structural information with interactomics data.

Authors:  Guillaume Postic; Jessica Andreani; Julien Marcoux; Victor Reys; Raphaël Guerois; Julien Rey; Emmanuelle Mouton-Barbosa; Yves Vandenbrouck; Sarah Cianferani; Odile Burlet-Schiltz; Gilles Labesse; Pierre Tufféry
Journal:  Nucleic Acids Res       Date:  2021-07-02       Impact factor: 16.971

4.  Closing the gap between formats for storing layout information in systems biology.

Authors:  David Hoksza; Piotr Gawron; Marek Ostaszewski; Jan Hasenauer; Reinhard Schneider
Journal:  Brief Bioinform       Date:  2020-07-15       Impact factor: 11.622

5.  MISCAST: MIssense variant to protein StruCture Analysis web SuiTe.

Authors:  Sumaiya Iqbal; David Hoksza; Eduardo Pérez-Palma; Patrick May; Jakob B Jespersen; Shehab S Ahmed; Zaara T Rifat; Henrike O Heyne; M Sohel Rahman; Jeffrey R Cottrell; Florence F Wagner; Mark J Daly; Arthur J Campbell; Dennis Lal
Journal:  Nucleic Acids Res       Date:  2020-07-02       Impact factor: 16.971

6.  An Update on MRMAssayDB: A Comprehensive Resource for Targeted Proteomics Assays in the Community.

Authors:  Pallab Bhowmick; Simon Roome; Christoph H Borchers; David R Goodlett; Yassene Mohammed
Journal:  J Proteome Res       Date:  2021-03-08       Impact factor: 4.466

7.  RCSB Protein Data Bank 1D3D module: Displaying positional features on macromolecular assemblies.

Authors:  Joan Segura; Yana Rose; Sebastian Bittrich; Stephen K Burley; Jose M Duarte
Journal:  Bioinformatics       Date:  2022-05-11       Impact factor: 6.931

8.  PredictProtein - Predicting Protein Structure and Function for 29 Years.

Authors:  Michael Bernhofer; Christian Dallago; Tim Karl; Venkata Satagopam; Michael Heinzinger; Maria Littmann; Tobias Olenyi; Jiajun Qiu; Konstantin Schütze; Guy Yachdav; Haim Ashkenazy; Nir Ben-Tal; Yana Bromberg; Tatyana Goldberg; Laszlo Kajan; Sean O'Donoghue; Chris Sander; Andrea Schafferhans; Avner Schlessinger; Gerrit Vriend; Milot Mirdita; Piotr Gawron; Wei Gu; Yohan Jarosz; Christophe Trefois; Martin Steinegger; Reinhard Schneider; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2021-07-02       Impact factor: 16.971

Review 9.  The Atlas of Inflammation Resolution (AIR).

Authors:  Charles N Serhan; Shailendra K Gupta; Mauro Perretti; Catherine Godson; Eoin Brennan; Yongsheng Li; Oliver Soehnlein; Takao Shimizu; Oliver Werz; Valerio Chiurchiù; Angelo Azzi; Marc Dubourdeau; Suchi Smita Gupta; Patrick Schopohl; Matti Hoch; Dragana Gjorgevikj; Faiz M Khan; David Brauer; Anurag Tripathi; Konstantin Cesnulevicius; David Lescheid; Myron Schultz; Eva Särndahl; Dirk Repsilber; Robert Kruse; Angelo Sala; Jesper Z Haeggström; Bruce D Levy; János G Filep; Olaf Wolkenhauer
Journal:  Mol Aspects Med       Date:  2020-09-03
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.