Literature DB >> 30768147

3DBIONOTES v3.0: crossing molecular and structural biology data with genomic variations.

Joan Segura1, Ruben Sanchez-Garcia1, C O S Sorzano1, J M Carazo1.   

Abstract

MOTIVATION: Many diseases are associated to single nucleotide polymorphisms that affect critical regions of proteins as binding sites or post translational modifications. Therefore, analysing genomic variants with structural and molecular biology data is a powerful framework in order to elucidate the potential causes of such diseases.
RESULTS: A new version of our web framework 3DBIONOTES is presented. This version offers new tools to analyse and visualize protein annotations and genomic variants, including a contingency analysis of variants and amino acid features by means of a Fisher exact test, the integration of a gene annotation viewer to highlight protein features on gene sequences and a protein-protein interaction viewer to display protein annotations at network level.
AVAILABILITY AND IMPLEMENTATION: The web server is available at https://3dbionotes.cnb.csic.es. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. CONTACT: Spanish National Institute for Bioinformatics (INB ELIXIR-ES) and Biocomputing Unit, National Centre of Biotechnology (CSIC)/Instruct Image Processing Centre, C/ Darwin nº 3, Campus of Cantoblanco, 28049 Madrid, Spain.
© The Author(s) 2019. Published by Oxford University Press.

Entities:  

Year:  2019        PMID: 30768147      PMCID: PMC6748749          DOI: 10.1093/bioinformatics/btz118

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Next-generation sequencing has flooded many databases with biomedical data where single-nucleotide variations are associated with phenotypes or diseases (Zerbino ). This information comprises collections of variant–disease pairs that can be used to infer which genomic variations might be involved in a particular disease. However, changes on the biochemical or structural features of the affected amino acids (if applicable) can be more informative in order to understand the causes of diseases. For that reason, some of the existent resources compiling variant–disease knowledge also annotates protein residues with biochemical features displaying what properties could be affected (Dingerdissen ). In this work, we present a new version of 3DBIONOTES (Segura ; Tabas-Madrid ) where different analysis tools and viewers have been integrated to find how genomic variants may affect the different protein residues. 3DBIONOTES is a web framework that integrates biological annotations and structural information of proteins from multiple sources (see Supplementary Section S1). In this version the application computes Fisher’s exact test in order to find what biochemical or structural features are statistically affected by the variants associated to a particular disease. Moreover, a new panel displays those annotated regions where the co-occurrence between protein features and variants are statistically enriched. In addition, a gene annotation viewer has been fully integrated to display protein features at gene level. Also, a protein–protein interaction (PPI) viewer has been included in such a way that the different annotations can be displayed at network level. Finally, and as an additional tool, a new type of query, request by set of proteins, has been implemented to explore and analyse PPI networks. Moreover, custom annotations, including variants, can be submitted and analysed with the biological features integrated in the application.

2 New features

2.1 Gene annotation viewer

In this version of 3DBIONOTES, a gene annotation viewer has been fully integrated (see Supplementary Section S2). This panel displays gene information from ENSEMBL database (Zerbino ); the collected information includes: introns, exons, codifying regions and genomic variants. Moreover, ENSEMBL gene sequences are aligned with UniProt (UniProt Consortium, 2018) and PDB (Burley ) amino acids in such a way that protein annotated regions can be highlighted on gene sequences and vice versa (see Supplementary Section S2).

2.2 Genomic variants contingency analysis

In order to find which protein regions are statistically affected by the genomic variants associated to a particular disease, Fisher’s exact test between the different annotations and the variants associated to the different diseases is computed. The main objective is to find co-occurrence of protein residues between the different structural or biochemical annotations and the variants associated to diseases. For example, most cancer related variants of the KRAS human protein map on its nucleotide binding region (see Section 3.1 and Supplementary Section S3).

2.3 Exploring PPI networks

A new type of query to request information for a set of proteins is now available. Moreover, a panel to visualize PPI networks using a graph-based representation has been integrated. This panel displays the physical binding between proteins when the information for a multimeric entry is requested or the PPIs that have been experimentally observed when a given set of proteins is submitted. In the first case, the contacts are computed using a distance threshold of 6 Å between heavy atoms. For the second case, PPI data is collected from Interactome3D (Mosca ). Moreover, the network panel can display annotations at network level using a similar approach as dSysMap (Mosca ) (see Supplementary Section S4).

2.4 Submitting custom annotations

The application supports the submission of custom data in such a way that users can analyse their own genomic variants or other annotations and compare them with 3DBIONOTES integrated data. The submitted information is fully integrated and the different visualization and analysis tools can be used to display and process the external data.

3 Use cases

3.1 Analysis on the human KRAS genomic variants

In this example we have analysed the genomic variants associated to KRAS human protein (UniProt accession P01116). KRAS protein is a GTPase that acts as a signalling switch in many transduction pathways including cell proliferation. The active state of KRAS occurs when the protein is bound to GTP. In this state, the protein recruits and activates other growth factors and cell signalling receptors. Upon GTP hydrolysis and conversion to GDP, KRAS is inactivated. KRAS mutations are known to be involved in different diseases such as multiple cancer types or neurofibromatosis (Simanshu, et al., 2017). We used 3DBIONOTES to analyse the co-occurrence of KRAS variants associated to diseases with the different biochemical annotations. The main reason was to check whether those variants occur in particular regions of KRAS or randomly distributed. Supplementary Figure S4 and Supplementary Tables S1 and S2 display the analysis panel of 3DBIONOTES and clearly show that many of those variants occur in the ‘Nucleotide Binding Site’ annotated regions. KRAS acts as on/off switch in many processes and its active or inactive form depends on the interaction with GTP or GDP, respectively. Then, mutations affecting KRAS GTP/GDP-binding sites may affect its activation and therefore, many cell regulatory processes.

3.2 GNB1 neurodevelopmental disability

This example illustrates how 3DBIONOTES can be used to analyse external variants. We have collected the variants of the G protein subunit beta (GNB1) associated to neurodevelopmental delay, hypotonia and seizures available in the work of Petrovski (see Supplementary Table S3). GNB1 protein modulates transmembrane signalling pathways controlled by G protein-coupled receptors. We have requested the PPI network information for the GNB1 protein (UniProt accession P62873) and attached the collected variants to 3DBIONOTES. When the variants are mapped to the PPI network, many of them appear affecting the binding sites between GNB1 and other G proteins. Moreover, the contingency analysis identified that the co-occurrence between many of the GNB1-biding sites and the submitted variants was statistically significant (see Supplementary Fig. S8). Consequently, mutations of the GNB1-binding sites may affect the interaction with other G proteins and, thus, some of the cell signalling pathways involving G proteins.

Funding

This work was supported by Ministerio de Economía, Industria y Competitividad, Gobierno de España [grant No. BIO2016-76400-R(AEI/FEDER, UE)]; Comunidad de Madrid [grant No. S2017/BMD-3817]; Instituto de Salud Carlos III [grant No. PT13/0001/0009; INB Grant PT17/0009/0010 - ISCIII-SGEFI/ERDF]; Horizon 2020 [grant No. Elixir – EXCELERATE INFRADEV-3-2015, Proposal 676559] and iNEXT [INFRAIA-1-2014-2015, Proposal 653706]; Ministerio de Ciencia, Innovación y Universidades, Gobierno de España [Juan de la Cierva-E-28-2018-0015407 to J.S.]; and Ministerio de Educación, Cultura y Deporte [FPU-2015/264 to R.S.-G.]. Conflict of Interest: none declared. Click here for additional data file.
  10 in total

1.  3DBIONOTES: A unified, enriched and interactive view of macromolecular information.

Authors:  D Tabas-Madrid; J Segura; R Sanchez-Garcia; J Cuenca-Alba; C O S Sorzano; J M Carazo
Journal:  J Struct Biol       Date:  2016-02-10       Impact factor: 2.867

2.  Germline De Novo Mutations in GNB1 Cause Severe Neurodevelopmental Disability, Hypotonia, and Seizures.

Authors:  Slavé Petrovski; Sébastien Küry; Candace T Myers; Kwame Anyane-Yeboa; Benjamin Cogné; Martin Bialer; Fan Xia; Parisa Hemati; James Riviello; Michele Mehaffey; Thomas Besnard; Emily Becraft; Alexandrea Wadley; Anya Revah Politi; Sophie Colombo; Xiaolin Zhu; Zhong Ren; Ian Andrews; Tracy Dudding-Byth; Amy L Schneider; Geoffrey Wallace; Aaron B I Rosen; Susan Schelley; Gregory M Enns; Pierre Corre; Joline Dalton; Sandra Mercier; Xénia Latypova; Sébastien Schmitt; Edwin Guzman; Christine Moore; Louise Bier; Erin L Heinzen; Peter Karachunski; Natasha Shur; Theresa Grebe; Alice Basinger; Joanne M Nguyen; Stéphane Bézieau; Klaas Wierenga; Jonathan A Bernstein; Ingrid E Scheffer; Jill A Rosenfeld; Heather C Mefford; Bertrand Isidor; David B Goldstein
Journal:  Am J Hum Genet       Date:  2016-04-21       Impact factor: 11.025

3.  dSysMap: exploring the edgetic role of disease mutations.

Authors:  Roberto Mosca; Jofre Tenorio-Laranga; Roger Olivella; Victor Alcalde; Arnaud Céol; Montserrat Soler-López; Patrick Aloy
Journal:  Nat Methods       Date:  2015-03       Impact factor: 28.547

4.  Interactome3D: adding structural details to protein networks.

Authors:  Roberto Mosca; Arnaud Céol; Patrick Aloy
Journal:  Nat Methods       Date:  2012-12-16       Impact factor: 28.547

Review 5.  RAS Proteins and Their Regulators in Human Disease.

Authors:  Dhirendra K Simanshu; Dwight V Nissley; Frank McCormick
Journal:  Cell       Date:  2017-06-29       Impact factor: 41.582

6.  BioMuta and BioXpress: mutation and expression knowledgebases for cancer biomarker discovery.

Authors:  Hayley M Dingerdissen; John Torcivia-Rodriguez; Yu Hu; Ting-Chia Chang; Raja Mazumder; Robel Kahsay
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

7.  UniProt: a worldwide hub of protein knowledge.

Authors: 
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

8.  Ensembl 2018.

Authors:  Daniel R Zerbino; Premanand Achuthan; Wasiu Akanni; M Ridwan Amode; Daniel Barrell; Jyothish Bhai; Konstantinos Billis; Carla Cummins; Astrid Gall; Carlos García Girón; Laurent Gil; Leo Gordon; Leanne Haggerty; Erin Haskell; Thibaut Hourlier; Osagie G Izuogu; Sophie H Janacek; Thomas Juettemann; Jimmy Kiang To; Matthew R Laird; Ilias Lavidas; Zhicheng Liu; Jane E Loveland; Thomas Maurel; William McLaren; Benjamin Moore; Jonathan Mudge; Daniel N Murphy; Victoria Newman; Michael Nuhn; Denye Ogeh; Chuang Kee Ong; Anne Parker; Mateus Patricio; Harpreet Singh Riat; Helen Schuilenburg; Dan Sheppard; Helen Sparrow; Kieron Taylor; Anja Thormann; Alessandro Vullo; Brandon Walts; Amonida Zadissa; Adam Frankish; Sarah E Hunt; Myrto Kostadima; Nicholas Langridge; Fergal J Martin; Matthieu Muffato; Emily Perry; Magali Ruffier; Dan M Staines; Stephen J Trevanion; Bronwen L Aken; Fiona Cunningham; Andrew Yates; Paul Flicek
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

9.  3DBIONOTES v2.0: a web server for the automatic annotation of macromolecular structures.

Authors:  Joan Segura; Ruben Sanchez-Garcia; Marta Martinez; Jesus Cuenca-Alba; Daniel Tabas-Madrid; C O S Sorzano; J M Carazo
Journal:  Bioinformatics       Date:  2017-11-15       Impact factor: 6.937

10.  RCSB Protein Data Bank: biological macromolecular structures enabling research and education in fundamental biology, biomedicine, biotechnology and energy.

Authors:  Stephen K Burley; Helen M Berman; Charmi Bhikadiya; Chunxiao Bi; Li Chen; Luigi Di Costanzo; Cole Christie; Ken Dalenberg; Jose M Duarte; Shuchismita Dutta; Zukang Feng; Sutapa Ghosh; David S Goodsell; Rachel K Green; Vladimir Guranovic; Dmytro Guzenko; Brian P Hudson; Tara Kalro; Yuhe Liang; Robert Lowe; Harry Namkoong; Ezra Peisach; Irina Periskova; Andreas Prlic; Chris Randle; Alexander Rose; Peter Rose; Raul Sala; Monica Sekharan; Chenghua Shao; Lihua Tan; Yi-Ping Tao; Yana Valasatava; Maria Voigt; John Westbrook; Jesse Woo; Huanwang Yang; Jasmine Young; Marina Zhuravleva; Christine Zardecki
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

  10 in total
  6 in total

1.  d-StructMAn: Containerized structural annotation on the scale from genetic variants to whole proteomes.

Authors:  Alexander Gress; Sanjay K Srikakulam; Sebastian Keller; Vasily Ramensky; Olga V Kalinina
Journal:  Gigascience       Date:  2022-09-20       Impact factor: 7.658

2.  Characterization of mutations modulating enhanced transmissibility of SARS-CoV-2 B.1.617+ (Delta) variant using In Silico tools.

Authors:  Nariman Shahhosseini
Journal:  Gene Rep       Date:  2022-06-14

3.  3DBionotes COVID-19 Edition.

Authors:  Jose Ramon Macias; Ruben Sanchez-Garcia; Pablo Conesa; Erney Ramirez-Aportela; Marta Martinez Gonzalez; Carlos Wert-Carvajal; Alberto M Parra-Perez; Joan Segura Mora; Sam Horrell; Andrea Thorn; Carlos O S Sorzano; Jose Maria Carazo
Journal:  Bioinformatics       Date:  2021-05-20       Impact factor: 6.931

4.  High-performance macromolecular data delivery and visualization for the web.

Authors:  David Sehnal; Radka Svobodová; Karel Berka; Alexander S Rose; Stephen K Burley; Sameer Velankar; Jaroslav Koča
Journal:  Acta Crystallogr D Struct Biol       Date:  2020-11-26       Impact factor: 7.652

5.  PDBe-KB: collaboratively defining the biological context of structural data.

Authors: 
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

Review 6.  An immersive journey to the molecular structure of SARS-CoV-2: Virtual reality in COVID-19.

Authors:  Martín Calvelo; Ángel Piñeiro; Rebeca Garcia-Fandino
Journal:  Comput Struct Biotechnol J       Date:  2020-09-18       Impact factor: 7.271

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.