Literature DB >> 22064856

NucleaRDB: information system for nuclear receptors.

Bas Vroling¹, David Thorne, Philip McDermott, Henk-Jan Joosten, Teresa K Attwood, Steve Pettifer, Gert Vriend.

Abstract

The NucleaRDB is a Molecular Class-Specific Information System that collects, combines, validates and disseminates large amounts of heterogeneous data on nuclear hormone receptors. It contains both experimental and computationally derived data. The data and knowledge present in the NucleaRDB can be accessed using a number of different interactive and programmatic methods and query systems. A nuclear hormone receptor-specific PDF reader interface is available that can integrate the contents of the NucleaRDB with full-text scientific articles. The NucleaRDB is freely available at http://www.receptors.org/nucleardb.

Entities: Chemical Disease Mutation Species

Mesh：

Substances：

Year: 2011 PMID： 22064856 PMCID： PMC3245090 DOI： 10.1093/nar/gkr960

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

INTRODUCTION

Nuclear receptors (NRs) are ligand-inducible transcription factors that regulate processes, such as homeostasis, differentiation, embryonic development and organ physiology. A total of 49 human NRs have been identified (1). Their ligands are lipophilic compounds such as steroids, thyroid hormone, vitamin D3 and retinoids (2). The endogenous ligands are not yet known for 30% of the NRs (3). As nuclear receptors are involved in almost all aspects of human physiology and are implicated in many important diseases including cancer, diabetes and osteoporosis, understanding of these receptors has major implications for human biology and for the development of new drug treatments. Nuclear receptors are targets for pharmaceutical industries with similar importance (4), as the G protein-coupled receptors (GPCRs), ion channels and kinases. Due to the increasing amounts of experimental and computational data buried in numerous databases and scientific articles, the task of extracting, combining and validating this data is becoming an increasingly large hurdle for the individual scientist. Databases that revolve around a single protein family can help researchers in using all data needed for their research, while relieving them of the onerous tasks related to the retrieval of many data from different sources (5). The NucleaRDB is a data source that holds many different data types (Table 1) in a well organized and easily accessible form (6). The data are validated, internally consistent and updated regularly. The NucleaRDB provides access to the data via various interfaces, which depending on the users’ needs, are suited either for automated access or interactive usage.

Table 1.

Contents of the NucleaRDB

Proteins	3764
Families	123
Mutations	1543
Protein structures	613
Structure models	3764
Residues	2 012 651
Species	339

DATA CONTENTS

Primary data

The NucleaRDB contains three different primary data types: sequences, structures and mutations. Sequences and structures were updated as described previously (7). Mutation data was obtained from the Nuclear Receptor Mutation Database (8) and fully integrated in the NucleaRDB. In addition, a large body of mutations was extracted from literature by the software package MuteXt (9).

Computational data

A large and diverse collection of computationally generated data are present in the NucleaRDB. Multiple sequence alignments (MSAs) form the heart of the system and allow users to easily transfer information between different proteins. MSAs are available for all families and subfamilies, and can be viewed using JalView (10) or can be directly downloaded in a number of formats. MSAs were created as described previously (7). Correlated mutation analyses (CMA) can be used to identify groups of residues that mutate in tandem. Residues that show correlated mutation behavior are likely to be functionally related, and networks of those correlating residues indicate functional units (11). Correlation scores are available for all (sub-)families. The entropy and variability for a position in a MSA can be an indicator of the evolutionary pressures exerted at that position (12). Entropy and variability scores are available in tabular form and via an interactive page displaying an integrated view via plots, tables and structure models. In addition to the already large amount of structural information that is present in the NucleaRDB, homology models based on multiple template structures have been built for all NRs. All structure models were built using YASARA (13) and are available for download or can be viewed directly using Jmol (14).

INFORMATION RETRIEVAL

All data in the NucleaRDB web interface are extensively connected, allowing for easy navigation between different data types. The main way of accessing the NucleaRDB’s contents is via the hierarchical family tree. For each family, users can access the individual receptors, multiple sequence alignments (and all derived data and analyses such as correlation scores and protein distance networks), mutations, structures and models (Figure 1). All pages contain links to all related data and information. Extensive search facilities are available, allowing the search for proteins, sequences, structures, families and mutations using various search criteria and filters. A BLAST service is available that allows users to run their own sequences against the NucleaRDB.

Figure 1.

Screenshot of the NucleaRDB family page. The family tree is shown on the left with the thyroid hormone family expanded. On the right-hand side, the data for the selected family is shown.

All data types and search facilities are accessible from the web pages as well as from the web service endpoints, allowing users to write workflows or in-house software that uses the NucleaRDB.

Annotating scientific literature

Utopia Documents (15,16) is a new PDF reader that offers unique opportunities to place information and knowledge in the context of scientific literature. We have integrated the NucleaRDB with the Utopia Documents PDF reader in such a way as to present to scientists, in a non-intrusive way, all NR-relevant data and information discussed in an article at hand. Annotations are provided for proteins, residues and mutations mentioned in the PDF. For each of these concepts the annotations contain carefully selected information, as well as pointers to relevant web pages and related scientific literature. An example is shown in Figure 2. The PDF reader presents the scientist, in a non-intrusive way, all relevant data and information related to the topics discussed in the article. This alleviates the troubles associated with navigating the many links between existing data and information available from the many articles in this field. The scientist neither struggles to get access to information related to topics within an article, nor is swamped by unnecessary information that still needs disambiguation; only data and information relevant to the topic of the article is made available.

Figure 2.

An impression of the Utopia Documents PDF reader interface to the NucleaRDB data. On the left-hand side a part of a scientific paper (17) is shown that is annotated by the NucleaRDB. Annotations are available for all the highlighted words. On the right-hand side an example of such an annotation (the mutation R274A) is displayed.

Screenshot of the NucleaRDB family page. The family tree is shown on the left with the thyroid hormone family expanded. On the right-hand side, the data for the selected family is shown. An impression of the Utopia Documents PDF reader interface to the NucleaRDB data. On the left-hand side a part of a scientific paper (17) is shown that is annotated by the NucleaRDB. Annotations are available for all the highlighted words. On the right-hand side an example of such an annotation (the mutation R274A) is displayed. Contents of the NucleaRDB

IMPLEMENTATION

The data in the NucleaRDB is stored in a PostgreSQL (www.postgresql.org) relational database. The web service interface is developed with the Apache CXF (cxf.apache.org) web services framework. We offer both Simple Object Access Protocol and Representational state transfer endpoints. The web interface is built using the Apache Wicket (wicket.apache.org) web application framework. The database is accessed via a Hibernate (www.hibernate.org) object-relational mapping layer. The server is running within Sun’s Glassfish (www.glassfish.org) application server.

CONCLUSION

The NucleaRDB provides researchers with a single point of access for nuclear receptor-related data. Not only does the NucleaRDB hold a large amount of information, it also provides a broad scope of tools and dissemination facilities, relieving scientist of many of the tasks that come with collecting, validating and integrating many diverse data.

FUNDING

BioRange program of the Netherlands Bioinformatics Centre (NBIC); BSIK grant through the Netherlands Genomics Initiative (NGI); EMBRACE project that is funded by the European Commission within its FP6 Programme, under the thematic area ‘Life sciences, genomics and biotechnology for health’ (contract number LHSG-CT-2004-512092); and TIPharma. Funding for open access charge: RUNMC. Conflict of interest statement. None declared.

17 in total

1. Collecting and harvesting biological data: the GPCRDB and NucleaRDB information systems.

Authors: F Horn; G Vriend; F E Cohen
Journal: Nucleic Acids Res Date: 2001-01-01 Impact factor: 16.971

Review 2. Orphan nuclear receptors: shifting endocrinology into reverse.

Authors: S A Kliewer; J M Lehmann; T M Willson
Journal: Science Date: 1999-04-30 Impact factor: 47.728

3. NRMD: Nuclear Receptor Mutation Database.

Authors: Joost J J Van Durme; Emmanuel Bettler; Simon Folkertsma; Florence Horn; Gert Vriend
Journal: Nucleic Acids Res Date: 2003-01-01 Impact factor: 16.971

4. Correlated mutation analyses on very large sequence families.

Authors: L Oliveira; A C M Paiva; G Vriend
Journal: Chembiochem Date: 2002-10-04 Impact factor: 3.164

5. Automated extraction of mutation data from the literature: application of MuteXt to G protein-coupled receptors and nuclear hormone receptors.

Authors: Florence Horn; Anthony L Lau; Fred E Cohen
Journal: Bioinformatics Date: 2004-01-22 Impact factor: 6.937

6. A two-entropies analysis to identify functional positions in the transmembrane region of class A G protein-coupled receptors.

Authors: Kai Ye; Eric-Wubbo M Lameijer; Margot W Beukers; Adriaan P Ijzerman
Journal: Proteins Date: 2006-06-01

7. How many nuclear hormone receptors are there in the human genome?

Authors: M Robinson-Rechavi; A S Carpentier; M Duffraisse; V Laudet
Journal: Trends Genet Date: 2001-10 Impact factor: 11.639

8. Jalview Version 2--a multiple sequence alignment editor and analysis workbench.

Authors: Andrew M Waterhouse; James B Procter; David M A Martin; Michèle Clamp; Geoffrey J Barton
Journal: Bioinformatics Date: 2009-01-16 Impact factor: 6.937

9. GPCRDB: information system for G protein-coupled receptors.

Authors: Bas Vroling; Marijn Sanders; Coos Baakman; Annika Borrmann; Stefan Verhoeven; Jan Klomp; Laerte Oliveira; Jacob de Vlieg; Gert Vriend
Journal: Nucleic Acids Res Date: 2010-11-02 Impact factor: 16.971

Review 10. The nuclear receptor superfamily: the second decade.

Authors: D J Mangelsdorf; C Thummel; M Beato; P Herrlich; G Schütz; K Umesono; B Blumberg; P Kastner; M Mark; P Chambon; R M Evans
Journal: Cell Date: 1995-12-15 Impact factor: 41.582

7 in total

1. NURBS: a database of experimental and predicted nuclear receptor binding sites of mouse.

Authors: Yaping Fang; Hui-Xin Liu; Ning Zhang; Grace L Guo; Yu-Jui Yvonne Wan; Jianwen Fang
Journal: Bioinformatics Date: 2012-11-29 Impact factor: 6.937

2. Prioritizing Potentially Druggable Mutations with dGene: An Annotation Tool for Cancer Genome Sequencing Data.

Authors: Runjun D Kumar; Li-Wei Chang; Matthew J Ellis; Ron Bose
Journal: PLoS One Date: 2013-06-27 Impact factor: 3.240

3. Importins involved in the nuclear transportation of steroid hormone receptors: In silico and in vitro data.

Authors: Konstantina Kalyvianaki; Athanasios A Panagiotopoulos; Maria Patentalaki; Elias Castanas; Marilena Kampa
Journal: Front Endocrinol (Lausanne) Date: 2022-09-06 Impact factor: 6.055

4. SARConnect: A Tool to Interrogate the Connectivity Between Proteins, Chemical Structures and Activity Data.

Authors: Mats Eriksson; Ingemar Nilsson; Thierry Kogej; Christopher Southan; Martin Johansson; Christian Tyrchan; Sorel Muresan; Niklas Blomberg; Marcus Bjäreland
Journal: Mol Inform Date: 2012-08-07 Impact factor: 3.353

5. Nuclear Receptor Signaling Atlas: Opening Access to the Biology of Nuclear Receptor Signaling Pathways.

Authors: Lauren B Becnel; Yolanda F Darlington; Scott A Ochsner; Jeremy R Easton-Marks; Christopher M Watkins; Apollo McOwiti; Wasula H Kankanamge; Michael W Wise; Michael DeHart; Ronald N Margolis; Neil J McKenna
Journal: PLoS One Date: 2015-09-01 Impact factor: 3.240

6. Accurate prediction of nuclear receptors with conjoint triad feature.

Authors: Hongchu Wang; Xuehai Hu
Journal: BMC Bioinformatics Date: 2015-12-03 Impact factor: 3.169

7. ONRLDB--manually curated database of experimentally validated ligands for orphan nuclear receptors: insights into new drug discovery.

Authors: Ravikanth Nanduri; Isha Bhutani; Arun Kumar Somavarapu; Sahil Mahajan; Raman Parkesh; Pawan Gupta
Journal: Database (Oxford) Date: 2015-12-04 Impact factor: 3.451

7 in total