Literature DB >> 23434633

The Rat Genome Database 2013--data, tools and users.

Stanley J F Laulederkind1, G Thomas Hayman, Shur-Jen Wang, Jennifer R Smith, Timothy F Lowry, Rajni Nigam, Victoria Petri, Jeff de Pons, Melinda R Dwinell, Mary Shimoyama, Diane H Munzenmaier, Elizabeth A Worthey, Howard J Jacob.   

Abstract

The Rat Genome Database (RGD) was started >10 years ago to provide a core genomic resource for rat researchers. Currently, RGD combines genetic, genomic, pathway, phenotype and strain information with a focus on disease. RGD users are provided with access to structured and curated data from the molecular level through the organismal level. Those users access RGD from all over the world. End users are not only rat researchers but also researchers working with mouse and human data. Translational research is supported by RGD's comparative genetics/genomics data in disease portals, in GBrowse, in VCMap and on gene report pages. The impact of RGD also goes beyond the traditional biomedical researcher, as the influence of RGD reaches bioinformaticians, tool developers and curators. Import of RGD data into other publicly available databases expands the influence of RGD to a larger set of end users than those who avail themselves of the RGD website. The value of RGD continues to grow as more types of data and more tools are added, while reaching more types of end users.

Entities:  

Keywords:  database; disease; genome; human; rat

Mesh:

Year:  2013        PMID: 23434633      PMCID: PMC3713714          DOI: 10.1093/bib/bbt007

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


INTRODUCTION

The Rat Genome Database (RGD, http://rgd.mcw.edu) is the model organism database for the laboratory rat, Rattus norvegicus. The rat is a widely studied animal model for physiology, pharmacology, toxicology, nutrition, behavior, immunology and disease [1, 2]. The primary goal of RGD is to provide support for researchers using the rat as a model organism to understand human physiology and disease. To provide data for translational research, RGD manually curates disease and molecular pathway literature for rat, mouse and human and mammalian phenotype literature for rat and human. Imported data for gene ontology, drug–gene interactions, pathways and mammalian phenotype fill out the database to give broad coverage of relevant information across all three species. Behind the data, RGD represents an infrastructure of software tools and ontology development that supports both the display and analysis of the data.

WHAT IS THE VALUE OF HAVING DATA AVAILABLE IN RGD?

The inherent value of information in a database is the convenience of having accumulated data in an easy-to-access location and format. The realized value to the database user is the time savings because the data is collected and displayed by a trained dedicated team of curators and developers. Information in the database is presented in an organized manner with controlled vocabularies and ontologies. The result is the ability of the database user to access condensed, filtered data sets that would take far more time to accumulate and assimilate if curators and automated pipelines had not collected them into the database [3]. Having rat, mouse and human data in the same place makes it easy to compare disease phenotypes, genes and functional annotations across the three species. A secondary, but important, access to RGD data is the display of RGD-generated annotations on other publicly available websites. Major databases that store and display RGD annotations are the Gene Ontology Consortium [4], NCBI Gene [5], Ensembl [6], UniProt-GOA [7] and UCSC Genome Bioinformatics [8]. Sharing data with other databases exposes more researchers to RGD annotations and offers alternate routes to the RGD website as noted earlier in the text. Conversely, RGD imports data from and displays links to an expanding array of other databases, such as the Comparative Toxicogenomics Database (CTD) [9], the Kyoto Encyclopedia of Genes and Genomes (KEGG) [9] and the Pathway Interaction Database (PID) [10]. Beyond the convenience of data presentation and access, RGD provides software tools for data analysis. Efficient data search, ontology browser and genome browser tools [11, 12] are provided to find the information that the database user seeks. Analysis tools (Gene Annotator, RatMine, SNPlotyper and VCMap) [13 14] allow manipulation of data sets gathered at RGD to make qualitative and quantitative assessments.

WHO USES THE RAT GENOME DATABASE AND FOR WHAT PURPOSES?

For many years, RGD has been and continues to be the key repository for rat gene, quantitative trait locus (QTL) and strain data. As a measure of what type of researchers use these data, where the researchers are located and how they use the database, we have used a Google Analytics analysis of the RGD website and a manual analysis of publications in PubMed Central that cite RGD. Demographics, website use behavior, website traffic sources and context of citations have all been used to evaluate the use of RGD. RGD users reside all over the globe, with the largest number being in the United States (∼45%), followed by Europe (∼25%) and Asia (∼25%). Use increases at an annual rate of ∼7%. Half of all visits to RGD originate in a general internet search engine like Google. In all, 15–20% of visits consist of users who come directly to the RGD website by way of bookmarks or manual URL entry. The majority of the remaining visits to RGD come via referring links at the NCBI (National Center for Biotechnology Information) [15], HGNC (HUGO Gene Nomenclature Committee) [16] and Ensembl [6] websites. One might assume that researchers using rat as an experimental model would be the primary users of RGD. Based on articles in PubMed Central that cite ‘Rat Genome Database’, this is true for ∼40% of the returned titles. It is also true that 40% of the genetics articles involve studies with some combination of rat, mouse and human genes [17-19]. Those articles with human and rat and/or mouse data are examples of basic translational research. The authors reporting rat and mouse data provide insights and discoveries that may be applied to human research or clinical practice. Likewise, articles with human data may provide information that provokes new ideas or approaches in the study of disease or physiology in animal models. The genetic rat data at RGD allows comparison with both mouse and human without leaving the database. Every rat gene report page at RGD has comparative genomics data for the corresponding mouse and human orthologs (Figure 1). At a glance the database user can see the base pair coordinates or cytogenetic positions on the specific chromosomes that contain the gene and orthologs in question. Similarly, the mouse and human gene report pages at RGD have comparative genomics data for the corresponding two orthologs.
Figure 1:

Comparative map data. Genomic map data for prostaglandin synthase 1 in rat, human and mouse. The assemblies shown in bold font are the current reference assemblies. Alternate assemblies (e.g. previous assemblies and Celera assemblies) and alternate map data (cytogenetic and genetic maps) are also shown for each species.

Comparative map data. Genomic map data for prostaglandin synthase 1 in rat, human and mouse. The assemblies shown in bold font are the current reference assemblies. Alternate assemblies (e.g. previous assemblies and Celera assemblies) and alternate map data (cytogenetic and genetic maps) are also shown for each species. The genome tool GBrowse [12, 20–22] affords another opportunity to analyze and compare the genetics/genomics of rat, mouse and human orthologs. RGD has three interconnected GBrowse tools specifically for rat, mouse and human genome assemblies. The rat version of GBrowse can show subsets of rat genes, QTLs and single-nucleotide polymorphisms (SNPs). Rat GBrowse also has human and mouse synteny blocks to allow quick comparison of homologous segments between rat, mouse and human chromosomes (Figure 2). The user can select one of the synteny tracks and switch the viewer to mouse or human GBrowse via a pop-up window and then view the other two species as synteny blocks.
Figure 2:

Rat GBrowse. The rat Ptgs1 gene and two other genes are shown on chromosome 3 with the mouse and human synteny blocks from chromosomes 2 and 9, respectively, which indicates where the human and mouse orthologs of these particular rat genes are located.

Rat GBrowse. The rat Ptgs1 gene and two other genes are shown on chromosome 3 with the mouse and human synteny blocks from chromosomes 2 and 9, respectively, which indicates where the human and mouse orthologs of these particular rat genes are located. A third comparative genomics feature at RGD is VCMap (Figure 3) [14], which is a tool that compares genomic positions of genes simultaneously between different species, including rat, mouse and human. It gives a finer view than GBrowse of syntenic regions between species. The relative locations of numerous genes can be viewed in one window and compared across species.
Figure 3:

VCMap tool. A view of homologous regions of rat, mouse and human chromosomes containing the Ptgs1/PTGS1 gene and its syntenic neighbors. VCMap also contains genetic/genomic maps of other vertebrates (cow, chicken, horse and pig).

VCMap tool. A view of homologous regions of rat, mouse and human chromosomes containing the Ptgs1/PTGS1 gene and its syntenic neighbors. VCMap also contains genetic/genomic maps of other vertebrates (cow, chicken, horse and pig). Traditionally, the laboratory rat has been used as a model system for human disease. That continues to be the case and is the driving force behind the creation of RGD disease portals. Currently, RGD has seven disease portals: Cancer, Cardiovascular Disease, Diabetes, Immune and Inflammatory Disease, Neurological Disease, Obesity/Metabolic Syndrome and Respiratory Disease Portals. The disease curation done at RGD covers disease models in rat and mouse, as well as disease data generated by research with human subjects. Disease–gene annotations are not only made to the species from which the data originated but also to the orthologous genes in the other two species. That allows users to find significant relevant data from the other two species while looking specifically at data in rat, mouse or human. Disease data for rat QTLs, human QTLs and rat strains are also found in the disease portals. The GViewer within the portal home page shows the genomic locations of all genes and QTLs associated with the chosen disease (Figure 4) [23]. The view toggles between rat, mouse and human genomes, with an option to see homologous syntenic views in the other two species.
Figure 4:

The Immune & Inflammatory Disease Portal is the latest of the RGD disease portals. Rat, human and mouse genes, rat and human QTLs and rat strains associated with immune and inflammation-related disease can be accessed through this portal home page. The GViewer shows chromosomal locations of all genes (gray triangles) and QTLs (light gray bars) in the portal. The location of an individual gene (Ptgs1) is shown here by mousing over (arrowhead) the appropriate triangle adjacent to chromosome 3.

The Immune & Inflammatory Disease Portal is the latest of the RGD disease portals. Rat, human and mouse genes, rat and human QTLs and rat strains associated with immune and inflammation-related disease can be accessed through this portal home page. The GViewer shows chromosomal locations of all genes (gray triangles) and QTLs (light gray bars) in the portal. The location of an individual gene (Ptgs1) is shown here by mousing over (arrowhead) the appropriate triangle adjacent to chromosome 3. Appropriate to RGD’s focus on disease data for genes, QTLs and strains, the articles that cite RGD most often are those concerned with disease research (33% of articles in PubMed Central from January 2010 to December 2012, examples: [24-26]). Those users cover a range of disease research from QTLs to strain models to gene targets. They use RGD as a source of SNP, simple sequence length polymorphisms (SSLP), QTL and gene data. An additional important use of RGD data occurs via the FTP site [27]. All of the data found on the RGD website can be downloaded in bulk from the FTP site. The files available for download include descriptive information for genes, QTLs and strains, functional gene annotations, SSLPs, SNPs, ontology term files and more. Bulk downloads of data allow bioinformaticians to make analyses with their own software tools or other tools that are not available at RGD. Similar to widespread use of the rest of the RGD website, users from all over the world download files from the FTP site. Some of the downloaded data are used for public display of RGD data at sites such as NCBI, Ensembl and RIKEN [28]. Other users of the FTP site include many universities, research institutes and pharmaceutical companies in the United States, Europe and Asia. One of the tasks of RGD, in addition to data collection and presentation, is the responsibility for naming rat genes, QTLs and strains in a systematic manner. The fact that, during the last year and a half, references to nomenclature of rat genes, QTLs and strains [29-32] almost equal references to RGD QTL or gene data emphasizes the importance of official nomenclature. The importance of consistency in the naming of genetic elements and other data objects requires a stable source of that nomenclature, and RGD provides that service. The nomenclature of genes is particularly important in translational research because it helps keep track of orthologs between species. Gene nomenclature of rat, mouse and human genes are kept aligned by the coordinated work of the nomenclature committees of rat (RGD), mouse (MGD [33]) and human (HGNC [16]).

SUMMARY

For >10 years, RGD has been, and continues to be, a core genomic resource for rat researchers. During that time, RGD has expanded several elements. It has increased its data coverage [13], developed ontologies [34, 35] and has developed increasingly sophisticated software tools for both end users and curators [11, 36]. RGD provides numerous tools and data for rat, mouse and human to provide support for translational research. FTP files provide access to bulk RGD data for personalized analysis by users. The number of users of the database increases annually, accompanied by a broadening of user identity. The growing value of RGD comes not only from its data but also from its involvement in biological nomenclature, ontology development and software tool development. RGD is a global resource of biomedical data and analysis tools. RGD serves a diverse user base. RGD data support translational research.

Funding

The National Heart, Lung, and Blood Institute on behalf of the National Institutes of Health (HL64541). Funding for open access charge: The National Heart, Lung and Blood Institute on behalf of the National Institutes of Health (HL64541).
  21 in total

Review 1.  Progress and prospects in rat genetics: a community view.

Authors:  Timothy J Aitman; John K Critser; Edwin Cuppen; Anna Dominiczak; Xose M Fernandez-Suarez; Jonathan Flint; Dominique Gauguier; Aron M Geurts; Michael Gould; Peter C Harris; Rikard Holmdahl; Norbert Hubner; Zsuzsanna Izsvák; Howard J Jacob; Takashi Kuramoto; Anne E Kwitek; Anna Marrone; Tomoji Mashimo; Carol Moreno; John Mullins; Linda Mullins; Tomas Olsson; Michal Pravenec; Lela Riley; Kathrin Saar; Tadao Serikawa; James D Shull; Claude Szpirer; Simon N Twigger; Birger Voigt; Kim Worley
Journal:  Nat Genet       Date:  2008-05       Impact factor: 38.330

2.  Using the Generic Genome Browser (GBrowse).

Authors:  Maureen J Donlin
Journal:  Curr Protoc Bioinformatics       Date:  2009-12

3.  Comparative studies of mammalian acid lipases: Evidence for a new gene family in mouse and rat (Lipo).

Authors:  Roger S Holmes; Laura A Cox; John L VandeBerg
Journal:  Comp Biochem Physiol Part D Genomics Proteomics       Date:  2010-06-11       Impact factor: 2.674

4.  Mammalian glutaminase Gls2 gene encodes two functional alternative transcripts by a surrogate promoter usage mechanism.

Authors:  Mercedes Martín-Rufián; Marta Tosina; José A Campos-Sandoval; Elisa Manzanares; Carolina Lobo; J A Segura; Francisco J Alonso; José M Matés; Javier Márquez
Journal:  PLoS One       Date:  2012-06-05       Impact factor: 3.240

5.  Ontology searching and browsing at the Rat Genome Database.

Authors:  Stanley J F Laulederkind; Marek Tutaj; Mary Shimoyama; G Thomas Hayman; Timothy F Lowry; Rajni Nigam; Victoria Petri; Jennifer R Smith; Shur-Jen Wang; Jeff de Pons; Melinda R Dwinell; Howard J Jacob
Journal:  Database (Oxford)       Date:  2012-03-20       Impact factor: 3.451

6.  The Rat Genome Database pathway portal.

Authors:  Victoria Petri; Mary Shimoyama; G Thomas Hayman; Jennifer R Smith; Marek Tutaj; Jeff de Pons; Melinda R Dwinell; Diane H Munzenmaier; Simon N Twigger; Howard J Jacob
Journal:  Database (Oxford)       Date:  2011-04-08       Impact factor: 3.451

7.  RGD: a comparative genomics platform.

Authors:  Mary Shimoyama; Jennifer R Smith; Tom Hayman; Stan Laulederkind; Tim Lowry; Rajni Nigam; Victoria Petri; Shur-Jen Wang; Melinda Dwinell; Howard Jacob
Journal:  Hum Genomics       Date:  2011-01       Impact factor: 4.639

8.  Three ontologies to define phenotype measurement data.

Authors:  Mary Shimoyama; Rajni Nigam; Leslie Sanders McIntosh; Rakesh Nagarajan; Treva Rice; D C Rao; Melinda R Dwinell
Journal:  Front Genet       Date:  2012-05-28       Impact factor: 4.599

9.  Gene and protein nomenclature in public databases.

Authors:  Katrin Fundel; Ralf Zimmer
Journal:  BMC Bioinformatics       Date:  2006-08-09       Impact factor: 3.169

10.  Virus-induced autoimmune diabetes in the LEW.1WR1 rat requires Iddm14 and a genetic locus proximal to the major histocompatibility complex.

Authors:  Elizabeth P Blankenhorn; Laura Cort; Dale L Greiner; Dennis L Guberski; John P Mordes
Journal:  Diabetes       Date:  2009-08-31       Impact factor: 9.461

View more
  42 in total

Review 1.  miRNA Nomenclature: A View Incorporating Genetic Origins, Biosynthetic Pathways, and Sequence Variants.

Authors:  T Desvignes; P Batzel; E Berezikov; K Eilbeck; J T Eppig; M S McAndrews; A Singer; J H Postlethwait
Journal:  Trends Genet       Date:  2015-10-08       Impact factor: 11.639

2.  Islet adaptation to obesity and insulin resistance in WNIN/GR-Ob rats.

Authors:  Himadri Singh; Sireesha Ganneru; Venkata Malakapalli; Maniprabha Chalasani; Giridharan Nappanveettil; Ramesh R Bhonde; Vijayalakshmi Venkatesan
Journal:  Islets       Date:  2014       Impact factor: 2.694

3.  Structural libraries of protein models for multiple species to understand evolution of the renin-angiotensin system.

Authors:  Jeremy W Prokop; Victoria Petri; Mary E Shimoyama; Ingrid K M Watanabe; Dulce E Casarini; Thomas C Leeper; Stephanie M Bilinovich; Howard J Jacob; Robson A S Santos; Almir S Martins; Fabiano C Araujo; Fernando M Reis; Amy Milsted
Journal:  Gen Comp Endocrinol       Date:  2014-09-26       Impact factor: 2.822

4.  Application of quantitative trait locus mapping and transcriptomics to studies of the senescence-accelerated phenotype in rats.

Authors:  Elena E Korbolina; Nikita I Ershov; Leonid O Bryzgalov; Natalia G Kolosova
Journal:  BMC Genomics       Date:  2014-12-19       Impact factor: 3.969

5.  Rich annotation of DNA sequencing variants by leveraging the Ensembl Variant Effect Predictor with plugins.

Authors:  Michael Yourshaw; S Paige Taylor; Aliz R Rao; Martín G Martín; Stanley F Nelson
Journal:  Brief Bioinform       Date:  2014-03-12       Impact factor: 11.622

6.  Ept7 influences estrogen action in the pituitary gland and body weight of rats.

Authors:  Scott G Kurz; Kirsten L Dennison; Nyssa Becker Samanas; Maureen Peters Hickman; Quincy A Eckert; Tiffany L Walker; Andrea S Cupp; James D Shull
Journal:  Mamm Genome       Date:  2014-01-22       Impact factor: 2.957

7.  HOPS-dependent endosomal fusion required for efficient cytosolic delivery of therapeutic peptides and small proteins.

Authors:  Angela Steinauer; Jonathan R LaRochelle; Susan L Knox; Rebecca F Wissner; Samuel Berry; Alanna Schepartz
Journal:  Proc Natl Acad Sci U S A       Date:  2019-01-08       Impact factor: 11.205

8.  Long Evans rat spermatogonial lines are effective germline vectors for transgenic rat production.

Authors:  F Kent Hamra; Christopher T Richie; Brandon K Harvey
Journal:  Transgenic Res       Date:  2017-06-12       Impact factor: 2.788

9.  Development and characterization of a novel rat model of estrogen-induced mammary cancer.

Authors:  Kirsten L Dennison; Nyssa Becker Samanas; Quincy Eckert Harenda; Maureen Peters Hickman; Nicole L Seiler; Lina Ding; James D Shull
Journal:  Endocr Relat Cancer       Date:  2015-04       Impact factor: 5.678

10.  Use of Drosophila as an evaluation method reveals imp as a candidate gene for type 2 diabetes in rat locus Niddm22.

Authors:  Kurenai Kawasaki; Sawaka Yamada; Koki Ogata; Yumiko Saito; Aiko Takahama; Takahisa Yamada; Kozo Matsumoto; Hiroyuki Kose
Journal:  J Diabetes Res       Date:  2015-03-02       Impact factor: 4.011

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.