| Literature DB >> 30357347 |
Sook Jung1, Taein Lee1, Chun-Huai Cheng1, Katheryn Buble1, Ping Zheng1, Jing Yu1, Jodi Humann1, Stephen P Ficklin1, Ksenija Gasic2, Kristin Scott1, Morgan Frank1, Sushan Ru3, Heidi Hough1, Kate Evans4, Cameron Peace1, Mercy Olmstead5, Lisa W DeVetter6, James McFerson4, Michael Coe7, Jill L Wegrzyn8, Margaret E Staton9, Albert G Abbott10, Dorrie Main1.
Abstract
The Genome Database for Rosaceae (GDR, https://www.rosaceae.org) is an integrated web-based community database resource providing access to publicly available genomics, genetics and breeding data and data-mining tools to facilitate basic, translational and applied research in Rosaceae. The volume of data in GDR has increased greatly over the last 5 years. The GDR now houses multiple versions of whole genome assembly and annotation data from 14 species, made available by recent advances in sequencing technology. Annotated and searchable reference transcriptomes, RefTrans, combining peer-reviewed published RNA-Seq as well as EST datasets, are newly available for major crop species. Significantly more quantitative trait loci, genetic maps and markers are available in MapViewer, a new visualization tool that better integrates with other pages in GDR. Pathways can be accessed through the new GDR Cyc Pathways databases, and synteny among the newest genome assemblies from eight species can be viewed through the new synteny browser, SynView. Collated single-nucleotide polymorphism diversity data and phenotypic data from publicly available breeding datasets are integrated with other relevant data. Also, the new Breeding Information Management System allows breeders to upload, manage and analyze their private breeding data within the secure GDR server with an option to release data publicly.Entities:
Mesh:
Year: 2019 PMID: 30357347 PMCID: PMC6324069 DOI: 10.1093/nar/gky1000
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Comparison of number of GDR entries between 2008, 2013 and 2018 by data type
| Number of entries by year | ||||
|---|---|---|---|---|
| Data type | 2008 | 2013 | 2018 | Details |
| Genome | 0 | 5 | 21 | Multiple versions of whole genome assemblies and annotations from seven major crops and 14 species |
| Gene and mRNA | 0 | 236191 genes | 528890 genes and 585574 mRNAs | Genes and mRNAs from whole genome assemblies and parsed from NCBI nucleotide sequences |
| Transcript | 90337 | 200467 | 1065226 RefTrans and 1412519 Unigenes | RefTrans for six genus and unigene V5 for six genus and the rosaceae family |
| Marker | 1700 | 2229311 | 3285775 | Including 127877 for Fragaria, 2613842 for Malus, 470747 for Prunus, 1593 for Pyrus, 71716 for Rosa and 8743 for Rubus |
| Genetic Map | 37 | 84 | 313 | Including 27 for Fragaria, 108 for Malus, 168 for Prunus, 9 for Pyrus, 15 for Rosa and 13 for Rubus |
| QTL | 0 | 1195 | 3799 | QTL and MTL associated with 392 agronomic traits |
| MTL | 27 | 52 | 103 | |
| Species | <100 | 516 | 1967 | Data available for 1967 species, specific species pages with hyperlinks to various data and tools for 13 major species |
| Germplasm | 0 | 8613 | 14411 | Including 1799 for Fragaria, 4635 for Malus, 7367 for Prunus, 465 for Pyrus, 145 for Rosa, and 42 for Rubus |
| Phenotype data | 0 | 578 568 | 878426 | 878426 measurements include 389191 publicly available data and the rest available for the RosBreed project team members |
| Genotype data | 0 | 28 296 | 10806569 | SNP genotype data from 10787946 measurements using 12229 markers for 6 species, SSR genotype data from 18623 measurements using 5044 markers for 10 species |
| Publication | 2447 | 5182 | 7449 | 7499 publications on genomic, genetic and breeding research in Rosaceae |
Figure 1.Synteny Viewer in GDR. (A) Home page of Synteny Viewer allows users to choose a reference genome and a chromosome and multiple genomes for comparison. Users can also choose a synteny block ID. (B) A circular diagram and a table shows the synteny blocks between a chromosome of a reference genome and all chromosomes of another genome being compared. (C) A bar diagram and a table that shows all the genes in a syntenic block. The table displays E-value between the matching genes and the gene names have hyperlinks to the gene detail page. (D) A gene detail page with resource side bar and the hyperlink to JBrowse. (E) JBrowse around the mRNA of interest with tracks such as gene, mRNA, SNP and genes parsed from NCBI nucleotide database.
Figure 2.SNP Genotype Search Page in GDR. (A) Users can search SNP genotype data by dataset name, species, germplasm name, SNP name, genome location and/or gene name. Users can also upload a file with a list of germplasm names. (B) Search result table that shows SNP name, genome location, allele and the genotype data of all the germplasm chosen in the order of SNP location in the genome. The red square highlights the options to download the genotype for all the marker displayed in the result page or the genotype data that are polymorphic in the germplasm set chosen.
Figure 3.BIMS in GDR. (A) ‘Template List’ subsection in ‘Data Import’ section provide downloadable templates for users to enter various breeding data. (B) Search’ section allows users to search and save the list of accessions using any combination of properties and trait cut of values: name, trial, location, cross, parent and trait values. The middle section shows the statistical information on the filtered dataset for the trait chosen and the right section shows the number of accessions filtered so far. (C) A page with the search result table. Users can add more columns in the table using ‘Column options’ and save/download the result table. (D) ‘Data Analysis’ section that allows users to choose two datasets, using the categories or saved accession lists and compare the trait statistics between the two datasets.