| Literature DB >> 26519466 |
Lili Hao, Huiyong Zhang, Zhang Zhang, Songnian Hu, Yu Xue.
Abstract
Rice is the most important staple food for a large part of the world's human population and also a key model organism for plant research. Here, we present Information Commons for Rice (IC4R; http://ic4r.org), a rice knowledgebase featuring adoption of an extensible and sustainable architecture that integrates multiple omics data through community-contributed modules. Each module is developed and maintained by different committed groups, deals with data collection, processing and visualization, and delivers data on-demand via web services. In the current version, IC4R incorporates a variety of rice data through multiple committed modules, including genome-wide expression profiles derived entirely from RNA-Seq data, resequencing-based genomic variations obtained from re-sequencing data of thousands of rice varieties, plant homologous genes covering multiple diverse plant species, post-translational modifications, rice-related literatures and gene annotations contributed by the rice research community. Unlike extant related databases, IC4R is designed for scalability and sustainability and thus also features collaborative integration of rice data and low costs for database update and maintenance. Future directions of IC4R include incorporation of other omics data and association of multiple omics data with agronomically important traits, dedicating to build IC4R into a valuable knowledgebase for both basic and translational researches in rice.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26519466 PMCID: PMC4702825 DOI: 10.1093/nar/gkv1141
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.IC4R committed sub-projects. In the current version, IC4R integrates data from multiple committed databases as well as offers a genome browser for data visualization: (1) Rice Expression Database, a repository of gene expression profiles derived entirely from RNA-Seq data analysis on tissues spanning a range of rice growth stages and covering a variety of biotic and abiotic treatments; (2) Rice Variation Database, a large set of single nucleotide polymorphisms based on re-sequencing of thousands of rice cultivars; (3) Plant Homolog Database, a database composed of more than 14,000 homologous genes covering diverse plant species; (4) Rice Literature Miner, a collection of rice-related publications by associating with specific rice genes; (5) RiceWiki, a wiki-based, publicly editable and open-content platform for community curation of rice genes; (6) Post-translational Modification, provided by three partner databases and (7) Rice Genome Browser, a genome browser for rice data visualization.
Data statistics of IC4R committed databases (as of 22 September 2015)
| Data Contents | Data Statistics |
|---|---|
| RNA-Seq projects | 17 |
| RNA-Seq experiments | 218 |
| Tissues | 8 |
| Gene expression profiles | 10 025 602 |
| Transcript expression profiles | 11 457 912 |
| Cultivars | 5524 |
| SNPs | 8 544 598 |
| Homologous groups | 14 739 |
| Species | 17 |
| PKs and PPs | 1676 |
| E1s, E2s, E3s and DUBs | 1814 |
| Protein phosphorylation sites | 3746 |
| Community-annotated genes | 1005 |
| Articles | 35 717 |
Figure 2.Screenshots of a gene report page, taking gene ‘Os01g0192000’ as an example. (A) Navigation information, providing links to different sections. (B) Expression profiles that are derived entirely from RNA-Seq data. Detailed expression levels under different tissues are provided in the form of box plot. (C) Homologous genes covering 17 plant genomes. To facilitate users to investigate orthologs and paralogs in an interactive manner, phylogenetic tree is plotted by a JavaScript-based web plugin and each node is identified as gene duplication event or speciation event. (D) Variation details. There are multiple SNPs found in this gene and a wealth of information regarding each SNP (for example, ‘OS01SNP04949925’) is provided, including its effects, surrounding genes, allele frequency among 5524 cultivars and the corresponding genotype in each cultivar. (E) Post-translational modification data. (F) Publications associated with this gene. (G) Community-contributed annotations. (H) Genome browser for visualizing different data tracks.