| Literature DB >> 23299411 |
Hiroaki Sakai1, Sung Shin Lee, Tsuyoshi Tanaka, Hisataka Numa, Jungsok Kim, Yoshihiro Kawahara, Hironobu Wakimoto, Ching-chia Yang, Masao Iwamoto, Takashi Abe, Yuko Yamada, Akira Muto, Hachiro Inokuchi, Toshimichi Ikemura, Takashi Matsumoto, Takuji Sasaki, Takeshi Itoh.
Abstract
The Rice Annotation Project Database (RAP-DB, http://rapdb.dna.affrc.go.jp/) has been providing a comprehensive set of gene annotations for the genome sequence of rice, Oryza sativa (japonica group) cv. Nipponbare. Since the first release in 2005, RAP-DB has been updated several times along with the genome assembly updates. Here, we present our newest RAP-DB based on the latest genome assembly, Os-Nipponbare-Reference-IRGSP-1.0 (IRGSP-1.0), which was released in 2011. We detected 37,869 loci by mapping transcript and protein sequences of 150 monocot species. To provide plant researchers with highly reliable and up to date rice gene annotations, we have been incorporating literature-based manually curated data, and 1,626 loci currently incorporate literature-based annotation data, including commonly used gene names or gene symbols. Transcriptional activities are shown at the nucleotide level by mapping RNA-Seq reads derived from 27 samples. We also mapped the Illumina reads of a Japanese leading japonica cultivar, Koshihikari, and a Chinese indica cultivar, Guangluai-4, to the genome and show alignments together with the single nucleotide polymorphisms (SNPs) and gene functional annotations through a newly developed browser, Short-Read Assembly Browser (S-RAB). We have developed two satellite databases, Plant Gene Family Database (PGFD) and Integrative Database of Cereal Gene Phylogeny (IDCGP), which display gene family and homologous gene relationships among diverse plant species. RAP-DB and the satellite databases offer simple and user-friendly web interfaces, enabling plant and genome researchers to access the data easily and facilitating a broad range of plant research topics.Entities:
Mesh:
Year: 2013 PMID: 23299411 PMCID: PMC3583025 DOI: 10.1093/pcp/pcs183
Source DB: PubMed Journal: Plant Cell Physiol ISSN: 0032-0781 Impact factor: 4.927
Statistics of rice genes
| Loci with expression evidence | 37,869 |
| Protein-coding loci | 33,279 |
| Non-protein coding loci | 2,190 |
| | 2,400 |
| Alternative variants | 6,667 |
| 8,121 |
Fig. 1Gene map view. In RNA-Seq tracks, gene expression levels are shown as a histogram generated from the depth of mapped reads. At the bottom, two tracks, ‘Koshihikari’ and ‘Guangluai-4’, are linked to the Short-Read Assembly Browser.
Fig. 2Gene details view. The ‘Details’ tab shows the detailed annotation information including the literature-based manual curation data. The ‘DB references’ tab provides hyperlinks to the external databases including the two satellite databases, Plant Gene Family Database and Integrative Database of Cereal Gene Phylogeny.
Fig. 3Short-Read Assembly Browser (S-RAB). The S-RAB window consists of (i) a Gene information view (upper left); (ii) an Alignment view (bottom); and (iii) a Short-read information view (upper right).
Fig. 4Integrative Database of Cereal Gene Phylogeny (IDCGP). (A) Main window of IDCGP showing the phylogenetic trees, hyperlinks to the downloadable data and buttons for adding sequences and editing multiple sequence alignments. (B) A window for editing multiple sequence alignments.