Literature DB >> 16381971

The Rice Annotation Project Database (RAP-DB): hub for Oryza sativa ssp. japonica genome information.

Hajime Ohyanagi1, Tsuyoshi Tanaka, Hiroaki Sakai, Yasumasa Shigemoto, Kaori Yamaguchi, Takuya Habara, Yasuyuki Fujii, Baltazar A Antonio, Yoshiaki Nagamura, Tadashi Imanishi, Kazuho Ikeo, Takeshi Itoh, Takashi Gojobori, Takuji Sasaki.   

Abstract

With the completion of the rice genome sequencing, a standardized annotation is necessary so that the information from the genome sequence can be fully utilized in understanding the biology of rice and other cereal crops. An annotation jamboree was held in Japan with the aim of annotating and manually curating all the genes in the rice genome. Here we present the Rice Annotation Project Database (RAP-DB), which has been developed to provide access to the annotation data. The RAP-DB has two different types of annotation viewers, BLAST and BLAT search, and other useful features. By connecting the annotations to other rice genomics data, such as full-length cDNAs and Tos17 mutant lines, the RAP-DB serves as a hub for rice genomics. All of the resources can be accessed through http://rapdb.lab.nig.ac.jp/.

Entities:  

Mesh:

Year:  2006        PMID: 16381971      PMCID: PMC1347456          DOI: 10.1093/nar/gkj094

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Rice is considered a model cereal plant because of its small genome size and high degree of chromosomal co-linearity with other major cereal crops such as maize, wheat, barley and sorghum (1,2). The International Rice Genome Sequencing Project (IRGSP), a consortium of publicly funded laboratories from 10 countries, initiated the sequencing of Oryza sativa ssp. japonica cultivar Nipponbare in 1998 using the clone-by-clone sequencing strategy (2). In 2004, the finished-quality sequence of the entire genome was completed and is now available in the public domain (3). The annotation of the sequence is indispensable in understanding the overall structure and function of the rice genome. However, most of the annotations of the rice genome sequences were obtained by automated methods. Although this provides an overview of the composition of the genes that comprise the genome, limitations in prediction programs often result in probable errors and artifacts among predicted genes. Therefore, in concordance with the completion of the rice genome sequence, the Rice Annotation Project (RAP) was organized in 2004 (T. Itoh et al., manuscript in preparation) with the aim of providing standardized and highly accurate annotations of the rice genome. To facilitate efficient management of the results of annotation and to establish a platform for integrating the data with other rice resources, an annotation database called the RAP Database (RAP-DB) was developed. The RAP-DB integrates the IRGSP genome sequence and the RAP annotations with other data on rice researches, and makes them available to the public through HTTP access.

DATABASE CONTENTS

The RAP-DB contains the IRGSP genome sequence (build 3 assembly) (3) and the RAP loci with corresponding locus IDs representing the annotated genes. Each locus has one or more variant transcript(s) as RAP annotated sequence(s). Predicted protein-coding regions were also employed as RAP predicted loci. The TIGR-transcripts derived from the annotations on the TIGR assembly (4) were added to the RAP-DB by mapping them to the IRGSP genome. Each RAP transcript has the following links: Gene Ontology, motif domain information, full-length cDNA information (5) and so on. Among them, full-length cDNAs are anticipated to be invaluable for rice researches (Figure 1) by providing good evidence of physical clones, and facilitating future experimental researches. Hyperlinks to the Tos17-flanking sequence positions on the chromosomes (6) should be quite useful for application in clarifying gene functions (Figure 1). The RAP-DB also contains a repeat-masked version of the IRGSP genome sequence build 3 as the reference genome sequence for the annotations.
Figure 1

Overview of the hub for rice genome information. The RAP-DB contains the high-quality rice genome sequence generated by the IRGSP, curated annotations of identified or predicted genes, and links to other databases (represented by small blue cylinders). Integrative access to all the information (represented by large blue cylinders), such as experimental evidence, is facilitated.

SYSTEM ARCHITECTURE

The RAP-DB was implemented on PC servers with RedHat Enterprise Linux ES Version 3, Apache web server, MySQL Database server and GBrowse (7). Other common utilities for UNIX were appropriately installed on the servers if necessary. In order to implement the G-integra system, a modified version provided from the H-Invitational Database (H-InvDB) (8) was used. All of the RAP-DB resources are stored in the servers and available through HTTP access.

DATA ACCESS

The primary concept of the RAP-DB is to provide simple access for the IRGSP genome sequence and the RAP annotations. Furthermore, the RAP-DB enables integrative access for other rice resources, which will establish a hub for O.sativa ssp. japonica genomics (Figure 1). One of the entry points of the database is search by keywords ().Descriptions and IDs () of the annotations are searched. The other entry points are sequence similarity searches (for details see below).

Annotation browser

All the descriptions of the functional annotations and other related information can be viewed through GBrowse (Figure 2A and B), which provides the main features of the RAP-DB and gives chromosome-oriented access (Figure 2A) for the genome sequence and the annotations. Results of keyword or sequence similarity search are automatically hyperlinked to corresponding annotations stored in GBrowse. GBrowse is a Generic Genome Browser originally developed by Stein et al. (7) whose characteristics are a combination of a relational database and interactive web pages for manipulating and displaying annotations on genomes. An annotation table corresponding to each transcript is also available by clicking on each glyph (Figure 2B). The table is composed of multiple rows that includes Gene Ontology information, motif domain information and so on. Links are provided to other useful databases such as the full-length cDNAs (5) and Tos17 mutant lines (6), and thereby the RAP-DB functions as a hub for rice genome information. Moreover, SVG images are generated, so that the user can edit the graphics of the genomic view.
Figure 2

Flowchart of RAP-DB browsing. Users can search the rice genome annotations by chromosomal position, ID or keyword. Sequence similarity search by RAP-BLAST or RAP-BLAT is also available (see text). (A) A graphical view of the RAP annotated loci and sequences, Tos17-flanking positions, and other tracks illustrated by GBrowse. (B) An annotation table corresponding to the sequence with hyperlinks to other databases. (C) Browsing a precise genomic view by G-integra.

Genome viewer

Genome-scale view of the annotation and comparison of transcripts with those of other species are available through the G-integra system (Figure 2C), which was originally developed as a part of the H-InvDB (8). G-integra is implemented so as to facilitate parallel access for the RAP annotations and numbers of tracks for other species (cDNAs and expressed sequence tags of representative monocots and Arabidopsis thaliana and the like). G-integra and GBrowse are reciprocally hyperlinked and hence the user can easily access both information.

Sequence similarity search

To facilitate access by sequence similarities, two alternative search methods are available (Figure 2). One is BLAT for aligning a given DNA against the genome (9). Hits reported by BLAT are automatically hyperlinked to the corresponding regions in GBrowse. The other is BLAST (10), which is used for searching transcripts and open reading frames. Hits reported by BLAST are automatically hyperlinked to the corresponding annotation tables in GBrowse.

Distributed annotation system (DAS)

Although we wish to use the IRGSP genome and the RAP annotations as the standard references for future rice genomics, it will be of the rice community's benefit to utilize them for third party annotations. Therefore, we made them available through the DAS protocol (11). The URL for the IRGSP genome reference server is .

FUTURE DIRECTION

The annotations of the rice genome sequence will be updated as the genome sequence and cDNA sequences are revised. The latest version of the high-quality rice genome sequence (build 4 assembly) has been released recently (T. Sasaki, personal communication). This assembly will be used to update the manual curation of annotation in conjunction with the Second RAP Meeting (RAP2). It is therefore expected to generate additional loci as well as modifications on previous annotations. In addition, we will increase the links for other valuable databases to provide multiple access to various genome information. The RAP-DB will be a bridge to connect the rice genome informatics and the experimental genomics, and an important hub for rice genomics.
  11 in total

1.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

2.  The generic genome browser: a building block for a model organism system database.

Authors:  Lincoln D Stein; Christopher Mungall; ShengQiang Shu; Michael Caudy; Marco Mangone; Allen Day; Elizabeth Nickerson; Jason E Stajich; Todd W Harris; Adrian Arva; Suzanna Lewis
Journal:  Genome Res       Date:  2002-10       Impact factor: 9.043

3.  Target site specificity of the Tos17 retrotransposon shows a preference for insertion within genes and against insertion in retrotransposon-rich regions of the genome.

Authors:  Akio Miyao; Katsuyuki Tanaka; Kazumasa Murata; Hiromichi Sawaki; Shin Takeda; Kiyomi Abe; Yoriko Shinozuka; Katsura Onosato; Hirohiko Hirochika
Journal:  Plant Cell       Date:  2003-08       Impact factor: 11.277

Review 4.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

Review 5.  Cereal genome evolution. Grasses, line up and form a circle.

Authors:  G Moore; K M Devos; Z Wang; M D Gale
Journal:  Curr Biol       Date:  1995-07-01       Impact factor: 10.834

6.  The institute for genomic research Osa1 rice genome annotation database.

Authors:  Qiaoping Yuan; Shu Ouyang; Aihui Wang; Wei Zhu; Rama Maiti; Haining Lin; John Hamilton; Brian Haas; Razvan Sultana; Foo Cheung; Jennifer Wortman; C Robin Buell
Journal:  Plant Physiol       Date:  2005-05       Impact factor: 8.340

7.  The map-based sequence of the rice genome.

Authors: 
Journal:  Nature       Date:  2005-08-11       Impact factor: 49.962

8.  Collection, mapping, and annotation of over 28,000 cDNA clones from japonica rice.

Authors:  Shoshi Kikuchi; Kouji Satoh; Toshifumi Nagata; Nobuyuki Kawagashira; Koji Doi; Naoki Kishimoto; Junshi Yazaki; Masahiro Ishikawa; Hitomi Yamada; Hisako Ooka; Isamu Hotta; Keiichi Kojima; Takahiro Namiki; Eisuke Ohneda; Wataru Yahagi; Kohji Suzuki; Chao Jie Li; Kenji Ohtsuki; Toru Shishiki; Yasuhiro Otomo; Kazuo Murakami; Yoshiharu Iida; Sumio Sugano; Tatsuto Fujimura; Yutaka Suzuki; Yuki Tsunoda; Takashi Kurosaki; Takeko Kodama; Hiromi Masuda; Michie Kobayashi; Quihong Xie; Min Lu; Ryuya Narikawa; Akio Sugiyama; Kouichi Mizuno; Satoko Yokomizo; Junko Niikura; Rieko Ikeda; Junya Ishibiki; Midori Kawamata; Akemi Yoshimura; Junichirou Miura; Takahiro Kusumegi; Mitsuru Oka; Risa Ryu; Mariko Ueda; Kenichi Matsubara; Jun Kawai; Piero Carninci; Jun Adachi; Katsunori Aizawa; Takahiro Arakawa; Shiro Fukuda; Ayako Hara; Wataru Hashizume; Norihito Hayatsu; Koichi Imotani; Yoshiyuki Ishii; Masayoshi Itoh; Ikuko Kagawa; Shinji Kondo; Hideaki Konno; Ai Miyazaki; Naoki Osato; Yoshimi Ota; Rintaro Saito; Daisuke Sasaki; Kenjiro Sato; Kazuhiro Shibata; Akira Shinagawa; Toshiyuki Shiraki; Masayasu Yoshino; Yoshihide Hayashizaki; Ayako Yasunishi
Journal:  Science       Date:  2003-07-18       Impact factor: 47.728

9.  The distributed annotation system.

Authors:  R D Dowell; R M Jokerst; A Day; S R Eddy; L Stein
Journal:  BMC Bioinformatics       Date:  2001-10-10       Impact factor: 3.169

10.  Integrative annotation of 21,037 human genes validated by full-length cDNA clones.

Authors:  Tadashi Imanishi; Takeshi Itoh; Yutaka Suzuki; Claire O'Donovan; Satoshi Fukuchi; Kanako O Koyanagi; Roberto A Barrero; Takuro Tamura; Yumi Yamaguchi-Kabata; Motohiko Tanino; Kei Yura; Satoru Miyazaki; Kazuho Ikeo; Keiichi Homma; Arek Kasprzyk; Tetsuo Nishikawa; Mika Hirakawa; Jean Thierry-Mieg; Danielle Thierry-Mieg; Jennifer Ashurst; Libin Jia; Mitsuteru Nakao; Michael A Thomas; Nicola Mulder; Youla Karavidopoulou; Lihua Jin; Sangsoo Kim; Tomohiro Yasuda; Boris Lenhard; Eric Eveno; Yoshiyuki Suzuki; Chisato Yamasaki; Jun-ichi Takeda; Craig Gough; Phillip Hilton; Yasuyuki Fujii; Hiroaki Sakai; Susumu Tanaka; Clara Amid; Matthew Bellgard; Maria de Fatima Bonaldo; Hidemasa Bono; Susan K Bromberg; Anthony J Brookes; Elspeth Bruford; Piero Carninci; Claude Chelala; Christine Couillault; Sandro J de Souza; Marie-Anne Debily; Marie-Dominique Devignes; Inna Dubchak; Toshinori Endo; Anne Estreicher; Eduardo Eyras; Kaoru Fukami-Kobayashi; Gopal R Gopinath; Esther Graudens; Yoonsoo Hahn; Michael Han; Ze-Guang Han; Kousuke Hanada; Hideki Hanaoka; Erimi Harada; Katsuyuki Hashimoto; Ursula Hinz; Momoki Hirai; Teruyoshi Hishiki; Ian Hopkinson; Sandrine Imbeaud; Hidetoshi Inoko; Alexander Kanapin; Yayoi Kaneko; Takeya Kasukawa; Janet Kelso; Paul Kersey; Reiko Kikuno; Kouichi Kimura; Bernhard Korn; Vladimir Kuryshev; Izabela Makalowska; Takashi Makino; Shuhei Mano; Regine Mariage-Samson; Jun Mashima; Hideo Matsuda; Hans-Werner Mewes; Shinsei Minoshima; Keiichi Nagai; Hideki Nagasaki; Naoki Nagata; Rajni Nigam; Osamu Ogasawara; Osamu Ohara; Masafumi Ohtsubo; Norihiro Okada; Toshihisa Okido; Satoshi Oota; Motonori Ota; Toshio Ota; Tetsuji Otsuki; Dominique Piatier-Tonneau; Annemarie Poustka; Shuang-Xi Ren; Naruya Saitou; Katsunaga Sakai; Shigetaka Sakamoto; Ryuichi Sakate; Ingo Schupp; Florence Servant; Stephen Sherry; Rie Shiba; Nobuyoshi Shimizu; Mary Shimoyama; Andrew J Simpson; Bento Soares; Charles Steward; Makiko Suwa; Mami Suzuki; Aiko Takahashi; Gen Tamiya; Hiroshi Tanaka; Todd Taylor; Joseph D Terwilliger; Per Unneberg; Vamsi Veeramachaneni; Shinya Watanabe; Laurens Wilming; Norikazu Yasuda; Hyang-Sook Yoo; Marvin Stodolsky; Wojciech Makalowski; Mitiko Go; Kenta Nakai; Toshihisa Takagi; Minoru Kanehisa; Yoshiyuki Sakaki; John Quackenbush; Yasushi Okazaki; Yoshihide Hayashizaki; Winston Hide; Ranajit Chakraborty; Ken Nishikawa; Hideaki Sugawara; Yoshio Tateno; Zhu Chen; Michio Oishi; Peter Tonellato; Rolf Apweiler; Kousaku Okubo; Lukas Wagner; Stefan Wiemann; Robert L Strausberg; Takao Isogai; Charles Auffray; Nobuo Nomura; Takashi Gojobori; Sumio Sugano
Journal:  PLoS Biol       Date:  2004-04-20       Impact factor: 8.029

View more
  85 in total

1.  Deep RNA sequencing at single base-pair resolution reveals high complexity of the rice transcriptome.

Authors:  Guojie Zhang; Guangwu Guo; Xueda Hu; Yong Zhang; Qiye Li; Ruiqiang Li; Ruhong Zhuang; Zhike Lu; Zengquan He; Xiaodong Fang; Li Chen; Wei Tian; Yong Tao; Karsten Kristiansen; Xiuqing Zhang; Songgang Li; Huanming Yang; Jian Wang; Jun Wang
Journal:  Genome Res       Date:  2010-03-19       Impact factor: 9.043

2.  Dramatic amplification of a rice transposable element during recent domestication.

Authors:  Ken Naito; Eunyoung Cho; Guojun Yang; Matthew A Campbell; Kentaro Yano; Yutaka Okumoto; Takatoshi Tanisaka; Susan R Wessler
Journal:  Proc Natl Acad Sci U S A       Date:  2006-11-13       Impact factor: 11.205

3.  Plant Gene and Alternatively Spliced Variant Annotator. A plant genome annotation pipeline for rice gene and alternatively spliced variant identification with cross-species expressed sequence tag conservation from seven plant species.

Authors:  Feng-Chi Chen; Sheng-Shun Wang; Shu-Miaw Chaw; Yao-Ting Huang; Trees-Juen Chuang
Journal:  Plant Physiol       Date:  2007-01-12       Impact factor: 8.340

4.  Isolation and characterization of a male sterility gene homolog BcMS2 from Chinese cabbage-pak-choi that expressing in an anther-specific manner.

Authors:  Ming Jiang; Jiashu Cao
Journal:  Mol Biol Rep       Date:  2007-05-20       Impact factor: 2.316

5.  A community-based annotation framework for linking solanaceae genomes with phenomes.

Authors:  Naama Menda; Robert M Buels; Isaak Tecle; Lukas A Mueller
Journal:  Plant Physiol       Date:  2008-06-06       Impact factor: 8.340

6.  Comparative transcriptional analysis reveals differential gene expression between Sand Daffodil tissues.

Authors:  Bruna De Felice; Francesco Manfellotto; Raffaella D'Alessandro; Olga De Castro; Antonietta Di Maio; Marco Trifuoggi
Journal:  Genetica       Date:  2013-10-20       Impact factor: 1.082

7.  A genome-wide gain-of function analysis of rice genes using the FOX-hunting system.

Authors:  Hidemitsu Nakamura; Makoto Hakata; Kou Amano; Akio Miyao; Naoko Toki; Mariko Kajikawa; Jinhuan Pang; Naokuni Higashi; Shigeko Ando; Seiichi Toki; Miki Fujita; Akiko Enju; Motoaki Seki; Miki Nakazawa; Takanari Ichikawa; Kazuo Shinozaki; Minami Matsui; Yoshiaki Nagamura; Hirohiko Hirochika; Hiroaki Ichikawa
Journal:  Plant Mol Biol       Date:  2007-10-10       Impact factor: 4.076

8.  Characterization of rice tryptophan decarboxylases and their direct involvement in serotonin biosynthesis in transgenic rice.

Authors:  Sei Kang; Kiyoon Kang; Kyungjin Lee; Kyoungwhan Back
Journal:  Planta       Date:  2007-09-01       Impact factor: 4.116

9.  A putative autonomous 20.5 kb-CACTA transposon insertion in an F3'H allele identifies a new CACTA transposon subfamily in Glycine max.

Authors:  Gracia Zabala; Lila Vodkin
Journal:  BMC Plant Biol       Date:  2008-12-02       Impact factor: 4.215

10.  Ectopic overexpression of the transcription factor OsGLK1 induces chloroplast development in non-green rice cells.

Authors:  Hidemitsu Nakamura; Masayuki Muramatsu; Makoto Hakata; Osamu Ueno; Yoshiaki Nagamura; Hirohiko Hirochika; Makoto Takano; Hiroaki Ichikawa
Journal:  Plant Cell Physiol       Date:  2009-10-06       Impact factor: 4.927

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.