Literature DB >> 19036133

RICD: a rice indica cDNA database resource for rice functional genomics.

Tingting Lu¹, Xuehui Huang, Chuanrang Zhu, Tao Huang, Qiang Zhao, Kabing Xie, Lizhong Xiong, Qifa Zhang, Bin Han.

Abstract

BACKGROUND: The Oryza sativa L. indica subspecies is the most widely cultivated rice. During the last few years, we have collected over 20,000 putative full-length cDNAs and over 40,000 ESTs isolated from various cDNA libraries of two indica varieties Guangluai 4 and Minghui 63. A database of the rice indica cDNAs was therefore built to provide a comprehensive web data source for searching and retrieving the indica cDNA clones.
RESULTS: Rice Indica cDNA Database (RICD) is an online MySQL-PHP driven database with a user-friendly web interface. It allows investigators to query the cDNA clones by keyword, genome position, nucleotide or protein sequence, and putative function. It also provides a series of information, including sequences, protein domain annotations, similarity search results, SNPs and InDels information, and hyperlinks to gene annotation in both The Rice Annotation Project Database (RAP-DB) and The TIGR Rice Genome Annotation Resource, expression atlas in RiceGE and variation report in Gramene of each cDNA.
CONCLUSION: The online rice indica cDNA database provides cDNA resource with comprehensive information to researchers for functional analysis of indica subspecies and for comparative genomics. The RICD database is available through our website http://www.ncgr.ac.cn/ricd.

Entities: Chemical Disease Species

Mesh：

Substances：
DNA, Complementary

Year: 2008 PMID： 19036133 PMCID： PMC2605458 DOI： 10.1186/1471-2229-8-118

Source DB: PubMed Journal: BMC Plant Biol ISSN： 1471-2229 Impact factor: 4.215

Background

Rice is one of the most important crops feeding about half of the world's population. Indica and japonica are two major Asian cultivated rice subspecies, which show distinct divergence from sequence variations to phenotypic changes [1-3]. The rice indica varieties are the most widely cultivated in China, India and most Southeast Asia countries, occupying the largest area of rice production in the world. However, most cDNA resources of the publicly available databases were generated from japonica subspecies [4]. Comparison of genomic DNA sequences has showed a large number of variations in genic regions between indica and japonica [2]. So, collection and analysis of indica cDNAs are essential and urgent. Therefore, we collected over 20,000 full-length cDNAs and over 40,000 5' ESTs isolated from various cDNA libraries of two indica varieties Guangluai 4 and Minghui 63 during the last few years (Table 1), as part of the National Rice Functional Genomics Project of China [5-7]. The cultivar Guangluai 4 is a typical indica variety with the largest growing area during 1968 to 1987 in China [8,9], while the cultivar Minghui 63 is the restore line for lots of planted rice hybrids. Systematic analysis and utilization of the indica cDNA clones would be valuable to rice genetics and breeding. Therefore, a dedicated platform is required for search, characterization and retrieval of the indica cDNA clones.

Table 1

Sources of indica cDNA sequences in the RICD database.

	Number of cDNAs	Number of cDNAs matching IRGSP 4*	Number of cDNAs matching TIGR Gene**
Guangluai 4 FL-cDNA	10,081	9,109 (90.4%)	7,680 (76.2%)
Minghui 63 FL-cDNA	12,727	12,317 (96.8%)	8,209 (64.5%)
Guangluai 4 5'EST	21,690	16,504 (76.1%)	8,700 (40.1%)
Minghui 63 5'EST	27,130	21,059 (77.6%)	15,115 (55.7%)

* IRGSP pseudomolecules Build 4.

** TIGR rice gene annotation version 5

Sources of indica cDNA sequences in the RICD database. * IRGSP pseudomolecules Build 4. ** TIGR rice gene annotation version 5 RICD was developed to provide a handy way to access all the available indica cDNA sources and be integrated with comprehensive information, which includes encoded amino acid sequence information, the mapping information, the protein domain information, the results of similarity search, gene function annotation and so on. Several search and retrieval forms were developed to create a comprehensive data warehouse for rice functional genomics and comparative genomics research.

Construction and content

Database architecture

RICD is a relational database, and was developed using MySQL 4.1. The database was implemented on a server running Red Hat Enterprise Linux AS release 4 (×86). The web interface was constructed by using PHP scripts (PHP Version 4.3), and powered by an Apache server (Apache Version 2.0).

cDNA acquisition

The Oryza sativa L. indica Guangluai-4 cDNAs were generated from five types of cDNA libraries [7]. The libraries included: (1) Seedlings grown in water for two weeks with a cycle of 16 hours light and 8 hours dark at 30°C; (2) Panicles harvested from rice grown in paddy fields; (3) Root grown in water for two weeks with a cycle of 16 hours light and 8 hours dark at 30°C; (4) Two-day germinated shoots and roots collected when roots reached 1–2 cm long; (5) Two-week seedlings treated individually with various stresses, that is, high-salinity (100 mM NaCl, treated for 20 min, 3, 12, 24, 48 h, 3 days and recovered for 72 h), dehydration (15% PEG-4000, treated for the same time duration as high-salinity), cold (60°C for 1, 12, 24, 48 h, 3 days and recovered for 72 h), heat (45°C, for the same time duration as cold), or immersion under water (for 1, 12, 24, 48 h, 3, 5 days). The Oryza sativa L. indica Minghui-63 cDNAs were from a normalized whole-life-cycle cDNA library, which was constructed by use of 15 tissues collected from 9 developmental stages [5].

Sequence analysis

The cDNAs and TIGR rice genes were remapped to IRGSP pseudomolecules Build 4 [3,10] using the GMAP program [11]. Over 90% putative full-length cDNAs of Guangluai 4 and Minghui 63 could be matched to rice IRGSP pseudomolecules, and 76.2% of Guangluai 4 cDNAs and 64.5% of Minghui 63 cDNAs could be matched to TIGR gene loci, respectively. In addition, > 76% 5' ESTs of Guangluai 4 and Minghui 63 were matched to IRGSP pseudomolecules, and > 40% of them could be matched to TIGR gene loci. The cDNAs were annotated according to RAP-DB [12] and TIGR systems [13]. Other sequence analysis was performed as described previously [7].

Utility

RICD provides an interactive and user-friendly web interface for searching the cDNA clones and retrieving their sequences along with other detailed information (Figure 1). The panel on the right of the main page lists hyperlinks to the different search pages, online tools and web pages describing various information of the database. The cDNA component of the database can be searched by multiple ways as described below.

Figure 1

Query and display of cDNA clones in Rice . The query parts are in red, while the display parts are in blue.

BLAST search

The BLAST program (version 2.2) is integrated into RICD for sequence search. Either nucleotide sequence (BLASTN) or protein sequence (TBLASTN) can be used as query to search all indica cDNAs or any subset of the database, with an initial cut-off E value as 0.01 [14]. A typical BLAST result page displays cDNAs matching the query, with their clone ID and the sequence alignments.

Keyword search

The cDNA of the database can be searched directly using clone name, NCBI accession number, RAP gene locus ID, or TIGR gene locus ID. The NCBI accession number commonly starts with the two characters "CT". And two separate 'gene locus ID search' sections, for RAP gene locus ID and TIGR gene locus ID, were given. The querying result will show a list of entries that are linked to the individual cDNA clones.

Chromosome position search

Two tools were designed for users to look up cDNAs in the rice genome. One shows in table form, naming "Clone list". You can directly enter the appointed region to get a list of cDNAs in the region, each of which can be further entered individually. A separate web page is provided for listing the putative indica-specific cDNAs, which cannot be mapped into IRGSP pseudomolecules Build 4. You can also choose a browser tool "mapping view" to visualize the graphic display. The main browser display contains two main parts. On the top are the scales for users to specify a particular region in the genome. On the bottom shows a series of cDNAs in the database, and also RAP-DB and TIGR gene loci. Upon clicking one, users can enter into result display page of the cDNA clone.

Function search

To query the cDNA clones, users can also enter a putative function (bZIP transcription factor, receptor protein kinase, and so on) to retrieve the responding cDNA clones (Figure 2). The putative functions rely on the protein domain Pfam descriptions [15] or RAP-DB gene annotations of any individual cDNA annotated.

Figure 2

A example result page of a putative function query. The figure shows the page displaying Pfam search results for query term "bZIP transcription factor".

cDNA report display

For each cDNA clone, the result display page consists of three sections (Figure 3). The first section provides general information about the cDNA: genome position, clone name, accession number, library, RAP gene locus, TIGR gene locus, expression atlas in RiceGE [16] and variation report in Gramene [17]. The second section provides sequence of the cDNA and its ORF information. The last section includes the information of SNPs and InDels between japonica KOME and indica NCGR FL-cDNA (full-length cDNA) pairs and the results of similarity search with japonica and indica genome, KOME FL-cDNA and NCBI non-redundant databases.

Figure 3

A typical result display page of a cDNA. The information of an indica cDNA clone includes the clone ID, GenBank Accession, library, genome position, gene annotation, expression atlas, variation report, cDNA sequence, ORF information. SNPs and InDels information and the results of similarity search will be shown after clicking corresponding buttons.

Conclusion

The initial aim of RICD is to provide a convenient way to search and retrieve incdia cDNA clone for research community. Via integrating with comprehensive information, it tries to grow to be a platform for broad applications to rice genetics, breeding and comparative genomics. Future expansion plans of the database include recruiting more indica cDNA clones, cataloging splice variants and adding gene expression data from indica cDNA microarray. The RICD database will be updated frequently if more information becomes available.

Availability and requirements

The RICD resource can be freely accessed via . The cDNA clones in the database can be requested for research purpose by contacting Tingting Lu ttlu@ncgr.ac.cn

Authors' contributions

* The authors wish it to be known that, in their opinion, the first three authors should be regarded as joint first authors. TL designed the structure and layouts of database, provided the majority of annotations of sequences and partly drafted the manuscript. XH participated the designing of database, provided some annotation results and mostly drafted the manuscript. CZ developed the database and its web interface. TH and QZ provided the technical supports and discussion. KX, LX and QZ provided the batch sequences of Minghui-63 variety. BH conceived of the study, and participated in its design and helped to draft the manuscript. All coauthors read and approved the final manuscript.

14 in total

Review 1. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors: S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal: Nucleic Acids Res Date: 1997-09-01 Impact factor: 16.971

2. GMAP: a genomic mapping and alignment program for mRNA and EST sequences.

Authors: Thomas D Wu; Colin K Watanabe
Journal: Bioinformatics Date: 2005-02-22 Impact factor: 6.937

3. Genome-wide searching of single-nucleotide polymorphisms among eight distantly and closely related rice cultivars (Oryza sativa L.) and a wild accession (Oryza rufipogon Griff.).

Authors: Lisa Monna; Rieko Ohta; Haruka Masuda; Akiko Koike; Yuzo Minobe
Journal: DNA Res Date: 2006-03-29 Impact factor: 4.458

Review 4. Genome-wide intraspecific DNA-sequence variations in rice.

Authors: Bin Han; Yongbiao Xue
Journal: Curr Opin Plant Biol Date: 2003-04 Impact factor: 7.834

5. Isolation and annotation of 10828 putative full length cDNAs from indica rice.

Authors: Kabin Xie; Jianwei Zhang; Yong Xiang; Qi Feng; Bin Han; Zhaohui Chu; Shiping Wang; Qifa Zhang; Lizhong Xiong
Journal: Sci China C Life Sci Date: 2005-10

6. Features of the expressed sequences revealed by a large-scale analysis of ESTs from a normalized cDNA library of the elite indica rice cultivar Minghui 63.

Authors: Jianwei Zhang; Qi Feng; Caoqing Jin; Deyun Qiu; Lida Zhang; Kabin Xie; Dejun Yuan; Bin Han; Qifa Zhang; Shiping Wang
Journal: Plant J Date: 2005-06 Impact factor: 6.417

7. Collection, mapping, and annotation of over 28,000 cDNA clones from japonica rice.

Authors: Shoshi Kikuchi; Kouji Satoh; Toshifumi Nagata; Nobuyuki Kawagashira; Koji Doi; Naoki Kishimoto; Junshi Yazaki; Masahiro Ishikawa; Hitomi Yamada; Hisako Ooka; Isamu Hotta; Keiichi Kojima; Takahiro Namiki; Eisuke Ohneda; Wataru Yahagi; Kohji Suzuki; Chao Jie Li; Kenji Ohtsuki; Toru Shishiki; Yasuhiro Otomo; Kazuo Murakami; Yoshiharu Iida; Sumio Sugano; Tatsuto Fujimura; Yutaka Suzuki; Yuki Tsunoda; Takashi Kurosaki; Takeko Kodama; Hiromi Masuda; Michie Kobayashi; Quihong Xie; Min Lu; Ryuya Narikawa; Akio Sugiyama; Kouichi Mizuno; Satoko Yokomizo; Junko Niikura; Rieko Ikeda; Junya Ishibiki; Midori Kawamata; Akemi Yoshimura; Junichirou Miura; Takahiro Kusumegi; Mitsuru Oka; Risa Ryu; Mariko Ueda; Kenichi Matsubara; Jun Kawai; Piero Carninci; Jun Adachi; Katsunori Aizawa; Takahiro Arakawa; Shiro Fukuda; Ayako Hara; Wataru Hashizume; Norihito Hayatsu; Koichi Imotani; Yoshiyuki Ishii; Masayoshi Itoh; Ikuko Kagawa; Shinji Kondo; Hideaki Konno; Ai Miyazaki; Naoki Osato; Yoshimi Ota; Rintaro Saito; Daisuke Sasaki; Kenjiro Sato; Kazuhiro Shibata; Akira Shinagawa; Toshiyuki Shiraki; Masayasu Yoshino; Yoshihide Hayashizaki; Ayako Yasunishi
Journal: Science Date: 2003-07-18 Impact factor: 47.728

8. Sequence and analysis of rice chromosome 4.

Authors: Qi Feng; Yujun Zhang; Pei Hao; Shengyue Wang; Gang Fu; Yucheng Huang; Ying Li; Jingjie Zhu; Yilei Liu; Xin Hu; Peixin Jia; Yu Zhang; Qiang Zhao; Kai Ying; Shuliang Yu; Yesheng Tang; Qijun Weng; Lei Zhang; Ying Lu; Jie Mu; Yiqi Lu; Lei S Zhang; Zhen Yu; Danlin Fan; Xiaohui Liu; Tingting Lu; Can Li; Yongrui Wu; Tongguo Sun; Haiyan Lei; Tao Li; Hao Hu; Jianping Guan; Mei Wu; Runquan Zhang; Bo Zhou; Zehua Chen; Ling Chen; Zhaoqing Jin; Rong Wang; Haifeng Yin; Zhen Cai; Shuangxi Ren; Gang Lv; Wenyi Gu; Genfeng Zhu; Yuefeng Tu; Jia Jia; Yi Zhang; Jie Chen; Hui Kang; Xiaoyun Chen; Chunyan Shao; Yun Sun; Qiuping Hu; Xianglin Zhang; Wei Zhang; Lijun Wang; Chunwei Ding; Haihui Sheng; Jingli Gu; Shuting Chen; Lin Ni; Fenghua Zhu; Wei Chen; Lefu Lan; Ying Lai; Zhukuan Cheng; Minghong Gu; Jiming Jiang; Jiayang Li; Guofan Hong; Yongbiao Xue; Bin Han
Journal: Nature Date: 2002-11-21 Impact factor: 49.962

9. The TIGR Rice Genome Annotation Resource: improvements and new features.

Authors: Shu Ouyang; Wei Zhu; John Hamilton; Haining Lin; Matthew Campbell; Kevin Childs; Françoise Thibaud-Nissen; Renae L Malek; Yuandan Lee; Li Zheng; Joshua Orvis; Brian Haas; Jennifer Wortman; C Robin Buell
Journal: Nucleic Acids Res Date: 2006-12-01 Impact factor: 16.971

10. The Rice Annotation Project Database (RAP-DB): 2008 update.

Authors: Tsuyoshi Tanaka; Baltazar A Antonio; Shoshi Kikuchi; Takashi Matsumoto; Yoshiaki Nagamura; Hisataka Numa; Hiroaki Sakai; Jianzhong Wu; Takeshi Itoh; Takuji Sasaki; Ryo Aono; Yasuyuki Fujii; Takuya Habara; Erimi Harada; Masako Kanno; Yoshihiro Kawahara; Hiroaki Kawashima; Hiromi Kubooka; Akihiro Matsuya; Hajime Nakaoka; Naomi Saichi; Ryoko Sanbonmatsu; Yoshiharu Sato; Yuji Shinso; Mami Suzuki; Jun-ichi Takeda; Motohiko Tanino; Fusano Todokoro; Kaori Yamaguchi; Naoyuki Yamamoto; Chisato Yamasaki; Tadashi Imanishi; Toshihisa Okido; Masahito Tada; Kazuho Ikeo; Yoshio Tateno; Takashi Gojobori; Yao-Cheng Lin; Fu-Jin Wei; Yue-ie Hsing; Qiang Zhao; Bin Han; Melissa R Kramer; Richard W McCombie; David Lonsdale; Claire C O'Donovan; Eleanor J Whitfield; Rolf Apweiler; Kanako O Koyanagi; Jitendra P Khurana; Saurabh Raghuvanshi; Nagendra K Singh; Akhilesh K Tyagi; Georg Haberer; Masaki Fujisawa; Satomi Hosokawa; Yukiyo Ito; Hiroshi Ikawa; Michie Shibata; Mayu Yamamoto; Richard M Bruskiewich; Douglas R Hoen; Thomas E Bureau; Nobukazu Namiki; Hajime Ohyanagi; Yasumichi Sakai; Satoshi Nobushima; Katsumi Sakata; Roberto A Barrero; Yutaka Sato; Alexandre Souvorov; Brian Smith-White; Tatiana Tatusova; Suyoung An; Gynheung An; Satoshi OOta; Galina Fuks; Galina Fuks; Joachim Messing; Karen R Christie; Damien Lieberherr; HyeRan Kim; Andrea Zuccolo; Rod A Wing; Kan Nobuta; Pamela J Green; Cheng Lu; Blake C Meyers; Cristian Chaparro; Benoit Piegu; Olivier Panaud; Manuel Echeverria
Journal: Nucleic Acids Res Date: 2007-12-17 Impact factor: 16.971

4 in total

1. Extensive sequence divergence between the reference genomes of two elite indica rice varieties Zhenshan 97 and Minghui 63.

Authors: Jianwei Zhang; Ling-Ling Chen; Feng Xing; David A Kudrna; Wen Yao; Dario Copetti; Ting Mu; Weiming Li; Jia-Ming Song; Weibo Xie; Seunghee Lee; Jayson Talag; Lin Shao; Yue An; Chun-Liu Zhang; Yidan Ouyang; Shuai Sun; Wen-Biao Jiao; Fang Lv; Bogu Du; Meizhong Luo; Carlos Ernesto Maldonado; Jose Luis Goicoechea; Lizhong Xiong; Changyin Wu; Yongzhong Xing; Dao-Xiu Zhou; Sibin Yu; Yu Zhao; Gongwei Wang; Yeisoo Yu; Yijie Luo; Zhi-Wei Zhou; Beatriz Elena Padilla Hurtado; Ann Danowitz; Rod A Wing; Qifa Zhang
Journal: Proc Natl Acad Sci U S A Date: 2016-08-17 Impact factor: 11.205