Literature DB >> 17916574

Shanghai RAPESEED Database: a resource for functional genomics studies of seed development and fatty acid metabolism of Brassica.

Guo-Zhang Wu1, Qiu-Ming Shi, Ya Niu, Mei-Qing Xing, Hong-Wei Xue.   

Abstract

The Shanghai RAPESEED Database (RAPESEED, http://rapeseed.plantsignal.cn/) was created to provide the solid platform for functional genomics studies of oilseed crops with the emphasis on seed development and fatty acid metabolism. The RAPESEED includes the resource of 8462 unique ESTs, of which 3526 clones are with full length cDNA; the expression profiles of 8095 genes and the Serial Analysis of Gene Expression (SAGE, 23,895 unique tags) and tag-to-gene data during seed development. In addition, a total of approximately 14,700 M3 mutant populations were generated by ethylmethanesulfonate (EMS) mutagenesis and related seed quality information was determined using the Foss NIR System. Further, the TILLING (Targeting Induced Local Lesions IN Genomes) platform was established based on the generated EMS mutant population. The relevant information was collected in RAPESEED database, which can be searched through keywords, nucleotide or protein sequences, or seed quality parameters, and downloaded.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17916574      PMCID: PMC2238842          DOI: 10.1093/nar/gkm780

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Brassica species, including Brassica napus, B. oleracea and B. rapa, are important vegetable and oilseed crops. Seed of B. napus contains abundant fatty acids such as oleic acid and linoleic acid, and serves as one of the main sources of plant oil for human daily life. As the second oilseed crop in the world, compared to soybean, numerous studies have been performed on B. napus with the focus on yield increase and quality improvement through genetic breeding (1–3), and development of genetics and molecular biology tools has significantly contributed to the relevant studies. However, the deficiency of genome sequences and the detailed genomic information counteract the functional genomics studies of rapeseed, especially the genetics approach through T-DNA tagged mutant, of which the gene sequences and expression information are required (4–6). Thus, the increased information of gene sequences and expression profiles will be beneficial for rapeseed functional genomics studies and further bioengineering. Currently, several databases for Brassica species are available. Handa reported a database functions on B. napus mitochondrial genome and the comparative analysis to Arabidopsis (7). The EST and fatty acid metabolism sources of B. napus are available in KEGG database (8). Love et al. described a database (http://hornbill.cspp.latrobe.edu.cn) incorporating the Brassica EST, Gene Ontology (GO) annotation and information of simple sequence repeat molecular markers, of which the EST, microarray (the expression profiles of 7000 uni-genes in vegetative tissues root and leaf), and MarkerQTL information were updated (9–11), providing a useful tool for Brassica research. However, there is still no resource providing the gene expression information and the SAGE data during the reproductive development of B. napus yet. In addition, there is also no database providing bio-sources including full-length cDNA or mutants. Based on constructing cDNA libraries using seed materials at various developmental stages, large-scale sequencing, generation of glass-based cDNA microarray and hybridization, we are able to obtain the dataset of B. napus ESTs and the relevant expression profiles during seed development, which are incorporated into Shanghai RAPESEED (RAPESEED) Database, providing a solid platform for functional genomics studies of oilseed crops. In addition, the Serial Analysis of Gene Expression (SAGE) during seed development and mutant population by ethylmethanesulfonate (EMS) mutagenesis will further facilitate the relevant studies. These will significantly enrich the relevant resources and improve the studies of the area.

SYSTEM ARCHITECTURE AND IMPLEMENTATION

RAPESEED was constructed on the platform of Sun Solaris 9 operating system and TomCat5.5 Web server. The database was implemented using a database management system MySQL5.0.20 (12). RAPESEED has been set up on a World Wide Web server allowing internet access with a web client.

SOURCES IN RAPESEED

The purpose of the RAPESEED is to provide helpful informaitons of EST, gene expression profiles and bioresources (full-length cDNA, TILLING population) to researchers and to promote the functional genomics studies and quality breeding of Brassica crops (Table 1).
Table 1.

Available resource of RAPESEED

Resource
Unique EST8462; relevant expression profiles during seed development
Full-length cDNA3526
Unique SAGE tag23 895
Tag-to-gene502 tags to B. napus
860 tags to Brassica
M3 EMS mutant individuals∼14 700
Available resource of RAPESEED

ESTs, the annotation and Gene Ontology classification

RAPESEED contains 8462 unique ESTs of B. napus, of which 6892 were functionally annotated by sequence similarity to entries within GenBank (using BLAST with a cut-off valve of E-value <10−5, 13). GO classification was performed via mapping to Uniprot to get the GO annotation. In addition, 3526 clones were full-length cDNA among the 8462 unique ESTs.

Gene expression profiles during seed development

Gene expression data for 8095 of 8462 genes during B. napus seed development are studied through cDNA microarray hybridization and available in RAPESEED, which include the expression profiles of relevant genes at 7, 9, 12, 17, 19, 21, 25, 31 day after pollination (DAP), normalized to 3 DAP (1.0).

Serial Analysis of Gene Expression (SAGE) data

A total of 23 895 unique tags from B. napus immature seeds (5 and 9 DAP) were obtained after sequencing and deposited into RAPESEED. Based on the virtual tags derived from GenBank and full-length cDNA from our cDNA library, the ‘tag-to-gene mapping database’ for B. napus and Brassica including B. oleracea and B. rapa were constructed. The whole SAGE dataset can be downloaded from RAPESEED.

EMS mutant population and seed quality measurement

The mutant population mutagenesized with ethylmethanesulfonate (EMS) was generated using B. napus (huyou-15). A population of ∼14 700 M3 individuals was generated and relevant seed qualities were measured and analyzed using the Foss NIR System. The seed quality parameters including the content of glucosinolates, protein, total fat, erucic acid, oleic acid, linoleic acid and eicosatetraenoic acid in seeds (Table 2), providing a useful source for rapeseed quality studies.
Table 2.

The summary of the seed quality measurement of EMS mutagenesized population through NIR analysis

GlucosinolatesProteinLipid contentErucic acidOleic acidLinoleic acidEicosatetraenoic acid
WT (Average)24.02 ± 2.1724.74 ± 0.2539.39 ± 0.427.03 ± 1.2356.28 ± 1.6619.62 ± 0.391.27 ± 0.77
WT (Maximum)31.3725.4742.439.4259.9121.142.97
WT (Minimum)18.9921.7238.263.7152.8618.60−0.49
Criteria>40<10>30<20>44>20<1>60>25<10>5
Numbers110113856193835531251254432117365

Parameters of seed qualities are indicated as the content (%) in seeds. Data are presented as mean ± SD (n = 77 for wild type).

The summary of the seed quality measurement of EMS mutagenesized population through NIR analysis Parameters of seed qualities are indicated as the content (%) in seeds. Data are presented as mean ± SD (n = 77 for wild type). Based on the EMS mutant population, a TILLING (Targeting Induced Local Lesions IN Genomes) platform was established.

DATABASE QUERY AND USER INTERFASE

BLAST and SEARCH

The genes of interest or EST information can be identified through the BLAST program (13). Three different BLAST programs including BLASTn, tBLASTn and tBLASTx were provided, of which the BLAST parameters can be freely chosen by users including E-value or number of alignments. The typical result page contains the collected information of the available genes in RAPESEED, including the clone ID, GenBank Accession, expression profiles during seed development, gene annotation and related E-value and score, full-length information and DNA sequences (Figure 1).
Figure 1.

Typical result page of query. The information of a gene including the clone ID, GenBank Accession, expression profiles during seed development, gene annotation and related E-value and score, full-length information and EST sequences will be shown after search.

Typical result page of query. The information of a gene including the clone ID, GenBank Accession, expression profiles during seed development, gene annotation and related E-value and score, full-length information and EST sequences will be shown after search. In addition, RAPESEED provides a key word SEARCH program to facilitate the gene identification. Clone ID, GenBank Accession or key word can be selected for search. The complex SEARCH of two search items combination is also supported through and/or option.

Resource download and order from RAPESEED

All the dataset including those for whole EST sequences and corresponding annotations against GenBank, SAGE data and related tag-to-gene information, raw data of the microarray hybridization and seed qualities of EMS population (Table 2) can be downloaded. Furthermore, all listed sources of RAPESEED were available, especially the full-length cDNA clones and EMS lines with altered seed qualities, which can be ordered under the accordance of ‘Biological Material Transfer Agreement’.

AVAILABILITY

RAPESEED can be freely accessed at http://rapeseed.plantsignal.cn via the World Wide Web. We have developed a mature data management system, and all newly released information will be announced on the website. Besides ordering the interested clones or lines, users can contact us for any suggestions or questions through internet or send the comments to rapeseed@sibs.ac.cn.

FUTURE DEVELOPMENTS

RAPESEED is a database developed for B. napus functional genomics studies, and we will do our best to continuously optimize the system and supply new records, to make it an integrated resource (Figure 2) and provide a base for Brassica research and breeding. Besides the gene identification and expression profiles, we will focus on the following in the future.
Figure 2.

Structures and development highlights of RAPESEED. The parts in red are the main directions of RAPESEED development.

Structures and development highlights of RAPESEED. The parts in red are the main directions of RAPESEED development.

TILLING population

TILLING is a useful reverse genetic technique that is a powerful, non-transgenic and unbiased targeted mutagenesis method. It can help to identify single point mutations in genes of interest (14), and is suitable for mutation site identification of the EMS lines, which is appropriate and helpful for further crop breeding. Based on the developed TILLING population of M3 EMS lines of B. napus, an amphidiploid (AACC) consisting of the genomes of B. rapa (AA) and B. oleracea (CC), we will further develop a TILLING population (>4000 M2) of diploid (B. rapa) for Brassica breeding studies. In addition, a new B. napus EMS population (M2) has already been developed this year. We will develop pooled DNAs of ∼3000 M2 plants every year and relevant seed qualities will be measured. The screening data and seed qualities information of them will be deposited into RAPESEED annually.

Comparative gene expression profiles of B. napus to B. oleracea and B. rapa

Brassica napus (∼1200 mb) consists of that from B. rapa (700 mb) and B. oleracea (500 mb), and differences between them are obvious in many aspect such as plant morphology, seed oil content and self-productivity. This suggested that the gene expression profiles were changed dramatically during breeding processes, and investigations on these differences will surely provide the information on the regulatory network of the gene expression and the relevant cross-talk. The related information will be available soon.
  10 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

3.  Targeted screening for induced mutations.

Authors:  C M McCallum; L Comai; E A Greene; S Henikoff
Journal:  Nat Biotechnol       Date:  2000-04       Impact factor: 54.908

4.  Inhibition of the glucose-6-phosphate transporter in oilseed rape (Brassica napus L.) plastids by acyl-CoA thioesters reduces fatty acid synthesis.

Authors:  S R Fox; L M Hill; S Rawsthorne; M J Hills
Journal:  Biochem J       Date:  2000-12-01       Impact factor: 3.857

5.  Overexpression of 3-ketoacyl-acyl-carrier protein synthase IIIs in plants reduces the rate of lipid synthesis.

Authors:  K Dehesh; H Tai; P Edwards; J Byrne; J G Jaworski
Journal:  Plant Physiol       Date:  2001-02       Impact factor: 8.340

6.  Genetic analysis of oil content in Brassica napus L. using mixed model of major gene and polygene.

Authors:  Shu-Fen Zhang; Chao-Zhi Ma; Jia-Cheng Zhu; Jian-Ping Wang; Yan-Cheng Wen; Ting-Dong Fu
Journal:  Yi Chuan Xue Bao       Date:  2006-02

7.  The complete nucleotide sequence and RNA editing content of the mitochondrial genome of rapeseed (Brassica napus L.): comparative analysis of the mitochondrial genomes of rapeseed and Arabidopsis thaliana.

Authors:  Hirokazu Handa
Journal:  Nucleic Acids Res       Date:  2003-10-15       Impact factor: 16.971

8.  Brassica ASTRA: an integrated database for Brassica genomic research.

Authors:  Christopher G Love; Andrew J Robinson; Geraldine A C Lim; Clare J Hopkins; Jacqueline Batley; Gary Barker; German C Spangenberg; David Edwards
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

9.  BASC: an integrated bioinformatics system for Brassica research.

Authors:  Timothy A Erwin; Erica G Jewell; Christopher G Love; Geraldine A C Lim; Xi Li; Ross Chapman; Jacqueline Batley; Jason E Stajich; Emmanuel Mongin; Elia Stupka; Bruce Ross; German Spangenberg; David Edwards
Journal:  Nucleic Acids Res       Date:  2006-12-05       Impact factor: 16.971

10.  New computational tools for brassica genome research.

Authors:  Christopher G Love; Jacqueline Batley; Geraldine Lim; Andrew J Robinson; David Savage; Daniel Singh; German C Spangenberg; David Edwards
Journal:  Comp Funct Genomics       Date:  2004
  10 in total
  6 in total

1.  GOAL: a software tool for assessing biological significance of genes groups.

Authors:  Alain B Tchagang; Alexander Gawronski; Hugo Bérubé; Sieu Phan; Fazel Famili; Youlian Pan
Journal:  BMC Bioinformatics       Date:  2010-05-06       Impact factor: 3.169

2.  FR database 1.0: a resource focused on fruit development and ripening.

Authors:  Junyang Yue; Xiaojing Ma; Rongjun Ban; Qianli Huang; Wenjie Wang; Jia Liu; Yongsheng Liu
Journal:  Database (Oxford)       Date:  2015-02-27       Impact factor: 3.451

3.  ocsESTdb: a database of oil crop seed EST sequences for comparative analysis and investigation of a global metabolic network and oil accumulation metabolism.

Authors:  Tao Ke; Jingyin Yu; Caihua Dong; Han Mao; Wei Hua; Shengyi Liu
Journal:  BMC Plant Biol       Date:  2015-01-21       Impact factor: 4.215

Review 4.  Omics: The way forward to enhance abiotic stress tolerance in Brassica napus L.

Authors:  Ali Raza; Ali Razzaq; Sundas Saher Mehmood; Muhammad Azhar Hussain; Su Wei; Huang He; Qamar U Zaman; Zhang Xuekun; Mirza Hasanuzzaman
Journal:  GM Crops Food       Date:  2021-01-02       Impact factor: 3.074

5.  Implementation of two high through-put techniques in a novel application: detecting point mutations in large EMS mutated plant populations.

Authors:  Antoine Lf Gady; Freddy Wk Hermans; Marion Hbj Van de Wal; Eibertus N van Loo; Richard Gf Visser; Christian Wb Bachem
Journal:  Plant Methods       Date:  2009-10-07       Impact factor: 4.993

6.  Identification and expression analysis of WRKY transcription factor genes in canola (Brassica napus L.) in response to fungal pathogens and hormone treatments.

Authors:  Bo Yang; Yuanqing Jiang; Muhammad H Rahman; Michael K Deyholos; Nat N V Kav
Journal:  BMC Plant Biol       Date:  2009-06-03       Impact factor: 4.215

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.