Literature DB >> 29989091

RicyerDB: A Database For Collecting Rice Yield-related Genes with Biological Analysis.

Jing Jiang1, Fei Xing1, Xiangxiang Zeng2, Quan Zou3.   

Abstract

The Rice Yield-related Database (RicyerDB) was created to complement with related research of influence rice (Oryza sativa L.) yield in multiple traits by manually curating the related databases and literature, and genomics and proteomics information that could be useful for comprehensive understanding of the rice biology. RicyerDB provides a more valuable resource in which to efficiently investigate, browse and analyze yield-related genes. The whole data set can be easily queried and downloaded through the webpage. In addition, RicyerDB also constructed a protein-protein interaction network with biological analysis. The combined rice database opens a new path to facilitate researchers achieving information on rice gene in terms of their effects on traits important for rice breeding. The web server is freely available at: http://server.malab.cn/Ricyer/index.html.

Entities:  

Keywords:  gene; protein; rice; trait; yield

Mesh:

Year:  2018        PMID: 29989091      PMCID: PMC6036756          DOI: 10.7150/ijbs.23328

Source DB:  PubMed          Journal:  Int J Biol Sci        ISSN: 1449-2288            Impact factor:   6.580


Introduction

Rice (Oryza sativa L.) is one of the most important food crops worldwide, and more than half of the global population uses it as the main food source 1. In the developing world, rice provides 27% of dietary energy and 20% of dietary protein for people's daily life 2. Moreover, rice has relatively small genome size 3, so it is generally used as a model species in plant biology, especially for studies on monocotyledonous plants. In addition, due to its global importance in food production 4, 5, a number of researches have been published, analyzing the yield associated traits such as grain size 6, 7, grain weight 8, 9, panicle number 10, 11 and so on 12 Besides, until now a huge collection of rice seed carrying useful genes for those traits have been explored by the rice breeders 13. As increasing rice production is crucial for the farmers rely on it for their livelihood, it has been a longstanding issue for the whole world rice researchers for improving the rice yield through rice breeding 14-16. With the rapid advances in high-throughput technologies, it's possible to use bioinformatics measures emerging multi-omics data to explore the major effect factor of the yield for rice 17, 18. In the molecular level emerge genome and proteome data, increasingly being applied outside of pure research towards support the accelerated breeding of rice. However, due to the size and structure of the biological datasets, working with these data is challenging for many field and bench scientists. The main target of this research is to establish an easy and efficient search and retrieval system that would allow rice researchers and breeders to search the trait-related genes quickly. Several databases have been published that contain the proteomic and genomic information about rice. The RAP-DB (Rice Annotation Project Database) database was conceptualized in 2004 upon the completion of genome sequencing by the International Rice Genome Sequencing Project with the aim of providing the scientific community with an accurate and timely annotation of the rice genome sequence. One of the major objectives of this project is to facilitate a comprehensive analysis of the genome structure and function of rice on the basis of the annotation. The CRDC (China Rice Data Center) database was constructed by the China Rice Research Institute of Science and Technology Information Center in 2005. This rice gene database mainly collected rice genes (including QTLs) found at domestic and abroad, including gene name, function, location and reference literature. In addition, there're some online rice genomics databases such as the Whole Rice Genome Automated Annotation Database of TIGR and Rice Information System of Beijing Genomics Institute being focus on integrating facility for data-mining and comparative genomics. The above databases primarily focus on single or multiple omics covering relatively complete information about rice. However, to the best of our knowledge, there is currently no consolidated web tool available for collecting all available rice trait-related genes over public databases. Towards this goal, here, we describe the construction and utility of RicyerDB, which will be of use to the rice researchers in general and rice breeders in particular towards successful planning of their breeding objectives. To meet this challenge the RicyerDB and website was developed. RicyerDB integrates diverse data sources to construct a public platform for browsing and interactive visualizations of yield-related genes. Schematic illustration of the overall information of the database is shown in Figure 1. The search tool enables the user to query a particular gene, and even provides insight into the functions/location of overall genes. The whole data set can be easily queried and downloaded through the webpage. Furthermore, the database can perform fast visualization of genome annotation and protein-protein interaction network, as well as providing statistical analysis of PPIN (protein-protein interaction network). In addition, RicyerDB also allows researchers to submit new gene information.
Figure 1

Schematic illustration of the overall information of RicyerDB.

Materials and Methods

The RicyerDB database integrates diverse publicly available resources to construct a public platform for browsing and interactive visualizations of yield-related genes. We have collected experimental thermodynamic data from PubMed literature and integrated with two public databases RAP-DB and CRDC. The first release of RicyerDB contained more than 400 manually curated gene information entries with literature confirmed, among which 76 come from two databases, the rest come from PubMed. As terminology differs among databases and literature, making cross-comparisons is difficult, therefore data curation from literature requires human extraction and selection. The standard named genes can be retrieval by reference to the NCBI (National Center for Biotechnology Information) Gene. RicyerDB supplemented yield-related genes with protein sequences and chromosome loci information, which obtained from Uniprot and NCBI Gene, respectively. Uniprot (Universal Protein Resource) is one of the most influentially housing protein information databases. It provides high quality protein sequences and the corresponding ID are freely accessible to the scientific community. Gene is a comprehensive public database, which is maintained and distributed by NCBI (National Center for Biotechnology Information), and contains the gene information about chromosome position as well as gene alias. To explore the biological function of yield-related genes, information of gene annotation derived from the Gene Ontology Consortium. The Gene Ontology database is a major bioinformatics tool of our evolving knowledge of how genes encode biological functions at the molecular, cellular and tissue levels 19-21. RicyerDB database provides a functional annotation analysis of yield-related genes, and assigns an importance score for each functional annotation. Conversely, for each gene along with annotation information also possesses an importance score. Meanwhile, a global view of the genes association requires knowledge of interactions between the expressed proteins 22. For each protein-protein interaction stored in STRING (Search Tool for Recurring Instances of Neighboring Genes), a score is provided. The score (i.e., the 'edge weight' in the network) represents confidence score, and is scaled between zero and one. It indicates the estimated likelihood that a given interaction is biologically meaningful, specific and reproducible, given the supporting evidence. These above online information resources are shown in Figure 2.
Figure 2

The on-line public source databases of RicyerDB.

Results

Implementation

The RicyerDB server consists of two major components: the client web interface and the server backend. The former was implemented using jQuery, Bootstrap, CSS and Html. For the latter, a Java Servlet, for the service connector, responds to the server request. The RicyerDB has been tested in the Google Chrome, Firefox and Internet Explorer web browsers.

Interface

The RicyerDB interface is divided into several sections. Meanwhile, at the bottom of this homepage, existing three friendly links point to the corresponding databases.

Search

A capability to create a valid search query is the key to successful usage of any database. RicyerDB provides an interface for convenient retrieval of all rice trait-related genes and corresponding information in the 'Search' page. With the input of key word in the quick searching box, the search engine will return the brief details of search results as a table. Moreover, users can also search genes through the advanced search. There are two options “smart search” and “regex” in the advanced search part, which can be checked according to the users' need. The queried result table contains gene names, protein sequences, and the supporting literature evidence, and so on. When the user clicks the small triangle on the head of each table column, the results in the table will be resorted in ascending/descending order.

Browse

A global overview of all rice yield-related genes from different perspectives can be acquired by browsing the database. In 'Browse' section, users can access RicyerDB in three different paths: 'browse by trait', 'browse by cause' and 'browse by location'. In each path, all genes are classified into several entries. The rings distribution of the three browsing paths is shown in Figure 3.
Figure 3

The rings distribution of rice yield-related genes. The three rings from the inside to the outside correspond to chromosome, cause and trait, respectively.

JBrowse

JBrowse implements a genome annotation tool that can be used to display an arbitrary set of features on expressed protein, and shows the position of the protein in corresponding chromosome. JBrowse provides multiple configurable levels of zoom, and two scroll speeds. Once an interesting region of the genome is in view, the user can make finer adjustments by scrolling and zooming with the navigation bar, which appears in the upper side of the area.

Interaction

To further explore the relationship between different proteins, 'Interaction' was provided to visualize as a network 23, which nodes present the proteins and edges pre-sent the interactions between proteins. The combined score of each interaction is mapped to the edge thickness. The further analysis results of the network comprised topological and statistical features were also shown in the page.

Submit

It is inevitable that the collections of RicyerDB may not cover all yield-associated genes. So we provide the submission interface to make sure that researchers can submit new genes that are not documented. In the 'Submit' page, RicyerDB invites users to upload novel gene symbol whether through experiment validation associated with rice yield or not. The request to leave the email is convenient for us to further contact you. In most cases, the authors are contacted for missing or ambiguous information and an extensive literature search is performed to complement data. If a user needs a more complex analysis, the website allows downloading the entire database data. In 'Home' page, the whole data are saved in ZIP formats, users can get them by clicking the 'Download' button. Except these, RicyerDB also provides a section to facilitate new users quickly access to use, its instruction is in the 'Help' page, including figures and narrative memoranda of it.

Discussion and Future Prospects

Bioinformatics is a rapidly growing field of research that is being driven by the requirement to manage and interrogate the vast quantities of data being generated by 'omics' technologies. Decades of research on rice has generated several known multi-omics resources, such as genome, proteome and transcriptome 23-35, with a sole aim to understand every aspect of rice biology. Rice is the most important crop consumed all over the world. Several rice genes databases including the annotation as well as mutant information of the rice have been previously constructed, such as JCVI 36, RAD 37, RGKbase 38 and RMD 39. Although, these databases have exhaustive data on rice, they do not precisely catalogue and integrate rice production increase this specific demand. To complement with this absence, we developed the RicyerDB by integrating genome and proteome data. To our knowledge, this is the first database comprehensively focusing on the rice production. We hope this resource will provide effective information and be convenient for the researchers as well as farmers exploit potentials of rice as a major crop to feed the world. A limitation of this database is that it integrates the genome and proteome, which do not cover all omics information of the genes. Future developments of RicyerDB include regular updates, improving data quantity and quality, and incorporating new types of data, such as epigenome, phenome and other omics data 40, 41. In addition, RicyerDB will collect more rice online database resources and yield-related literature. Our database will be updated periodically in future according to this additional information. In the subsequent modules, an effective prediction algorithm was added to predict the new genes for rice yield based on our database. We also call for worldwide collaborations and look forward to comments and suggestions from researchers and breeders, aiming to build RicyerDB into a more comprehensive knowledgebase of rice production.
  39 in total

1.  What it will take to feed 5.0 billion rice consumers in 2030.

Authors:  Gurdev S Khush
Journal:  Plant Mol Biol       Date:  2005-09       Impact factor: 4.076

2.  Integrating Multiple Heterogeneous Networks for Novel LncRNA-Disease Association Inference.

Authors:  Jingpu Zhang; Zuping Zhang; Zhigang Chen; Lei Deng
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2017-05-04       Impact factor: 3.710

3.  Genomic architecture of heterosis for yield traits in rice.

Authors:  Xuehui Huang; Shihua Yang; Junyi Gong; Qiang Zhao; Qi Feng; Qilin Zhan; Yan Zhao; Wenjun Li; Benyi Cheng; Junhui Xia; Neng Chen; Tao Huang; Lei Zhang; Danlin Fan; Jiaying Chen; Congcong Zhou; Yiqi Lu; Qijun Weng; Bin Han
Journal:  Nature       Date:  2016-09-07       Impact factor: 49.962

4.  Construction of a male sterility system for hybrid rice breeding and seed production using a nuclear male sterility gene.

Authors:  Zhenyi Chang; Zhufeng Chen; Na Wang; Gang Xie; Jiawei Lu; Wei Yan; Junli Zhou; Xiaoyan Tang; Xing Wang Deng
Journal:  Proc Natl Acad Sci U S A       Date:  2016-11-18       Impact factor: 11.205

5.  Towards establishment of a rice stress response interactome.

Authors:  Young-Su Seo; Mawsheng Chern; Laura E Bartley; Muho Han; Ki-Hong Jung; Insuk Lee; Harkamal Walia; Todd Richter; Xia Xu; Peijian Cao; Wei Bai; Rajeshwari Ramanan; Fawn Amonpant; Loganathan Arul; Patrick E Canlas; Randy Ruan; Chang-Jin Park; Xuewei Chen; Sohyun Hwang; Jong-Seong Jeon; Pamela C Ronald
Journal:  PLoS Genet       Date:  2011-04-14       Impact factor: 5.917

6.  A computational interactome and functional annotation for the human proteome.

Authors:  José Ignacio Garzón; Lei Deng; Diana Murray; Sagi Shapira; Donald Petrey; Barry Honig
Journal:  Elife       Date:  2016-10-22       Impact factor: 8.140

7.  iRNA-PseColl: Identifying the Occurrence Sites of Different RNA Modifications by Incorporating Collective Effects of Nucleotides into PseKNC.

Authors:  Pengmian Feng; Hui Ding; Hui Yang; Wei Chen; Hao Lin; Kuo-Chen Chou
Journal:  Mol Ther Nucleic Acids       Date:  2017-03-29

8.  The Rice Genome Knowledgebase (RGKbase): an annotation database for rice comparative genomics and evolutionary biology.

Authors:  Dapeng Wang; Yan Xia; Xinna Li; Lixia Hou; Jun Yu
Journal:  Nucleic Acids Res       Date:  2012-11-28       Impact factor: 16.971

Review 9.  Advances in breeding for high grain Zinc in Rice.

Authors:  B P Mallikarjuna Swamy; Mohammad Akhlasur Rahman; Mary Ann Inabangan-Asilo; Amery Amparado; Christine Manito; Prabhjit Chadha-Mohanty; Russell Reinke; Inez H Slamet-Loedin
Journal:  Rice (N Y)       Date:  2016-09-26       Impact factor: 4.783

10.  RNALocate: a resource for RNA subcellular localizations.

Authors:  Ting Zhang; Puwen Tan; Liqiang Wang; Nana Jin; Yana Li; Lin Zhang; Huan Yang; Zhenyu Hu; Lining Zhang; Chunyu Hu; Chunhua Li; Kun Qian; Changjian Zhang; Yan Huang; Kongning Li; Hao Lin; Dong Wang
Journal:  Nucleic Acids Res       Date:  2016-08-19       Impact factor: 16.971

View more
  6 in total

1.  Yield-associated putative gene regulatory networks in Oryza sativa L. subsp. indica and their association with high-yielding genotypes.

Authors:  Aparna Eragam; Vishnu Shukla; Vijaya Sudhakararao Kola; P Latha; Srividhya Akkareddy; Madhavi L Kommana; Eswarayya Ramireddy; Lakshminarayana R Vemireddy
Journal:  Mol Biol Rep       Date:  2022-05-25       Impact factor: 2.742

2.  Special issue on Computational Resources and Methods in Biological Sciences.

Authors:  Hao Lin; Shaoliang Peng; Jian Huang
Journal:  Int J Biol Sci       Date:  2018-07-01       Impact factor: 6.580

3.  Identification and Analysis of Rice Yield-Related Candidate Genes by Walking on the Functional Network.

Authors:  Jing Jiang; Fei Xing; Chunyu Wang; Xiangxiang Zeng
Journal:  Front Plant Sci       Date:  2018-11-20       Impact factor: 5.753

4.  Nitrogen Use Efficiency Phenotype and Associated Genes: Roles of Germination, Flowering, Root/Shoot Length and Biomass.

Authors:  Narendra Sharma; Vimlendu Bhushan Sinha; N Arun Prem Kumar; Desiraju Subrahmanyam; C N Neeraja; Surekha Kuchi; Ashwani Jha; Rajender Parsad; Vetury Sitaramam; Nandula Raghuram
Journal:  Front Plant Sci       Date:  2021-01-20       Impact factor: 5.753

Review 5.  Physiological and Multi-Omics Approaches for Explaining Drought Stress Tolerance and Supporting Sustainable Production of Rice.

Authors:  Sajad Majeed Zargar; Rakeeb Ahmad Mir; Leonard Barnabas Ebinezer; Antonio Masi; Ammarah Hami; Madhiya Manzoor; Romesh K Salgotra; Najeebul Rehman Sofi; Roohi Mushtaq; Jai Singh Rohila; Randeep Rakwal
Journal:  Front Plant Sci       Date:  2022-01-27       Impact factor: 5.753

Review 6.  Prediction Methods of Herbal Compounds in Chinese Medicinal Herbs.

Authors:  Ke Han; Lei Zhang; Miao Wang; Rui Zhang; Chunyu Wang; Chengzhi Zhang
Journal:  Molecules       Date:  2018-09-10       Impact factor: 4.411

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.