Literature DB >> 26519466

Information Commons for Rice (IC4R).

Lili Hao, Huiyong Zhang, Zhang Zhang, Songnian Hu, Yu Xue.   

Abstract

Rice is the most important staple food for a large part of the world's human population and also a key model organism for plant research. Here, we present Information Commons for Rice (IC4R; http://ic4r.org), a rice knowledgebase featuring adoption of an extensible and sustainable architecture that integrates multiple omics data through community-contributed modules. Each module is developed and maintained by different committed groups, deals with data collection, processing and visualization, and delivers data on-demand via web services. In the current version, IC4R incorporates a variety of rice data through multiple committed modules, including genome-wide expression profiles derived entirely from RNA-Seq data, resequencing-based genomic variations obtained from re-sequencing data of thousands of rice varieties, plant homologous genes covering multiple diverse plant species, post-translational modifications, rice-related literatures and gene annotations contributed by the rice research community. Unlike extant related databases, IC4R is designed for scalability and sustainability and thus also features collaborative integration of rice data and low costs for database update and maintenance. Future directions of IC4R include incorporation of other omics data and association of multiple omics data with agronomically important traits, dedicating to build IC4R into a valuable knowledgebase for both basic and translational researches in rice.
© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26519466      PMCID: PMC4702825          DOI: 10.1093/nar/gkv1141

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Rice (Oryza sativa) is not only an important model organism for monocot plants and cereals but also the most widely consumed staple food for a large part of the world's human population. The rapid development of high-throughput sequencing technologies leads to big omics data in rice, enabling in-depth investigations of the mechanisms that control important agronomical traits, the diversity of germplasm resources and the process of rice domestication (1–3). With the increasing volume of various types of omics data, one of the fundamentally crucial issues is integration of numerous, voluminous and heterogeneous rice omics data and construction of an integrated web resource for rice data repository, accessing and visualization. Over the past years, several databases have been developed to manage various rice data (4–14), and among them, RAP-DB (Rice Annotation Project Database by National Institute of Agrobiological Sciences) (4,5) and MSU-RGAP (Michigan State University-Rice Genome Annotation Project, which was initially developed by The Institute for Genomic Research) (6) are representative examples. RAP-DB and MSU-RGAP are two parallel gene annotation systems based on identical reference genome of japonica cultivar Nipponbare, independently generate gene models, and host sequence data and associated annotations for the rice genome (15). However, RAP-DB and MSU-RGAP do not include resequencing-based genome-wide variations, contain a limited number of gene expression profiles and incorporate very few plant organisms for rice homology analysis. Unlike RAP-DB and MSU-RGAP that are designed specialized for rice, Gramene (8) is a resource for comparative functional genomics in crops and plant models but does not incorporate adequate rice genomic variations or high-resolution expression profiles, either. In the meanwhile, there are still several databases with particular focuses on specialized data types or areas, such as RiceXPro (9) and ROAD (13), two repositories of gene expression profiles derived wholly from microarray data, RiceVarMap (10), a database of rice genomic variations identified from 1479 rice varieties, and DRTF (11), a database of rice transcription factors, etc. An integrated database incorporating multiple omics data is of great significance for the rice research community. Although existing rice-related databases have made valuable attempts, none of them achieves a substantial impact on fully incorporating heterogeneous and voluminous omics data into a web resource. To make matters worse, the exponentially growing volume of rice omics data generated by next-generation sequencing technologies, poses further challenges on database management and maintenance. As a result, it becomes increasingly daunting for these databases to incorporate big omics data and to have regular and frequent updates (e.g. MSU-RGAP is last updated on 6 February 2013). Intrinsically, this is due to lack of an efficient means to effectively integrate data from different sources and to sustainably keep updates at relatively low costs. Considering the deluge of rice data, therefore, it would be desirable for the rice research community to build an extensible and sustainable architecture that integrates data through community-contributed modules, which are developed and released by different committed groups and thus may reside all over the world. Each module may correspond to a specific data type, deals with data collection, processing and visualization and delivers data on-demand via web application programming interfaces (API). Such module-based architecture accordingly achieves integration of big omics data from different resources and facilitates database update and maintenance at far lower costs. Toward this end, here we present Information Commons for Rice (IC4R; http://ic4r.org), a rice knowledgebase based on an extensible and sustainable architecture that achieves data integration through community-contributed modules. IC4R dedicates to provide a reference genome with standardized and accurate gene annotations based on huge amounts of omics data and large quantities of rice-related literatures. Unlike extant related databases, IC4R is designed for scalability and sustainability, integrating data from remote resources through web APIs and thus featuring collaborative integration of rice data from multiple committed modules and low costs for database update and maintenance. In the current version, IC4R is focused primarily on integrating expression profiles, genomic variations, plant homologs, post-translational modifications, literatures as well as community-contributed annotations.

IMPLEMENTATION

IC4R is built based on the Nipponbare reference genome Os-Nipponbare-Reference-IRGSP-1.0 (15) and its sequence data and associated annotations are initially seeded with a combined dataset from RAP-DB (4,5) and Uniprot (16). To fully characterize rice genes, a variety of ontologies, including gene ontology, trait ontology, plant ontology and environmental ontology, are extracted from Oryzabase (12) and MCDRP (17). In addition, information of the representative quantitative trait loci (QTLs) is obtained from Q-TARO (18). The current version of IC4R has several committed sub-projects (Figure 1), including Rice Expression Database, Rice Variation Database, Plant Homolog Database, Rice Literature Miner, RiceWiki (a wiki-based database for community-curation of rice genes), Post-translational Modification (PTM) and Rice Genome Browser, which have been developed and maintained by different working groups (http://ic4r.org/working_groups). Rice Expression Database, Rice Variation Database and Rice Literature Miner are implemented using J2EE (Java Platform, Enterprise Edition), MySQL (http://www.mysql.org; one of the most popular relational database management systems) and Apache Tomcat Server (http://tomcat.apache.org; an open source software implementation of Java Servlet and Java Server Pages). They are built on MVC (Model View Controller), a popular architecture in software engineering that divides software applications into three interconnected parts: data storage and modeling (independent from user interfaces) as Model, data retrieval and presentation as View, and data processing and update as Controller. Plant Homolog Database is constructed with RoR (Ruby on Rails, http://rubyonrails.org; version 4.1), a full stack web application development framework that provides an agile way to build robust web system in a relative short time. Its underlying data is directly deposited into PostgreSQL (http://www.postgresql.org; a commonly employed enterprise relational SQL database) using a fully automatic command-line script, which can efficiently load data within a few hours. RiceWiki (19) is developed based on MediaWiki (a free and open source wiki engine; version 1.18.4), with the aim to harness community intelligence for collaborative and collective curation of rice genes. Rice Genome Browser is built based on JBrowse (20) (a portable, JavaScript-based genome browser) and is capable of visualizing a variety of rice omics data in an interactive manner. In addition, IC4R works in close collaboration with three PTM-related databases, viz., EKPD (21), UUCD (22) and dbPPT (7), and fetches the PTM data via web APIs.
Figure 1.

IC4R committed sub-projects. In the current version, IC4R integrates data from multiple committed databases as well as offers a genome browser for data visualization: (1) Rice Expression Database, a repository of gene expression profiles derived entirely from RNA-Seq data analysis on tissues spanning a range of rice growth stages and covering a variety of biotic and abiotic treatments; (2) Rice Variation Database, a large set of single nucleotide polymorphisms based on re-sequencing of thousands of rice cultivars; (3) Plant Homolog Database, a database composed of more than 14,000 homologous genes covering diverse plant species; (4) Rice Literature Miner, a collection of rice-related publications by associating with specific rice genes; (5) RiceWiki, a wiki-based, publicly editable and open-content platform for community curation of rice genes; (6) Post-translational Modification, provided by three partner databases and (7) Rice Genome Browser, a genome browser for rice data visualization.

IC4R committed sub-projects. In the current version, IC4R integrates data from multiple committed databases as well as offers a genome browser for data visualization: (1) Rice Expression Database, a repository of gene expression profiles derived entirely from RNA-Seq data analysis on tissues spanning a range of rice growth stages and covering a variety of biotic and abiotic treatments; (2) Rice Variation Database, a large set of single nucleotide polymorphisms based on re-sequencing of thousands of rice cultivars; (3) Plant Homolog Database, a database composed of more than 14,000 homologous genes covering diverse plant species; (4) Rice Literature Miner, a collection of rice-related publications by associating with specific rice genes; (5) RiceWiki, a wiki-based, publicly editable and open-content platform for community curation of rice genes; (6) Post-translational Modification, provided by three partner databases and (7) Rice Genome Browser, a genome browser for rice data visualization. To provide friendly and interactive web pages, browser-based interfaces are coded in HTML5, CSS3, AJAX (Asynchronous JavaScript and XML, a collection of web development technologies for creating highly interactive web applications) and JQuery (a cross-platform and feature-rich JavaScript library; http://jquery.com, version 2.1.4), enabling data transfer between server and browser asynchronously without reloading the current web page. Equipped with HighCharts (a tool to provide interactive charts and diagrams for web and mobile projects; http://www.highcharts.com, version 4.1.7) and D3 (Data-Driven Documents, a JavaScript library for producing dynamic, interactive data visualizations; http://d3js.org, version 3.5.6), IC4R provides interactive and dynamic visualizations for different types of rice data in web browsers. In addition, for visualization of multiple sequence alignments (MSA), IC4R employs BioJS (23) to interactively view the aligned sequences. For scalability and sustainability, IC4R relies on community-contributed modules and integrates various types of data from committed sub-projects via web APIs. Therefore, there are multiple open web APIs that enable data retrieval in a programmatic manner. The server side JSON-formatted response can be transmitted through HTTP protocol and then can be easily parsed with any standard JSON parser. In addition to APIs, IC4R provides a more convenient and effective way to embed another document within any current HTML document using the iframe tag. The detailed information on IC4R APIs as well as embed tag can be found at http://ic4r.org/support/api.

DATABASE CONTENTS AND FEATURES

Designed for scalability and sustainability, IC4R integrates data from committed databases that are built from scratch by different working groups. In the current implementation, IC4R hosts multiple committed sub-projects that correspond to a variety of omics data (Table 1), including genome-wide expression profiles derived wholly from RNA-Seq data, genomic variations obtained from re-sequencing data of thousands of rice varieties, plant homologous genes covering multiple diverse plant species, post-translational modifications, rice-related literatures that are associated with specific genes, and gene annotations contributed by the rice research community. In addition, to enable users to navigate rice data in an intuitive and graphical visualization, an interactive and dynamic genome browser powered by JBrowse is deployed in IC4R (http://visual.ic4r.org), which accordingly contains the corresponding data tracks as well as reference sequence, gene/transcript structure, etc. Consequently, each gene in IC4R that corresponds to a specific web page contains the following sections: gene summary, genomic context, transcripts, expression, homology, variation, post-translational modification, literature, community annotation, genome browser, ontology, QTL and additional links (Figure 2). In each section, a hyperlink to its source database, if available, is provided, which can direct users to refer to more detailed information.
Table 1.

Data statistics of IC4R committed databases (as of 22 September 2015)

Data ContentsData Statistics
Expression
RNA-Seq projects17
RNA-Seq experiments218
Tissues8
Gene expression profiles10 025 602
Transcript expression profiles11 457 912
Variation
Cultivars5524
SNPs8 544 598
Homology
Homologous groups14 739
Species17
Post-translational modification
PKs and PPs1676
E1s, E2s, E3s and DUBs1814
Protein phosphorylation sites3746
Wiki
Community-annotated genes1005
Literature
Articles35 717
Figure 2.

Screenshots of a gene report page, taking gene ‘Os01g0192000’ as an example. (A) Navigation information, providing links to different sections. (B) Expression profiles that are derived entirely from RNA-Seq data. Detailed expression levels under different tissues are provided in the form of box plot. (C) Homologous genes covering 17 plant genomes. To facilitate users to investigate orthologs and paralogs in an interactive manner, phylogenetic tree is plotted by a JavaScript-based web plugin and each node is identified as gene duplication event or speciation event. (D) Variation details. There are multiple SNPs found in this gene and a wealth of information regarding each SNP (for example, ‘OS01SNP04949925’) is provided, including its effects, surrounding genes, allele frequency among 5524 cultivars and the corresponding genotype in each cultivar. (E) Post-translational modification data. (F) Publications associated with this gene. (G) Community-contributed annotations. (H) Genome browser for visualizing different data tracks.

Screenshots of a gene report page, taking gene ‘Os01g0192000’ as an example. (A) Navigation information, providing links to different sections. (B) Expression profiles that are derived entirely from RNA-Seq data. Detailed expression levels under different tissues are provided in the form of box plot. (C) Homologous genes covering 17 plant genomes. To facilitate users to investigate orthologs and paralogs in an interactive manner, phylogenetic tree is plotted by a JavaScript-based web plugin and each node is identified as gene duplication event or speciation event. (D) Variation details. There are multiple SNPs found in this gene and a wealth of information regarding each SNP (for example, ‘OS01SNP04949925’) is provided, including its effects, surrounding genes, allele frequency among 5524 cultivars and the corresponding genotype in each cultivar. (E) Post-translational modification data. (F) Publications associated with this gene. (G) Community-contributed annotations. (H) Genome browser for visualizing different data tracks.

Expression

Rice Expression Database (RED; http://expression.ic4r.org) features integration of genome-wide expression profiles derived entirely from RNA-Seq data analysis on rice tissues spanning a wide range of growth stages and covering a huge variety of biotic and abiotic treatments. Different from existing relevant databases that are based on microarray data, RED is based completely on high-resolution RNA-Seq data that are publicly available in NCBI SRA (24,25) (http://www.ncbi.nlm.nih.gov/sra/). In order to provide high-quality gene expression profiles, strict criteria were used in data processing. Sequencing reads are required to be longer than or equal to 50bp. Raw RNA-Seq data was first converted into fastq format using SRA Toolkit (v 2.4.2) and NGS QC Toolkit (v2.3.3) (26) was then adopted for quality control (e.g. base quality > 20). A sample was excluded from further analysis if it contains low-quality reads accounting for over 30% of total reads. Sequencing reads were mapped to the rice reference genome (IRGSP-1.0) using Tophat (v 2.0.13) (27), but only samples with >70% reads mapped to the genome were used for gene expression profiling. Afterwards, FPKM (fragments per kilobase of exon per million fragments mapped) was estimated to represent expression levels of genes and transcripts with the help of Cufflinks (v 2.2.1) (28,29). As a result, a total of 17 high-quality RNA-Seq projects (30–32) were selected, including 218 experiments, covering 8 tissues and yielding 10 025 602 gene expression profiles (Table 1). In addition, RED features friendly web interfaces for querying and visualizing gene expression profiles. For a given gene, its expression profiles under all available tissues and treatments are not only retrieved and displayed in a tabulated form but also visualized in a box plot, which greatly facilitates investigation on gene expression pattern. For multiple genes (or transcripts) of interest, RED provides interactive functionalities of ‘Line chart’ and ‘Heatmap chart’ to dynamically depict their expression patterns across a variety of tissues and treatments. For convenience, all charts generated in RED are downloadable with four different image formats (PNG, JPEG, PDF and SVG) available. Furthermore, RED provides easy-to-use web interfaces to query expression profiles for multiple specific genes or a given region. The queried results can be further narrowed down by setting a customized cut-off for gene expression level as well as by specifying tissue type, development stage, sequencing platform, experiment, project name, etc. Detailed information on all collected projects as well as their corresponding experiments (including experiment metadata, mapping quality, etc.) used in RED can be found at http://expression.ic4r.org/project. Considering that users may download gene expression profiles for their own analysis, RED also provides links to all processed results for different SRA projects (http://expression.ic4r.org/download).

Variation

Genomic variations are of great usefulness in studying favourable traits (e.g. drought/disease resistance) for plant breeding. Rice Variation Database (RVD; http://variation.ic4r.org) is built based on large-scale re-sequencing of a high diversity of rice cultivars (1–3,10,14,33,34) (Table 1), including 1955 from published literatures (1–3,10,33), 3024 from the 3K Rice Genome Project (14,34) and 545 from our collaborators (unpublished data). RVD adopts stringent criteria for SNP calling. All re-sequencing data was first aligned to the japonica Nipponbare reference genome with Bowtie2 (35). As one of most important types of genomic variations, single nucleotide polymorphisms (SNP) were then identified with SAMtools and BCFtools (36). Only SNPs with minimum occurrence ≥50 and missing rate <0.8 were considered. As a consequence, a collection of 8 544 598 SNPs was derived from 5524 rice cultivars (Table 1). Other types of genomic variations (e.g. insertion/deletion) will be integrated into RVD in the near future. By contrast with existing related databases (e.g. RiceVarMap (10) that is based on 1479 rice cultivars, SNP-Seek (14) that is derived from the 3K Rice Genome Project), RVD features a more comprehensive assembly of 5524 rice cultivars and consequently houses a total of 8 544 598 SNPs. By specifying a gene ID as well as its downstream and upstream distances, RVD is able to retrieve all related SNP information, including SNP ID, chromosome position, reference genotype, alleles, effect, etc. Users can also input a specific region to obtain all SNPs locating in this region. When querying a specific SNP, RVD offers detailed information about its genome location, effect, neighbouring genes, reference base, alleles and provides a pie chart to visualize the frequency of alleles across all available cultivars. Hyperlinks to genome browser are also provided in the SNP page. Moreover, RVD is capable of comparing polymorphic loci between cultivars and users can just specify the chromosome region and cultivars of interest.

Homology

To better understand the evolution of rice genes, Plant Homolog Database (PHD; http://homolog.ic4r.org) is built from scratch for identification of homologous genes among diverse plant species. According to genome assembly quality and annotation completeness, there are a total of 17 plant species (obtained from Ensembl (37); ftp://ftp.ensemblgenomes.org/pub/plants/release-24/) incorporated in PHD. PHD combines phylogenetic and heuristic best-match approaches (38,39) for homology identification. Briefly, OrthoMCL (40) was used to build homologous groups and homologous alignments were done by MAFFT (41) and trimmed by trimAl (42). Phylogenetic tree was constructed by PhyML (43) and amino acid replacement model was optimized by ProtTest (44). Finally, GSDI (45) and RIO (46) were used for ortholog and paralog identification, inferring gene speciation and duplication events on a gene tree. As a result, PHD houses a multitude of 14 739 plant homologous groups (Table 1). Unlike extant similar databases (38,47–50) that incorporate few information of homologs focusing on the genus of Oryza, PHD is a database specialized for plant homologs, covering 10 Oryza genomes as well as 7 important model organisms and crops (including Arabidopsis, tomato, maize, sorghum, etc.). It features user-friendly web interfaces for simplifying the query and retrieval of information of interest and offers highly dynamic visualizations for multiple sequence alignment, phylogenetic tree, and duplication/speciation events. When specifying a gene or homolog group, PHD is able to retrieve its homologs and provide visualizations for its phylogenetic tree, species distribution, nonsynonymous and synonymous substitution rates and aligned sequences. Particularly, in phylogenetic gene tree, each node is identified as gene duplication event or speciation event and accordingly represented by different colors, facilitating users to distinguish orthologs from paralogs in a more straightforward way. Detailed information about 17 plant species used in PHD can be found at http://homolog.ic4r.org/species/index.

Literature

Rice Literature Miner (RLM; http://literature.ic4r.org) is a literature database developed specific for rice. Literatures are fetched from PubMed (http://www.ncbi.nlm.nih.gov/pubmed/) via NCBI E-utilities. As a result, a total of 35 717 rice-related publications were obtained and the associated genes for each publication were then identified (Table 1). Specifically, for any given publication, RLM is able to provide all associated genes reported in this publication. Meanwhile, when inputting a gene ID, gene symbol or author name, RLM can also retrieve a list of related publications and offer all relevant information including publication title, author(s), journal, year of publication, abstract, etc. RLM links genes with literatures and thus facilitates users to find publications for genes of interest, which ultimately can increase the productivity and accuracy of curation and improve the credibility of community-curated contents.

Community annotation

The exponential growth of rice omics data demands more and more people getting involved in gene annotation, namely, community annotation. Therefore, we launched RiceWiki (http://wiki.ic4r.org) in 2012 (officially released in August 2013), a wiki-based database that aims to exploit the full potential of community intelligence in collective and collaborative curation of rice genes (19). To increase community participations and contributions, AuthorReward, a MediaWiki extension that quantifies users’ contributions in community curation and provides explicit authorship according to their quantified contributions (51), was installed in RiceWiki. To date, RiceWiki contains >86 000 gene-specific wiki pages incorporating two cultivated rice subspecies (japonica and indica) and has more than 600 registered community-curators, leading to a total of 1003 rice genes that have been community-annotated (Table 1; http://wiki.ic4r.org/index.php/Special:CuratedGenes).

Post-translational modification

Various post-translational modifications, such as phosphorylation and ubiquitination, are critical for plants in regulating a diversity of biological processes including cellular metabolism, signal transduction and responses to environmental stress. Three databases, viz., EKPD (http://ekpd.biocuckoo.org) (21), UUCD (http://uucd.biocuckoo.org) (22) and dbPPT (http://dbppt.biocuckoo.org) (7), have been developed for collecting and annotating the post-translational modification data. Specifically, EKPD features classification of eukaryotic protein kinases (PKs) and protein phosphatases (PPs) into a hierarchical structure with three levels, namely, group, family and individual PK/PP (21). UUCD is a family-based database collecting and classifying ubiquitin-activating enzymes (E1s), ubiquitin-conjugating enzymes (E2s), ubiquitin protein ligases (E3s), and deubiquitination enzymes (DUBs) into different families (22). dbPPT is a database of protein phosphorylation in plants (7). To collect post-translational modification data for rice, IC4R builds close collaborations with these three databases and integrates relevant data through web APIs (Table 1).

DISCUSSION AND FUTURE DIRECTIONS

IC4R is dedicated to comprehensive integration of rice omics data. Unlike existing related databases, IC4R features adoption of an extensible and sustainable architecture that is based on community-contributed modules for rice data integration. Such module-based architecture, albeit relatively new, is promising to effectively and efficiently integrate big omics data as the cost for database update and maintenance under this architecture is significantly reduced (52). The current implementation of IC4R integrates data of expression, variation, homology, literature, community annotation and post-translational modification. As a consequence, IC4R bears the potential to serve as a one-stop knowledgebase to make big data accessible to the rice research community and function as a valuable resource not only for plant researchers in molecular biological studies but also for breeders in rice production and improvement. Future developments of IC4R include regular updates of existing committed sub-projects to improve data quantity and quality and establishment of new committed sub-projects to incorporate new types of data, such as epigenomic, phenomic and other omics data (some of them are already in progress; http://ic4r.org/working_groups). The ultimate goal of IC4R is to associate rice omics data with agronomically important phenotypic data, which will greatly help researchers and breeders to elucidate molecular mechanisms underlying these important agronomic traits. In addition, IC4R will also develop and integrate a variety of tools for functional annotation, co-expression network, genomic variation analysis, and literature mining as well as more interactive visualizations for various omics data. We also call for worldwide collaborations and look forward to comments and suggestions from plant researchers and breeders, aiming to build IC4R into a more comprehensive knowledgebase covering all aspects of rice knowledge.
  50 in total

1.  Transcriptome and methylome interactions in rice hybrids.

Authors:  Ramakrishna K Chodavarapu; Suhua Feng; Bo Ding; Stacey A Simon; David Lopez; Yulin Jia; Guo-Liang Wang; Blake C Meyers; Steven E Jacobsen; Matteo Pellegrini
Journal:  Proc Natl Acad Sci U S A       Date:  2012-07-09       Impact factor: 11.205

2.  Genome-wide association studies of 14 agronomic traits in rice landraces.

Authors:  Xuehui Huang; Xinghua Wei; Tao Sang; Qiang Zhao; Qi Feng; Yan Zhao; Canyang Li; Chuanrang Zhu; Tingting Lu; Zhiwu Zhang; Meng Li; Danlin Fan; Yunli Guo; Ahong Wang; Lu Wang; Liuwei Deng; Wenjun Li; Yiqi Lu; Qijun Weng; Kunyan Liu; Tao Huang; Taoying Zhou; Yufeng Jing; Wei Li; Zhang Lin; Edward S Buckler; Qian Qian; Qi-Fa Zhang; Jiayang Li; Bin Han
Journal:  Nat Genet       Date:  2010-10-24       Impact factor: 38.330

3.  Resequencing 50 accessions of cultivated and wild rice yields markers for identifying agronomically important genes.

Authors:  Xun Xu; Xin Liu; Song Ge; Jeffrey D Jensen; Fengyi Hu; Xin Li; Yang Dong; Ryan N Gutenkunst; Lin Fang; Lei Huang; Jingxiang Li; Weiming He; Guojie Zhang; Xiaoming Zheng; Fumin Zhang; Yingrui Li; Chang Yu; Karsten Kristiansen; Xiuqing Zhang; Jian Wang; Mark Wright; Susan McCouch; Rasmus Nielsen; Jun Wang; Wen Wang
Journal:  Nat Biotechnol       Date:  2011-12-11       Impact factor: 54.908

4.  dbPPT: a comprehensive database of protein phosphorylation in plants.

Authors:  Han Cheng; Wankun Deng; Yongbo Wang; Jian Ren; Zexian Liu; Yu Xue
Journal:  Database (Oxford)       Date:  2014-12-22       Impact factor: 3.451

5.  Araport: the Arabidopsis information portal.

Authors:  Vivek Krishnakumar; Matthew R Hanlon; Sergio Contrino; Erik S Ferlanti; Svetlana Karamycheva; Maria Kim; Benjamin D Rosen; Chia-Yi Cheng; Walter Moreira; Stephen A Mock; Joseph Stubbs; Julie M Sullivan; Konstantinos Krampis; Jason R Miller; Gos Micklem; Matthew Vaughn; Christopher D Town
Journal:  Nucleic Acids Res       Date:  2014-11-20       Impact factor: 16.971

6.  trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses.

Authors:  Salvador Capella-Gutiérrez; José M Silla-Martínez; Toni Gabaldón
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

7.  InParanoid 7: new algorithms and tools for eukaryotic orthology analysis.

Authors:  Gabriel Ostlund; Thomas Schmitt; Kristoffer Forslund; Tina Köstler; David N Messina; Sanjit Roopra; Oliver Frings; Erik L L Sonnhammer
Journal:  Nucleic Acids Res       Date:  2009-11-05       Impact factor: 16.971

8.  UUCD: a family-based database of ubiquitin and ubiquitin-like conjugation.

Authors:  Tianshun Gao; Zexian Liu; Yongbo Wang; Han Cheng; Qing Yang; Anyuan Guo; Jian Ren; Yu Xue
Journal:  Nucleic Acids Res       Date:  2012-11-20       Impact factor: 16.971

9.  TreeFam: 2008 Update.

Authors:  Jue Ruan; Heng Li; Zhongzhong Chen; Avril Coghlan; Lachlan James M Coin; Yiran Guo; Jean-Karim Hériché; Yafeng Hu; Karsten Kristiansen; Ruiqiang Li; Tao Liu; Alan Moses; Junjie Qin; Søren Vang; Albert J Vilella; Abel Ureta-Vidal; Lars Bolund; Jun Wang; Richard Durbin
Journal:  Nucleic Acids Res       Date:  2007-12-01       Impact factor: 16.971

10.  A map of rice genome variation reveals the origin of cultivated rice.

Authors:  Xuehui Huang; Nori Kurata; Xinghua Wei; Zi-Xuan Wang; Ahong Wang; Qiang Zhao; Yan Zhao; Kunyan Liu; Hengyun Lu; Wenjun Li; Yunli Guo; Yiqi Lu; Congcong Zhou; Danlin Fan; Qijun Weng; Chuanrang Zhu; Tao Huang; Lei Zhang; Yongchun Wang; Lei Feng; Hiroyasu Furuumi; Takahiko Kubo; Toshie Miyabayashi; Xiaoping Yuan; Qun Xu; Guojun Dong; Qilin Zhan; Canyang Li; Asao Fujiyama; Atsushi Toyoda; Tingting Lu; Qi Feng; Qian Qian; Jiayang Li; Bin Han
Journal:  Nature       Date:  2012-10-03       Impact factor: 49.962

View more
  21 in total

1.  Protein kinase A-mediated phosphorylation of the Broad-Complex transcription factor in silkworm suppresses its transcriptional activity.

Authors:  Wenliang Qian; Xiaoxu Gang; Tianlei Zhang; Ling Wei; Xinxin Yang; Zheng Li; Yan Yang; Liang Song; Peng Wang; Jian Peng; Daojun Cheng; Qingyou Xia
Journal:  J Biol Chem       Date:  2017-06-05       Impact factor: 5.157

2.  Database Resources of the National Genomics Data Center in 2020.

Authors: 
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

3.  Database Resources of the BIG Data Center in 2018.

Authors: 
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

4.  CottonFGD: an integrated functional genomics database for cotton.

Authors:  Tao Zhu; Chengzhen Liang; Zhigang Meng; Guoqing Sun; Zhaoghong Meng; Sandui Guo; Rui Zhang
Journal:  BMC Plant Biol       Date:  2017-06-08       Impact factor: 4.215

5.  Rice SNP-seek database update: new SNPs, indels, and queries.

Authors:  Locedie Mansueto; Roven Rommel Fuentes; Frances Nikki Borja; Jeffery Detras; Juan Miguel Abriol-Santos; Dmytro Chebotarov; Millicent Sanciangco; Kevin Palis; Dario Copetti; Alexandre Poliakov; Inna Dubchak; Victor Solovyev; Rod A Wing; Ruaraidh Sackville Hamilton; Ramil Mauleon; Kenneth L McNally; Nickolai Alexandrov
Journal:  Nucleic Acids Res       Date:  2016-11-29       Impact factor: 16.971

6.  The BIG Data Center: from deposition to integration to translation.

Authors: 
Journal:  Nucleic Acids Res       Date:  2016-11-28       Impact factor: 16.971

7.  eGenPub, a text mining system for extending computationally mapped bibliography for UniProt Knowledgebase by capturing centrality.

Authors:  Ruoyao Ding; Emmanuel Boutet; Damien Lieberherr; Michel Schneider; Michael Tognolli; Cathy H Wu; K Vijay-Shanker; Cecilia N Arighi
Journal:  Database (Oxford)       Date:  2017-01-01       Impact factor: 3.451

8.  SnpHub: an easy-to-set-up web server framework for exploring large-scale genomic variation data in the post-genomic era with applications in wheat.

Authors:  Wenxi Wang; Zihao Wang; Xintong Li; Zhongfu Ni; Zhaorong Hu; Mingming Xin; Huiru Peng; Yingyin Yao; Qixin Sun; Weilong Guo
Journal:  Gigascience       Date:  2020-06-01       Impact factor: 6.524

Review 9.  A comprehensive review of web-based resources of non-coding RNAs for plant science research.

Authors:  Peiran Liao; Shipeng Li; Xiuming Cui; Yun Zheng
Journal:  Int J Biol Sci       Date:  2018-05-22       Impact factor: 6.580

10.  The 2016 database issue of Nucleic Acids Research and an updated molecular biology database collection.

Authors:  Daniel J Rigden; Xosé M Fernández-Suárez; Michael Y Galperin
Journal:  Nucleic Acids Res       Date:  2016-01-04       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.