| Literature DB >> 26223238 |
Ki-Hong Jung1, Peijian Cao, Rita Sharma, Rashmi Jain, Pamela C Ronald.
Abstract
The completion of whole genome sequence of rice (Oryza sativa) has significantly accelerated functional genomics studies. Prior to the release of the sequence, only a few genes were assigned a function each year. Since sequencing was completed in 2005, the rate has exponentially increased. As of 2014, 1,021 genes have been described and added to the collection at The Overview of functionally characterized Genes in Rice online database (OGRO). Despite this progress, that number is still very low compared with the total number of genes estimated in the rice genome. One limitation to progress is the presence of functional redundancy among members of the same rice gene family, which covers 51.6 % of all non-transposable element-encoding genes. There remain a significant portion or rice genes that are not functionally redundant, as reflected in the recovery of loss-of-function mutants. To more accurately analyze functional redundancy in the rice genome, we have developed a phylogenomics databases for six large gene families in rice, including those for glycosyltransferases, glycoside hydrolases, kinases, transcription factors, transporters, and cytochrome P450 monooxygenases. In this review, we introduce key features and applications of these databases. We expect that they will serve as a very useful guide in the post-genomics era of research.Entities:
Year: 2015 PMID: 26223238 PMCID: PMC4519439 DOI: 10.1186/s12284-015-0060-7
Source DB: PubMed Journal: Rice (N Y) ISSN: 1939-8425 Impact factor: 4.783
Fig. 1Snapshots from homepage for rice phylogenomics databases and Tree viewer sites for six phylogenomics databases
Summary of the types of data integrated in phylogenomics databases
| Data type | Providing information or data | Reference |
|---|---|---|
| Sequence | locus IDs from MSU-RGAP and The Rice Annotation Project Database (RAP-DB; | (IRGSP |
| Sequence Quality | TE-relatedness, existence of EST/cDNA, and Program to Assemble Spliced Alignments (PASA) status | |
| Orthologs in Plants | orthologs from 12 plant species, (i.e., | (Berglund et al. 2008) |
| Topology | Transmembrane Domain (TM), N-terminal Myristoylation Site (Myrist), N-terminal Signal Peptide (SignalP), Chloroplast Transit Peptide (ChloroP), and predicted Subcellular Localization | |
| Mutants | mutant lines and corresponding flanking sequence tags from eight institutes | (Chandran and Jung |
| Interactome Data | experimentally validated network of protein–protein interactions based on Yeast Two-Hybrid (Y2H) and Tandem Affinity Purification (TAP) methods | (Ding et al. |
| Digital Northern Data | normalized frequency of ESTs in selected tissues/organs | (Dardick et al. |
| MPSS mRNA Data | meta-expression data from 70 libraries | (Nakano et al. |
| MPSS Small RNA Data | meta-expression data from six libraries | (Nakano et al. |
| Microarray Data | meta-expression data from the six microarray platforms including Affymetrix, Agilent22K, Agilent44K, BGI/YALE60K, NSF20K, and NSF45K (http://ricephylogenomics.ucdavis.edu/description.shtml) | (Cao et al. |
Fig. 2Snapshots from homepage for rice kinase database showing phylogenomics data for cyclin dependent kinase (CDK) subfamily using Tree viewer tool. At Rice kinase database (RKD) homepage (a), we clicked Tree viewer option and moved to RKD Tree viewer website (b). This website consists of two parts: Kinase subfamilies and integrating data sections. After then, we selected CDK CMGC_1) from kinase subfamilies, MSU/TIGR V7 Model from sequence information, Zea mays and Arabidopsis thaliana from orthologs in plants, transmembrane domain from topology, NIAS Tos17 KO line from mutants, and show average and heatmap of Affymetrix GSE4438 dataseries (c). The scale bar for single channel array data from 7 to 13 log2 transformed intensity value was applied. Yellow color indicates the high level of expression and blue one indicates the low level
Fig. 3Case study to evaluate the functionality of phylogenomics tool: Phylogenomics analysis of aquaporin family in rice. Anatomical expression data were integrated into phylogenetic tree context of aquaporin family. Anatomical meta-expression data were generated from ROAD with 34 rice aquaporin genes. Their protein classifications revealed 20 NIPs (green boxes), 12 PIPs (pale-yellow boxes), 10 tonoplast intrinsic proteins (pink boxes), and 8 small basic intrinsic proteins (light-blue boxes). For heatmaps, blue color denotes low expression, black medium level expression and yellow indicates high expression. Numbers indicate estimated evolutionary distances