| Literature DB >> 36235479 |
Muhammad-Redha Abdullah-Zawawi1,2, Nisha Govender2, Sarahani Harun2, Nor Azlan Nor Muhammad2, Zamri Zainal2,3, Zeti-Azura Mohamed-Hussein2,3.
Abstract
In higher plants, the complexity of a system and the components within and among species are rapidly dissected by omics technologies. Multi-omics datasets are integrated to infer and enable a comprehensive understanding of the life processes of organisms of interest. Further, growing open-source datasets coupled with the emergence of high-performance computing and development of computational tools for biological sciences have assisted in silico functional prediction of unknown genes, proteins and metabolites, otherwise known as uncharacterized. The systems biology approach includes data collection and filtration, system modelling, experimentation and the establishment of new hypotheses for experimental validation. Informatics technologies add meaningful sense to the output generated by complex bioinformatics algorithms, which are now freely available in a user-friendly graphical user interface. These resources accentuate gene function prediction at a relatively minimal cost and effort. Herein, we present a comprehensive view of relevant approaches available for system-level gene function prediction in the plant kingdom. Together, the most recent applications and sought-after principles for gene mining are discussed to benefit the plant research community. A realistic tabulation of plant genomic resources is included for a less laborious and accurate candidate gene discovery in basic plant research and improvement strategies.Entities:
Keywords: computational approaches; functional genomics; metabolomics; plant breeding; systems biology; transcriptomics
Year: 2022 PMID: 36235479 PMCID: PMC9573505 DOI: 10.3390/plants11192614
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Figure 1Scholarly omics-related articles published under the plant sciences category from 2012 to 2022. The literature search using Web of Science (https://www.webofknowledge.com) search engine was accessed on 18 September 2022 with Boolean ‘or’ and the following keywords: genomic, genome, transcriptomic, transcriptome, proteomic, proteome, metabolomic and metabolome.
Plant omics databases, as accessed on 24 August 2022.
| Omics Type | Database | Organism | URL | References |
|---|---|---|---|---|
| Genomics | Plant Genome Database (PlantGDB) | Plants |
| [ |
| Plant Genome DataBase Japan (PGDBj) | Plants |
| [ | |
| National Center for Biotechnology Information (NCBI) | Various |
| [ | |
| Ensembl Plants | Plants |
| [ | |
| Phytozome | Plants |
| [ | |
| PLAZA | Plants |
| [ | |
| Plant Genome and Systems Biology (PGSB PlantsDB) | Plants |
| [ | |
| Chloroplast Genome Database (ChloroplastDB) | Plants |
| [ | |
| The Solanaceae Genomics Resource (Spud DB) | Potato |
| [ | |
| Melon Genome Database (Melonomics) | Melon |
| [ | |
| Maize Genetics and Genomics Database (MaizeGDB) | Maize |
| [ | |
| Rice Annotation Project Database (RAP-DB) | Rice |
| [ | |
| Rice Genome Annotation Project (RGAP) | Rice |
| [ | |
| GrainGenes | Wheat, Barley, rye, oat |
| [ | |
| SoyBase | Soy |
| [ | |
| Genome Database for Rosaceae (GDR) | Rosaceae plants |
| [ | |
| Brassica Database (BRAD) | Brassica plants |
| [ | |
| Transcriptomics | Gene Expression Omnibus (GEO) | Various |
| [ |
| AgriSeqDB | Plants |
| [ | |
| The Bio-Analytic Resource for Plant Biology (BAR) | Plants |
| [ | |
| and | The Arabidopsis Information Resource (TAIR) | Arabidopsis |
| [ |
| Transcriptome Variation Analysis (TRAVA) | Arabidopsis |
| [ | |
| The Rice Expression Profile Database (RiceXPro) | Rice |
| [ | |
| Transcriptome Encycloperdia of Rice (TENOR) | Rice |
| [ | |
| Barley Gene Expression Database (Bex-db) | Barley |
| [ | |
| Plant Stress RNA-seq Nexus (PSRN) | Plants |
| [ | |
| Plant microRNA database (PMRD) | Plants |
| [ | |
| Interactomics | STRING | Various |
| [ |
| Database of Interacting Proteins (DIP) | Various |
| [ | |
| Protein–Protein Interaction Database for Maize (PPIM) | Maize |
| [ | |
| IntAct | Various |
| [ | |
| Oryza sativa Protein–Protein Interaction Network (PRIN) | Rice |
| [ | |
| Biomolecular Interaction Network Database (BIND) | Various |
| [ | |
| The Biological General Repository for Interaction Datasets (BioGRID) | Various |
| [ | |
| Arabidopsis thaliana Protein Interaction Network (AtPIN) | Arabidopsis |
| [ | |
| PlaPPISite | Plants |
| [ | |
| 3D interacting domains (3did) | Various |
| [ | |
| Molecular INTeraction database (MINT) | Various |
| [ | |
| ATTED-II | Plants |
| [ | |
| CressExpress | Arabidopsis |
| [ | |
| Arabidopsis Network (AraNet) | Arabidopsis |
| [ | |
| Co-expressed Biological Processes (CoP) | Plants |
| [ | |
| EXPath | Plants |
| [ | |
| Plant Omics Data Center (PODC) | Plants |
| [ | |
| Plant Netwrok (PlaNet) | Plants |
| [ | |
| OryzaExpress | Rice |
| [ | |
| PlantExpress | Rice, Arabidopsis |
| [ | |
| Rice Functionally Related Gene Expression Network Database (RiceFREND) | Rice |
| [ | |
| Grape |
| [ | ||
| GeneMania | Various |
| [ | |
| A Comprehensive Systems-Biology Database (CSB.DB) | Various |
| [ | |
| RapaNet | Brassica |
| [ | |
| Rice Expression Database (RED) | Rice |
| [ | |
| PhytoNet | Various |
| [ | |
| CoNekT | Plants |
| [ | |
| CoCoCoNet | Plants |
| [ |
Figure 2Type of omics datasets. Omics datasets can be divided into two categories: modules and interactions. Module data indicate molecular sequences of biological systems; DNA (genome) transcribed into mRNA (transcriptome), later translated into proteins (proteome), and lastly synthesized into metabolite (metabolome). Interaction data, known as interactomes, represent the relationships of module data generated from respective platforms. The omics resources of the omics technologies and network interactions can be downloaded from the databases.
Figure 3Guilt-by-association techniques for candidate gene discovery. (A) Preliminary selection of candidate gene using the biological network, in which the co-functional information of candidate gene with known gene (i.e., metabolic or other functional genes) can be extracted from gene correlation and coregulation/regulatory network (steps 1–2). (B) Co-functional information can be inferred by gene context analyses (steps 3–7). Candidate genes will be observed based on enriched in a similar function (step 3), clustered in a monophyletic group (step 4), shared similar distributions of motifs (step 5) and exon/intron structure (step 6), and lastly consist similar CREs (step 7). (C) The ranking of the high confidence candidate gene will be observed based on the predicted co-function similarity.