| Literature DB >> 35807577 |
Rabiatul-Adawiah Zainal-Abidin1, Sarahani Harun2, Vinothienii Vengatharajuloo2, Amin-Asyraf Tamizi1,3, Nurul Hidayah Samsulrizal3.
Abstract
Transcriptomics has significantly grown as a functional genomics tool for understanding the expression of biological systems. The generated transcriptomics data can be utilised to produce a gene co-expression network that is one of the essential downstream omics data analyses. To date, several gene co-expression network databases that store correlation values, expression profiles, gene names and gene descriptions have been developed. Although these resources remain scattered across the Internet, such databases complement each other and support efficient growth in the functional genomics area. This review presents the features and the most recent gene co-expression network databases in crops and summarises the present status of the tools that are widely used for constructing the gene co-expression network. The highlights of gene co-expression network databases and the tools presented here will pave the way for a robust interpretation of biologically relevant information. With this effort, the researcher would be able to explore and utilise gene co-expression network databases for crops improvement.Entities:
Keywords: bioinformatics tool; crop; database; gene co-expression network; transcriptomics
Year: 2022 PMID: 35807577 PMCID: PMC9269215 DOI: 10.3390/plants11131625
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Figure 1Summary of the gene co-expression network analysis pipeline. A co-expression network study is usually initiated by a biological question that would affect the experimental design of the RNA-seq and microarray experiments. The gene expression data can also be retrieved from transcriptome databases, i.e., SRA, GEO Profile and ArrayExpress. First, normalisation will be performed on the input transcriptome datasets. The generated data matrix comprises columns containing different samples and rows corresponding to genes. Next, the correlation analysis using the Pearson’s Correlation Coefficient (PCC) will be performed to calculate the degree of similarity between the gene expression profiles. Finally, the undirected GCN construction will calculate the whole gene pairs in the data matrix. The selected threshold value calculated by PCC to infer significantly co-expressed genes is >0.9 or <−0.9, highlighted in grey.
Figure 2The application of GBA in identifying potential genes. First, a correlation analysis will be calculated to determine the co-expressed genes. Then, the generated GCN will be used in the clustering analysis using clustering tools, such as, MCODE, to extract the densely connected regions (yellow nodes). The GBA approach can elucidate the potential genes (red nodes) with the co-expressed known genes (blue nodes). The blue nodes are known to be involved in glucosinolate biosynthesis, which can be used to infer the red nodes as potential genes in glucosinolate biosynthesis.
List of co-expression tools for a gene co-expression network analysis in the crops.
| Types | Co-Expression Network Tool | Descriptions | References |
|---|---|---|---|
| Web-based | CORNET 2.0 | An integrating tool for plant co-expression network | [ |
| A comparative co-expression network construction and visualisation | [ | ||
| PlaNet | A tool for comparative co-expression network analyses | [ | |
| RECoN | A co-expression tool to identify co-expressed genes in abiotic stress response | [ | |
| webCemiTool | A web-based tool to identify co-expression modules in a given co-expression network | [ | |
| CoExp | A web tool for the exploitation of co-expression networks | [ | |
| Command-line based tool & require installation | WGCNA | An R package for performing weighted correlation network analysis | [ |
| petal | An R package for co-expression network modelling | [ | |
| LSTrAP | A pipeline to construct co-expression networks from RNA-seq data | [ | |
| COGENT | An R package to construct a gene co-expression network without the need for annotation or external validation data. | [ | |
| GWENA | An R package developed to extend the analysis of gene co-expression network | [ | |
| Juxtapose | A tool to compare gene co-expression networks (GCNs) | [ |
Summary of gene co-expression network-related databases in publicly available crops.
| Plant Species | Databases | Descriptions | Statistical Methods | References |
|---|---|---|---|---|
|
| Rice Expression database | A repository of gene expression profiles and co-expression network. | PCC | [ |
| RiceFrend | A gene co-expression database in rice based on an extensive collection of microarray data derived from various tissues/organs at different stages of growth and development under natural field conditions. | PCC | [ | |
|
| MCENet | A database for maize co-expression networks. | PCC and Mutual Rank | [ |
| Maize gene co-expression network database | A gene co-expression network database for maize. | PCC, KCC, SCC and Mutual Information | [ | |
|
| Sorghum Functional Genomics Database (SorghumFDB) | A sorghum database to predict gene function. | PCC and Mutual Rank | [ |
|
| VTCdb: ViTis Co-expression DataBase | A database for co-expressed genes in grapes. | PCC, SCC, Highest Reciprocal and Mutual Rank | [ |
|
| Co-expressed pathways database for tomato | A database for co-expressed genes in tomatoes. | PCC, ORA (p-value), GSEA (p-value, percentile-scores) | [ |
|
| BambooNET | A database of co-expression networks with functional modules for bamboo. | PCC and Mutual Rank | [ |
|
| AppleMDO | A multi-dimensional omics database for apple co-expression networks and chromatin states. | PCC and Mutual Rank | [ |
|
| TeaCoN | A database of gene co-expression network for tea plants. | PCC | [ |
|
| BrassicaEDB | A database of gene co-expression network and expression profiles for Brassica crops. | PCC and weight value | [ |
| Multiple crop species | PLANEX | A plant gene co-expression database obtained from GEO NCBI. | PCC, Gene enrichment analysis (Cohen’s Kappa) | [ |
| ATTED-II | A plant co-expression database. | PCC, SCC and Mutual Rank | [ | |
| PlantNexus | A gene co-expression network database for barley and sorghum. | [ | ||
| CoNekT-P | An online platform that allows users to browse co-expression networks and perform comparative GCN analysis across different crop species (rice, maize, tomato) and others plant species. | HRR and HCCA | [ | |
| CoCoCoNet | A comparative gene co-expression network portal for a diverse range of species including plants, humans and animals. | SCC | [ |
Figure 3The co-expression network of CHS and CHI genes in rice (A) and maize (C). (A) The nodes with bold lines indicate the query genes for CHS (LOC4350636) and CHI (LOC4351321, LOC4334588, LOC4349607) in rice. (C) In maize, the query genes for CHS (LOC100282642, LOC100274415) and CHI (LOC100284018) are also shown with bold lines. (A,C) The co-expression network consists of genes classified based on subcellular location, calculated by TargetP. The flavonoid biosynthesis genes were selected based on KEGG in the ATTED-II database, as shown by red nodes in rice (B) and maize (D). (D) The highlighted gene, LOC100273383 was found to be co-expressed with known flavonoid biosynthetic genes (CHS and CHI).