| Literature DB >> 30555503 |
Keiichi Mochida1,2,3,4, Satoru Koda5, Komaki Inoue1, Ryuei Nishii6.
Abstract
Statistical and machine learning (ML)-based methods have recently advanced in construction of gene regulatory network (GRNs) based on high-throughput biological datasets. GRNs underlie almost all cellular phenomena; hence, comprehensive GRN maps are essential tools to elucidate gene function, thereby facilitating the identification and prioritization of candidate genes for functional analysis. High-throughput gene expression datasets have yielded various statistical and ML-based algorithms to infer causal relationship between genes and decipher GRNs. This review summarizes the recent advancements in the computational inference of GRNs, based on large-scale transcriptome sequencing datasets of model plants and crops. We highlight strategies to select contextual genes for GRN inference, and statistical and ML-based methods for inferring GRNs based on transcriptome datasets from plants. Furthermore, we discuss the challenges and opportunities for the elucidation of GRNs based on large-scale datasets obtained from emerging transcriptomic applications, such as from population-scale, single-cell level, and life-course transcriptome analyses.Entities:
Keywords: gene regulatory network; machine learning; sparse modeling; time series analysis; transcriptome
Year: 2018 PMID: 30555503 PMCID: PMC6281826 DOI: 10.3389/fpls.2018.01770
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Examples of statistics-based and machine learning-based algorithms used for GRN inference in plants and other species.
| Data type | Algorithm | Organism | Reference |
|---|---|---|---|
| Time series | Fused LASSO regression | ||
| ARX model and GroupSCAD | |||
| Inferelator | |||
| GENIST | |||
| Non-time series | CLR | ||
| PRC | |||
| Various biological and experimental conditions | MinReg | ||
| Various tissues | GENIE3 | ||
| Time series (spatial and temporal) | DFG | ||
Examples of combined approaches for GRN inference in plants and other species.
| Data type | Algorithm | Organisms | Reference |
|---|---|---|---|
| Time series | Inferelator, TIGRESS, and GENIE3 | ||
| ARACNE, GENIE3, TIGRESS, Partial correlation, and CLR | |||
| Time series (development) | GRACE (Random forest and ensembles of Markov Random Fields) | ||