| Literature DB >> 29474376 |
Qi Zhang1, Yao Yu2, Jun Zhang3, Hua Liang4.
Abstract
With the development of biotechnology, high-throughput studies on protein-protein, protein-gene, and gene-gene interactions become possible and attract remarkable attention. To explore the interactions in dynamic gene regulatory networks, we propose a single-index ordinary differential equation (ODE) model and develop a variable selection procedure. We employ the smoothly clipped absolute deviation penalty (SCAD) penalized function for variable selection. We analyze a yeast cell cycle gene expression data set to illustrate the usefulness of the single-index ODE model. In real data analysis, we group genes into functional modules using the smoothing spline clustering approach. We estimate state functions and their first derivatives for functional modules using penalized spline-based nonparametric mixed-effects models and the spline method. We substitute the estimates into the single-index ODE models, and then use the penalized profile least-squares procedure to identify network structures among the models. The results indicate that our model fits the data better than linear ODE models and our variable selection procedure identifies the interactions that may be missed by linear ODE models but confirmed in biological studies. In addition, Monte Carlo simulation studies are used to evaluate and compare the methods.Entities:
Mesh:
Year: 2018 PMID: 29474376 PMCID: PMC5825071 DOI: 10.1371/journal.pone.0192833
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1The scatterplot of gene expressions against time (the same color for each individual gene in a module) and the population mean curve (solid line) of 12 modules for the time course yeast cell data set.
The inward and outward regulations in the module-based regulatory network and RSS based on the linear ODE (L-ODE) and the single-index ODE (Si-ODE).
| Module | Selected function annotation and associated p-values (in parentheses) | Outward influence modules | Inward influence modules | RSS | |||
|---|---|---|---|---|---|---|---|
| L-ODE | Si-ODE | L-ODE | Si-ODE | L-ODE | Si-ODE | ||
| module1 (12) | DNA replication (0.013), regulation of RNA metabolic process (0.014), meiosis (0.021) | 3, 5, 7, 8 | 1, 3, 7, 8, 9, 12 | 2, 6, 9, 12 | 1, 2, 3, 4, 5, 8, 9, 10, 11 | 3.07E-03 | 1.41E-04 |
| module2 (30) | cellular carbohydrate biosynthetic process (0.006), | 1, 7, 9 | 1, 2, 7, 8, 9, 12 | 7 | 2, 4, 5, 8, 9, 10, 12 | 5.20E-04 | 3.38E-05 |
| module3 (15) | protein—DNA complex assembly (< 0.001), DNA packaging (< 0.001) | 3, 7, 8, 10 | 1, 7, 8, 12 | 1, 3, 4, 5, 6, 7, 8, 12 | 1, 4, 5, 6, 7, 8, 10, 11, 12 | 3.37E-03 | 3.58E-03 |
| module4 (32) | DNA metabolic process (< 0.001), DNA replication (< 0.001), DNA repair (< 0.001), cell-division cycle (0.025) | 3, 5, 7, 8 | 1, 2, 3, 7, 8, 9, 11, 12 | NA | 6, 8, 9, 11, 12 | 4.52E-03 | 1.05E-03 |
| module5 (16) | interphase of mitotic cell cycle (< 0.001), DNA replication initiation (< 0.001) | 3, 5, 7, 8 | 1, 2, 3, 5, 8, 9, 11, 12 | 1, 4, 5, 6, 7, 8, 9, 10, 11, 12 | 5, 7, 10, 11 | 5.28E-03 | 3.45E-04 |
| module6 (38) | lipoprotein biosynthetic process and metabolic process (0.004), regulation of DNA metabolic process (0.005), chromosome organization (< 0.001) | 1, 3, 5, 7 | 3, 4, 6, 7, 8, 9, 12 | NA | 6, 9 | 1.10E-03 | 2.88E-05 |
| module7 (20) | nuclear division (< 0.001), cell-division (< 0.001), mitosis (< 0.001) | 2, 3, 5, 8 | 3, 5, 7, 8, 9, 10, 12 | 1, 2, 3, 4, 5, 6, 8, 9, 10, 11 | 1, 2, 3, 4, 6, 7, 8, 10 | 2.10E-03 | 2.67E-04 |
| module8 (9) | cell cycle (0.007), regulation of cell cycle (0.025) | 3, 5, 7, 8 | 1, 2, 3, 4, 7, 8, 9, 11, 12 | 1, 3, 4, 5, 7, 8, 9, 10, 11 | 1, 2, 3, 4, 5, 6, 7, 8, 9, 11 | 1.01E-02 | 7.90E-04 |
| module9 (35) | Glycosylation (< 0.001), mitotic cell cycle (< 0.001), nuclear division (< 0.001) | 1, 5, 7, 8 | 1, 2, 4, 6, 8, 11 | 2 | 1, 2, 4, 5, 6, 7, 8, 11 | 5.20E-04 | 1.34E-05 |
| module10 (14) | regulation of cell cycle (< 0.001), regulation of cell cycle process (0.001) | 5, 7, 8, 12 | 1, 2, 3, 5, 7, 11, 12 | 3, 11 | 7, 11 | 6.55E-03 | 6.82 E-06 |
| module11 (53) | cell cycle (0.043), nuclear migration along microtubule (0.012) | 5, 7, 8, 10 | 1, 3, 4, 5, 8, 9, 10 | NA | 4, 5, 8, 9, 10 | 8.03E-05 | 8.70E-06 |
| module12 (23) | mitotic recombination (< 0.001), DNA metabolic process (< 0.001) | 1, 3, 5 | 2, 3, 4 | 10 | 1, 2, 3, 4, 5, 6, 7, 8, 10 | 2.27E-03 | 1.41E-04 |
Fig 2The GRN identified by the linear ODE models for the time course yeast cell data set.
Each node represents a module and the arrows presents the direction of influence.
Fig 3The GRN identified by the single-index ODE models for the time course yeast cell data set.
Each node represents a module and the arrows presents the direction of influence.
The simulation results for the SCAD method for scenarios with different sample sizes based on 100 replications.
The simulation results for the SCAD method for scenarios with different sample sizes based on 100 replications. Correctly fitted (C); underfitted (U); overfitted(O).
| ODE | C | U | O | MSE | MSE | ARE(%) | ARE | |
|---|---|---|---|---|---|---|---|---|
| 1 | 100 | 0 | 0 | < 0.001 | < 0.001 | 0.005 | 0.004 | |
| < 0.001 | < 0.001 | 0.005 | 0.004 | |||||
| 2 | 96 | 0 | 3 | 0.012 | < 0.001 | 4.012 | 0.013 | |
| 0.028 | < 0.001 | 4.005 | 0.006 | |||||
| 3 | 97 | 1 | 2 | 0.014 | < 0.001 | 2.271 | 0.006 | |
| 0.007 | < 0.001 | 2.366 | 0.013 | |||||
| 4 | 99 | 0 | 1 | 0.004 | < 0.001 | 1.025 | 0.024 | |
| 0.006 | < 0.001 | 1.014 | 0.013 | |||||
| 5 | 65 | 6 | 24 | 0.295 | 0.255 | 35.181 | 32.363 | |
| 0.064 | 0.057 | 32.669 | 30.219 | |||||
| 6 | 93 | 2 | 3 | 0.053 | 0.005 | 6.443 | 1.105 | |
| 0.014 | 0.003 | 6.811 | 1.538 | |||||
| 7 | 100 | 0 | 0 | < 0.001 | < 0.001 | 0.004 | 0.003 | |
| < 0.001 | < 0.001 | 0.016 | 0.014 | |||||
| 1 | 100 | 0 | 0 | < 0.001 | < 0.001 | 0.002 | 0.001 | |
| < 0.001 | < 0.001 | 0.002 | 0.001 | |||||
| 2 | 96 | 0 | 4 | 0.012 | < 0.001 | 4.003 | 0.003 | |
| 0.028 | < 0.001 | 4.001 | 0.001 | |||||
| 3 | 97 | 1 | 2 | 0.014 | < 0.001 | 2.264 | 0.002 | |
| 0.007 | < 0.001 | 2.49 | 0.003 | |||||
| 4 | 100 | 0 | 0 | < 0.001 | < 0.001 | 0.007 | 0.007 | |
| < 0.001 | < 0.001 | 0.004 | 0.004 | |||||
| 5 | 77 | 5 | 13 | 0.166 | 0.14 | 21.242 | 18.046 | |
| 0.055 | 0.035 | 23.601 | 18.503 | |||||
| 6 | 96 | 1 | 2 | 0.024 | < 0.001 | 3.167 | 0.001 | |
| 0.006 | < 0.001 | 3.19 | 0.003 | |||||
| 7 | 100 | 0 | 0 | < 0.001 | < 0.001 | 0.001 | 0.001 | |
| < 0.001 | < 0.001 | 0.004 | 0.004 | |||||
| 1 | 100 | 0 | 0 | < 0.001 | < 0.001 | 0.001 | 0.001 | |
| < 0.001 | < 0.001 | 0.001 | 0.001 | |||||
| 2 | 96 | 0 | 4 | 0.012 | < 0.001 | 4.002 | 0.002 | |
| 0.028 | < 0.001 | 4.001 | 0.001 | |||||
| 3 | 97 | 1 | 2 | 0.014 | < 0.001 | 2.278 | 0.001 | |
| 0.007 | < 0.001 | 2.402 | 0.002 | |||||
| 4 | 100 | 0 | 0 | < 0.001 | < 0.001 | 0.005 | 0.005 | |
| < 0.001 | < 0.001 | 0.003 | 0.003 | |||||
| 5 | 78 | 5 | 10 | 0.15 | 0.122 | 19.267 | 15.852 | |
| 0.039 | 0.032 | 20.493 | 17.214 | |||||
| 6 | 96 | 0 | 4 | 0.032 | < 0.001 | 4 | < 0.001 | |
| 0.008 | < 0.001 | 4.002 | 0.002 | |||||
| 7 | 100 | 0 | 0 | < 0.001 | < 0.001 | 0.001 | < 0.001 | |
| < 0.001 | < 0.001 | 0.002 | 0.002 | |||||
Fig 4The constructed gene regulatory networks for simulation studies with N = 180 and 100 iterations.
Solid lines: the true connections, numbers present: the times correctly identified using our procedure in 100 iteration, dots line: incorrectly identified connections.
Fig 5The constructed gene regulatory networks for simulation studies with N = 288 and 100 iterations.
The legend is the same as in Fig 4.
Fig 6The constructed gene regulatory networks for simulation studies with N = 360 and 100 iterations.
The legend is the same as in Fig 4.