| Literature DB >> 27295368 |
Sebastián Moschen1,2, Janet Higgins3, Julio A Di Rienzo4, Ruth A Heinz1,2, Norma Paniego1,2, Paula Fernandez5,6,7.
Abstract
BACKGROUND: In recent years, high throughput technologies have led to an increase of datasets from omics disciplines allowing the understanding of the complex regulatory networks associated with biological processes. Leaf senescence is a complex mechanism controlled by multiple genetic and environmental variables, which has a strong impact on crop yield. Transcription factors (TFs) are key proteins in the regulation of gene expression, regulating different signaling pathways; their function is crucial for triggering and/or regulating different aspects of the leaf senescence process. The study of TF interactions and their integration with metabolic profiles under different developmental conditions, especially for a non-model organism such as sunflower, will open new insights into the details of gene regulation of leaf senescence.Entities:
Keywords: BioSignature Discoverer; Data integration; Leaf senescence; Metabolomic; Sunflower; Transcriptomic; WGCNA
Mesh:
Substances:
Year: 2016 PMID: 27295368 PMCID: PMC4905614 DOI: 10.1186/s12859-016-1045-2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Networks exported from WGCNA visualized in Cytoscape. Network were constructed using 75 transcription factors (a) and 51 metabolites (b) and were visualized in Cytoscape [46] with an edge weight higher than 0.3. The nodes represent genes and metabolites and the edges represent connections between them. The node size and color is related to the number of connections, large red nodes represent highly connected hub genes and metabolites, small green nodes represent gene with few connections. Strong connections are visualized as wider lines
Fig. 2Integrated network of hubs metabolites and transcription factors. Pearson correlation analysis of 24 selected hubs metabolites and the list of 82 TFs statistically significant during senescence. Correlations with p-value < 0.0001 were selected and visualized in Cytoscape [46] by degree (node size and color) and edge weight (edge size and color)
List of biosignatures detected in each dataset
| Metabolites | Transcription factors | Integrated list |
|---|---|---|
| Alanine | HeAn_C_10939 | Alanine |
| HeAn_C_11118 | HeAn_S_15792 | |
| HeAn_C_11953 | HeAn_C_10939 | |
| HeAn_C_12899 | HeAn_S_38523 | |
| HeAn_C_8526 | HeAn_C_9238 | |
| HeAn_C_9238 | HeAn_S_32569 | |
| HeAn_S_15792 | HeAn_C_11118 | |
| HeAn_S_32569 | ||
| HeAn_S_36498 | ||
| HeAn_S_38523 |
Each signature is composed of a single biomarker, and all signatures are expected to have the same predictive performance (see Table 2)
Predicted performances of the selected biosignatures
| Metric | In Sample | Out Sample | 95 % Confidence Interval | |
|---|---|---|---|---|
| Metabolites | R2 | 0.997 | 0.840 | [ 0.335, 0.972 ] |
| Mean Absolute Error | 0.228 | 2.503 | [ 0.783, 4.060 ] | |
| Mean Squared Error | 0.2104 | 12.180 | [ 0.964, 24.916 ] | |
| Transcription factors | R2 | 0.983 | 0.825 | [ 0.565, 0.939 ] |
| Mean Absolute Error | 0.897 | 2.658 | [ 1.009, 4.950 ] | |
| Mean Squared Error | 1.239 | 13.315 | [ 3.743, 29.804 ] | |
| Integrated list | R2 | 0.9965 | 0.909 | [ 0.721, 0.986 ] |
| Mean Absolute Error | 0.465 | 1.779 | [ 0.487, 3.919 ] | |
| Mean Squared Error | 0.2639 | 6.927 | [ 0.469, 18.832 ] |
Performances are reported in terms of the determination coefficient R2, Mean Absolute Error (MAE) and Mean Squared Error (MSE, see text for more details on these metrics). The in-sample performances quantify the fitness of the predictive models on the training data, while the out-of-sample values estimate the expected performance on new data. Confidence intervals are calculated using a bootstrap approach
Candidate biomarkers detected by the two complementary methods, WGCNA and BioSignature Discoverer
| WGCNA | Biosignatures | ||||
|---|---|---|---|---|---|
| Sunflower ID | Arabidopsis ID | TF Family | Sunflower ID | Arabidopsis ID | TF Family |
| HeAn_C_10816 | AT1G52890 | NAC | HeAn_S_15792 | AT5G26210 | Alfin-like |
| HeAn_S_17894 | AT5G55580 | mTERF |
|
|
|
| HeAn_C_3510 | AT4G27410 | NAC | HeAn_S_38523 | AT3G16770 | AP2 |
| HeAn_S_30904 | AT4G17810 | C2H2 | HeAn_C_9238 | AT2G39770 | GRAS |
| HeAn_C_3774 | AT4G02590 | bHLH |
|
|
|
| HeAn_C_11953 | AT2G45650 | MADS |
|
|
|
| HeAn_S_36498 | AT2G30400 | OFP | Alanine | ||
|
|
|
| |||
| HeAn_C_1262 | AT1G64860 | Sigma70-like | |||
| HeAn_C_8818 | AT2G28810 | C2C2-Dof | |||
|
|
|
| |||
| HeAn_C_12238 | AT1G27320 | Orphans | |||
|
|
|
| |||
| Succinic_acid | |||||
| Malonic_acid | |||||
| Raffinose | |||||
| Xylose | |||||
| Talose | |||||
| Inositol_myo | |||||
| Glucoheptose | |||||
| Malic_acid | |||||
| Galactinol | |||||
| Sorbose | |||||
Biomarkers from WGCNA were selected based on their degree (higher than 20) and biomarkers from BioSignature Discoverer correspond to the analysis of the integrated list. Biomarkers detected independently in the two methods are in bold