| Literature DB >> 25435802 |
Andrés Quintero1, Jorge Ramírez2, Luis Guillermo Leal3, Liliana López-Kleine4.
Abstract
Relationships between genes are best represented using networks constructed from information of different types, with metabolic information being the most valuable and widely used for genetic network reconstruction. Other types of information are usually also available, and it would be desirable to systematically include them in algorithms for network reconstruction. Here, we present an algorithm to construct a global metabolic network that uses all available enzymatic and metabolic information about the organism. We construct a global enzymatic network (GEN) with a total of 4226 nodes (EC numbers) and 42723 edges representing all known metabolic reactions. As an example we use microarray data for Arabidopsis thaliana and combine it with the metabolic network constructing a final gene interaction network for this organism with 8212 nodes (genes) and 4606,901 edges. All scripts are available to be used for any organism for which genomic data is available.Entities:
Keywords: EC number; Gene network reconstruction; Global enzymatic network; KEGG; Perl.
Year: 2014 PMID: 25435802 PMCID: PMC4245699 DOI: 10.2174/1389202915666140807004909
Source DB: PubMed Journal: Curr Genomics ISSN: 1389-2029 Impact factor: 2.236
Description of all the scripts used for the GEN, OEN and WOEN reconstruction. ^scripts written in italics are for users that want to use KEGG ftp database downloaded data; *input and output files names written in quotations marks for easy differentiation with running text, names used can change depending of the user preferences.
| Script | Script^ | Description | Input Files* | Output Files* |
|---|---|---|---|---|
| GEN Construction | ||||
| 1.2SGEN | 1-2_Fetch_EC_met.pl | Downloads EC numbers and reactions data and prints a simplified reaction | NA | "reactions_slim" |
| - | Creates a equivalence list of complete names and codes for metabolites | Compound information file from KEGG ftp | "metabolite_name_code" | |
| - | Prints a simplified reaction | "metabolite_name_code" | "reactions_slim" | |
| 3SGEN | 3_filter_reactions_slim.pl | Filter n most common metabolites | "reactions_slim" | "reactions_slim.filter_n" |
| 4SGEN | 4_network_constuction.pl | Prints the GEN as a list of node pairs | "reactions_slim.filter_n" | "network.reactions_slim.filter_n";"eclist" |
| OEN Construction | ||||
| 1SOEN | 1_Fetch_genes.pl | Downloads all GSE and BLAST them against genome of interest | "eclist" | "Blasted_genes.list" |
| 2SOEN | 2_DataBase_network.pl | Assigns edges within genes comparing associated enzymatic activities in GEN | "Blasted_genes.list";"network.reactions_slim.filter_n" | "Gen-Gen.list" |
| - | 3_Gen-gen_adjacency_matrix.pl | Prints the OEN as an adjacency matrix | "Gen-Gen.list" | "Gen-Gen_adjacency.matrix" |
| WOEN Construction | ||||
| 1SWOEN | 4_Weight_adjacency_matrix.pl | Weights the OEN using the GESM | "Gen-Gen_adjacency.matrix"; a GESM valid file | "Gen-Gen_similarity.matrix" |
Topological variables measured in the global networks.
| Variable | GEN | OEN | WOEN |
|---|---|---|---|
| Nodes | 4,226 | 9,829 | 8,212 |
| Edges | 42,753 | 6,444,453 | 4,606,901 |
| Clustering coefficient | 0.52 | 0.48 | 0.49 |
| Average path length | 4.22 | 2.03 | 2.02 |
| Centralization | 0.05 | 0.01 | 0.01 |
WOEN edges validated for a selection of IRGs.
| IRG | Prediction | References |
|---|---|---|
| [24, 26] | ||
| [26, 27] | ||
| [27] | ||
| [30] | ||
| [29] | ||
| [29] | ||
| [28] | ||
| [28] | ||
| [28] | ||
| [32] | ||
| [31] | ||
| [28] | ||
| [34] |