| Literature DB >> 27096930 |
Nurcan Tuncbag1, Sara J C Gosline1, Amanda Kedaigle1, Anthony R Soltis1, Anthony Gitter1, Ernest Fraenkel1.
Abstract
High-throughput, 'omic' methods provide sensitive measures of biological responses to perturbations. However, inherent biases in high-throughput assays make it difficult to interpret experiments in which more than one type of data is collected. In this work, we introduce Omics Integrator, a software package that takes a variety of 'omic' data as input and identifies putative underlying molecular pathways. The approach applies advanced network optimization algorithms to a network of thousands of molecular interactions to find high-confidence, interpretable subnetworks that best explain the data. These subnetworks connect changes observed in gene expression, protein abundance or other global assays to proteins that may not have been measured in the screens due to inherent bias or noise in measurement. This approach reveals unannotated molecular pathways that would not be detectable by searching pathway databases. Omics Integrator also provides an elegant framework to incorporate not only positive data, but also negative evidence. Incorporating negative evidence allows Omics Integrator to avoid unexpressed genes and avoid being biased toward highly-studied hub proteins, except when they are strongly implicated by the data. The software is comprised of two individual tools, Garnet and Forest, that can be run together or independently to allow a user to perform advanced integration of multiple types of high-throughput data as well as create condition-specific subnetworks of protein interactions that best connect the observed changes in various datasets. It is available at http://fraenkel.mit.edu/omicsintegrator and on GitHub at https://github.com/fraenkel-lab/OmicsIntegrator.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27096930 PMCID: PMC4838263 DOI: 10.1371/journal.pcbi.1004879
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 5Anticipated results: In silico EGFR knock-out experiment in network modeling.
Blue nodes represent ‘Steiner nodes’ that were not measured as changing in the original experiment but are identified through network reconstruction; yellow nodes represent ‘terminal nodes’ that are the phosphoproteomic hits. The original network and the network with EGFR knock-out have been merged to clearly show the common and different nodes and edges in the two conditions. Common edges in two conditions are black lines, edges only present in EGFR knock-out condition are red dotted lines and edges only present in the wild-type condition are blue dashed lines. Cell surface receptors are arrow-shaped. The parameters are μ = 0.002, ω = 2, β = 150, and D = 10.
Description of the parameters used in the Forest.py script.
| -p PRIZEFILE, --prize=PRIZEFILE | Path to the text file containing the prizes | Path to a tab-delimited plain text file with lines “ProteinName PrizeValue” |
| -e EDGEFILE, --edge=EDGEFILE | Path to the text file containing the interactome edges | Path to a tab-delimited plain text file with 3 or 4 columns: “ProteinA ProteinB Weight(between 0 and 1) Directionality(U or D, optional) |
| -c CONFFILE, --conf=CONFFILE | Path to the text file containing the parameters. Default = “./conf.txt” | Path to a tab-delimited plain text file with lines “ParameterName = ParameterValue”. Must contain values for w, b, D. Optional parameters mu, garnetBeta, g may also be included |
| -d DUMMYMODE, --dummyMode=DUMMYMODE | Tells the program which nodes in the interactomes to connect to the dummy root node. Default = “terminals” | Either a file name (containing a list of nodes), “terminals” (connect to all prize nodes), “others” (connect to nodes with no prize), or “all” (connect to all nodes) |
| --garnet=GARNETOUTPUT | Tells the program that it will also use the Garnet output for network modeling. The prizes will be scaled by the garnetBeta parameter you provide in the conf file, default 0.01 | Full path + filename of the Garnet output file |
| --musquared | Flag to add negative prizes to hub nodes proportional to their degree2, rather than degree. Use to penalize hub nodes more intensely. Must specify a positive mu in conf file. | |
| --msgpath=MSGPATH | Path to the msgsteiner executable, including the executable name. Default = “./msgsteiner” | Path where the msgsteiner executable is located |
| --outpath=OUTPUTPATH | Path to the directory which will hold the output file. Default = this directory | Path |
| --outlabel=OUTPUTLABEL | A string to put at the beginning of the names of the files output by this program. Default = “result” | String |
| --cyto30 | Use this flag if you want the output files to be compatible with Cytoscape v3.0 (this is the default) | |
| --cyto28 | Use this flag if you want the output files to be compatible with Cytoscape v2.8 | |
| --noisyEdges=NOISENUM | Specifies how many times you would like to add noise to the given edge values and re-run the algorithm. Results of these runs will be merged together and written in files with the word “_noisyEdges_” added to their names. Default = 0 | Integer |
| --shuffledPrizes=SHUFFLENUM | Specifies how many times you would like to shuffle the given prizes around the terminals and re-run the algorithm. Results of these runs will be merged together and written in files with the word “_shuffledPrizes_” added to their names. Default = 0 | Integer |
| --randomTerminals=TERMNUM | Specifies how many times you would like to apply the given prizes to random nodes in the interactome (with a similar degree distribution) and re-run the algorithm. Results of these runs will be merged together and written in files with the word “_randomTerminals_” added to their names. Default = 0 | Integer |
| --knockout=KNOCKOUT | Specifies protein(s) you would like to “knock out” of the interactome to simulate a knock-out experiment | The name(s) of the protein(s), i.e. TP53 or TP53 EGFR |
| -s SEED, --seed=SEED | A seed for the pseudo-random number generators. If you want to reproduce exact results, supply the same seed. Default = None | Integer |