MOTIVATION: The interpretation of high-throughput datasets has remained one of the central challenges of computational biology over the past decade. Furthermore, as the amount of biological knowledge increases, it becomes more and more difficult to integrate this large body of knowledge in a meaningful manner. In this article, we propose a particular solution to both of these challenges. METHODS: We integrate available biological knowledge by constructing a network of molecular interactions of a specific kind: causal interactions. The resulting causal graph can be queried to suggest molecular hypotheses that explain the variations observed in a high-throughput gene expression experiment. We show that a simple scoring function can discriminate between a large number of competing molecular hypotheses about the upstream cause of the changes observed in a gene expression profile. We then develop an analytical method for computing the statistical significance of each score. This analytical method also helps assess the effects of random or adversarial noise on the predictive power of our model. RESULTS: Our results show that the causal graph we constructed from known biological literature is extremely robust to random noise and to missing or spurious information. We demonstrate the power of our causal reasoning model on two specific examples, one from a cancer dataset and the other from a cardiac hypertrophy experiment. We conclude that causal reasoning models provide a valuable addition to the biologist's toolkit for the interpretation of gene expression data. AVAILABILITY AND IMPLEMENTATION: R source code for the method is available upon request.
MOTIVATION: The interpretation of high-throughput datasets has remained one of the central challenges of computational biology over the past decade. Furthermore, as the amount of biological knowledge increases, it becomes more and more difficult to integrate this large body of knowledge in a meaningful manner. In this article, we propose a particular solution to both of these challenges. METHODS: We integrate available biological knowledge by constructing a network of molecular interactions of a specific kind: causal interactions. The resulting causal graph can be queried to suggest molecular hypotheses that explain the variations observed in a high-throughput gene expression experiment. We show that a simple scoring function can discriminate between a large number of competing molecular hypotheses about the upstream cause of the changes observed in a gene expression profile. We then develop an analytical method for computing the statistical significance of each score. This analytical method also helps assess the effects of random or adversarial noise on the predictive power of our model. RESULTS: Our results show that the causal graph we constructed from known biological literature is extremely robust to random noise and to missing or spurious information. We demonstrate the power of our causal reasoning model on two specific examples, one from a cancer dataset and the other from a cardiac hypertrophy experiment. We conclude that causal reasoning models provide a valuable addition to the biologist's toolkit for the interpretation of gene expression data. AVAILABILITY AND IMPLEMENTATION: R source code for the method is available upon request.
Authors: Kourosh Zarringhalam; Ahmed Enayetallah; Alex Gutteridge; Ben Sidders; Daniel Ziemek Journal: Bioinformatics Date: 2013-09-26 Impact factor: 6.937
Authors: Michael Maes; Gabriel Nowak; Javier R Caso; Juan Carlos Leza; Cai Song; Marta Kubera; Hans Klein; Piotr Galecki; Cristiano Noto; Enrico Glaab; Rudi Balling; Michael Berk Journal: Mol Neurobiol Date: 2015-05-02 Impact factor: 5.590
Authors: Meeghan A O'Connor; Petra Koza-Taylor; Sarah N Campion; Lauren M Aleksunes; Xinsheng Gu; Ahmed E Enayetallah; Michael P Lawton; José E Manautou Journal: Toxicol Appl Pharmacol Date: 2013-10-11 Impact factor: 4.219
Authors: Ben Sidders; Christoph Brockel; Alex Gutteridge; Lee Harland; Peter Gildsig Jansen; Robert McEwen; David Michalovich; Henrik Seidel; Bertram Weiss; Bryn Williams-Jones; Mathew Woodwark Journal: Nat Rev Drug Discov Date: 2014-02 Impact factor: 84.694
Authors: Parul Choudhary; Alex Gutteridge; Emma Impey; R Ian Storer; Robert M Owen; Paul J Whiting; Magda Bictash; Caroline L Benn Journal: Stem Cells Transl Med Date: 2016-04-25 Impact factor: 6.940
Authors: Hong Zhao; Guangxu Jin; Kemi Cui; Ding Ren; Timothy Liu; Peikai Chen; Solomon Wong; Fuhai Li; Yubo Fan; Angel Rodriguez; Jenny Chang; Stephen T C Wong Journal: Cancer Res Date: 2013-10-04 Impact factor: 12.701