| Literature DB >> 30013303 |
Daniele Ramazzotti1, Alex Graudenzi2, Giulio Caravagna3, Marco Antoniotti2.
Abstract
Several diseases related to cell proliferation are characterized by the accumulation of somatic DNA changes, with respect to wild-type conditions. Cancer and HIV are 2 common examples of such diseases, where the mutational load in the cancerous/viral population increases over time. In these cases, selective pressures are often observed along with competition, co-operation, and parasitism among distinct cellular clones. Recently, we presented a mathematical framework to model these phenomena, based on a combination of Bayesian inference and Suppes' theory of probabilistic causation, depicted in graphical structures dubbed Suppes-Bayes Causal Networks (SBCNs). The SBCNs are generative probabilistic graphical models that recapitulate the potential ordering of accumulation of such DNA changes during the progression of the disease. Such models can be inferred from data by exploiting likelihood-based model selection strategies with regularization. In this article, we discuss the theoretical foundations of our approach and we investigate in depth the influence on the model selection task of (1) the poset based on Suppes' theory and (2) different regularization strategies. Furthermore, we provide an example of application of our framework to HIV genetic data highlighting the valuable insights provided by the inferred SBCN.Entities:
Keywords: Bayesian graphical models; cumulative; phenomena; probabilistic causality
Year: 2018 PMID: 30013303 PMCID: PMC6043942 DOI: 10.1177/1176934318785167
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Definitions for CMPN, DMPN, and XMPN.
|
| (11) |
|
| (12) |
|
| (13) |
Figure 1.Performance of the inference on simulated data sets of 100 samples and networks of 15 nodes in terms of accurancy for the 6 considered topological structures. The y-axis refers to the performance, whereas the x-axis represents the different noise levels.
Figure 3.Performance of the inference on simulated data sets of 100 samples and networks of 15 nodes in terms of specificity for the 6 considered topological structures. The y-axis refers to the performance while the x-axis represents the different noise levels.
Figure 4.Mutations detected in the genome for 179 patients with HIV under ritonavir (top) and 1035 under indinavir (bottom). Each black rectangle denotes the presence of a mutation in the gene annotated to the right of the plot; percentages correspond to marginal probabilities.
Figure 5.HIV progression of patients under ritonavir or indinavir (Figure 4) described as a Bayesian Network or as a Suppes-Bayes Causal Network. Edges are annotated with nonparametric bootstrap scores.