| Literature DB >> 31299056 |
Tomasz Jetka1, Karol Nienałtowski1, Tomasz Winarski1, Sławomir Błoński1, Michał Komorowski1.
Abstract
Mathematical methods of information theory appear to provide a useful language to describe how stimuli are encoded in activities of signaling effectors. Exploring the information-theoretic perspective, however, remains conceptually, experimentally and computationally challenging. Specifically, existing computational tools enable efficient analysis of relatively simple systems, usually with one input and output only. Moreover, their robust and readily applicable implementations are missing. Here, we propose a novel algorithm, SLEMI-statistical learning based estimation of mutual information, to analyze signaling systems with high-dimensional outputs and a large number of input values. Our approach is efficient in terms of computational time as well as sample size needed for accurate estimation. Analysis of the NF-κB single-cell signaling responses to TNF-α reveals that NF-κB signaling dynamics improves discrimination of high concentrations of TNF-α with a relatively modest impact on discrimination of low concentrations. Provided R-package allows the approach to be used by computational biologists with only elementary knowledge of information theory.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31299056 PMCID: PMC6655862 DOI: 10.1371/journal.pcbi.1007132
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 1Probabilities of correct discrimination between two inputs, x and x.
The input distribution, P(X) = (1/2, 1/2), visualized in (A), and the conditional output probabilities P(Y|X), presented in (B), can be translated, via the Bayes formula, into conditional input distributions, P(X|Y), visualized in (C). The conditional input distribution, P(X|Y), serves to calculate the probability of correct discrimination of the observation y, as shown in (D). Precisely, for any fixed output, y, vertical line in (B), the conditional input probability, in (C), P(X|Y = y), quantifies how likely it is that y was generated by each of the inputs. The probability of correct discrimination of the observation, y is given as the maximum of P(x|Y = y) and P(x|Y = y). Completely overlapping conditional output probabilities P(Y|X), left column, yield random discrimination as opposed to non-overlapping distributions yielding perfect discrimination, right column. The use of the conditional input distribution, P(X|Y), enables quantification of intermediate scenarios, middle column.
Fig 2Information-theoretic analysis of the NF-κB responses to TNF-α stimulation.
(A) Temporally resolved responses of individual cells to selected concentrations of TNF-α. The panel corresponds to Fig. IV in S1 Text. (B) The information capacity as a function of time for time-point responses. (C) As in (B) but for time-series responses. (D) Probabilities of the correct pairwise discrimination between TNF-α concentrations for time-point responses at 21 minutes. The color filled fraction of the circle marks the probability of correct discrimination. (E) The same as in (D) but for time-series responses. (F) Differences between probabilities in (D) and (E). Modeling details: Uncertainties of estimates (grey ribbons in B and C) were obtained by bootstrapping 80% of data (repeated 100 times). Probabilities in (D) and (E) present mean of 50 bootstrap re-sampling.
Fig 3Test scenario 1.
(A) A violin plot representation of the conditional output distribution Y|x for 11 considered inputs. (B) Information capacity estimates as the function of the sample size N. Blue and red lines correspond to SLEMI and KNN estimates, respectively. The bold black line marks the true value of the capacity. For the KNN estimation, k = 10 was assumed. (C) Information capacity estimates of the KNN method as a function of k compared with the true value (bold black line). The error-bars in B and C show the standard deviation of capacity estimates from 40 repeated samplings. N = 1000 was assumed. (D) Computation time of SLEMI and KNN method as the function of the sample size N. (E) Computation time of SLEMI (blue) and the KNN method as the function of the number of considered input values. Input values were subsequently added starting with x1 and x2, only, and ending up with all 11 considered input values. The times reported in panels (D) and (E) correspond to computations performed by a single core on a workstation with Intel Xeon E5-1650 3.50 GHz processor and 32 GB RAM.