| Literature DB >> 17212835 |
Jongrae Kim1, Declan G Bates, Ian Postlethwaite, Pat Heslop-Harrison, Kwang-Hyun Cho.
Abstract
BACKGROUND: We consider the problem of identifying the dynamic interactions in biochemical networks from noisy experimental data. Typically, approaches for solving this problem make use of an estimation algorithm such as the well-known linear Least-Squares (LS) estimation technique. We demonstrate that when time-series measurements are corrupted by white noise and/or drift noise, more accurate and reliable identification of network interactions can be achieved by employing an estimation algorithm known as Constrained Total Least Squares (CTLS). The Total Least Squares (TLS) technique is a generalised least squares method to solve an overdetermined set of equations whose coefficients are noisy. The CTLS is a natural extension of TLS to the case where the noise components of the coefficients are correlated, as is usually the case with time-series measurements of concentrations and expression profiles in gene networks.Entities:
Mesh:
Year: 2007 PMID: 17212835 PMCID: PMC1793997 DOI: 10.1186/1471-2105-8-8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The four-gene network interactions. The arrows indicate activating regulatory relationships and the bars indicate inhibiting regulatory relationships. Each messenger RNA is functionally inhibited and/or activated by the expression of other mRNA's. The expression rates are described by the Hill-type equations and given by (1).
Figure 2The four-gene network measurements corrupted by white noise. Measurements of 30 noisy data points for each mRNA concentration are shown. The solid line represents the true simulated value and the crosses denote the measurements corrupted by white noise. The measurements are taken starting at 72.01 h, 0.01 h after the perturbation is applied.
The four-gene network example: white noise
| Samplings per Experiment | Algorithms | ||||||
| Mean | STD | Mean | STD | Mean | STD | ||
| 3 | LS | 94.36 | 36.54 | 0.95 | 0.20 | 368.06 | 123.08 |
| TLS | 94.36 | 36.54 | 0.95 | 0.20 | 368.06 | 123.08 | |
| CTLS | 94.36 | 36.54 | 0.95 | 0.20 | 368.06 | 123.08 | |
| 6 | LS | 16.35 | 5.11 | 0.59 | 0.14 | 71.10 | 17.84 |
| TLS | 196.04 | 2239.78 | 0.74 | 0.20 | 1778.75 | 24252.19 | |
| CTLS | 14.96 | 5.63 | 0.63 | 0.16 | 64.29 | 21.34 | |
| 9 | LS | 7.87 | 2.42 | 0.46 | 0.09 | 35.73 | 9.03 |
| TLS | 11.96 | 9.68 | 0.54 | 0.13 | 67.47 | 118.27 | |
| CTLS | 6.61 | 2.74 | 0.47 | 0.10 | 31.57 | 12.05 | |
| 12 | LS | 5.19 | 1.64 | 0.40 | 0.06 | 24.98 | 6.47 |
| TLS | 6.20 | 2.34 | 0.45 | 0.09 | 32.42 | 15.33 | |
| CTLS | 3.79 | 1.48 | 0.40 | 0.06 | 19.59 | 6.74 | |
| 21 | LS | 3.74 | 1.06 | 0.38 | 0.02 | 18.12 | 4.39 |
| TLS | 3.71 | 1.36 | 0.40 | 0.05 | 20.40 | 8.51 | |
| CTLS | 2.20 | 0.68 | 0.38 | 0.02 | 11.29 | 2.93 | |
| 30 | LS | 3.70 | 0.87 | 0.41 | 0.06 | 17.21 | 3.62 |
| TLS | 3.45 | 1.20 | 0.44 | 0.07 | 18.75 | 7.30 | |
| CTLS | 2.31 | 0.56 | 0.49 | 0.03 | 10.10 | 1.96 | |
| 60 | LS | 3.75 | 0.66 | 0.50 | 0.01 | 17.05 | 2.59 |
| TLS | 3.59 | 1.05 | 0.52 | 0.05 | 16.25 | 4.74 | |
| CTLS | 2.51 | 0.52 | 0.50 | 0.01 | 10.76 | 1.45 | |
The table shows the error comparisons in terms of the mean and the standard deviation (STD) for different numbers of data points for each method based on 1000 Monte-Carlo Simulations. εis the sum of two terms, i.e (l/N1) Σ |α| and (l/N2) Σ |β| where αand βare the relative magnitude errors in the non-zero and zero elements of the true Jacobian, respectively, and N1 and N2 are the number of non-zero and zero elements in the true Jacobian, respectively. εis given by (1/n2) Σ |sign () - sign (f)|, i.e. the average sign differences, where and fare the (i-th row, j-th column) elements of the estimated and the true Jacobian, respectively. εis the Frobenius norm of the difference between the estimated and the true Jacobian, i.e. || - F||.
The four-gene network example: 12 data points, white noise and drift noise
| Strength of drift noise ( | Algorithms | ||||||
| Mean | STD | Mean | STD | Mean | STD | ||
| 2.0 | LS | 9.18 | 3.63 | 0.47 | 0.10 | 41.38 | 14.19 |
| TLS | 29.25 | 178.08 | 0.57 | 0.15 | 237.51 | 2995.95 | |
| CTLS | 8.37 | 3.95 | 0.51 | 0.12 | 40.28 | 20.33 | |
| 1.0 | LS | 6.31 | 2.07 | 0.42 | 0.07 | 29.24 | 8.25 |
| TLS | 8.21 | 5.02 | 0.48 | 0.11 | 41.76 | 28.25 | |
| CTLS | 5.01 | 2.00 | 0.43 | 0.09 | 24.67 | 9.62 | |
| 0.1 | LS | 5.14 | 1.59 | 0.40 | 0.06 | 25.02 | 6.61 |
| TLS | 6.21 | 2.38 | 0.45 | 0.09 | 32.87 | 15.91 | |
| CTLS | 3.79 | 1.40 | 0.40 | 0.05 | 19.71 | 6.89 | |
| 0.05 | LS | 5.18 | 1.66 | 0.40 | 0.06 | 25.16 | 6.57 |
| TLS | 6.20 | 2.39 | 0.45 | 0.09 | 32.29 | 15.30 | |
| CTLS | 3.79 | 1.46 | 0.40 | 0.06 | 19.56 | 6.80 | |
The table shows the error comparisons in terms of the mean and the standard deviation (STD) for different strengths of drift noise for each method based on 1000 Monte-Carlo simulations. The number of measurements per experiment is fixed at 12. All conditions are the same as in Table 1 with only the drift noise being added. εis the sum of two tems, i.e (1/N1) Σ |α| and (1/N2) Σ |β|, where βand βare the relative magnitude errors in the non-zero and zero elements of the true Jacobian, respectively, and N1 and N2 are the number of non-zero and zero elements in the true Jacobian, respectively. εis given by (1/n2) Σ |sign () - sign(f)|, i.e. the average sign differences, where and fare the (i-th row, j-th column) elements of the estimated and the true Jacobian, respectively. εis the Frobenius norm of the difference between the estimated and the true Jacobian, i.e. || - F||.
The four-gene network example: 21 data points, white noise and drift noise
| Strength of drift noise ( | Algorithms | ||||||
| Mean | STD | Mean | STD | Mean | STD | ||
| 2.0 | LS | 6.32 | 2.59 | 0.43 | 0.08 | 28.32 | 10.61 |
| TLS | 19.57 | 179.54 | 0.50 | 0.12 | 111.66 | 949.78 | |
| CTLS | 5.81 | 3.10 | 0.47 | 0.10 | 28.62 | 17.56 | |
| 1.0 | LS | 4.46 | 1.51 | 0.39 | 0.05 | 21.21 | 6.61 |
| TLS | 4.83 | 2.15 | 0.43 | 0.08 | 26.42 | 14.98 | |
| CTLS | 3.17 | 1.24 | 0.41 | 0.06 | 16.08 | 7.20 | |
| 0.1 | LS | 3.68 | 1.05 | 0.38 | 0.02 | 17.93 | 4.34 |
| TLS | 3.58 | 1.27 | 0.40 | 0.05 | 19.90 | 8.33 | |
| CTLS | 2.18 | 0.68 | 0.38 | 0.02 | 11.16 | 2.91 | |
| 0.05 | LS | 3.68 | 1.04 | 0.38 | 0.02 | 17.92 | 4.10 |
| TLS | 3.63 | 1.35 | 0.40 | 0.06 | 19.88 | 8.16 | |
| CTLS | 2.18 | 0.69 | 0.38 | 0.02 | 11.14 | 2.93 | |
The table shows the error comparisons in terms of the mean and the standard deviation (STD) for different strengths of drift noise for each method based on 1000 Monte-Carlo simulations. The number of measurements per experiment is fixed at 21. All conditions are the same as the ones for Table 1 with only the drift noise being added. εis the sum of two tems, i.e (1/N1) Σ |α| and (l/N2) Σ |β|, where βand βare the relative magnitude errors in the non-zero and zero elements of the true Jacobian, respectively, and N1 and N2 are the number of non-zero and zero elements in the true Jacobian, respectively. εis given by (1/n2) Σ |sign() - sign(f)|, i.e. the average sign differences, where and fare the (i-th row, j-th column) element of the estimated and the true Jacobian, respectively. εis the Frobenius norm of the difference between the estimated and the true Jacobian, i.e. || - F||.
Figure 3The measurements of the perturbed p53 and . An example of the measurements for the perturbed p53 and mdm2 gene expression levels are shown. The data are generated from the model suggested in [14]. The true perturbed gene expression levels are shown in lines, and the corresponding measurements are the cross for p53 and the circle for mdm2, respectively.
p53 and mdm2 mRNA expression model: white noise with neglected kinetics
| Samplings per Experiment | Algorithms | ||||||
| Mean | STD | Mean | STD | Mean | STD | ||
| 4 | LS | 1.34 | 0.34 | 1.01 | 0.14 | 0.04 | 0.01 |
| TLS | 1.34 | 0.34 | 1.01 | 0.14 | 0.04 | 0.01 | |
| CTLS | 1.34 | 0.34 | 1.01 | 0.14 | 0.04 | 0.01 | |
| 8 | LS | 0.95 | 0.27 | 0.50 | 0.02 | 0.03 | 0.01 |
| TLS | 25.03 | 196.73 | 1.01 | 0.15 | 1.06 | 8.42 | |
| CTLS | 0.44 | 0.12 | 0.50 | 0.00 | 0.02 | 0.00 | |
| 12 | LS | 0.61 | 0.08 | 0.50 | 0.00 | 0.02 | 0.00 |
| TLS | 47.53 | 241.86 | 0.92 | 0.31 | 2.49 | 13.07 | |
| CTLS | 0.49 | 0.06 | 0.50 | 0.02 | 0.02 | 0.00 | |
| 16 | LS | 0.44 | 0.06 | 0.50 | 0.00 | 0.02 | 0.00 |
| TLS | 50.96 | 833.28 | 1.02 | 0.20 | 3.11 | 50.06 | |
| CTLS | 0.49 | 0.05 | 0.50 | 0.02 | 0.02 | 0.00 | |
The table shows the error comparisons in terms of the mean and the standard deviation (STD) for different number of data for each method based on 1000 Monte-Carlo Simulations. The measurements are taken every 2 hours and the states converge to steady states around the 16-th sample. εis the sum of two tems, i.e (1/N1) Σ |α| and (1/N2) Σ |β|, where αand βare the relative magnitude errors in the non-zero and zero elements of the true Jacobian, respectively, and N1 and N2 are the number of non-zero and zero elements in the true Jacobian, respectively. εis given by (1/n2) Σ |sign () - sign (f)|, i.e. the average sign differences, where and fare the (i-th row, j-th column) elements of the estimated and the true Jacobian, respectively. εis the Frobenius norm of the difference between the estimated and the true Jacobian, i.e. || - F||.
Figure 4The measurements of the prominent network in the innate immune system. The measurements of Rel, ATF3, I16, and I112 are taken from [15]. The actual data in [15] are measured at 0, 60, 120, 240, 360 minutes. To make the measurements equally spaced in time, the data at 180 and 300 minutes are interpolated.