| Literature DB >> 17125514 |
Francesca Demichelis1, Paolo Magni, Paolo Piergiorgi, Mark A Rubin, Riccardo Bellazzi.
Abstract
BACKGROUND: Uncertainty often affects molecular biology experiments and data for different reasons. Heterogeneity of gene or protein expression within the same tumor tissue is an example of biological uncertainty which should be taken into account when molecular markers are used in decision making. Tissue Microarray (TMA) experiments allow for large scale profiling of tissue biopsies, investigating protein patterns characterizing specific disease states. TMA studies deal with multiple sampling of the same patient, and therefore with multiple measurements of same protein target, to account for possible biological heterogeneity. The aim of this paper is to provide and validate a classification model taking into consideration the uncertainty associated with measuring replicate samples.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17125514 PMCID: PMC1698579 DOI: 10.1186/1471-2105-7-514
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Structure of a hierarchical model. The replicates j of the generic subject i are normally distributed around a mean value μwith a within sample variance σ2, i.e. x~N(μ, σ2). The mean values μare normally distributed around a "population" mean value M with between sample variance τ2, i.e. μ~N(M, τ2).
Results on simulated data for 1 feature with different level of within sample heterogeneity in the different classes.
| HierNB Classifier | StNB Classifier | ||||
| Exp | Acc | Brier | Acc | Brier | |
| 1 | 15 | 0.620 [0.604 0.636] | 0.449 | 0.559 [0.539 0.580] | 0.490 |
| 2 | 30 | 0.762 [0.740 0.790] | 0.307 | 0.560 [0.540 0.580] | 0.490 |
| 3 | 45 | 0.830 [0.813 0.849] | 0.228 | 0.554 [0.537 0.570] | 0.490 |
| 4 | 60 | 0.878 [0.866 0.888] | 0.176 | 0.556 [0.531 0.582] | 0.490 |
| 5 | 75 | 0.899 [0.883 0.914] | 0.147 | 0.560 [0.534 0.586] | 0.490 |
Exp = experiment number, Acc = Accuracy, Brier = Brier Score. In brackets the 95% confidence intervals for the estimate of the accuracy.
Results on simulated data: experiments were done using increasing number of features.
| HierNB Classifier | StNB Classifier | ||||
| Exp | N Feat. | Acc | Brier | Acc | Brier |
| 1 | 1 | 0.925 [0.917, 0.933] | 0.112 | 0.874 [0.864, 0.884] | 0.184 |
| 2 | 2 | 0.966 [0.960, 0.971] | 0.052 | 0.921 [0.912, 0.929] | 0.118 |
| 3 | 3 | 0.987 [0.983, 0.990] | 0.020 | 0.946 [0.938, 0.952] | 0.082 |
| 4 | 10 | 0.998 [0.997, 0.999] | 0.002 | 0.985 [0.981, 0.989] | 0.023 |
Exp = number of experiment, N. Feat = Number of features, Acc = Accuracy, Brier = Brier Score. In brackets the 95% confidence intervals for the estimate of the accuracy.
Figure 2Posterior probabilities of the three feature simulated experiment. Histograms of posterior probabilities of the three feature experiment (Exp.3, Table 2) on a simulated dataset. Panels A and B show the results obtained with the HierNB classifier for class 1 and 2 respectively; panels C and D show results obtained with the StNB classifier. In the upper right corner of each panel the frequency of the bin corresponding to the highest posterior probability range is reported.
TMA data description: model parameters of localized (class 1) and metastatic prostate cancer tumors (class 2) for two proteins.
| M | ||||||
| Class | 1 | 2 | 1 | 2 | 1 | 2 |
| AMACR | 155.2 | 148.8 | 201.2 | 208.8 | 49.1 | 85.7 |
| EZH2Int | 146.2 | 141.6 | 86.7 | 107.9 | 135.7 | 53.9 |
M = class mean; τ2 = class variance; σ2 = averaged within sample variance.
Results on TMA dataset for the two proteins.
| 0.65 [0.62–0.68] | 0.71 [0.66–0.74] | 0.60 [0.56–0.62] | 0.69 [0.689–0.693] | 0.41 [0.39–0.42] | |
| 0.58 [0.54–0.61] | 0.58 [0.54–0.62] | 0.57 [0.53–0.61] | 0.62 [0.617–0.622] | 0.47 [0.46–0.48] |
Acc = Accuracy, Spec = specificity, Sens = Sensitivity, Brier = Brier Score. In brackets the 95% confidence intervals for the estimates, AUC = area under the ROC.
Figure 3Posterior probabilities of prostate cancer cases. Histograms of the posterior probabilities of prostate cancer cases. Panels A and B show the results evaluated with the HierNB classifier for class 1 (localized tumors) and 2 (aggressive tumors) respectively; panels C and D show results evaluated with the StNB classifier.