| Literature DB >> 20055994 |
Nysia I George1, Joanne R Lupton, Nancy D Turner, Robert S Chapkin, Laurie A Davidson, Naisyin Wang.
Abstract
BACKGROUND: Developing and evaluating new technology that enables researchers to recover gene-expression levels of colonic cells from fecal samples could be key to a non-invasive screening tool for early detection of colon cancer. The current study, to the best of our knowledge, is the first to investigate and report the reproducibility of fecal microarray data. Using the intraclass correlation coefficient (ICC) as a measure of reproducibility and the preliminary analysis of fecal and mucosal data, we assessed the reliability of mixture density estimation and the reproducibility of fecal microarray data. Using Monte Carlo-based methods, we explored whether ICC values should be modeled as a beta-mixture or transformed first and fitted with a normal-mixture. We used outcomes from bootstrapped goodness-of-fit tests to determine which approach is less sensitive toward potential violation of distributional assumptions.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20055994 PMCID: PMC2827371 DOI: 10.1186/1471-2105-11-13
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Histogram of ICC values. The density of the fitted two-component beta-mixture to the (a) fecal data and (b) mucosal data is superimposed.
Figure 2Histogram of PT-ICC values. The density of the fitted two-component normal-mixture to the (a) fecal data and (b) mucosal data is superimposed.
Figure 3Density estimates of the probit transformed ICC values for (a) fecal data and (b) mucosal data. The solid, dashed, and dotted lines correspond to the kernel-based, the normal-mixture based, and the beta-mixture based density estimates.
Summary Statistics of Simulation Scenario #1
| Truth | 0.700 | 0.328 | 0.446 | -1.771 | 3.330 | |
| Mean | 0.725 | 0.302 | 0.440 | -1.951 | 3.321 | |
| Bias | 0.025 | -0.026 | -0.006 | -0.180 | -0.009 | |
| Std Dev | 0.018 | 0.023 | 0.028 | 0.152 | 0.283 | |
| MSE | 0.031 | 0.035 | 0.029 | 0.235 | 0.283 | |
| Truth | 0.800 | -0.033 | 0.391 | -2.090 | 2.722 | |
| Mean | 0.816 | -0.049 | 0.398 | -2.254 | 2.823 | |
| Bias | 0.016 | -0.016 | 0.007 | -0.164 | 0.101 | |
| Std Dev | 0.015 | 0.022 | 0.022 | 0.157 | 0.272 | |
| RMSE | 0.022 | 0.027 | 0.023 | 0.227 | 0.290 | |
Summary statistics of simulation scenario #1. Monte Carlo mean, bias, standard deviation, and square-root MSE (RMSE) of upper mixture proportion μ, upper mixture mean μand variance , and lower mixture mean μand variance from scenario #1.
Summary Statistics of Simulation Scenario #2
| Truth | 0.700 | 0.328 | 0.446 | -1.771 | 3.330 | |
| Mean | 0.453 | 0.282 | 0.521 | -1.995 | 3.409 | |
| Bias | -0.247 | -0.046 | 0.075 | -0.224 | 0.079 | |
| Std Dev | 0.010 | 0.036 | 0.032 | 0.050 | 0.138 | |
| RMSE | 0.247 | 0.059 | 0.082 | 0.229 | 0.159 | |
| Truth | 0.800 | -0.033 | 0.391 | -2.090 | 2.722 | |
| Mean | 0.527 | -0.149 | 0.387 | -1.691 | 2.546 | |
| Bias | -0.273 | -0.116 | -0.004 | 0.399 | -0.176 | |
| Std Dev | 0.011 | 0.031 | 0.023 | 0.049 | 0.111 | |
| RMSE | 0.273 | 0.120 | 0.023 | 0.402 | 0.208 | |
Summary statistics of simulation scenario #2. Monte Carlo mean, bias, standard deviation, and square-root MSE (RMSE) of upper mixture proportion μ, upper mixture mean μand variance , and lower mixture mean μand variance from scenario #2.
for fecal (mucosal) data using 5, 8, and 12 bins
| True | |||
|---|---|---|---|
| 5 | Beta | 0.12 (0.08) | 0.98 (0.01) |
| Normal | 0.13 (0.09) | 0.36 (0.01) | |
| 8 | Beta | 0.00 (0.01) | 1.00 (1.00) |
| Normal | 0.00 (0.01) | 0.04 (0.02) | |
| 12 | Beta | 0.02 (0.01) | 1.00 (1.00) |
| Normal | 0.02 (0.00) | 0.03 (0.01) | |
Figure 4Density estimates of the probit transformed ICC values for the matched subset for (a) fecal data and (b) mucosal data. The solid, dashed, and dotted lines correspond to the kernel-based, the normal-mixture based, and the beta-mixture based density estimates.