| Literature DB >> 21743835 |
Karl G Kugler1, Werner O Hackl, Laurin Aj Mueller, Heidi Fiegl, Armin Graber, Ruth M Pfeiffer.
Abstract
BACKGROUND: Using serum, plasma or tumor tissue specimens from biobanks for biomarker discovery studies is attractive as samples are often readily available. However, storage over longer periods of time can alter concentrations of proteins in those specimens. We therefore assessed the bias in estimates of association from case-control studies conducted using banked specimens when maker levels changed over time for single markers and also for multiple correlated markers in simulations. Data from a small laboratory experiment using serum samples guided the choices of simulation parameters for various functions of changes of biomarkers over time.Entities:
Year: 2011 PMID: 21743835 PMCID: PMC3131186 DOI: 10.1186/2043-9113-1-9
Source DB: PubMed Journal: J Clin Bioinforma ISSN: 2043-9113
Marker Concentration Changes
| Date of sample collection | Concentration measured | % change | |
|---|---|---|---|
| at sample collection | Sept 2009 | ||
| CA 15-3 | |||
| Nov 1997 | 166 | 187 | 12.65 |
| Oct 1998 | 29 | 33 | 13.79 |
| Apr 1995 | 10 | 12 | 20.00 |
| Feb 2001 | 21 | 19 | -9.52 |
| Apr 2001 | 23 | 24 | 4.35 |
| Feb 1999 | 33 | 34 | 3.03 |
| Sep 2000 | 26 | 33 | 26.92 |
| Sep 2000 | 24 | 33 | 37.50 |
| Sep 2000 | 15 | 17 | 13.33 |
| Sep 2000 | 12 | 16 | 33.33 |
| Nov 1999 | 884 | 986 | 11.54 |
| CA125 | |||
| Feb 1999 | 83 | 96 | 15.66 |
| Feb 1999 | < LOD† | < LOD | |
| Feb 1999 | < LOD | < LOD | |
| Feb 1999 | 51 | 69 | 35.29 |
| Feb 1999 | < LOD | < LOD | |
| Sep 2000 | 77 | 73 | -5.19 |
| Sep 2000 | 33 | 32 | -3.03 |
| Sep 1998 | 106 | 105 | -0.94 |
| Oct 1998 | 1273 | 2026 | 59.15 |
† LOD = limit of detection
Concentrations of two markers, CA 15-3 and CA125, measured at the time of freezing and then again after a long term storage. Measurements with concentrations below the limit of detection were excluded from further analysis.
Figure 1Choices of . Three functions model an increase in marker levels of 15% at t = 10, and three function model a decrease of 15% at t = 10.
Univariate Marker Results
| t = 0 | ||||||||||||
| 0.309 | 0.309 | 0.309 | 0.309 | 0.308 | 0.308 | 0.308 | 0.308 | 0.307 | 0.307 | 0.308 | 0.308 | |
| se.emp | 0.005 | 0.005 | 0.005 | 0.005 | 0.005 | 0.005 | 0.003 | 0.003 | 0.003 | 0.003 | 0.003 | 0.003 |
| rel.bias | 0.029 | 0.029 | 0.029 | 0.03 | 0.028 | 0.028 | 0.026 | 0.026 | 0.024 | 0.024 | 0.026 | 0.026 |
| rel.bias.sd | 0.566 | 0.566 | 0.568 | 0.571 | 0.568 | 0.563 | 0.343 | 0.342 | 0.343 | 0.341 | 0.342 | 0.34 |
| t = 5 | ||||||||||||
| 0.288 | 0.305 | 0.272 | 0.334 | 0.312 | 0.356 | 0.287 | 0.304 | 0.271 | 0.331 | 0.312 | 0.355 | |
| se.emp | 0.005 | 0.005 | 0.005 | 0.006 | 0.005 | 0.006 | 0.003 | 0.003 | 0.003 | 0.003 | 0.003 | 0.004 |
| rel.bias | -0.041 | 0.015 | -0.092 | 0.112 | 0.042 | 0.186 | -0.044 | 0.013 | -0.096 | 0.105 | 0.039 | 0.184 |
| rel.bias.sd | 0.527 | 0.559 | 0.5 | 0.617 | 0.576 | 0.65 | 0.319 | 0.337 | 0.302 | 0.368 | 0.346 | 0.393 |
| t = 10 | ||||||||||||
| 0.269 | 0.269 | 0.269 | 0.362 | 0.361 | 0.361 | 0.268 | 0.268 | 0.268 | 0.36 | 0.361 | 0.361 | |
| se.emp | 0.005 | 0.005 | 0.005 | 0.006 | 0.006 | 0.006 | 0.003 | 0.003 | 0.003 | 0.004 | 0.004 | 0.004 |
| rel.bias | -0.103 | -0.103 | -0.103 | 0.208 | 0.204 | 0.204 | -0.106 | -0.106 | -0.107 | 0.199 | 0.202 | 0.202 |
| rel.bias.sd | 0.493 | 0.493 | 0.495 | 0.671 | 0.667 | 0.66 | 0.298 | 0.297 | 0.298 | 0.4 | 0.401 | 0.399 |
Mean values of the maximum likelihood estimates of β = 0.3 after t = 0, 5, and 10 years for the various degradation functions, with empirical (se.emp) standard error and the relative bias . Simulations were performed with μ = -3, and sample sizes n = 75 and n = 200. Function b1 corresponds to a linear change, b2 exponential change and b3 logarithmic change in marker levels over time.
Multivariate Marker Results: A Single Marker is associated with Outcome
| t = 0 | ||||
| 0.305 | 0.302 | 0.303 | 0.298 | |
| se.emp | 0.091 | 0.064 | 0.128 | 0.093 |
| rel.bias | 0.018 | 0.005 | 0.009 | -0.005 |
| rel.bias.sd | 0.304 | 0.213 | 0.426 | 0.309 |
| power† | 0.522 | 0.92 | 0.541 | 0.908 |
| t = 5 | ||||
| 0.285 | 0.281 | 0.282 | 0.278 | |
| se.emp | 0.085 | 0.059 | 0.119 | 0.086 |
| rel.bias | -0.052 | -0.064 | -0.058 | -0.072 |
| rel.bias.sd | 0.282 | 0.198 | 0.398 | 0.287 |
| power | 0.527 | 0.926 | 0.546 | 0.908 |
| t = 10 | ||||
| 0.266 | 0.263 | 0.264 | 0.261 | |
| se.emp | 0.08 | 0.055 | 0.112 | 0.08 |
| rel.bias | -0.114 | -0.124 | -0.121 | -0.13 |
| rel.bias.sd | 0.266 | 0.185 | 0.372 | 0.268 |
| power | 0.532 | 0.929 | 0.55 | 0.91 |
Results for simulations based on a multivariate setting with 10 markers, where only X1 is associated with disease outcome with true β = 0.3, and μ = -3. Levels of X1 increases 1.5% per year. Simulations were performed with sample sizes n = 250 and n = 500. † The power is calculated as the number of rejected null hypotheses over all simulations.
Multivariate Marker Results: Three Markers are associated with Outcome
| true | 0.3 | 0.2 | 0.2 |
| perc.change | 0.150 | 0.20 | 0.10 |
| 1 | 2 | 3 | |
| t = 0 | |||
| 0.3 | 0.202 | 0.2 | |
| se.emp | 0.131 | 0.13 | 0.13 |
| rel.bias | -0.001 | 0.012 | 0.002 |
| rel.bias.sd | 0.435 | 0.652 | 0.648 |
| power† | 0.996 | ||
| t = 5 | |||
| 0.279 | 0.199 | 0.184 | |
| se.emp | 0.122 | 0.126 | 0.118 |
| rel.bias | -0.068 | -0.003 | -0.078 |
| rel.bias.sd | 0.405 | 0.630 | 0.591 |
| power | 0.995 | ||
| t = 10 | |||
| 0.261 | 0.169 | 0.182 | |
| se.emp | 0.113 | 0.108 | 0.117 |
| rel.bias | -0.131 | -0.155 | -0.090 |
| rel.bias.sd | 0.376 | 0.538 | 0.584 |
| power | 0.995 | ||
Results for simulations based on a multivariate setting 10 with correlated markers, with 250 cases and 250 controls, μ = -3, and ρ = 0.5. The first three markers X1, X2, and X3 are associated with outcome. † The power is calculated as the number of rejected null hypotheses over all simulations. Function b1 corresponds to a linear change, b2 exponential change and b3 logarithmic change in marker levels over time.