| Literature DB >> 22533627 |
Per Liv1, Svend Erik Mathiassen, Susanne Wulff Svendsen.
Abstract
BACKGROUND: Information on exposure variability, expressed as exposure variance components, is of vital use in occupational epidemiology, including informed risk control and efficient study design. While accurate and precise estimates of the variance components are desirable in such cases, very little research has been devoted to understanding the performance of data sampling strategies designed specifically to determine the size and structure of exposure variability. The aim of this study was to investigate the accuracy and precision of estimators of between-subjects, between-days and within-day variance components obtained by sampling strategies differing with respect to number of subjects, total sampling time per subject, number of days per subject and the size of individual sampling periods.Entities:
Mesh:
Year: 2012 PMID: 22533627 PMCID: PMC3377541 DOI: 10.1186/1471-2288-12-58
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Figure 1Upper arm postures during an illustrative full shift; minute-by-minute values of average elevation, percentage time above 90°, and percentage time below 15° (top to bottom). The vertical line at minute #337 separates original data (solid curve) from simulated data (dashed curve) added to achieve a full 480 minute shift.
Parameters characterizing the investigated sampling strategies
| Number of subjects, | 10,20 |
| Sampling time per subject, | 60, 120, 240, 480 |
| Number of days, | 2, 4 |
| Block size, | 1, 15, 60, 240 |
| Dispersion of blocks within days | random, fixed interval |
Estimates of mean exposure values, variance components and autocorrelation parameters in the parent data set
| 29.2° | 4.7 | 32.5 | |
| 22.2 (°)2 | 3.0 | 152.8 | |
| 9.2 (°)2 | 4.0 | 65.6 | |
| 234.9 (°)2 | 164.7 | 616.1 | |
| 0.55 | 0.52 | 0.51 | |
| 0.37 | 0.34 | 0.33 | |
| 0.29 | 0.26 | 0.26 | |
| 0.23 | 0.22 | 0.22 | |
| 0.19 | 0.17 | 0.18 | |
| 0.09 | 0.08 | 0.09 |
μ, mean value; σσ, σ variances between subjects, between days and within days; ρ(h), sample autocorrelation at lag h.
Bias [90% prediction intervals] of variance component estimates forfor a subset of the investigated sampling strategies
| 3a. Between-subjects variance, | ||||||||||
| 0.0 | −0.1 | −0.1 | −0.1 | −0.1 | −0.2 | −0.2 | ||||
| [−3.0, 5.3] | [−3.0, 4.1] | [−2.8, 3.0] | [−3.0, 3.6] | [−2.6, 2.7] | [−2.9, 2.7] | [−2.0, 1.9] | ||||
| | 0.0 | −0.1 | 0.0 | −0.3 | −0.1 | −0.2 | 0.0 | −0.3 | ||
| [−3.0, 5.0] | [−3.0, 3.8] | [−3.0, 4.2] | [−2.9, 2.5] | [−3.0, 3.4] | [−2.4, 2.5] | [−2.7, 2.7] | [−2.1, 1.7] | |||
| | 0.8 | 0.4 | −0.1 | −0.2 | 0.2 | 0.0 | −0.2 | −0.2 | ||
| [−3.0, 11.1] | [−3.0, 9.7] | [−3.0, 4.8] | [−3.0, 4.1] | [−3.0, 7.6] | [−3.0, 6.3] | [−3.0, 3.1] | [−2.6, 2.6] | |||
| | 0.5 | 0.3 | −0.5 | −0.6 | 0.1 | 0.0 | −0.6 | −0.7 | ||
| [−3.0, 9.9] | [−3.0, 9.4] | [−3.0, 3.3] | [−3.0, 2.8] | [−3.0, 6.6] | [−3.0, 6.2] | [−3.0, 2.1] | [−2.8, 1.6] | |||
| | 1.6 | | 0.0 | −0.1 | 0.7 | | −0.2 | −0.2 | ||
| [−3.0, 17.3] | | [−3.0, 5.8] | [−3.0, 5.5] | [−3.0, 11.5] | | [−3.0, 3.7] | [−3.0, 3.6] | |||
| | | | −0.3 | 0.1 | | | −0.4 | −0.1 | ||
| [−3.0, 5.0] | [−3.0, 5.7] | | | [−3.0, 3.2] | [−3.0, 3.6] | |||||
| 3b. Between-days variance, | ||||||||||
| −0.2 | −0.1 | −0.1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | |||
| [−4.0, 6.1] | [−4.0, 5.3] | [−3.4, 4.8] | [−2.5, 3.2] | [−3.5, 4.6] | [−3.3, 3.8] | [−2.6, 3.3] | [−1.9, 2.3] | |||
| | −0.8 | −1.1 | −0.3 | −0.3 | −0.7 | −1.1 | −0.2 | −0.3 | ||
| [−4.0, 5.3] | [−4.0, 3.7] | [−3.4, 4.7] | [−2.8, 2.9] | [−4.0, 3.6] | [−4.0, 2.4] | [−2.8, 3.4] | [−2.2, 1.9] | |||
| | 8.2 | 20.2 | 0.9 | 3.5 | 8.7 | 20.3 | 1.0 | 3.6 | ||
| [−1.8, 24.5] | [4.0, 44.6] | [−3.0, 6.7] | [−0.7, 9.0] | [0.5, 20.5] | [7.5, 37.5] | [−2.1, 5.1] | [0.5, 7.5] | |||
| | 7.1 | 19.2 | 0.1 | 2.6 | 7.3 | 19.4 | 0.3 | 2.7 | ||
| [−1.9, 20.9] | [3.9, 41.9] | [−3.3, 5.0] | [−1.0, 7.4] | [0.1, 16.9] | [7.4, 35.1] | [−2.6, 4.0] | [−0.1, 6.2] | |||
| | 17.2 | | 2.0 | 7.3 | 17.6 | | 2.1 | 7.4 | ||
| [−0.5, 51.0] | | [−2.7, 9.0] | [0.7, 17.0] | [2.7, 42.0] | | [−1.6, 7.2] | [2.4, 14] | |||
| | | | 1.3 | 7.7 | | | 1.5 | 7.9 | ||
| [−2.8, 7.2] | [1.0, 17.9] | | | [−1.8, 6.0] | [2.6, 14.7] | |||||
| 3c. Within-day variance, | ||||||||||
| 0.1 | −0.1 | −0.9 | 0.2 | 0.5 | 0.2 | −0.3 | −0.1 | |||
| [−50.5, 54.8] | [−46.7, 50.8] | [−42.9, 42.3] | [−37.3, 38.9] | [−35.6, 39.1] | [−33.1, 34.9] | [−29.8, 31.4] | [−27.2, 27.1] | |||
| | 0.5 | −0.2 | 1.1 | 0.5 | 0.6 | 0.4 | 1.4 | 0.3 | ||
| [−48.9, 54.1] | [−44.3, 47.7] | [−42.0, 46.5] | [−35.9, 38.4] | [−34.3, 37.3] | [−32.2, 33.4] | [−29.9, 33.6] | [−26.2, 25.9] | |||
| | −8.8 | −19.0 | 0.7 | −2.1 | −6.9 | −18.7 | 0.6 | −1.9 | ||
| [−72.9, 61.4] | [−74.3, 43.3] | [−43.7, 47.3] | [−44.3, 42.9] | [−53.8, 43.5] | [−60.0, 26.1] | [−31.6, 34.4] | [−31.5, 29.5] | |||
| | −8.5 | −16.7 | −2.4 | −6.2 | −9.2 | −17.6 | −1.5 | −5.9 | ||
| [−70.0, 60.7] | [−73.8, 46.3] | [−44.4, 43.5] | [−44.6, 35.2] | [−55.3, 39.9] | [−59.3, 26.5] | [−31.0, 29.2] | [−33.8, 23.6] | |||
| | −13.0 | | 1.2 | −3.2 | −13.3 | | 1.0 | −2.7 | ||
| [−80.6, 65.7] | | [−45.6, 53.2] | [−47.5, 45.8] | [−61.0, 39.5] | | [−34.0, 36.9] | [−34.9, 32.5] | |||
| −1.1 | 3.1 | | | −0.5 | 4.1 | |||||
| [−47.2, 47.9] | [−42.1, 52.8] | [−32.0, 34.0] | [−29.0, 39.7] | |||||||
a, b, c: between-subjects, between-days, and within-day variance.
n, number of subjects; t, total sampling time per subject (minutes); n, number of days per subject; t, size of sampling blocks (minutes); r, random sampling; f, fixed interval sampling.
Bias and prediction intervals are presented relative to the “true” variance components of the parent data set, i.e. σ = 3.0, σ = 4.0 and σ = 164.7 (cf. Table 2). Thus, the result in tells that for the sampling strategy (n, t, n, t) = (10, 120, 4, 1) with random distribution of blocks, σ was downward biased by 0.2, i.e. its average estimated value was 2.8, and the 90% prediction interval, presented as [−3.0, 4.1], reached from 3.0-3.0 to 3.0 + 4.1, i.e. from 0.0 to 7.1.
Figure 2Estimated variance component mean values (error bars: 90% prediction intervals) resulting from simulations of six different sampling strategies assessing percentage time above 90°. All six strategies used random sampling from n = 10 subjects, approached for n = 2 days each. Green circles, blue squares and red triangles show within-day, between-days and between-subjects variance, respectively. Unfilled and filled symbols show strategies with t = 120 minutes and t = 240 minutes, respectively. Red dashed, blue solid and green dotted lines represent the “true” within-day, between-days and between-subjects variance, respectively, according to the parent data set