| Literature DB >> 30208858 |
Carly A Bobak1, Paul J Barr2, A James O'Malley3,4.
Abstract
BACKGROUND: Intraclass correlation coefficients (ICC) are recommended for the assessment of the reliability of measurement scales. However, the ICC is subject to a variety of statistical assumptions such as normality and stable variance, which are rarely considered in health applications.Entities:
Keywords: Bayesian analysis; Hierarchical regression; ICC; Observer OPTION5; Reliability; Shared decision making; Variance function modelling
Mesh:
Year: 2018 PMID: 30208858 PMCID: PMC6134634 DOI: 10.1186/s12874-018-0550-6
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Encounters from the three randomized studies which compared the impact of PDAs to standard care
| Study | PDA (n) | Usual care (n) | Total |
|---|---|---|---|
| 1 | 101 | 100 | 201 |
| 2 | 37 | 35 | 72 |
| 3 | 13 | 25 | 38 |
| Total | 151 | 160 | 311 |
Fig. 1Comparison of Observer OPTION5 scores between raters. The individual rater score is shown on the y-axis and the mean OPTION5 score is shown on the x-axis
Fig. 2Actual difference of Observer OPTION5 between raters over the mean OPTION5 score. While the average difference is slightly less than 10, this difference varies greatly across the mean score, demonstrating non-constant variance
Fig. 3Empirical variance of scores Compares the mean variance, binomial variance, and the observed variance (using a smoothing spline with 10 degrees of freedom) of Observer OPTION5 score. Highlights the heteroscedasticity of the variance as a function of the mean
Fig. 4A comparison of the posterior distribution of the key parameters underlying the ICC between the within-encounter variance and the between-encounter (but within study) variance across the three studies
Fig. 5Posterior distributions of the ICCs for each study, and the difference in the ICC for each pair of studies
The differences in the ICC between studies in terms of the posterior mean, median, the 2.5 and 97.5 quantiles, and the posterior probability that the difference exceeds 0
| Paired difference | 2.50% | Median | Mean | 97.50% | |
|---|---|---|---|---|---|
| Study 1 - Study 2 | 0.166 | 0.472 | 0.473 | 0.764 | 0.995 |
| Study 1 - Study 3 | -0.155 | 0.170 | 0.171 | 0.508 | 0.835 |
| Study 2 - Study 3 | -0.659 | -0.306 | -0.302 | 0.078 | 0.056 |
Full model results from Bayesian Framework*
| Term | Posterior summary | ||
|---|---|---|---|
| Median | 2.5% | 97.5% | |
| 0.145 | -0.087 | 0.490 | |
| -0.061 | -0.073 | -0.051 | |
| 0.239 | 0.214 | 0.270 | |
| ( | 0.054 | 0.044 | 0.070 |
| ( | 0.117 | 0.084 | 0.168 |
| ( | 0.056 | 0.037 | 0.090 |
| 0.043 | 0.024 | 0.097 | |
| 0.011 | 0.004 | 0.034 | |
| 0.023 | 0.009 | 0.078 | |
|
| 0.029 | 0.003 | 0.717 |
| ICC[Study 1] | 0.821 | 0.655 | 0.985 |
| ICCStudy 2] | 0.295 | 0.119 | 0.628 |
| ICC[Study 3] | 0.644 | 0.359 | 0.919 |
*Here, the ICC for each study refers to the conventional within-study ICC (see Eq. 8) averaged over a population of encounters (Eq. 9)
Results for homogeneous variance using a Bernoulli variance function to capture heteroscedastic variance
| Term | Posterior summary | ||
|---|---|---|---|
| Median | 2.5% | 97.5% | |
| 0.317 | 0.203 | 0.449 | |
| -0.088 | -0.102 | -0.074 | |
| 0.250 | 0.217 | 0.281 | |
| ( | 0.041 | 0.035 | 0.048 |
|
| 0.015 | 0.011 | 0.019 |
|
| 0.004 | 0.001 | 0.092 |
| ICC | 0.609 | 0.520 | 0.745 |
| ICCb ∗ | 0.681 | 0.568 | 0.935 |
*ICCb denotes the ICC for the case when encounters are pooled across studies (see Eq. 10) whereas ICC is the conventional within-study ICC (see 8). In both cases the ICC is averaged over a population of encounters, as in 9
Results for homogeneous variance using a constant variance function
| Term | Posterior summary | ||
|---|---|---|---|
| Median | 2.5% | 97.5% | |
| 0.319 | 0.206 | 0.451 | |
| -0.097 | -0.111 | -0.083 | |
| 0.278 | 0.248 | 0.309 | |
| ( | 0.008 | 0.007 | 0.009 |
|
| 0.014 | 0.011 | 0.017 |
|
| 0.004 | 0.001 | 0.091 |
| ICC | 0.640 | 0.568 | 0.702 |
| ICCb ∗ | 0.706 | 0.614 | 0.930 |
*ICCb denotes the ICC for the case when encounters are pooled across studies (see Eq. 10) whereas ICC is the conventional within-study ICC (see 8). In both cases the ICC is averaged over a population of encounters, as in 9
Fig. 6Direct analysis of ICC as a function of level of agreement Relationship of ICC to the true amount of shared decision making (SDM) in an encounter and heterogeneity of reliability of measurements across studies. The ICC is higher at the ends of the scale than at the center where the variability under the binomial variance function of rater scores on the same encounter is greatest and the difference in the reliability of measurements across the studies is substantial