| Literature DB >> 25417040 |
Alexei C Ionan, Mei-Yin C Polley, Lisa M McShane, Kevin K Dobbin1.
Abstract
BACKGROUND: The intraclass correlation coefficient (ICC) is widely used in biomedical research to assess the reproducibility of measurements between raters, labs, technicians, or devices. For example, in an inter-rater reliability study, a high ICC value means that noise variability (between-raters and within-raters) is small relative to variability from patient to patient. A confidence interval or Bayesian credible interval for the ICC is a commonly reported summary. Such intervals can be constructed employing either frequentist or Bayesian methodologies.Entities:
Mesh:
Year: 2014 PMID: 25417040 PMCID: PMC4258044 DOI: 10.1186/1471-2288-14-121
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Analysis of variance
| Source | DFa | Sum of squares | MSb | EMSc |
|---|---|---|---|---|
| Patient |
|
|
|
|
| Lab/rater |
|
|
|
|
| Error |
|
|
|
|
aDF is degrees of freedom; bMS is observed mean squares; cEMS is expected means squares.
Notation: y is the average over l and s for fixed b.
Normal simulation table
| b0= 48, l0= 3, r0= 1 | b0= 96, l0= 6, r0= 1 | ||||
|---|---|---|---|---|---|
| ICCw | Method | Coverage | Average width (SEM) | Coverage | Average width (SEM) |
| 0.99 | GPQ | 0.949 | 0.755 (0.0014) | 0.947 | 0.523 (0.0009) |
| 0.99 | MLS | 0.950 | 0.758 (0.0014) | 0.948 | 0.525 (0.0009) |
| 0.99 | Bayes | 0.858 | 0.825 (0.0012) | 0.930 | 0.570 (0.0009) |
| 0.90 | GPQ | 0.943 | 0.685 (0.0014) | 0.948 | 0.448 (0.0010) |
| 0.90 | MLS | 0.946 | 0.690 (0.0014) | 0.949 | 0.450 (0.0010) |
| 0.90 | Bayes | 0.858 | 0.788 (0.0010) | 0.943 | 0.497 (0.0010) |
| 0.80 | GPQ | 0.955 | 0.595 (0.0014) | 0.943 | 0.331 (0.0009) |
| 0.80 | MLS | 0.957 | 0.602 (0.0014) | 0.946 | 0.334 (0.0009) |
| 0.80 | Bayes | 0.848 | 0.749 (0.0008) | 0.956 | 0.378 (0.0010) |
| 0.71 | GPQ | 0.959 | 0.373 (0.0011) | 0.954 | 0.156 (0.0002) |
| 0.71 | MLS | 0.968 | 0.377 (0.0012) | 0.957 | 0.156 (0.0002) |
| 0.71 | Bayes | 0.933 | 0.678 (0.0009) | 0.964 | 0.169 (0.0003) |
ICCb = 0.70 setting. Highlighted are coverages below 90%. The means of the ICCb point estimates when b0 = 48, l0 = 3 were 0.74, 0.72, 0.70 and 0.69, with standard deviations 0.17, 0.14, 0.09 and 0.06 as the values of the ICCw decreased from 0.99 to 0.71. When b0 = 96, l0 = 6 the means of the ICCb estimates were 0.72, 0.71, 0.70 and 0.70 with standard deviations 0.12, 0.10, 0.06, and 0.04 as the ICCw decreased from 0.99 to 0.71.
Simulation study with uniform and gamma models
| Uniform model | Gamma model | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| b0= 48, l0= 3, r0= 1 | b0= 96, l0= 6, r0= 1 | Low skew | High skew | ||||||
|
| Method | Cov. | Wid. | Cov. | Wid. | Cov. | Wid. | Cov. | Wid. |
| 0.99 | GPQ | 0.976 | 0.768 | 0.986 | 0.544 | 0.938 | 0.749 | 0.918 | 0.731 |
| MLS | 0.977 | 0.771 | 0.986 | 0.546 | 0.941 | 0.752 | 0.922 | 0.734 | |
| Bayes | 0.873 | 0.823 | 0.985 | 0.593 | 0.856 | 0.825 | 0.849 | 0.823 | |
| 0.90 | GPQ | 0.977 | 0.705 | 0.985 | 0.464 | 0.935 | 0.684 | 0.919 | 0.670 |
| MLS | 0.979 | 0.710 | 0.986 | 0.467 | 0.937 | 0.689 | 0.926 | 0.675 | |
| Bayes | 0.879 | 0.790 | 0.986 | 0.516 | 0.849 | 0.788 | 0.843 | 0.767 | |
| 0.80 | GPQ | 0.980 | 0.614 | 0.990 | 0.344 | 0.931 | 0.600 | 0.901 | 0.586 |
| MLS | 0.981 | 0.621 | 0.990 | 0.347 | 0.936 | 0.607 | 0.908 | 0.593 | |
| Bayes | 0.883 | 0.753 | 0.992 | 0.394 | 0.932 | 0.752 | 0.805 | 0.745 | |
| 0.71 | GPQ | 0.987 | 0.379 | 0.994 | 0.156 | 0.903 | 0.422 | 0.866 | 0.421 |
| MLS | 0.992 | 0.384 | 0.996 | 0.156 | 0.918 | 0.429 | 0.884 | 0.428 | |
| Bayes | 0.958 | 0.686 | 0.996 | 0.169 | 0.859 | 0.699 | 0.832 | 0.695 | |
Comparison of MLS, GPQ and Bayes method performance on uniform and gamma data. Nominal 95% confidence intervals for ICCb. Coverages and average widths calculated from 10,000 simulations. In each case, ICCb = 0.70. Study designs have 48 biological replicates and 3 labs for a total of 144 observations, and 96 biological replicates and 6 labs for a total of 576 observations. Means and standard deviations of the point estimates of the ICC for each setting are presented in a Additional file 2: Excel file.
Simulation settings
| Model name |
|
|
|
|---|---|---|---|
| Normal | Normal | Normal | Normal |
| Uniform | Uniform | Uniform | Uniform |
| Mixture normal | Mixture normal | Mixture normal | Mixture normal |
| Gamma low-skew | Gamma | Gamma | Gamma |
| Gamma high-skew | Gamma | Gamma | Gamma |
For each row of the table, we examined b 0 ∈ {48, 96}, l 0 ∈ {3, 6}, r 0 = 1 and ICC ∈ {0.70, 0.90}.
ANOVA table from the Barzman et al.[15] study
| Source | DF | SS | MS |
|---|---|---|---|
| Raters | 9 | 39.73 | 4.414 |
| Children | 23 | 2,195.35 | 95.450 |
| Error | 207 | 162.69 | 0.786 |
DC lung study results
| ICCbrange | Frequency of features | Interval method | Mean pseudo-coverage | Mean width | Median width | SD widths |
|---|---|---|---|---|---|---|
| 0.72-1 | 5,571 | GCI | 96.5% | 0.439 | 0.446 | 0.131 |
| NIB | 96.7% | 0.520 | 0.530 | 0.154 | ||
| 0.52- < 0.72 | 5,571 | GCI | 97.2% | 0.594 | 0.607 | 0.061 |
| NIB | 94.8% | 0.664 | 0.675 | 0.078 | ||
| 0.23- < 0.52 | 5,571 | GCI | 98.2% | 0.594 | 0.616 | 0.068 |
| NIB | 96.3% | 0.591 | 0.619 | 0.109 | ||
| 0- < 0.23 | 5,571 | GCI | 99.2% | 0.405 | 0.422 | 0.142 |
| NIB | 85.9% | 0.346 | 0.340 | 0.128 |
Summary of 22,283 confidence intervals, one for each feature, broken down by ICCb quartiles. GCI is generalized confidence interval method, and NIB is Bayesian method with noninformative prior distribution. Pseudo-coverage is the proportion of times the full data ICCb was contained in the interval.
Figure 1Decision tree for confidence intervals for the ICC. *Note that 8 is an estimate based on the simulations of this paper but may not be appropriate in all applications.