| Literature DB >> 32020851 |
Chikara Honda1,2, Tetsuji Ohyama3.
Abstract
BACKGROUND: Cohen's κ coefficient is often used as an index to measure the agreement of inter-rater determinations. However, κ varies greatly depending on the marginal distribution of the target population and overestimates the probability of agreement occurring by chance. To overcome these limitations, an alternative and more stable agreement coefficient was proposed, referred to as Gwet's AC1. When it is desired to combine results from multiple agreement studies, such as in a meta-analysis, or to perform stratified analysis with subject covariates that affect agreement, it is of interest to compare several agreement coefficients and present a common agreement index. A homogeneity test of κ was developed; however, there are no reports on homogeneity tests for AC1 or on an estimator of common AC1. In this article, a homogeneity score test for AC1 is therefore derived, in the case of two raters with binary outcomes from K independent strata and its performance is investigated. An estimation of the common AC1 between strata and its confidence intervals is also discussed.Entities:
Keywords: Common AC1; Consistency evaluation; Gwet’s AC1; Homogeneity test; Inter-rater agreement; Stratified study
Year: 2020 PMID: 32020851 PMCID: PMC7001312 DOI: 10.1186/s12874-019-0887-5
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Data layout
| Category | Ratings | Frequency | Probability |
|---|---|---|---|
| 1 | (+, +) | ||
| 2 | (+, −) or (−, +) | ||
| 3 | (−, −) | ||
| Total | 1 |
Empirical type I error rates of homogeneity tests for γ1 = γ2 = γ0 based on 10,000 simulations (K = 2 balanced sample size)
| Balanced | Unbalanced | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| SCORE | GOF | π1 | π2 | SCORE | GOF | |||||
| 20 | 0.1 | 0.5 | 0.045 | 0.067 | 20 | 0.1 | 0.5 | 0.35 | 0.049 | 0.097 |
| 0.3 | 0.046 | 0.067 | 0.3 | 0.049 | 0.096 | |||||
| 0.5 | 0.048 | 0.062 | 0.5 | 0.050 | 0.083 | |||||
| 0.7 | 0.033 | 0.041 | 0.7 | 0.037 | 0.049 | |||||
| 0.9 | 0.002 | 0.003 | 0.9 | 0.003 | 0.005 | |||||
| 0.1 | 0.35 | 0.052 | 0.121 | 0.1 | 0.65 | 0.35 | 0.050 | 0.120 | ||
| 0.3 | 0.054 | 0.126 | 0.3 | 0.051 | 0.120 | |||||
| 0.5 | 0.052 | 0.103 | 0.5 | 0.051 | 0.101 | |||||
| 0.7 | 0.039 | 0.064 | 0.7 | 0.039 | 0.065 | |||||
| 0.9 | 0.004 | 0.006 | 0.9 | 0.004 | 0.007 | |||||
| 0.7 | 0.2 | 0.047 | 0.132 | 0.7 | 0.5 | 0.2 | 0.038 | 0.090 | ||
| 0.9 | 0.008 | 0.029 | 0.9 | 0.005 | 0.013 | |||||
| 50 | 0.1 | 0.5 | 0.050 | 0.058 | 50 | 0.1 | 0.5 | 0.35 | 0.048 | 0.117 |
| 0.3 | 0.047 | 0.054 | 0.3 | 0.051 | 0.087 | |||||
| 0.5 | 0.050 | 0.054 | 0.5 | 0.049 | 0.072 | |||||
| 0.7 | 0.051 | 0.053 | 0.7 | 0.050 | 0.060 | |||||
| 0.9 | 0.026 | 0.027 | 0.9 | 0.024 | 0.027 | |||||
| 0.1 | 0.35 | 0.051 | 0.172 | 0.1 | 0.65 | 0.35 | 0.051 | 0.168 | ||
| 0.3 | 0.051 | 0.126 | 0.3 | 0.049 | 0.117 | |||||
| 0.5 | 0.052 | 0.092 | 0.5 | 0.051 | 0.092 | |||||
| 0.7 | 0.052 | 0.072 | 0.7 | 0.052 | 0.071 | |||||
| 0.9 | 0.028 | 0.033 | 0.9 | 0.028 | 0.033 | |||||
| 0.7 | 0.2 | 0.053 | 0.162 | 0.7 | 0.5 | 0.2 | 0.051 | 0.104 | ||
| 0.9 | 0.037 | 0.061 | 0.9 | 0.032 | 0.042 | |||||
| 80 | 0.1 | 0.5 | 0.047 | 0.052 | 80 | 0.1 | 0.5 | 0.35 | 0.051 | 0.120 |
| 0.3 | 0.047 | 0.051 | 0.3 | 0.053 | 0.094 | |||||
| 0.5 | 0.054 | 0.057 | 0.5 | 0.051 | 0.072 | |||||
| 0.7 | 0.050 | 0.052 | 0.7 | 0.051 | 0.061 | |||||
| 0.9 | 0.037 | 0.039 | 0.9 | 0.047 | 0.050 | |||||
| 0.1 | 0.35 | 0.052 | 0.173 | 0.1 | 0.65 | 0.35 | 0.052 | 0.172 | ||
| 0.3 | 0.054 | 0.123 | 0.3 | 0.054 | 0.124 | |||||
| 0.5 | 0.053 | 0.089 | 0.5 | 0.055 | 0.090 | |||||
| 0.7 | 0.051 | 0.069 | 0.7 | 0.051 | 0.069 | |||||
| 0.9 | 0.044 | 0.051 | 0.9 | 0.045 | 0.052 | |||||
| 0.7 | 0.2 | 0.052 | 0.152 | 0.7 | 0.5 | 0.2 | 0.050 | 0.103 | ||
| 0.9 | 0.051 | 0.073 | 0.9 | 0.048 | 0.059 | |||||
Empirical power of homogeneity tests based on 10,000 simulations (K = 2 balanced sample size)
| Balanced | Unbalanced | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SCORE | GOF | SCORE | GOF | |||||||||
| 20 | 0.1 | 0.5 | 0.5 | 0.243 | 0.290 | 20 | 0.1 | 0.5 | 0.5 | 0.35 | 0.234 | 0.304 |
| 0.3 | 0.6 | 0.173 | 0.202 | 0.3 | 0.6 | 0.168 | 0.224 | |||||
| 0.3 | 0.7 | 0.294 | 0.323 | 0.3 | 0.7 | 0.293 | 0.351 | |||||
| 0.5 | 0.8 | 0.212 | 0.232 | 0.5 | 0.8 | 0.216 | 0.258 | |||||
| 0.5 | 0.9 | 0.372 | 0.396 | 0.5 | 0.9 | 0.389 | 0.430 | |||||
| 0.1 | 0.5 | 0.35 | 0.245 | 0.357 | 0.1 | 0.5 | 0.65 | 0.35 | 0.243 | 0.355 | ||
| 0.3 | 0.6 | 0.185 | 0.279 | 0.3 | 0.6 | 0.171 | 0.266 | |||||
| 0.3 | 0.7 | 0.313 | 0.408 | 0.3 | 0.7 | 0.296 | 0.398 | |||||
| 0.5 | 0.8 | 0.227 | 0.295 | 0.5 | 0.8 | 0.221 | 0.291 | |||||
| 0.5 | 0.9 | 0.396 | 0.483 | 0.5 | 0.9 | 0.394 | 0.475 | |||||
| 0.5 | 0.8 | 0.2 | 0.270 | 0.411 | 0.5 | 0.8 | 0.5 | 0.2 | 0.230 | 0.278 | ||
| 0.5 | 0.9 | 0.482 | 0.616 | 0.5 | 0.9 | 0.435 | 0.473 | |||||
| 50 | 0.1 | 0.5 | 0.5 | 0.538 | 0.562 | 50 | 0.1 | 0.5 | 0.5 | 0.35 | 0.525 | 0.595 |
| 0.3 | 0.6 | 0.377 | 0.396 | 0.3 | 0.6 | 0.369 | 0.428 | |||||
| 0.3 | 0.7 | 0.635 | 0.651 | 0.3 | 0.7 | 0.630 | 0.673 | |||||
| 0.5 | 0.8 | 0.512 | 0.525 | 0.5 | 0.8 | 0.517 | 0.548 | |||||
| 0.5 | 0.9 | 0.835 | 0.841 | 0.5 | 0.9 | 0.843 | 0.855 | |||||
| 0.1 | 0.5 | 0.35 | 0.509 | 0.652 | 0.1 | 0.5 | 0.65 | 0.35 | 0.503 | 0.648 | ||
| 0.3 | 0.6 | 0.379 | 0.485 | 0.3 | 0.6 | 0.364 | 0.470 | |||||
| 0.3 | 0.7 | 0.633 | 0.718 | 0.3 | 0.7 | 0.618 | 0.711 | |||||
| 0.5 | 0.8 | 0.518 | 0.585 | 0.5 | 0.8 | 0.516 | 0.581 | |||||
| 0.5 | 0.9 | 0.844 | 0.877 | 0.5 | 0.9 | 0.838 | 0.874 | |||||
| 0.5 | 0.8 | 0.2 | 0.552 | 0.711 | 0.5 | 0.8 | 0.5 | 0.2 | 0.531 | 0.598 | ||
| 0.5 | 0.9 | 0.878 | 0.915 | 0.5 | 0.9 | 0.861 | 0.863 | |||||
| 80 | 0.1 | 0.5 | 0.5 | 0.757 | 0.768 | 80 | 0.1 | 0.5 | 0.5 | 0.35 | 0.732 | 0.786 |
| 0.3 | 0.6 | 0.568 | 0.578 | 0.3 | 0.6 | 0.545 | 0.596 | |||||
| 0.3 | 0.7 | 0.841 | 0.847 | 0.3 | 0.7 | 0.833 | 0.858 | |||||
| 0.5 | 0.8 | 0.716 | 0.722 | 0.5 | 0.8 | 0.717 | 0.741 | |||||
| 0.5 | 0.9 | 0.967 | 0.968 | 0.5 | 0.9 | 0.966 | 0.970 | |||||
| 0.1 | 0.5 | 0.35 | 0.707 | 0.819 | 0.1 | 0.5 | 0.65 | 0.35 | 0.707 | 0.816 | ||
| 0.3 | 0.6 | 0.538 | 0.645 | 0.3 | 0.6 | 0.541 | 0.644 | |||||
| 0.3 | 0.7 | 0.826 | 0.879 | 0.3 | 0.7 | 0.822 | 0.884 | |||||
| 0.5 | 0.8 | 0.717 | 0.767 | 0.5 | 0.8 | 0.715 | 0.764 | |||||
| 0.5 | 0.9 | 0.963 | 0.973 | 0.5 | 0.9 | 0.965 | 0.974 | |||||
| 0.5 | 0.8 | 0.2 | 0.746 | 0.872 | 0.5 | 0.8 | 0.5 | 0.2 | 0.734 | 0.787 | ||
| 0.5 | 0.9 | 0.976 | 0.982 | 0.5 | 0.9 | 0.974 | 0.970 | |||||
Bias and mean square error of the maximum likelihood estimator for the common AC1 based on 10,000 simulations (K = 2 balanced sample size)
| Balanced | Unbalanced | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Bias | MSE | Bias | MSE | |||||||
| 20 | 0.1 | 0.5 | 0.026 | 0.025 | 20 | 0.1 | 0.5 | 0.35 | 0.019 | 0.027 |
| 0.3 | 0.023 | 0.023 | 0.3 | 0.018 | 0.024 | |||||
| 0.5 | 0.017 | 0.018 | 0.5 | 0.013 | 0.019 | |||||
| 0.7 | 0.009 | 0.012 | 0.7 | 0.007 | 0.012 | |||||
| 0.9 | −0.011 | 0.003 | 0.9 | −0.011 | 0.003 | |||||
| 0.1 | 0.35 | 0.009 | 0.029 | 0.1 | 0.65 | 0.35 | 0.010 | 0.028 | ||
| 0.3 | 0.007 | 0.025 | 0.3 | 0.011 | 0.025 | |||||
| 0.5 | 0.007 | 0.019 | 0.5 | 0.008 | 0.019 | |||||
| 0.7 | 0.004 | 0.012 | 0.7 | 0.004 | 0.012 | |||||
| 0.9 | −0.011 | 0.003 | 0.9 | −0.011 | 0.003 | |||||
| 0.7 | 0.2 | −0.007 | 0.012 | 0.7 | 0.5 | 0.2 | 0.001 | 0.012 | ||
| 0.9 | −0.010 | 0.003 | 0.9 | −0.010 | 0.003 | |||||
| 80 | 0.1 | 0.5 | 0.007 | 0.006 | 80 | 0.1 | 0.5 | 0.35 | 0.006 | 0.007 |
| 0.3 | 0.006 | 0.006 | 0.3 | 0.005 | 0.006 | |||||
| 0.5 | 0.005 | 0.005 | 0.5 | 0.004 | 0.005 | |||||
| 0.7 | 0.003 | 0.003 | 0.7 | 0.003 | 0.003 | |||||
| 0.9 | 0.002 | 0.001 | 0.9 | 0.001 | 0.001 | |||||
| 0.1 | 0.35 | 0.004 | 0.007 | 0.1 | 0.65 | 0.35 | 0.003 | 0.008 | ||
| 0.3 | 0.002 | 0.006 | 0.3 | 0.001 | 0.006 | |||||
| 0.5 | 0.001 | 0.005 | 0.5 | 0.002 | 0.005 | |||||
| 0.7 | 0.001 | 0.003 | 0.7 | 0.001 | 0.003 | |||||
| 0.9 | 0.000 | 0.001 | 0.9 | 0.000 | 0.001 | |||||
| 0.7 | 0.2 | −0.001 | 0.003 | 0.7 | 0.5 | 0.2 | 0.001 | 0.003 | ||
| 0.9 | −0.001 | 0.001 | 0.9 | 0.001 | 0.001 | |||||
Coverage rates of common γ 95% confidence intervals of the three proposed methods based on 10,000 simulations
| Balanced | Unbalanced | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| π1 = π2 | SA | FZ | PV | π1 | π2 | SA | FZ | PV | ||||
| 20 | 0.1 | 0.5 | 0.939 | 0.959 | 0.958 | 20 | 0.1 | 0.5 | 0.35 | 0.939 | 0.958 | 0.958 |
| 0.3 | 0.936 | 0.958 | 0.950 | 0.3 | 0.933 | 0.958 | 0.955 | |||||
| 0.5 | 0.931 | 0.962 | 0.962 | 0.5 | 0.927 | 0.959 | 0.960 | |||||
| 0.7 | 0.924 | 0.976 | 0.963 | 0.7 | 0.917 | 0.971 | 0.961 | |||||
| 0.9 | 0.998 | 0.963 | 0.955 | 0.9 | 0.997 | 0.966 | 0.955 | |||||
| 0.1 | 0.35 | 0.935 | 0.953 | 0.956 | 0.1 | 0.65 | 0.35 | 0.938 | 0.955 | 0.957 | ||
| 0.3 | 0.934 | 0.955 | 0.953 | 0.3 | 0.934 | 0.955 | 0.956 | |||||
| 0.5 | 0.926 | 0.956 | 0.955 | 0.5 | 0.927 | 0.959 | 0.955 | |||||
| 0.7 | 0.918 | 0.965 | 0.958 | 0.7 | 0.917 | 0.967 | 0.960 | |||||
| 0.9 | 0.996 | 0.967 | 0.947 | 0.9 | 0.996 | 0.969 | 0.950 | |||||
| 0.7 | 0.2 | 0.929 | 0.959 | 0.950 | 0.7 | 0.5 | 0.2 | 0.931 | 0.965 | 0.956 | ||
| 0.9 | 0.970 | 0.966 | 0.911 | 0.9 | 0.993 | 0.970 | 0.943 | |||||
| 50 | 0.1 | 0.5 | 0.949 | 0.953 | 0.952 | 50 | 0.1 | 0.5 | 0.35 | 0.946 | 0.952 | 0.953 |
| 0.3 | 0.945 | 0.955 | 0.954 | 0.3 | 0.943 | 0.952 | 0.951 | |||||
| 0.5 | 0.945 | 0.954 | 0.953 | 0.5 | 0.941 | 0.952 | 0.952 | |||||
| 0.7 | 0.936 | 0.961 | 0.953 | 0.7 | 0.936 | 0.956 | 0.953 | |||||
| 0.9 | 0.920 | 0.971 | 0.971 | 0.9 | 0.923 | 0.968 | 0.965 | |||||
| 0.1 | 0.35 | 0.942 | 0.948 | 0.952 | 0.1 | 0.65 | 0.35 | 0.946 | 0.954 | 0.954 | ||
| 0.3 | 0.942 | 0.950 | 0.950 | 0.3 | 0.943 | 0.953 | 0.952 | |||||
| 0.5 | 0.940 | 0.951 | 0.951 | 0.5 | 0.941 | 0.954 | 0.953 | |||||
| 0.7 | 0.937 | 0.954 | 0.952 | 0.7 | 0.936 | 0.956 | 0.953 | |||||
| 0.9 | 0.926 | 0.966 | 0.960 | 0.9 | 0.925 | 0.968 | 0.960 | |||||
| 0.7 | 0.2 | 0.938 | 0.949 | 0.948 | 0.7 | 0.5 | 0.2 | 0.937 | 0.954 | 0.952 | ||
| 0.9 | 0.927 | 0.965 | 0.954 | 0.9 | 0.928 | 0.969 | 0.960 | |||||
| 80 | 0.1 | 0.5 | 0.945 | 0.952 | 0.952 | 80 | 0.1 | 0.5 | 0.35 | 0.946 | 0.951 | 0.951 |
| 0.3 | 0.947 | 0.952 | 0.952 | 0.3 | 0.946 | 0.952 | 0.952 | |||||
| 0.5 | 0.943 | 0.953 | 0.952 | 0.5 | 0.942 | 0.950 | 0.950 | |||||
| 0.7 | 0.944 | 0.950 | 0.949 | 0.7 | 0.943 | 0.955 | 0.953 | |||||
| 0.9 | 0.901 | 0.962 | 0.956 | 0.9 | 0.922 | 0.963 | 0.957 | |||||
| 0.1 | 0.35 | 0.946 | 0.951 | 0.951 | 0.1 | 0.65 | 0.35 | 0.943 | 0.947 | 0.946 | ||
| 0.3 | 0.944 | 0.949 | 0.949 | 0.3 | 0.945 | 0.950 | 0.950 | |||||
| 0.5 | 0.944 | 0.950 | 0.950 | 0.5 | 0.944 | 0.953 | 0.952 | |||||
| 0.7 | 0.941 | 0.950 | 0.949 | 0.7 | 0.939 | 0.953 | 0.952 | |||||
| 0.9 | 0.931 | 0.960 | 0.953 | 0.9 | 0.927 | 0.963 | 0.954 | |||||
| 0.7 | 0.2 | 0.944 | 0.950 | 0.949 | 0.7 | 0.5 | 0.2 | 0.942 | 0.955 | 0.954 | ||
| 0.9 | 0.929 | 0.956 | 0.951 | 0.9 | 0.927 | 0.961 | 0.955 | |||||
SA, FZ, and PV refer to 95% confidence intervals for common AC1 using the simple asymptotic, Fisher’s Z transformation, and profile variance methods, respectively
Agreement between ophthalmologist and reading center classifying superior nasal retinal breaks stratified by PVR grade
| PVR grade | ||||
|---|---|---|---|---|
| C3 | D1 | D2 | D3 | |
| Both ( | 1 | 6 | 5 | 3 |
| One ( | 9 | 8 | 11 | 9 |
| Neither ( | 65 | 46 | 54 | 33 |
| Total ( | 75 | 60 | 70 | 45 |
| 0.073 | 0.167 | 0.150 | 0.167 | |
| 0.880 | 0.867 | 0.843 | 0.800 | |
| 0.117 | 0.520 | 0.384 | 0.280 | |
| AC1 (MLE) | 0.861 | 0.815 | 0.789 | 0.723 |