| Literature DB >> 24837970 |
Guogen Shan1, Gregory E Wilding2.
Abstract
Asymptotic and exact conditional approaches have often been used for testing agreement between two raters with binary outcomes. The exact conditional approach is guaranteed to respect the test size as compared to the traditionally used asymptotic approach based on the standardized Cohen's kappa coefficient. An alternative to the conditional approach is an unconditional strategy which relaxes the restriction of fixed marginal totals as in the conditional approach. Three exact unconditional hypothesis testing procedures are considered in this article: an approach based on maximization, an approach based on the conditional p-value and maximization, and an approach based on estimation and maximization. We compared these testing procedures based on the commonly used Cohen's kappa with regards to test size and power. We recommend the following two exact approaches for use in practice due to power advantages: the approach based on conditional p-value and maximization and the approach based on estimation and maximization.Entities:
Mesh:
Year: 2014 PMID: 24837970 PMCID: PMC4023979 DOI: 10.1371/journal.pone.0097386
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
contingency table for the agreement test.
| Clinician B | Total | |||
| Yes | No | |||
| Clinician A | Yes |
|
|
|
| No |
|
|
| |
| Total |
|
| N = 60 | |
Strength of agreement using the kappa coefficient.
| Poor: |
|
| Slight: |
|
| Fair: |
|
| Moderate: |
|
| Substantial: |
|
| Almost perfect: |
|
Figure 1Type I rate error plots for the asymptotic, C, M, C+M, and E+M approach with N = 30.
Actual type I error rates at .
| Testing procedure | |||||
| N | Asymptotic | C | M | C+M | E+M |
| 20 | 0.0833 | 0.0188 | 0.0445 | 0.0462 | 0.0499 |
| 30 | 0.0837 | 0.0228 | 0.0461 | 0.0486 | 0.0474 |
| 50 | 0.1001 | 0.0295 | 0.0420 | 0.0482 | 0.0498 |
| 80 | 0.0901 | 0.0314 | 0.0436 | 0.0499 | 0.0499 |
| 100 | 0.0925 | 0.0326 | 0.0467 | 0.0499 | 0.0499 |
Figure 2Power comparison between the four exact testing procedures for N = 20, 30, 50, 80, and 100 from row 1 to row 5, respectively.
P-values for the example assessing cervical spine stiffness.
| Testing procedure | ||||
| Asymptotic | C | M | C+M | E+M |
| 0.0051 | 0.0561 | 0.0511 | 0.0324 | 0.0205 |