| Literature DB >> 34069942 |
Mauro Giammarino1, Silvana Mattiello2, Monica Battini2, Piero Quatto3, Luca Maria Battaglini4, Ana C L Vieira5, George Stilwell6, Manuela Renna7.
Abstract
This study focuses on the problem of assessing inter-observer reliability (IOR) in the case of dichotomous categorical animal-based welfare indicators and the presence of two observers. Based on observations obtained from Animal Welfare Indicators (AWIN) project surveys conducted on nine dairy goat farms, and using udder asymmetry as an indicator, we compared the performance of the most popular agreement indexes available in the literature: Scott's π, Cohen's k, kPABAK, Holsti's H, Krippendorff's α, Hubert's Γ, Janson and Vegelius' J, Bangdiwala's B, Andrés and Marzo's ∆, and Gwet's γ(AC1). Confidence intervals were calculated using closed formulas of variance estimates for π, k, kPABAK, H, α, Γ, J, ∆, and γ(AC1), while the bootstrap and exact bootstrap methods were used for all the indexes. All the indexes and closed formulas of variance estimates were calculated using Microsoft Excel. The bootstrap method was performed with R software, while the exact bootstrap method was performed with SAS software. k, π, and α exhibited a paradoxical behavior, showing unacceptably low values even in the presence of very high concordance rates. B and γ(AC1) showed values very close to the concordance rate, independently of its value. Both bootstrap and exact bootstrap methods turned out to be simpler compared to the implementation of closed variance formulas and provided effective confidence intervals for all the considered indexes. The best approach for measuring IOR in these cases is the use of B or γ(AC1), with bootstrap or exact bootstrap methods for confidence interval calculation.Entities:
Keywords: agreement index; animal-based measure; dichotomous categorical indicator; inter-rater reliability
Year: 2021 PMID: 34069942 PMCID: PMC8157558 DOI: 10.3390/ani11051445
Source DB: PubMed Journal: Animals (Basel) ISSN: 2076-2615 Impact factor: 2.752
Values of the agreement indexes for the AWIN animal-based welfare indicator “udder asymmetry” for the nine selected dairy goat farms, sorted by increasing concordance rate ().
| Agreement Index 1 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Farm |
|
|
|
|
|
|
|
|
|
|
|
|
| E-IT1 | 75 | 0.15 | 0.16 | 0.23 | 0.51 | 75 | 0.15 | 0.25 | 0.25 | 0.70 | 0.52 | 0.65 |
| I-IT1 | 77 | 0.24 | 0.24 | 0.24 | 0.54 | 77 | 0.24 | 0.28 | 0.30 | 0.71 | 0.54 | 0.68 |
| I-IT2 | 88 | 0.27 | 0.27 | 0.43 | 0.77 | 88 | 0.28 | 0.58 | 0.58 | 0.87 | 0.79 | 0.86 |
| I-IT3 | 92 | 0.55 | 0.55 | 0.55 | 0.84 | 92 | 0.56 | 0.69 | 0.70 | 0.90 | 0.84 | 0.90 |
| I-IT4 | 95 | 0.64 | 0.64 | 1.00 | 0.89 | 95 | 0.64 | 0.79 | 0.79 | 0.94 | 0.95 | 0.94 |
| I-IT5 | 95 | −0.02 | 0.00 | 0.00 | 0.90 | 95 | −0.01 | 0.80 | 0.81 | 0.95 | 0.95 | 0.95 |
| I-IT6 | 97 | 0.78 | 0.78 | 1.00 | 0.93 | 97 | 0.78 | 0.87 | 0.87 | 0.96 | 0.97 | 0.96 |
| I-IT7 | 97 | −0.02 | 0.00 | 0.00 | 0.93 | 97 | 0.00 | 0.87 | 0.87 | 0.97 | 0.97 | 0.96 |
| I-PT1 | 100 | 1.00 | 1.00 | 1.00 | 1.00 | 100 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
Abbreviations: E, extensive; I, intensive; IT, Italian; PT, Portuguese. 1 [19]; [5]; [5]; [29]; [30]; [31]; [32]; [33]; [24]; [21]. 2 Concordance rate (, %), calculated as: . 3 The related indexes ( index [20], G index [27], and S index [28]) gave the same results.
Figure 1Boxplot of the agreement values obtained for each index with the bootstrap method and the exact bootstrap method for all the selected farms (I-IT1, E-IT1, I-IT2, I-IT3, I_IT4, I_IT5, I_IT6, I-IT7, I-PT1). Legend: ∆ = index; B = index; γ = index; Γ = index; π = index; coincided with the related indexes: index, G index and S index; k = index; H = index; grey = bootstrap method; white = exact bootstrap method. The α index is not reported in the figure as it coincided with Cohen’s . The index is not reported in the figure as it coincided with Hubert’s .