| Literature DB >> 29089821 |
Ji Eun Park1, Kyunghwa Han2, Yu Sub Sung1, Mi Sun Chung3, Hyun Jung Koo1, Hee Mang Yoon1, Young Jun Choi1, Seung Soo Lee1, Kyung Won Kim1, Youngbin Shin1, Suah An1, Hyo-Min Cho4, Seong Ho Park1.
Abstract
OBJECTIVE: To evaluate the frequency and adequacy of statistical analyses in a general radiology journal when reporting a reliability analysis for a diagnostic test.Entities:
Keywords: Agreement; Reliability; Repeatability; Repeatability coefficient; Reproducibility; Software program; Statistical analysis; Statistical method
Mesh:
Year: 2017 PMID: 29089821 PMCID: PMC5639154 DOI: 10.3348/kjr.2017.18.6.888
Source DB: PubMed Journal: Korean J Radiol ISSN: 1229-6929 Impact factor: 3.500
Recommended Statistical Methods for Analysis of Reliability
| Dichotomous or Nominal Data (e.g., Benign vs. Malignant) | Ordinal Data (e.g., Grades I, II, III, and IV) | Continuous Data (e.g., Tumor Volume in mL) |
|---|---|---|
| Kappa | Weighted kappa | Reliability parameters: |
| Proportion of agreement | ICC | ICC |
| CCC | ||
| Agreement parameters: | ||
| Within-subject standard deviation | ||
| Repeatability coefficient and reproducibility coefficient | ||
| Coefficient of variation | ||
| Bland-Altman limits of agreement |
ICC has three different models including one-way random, two-way random, and two-way mixed models, and can use either consistency or absolute agreement assumptions. As ICC value for same set of data may change according to model and assumption used, it is desirable to describe model and assumption, for example, as shown in study by Yoo et al. (86). ICC calculated using one-way random model is appropriate for assessing repeatability (112). CCC or ICC calculated using two-way model, random or mixed according to data and setting (6), are appropriate for analyzing reproducibility. Intraobserver reliability could be regarded as similar to repeatability depending on study setting, whereas interobserver reliability should be regarded as reproducibility. CCC = concordance correlation coefficient, ICC = intraclass correlation coefficient
Selection and Reporting of Statistical Methods to Assess Reliability
| Items | No. of Eligible Articles (Denominator) | Yes (%) | No or Uncertain (%) |
|---|---|---|---|
| Use of recommended statistical methods | |||
| Analysis of dichotomous/nominal data | 5 | 5 (100.0) | 0 (0.0) |
| Analysis of ordinal data | 15 | 7 (46.7) | 8 (53.3) |
| Analysis of continuous data | 21 | 20 (95.2) | 1 (4.8) |
| Reporting of weighting method for weighted kappa | 7 | 2 (28.6) | 5 (71.4) |
| Reporting of model for ICC | 17 | 6 (35.3) | 11 (64.7) |
| Reporting of assumption for ICC | 17 | 5 (29.4) | 12 (70.6) |
| Appropriate use/interpretation of reliability parameters | 13 | 10 (76.9) | 3 (23.1) |
| Correct meaning of reproducibility and repeatability | 15 | 12 (80.0) | 3 (20.0) |
Data are numbers of articles with proportion of eligible articles for each item described as percentage in parentheses.
Fig. 1Display of detailed options associated with statistical tests used for reliability analysis in some user-friendly software programs.
A. Selection of weighting method to calculate weighted kappa with MedCalc Version 17.6 (MedCalc Software BVBA; https://www.medcalc.org). B. Selection of model and assumption to calculate ICC with IBM SPSS Statistics for Windows Version 21 (IBM Corp.). C. Selection of model and assumption to calculate ICC with MedCalc Version 17.6 (MedCalc Software BVBA). This software program does not distinguish between random and fixed effects models. ICC = intraclass correlation coefficient