| Literature DB >> 32041541 |
Toni Lange1,2, Christian Kopkow3,4, Jörg Lützner5, Klaus-Peter Günther5, Sascha Gravius6, Hanns-Peter Scharf6, Johannes Stöve7, Richard Wagner8, Jochen Schmitt3.
Abstract
BACKGROUND: Consensus-orientated Delphi studies are increasingly used in various areas of medical research using a variety of different rating scales and criteria for reaching consensus. We explored the influence of using three different rating scales and different consensus criteria on the results for reaching consensus and assessed the test-retest reliability of these scales within a study aimed at identification of global treatment goals for total knee arthroplasty (TKA).Entities:
Keywords: Consensus; Delphi; Outcomes; Rating scales; Reliability; Treatment goals
Mesh:
Year: 2020 PMID: 32041541 PMCID: PMC7011537 DOI: 10.1186/s12874-020-0912-8
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Fig. 1Flow chart
Fig. 2Rating scales
Patient characteristics
| Characteristics | |||
|---|---|---|---|
| 1st round | 2nd round | Drop out | |
| Sex | |||
| Male | 31 (31%) | 26 (30%) | 5 (38%) |
| Female | 69 (69%) | 61 (70%) | 8 (62%) |
| No answer | 0 (0%) | 0 (0%) | 0 (0%) |
| Care of relatives | |||
| Yes | 9 (9%) | 9 (10%) | 0 (0%) |
| No | 87 (87%) | 74 (85%) | 13 (100%) |
| No answer | 4 (5%) | 4 (5%) | 0 (0%) |
| Employment status, current | |||
| Employed | 24 (24%) | 22 (25%) | 2 (15%) |
| Unemployed | 76 (76%) | 65 (75%) | 11 (85%) |
| No answer | 0 (0%) | 0 (0%) | 0 (0%) |
| Age | |||
| Mean [sd] | 68.3 [± 9.9] | 68.0 [± 10] | 68.3 [± 9.3] |
| Range | 42–85 | 42–85 | 48–82 |
Consensus across different rating scales
| Treatment goal: top category | Consensus on specific treatment goals | |||||
|---|---|---|---|---|---|---|
| 1st survey | 2nd survey | |||||
| three-point scale | five-point scale* | nine-point scale* | three-point scale | five-point scale* | nine-point scale* | |
| preventing secondary impairments | 82.8 | 69.0 | 90.8 | 85.1 | 70.1 | 89.7 |
| duration of hospitalization | 52.9 | 36.8 | 64.4 | 48.3 | 39.1 | 62.1 |
| stability | 94.3 | 79.3 | 97.7 | 96.6 | 81.6 | 96.6 |
| pain | 89.7 | 79.3 | 95.4 | 95.4 | 79.3 | 94.3 |
| implant survival | 79.3 | 75.9 | 89.7 | 86.2 | 73.6 | 83.9 |
| quality of life | 87.4 | 73.6 | 97.7 | 89.7 | 74.7 | 90.8 |
| range of motion (ROM) | 92.0 | 80.5 | 96.6 | 93.1 | 79.3 | 96.6 |
| activity of daily life | 86.2 | 78.2 | 94.3 | 89.7 | 77.0 | 88.5 |
| malalignment | 66.7 | 47.1 | 72.4 | 63.2 | 46.0 | 72.4 |
| strength | 79.3 | 60.9 | 93.1 | 80.5 | 65.5 | 92.0 |
| employability | 54.0 | 35.6 | 64.4 | 56.3 | 47.1 | 65.5 |
| walking stairs | 86.2 | 74.7 | 92.0 | 89.7 | 77.0 | 89.7 |
| walking distance | 93.1 | 75.9 | 96.6 | 92.0 | 75.9 | 94.3 |
| no side effects | 78.2 | 67.8 | 83.9 | 74.7 | 71.3 | 82.8 |
| participation in social life | 82.8 | 65.5 | 89.7 | 81.6 | 60.9 | 87.4 |
| sex life | 20.7 | 17.2 | 31.0 | 19.5 | 20.7 | 29.9 |
| physical function | 83.9 | 64.4 | 90.8 | 85.1 | 70.1 | 93.1 |
| physical activity | 71.3 | 59.8 | 88.5 | 70.1 | 70.1 | 80.5 |
| global health status | 90.8 | 73.6 | 94.3 | 89.7 | 79.3 | 92.0 |
| Consensus threshold | Number of treatment goals that reached the threshold | |||||
| 1st survey | 2nd survey | |||||
| three-point scale (%) | five-point scale* (%) | nine-point scale* (%) | three-point scale (%) | five-point scale* (%) | nine-point scale* (%) | |
| > 60% | 16 (84.2) | 14 (73.7) | 18 (94.7) | 16 (84.2) | 15 (78.9) | 18 (94.7) |
| > 70% | 15 (78.9) | 9 (47.4) | 16 (84.2) | 15 (78.9) | 13 (68.4) | 16 (84.2) |
| > 75% | 14 (73.7) | 6 (31.6) | 15 (78.9) | 13 (68.4) | 7 (36.8) | 15 (78.9) |
| > 80% | 11 (57.9) | 1 (5.3) | 15 (78.9) | 13 (68.4) | 1 (5.3) | 15 (78.9) |
| > 90% | 4 (21.0) | 0 (0) | 11 (57.9) | 4 (21.0) | 0 (0) | 8 (42.1) |
The first part of the table shows the percentage of ratings as “main goal” of the 87 participants per survey across each treatment goal provided. The second part of the table shows the number and percentage of treatment goals that reached a certain level of consensus. In case of the three-point scale, the “main goal” is the top category. Five-point*/nine-point* scales: the top category of the five-point* scale, is “very important”, and for the nine-point* scale categories “9”, “8” and “7”
Inter-individual comparison of rating scales
| Statistic | three-point scale | five-point scale | nine-point scale | Sensitivity analysis | |
|---|---|---|---|---|---|
| five-point scalea | nine-point scalea | ||||
| Overallb | |||||
| Changes in 2nd survey (in %) | 12.48 | 24.73 | 32.26 | 20.96 | 8.57 |
| Class imbalancea 1st survey (in %) | 79.16 | 64.93 | 63.62 | 64.93 | 88.25 |
| Test-retest agreement (in %) | 87.52 | 75.27 | 67.74 | 79.04 | 91.43 |
| Weighted kappa [95% CI] | 0.63 [0.62; 0.64] | 0.47 [0.07; 0.86] | 0.78 [0.78; 0.78] | 0.54 [0.50; 0.58] | 0.58 [0.55; 0.62] |
| Mean [range] over the 19 proposed treatment goals | |||||
| Changes in 2nd survey (in %) | 12.60 [2.41; 25.61] | 24.75 [16.05; 38.82] | 32.43 [17.07; 55.13] | 20.96 [16.05; 28.24] | 8.69 [0.00; 24.00] |
| Class imbalancec 1st survey (in %) | 0.80 [49.38; 95.35] | 66.05 [35.71; 83.13] | 63.46 [21.25; 81.18] | 68.65 [45.78; 83.13] | 88.19 [37.50; 100.00] |
| Test-retest agreement (in %) | 87.40 [74.39; 97.59] | 75.25 [61.18; 83.95] | 67.57 [44.87; 82.93] | 79.04 [71.76; 83.95] | 91.31 [76.00; 100.00] |
| Weighted kappa | 0.55 [0.18; 0.87] | 0.44 [0.29; 0.62] | 0.61 [0.17; 0.81] | 0.49 [0.35; 0.67] | 0.40 [0.00; 0.80] |
aRating scale mapped onto three categories
bOverall refers total ratings of all participants of all treatment goals, e.g., the number of participants times 19 goals times ratings of the respective scale five-point/nine-point scale
cClass imbalance is highlighted by the percentage of the most frequently used rating category (e.g. in the first survey, the rating categories main goal/secondary goal/no goal scored 79%/11%/10% across all participants’ ratings of all goals, hence, the imbalance is 79%)
Fig. 3Test-retest agreement: comparison across rating scales:
Pirateplot of the percentage agreement and the weighted kappa distributions between survey 1 and survey 2. Each data point represents the value for a single treatment goal. The beans represents the smoothed density, the boxes the 95% confidence interval with the middle line as the average. Five-point/nine-point scale*: rating scale mapped onto three categories: - “very important” (five-point scale) and “9, 8, 7” (nine-point scale) as “main goal” - “a little important”, “somewhat important” (five-point scale) and “5, 6, 7” (nine-point scale) as “secondary goal” -“I do not expect this” and “this does not apply to me” (five-point scale) and “3, 2, 1” (nine-point scale) as “no goal”