| Literature DB >> 30658613 |
Ivar Magne Austevoll1,2,3, Rolf Gjestad4, Margreth Grotle5,6, Tore Solberg7,8, Jens Ivar Brox7,9, Erland Hermansen10,11,12, Frode Rekeland10, Kari Indrekvam10,11, Kjersti Storheim5, Christian Hellum7,13.
Abstract
BACKGROUND: Assessment of outcomes for spinal surgeries is challenging, and an ideal measurement that reflects all aspects of importance for the patients does not exist. Oswestry Disability Index (ODI), EuroQol (EQ-5D) and Numeric Rating Scales (NRS) for leg pain and for back pain are commonly used patients reported outcome measurements (PROMs). Reporting the proportion of individuals with an outcome of clinical importance is recommended. Knowledge of the ability of PROMs to identify clearly improved patients is essential. The purpose of this study was to search cut-off criteria for PROMs that best reflect an improvement considered by the patients to be of clinical importance.Entities:
Keywords: Back pain; Leg pain; Lumbar degenerative spondylolisthesis (LDS); Lumbar spinal stenosis (LSS); Minimal clinically important difference (MCID); Oswestry disability index (ODI); Patient reported outcome measures (PROMs); Success criteria
Mesh:
Year: 2019 PMID: 30658613 PMCID: PMC6339296 DOI: 10.1186/s12891-018-2386-y
Source DB: PubMed Journal: BMC Musculoskelet Disord ISSN: 1471-2474 Impact factor: 2.362
Fig. 1Flow chart for patients registered with spinal stenosis in NORSpine in the period 2007–2013
Patient demographics and surgical data for patients operated for spinal stenosis and for degenerative spondylolisthesis
| Spinal stenosis | Degenerative spondylolisthesis | |||
|---|---|---|---|---|
| N | N | |||
| Age; Yr ± SD | 3858 | 66 ± 11 | 617 | 67 ± 10 |
| Female, no (%) | 3859 | 1919 (50%) | 617 | 444 (72%) |
| ASA level (1–4); Mean ± SD | 3759 | 2.0 ± 0.6 | 608 | 2.0 ± 0.5 |
| ASA level 1, no (%) | 681 (18%) | 82 (13%) | ||
| ASA level 2, no (%) | 2349 (61%) | 429 (71%) | ||
| ASA level 3, no (%) | 753 (19%) | 97 (16%) | ||
| ASA level 4, no (%) | 12(0.3%) | 0 | ||
| Body Mass Index; Mean (SD) | 3547 | 27 ± 4 | 560 | 27.0 ± 5 |
| Smokers, no (%) | 3808 | 877 (23%) | 609 | 115 (19%) |
| Laminectomy, no (%) | 3859 | 1024 (27%) | 617 | 239 (39%) |
| Midline preserving decompression, no (%) | 3859 | 2835 (73%) | 617 | 378 (61%) |
| Fusion, no (%) | 3859 | 214 (6%) | 617 | 297 (48%) |
| ODI; Mean (SD) | 3837 | 40 ± 15 | 617 | 41 ± 15 |
| EQ-5D; Mean (SD) | 3535 | 0.37 ± 0.32 | 564 | 0.34 ± 0.32 |
| NRS leg pain; Mean (SD) | 3559 | 6.6 ± 2.2 | 569 | 6.7 ± 2.2 |
| NRS back pain; Mean (SD) | 3597 | 6.4 ± 2.2 | 573 | 6.9 ± 2.1 |
N number of patient with data for the evaluated parameter
Follow-up scores and the change scores for PROMs according to the GPE-scale
| Spinal stenosis | Degenerative spondylolisthesis | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N | (%) | 1 year | Spear man’s rho | Change score Mean (SD) | Spearman’s rho | N | (%) | 1 year Follow- up Mean (SD) | Spearman’s rho | Change score Mean (SD) | Spear- man’s rho | ||||||
| All | 3060 | 23 | (18) | 0.77 * | 16 | (18) | 0.66* | 509 | 22 | (18) | 0.78* | 19 | (17) | 0.64* | |||
| O | Compl.recovered | 599 | (20%) | 4 | (9) | 32 | (16) | 117 | (23%) | 4 | (7) | 33 | (15) | ||||
| D | Much improved | 1176 | (38%) | 17 | (12) | 21 | (15) | 213 | (42%) | 17 | (13) | 23 | (14) | ||||
| I | Slightly improved | 658 | (21%) | 32 | (12) | 9 | (13) | 105 | (21%) | 36 | (13) | 9 | (12) | ||||
| Unchanged | 283 | (9%) | 38 | (13) | 0 | (10) | 33 | (6%) | 38 | (14) | 5 | (13) | |||||
| Slightly worse | 181 | (6%) | 42 | (13) | 0 | (12) | 21 | (4%) | 41 | (13) | 3 | (13) | |||||
| Much worse | 117 | (4%) | 49 | (12) | −3 | (12) | 11 | (2%) | 51 | (11) | −8 | (13) | |||||
| Worse than ever | 46 | (2%) | 59 | (15) | −11 | (12) | 9 | (2%) | 57 | (17) | −7 | (15) | |||||
| Missing | 799 | 108 | |||||||||||||||
| All | 2464 | 0.64 | (0.31) | 0.73* | 0.25 | 0.36 | 0.50* | 419 | 0.67 | (0.30) | 0.78* | 0.32 | (0.34) | 0.48* | |||
| E | Compl.recovered | 463 | (19%) | 0.92 | (0.15) | 0.47 | (0.32) | 97 | (23%) | 0.93 | (0.16) | 0.51 | (0.30) | ||||
| Q | Much improved | 945 | (38%) | 0.74 | (0.17) | 0.34 | (0.32) | 175 | (42%) | 0.75 | (0.16) | 0.37 | (0.32) | ||||
| - | Slightly improved | 543 | (22%) | 0.55 | (0.26) | 0.19 | (0.33) | 89 | (21%) | 0.46 | (0.29) | 0.18 | (0.31) | ||||
| 5 | Unchanged | 230 | (9%) | 0.41 | (0.31) | 0.03 | (0.29) | 26 | (6%) | 0.40 | (0.30) | 0.08 | (0.33) | ||||
| D | Slightly worse | 148 | (6%) | 0.33 | (0.32) | 0.00 | (0.32) | 17 | (4%) | 0.36 | (0.30) | 0.13 | (0.29) | ||||
| Much worse | 100 | (4%) | 0.15 | (0.23) | 0.15 | (0.32) | 8 | (2%) | 0.30 | (0.34) | 0.02 | (0.08) | |||||
| Worse than ever | 35 | (1%) | 0.04 | (0.22) | 0.24 | (0.37) | 7 | (2%) | 0.08 | (0.24) | 0.03 | (0.11) | |||||
| Missing | 1395 | 198 | |||||||||||||||
| L | All | 2988 | 3.5 | (3.0) | 0.72* | 3.1 | (3.3) | 0.63* | 493 | 3.2 | (2.9) | 0.68* | 3.5 | (3.2) | 0.58* | ||
| E | Compl.Recovered | 580 | 19% | 0.6 | (1.5) | 5.9 | (2.5) | 112 | (23%) | 0.6 | (2.2) | 6.0 | (2.5) | ||||
| G | Much improved | 1159 | 39% | 2.5 | (2.2) | 4.0 | (2.7) | 208 | (42%) | 2.6 | (2.2) | 4.0 | (2.7) | ||||
| Slightly improved | 640 | 21% | 4.9 | (2.2) | 1.8 | (2.6) | 102 | (20%) | 4.8 | (2.4) | 1.8 | (2.6) | |||||
| P | Unchanged | 275 | 9% | 6.3 | (2.1) | 0.1 | (2.3) | 33 | (7%) | 6.1 | (4.7) | 0.4 | (2.2) | ||||
| A | Slightly worse | 176 | 6% | 6.4 | (2.1) | 0.7 | (2.6) | 18 | (4%) | 5.2 | (3.0) | 1.0 | (2.9) | ||||
| I | Much worse | 114 | 4% | 7.5 | (2.1) | −0.5 | (2.6) | 11 | (2%) | 6.6 | (2.3) | 0.4 | (3.8) | ||||
| N | Worse than ever | 44 | 1% | 7.7 | (2.1) | −0.4 | (2.9) | 9 | (2%) | 7.8 | (1.9) | 0.0 | (2.1) | ||||
| Missing | 871 | 124 | |||||||||||||||
| B | All | 3033 | 3.8 | (2.8) | 0.76* | 3.3 | (2.9) | 0.62* | 507 | 3.6 | (2.8) | 0.78* | 3.3 | (2.9) | 0.64* | ||
| A | Compl. recovered | 592 | 20% | 0.6 | (1.4) | 5.4 | (2.5) | 117 | (23%) | 0.7 | (2.0) | 5.8 | (2.5) | ||||
| C | Much improved | 1171 | 38% | 3.0 | (2.0) | 3.2 | (2.5) | 214 | (42%) | 3.0 | (2.0) | 3.7 | (2.5) | ||||
| K | Slightly improved | 648 | 21% | 5.2 | (1.9) | 1.4 | (2.3) | 105 | (21%) | 5.7 | (1.7) | 1.6 | (1.8) | ||||
| Unchanged | 278 | 9% | 6.5 | (2.0) | 0.5 | (2.0) | 32 | (6%) | 6.0 | (2.0) | 1.4 | (2.1) | |||||
| P | Slightly worse | 182 | 6% | 6.7 | (1.8) | 0.1 | (2.0) | 20 | (4%) | 6.7 | (1.6) | 0.3 | (1.6) | ||||
| A | Much worse | 116 | 4% | 7.4 | (2.1) | −0.1 | (2.2) | 11 | (2%) | 7.3 | (2.19 | −0.2 | (1.3) | ||||
| I | Worse than ever | 46 | 2% | 8.3 | (1.9) | −0.8 | (2.3) | 8 | (2%) | 8.5 | (1.3) | −0.4 | (1.4) | ||||
| N | Missing | 826 | 110 | ||||||||||||||
Mean 1 year follow-up scores and mean change scores from baseline to follow-up for ODI, EQ-5D, NRS leg pain, and NRS back pain [positive values indicate decreased disability (ODI), improved health-related quality of life (EQ-5D), and reduced pain (NRS)]. Results are given for all patients, and for patients stratified according to the Global Perceived Effect (GPE) scale. The association between the outcome measurements and the GPE responses are given by Spearman’s rank correlation coefficients (Spearman’s rho)
*p<0.005
Fig. 2Receiver Operating Characteristic curves for ODI. Legend: The closer the curve is in the upper left corner, the higher accuracy for determining whether a patients is cured (‘completely recovered’ or ‘much improved’) or not. 2a. Spinal stenosis; 2b. Degenerative spondylolisthesis
Fig. 3Receiver Operating Characteristic curves for EQ-5D. Legend: The closer the curve is in the upper left corner, the higher accuracy for determining whether a patients is cured (‘completely recovered’ or ‘much improved’) or not. 3a. Spinal stenosis; 3b. Degenerative spondylolisthesis
Fig. 4Receiver Operating Characteristic curves for NRS leg pain. Legend: The closer the curve is in the upper left corner, the higher accuracy for determining whether a patients is cured (‘completely recovered’ or ‘much improved’) or not. 4a. Spinal stenosis; 4b. Degenerative spondylolisthesis
Fig. 5Receiver Operating Characteristic curves for NRS back pain. Legend: The closer the curve is in the upper left corner, the higher accuracy for determining whether a patients is cured (‘completely recovered’ or ‘much improved’) or not. 5a. Spinal stenosis; 5b. Degenerative spondylolisthesis
ROC analyses for determining AUC (95% CI) and for estimating cut-off values for ‘success
| ODI | EQ-5D | NRS | Leg pain | NRS | Back pain | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AUC | (95% CI) | Max corr. Class | Cut-off | AUC | (95% CI) | Max corr. Class | Cut-off | AUC | Max corr. Class. | Cut-off | AUC | Max corr. | Cut-off | |||
| Spinal stenosis | ||||||||||||||||
| Follow-up score | 0.90 | (0.89–0.91) | 82% | ≤24 | 0.87 | (0.85–0.88) | 78% | 0.692 | 0.87 | (0.86–0.89) | 81% | ≤4 | 0.89 | (0.87–0.90) | 82% | ≤4 |
| Change score | 0.86 | (0.84–0.87) | 78% | ≥13 | 0.76 | (0.74–0.78) | 73% | 0.105 | 0.83 | (0.82–0.85) | 77% | ≥3 | 0.82 | (0.81–0.84) | 75% | ≥2 |
| Percentage change (%) | 0.91 | (0.90–0.92) | 83% | ≥30 | 0.86 | (0.85–0.88) | 81% | ≥40 | 0.86 | (0.84–0.87) | 79% | ≥33 | ||||
| Degenerative | ||||||||||||||||
| Follow-up score | 0.92 | (0.89–0.94) | 85% | ≤24 | 0.92 | (0.89–0.94) | 84% | ≥0.692 | 0.86 | (0.82–0.89) | 79% | ≤3 | 0.90 | (0.88–0.93) | 83% | ≤4 |
| Change score | 0.86 | (0.82–0.89) | 80% | ≥13 | 0.76 | (0.76–0.81) | 76% | ≥0.105 | 0.81 | (0.77–0.91) | 76% | ≥3 | 0.84 | (0.81–0.88) | 79% | ≥3 |
| Percentage | 0.92 | (0.90–0.94) | 85% | ≥30 | 0.84 | (0.80–0.87) | 78% | ≥40 | 0.88 | (0.85–0.91) | 80% | ≥33 | ||||
The area under the curve (AUC) with 95% confidence interval (CI) describes a candidate score’s ability to classify patients who replied ‘completely recovered’ or ‘much improved’ on the GPE scale into ‘success’ and those replied ‘slightly improved’, ‘unchanged’, ‘slightly worse’, ‘much worse’, and ‘worse than ever’ into ‘non-success’ at 12 month follow-up. The larger the AUC, the better the accuracy of the score [range from 0.5 (no ability) to 1.0 (perfect ability)]. A cut-off corresponds to the threshold that gave rise to the maximum percentage of patients correctly classified (max corr. Class.) into ‘success’ and ‘non-success’. Results are given for ODI, EQ-5D, NRS leg pain, and NRS back pain for spinal stenosis and for degenerative spondylolisthesis. Because the EQ-5D questionnaire values ranged from −0.6 to 1.0 on a categorical scale, it was not mathematically possible to evaluate the percent change score
Sensitivity and specificity for estimated cut-off values. Correct classification rate in different PROM baseline groups
| Spinal stenosis | Degenerative spondylolisthesis | |||||||
|---|---|---|---|---|---|---|---|---|
| Estimated cut-off | Correct | Sensitivity | Specificity | Estimated cut-off | Correct | Sensitivity | Specificity | |
| ODI follow-up score | ≤24 |
|
| ≤24 |
|
| ||
| Low baseline | 80% | 87% | ||||||
| Medium | 85% | 85% | ||||||
| High baseline | 80% | 84% | ||||||
| ODI change score | ≥13 |
|
| ≥13 |
|
| ||
| Low baseline | 72% | 77% | ||||||
| Medium | 84% | 86% | ||||||
| High baseline | 78% | 75% | ||||||
| ODI percentage change | ≥30 |
|
| ≥30 |
|
| ||
| Low baseline | 83% | 88% | ||||||
| Medium | 85% | 85% | ||||||
| High baseline | 80% | 82% | ||||||
| EQ-5D follow-up score | ≥0.692 |
|
| ≥0.692 |
|
| ||
| Low baseline | 75% | 81% | ||||||
| Medium | 79% | 80% | ||||||
| High baseline | 80% | 82% | ||||||
| EQ-5D change score | ≥0.105 |
|
| ≥0.105 |
|
| ||
| Low baseline | 73% | 74% | ||||||
| Medium | 75% | 80% | ||||||
| High baseline | 72% | 71% | ||||||
| Leg pain follow-up score | ≤3 |
|
| ≤3 |
|
| ||
| Low baseline | 82% | 81% | ||||||
| Medium | 82% | 76% | ||||||
| High baseline | 81% | 79% | ||||||
| Leg pain change score | ≥3 |
|
| ≥3 |
|
| ||
| Low baseline | 69% | 70% | ||||||
| Medium | 82% | 76% | ||||||
| High baseline | 78% | 80% | ||||||
| Leg pain percentage change | ≥40 |
|
| ≥40 |
|
| ||
| Low baseline | 79% | 75% | ||||||
| Medium | 81% | 76% | ||||||
| High baseline | 81% | 81% | ||||||
| Back pain follow-up score | ≤4 |
|
| ≤4 |
|
| ||
| Low baseline | 81% | 82% | ||||||
| Medium | 83% | 80% | ||||||
| High baseline | 82% | 87% | ||||||
| Back pain change score | ≥2 |
|
| ≥3 |
|
| ||
| Low baseline | 72% | 67% | ||||||
| Medium | 83% | 81% | ||||||
| High baseline | 71% | 83% | ||||||
| Back pain percentage change | ≥33% |
|
| ≥33% |
|
| ||
| Low baseline | 76% | 78% | ||||||
| Medium | 83% | 80% | ||||||
| High baseline | 80% | 85% | ||||||
The sensitivity describes the probability of correctly classifying an individual replying ‘completely recovered’ or ‘much improved’ (GPE) as a ‘success’ when assessed by the estimated cut-offs for the PROMs. The specificity describes the probability for detecting a ‘non-success’ patient (one with a lower response at the GPE scale)
For each estimated cut-off values the percentage of correctly classified patients (correct classification) into ‘success’ and ‘non-success’ according to the anchor are given separately for patients with low (ODI; 0–32, EQ-5D; −0.59-0.1, NRS leg and back pain; 0–5), medium (ODI; 32–46, EQ-5D; 0.1–0.6, NRS leg and back pain; 6–7), and high (ODI; 46 to 100, EQ-5D; 0.6–1.0, NRS leg and back pain; 8–10) baseline scores