Literature DB >> 22792458

A hierarchy of patient-reported outcomes for meta-analysis of knee osteoarthritis trials: empirical evidence from a survey of high impact journals.

Carsten Juhl1, Hans Lund, Ewa M Roos, Weiya Zhang, Robin Christensen.   

Abstract

Objectives. To develop a prioritised list based on responsiveness for extracting patient-reported outcomes (PROs) measuring pain and disability for performing meta-analyses in knee osteoarthritis (OA). Methods. A systematic search was conducted in 20 highest impact factor general and rheumatology journals chosen a priori. Eligible studies were randomised controlled trials, using two or more PROs measuring pain and/or disability. Results. A literature search identified 402 publications and 38 trials were included, resulting in 54 randomised comparisons. Thirty-five trials had sufficient data on pain and 15 trials on disability. The WOMAC "pain" and "function" subscales were the most responsive composite scores. The following list was developed. Pain: (1) WOMAC "pain" subscale, (2) pain during activity (VAS), (3) pain during walking (VAS), (4) general knee pain (VAS), (5) pain at rest (VAS), (6) other composite pain scales, and (7) other single item measures. Disability: (1) WOMAC "function" subscale, (2) SF-36 "physical function" subscale, (3) SF-36 (Physical composite score), and (4) Other composite disability scores. Conclusions. As choosing the PRO most favourable for the intervention from individual trials can lead to biased estimates, using a prioritised list as developed in this study is recommended to reduce risk of biased selection of PROs in meta-analyses.

Entities:  

Year:  2012        PMID: 22792458      PMCID: PMC3389647          DOI: 10.1155/2012/136245

Source DB:  PubMed          Journal:  Arthritis        ISSN: 2090-1992


1. Introduction

Measures for patient-reported outcomes (PROs) are used in most osteoarthritis (OA) trials. Extracting and combining these data is an essential part of any meta-analysis of such trials. At the 3rd Outcome Measures in Rheumatology (OMERACT) Conference in 1996, consensus was reached on three domains—pain, disability, and patient global assessment—as measures that should be reported in future OA trials [1]. This recommendation only specified which constructs to measure, but not which specific instruments to include [2]. In knee OA, pain is typically measured by either the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC, “pain” subscale) or by different pain scores using a visual analogue scale (VAS). Patient-reported disability is often measured by WOMAC “function” subscale or Short Form 36 (SF-36 “physical functioning” (PF) subscale), but a variety of different PRO measures have been used [3]. Often several PROs are used in trials for estimating change in pain and disability. Simply extracting data on the primary outcome may not be an option in a meta-analysis, as this is often not reported and may not be available for both pain and disability. If the choice of PROs is based on which outcome measure reaches statistical significance, the corresponding meta-analytic estimates are likely to be biased [4, 5]. Ideally, the most responsive outcome measure is the best choice for extraction and inclusion in a meta-analysis, provided that it is a valid outcome measure and included in the majority of the trials. Whether the outcome measure is truly valid (i.e., it measures the intended change) is a separate consideration based on face, content, construct, and criterion validity [6]. Good validity is a prerequisite for high responsiveness [7]. Responsiveness is quantified as the standardised mean difference (SMD), which is calculated as the difference in the mean change between the intervention and control groups divided by the pooled standard deviation (combining the different groups in any particular trial). The PRO with the largest SMD is considered to be the most responsive [8]. The choice of the most responsive among the potential outcome measures is consistent with the OMERACT filter of truth, discrimination, and feasibility [9]. Combining instruments with different responsiveness can increase heterogeneity in a meta-analysis [10]. Alternatively, various outcomes can be transformed to the most frequently used outcome, using the transformation coefficients from regression analyses [7, 10]. In meta-analyses of OA, it is recommended that a hierarchy of PROs should be determined prior to data extraction [11]. In 2006, Jüni et al. suggested a prioritised list for extracting data on pain and disability in patients with knee OA, but the methodology behind the list was not reported [11]. Likewise in a meta-analysis of aquatic exercise for OA, a nonsystematic approach was used to develop an operational hierarchy for OA meta-analyses [12].

1.1. Objectives

The aim of this study was to develop a prioritised list for extracting PROs on pain and disability for meta-analyses, based on an investigation of the responsiveness of PRO measures in knee OA, and restricted to trials with an anticipated low risk of selective outcome reporting.

2. Methods

2.1. Eligibility Criteria

Trials were considered eligible in the current study if they were designed as randomised controlled trials (RCTs) or quasi-randomised trials investigating any type of intervention for patients with knee OA [13]. Trials had to include at least one group with a specified intervention and a control group. In accordance with international consensus regarding the core set of outcome measures for phase III clinical trials in OA [9], the eligible RCTs had to include assessment of at least self-reported pain and/or self-reported disability. Only trials measuring one or both of these constructs with at least two different outcome measures were eligible.

2.2. Information Sources

A systematic literature search was conducted in MEDLINE in December 2009 by searching PubMed in order to identify RCTs of interventions aimed at reducing pain and disability in patients with knee OA. In order to identify RCTs with a low risk of selective outcome reporting, the search was restricted to RCTs published later than the year of 2002. The revised CONSORT Statement was published in 2001, and the reporting of RCTs was expected to improve from 2002 [14].

2.3. Literature Search

The search terms for identifying knee OA were “osteoarthritis, knee” [MeSH] OR (“osteoarthritis” [TIAB] AND “knee” [TIAB]). The search was restricted to the ten highest impact factor general and internal medicine journals (previous 5 years) as well as the ten highest impact factor rheumatology journals (previous 5 years) (determined by Journal Citation Reports, http://isiwebofknowledge.com. 6 November 2009): The New England Journal of The Medicine; Journal of the American Medical Association; The Lancet; Annals of Internal Medicine; British Medical Journal; PLoS Medicine; Annual Review of Medicine; Archives of Internal Medicine; Canadian Medical Association Journal; Medicine, Arthritis and Rheumatism; Annals of Rheumatic Diseases; Nature Clinical Practice Rheumatology; Arthritis Research & Therapy; Seminars in Arthritis and Rheumatism; Osteoarthritis and Cartilage, Rheumatology (Oxford); Current Opinion in Rheumatology; Journal of Rheumatology; and Best Practice and Research in Clinical Rheumatology. Targeting journals with a high impact factor has previously been suggested as a good strategy for identifying journal articles with high methodological quality [15].

2.4. Study Selection

Two members of the study team (C. Juhl, H. Lund) independently scrutinised titles and abstracts of all identified publications. The full text of any article was obtained if it was judged eligible by at least one of the reviewers. The two reviewers then evaluated eligibility based on the full text of all the retrieved papers, and consensus on inclusion was reached by discussion.

2.5. Data Collections Process and Data Items

Study identification (author, year) and outcome measures were extracted using a customised data extraction form. For each outcome measure, the number of participants in the intervention and the control groups and the mean change and standard deviation (SD) were extracted in order to calculate the SMD. When SD was not available in an explicit format, it was estimated from the standard error (SE), confidence interval, the P value, the interquartile range, or other methods as recommended by the Cochrane Collaboration [16].

2.6. Risk of Bias in Individual Studies

Selective outcome reporting has been defined as choosing a subset of the original outcomes on the basis of the results [1, 2]. Two members of the study team (C. Juhl, H. Lund) assessed the risk of selective outcome reporting, indexed according to whether the trials had been classified as “adequate,” “unclear,” or “inadequate” in accordance with the Cochrane Handbook for Systematic Reviews of Interventions 5.1.0 [16]. Selective outcome reporting was deemed as follows. Adequate, if a protocol (published or from ClinicalTrials gov or other databases) was available and all PROs were sufficiently reported for extracting data for estimating SMD. Unclear, if a protocol was not available (published or from ClinicalTrials gov or other databases). Inadequate, if some or all PROs were insufficiently reported for extracting data for estimating SMD (evaluated by checking the protocol from earlier publications, trial registers, or described in the published trial).

2.7. Summary Measures

The effect size was calculated as the SMD to allow comparison of the various PROs. The SMD was estimated as the difference in mean change between the intervention and control groups divided by the pooled SD. The pooled SD was estimated from SDpooled 2 = (SD 2 × [N − 1] + SD 2 × [N − 1])/(N + N − 2), where N and SD represent the number of patients and the SD in the intervention group, respectively. The SMD was used to represent the responsiveness; the higher the SMD, the more responsive the measure. This approach is only valid to rank PROs within a particular study, as different studies obviously measure different therapeutic interventions. Data from an intention to treat (ITT) analysis was preferred for calculating the SMD. When several intervention groups were compared with a control group, the number of control patients was divided equally into the appropriate number of groups when estimating the SMD.

2.8. Synthesis of Results

The responsiveness estimated as SMD of the PROs in each of the included trials (or subgroups when more interventions were compared with the control) was ranked according to responsiveness for pain and disability separately. The PRO used to measure the effect of an intervention in any individual trial with the highest responsiveness was ranked 1, the second most responsive was ranked 2, and so on. The mean rank was then used to estimate the responsiveness across the trials, and a low mean rank (close to 1) indicated that this PRO was often the most responsive PRO used. The PROs used in at least 5 trials were then listed according to the lowest mean rank of the SMD. However, composite item scales with established validity would rate higher than single item scales.

2.9. Risk of Bias across Studies

A sensitivity analysis was performed evaluating the impact of different systematic approaches to data extraction of PROs in a meta-analysis. The pooled mean across trials using the developed list from the current study was compared with lists based on (1) the most favourable outcome from each of the individual trials, (2) the most frequently used PROs, and (3) the most responsive of the PROs. Inconsistency in means between trials was evaluated using the I 2 index [17]. A random effect model was used for pooling the trials. All analyses were performed at the study level using the meta-analysis software “Comprehensive Meta Analysis” Version 2, Biostat. Inc., and were based on published data only.

2.10. Additional Analysis

In order to assess the robustness of the developed list, subgroup analyses stratifying the available trials according to risk of selective outcome bias were applied, and a list based on trials with no risk of selective outcome bias was compared with the list based on all included trials. Stratified analyses were performed based on whether the included trials were published in the general/internal medicine journals or in the rheumatology journals. Secondly, sensitivity analyses were performed stratifying trials according to the intervention; injection in the knee joint, oral medication and other interventions (tai chi, lateral wedge shoes, etc.). Outcomes used at least 5 times were ranked based on the mean rank and the lists from these subgroups were compared to the list based of all included trials.

3. Results

3.1. Study Selection

Through the search strategy, 402 publications were identified, as presented in the flowchart (Figure 1). Titles and abstracts of the publications were checked independently by two reviewers (C. Juhl, H. Lund). One hundred and eighty three trials were identified as potentially eligible by at least one of the reviewers and subsequently examined independently in full text by two reviewers (C. Juhl, H. Lund). Consensus was reached by discussion and resulted in 38 trials that fulfilled the eligibility criteria and were included in the analysis [18-55] (Table 1).
Figure 1

Flowchart: identification of studies.

Table 1

Characteristics of included studies.

AuthorJournalComparisonNumber of randomised patientsConstructSelectiveoutcome reportingOutcomeNumber of patientsStandard mean differences (SMD) measured as Cohens' d (SE)Rank of responsiveness measured as SMD
Altman et al. (Flexx) 2009Semin Arthritis Rheum.Sodium hyaluronate versus saline588PainUnclearWOMAC291/2950.143 (0.083)2
VAS (walking)285/2950.259 (0.083)1
WOMAC291/2950.194 (0.083)1
Disability SF-36 (PCS)291/2950.191 (0.083)2
SF-36 (PF)291/2950.170 (0.083)3
Baltzer et al. 2009Osteoarthritis and CartilageAutologous conditioned serum (ACS) versus saline241PainUnclearWOMAC134/1070.703 (0.166)2
VAS (activity)134/1070.937 (0.169)1
Disability WOMAC134/1070.699 (0.166)1
SF-8 (PCS)134/1070.682 (0.166)2
Hyaluronan versus saline242PainUnclearWOMAC135/1070.050 (0.162)1
VAS (activity)135/1070.035 (0.162)2
Disability WOMAC135/1070.078 (0.162)2
SF-8 (PCS)135/1070.146 (0.162)1
Barthel et al. 2009Semin Arthritis  Rheum.Diclofenac versus vehicle492PainUnclearWOMAC254/2380.234 (0.091)2
VAS (activity)254/2380.281 (0.091)1
Bennell et al. 2005Ann. Rheum. Dis.Multimodal physiotherapy programme versus placebo140PainUnclearWOMAC73/670.122 (0.169)3
VAS (activity)73/670.223 (0.170)1
SF-36 (BP)73/670.071 (0.169)4
VAS (knee)73/670.216 (0.170)2
Disability WOMAC73/670.085 (0.169)2
SF-36 (PF)73/670.236 (0.170)1
Berman et al. 2004Ann. Intern. Med.Acupuncture versus sham  acupuncture381DisabilityUnclearWOMAC142/1410.201 (0.146)1
SF-36 (PCS)142/1410.134 (0.146)2
Education versus sham acupuncture380DisabilityUnclearWOMAC141/108 −0.244 (0.154)2
SF-36 (PCS)141/108−0.255 (0.154)1
Burch et al. 2008Osteoarthritis and CartilagePattern stimulation versus control (TENS)116PainUnclearWOMAC52/530.698 (0.201)1
VAS (global)52/530.231 (0.195)2
Case et al. 2003Ann. Intern. Med.Diclofena versus placebo53PainUnclearWOMAC25/280.446 (0.338)1
Lequesne25/280.190 (0.334)2
Acetaminophen versus placebo57PainUnclearWOMAC29/280.096 (0.326)2
Lequesne29/280.217 (0.326)1
Clegg et al. 2006N. Engl. J. Med.Glucosamine + chondroitin versus placebo630PainInadequateWOMAC317/3130.127 (0.126)2
HAQ317/3130.146 (0.126)1
Disability WOMAC317/3130.137 (0.126)1
HAQ317/3130.104 (0.126)2
Glucosamine versus placebo630PainInadequateWOMAC317/313−0.028 (0.126)1
HAQ317/313−0.021 (0.126)2
Disability WOMAC317/313−0.013 (0.126)2
HAQ317/3130.056 (0.126)1
Chondroitin versus placebo631PainInadequateWOMAC318/313−0.020 (0.126)2
HAQ318/313−0.046 (0.126)1
Disability WOMAC318/3130.023 (0.126)2
HAQ318/3130.029 (0.126)1
Celecoxib versus placebo631PainInadequateWOMAC318/3130.132 (0.126)1
HAQ318/3130.130 (0.126)2
Disability WOMAC318/3130.179 (0.126)1
HAQ318/3130.114 (0.126)2
Forestier et al. 2009Ann. Rheum. Dis.Spa therapy versus control451PainAdequateVAS (knee)193/1860.310 (0.103)1
SF-36 (BP)194/1850.190 (0.103)2
Disability WOMAC179/1720.366 (0.108)1
SF-36 (PCS)190/1770.153 (0.105)2
SF-36 (PF)195/1860.070 (0.103)3
Geba et al. 2002JAMARofecoxib 25 mg/d versus acetaminophen189PainInadequateVAS (walking)95/940.467 (0.208)1
VAS (night)95/940.394 (0.207)3
VAS (rest)95/940.411 (0.206)2
Rofecixib 12.5 mg/d versus acetaminophen190PainInadequateVAS (walking)96/940.191 (0.206)1
VAS (night)96/940.069 (0.206)3
VAS (rest)96/940.135 (0.206)2
Celecoxib versus  Acetaminophen191PainInadequateVAS (walking)97/940.233 (0.206)1
VAS (night)97/94−0.043 (0.206)3
VAS (rest)97/940.074 (0.206)2
Gibofsky et al. 2003Arthritis, Rheum.Rofecoxib versus placebo285PainUnclearWOMAC189/960.489 (0.163)1
VAS (walking)189/960.366 (0.162)2
Celecoxib versus placebo286PainUnclearWOMAC190/960.514 (0.163)1
VAS (walking)190/960.451 (0.163)2
Hinman et al. 2003BMJTappening versus placebo58PainUnclearWOMAC29/290.160 (0.318)4
NRS (activity)29/290.802 (0.329)1
NRS (walking)29/290.795 (0.329)2
SF-36 (BP)29/290.308 (0.320)3
Disability WOMAC29/291.019 (0.340)1
SF-36 (PF)29/290.247 (0.323)2
Control tape versus placebo58PainUnclearWOMAC29/290.649 (0.329)1
NRS (activity)29/290.376 (0.329)4
NRS (walking)29/290.649 (0.329)1
SF-36 (BP)29/290.431 (0.325)3
Disability WOMAC29/290.640 (0.329)1
SF-36 (PF)29/290.230 (0.323)2
Hughes and Carr 2002RheumatologyGlucosamine sulphate versus placebo80PainUnclearWOMAC39/39−0.066 (0.227)3
VAS (activity)39/39−0.020 (0.226)5
VAS (knee)39/39−0.032 (0.226)4
VAS (rest)39/390.323 (0.228)1
McGill (sense)39/390.221 (0.227)2
Kirkley et al. 2008N. Engl. J. Med.Arthroscopic surgery versus placebo188PainAdequateWOMAC88/800.316 (0.155)1
ASES88/80−0.073 (0.155)2
Disability WOMAC88/800.259 (0.155)1
ASES88/800.209 (0.155)2
SF-36 (PCS)88/80−0.009 (0.155)3
Kitay et al. 2009Osteoarthritis and CartilageVibrations, CPM, and heat versus sham device71PainAdequateWOMAC24/320.984 (0.284)1
VAS (knee)24/320.984 (0.284)1
Mazières et al.2007Ann. Rheum. Dis.Chondroitin sulphate versus placebo307PainUnclearVAS (activity)139/1400.260 (0.120)1
VAS (rest)139/1400.092 (0.120)2
McCarthy et al. 2004RheumatologyClass-based versus home-based exercise214PainUnclearWOMAC111/1030.321 (0.138)2
VAS (walking)111/1030.836 (0.143)1
Moseley et al. 2002N. Engl. J. Med.Arthroscopic debridement versus placebo119PainUnclearKnee specific
Pain scale53/550.010 (0.232)3
AIMS (pain)53/55−0.072 (0.224)2
SF-36 (BP)52/550.114 (0.194)1
Arthroscopic lavage versus placebo121PainUnclearKnee specific
Pain scale55/550.110 (0.236)2
AIMS (pain)56/55−0.200 (0.223)1
SF-36 (BP)57/550.090 (0.189)3
Neustadt et al. 2005J. Rheumatol.Hyalerone four times a week versus control251PainUnclearWOMAC115/1140.098 (0.162)2
VAS (activity)104/1000.158 (0.172)1
Hyalerone three times a week versus control242PainUnclearWOMAC107/114−0.028 (0.164)2
VAS (activity)90/1000.030 (0.176)1
Niethard et al.2005J. Rheumatol.Diclofena versus placebo238PainUnclearWOMAC117/1200.363 (0.131)1
VAS (activity)117/1190.360 (0.131)2
Nuñez et al.2006Osteoarthritis and CartilageEducation programme versus control100PainUnclearWOMAC43/370.980 (0.237)1
SF-36 (BP)43/370.605 (0.229)2
Disability WOMAC43/370.742 (0.232)1
SF-36 (PF)43/370.576 (0.229)2
J. Petrella and M. Petrella2006Arch. Intern. Med.Massage versus control74PainAdequateWOMAC34/340.948 (0.256)1
VAS (knee)34/340.872 (0.254)2
Petrella et al. 2006J. Rheumatol.Hyaleron versuscontrol106PainInadequateWOMAC53/530.092 (0.194)1
VAS (activity)53/53−0.040 (0.194)2
VAS (walking)53/530.000 (0.194)3
DisabilityUnclearWOMAC53/53−0.147 (0.195)2
SF-36 (PF)53/530.084 (0.194)1
Petrella 2002Arch. Intern. Med.Hyalerone + placebo versus control53PainUnclearWOMAC25/280.190 (0.384)4
VAS (activity)25/280.890 (0.398)1
VAS (rest)25/28−0.539 (0.389)3
VAS (walking)25/280.623 (0.391)2
Hyalerone + NSAID versuscontrol57PainUnclearWOMAC29/280.238 (0.377)4
VAS (activity)29/280.501 (0.381)2
VAS (rest)29/280.383 (0.379)3
NSAID + saline versus control54PainUnclearVAS (walking)29/281.268 (0.403)1
WOMAC26/280.336 (0.384)3
VAS (activity)26/281.622 (0.428)1
VAS (rest)26/280.173 (0.382)4
VAS (walking)26/281.154 (0.406)2
Pham et al. 2004Ann. Rheum. Dis.Hyalerone versus control206PainUnclearVAS (knee)131/85−0.035 (0.177)2
Painful days131/85−0.078 (0.177)1
Diacerein versus control170PainUnclearVAS (knee)85/85−0.023 (0.188)2
Painful days85/85−0.027 (0.188)1
Ravaud et al. 2009BMJStandardised consultation versus usual care327DisabilityUnclearWOMAC146/1810.260 (0.112)2
SF-12 (PCS)129/1470.361 (0.122)1
Raynauld et al. 2003Arthritis Rheum.Triamcinolone versus saline (control)68PainUnclearWOMAC33/330.307 (0.248)2
VAS (knee)33/330.672 (0.253)1
Raynauld et al. 2002Osteoarthritis and CartilageHylan G-F 20 versus Control255DisabilityUnclearWOMAC124/1070.668 (0.136)1
SF-36 (PCS)124/1070.553 (0.134)2
Rodrigues et al. 2008Arthritis Rheum.Medial-wedge insole versus neutral insole30PainAdequateVAS (activity)16/141.337 (0.405)1
VAS (rest)16/140.899 (0.384)2
VAS (night)16/140.802 (0.380)3
Rooks et al.2006Arthritis RheumPreoperative exercise versus control45PainUnclearWOMAC14/150.220 (0.373)2
SF-36 (BP)14/15−0.322 (0.374)1
Disability WOMAC14/150.034 (0.372)2
SF-36 (PF)14/15−0.391 (0.375)1
Sengupta et al. 2008Arthritis Res. Ther.5-Loxin 250 mg versus control50PainUnclearWOMAC23/231.292 (0.393)2
VAS (knee)23/232.463 (0.467)1
5-Loxin 100 mg versus control50PainUnclearWOMAC24/231.039 (0.379)1
VAS (knee)24/231.918 (0.425)2
Song et al. 2009Ann. Rheum. Dis.Icatibant 2000 μg versus control29PainUnclearVAS (activity)14/150.492 (0.459)1
VAS (rest)14/150.391 (0.456)2
Icatibant 500 μg versus control27PainUnclearVAS (activity)12/150.411 (0.470)1
VAS (rest)12/150.391 (0.470)2
Tannenbaum et al. 2004Ann. Rheum. Dis.Lumiracoxib 400 mg versus placebo734PainUnclearWOMAC491/2430.211 (0.132)2
VAS (knee)491/2430.307 (0.132)1
Lumiracoxib 200 mg versus placebo730PainUnclearWOMAC487/2430.189 (0.132)2
VAS (knee)487/2430.236 (0.132)1
Celecoxib versus placebo724PainUnclearWOMAC481/2430.184 (0.132)2
VAS (knee)481/2430.217 (0.132)1
Trnavský et al. 2004J. Rheumatol.Ibuprofen versus placebo50PainUnclearVAS (walking)25/251.739 (0.332)1
VAS (rest)25/250.932 (0.298)2
VAS (knee)25/250.786 (0.294)3
Vas et al. 2004BMJAcupuncture versus control97PainUnclearWOMAC48/491.109 (0.218)2
VAS (knee)48/491.249 (0.222)1
Wang et al. 2009Arthritis Rheum.Tai chi versus control40PainAdequateWOMAC20/200.456 (0.320)1
VAS (knee)20/200.015 (0.316)2
40DisabilityAdequateWOMAC20/200.356 (0.319)2
SF-36 (PCS)20/200.820 (0.329)1
Witt et al. 2005LancetAcupuncture versus minimal acupuncture226PainUnclearWOMAC149/750.150 (0.142)1
SES149/750.062 (0.142)2
226DisabilityUnclearWOMAC149/750.249 (0.142)1
SF-36 (PCS)149/750.215 (0.142)3
PDI149/750.245 (0.142)2
Wittenberg et al. 2006Arthritis Rheum.Lumiracoxib versus placebo219PainUnclearWOMAC144/750.406 (0.185)2
VAS (activity)144/750.645 (0.186)1
Celecoxib versus placebo220PainUnclearWOMAC145/750.396 (0.184)2
VAS (activity)145/750.525 (0.185)1

WOMAC (Western Ontario and McMaster Universities Arthritis Index), VAS (visual analogue scale), NRS (numerical rating scale), AIMS (Arthritis Impact Measurement Scale), ASES (Arthritis Self-Efficacy Scale), HAQ (Health Assessment Questionnaire), PDI (Pain Disability Index), SES (Schmerzempfindungsskala) (Pain Experience Scale), SF-36 (Short Form 36), BP (bodily pain), PF (physical function), PCS (Physical Composite Scale).

3.2. Study Characteristics

Pain was evaluated with more than one PRO in 35 trials [18–21, 23–42, 45–55], and disability was evaluated in 15 trials [18, 19, 21, 22, 25, 26, 29, 31, 38, 40, 43, 44, 46, 53, 54]. More than one intervention group was compared in 14 trials measuring pain [19, 24, 25, 27–29, 34–36, 41, 42, 48–50, 55] and in 4 trials measuring disability [22, 25, 29, 40]. Different specific questions were asked when VAS scores were used for measuring pain. The VAS scores were classified as pain during activity (e.g., stepping, daily activities, etc.), pain during walking, pain at rest, pain at night, and general knee pain. WOMAC was used both in a VAS version using a 100 mm scale and in a Likert version using a 1–5 scale, and these were treated as different outcomes in the initial analysis. Pain during activity and pain during walking were measured by either a VAS score or a numeric rating scale (NRS) score, and these PROs were analysed separately. Then eighteen different PROs were used for measuring pain and seven PROs for measuring disability in the included trials.

3.3. Risk of Bias within Studies

Only 12 out of the 38 included trials were registered in ClinicalTrial.gov. Six of these trials were classified as “adequate” and six as “inadequate”. Two trials reported PROs not declared in the protocol, one did not report all PROs from the published protocol, one did not report all time points, and two protocols did not report PROs at all in the published protocol (Table 1).

3.4. Results of Individual Studies

The most frequently used PROs for measuring pain were the WOMACpain” subscale (in either the Likert scale or VAS format) used in 27 of the included trials and for disability the most frequently used PRO was the WOMAC “function” subscale (in either the Likert scale or VAS format) used in all 15 trials (Table 2).
Table 2

Frequency of the used outcomes listed according to their frequency in the included trials.

Pain outcome (scale)Number of trials k  =  35 (%)Number of substudies k  =  54 (%)Mean rank in responsiveness (range)
WOMAC pain (Likert scale)15 (43%)20 (37%)1.8 (1–4)
Global knee pain (VAS)13 (37%)17 (31%)1.7 (1–4)
WOMAC pain (100 mm scale)12 (34%)21 (39%)1.9 (1–4)
Pain during activity (VAS)12 (34%)18 (33%)1.4 (1–5)
Pain during walking (VAS)7 (20%)12 (22%)1.5 (1–3)
Pain at rest (VAS)7 (20%)12 (22%)2.3 (1–4)
SF-36 (bodily pain (BP) subscale)6 (17%)8 (15%)2.3 (1–4)
Pain at night (VAS)2 (6%)4 (7%)3.0 (3)
HAQ (pain subscale)1 (3%)4 (7%)1.5 (1-2)
Lequesne algofunctional index (pain subscale)1 (3%)2 (4%)1.5 (1-2)
Pain during activity (NRS)1 (3%)2 (4%)2.5 (1–4)
Pain during walking (NRS)1 (3%)2 (4%)2.5 (2-3)
Number of painful days (days)1 (3%)2 (4%)1.0 (1)
AIMS (pain subscale)1 (3%)2 (4%)1.5 (1-2)
Knee-Specific Pain Scale (KSPS)1 (3%)2 (4%)2.5 (2-3)
McGill Pain Questionnaire (pain intensity)1 (3%)1 (2%)2.0 (2)
ASES (pain subscale)1 (3%)1 (2%)2.0 (2)
SES (Schmerzempfindungsskala)1 (3%)1 (2%)2.0 (2)

Disability outcome (scale)Number of trials k  =  15 (%)Number of substudies k  =  21 (%)Mean rank in responsiveness (range)

Physical composite score (PFC) (based on SF-36, SF-12, or SF-8)9 (60%)11 (52%)1.8 (1–3)
WOMAC function (100 mm scale)8 (53%)13 (62%)1.5 (1-2)
WOMAC function (Likert scale)7 (47%)8 (38%)1.5 (1-2)
SF-36 (physical function PF subscale)7 (47%)8 (38%)1.8 (1-2)
HAQ (disability subscale)1 (7%)4 (19%)1.5 (1-2)
PDI (pain disability index)1 (7%)1 (5%)2.0 (2)
ASES (disability subscale)1 (7%)1 (5%)2.0 (2)

WOMAC (Western Ontario and McMaster Universities Arthritis Index), VAS (visual analogue scale), NRS (Numerical Rating Scale), AIMS (Arthritis Impact Measurement Scale), ASES (Arthritis Self Efficacy Scale), HAQ (Health Assessment Questionnaire), PDI (Pain Disability Index), SES (Schmerzempfindungsskala) (Pain Experience Scale), SF-36 (Short Form 36), SF-12 (Short Form 12), SF-8 (Short Form 8).

The most responsive PRO measure for pain was “pain during activity” using a VAS with a mean rank of 1.4. It was used when comparing the effect of an intervention in 12 trials (comparing 18 interventions with controls), and it was the most responsive outcome in 14 of these 18 substudies. The most responsive composite score for pain was the WOMACpain” subscale using a Likert scale with a mean rank of 1.8. It was the most responsive outcome in 7 out of 20 substudies and the most responsive composite score in 19 out of 20 substudies. The most responsive PRO on disability was WOMAC “function” subscale using a 100 mm scale with a mean rank of 1.4. It was the most responsive outcome in 7 out of 13 substudies.

3.5. Synthesis of Results

According to the Food and Drug Administration (FDA), a single item may be reasonable for concepts such as pain severity, if it is a reliable and valid measure, but not for general concepts such as disability [56]. Different single item VAS scores were actually measuring different items, as they were based on different questions. A list based on the responsiveness of the PROs was constructed, placing the most responsive of the PROs as first choice for extracting data for meta-analyses, and PROs based on a single item being downgraded as they were derived from different instruments, whose validity and reliability had not been established. The final prioritised list for preferred PROs to extract when performing meta-analyses is presented in Table 3.
Table 3

A prioritized list of patient-reported outcomes for extracting data in meta-analysis.

Pain
outcome (scale)
(1) WOMAC pain subscale (Likert/100 mm)
(2) Pain during activity (VAS)
(3) Pain during walking (VAS)
(4) Global knee pain (VAS)
(5) Pain at rest (VAS)
(6) SF-36 (bodily pain (BP) subscale)
(7) HAQ (pain subscale), Lequesne algofunctional index (pain subscale), AIMS (pain subscale), Knee-Specific Pain Scale (KSPS), McGill Pain Questionnaire (pain intensity), ASES (pain subscale), SES (Schmerzempfindungsskala)
(8) Pain at night (VAS), pain during activity (NRS), pain on walking (NRS), number of painful days (days)

Disability
outcome (scale)

(1) WOMAC subscale function (Likert/100 mm)
(2) SF-36 (subscale physical function (PF))
(3) Physical composite score (PCS) based on SF-36, SF-12, or SF-8
(4) HAQ (disability subscale), PDI (pain disability index), ASES (disability subscale)

WOMAC (Western Ontario and McMaster Universities Arthritis Index), VAS (visual analogue scale), NRS (Numerical Rating Scale), AIMS (Arthritis impact measurement scale), ASES (Arthritis Self Efficacy Scale), HAQ (health assessment questionnaire), PDI (Pain Disability Index), SES (Schmerzempfindungsskala) (Pain Experience Scale), SF-36 (Short Form 36), SF-12 (Short Form 12), SF-8 (Short Form 8).

3.6. Risk of Bias across Studies

The likelihood of overestimating the pooled effect in meta-analyses by choosing the most favourable PROs for the intervention in each trial is illustrated in Figure 2. Compared with the developed list, consistently choosing the most favourable PROs overestimated the effect size for pain with SMD = 0.14 (95% CI 0.02, 0.26) and for disability with SMD = 0.09 (95% CI –0.03, 0.21). The differences between (1) the developed list, (2) the most frequently used PROs, and (3) the most responsive of the PROs were smaller. The list developed in this study seems to be a conservative but robust approach for extracting PROs when planning a meta-analysis.
Figure 2

Impact on different approaches to extracting data for meta-analysis.

3.7. Additional Analysis

The subgroup analysis stratifying the available trials according to risk of selective outcome bias was applied, and a list based on trials with a low risk of selective outcome reporting was developed (i.e., trials with a published protocol and evaluated as “adequate” in the risk of bias assessment). All 6 trials with a low risk of selective outcome reporting had measured pain, but only 3 had measured disability. Even though some outcomes were missing in the list based on these 6 trials with low risk of selective outcome reporting bias compared with the list based on all the included 38 trials, the differences between the two lists were very small (data not shown). When stratifying the included trials according to the interventions; injection in the knee joint, oral medication and other interventions (tai chi, lateral wedge shoes etc.) no differences in the responsiveness of the most frequently used outcomes (a least 5 times) were found between the “injection subgroup”, the “other interventions subgroup” and all the included trials. In the “oral medication subgroup” the five most responsive outcomes (used a least 5 times) were “Pain walking”, “Pain global”, WOMAC (100 mm scale), “Pain at rest” and WOMAC (likert scale). When compared this to the list of outcomes from all the included trials “Pain activity” and SF-36 is missing as they were not used as frequently in the “oral medication subgroup” and the rank of the last three later were WOMAC (likert scale), WOMAC (100 mm scale) and “Pain at rest” but the responsiveness for these three outcomes was more or less the same. No differences in sensitivity analysis based on intervention were seen in the disability list. Other sensitivity analyses based on whether the included trials were published in the general/internal medicine journals or in the rheumatology journals showed no differences between lists based on mean rank in these subgroups and the list of all trials neither for the analysis of disability nor for pain.

4. Discussion

4.1. Summary of Evidence

Biased selection of PROs in meta-analyses (e.g., choosing the most favourable PROs for the intervention from individual trials) can overestimate the effect compared with a systematic approach. As anticipated, choosing the “most favourable” outcome from each individual trial was more positive than systematic approaches based on either (1) how frequently the PROs were used, (2) the average responsiveness of the PROs, or (3) the developed list. Using a prioritised list as developed in this study is recommended to reduce the risk of biased selection of PROs in meta-analyses. When comparing the list developed in this study with the hierarchy for extracting pain measurement scales published by Jüni et al. [11], the main differences are that Jüni et al. included global health outcome (as the total WOMAC score, patient's global health and physician's global health), contrasting with the developed list in this study, which only included specific pain measurement scales. The list used by Bartels et al. [12] was based on consensus among the authors and reports the WOMAC subscales as first choice in both pain and disability, similar to the list developed in this study.

4.2. Limitations

This study has some limitations. First of all, even though 38 trials were included, and 35 trials had sufficient data on pain, only 15 trials had sufficient data on disability. The literature search was performed in the ten highest impact factor general and internal medicine journals as well as the ten highest impact factor rheumatology journals in order to identify trials with a low risk of selective reporting bias. Some of the highly ranked journals were primarily review journals. Even though these journals only published a few randomised controlled trials, some of the eligible trials were published in one of these journals. Only six trials were classified as “adequate” for having a low risk of selective reporting bias, and some frequently used PROs were missing in the list based on these trials. However, as the differences between the list based on these trials with a low risk of selective outcome reporting bias and the list based on all the included 38 trials were very small, the developed list seems to be trustworthy. Furthermore, stratified analyses comparing lists based on the subgroups of trials and the list from all included trials showed only small discrepancies for the subgroup of “oral medication” using pain as outcome. No differences were seen in other subgroup analyses. As the subgroup analyses based on whether the trials were published in the general/internal medicine journals or in the rheumatology journals showed no differences between the list from the subgroup and the list from all included trials, it seems unlikely that including more trials will change the list of the most frequently used outcome. So, even though including more journals in the literature search could be preferable, based on these subgroup analyses, it is unlikely that including more trials would change the list developed in this study. Secondly, combining the SMDs across different populations and interventions can cause heterogeneity, as characteristics of the patient populations (age, sex, BMI, etc.) and interventions have an impact on the effect size. A large SMD can firstly be due to a large difference in mean change between the intervention group and control group, or secondly, to a small standard deviation due to small variability in the included patient group. Combining SMDs across trials could then cause bias if trials with a homogeneous patient group (small SD) were combined with trials with a heterogeneous patient group (large SD). These differences are reduced when using the rank of the SMD instead, when comparing across trials. In developing the prioritised list for extracting PROs in this study, the estimated SMDs were then only used for ranking the PROs in each trial. When the variability in a trial was small due to a homogeneous patient group, the variability in all PROs in the trial was then expected to be equally small with the rank of the SMD still being an acceptable measure of the relationship of the responsiveness between the PROs in individual trials. Thirdly, as a large number of different PROs were used and most of the included trials only compared two PROs on either pain or disability, it was not possible to make direct comparisons of the responsiveness of PROs. Therefore, the indirect method of ranking the PROs in the individual trials according to responsiveness was used. This method of ranking the PROs reduces the impact of the differences in populations and interventions between the included trials, as it only used the SMD for ranking the responsiveness of the PROs. As the mean rank of the PROs could change if more trials were included, especially the PROs only used in a few trials, the systematically developed list in this study was based on the most frequently used PROs (used in at least 5 trials).

4.3. Strengths

A strength of this study was that a comprehensive systematic literature search in high-quality journals was performed, and a systematic approach was used for developing the prioritised list. Furthermore, the impact of using the developed list for data extraction in meta-analyses was analysed. Finally, the prioritised list developed in this study was compared with a list based on the six trials with a low risk of selective outcome reporting bias, and the differences were small. To summarise, the WOMAC subscales for both pain and disability should be the first choice for extracting PROs in meta-analyses for patients with knee OA. Different single item VAS scores were actually measuring different items (e.g., VAS during activity covers “pain during daily activity,” “pain following stepping activities,” or “pain during worst activity”), so, even though different single item VAS scores for measuring pain were more responsive than the WOMAC, WOMAC is preferred for meta-analysis. The Knee Injury and Osteoarthritis Outcome Score (KOOS) “ADL” subscale is equivalent to the WOMAC “function” subscale, and the WOMACpain” score is contained in the KOOS “pain” score. As the corresponding KOOS subscales and WOMAC subscales have shown equal responsiveness [57], the KOOS subscales could be included in the developed list. Based on these preliminary results and considerations, we recommend the list presented in Table 3, when extracting PROs on pain and disability for meta-analysis.

5. Conclusions

As choosing the most favorable patient-reported outcomes (PROs) from individual trials can overestimate the effect compared with a systematic approach, using a prioritised list as presented in this study is recommended to reduce reviewers' likelihood of biased selection of PROs in meta-analyses. The impact of the prioritised list should be tested in published meta-analyses investigating the effect of interventions on pain and disability for patients with knee osteoarthritis. When a larger number of trials are registered and classified as “adequate” for having a low risk of selective reporting outcome bias, this study should be repeated in order to check the prioritised list based on the responsiveness of PROs.
  54 in total

Review 1.  Osteoarthritis: rational approach to treating the individual.

Authors:  Peter Jüni; Stephan Reichenbach; Paul Dieppe
Journal:  Best Pract Res Clin Rheumatol       Date:  2006-08       Impact factor: 4.098

2.  Effect of preoperative exercise on measures of functional status in men and women undergoing total hip and knee arthroplasty.

Authors:  Daniel S Rooks; Jie Huang; Benjamin E Bierbaum; Sarah A Bolus; James Rubano; Christine E Connolly; Sandra Alpert; Maura D Iversen; Jeffrey N Katz
Journal:  Arthritis Rheum       Date:  2006-10-15

3.  Acupuncture in patients with osteoarthritis of the knee: a randomised trial.

Authors:  C Witt; B Brinkhaus; S Jena; K Linde; A Streng; S Wagenpfeil; J Hummelsberger; H U Walther; D Melchart; S N Willich
Journal:  Lancet       Date:  2005 Jul 9-15       Impact factor: 79.321

4.  A randomized, double-blind, placebo-controlled trial of glucosamine sulphate as an analgesic in osteoarthritis of the knee.

Authors:  R Hughes; A Carr
Journal:  Rheumatology (Oxford)       Date:  2002-03       Impact factor: 7.580

5.  Effectiveness and safety of repeat courses of hylan G-F 20 in patients with knee osteoarthritis.

Authors:  J P Raynauld; C H Goldsmith; N Bellamy; G W Torrance; R Polisson; D Belovich; D Pericak; P Tugwell
Journal:  Osteoarthritis Cartilage       Date:  2005-02       Impact factor: 6.576

6.  Glucosamine, chondroitin sulfate, and the two in combination for painful knee osteoarthritis.

Authors:  Daniel O Clegg; Domenic J Reda; Crystal L Harris; Marguerite A Klein; James R O'Dell; Michele M Hooper; John D Bradley; Clifton O Bingham; Michael H Weisman; Christopher G Jackson; Nancy E Lane; John J Cush; Larry W Moreland; H Ralph Schumacher; Chester V Oddis; Frederick Wolfe; Jerry A Molitor; David E Yocum; Thomas J Schnitzer; Daniel E Furst; Allen D Sawitzke; Helen Shi; Kenneth D Brandt; Roland W Moskowitz; H James Williams
Journal:  N Engl J Med       Date:  2006-02-23       Impact factor: 91.245

7.  Contrast-enhanced ultrasound in monitoring the efficacy of a bradykinin receptor 2 antagonist in painful knee osteoarthritis compared with MRI.

Authors:  I H Song; C E Althoff; K G Hermann; A K Scheel; T Knetsch; G R Burmester; M Backhaus
Journal:  Ann Rheum Dis       Date:  2008-03-28       Impact factor: 19.103

8.  Comparing the efficacy of cyclooxygenase 2-specific inhibitors in treating osteoarthritis: appropriate trial design considerations and results of a randomized, placebo-controlled trial.

Authors:  Allan Gibofsky; Gary W Williams; Frank McKenna; John G Fort
Journal:  Arthritis Rheum       Date:  2003-11

9.  ARTIST (osteoarthritis intervention standardized) study of standardised consultation versus usual care for patients with osteoarthritis of the knee in primary care in France: pragmatic randomised controlled trial.

Authors:  P Ravaud; R-M Flipo; I Boutron; C Roy; A Mahmoudi; B Giraudeau; T Pham
Journal:  BMJ       Date:  2009-02-23

10.  A double blind, randomized, placebo controlled study of the efficacy and safety of 5-Loxin for treatment of osteoarthritis of the knee.

Authors:  Krishanu Sengupta; Krishnaraju V Alluri; Andey Rama Satish; Simanchala Mishra; Trimurtulu Golakoti; Kadainti Vs Sarma; Dipak Dey; Siba P Raychaudhuri
Journal:  Arthritis Res Ther       Date:  2008-07-30       Impact factor: 5.156

View more
  33 in total

1.  Do clinicians understand the size of treatment effects? A randomized survey across 8 countries.

Authors:  Bradley C Johnston; Pablo Alonso-Coello; Jan O Friedrich; Reem A Mustafa; Kari A O Tikkinen; Ignacio Neumann; Per O Vandvik; Elie A Akl; Bruno R da Costa; Neill K Adhikari; Gemma Mas Dalmau; Elise Kosunen; Jukka Mustonen; Mark W Crawford; Lehana Thabane; Gordon H Guyatt
Journal:  CMAJ       Date:  2015-10-26       Impact factor: 8.262

2.  Paracetamol versus placebo for knee and hip osteoarthritis.

Authors:  Amanda O Leopoldino; Gustavo C Machado; Paulo H Ferreira; Marina B Pinheiro; Richard Day; Andrew J McLachlan; David J Hunter; Manuela L Ferreira
Journal:  Cochrane Database Syst Rev       Date:  2019-02-25

3.  Rapid Progression of Knee Pain and Osteoarthritis Biomarkers Greatest for Patients with Combined Obesity and Depression: Data from the Osteoarthritis Initiative.

Authors:  Cale A Jacobs; Ana-Maria Vranceanu; Katherine L Thompson; Christian Lattermann
Journal:  Cartilage       Date:  2018-06-01       Impact factor: 4.634

4.  Adaptation of the osteoarthritis-specific quality of life scale (the OAQoL) for use in Germany, Hungary, Italy, Spain and Turkey.

Authors:  Jeanette Wilburn; Stephen P McKenna; Şehim Kutlay; Tamas Bender; Jürgen Braun; Concepcion Castillo-Gallego; Marta Favero; Pal Geher; Uta Kiltz; Emilio Martin-Mola; Roberta Ramonda; Matthew Rouse; Alan Tennant; Ayşe A Küçükdeveci
Journal:  Rheumatol Int       Date:  2017-02-15       Impact factor: 2.631

Review 5.  Ayurvedic interventions for osteoarthritis: a systematic review and meta-analysis.

Authors:  Christian S Kessler; Lea Pinders; Andreas Michalsen; Holger Cramer
Journal:  Rheumatol Int       Date:  2014-07-26       Impact factor: 2.631

6.  Responsiveness of SF-36 Health Survey and Patient Generated Index in people with chronic knee pain commenced on oral analgesia: analysis of data from a randomised controlled clinical trial.

Authors:  Alex Papou; Salma Hussain; Daniel McWilliams; Weiya Zhang; Michael Doherty
Journal:  Qual Life Res       Date:  2016-12-19       Impact factor: 4.147

Review 7.  Arthroscopic surgery for degenerative knee disease (osteoarthritis including degenerative meniscal tears).

Authors:  Denise O'Connor; Renea V Johnston; Romina Brignardello-Petersen; Rudolf W Poolman; Sheila Cyril; Per O Vandvik; Rachelle Buchbinder
Journal:  Cochrane Database Syst Rev       Date:  2022-03-03

8.  Efficacy and safety of intra-articular therapies in rheumatic and musculoskeletal diseases: an overview of systematic reviews.

Authors:  Sebastián Cruz Rodriguez-García; Raul Castellanos-Moreira; Jacqueline Uson; Esperanza Naredo; Terence W O'Neill; Michael Doherty; Mikael Boesen; Hemant Pandit; Ingrid Möller Parera; Valentina Vardanyan; Lene Terslev; Will Uwe Kampen; Maria Antonieta D'Agostino; Francis Berenbaum; Elena Nikiphorou; Irene Pitsillidou; Jenny de la Torre-Aboki; Loreto Carmona
Journal:  RMD Open       Date:  2021-06

9.  A Comparison of Treatment Effects for Nonsurgical Therapies and the Minimum Clinically Important Difference in Knee Osteoarthritis: A Systematic Review.

Authors:  Andrew Concoff; Jeffrey Rosen; Freddie Fu; Mohit Bhandari; Kevin Boyer; Jon Karlsson; Thomas A Einhorn; Emil Schemitsch
Journal:  JBJS Rev       Date:  2019-08

10.  Tourniquet use for knee replacement surgery.

Authors:  Imran Ahmed; Amit Chawla; Martin Underwood; Andrew J Price; Andrew Metcalfe; Charles Hutchinson; Jane Warwick; Kate Seers; Helen Parsons; Peter Dh Wall
Journal:  Cochrane Database Syst Rev       Date:  2020-12-08
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.