Literature DB >> 28114993

An equivalence study: Are patient-completed and telephone interview equivalent modes of administration for the EuroQol survey?

R Chatterji1,2, J M Naylor3,4, I A Harris3,4,5, E Armstrong3,5, E Davidson6, R Ekmejian3, J Descallar3,7.   

Abstract

BACKGROUND: To determine if the EuroQol Health Related Quality of Life survey produces equivalent results when administered by phone interview or patient-completed forms.
METHODS: People awaiting hip or knee arthroplasty at a major metropolitan hospital participated. They were randomly assigned to receive the EuroQol Health Related Quality of Life survey via telephone, followed by a patient completed form 1 week later, or vice versa. Equivalence was determined using two one-sided tests (TOST) based on minimal clinically-important differences for the visual analogue scale (VAS) and the summary Utility Index. Cohen's Kappa scores were computed to determine agreement for the individual EuroQoL Likert scale items.
RESULTS: Seventy-six from 90 (84%) participants completed the survey twice. Based on limits set at ±7 and ±0.11 for the VAS and Utility Index, respectively, equivalence was established between the two methods of administration for both the VAS (mean difference 0.05 [90% CI -3.76-3.67]) and the Utility Index (mean difference 0.06 [90% CI 0.02-0.11]). Varying levels of agreement, ranging from slight to substantial (κ = 0.17-0.67), were demonstrated for the individual health domains. The order of telephone and patient-completed survey administration had no significant effect on results.
CONCLUSIONS: Equivalent results are obtained between telephone and patient-completed administration for the VAS and Utility Index of the EuroQol Survey in people with advanced hip or knee osteoarthritis. The limits of agreement for the individual health domains vary which prevents the accurate interpretation of real change in these items across modes.

Entities:  

Keywords:  Equivalence; Health related quality of life; Hip arthroplasty; Knee arthroplasty; Mode of administration; Questionnaire; Telephone

Mesh:

Year:  2017        PMID: 28114993      PMCID: PMC5259885          DOI: 10.1186/s12955-017-0596-x

Source DB:  PubMed          Journal:  Health Qual Life Outcomes        ISSN: 1477-7525            Impact factor:   3.186


Background

Health-related quality of life (HRQoL) is a key consideration in hip and knee arthroplasty. Pre-surgically, low HRQoL is one of the key drivers for hip or knee arthroplasty [1]. Post-surgically, hip and knee arthroplasties improve HRQoL substantially with large effect sizes evident in both the long and short term [2]. Together, these factors justify the use of HRQoL as an assessment tool in determining suitability for surgery and evaluating the success of surgery. HRQoL is typically measured through a structured questionnaire. The Scientific Advisory Committee of the Medical Outcomes Trust has stated that these surveys should be evaluated according to eight psychometric criteria [3]: Conceptual and measurement model Reliability Validity Responsiveness Interpretability Respondent and administrative burden Modes of administration (MOA) Cultural and language adaptations There are several HRQoL surveys currently being used to monitor HRQoL in hip and knee arthroplasty cohorts including the Short Form-36 and the Western Ontario and McMaster University Osteoarthritis Index [4]. Relevant to several arthroplasty clinical outcome registries (e.g. Arthoplasty Clinical Outcomes Registry, National; United Kingdom Patient Reported Outcome Measures; Swedish Hip Registry) is the EuroQol visual analogue scale (VAS) for joint pain and the associated Utility Index based on a 5-dimension health survey [5-7]. The EuroQol survey measures global patient-perceived health-related quality of life out of 100 on a 20 cm visual analogue scale (VAS). In addition, it measures HRQoL across five domains: mobility; personal care; usual activities; pain or discomfort; and anxiety or depression. In the 5 L version of the EuroQol Survey, each of the five domains is graded on a five-point scale ranging from no problems (1 point), slight (2 points), moderate (3 points), severe (4 points) or extreme/unable to perform (5 points). The resultant health state, represented by a five-digit profile, is then expressed as a Utility Index based on value sets derived for individual countries [8]. It is one of the most widely used HRQoL surveys internationally with translations available in 119 languages. The current literature includes limited psychometric evaluation of the EuroQol survey while mode of administration has received little attention overall. Self-administered and interview-administered EuroQol questionnaires have been established to yield comparable results [9, 10]. Telephone and interview-administered questionnaires were similarly comparable [11]. Telephone administration has also been associated with better follow up and patient-reported scores as compared to mailed surveys [12, 13]. Notably, these studies were in cohorts of acute hospital admission, AIDS, rheumatoid arthritis, heart failure and cataract patients but not hip or knee arthroplasty patients; a group in which the EuroQol survey is commonly used. Furthermore, the telephone and self-completed modes of the EuroQol survey have not been directly compared. The aim of this study was to determine whether the EuroQol survey provides equivalent responses for the VAS and Utility Index for the 5 EuroQol HRQoL domains given by telephone interview compared to paper-based form completion by the patient in an osteoarthritis cohort awaiting arthroplasty surgery.

Methods

Design and participants

Recruitment was conducted at a metropolitan teaching hospital (Nepean Hospital) in Sydney, Australia. The hospital is considered a high-volume arthroplasty service provider, performing an annual load of 290 hip and knee arthroplasty procedures at the time of the study. Patients 18 years of age or older and proficient in English were selected from the waiting list for primary knee or hip arthroplasty and invited to participate between May and August 2014. Exclusion criteria were patients scheduled to undergo arthroplasty less than a week after the earliest possible initial survey, or having joint replacement due to a fracture. Signed consent was obtained from all patients agreeing to participate. Recruitment was undertaken by physiotherapists experienced in the administration of the EuroQol survey. The study was approved by the institution’s human research ethics committee.

Survey completion protocol

After agreeing to participate, each patient was administered the EuroQol survey via telephone and via a written, patient-completed form 1 week apart. The telephone interviews were scripted in a manner which replicated the original survey text and was performed by two trained personnel. As the visual analogue scale was not feasible over the telephone, a standardised verbal instruction was used to obtain a score of 0–100 and best replicate the effect of the scale. In contrast, collecting patient-completed responses via telephone did not refer to the survey text and was kept neutral by asking for their answer to the respective question number. The 1-week interval between surveys was chosen as it balanced minimising the amount of time for the patient’s health status to change with allowing sufficient time that they would not be able to simply recall their previous answers [14]. To control for the effects of order of administration, patients were randomly allocated to completing the telephone or written survey first. A computer-generated sequence was provided by a researcher not involved in participant recruitment and allocation concealment was achieved by using consecutively numbered sealed envelopes, each containing the allocated order, which were opened individually upon each successful recruitment. Those in the patient-completed first group filled in the survey unassisted at the clinic on the day of enrolment. A convenient time to call 1 week later was then arranged for the follow-up telephone survey. Those in the telephone first group received a telephone survey within days of providing consent to participate. The EuroQol survey was mailed 4–5 days after the phone interview so patients would receive it approximately 1 week after the initial survey, along with a telephone call for data-collection. Confirmation that they had hand-completed the survey was obtained prior to data-collection. Participants who had not completed their survey were asked to complete it on the spot. This method of survey return was chosen as it was intended to increase data completion by not relying on patients to post the completed paper surveys which has been associated with significant loss to follow-up [15].

Statistical analysis

Ideally, the limits of variation in scores between the two modes of administration should be less than what is perceived to be the minimum important difference (MID) for changes in the VAS scale and the Utility Index, or equivalent to test-retest variation between the same MOA. In the absence of data indicating the MID for these indices for patients with osteoarthritis, we used data from a previously published study identifying the MID in a cohort of cancer patients [16]. A sample size of 58 people would be sufficient to find equivalence within ±7 for the VAS score, assuming a standard deviation of 20, correlation of 0.6 in responses between modes of administration, a significance threshold of 0.05 and 80% power. A sample of 29 participants would be sufficient to find equivalence within ± 0.11 for the Utility Index, assuming a standard deviation of 0.22, correlation of 0.6 in responses between modes of administration, a significance threshold of 0.05 and 80% power. Allowing for a loss to follow-up of 25%, the minimum sample required was 80. The EuroQol questionnaire includes both categorical and continuous data. Continuous data from the VAS and summary Utility Index were subject to the Two One-Sided Tests (TOST) for equivalence. Equivalence bounds for the VAS and Utility Index were set at ±7 and ±0.11, respectively, in accordance with the EuroQol MIDs as stated above [16]. Significance for equivalence was set at p < 0.05. As the utility value set for Australia is yet to be determined, the United Kingdom value set was used for this study. Use of the UK values may affect the average calculated index values, but not affect the extent the two modes of administration are equivalent [17]. Bland-Altman plots of the data were also produced to illustrate the 95% limits of agreement (LOA) for both the Utility Index and VAS. These plots illustrate the range over which 95% of the paired data (scores from one method versus the other) vary in absolute terms [18]. Agreement between the categorical data was tested using Cohen’s kappa coefficient. The weighted Kappa scores used in the following analysis further distinguish between agreements or disagreements of varying gravity. This is achieved by weighting them differently to incorporate ratio-scaled degrees of agreement or disagreement [19]. Kappa scores range from −1 to 1, with higher scores indicating greater agreement. The typical kappa cut-offs are as follows [20]: ≤0: less than chance agreement 0.01–0.20: slight agreement 0.21–.040: fair agreement 0.41–0.60: moderate agreement 0.61–0.80: substantial agreement 0.81–1.00: almost perfect agreement To determine the success of the randomisation process, we conducted chi-squared and two sample t-tests of patient characteristics between the groups subject to different orders of administration. A two sample t-test was used to determine if differences between modes was associated with order of administration. The data analysis was generated using SAS Enterprise Guide Software, Version 6.1 of the SAS System for Windows (Cary, NC). Participants awaiting knee or hip arthroplasty were analysed together as the condition of interest was severe osteoarthritis and not the specific joint.

Results

Ninety-three participants were screened for enrolment, three declined to participate and 15.6% (n = 14; 7 from each group) were lost to follow up, resulting in 76 complete datasets. There were no significant differences in patient characteristics between the telephone first and patient-completed first groups based on gender, age, joint type and side (Table 1). Follow-up interval (mean = 8 days, median = 7 days) and rates of loss to follow up were also not significantly different between the two orders of administration.
Table 1

Characteristics, mean telephone scores (T) and mean patient-completed scores (P) of the telephone-first group, patient-completed-first group and cumulatively

Telephone FirstPatient-Completed firstTotal P-value
Gender
 Female20 (57%)21 (43%)490.15
 Male10 (37%)17 (63%)27
Age (Years)
 Mean6868680.98
 Median696768
Joint
 Knee26 (53%)23 (47%)490.63
 Hip12 (44%)15 (56%)27
Side
 Left20 (48%)22 (52%)420.70
 Right15 (50%)15 (50%)30
 Bilateral3 (75%)1 (25%)4
Follow-Up Interval
 Days
  Mean (SD)7.6 (1.3)8.2 (1.6)7.9 (1.5)0.08
  Median (IQR)7 (2)7 (3)7 (1)
 Mean EuroQol Scores
  MobilityT 2.912.992.96
  MobilityP 2.893.453.17
  Personal CareT 1.711.951.87
  Personal CareP 1.802.352.08
  Usual ActivitiesT 2.742.952.83
  Usual ActivitiesP 2.743.172.97
  Pain & DiscomfortT 3.173.303.25
  Pain & DiscomfortP 3.143.743.46
  Anxiety & DepressionT 1.571.691.67
  Anxiety & DepressionP 1.912.051.99
  VAS ScaleT 63.8558.1860.29
  VAS ScaleP 59.9460.9460.34
Characteristics, mean telephone scores (T) and mean patient-completed scores (P) of the telephone-first group, patient-completed-first group and cumulatively EQ-VAS scores were found to be equivalent between MOA (TOST p = 0.0013) within equivalence bounds of ±7. The VAS score was an average of 0.05 points lower (90% CI −3.76–3.67) when the survey was administered via telephone. Order of administration had no significant effect on differences in VAS scores between patient-completed and telephone survey modes (p = 0.20). Utility Index was found to be equivalent between MOA (TOST p = 0.035) within equivalence bounds of ±0.11. This was an average of 0.06 points higher (90% CI 0.02–0.11) when the survey was administered via telephone. Order of administration had no significant effect on differences in Utility Index between patient-completed and telephone survey modes (p = 0.20). The 95% limits of agreement for the Utility Index and visual analogue scale were relatively wide as shown in the Bland-Altman plot below (Fig. 1). This variation was consistent across the range of scores observed for both indices.
Fig. 1

Bland Altman plots demonstrating the limits of agreement for utility and VAS scores. The mean difference and 2 standard deviations from the mean are respectively indicated by the blue and grey lines

Bland Altman plots demonstrating the limits of agreement for utility and VAS scores. The mean difference and 2 standard deviations from the mean are respectively indicated by the blue and grey lines Weighted Cohen’s kappa coefficients reflected variable levels of agreement for the categorical data obtained via the two methods of administration. The personal care item exhibited substantial agreement (κ = 0.67) while the items for mobility and anxiety exhibited moderate agreement (κ = 0.45). Less agreement between the two MOA was found for the remaining items - usual activities (κ = 0.38) and pain (κ = 0.17).

Discussion

Our study demonstrated equivalence between the telephone and patient-completed modes of administration for both the VAS and Utility Index of the EuroQol survey. This indicates that the telephone and patient-completed administrations can be used interchangeably if necessary. However, agreement between the individual domain scores varied between slight agreement and substantial agreement; indicating that interpreting changes in the individual questions across time if different modes are used is not recommended. Interestingly, the between mode agreements observed here for the VAS and the Utility Index are similar to the week-to-week (test-retest) agreements of the VAS and Utility Index when completed by people with osteoarthritis using the same mode (patient-completed). We observed, testing people awaiting arthroplasty twice across a 1-week period, that the 95% LOAs for the VAS and Utility Index to be ± 29.4 and 0.4 respectively (unpublished data). Thus, it appears that the differences observed when changing between modes of the EuroQol are of similar magnitude to the differences observed when testing stability of the same method of administration over a short time span. Establishing equivalence between modes permits survey administration through different modes without adjustment for the mode of administration. Furthermore, data collected by different but equivalent modes in different cohorts could be pooled for analysis. This broadens the scope for systematic reviews to compare studies that may have used different data collection methods. The strengths and limitations of the study are acknowledged. We used a well-defined, homogenous sample of sufficient size to detect a difference as small as the reported MIDs for the VAS and Utility Index and the order of administration was randomised. The use of MID values based on those obtained using a cancer cohort may not be applicable to our cohort but MID values obtained using an orthopaedic cohort were unavailable at the time of the study.

Conclusion

In conclusion, this study found that the summary Utility Index and VAS scales of the EuroQol survey were equivalent when obtained either by telephone or via a patient-administered mode. Despite equivalence in these components, greater variation is seen in the individual HRQoL items; indicating there could be no meaningful and accurate interpretation of change in the individual questions across time if different modes are used.
  15 in total

Review 1.  Measuring agreement in method comparison studies.

Authors:  J M Bland; D G Altman
Journal:  Stat Methods Med Res       Date:  1999-06       Impact factor: 3.021

Review 2.  Assessing health status and quality-of-life instruments: attributes and review criteria.

Authors:  Neil Aaronson; Jordi Alonso; Audrey Burnam; Kathleen N Lohr; Donald L Patrick; Edward Perrin; Ruth E Stein
Journal:  Qual Life Res       Date:  2002-05       Impact factor: 4.147

Review 3.  The kappa statistic in reliability studies: use, interpretation, and sample size requirements.

Authors:  Julius Sim; Chris C Wright
Journal:  Phys Ther       Date:  2005-03

Review 4.  Mode of questionnaire administration can have serious effects on data quality.

Authors:  Ann Bowling
Journal:  J Public Health (Oxf)       Date:  2005-05-03       Impact factor: 2.341

5.  Quality criteria were proposed for measurement properties of health status questionnaires.

Authors:  Caroline B Terwee; Sandra D M Bot; Michael R de Boer; Daniëlle A W M van der Windt; Dirk L Knol; Joost Dekker; Lex M Bouter; Henrica C W de Vet
Journal:  J Clin Epidemiol       Date:  2006-08-24       Impact factor: 6.437

Review 6.  Review of utility values for economic modeling in type 2 diabetes.

Authors:  Amélie Beaudet; John Clegg; Per-Olof Thuresson; Adam Lloyd; Phil McEwan
Journal:  Value Health       Date:  2014-05-17       Impact factor: 5.725

Review 7.  Health-related quality of life in total hip and total knee arthroplasty. A qualitative and systematic review of the literature.

Authors:  Olivier Ethgen; Olivier Bruyère; Florent Richy; Charles Dardennes; Jean-Yves Reginster
Journal:  J Bone Joint Surg Am       Date:  2004-05       Impact factor: 5.284

8.  Disability and quality of life of patients with knee or hip osteoarthritis in the primary care setting and factors associated with general practitioners' indication for prosthetic replacement within 1 year.

Authors:  I Boutron; F Rannou; M Jardinaud-Lopez; G Meric; M Revel; S Poiraudeau
Journal:  Osteoarthritis Cartilage       Date:  2008-02-13       Impact factor: 6.576

9.  Effects of mode and order of administration on generic health-related quality of life scores.

Authors:  Ron D Hays; Seongeun Kim; Karen L Spritzer; Robert M Kaplan; Steve Tally; David Feeny; Honghu Liu; Dennis G Fryback
Journal:  Value Health       Date:  2009-05-15       Impact factor: 5.725

10.  Interviewer versus self-administered health-related quality of life questionnaires - does it matter?

Authors:  Milo A Puhan; Alka Ahuja; Mark L Van Natta; Lori E Ackatz; Curtis Meinert
Journal:  Health Qual Life Outcomes       Date:  2011-05-10       Impact factor: 3.186

View more
  12 in total

1.  Responsiveness of the EuroQoL 5-dimension (EQ-5D) in adolescent idiopathic scoliosis.

Authors:  Prudence Wing Hang Cheung; Carlos King Ho Wong; Sin Ting Lau; Jason Pui Yin Cheung
Journal:  Eur Spine J       Date:  2017-10-09       Impact factor: 3.134

2.  Associations between pre-surgical daily opioid use and short-term outcomes following knee or hip arthroplasty: a prospective, exploratory cohort study.

Authors:  Justine M Naylor; Natalie Pavlovic; Melissa Farrugia; Shaniya Ogul; Danella Hackett; Anthony Wan; Sam Adie; Bernadette Brady; Leeanne Gray; Rachael Wright; Michelle Nazar; Wei Xuan
Journal:  BMC Musculoskelet Disord       Date:  2020-06-22       Impact factor: 2.362

3.  Is retrospective assessment of health-related quality of life valid?

Authors:  Andrew Lawson; Aidan C Tan; Justine Naylor; Ian A Harris
Journal:  BMC Musculoskelet Disord       Date:  2020-06-30       Impact factor: 2.362

4.  The effectiveness of inpatient rehabilitation after uncomplicated total hip arthroplasty: a propensity score matched cohort.

Authors:  Justine M Naylor; Andrew Hart; Rajat Mittal; Ian A Harris; Wei Xuan
Journal:  BMC Musculoskelet Disord       Date:  2018-07-18       Impact factor: 2.362

5.  Population norms for the EQ-5D-3L: a cross-country analysis of population surveys for 20 countries.

Authors:  M F Janssen; A Szende; J Cabases; J M Ramos-Goñi; G Vilagut; H H König
Journal:  Eur J Health Econ       Date:  2018-02-14

6.  An online intervention for improving stroke survivors' health-related quality of life: study protocol for a randomised controlled trial.

Authors:  Ashleigh Guillaumier; Sam McCrabb; Neil J Spratt; Michael Pollack; Amanda L Baker; Parker Magin; Alyna Turner; Christopher Oldmeadow; Clare Collins; Robin Callister; Chris Levi; Andrew Searles; Simon Deeming; Olivia Wynne; Alexandra M J Denham; Brigid Clancy; Billie Bonevski
Journal:  Trials       Date:  2019-08-09       Impact factor: 2.279

7.  Design and rationale of a routine clinical care pathway and prospective cohort study in older patients needing intensive treatment.

Authors:  Yara van Holstein; Floor J van Deudekom; Stella Trompet; Iris Postmus; Anna Uit den Boogaard; Marjan J T van der Elst; Nienke A de Glas; Diana van Heemst; Geert Labots; Mariëtte Altena; Marije Slingerland; Gerrit Jan Liefers; Frederiek van den Bos; Jessica M van der Bol; Gerard J Blauw; Johanneke E A Portielje; Simon P Mooijaart
Journal:  BMC Geriatr       Date:  2021-01-07       Impact factor: 3.921

8.  Economic Burden and Impact on Quality of Life of Herpes Zoster in Spanish Adults Aged 50 Years or Older: A Prospective Cohort Study.

Authors:  Javier Díez-Domingo; Desmond Curran; Maria Del Rosario Cambronero; Juan-Antonio Garcia-Martinez; Sean Matthews
Journal:  Adv Ther       Date:  2021-05-19       Impact factor: 3.845

9.  Are responders to patient health surveys representative of those invited to participate? An analysis of the Patient-Reported Outcome Measures Pilot from the Australian Orthopaedic Association National Joint Replacement Registry.

Authors:  Ian A Harris; Kara Cashman; Michelle Lorimer; Yi Peng; Ilana Ackerman; Emma Heath; Stephen E Graves
Journal:  PLoS One       Date:  2021-07-02       Impact factor: 3.240

10.  Implementing early mobilisation after knee or hip arthroplasty to reduce length of stay: a quality improvement study with embedded qualitative component.

Authors:  Happy Chua; Bernadette Brady; Melissa Farrugia; Natalie Pavlovic; Shaniya Ogul; Danella Hackett; Dimyana Farag; Anthony Wan; Sam Adie; Leeanne Gray; Michelle Nazar; Wei Xuan; Richard M Walker; Ian A Harris; Justine M Naylor
Journal:  BMC Musculoskelet Disord       Date:  2020-11-20       Impact factor: 2.362

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.