Literature DB >> 32398083

How is the minimal clinically important difference established in health-related quality of life instruments? Review of anchors and methods.

Yosra Mouelhi1, Elisabeth Jouve2, Christel Castelli3,4, Stéphanie Gentile5,6.   

Abstract

BACKGROUND: The aim of this systematic review is to describe the different types of anchors and statistical methods used in estimating the Minimal Clinically Important Difference (MCID) for Health-Related Quality of Life (HRQoL) instruments.
METHODS: PubMed and Google scholar were searched for English and French language studies published from 2010 to 2018 using selected keywords. We included original articles (reviews, meta-analysis, commentaries and research letters were not considered) that described anchors and statistical methods used to estimate the MCID in HRQoL instruments.
RESULTS: Forty-seven papers satisfied the inclusion criteria. The MCID was estimated for 6 generic and 18 disease-specific instruments. Most studies in our review used anchor-based methods (n = 41), either alone or in combination with distribution-based methods. The most common applied anchors were non-clinical, from the viewpoint of patients. Different statistical methods for anchor-based methods were applied and the Change Difference (CD) was the most used one. Most distributional methods included 0.2 standard deviations (SD), 0.3 SD, 0.5 SD and 1 standard error of measurement (SEM). MCID values were very variable depending on methods applied, and also on clinical context of the study.
CONCLUSION: Multiple anchors and methods were applied in the included studies, which lead to different estimations of MCID. Using several methods enables to assess the robustness of the results. This corresponds to a sensitivity analysis of the methods. Close collaboration between statisticians and clinicians is recommended to integrate an agreement regarding the appropriate method to determine MCID for a specific context.

Entities:  

Keywords:  Anchors-based methods; Distribution-based methods; Health-related-quality of life; Minimal clinically important difference

Year:  2020        PMID: 32398083      PMCID: PMC7218583          DOI: 10.1186/s12955-020-01344-w

Source DB:  PubMed          Journal:  Health Qual Life Outcomes        ISSN: 1477-7525            Impact factor:   3.186


Introduction

Health-Related Quality of Life (HRQoL), a multidimensional construct that assesses several domains (e.g., physical, emotional, social), is an important Patient-Reported Outcome (PRO) in clinical trials as well as in routine clinical practice, and such information is also used by health policy makers in health care resource allocation and reimbursement decisions [1-3]. A PRO is defined as “any report coming directly from patients about how they function or feel in relation to a health condition and its therapy” [4]. Interpretation of changes in HRQoL scores of Patient-Reported Outcomes (PROs) is a challenge to the meaningful application of PRO measures in patient-centered care and policy [5]. Numerous clinical trials have established the importance of HRQoL in various diseases, and it is increasingly popular to evaluate generic and disease-specific HRQoL in clinical trials as a measure of patients’ subjective state of health [5, 6]. To be clinically useful, HRQoL instruments must demonstrate psychometric properties such as validity, reliability and responsiveness to change [7, 8]. Responsiveness to change is important for instruments designed to measure change over time. However, the statistical significance of a change in HRQoL scores does not necessarily imply that it is also clinically relevant [9-12]. Indeed, health policy makers need to present clinically meaningful results, to determine if the treatment is beneficial or harmful to their patients and also to know how to interpret and implement those results in their evidence-based method for clinical decision making [13]. Interpretation of clinical outcomes therefore should not be based solely on the presence or absence of statistically significant differences [14]. This highlights the need to define the minimal change in score considered relevant by patients and physicians, called ‘the Minimal Clinically Important Difference (MCID)’. The MCID was first defined by Jaeschke [15] as ‘the smallest difference in score in the domain of interest which patients perceive as beneficial and which would mandate, in the absence of troublesome side effects and excessive cost, a change in the patient’s management’. MCID values are therefore important in interpreting the clinical relevance of observed changes, at both the individual and group levels. From the patient’s viewpoint, a meaningful change in HRQoL may be one that reflects a reduction in symptoms or improvement in function, however, a meaningful change for the physician may be one that indicates a change in the treatment or in the prognosis of the disease [16, 17]. Several methods have been developed, but no clear consensus exists regarding which methods are most suitable. An extensive review of available methods was published by Wells and colleagues and classified them into nine different methods [18]. Another review proposed three categories of methods for defining the MCID: distribution-based, opinion-based (relying upon experts) and anchor-based methods [19]. On one hand, anchor-based methods examine the relationship between a HRQoL measure with another measure of clinical change: the anchor [20]. Anchors can be derived from clinical outcomes (laboratory values, psychological measures, and clinical rating performance measures) or Patient-Reported Outcomes (PRO) (global health transition scale, patient’s self-reported evaluation of change) [20]. On the other hand, distribution-based methods use statistical properties of the distribution of outcome scores, particularly how the scores differ between patients. The distribution methods may use methods based on Standard Error of Measurement (SEM), Standard Deviation (SD), Effect Size (ES), Standardized Response Mean (SRM), Minimal Detectable Change (MDC), or Reliable Change Index (RCI) [20, 21]. The Delphi method has also been put forth in the literature. It involves the presentation of a questionnaire or interview to a panel of experts in a specific field for the purpose of obtaining a consensus [22]. The expert panel is provided with information on the results of a trial and are requested to provide their best estimate of the MCID. Their responses are averaged, and this summary is sent back with an invitation to revise their estimates. This process is continued until consensus is achieved [23]. To date, methods to determine MCID can be divided into two well-defined categories: distribution-based and anchor-based methods [20, 24–26]. These two methods are conceptually different. Distribution-based methods are the most used with a meaningful external anchor [20, 24–26]. Revicki et al. [21] recommended the usage of the anchor-based method to produce primary evidence for the MCID of any instrument and the distribution-based method to provide secondary or supportive evidence for that MCID. The interest in estimation of MCID for HRQoL instruments has been increasing in recent years, and several reviews focused on estimates of MCID [20, 27–29]. MCID values have been shown to differ by population and study context as well as choice of anchors. This variability highlights the need to understand how the MCID was statistically established and what kind of anchors have been used, in order to facilitate its application in the Quality of Life field. A systematic review was conducted to describe, from a structural literature search, the different types of anchors and statistical methods used in estimating the MCID for HRQoL instruments, either generic or disease-specific ones.

Materials and methods

Search strategy

A literature review was conducted in accordance with the preferred reporting items for systematic reviews and meta-analyses (PRISMA) [29]. To identify a large number of studies related to MCID, we performed a literature search on PubMed and Google scholar articles from 01 January 2010 to 31 December 2018 using the following request: (“MCID” OR “MID” OR “minimal clinically important difference” OR “minimal important difference” OR “minimal clinically important change” OR “clinically important change” OR “minimal clinical important difference” OR “clinical important difference” OR “meaningful change”) AND (“health related quality of life”). A grey literature review was also performed. We selected English and French language articles displaying an abstract and having included studies which (1) were original articles (i.e. reviews, meta-analysis, commentaries and research letters were not considered), (2) described anchors and statistical methods used to estimate the MCID in HRQoL instruments. We did not select the literature reviews, considered as secondary research articles, but we used the references of these reviews to search for other pertinent articles. Two authors (YM and EJ) independently screened the study based on titles and abstracts. Then, authors (YM and EJ) obtained the selected full texts and read them to determine eligibility, and finally, the references in each of the retained articles were reviewed by YM and EJ for other relevant articles that might have been missed in the initial research.

Data extraction and evaluation

For each included article, we collected data about: - The year of publication, - The study design. Four types were identified: • Prospective; • Retrospective; • Cross-sectional; • Clinical trials. – The sample size (N); – The disease; – The HRQoL instrument: number of instruments used/subscale, generic and/or disease-specific; – MCID estimation method: anchor and/or distribution, number of anchors, kind (subjective or clinical), cutoffs used, statistical methods, distribution criteria; – The MCID value/range of each HRQoL instrument for each study. The methodological quality of the included studies was independently assessed by two authors and disagreements were resolved by discussion. Articles that met eligibility criteria were grouped according to different clinical treatment areas. We then assessed MCID anchors and calculation methods, developed tables to display questionnaire names, calculation methods and type of anchors, MCID values by generic and disease-specific questionnaire.

Results

Selection process and general characteristics of included studies

The literature search identified 695 articles via PubMed, and 119 more articles were added with complementary research. After the selection process, this literature review included 47 articles (Fig. 1).
Fig. 1

PRISMA Flow diagram of the literature search

PRISMA Flow diagram of the literature search Our review provides an assessment of MCID for 6 generic and 18 disease-specific instruments (Table 1). Characteristics of the 47 included articles [30-76] are summarized in Table 2.
Table 1

HRQoL instruments: abbreviations and full names

Instrument-abbreviationInstrument-full name
Generic instruments
 EQ-5DEuropean Quality of Life 5-Dimensions
 SF-36Short Form-36 Health Survey
 SF-12Short Form-12 Health Survey
 SF-6DShort-Form 6-Dimensions
 WHOQOL- 100World Health Organization Quality of Life assessment
 15D15-dimensions Quality of Life
Disease-specific instruments
 EORTC QLQ-C30European Organization for Research and Treatment of Cancer Quality of Life- Questionnaire Core 30
 EORTC QLQ-BM22EORTC QLQ- bone metastases module
 EORTC QLQ-BN20EORTC QLQ- brain module
 EORTC QLQ-C15-PALEORTC QLQ −15 palliative
 EORTC GHSEORTC Global Health Status (GHS)
 UW-QOLQUniversity of Washington Quality of Life Questionnaire
 DLQIDermatology Life Quality Index
 KQoL-26Knee Quality of life 26-item
 VascuQolVascular Quality Of Life
 UCLA-PCIUCLA prostate cancer index
 SIS-16Stroke Impact Scale
 IWQOL-kidsImpact of Weight on Quality of Life-Kids
 QLS /QOLSHeinrichs–carpenter Quality of Life
 PC-QOLParent Cough-Specific Quality of Life
 PedsQLPediatric Quality Of Life Inventory
 ASK Nasal-12

The Anterior Skull Base Nasal

Inventory-12

 PANQOLThe Penn Acoustic Neuroma QOL
 PEmbQoLPulmonary Embolism Quality of Life

HRQoL health-related-quality of life

Table 2

General characteristics of the included studies (N = 47)

Reference (year)DesignPatients, nAge (mean or median)DiseaseHRQOL instrumentMCID estimation method
(n)InstrumentSubscale/dimensionGenericSpecificAnchor-basedDistribution-based
Oncology
 Kvam AK et al. (2010) [30]Prospective23966Multiple myeloma1EORTC QLQ-C30××
 Kvam AK et al. (2011) [31]Prospective23966Multiple myeloma3EORTC QLQ-C30 EQ-5D 15D××××
 Maringwa J et al. (2011) [32]Randomized / controlled clinical trials94152Brain cancer2

QLQ-C30

QLQ-BN20

×××
 Maringwa J et al. (2011) [33]Randomized / controlled clinical trials81257Lung cancer1QLQ-C30×××
 Zeng L et al. (2012) [34]Prospective9363.4Cancer with bone metastases2EORTC QLQ-C30 EORTC QLQ-BM22×××
 Jayadevappa et al. (2012) [35]Prospective60263.3Prostate Cancer2

SF-36

ULCA PCI

××××
 Den Oudsten BL et al. (2013) [36]Prospective

223 = early stag

383 = benign breast problems

52.9

58.6

Early-stage breast cancer1WHOQOL- 100×××
 Hong F et al. (2013) [37]Prospective

Group1 = 191

Group 2 = 436

49

56

Any type of cancer1EORTC QLQ-C30××
 Bedard G et al. (2014) [38]Prospective36957.7Advanced cancer1EORTC QLQ-C30×××
 Binenbaum Y et al. (2014) [39]Prospective101157Oral cavity and Oropharynx Cancer2UW-QOLQ EORTC QLQ-C30××
 Sagberg LM et al. (2014) [40]Prospective16456Intracranial glioma surgery1EQ-5D×××
 Wong E et al. (2015) [41]Prospective9960.6Advanced cancer with brain metastases1EORTC QLQ-BN20×××
 Bedard G et al. (2016) [42]Prospective27665.1Advanced cancer1EORTC QLQ-C15-PAL×××
 Yoshizawa K et al. (2016) [43]Retrospective71066.7Chronic NonCancer Pain1EQ-5D××
 Raman S et al. (2016) [44]Randomized phase III trial20467.5Bone metastases2EORTC QLQ-BM22 EORTC QLQ-C15-PAL×××
 Quinten C et al. (2018) [45]Prospective

741 = surgery

683 = chemotherapy

56.18

52.08

Cancer undergoing chemotherapy or surgery1EORTC Global Health Status (GHS)×××
 Kerezoudis P et al. (2018) [46]Prospective125457.4

Patients with Vestibular

Schwannoma

1PANQOL×××
Rheumatology/Musculoskeletal
 Soer Rt al (2012) [47]Prospective15151.9Low back pain1EQ-5DCategorical VAS××
 Parker SL et al. (2012) [48]Retrospective47NRSymptomatic pseudoarthrosis2SF-12 EQ-5DSF-12 PCS××
 Parker SL et al. (2012) [49]Retrospective5356.3Revision surgery for same-level recurrent lumbar stenosis-associated back and leg pain2SF-12 EQ-5DSF-12 PCS and MCS××
 Parker SL et al. (2013) [50]Prospective6949.3Anterior cervical discectomy and fusion2SF-12 EQ-5DSF-12 PCS and MCS××
 Chuang LH et al. (2013) [51]Clinical trial

Group1 = 121

Group 2 = 218

A = 45.5

GP = 48.1

Suspected internal derangement of the knee1KQoL-26×××
 Díaz-Arribas MJ et al. (2017) [52]Prospective45846.4Low back pain1SF-12PCS and MCS××
 Shi H et al. (2010) [53]Prospective6770.2Revision total hip arthoplasty1SF-36××
 Solberg T et al. (2013) [54]Prospective69246Lumbar disc herniation1EQ-5D××
 Carreon LY et al. (2013) [55]Prospective

722 = primary procedures

333 = revision

60.8

50.9

Primary and revision lumbar fusion surgeries1SF-36PCS××
 Asher AL et al. (2018) [56]Prospective44162Posterior lumbar surgery for grade I degenerative spondylolisthesis1EQ-5D×××
Neurology/Neurovascular
 Kwakkenbos et al. (2013) [57]Prospective211NRSystemic sclerosis2SF-6D EQ-5D××
 Kohn CG et al. (2014) [58]Cross-sectional304456.8Multiple sclerosis1EQ-5D××
 Zhou F et al. (2015) [59]Prospective11357.6Cervical spondylotic myelopathy1SF-36PCS and MCS××
 Fulk GD et al. (2010) [60]Prospective3660.9Stroke1SIS-16××
 Frans FA (2014) [61]Prospective12767Critical Limb Ischemia1VascuQol×××
 Kim SK et al. (2015) [62]Prospective48768.3Stroke2SF-6D EQ-5D××
 Chen P et al. (2016) [63]Prospective6552.8Stroke1EQ-5DEQ-5D-5 L×××
 Yuksel S et al. [2018] [64]Prospective

185 = surgical patients

86 = nonsurgical patients

52.4

44.9

spinal deformity1SF-36PCS and MCS×××
Psychiatry/Gastroenterology/Dermatology
 Le QA et al. (2013) [65]Randomized, controlled trial20037.5Post-traumatic stress disorder1EQ-5D×××
 Thwin SS et al. (2013) [66]Randomized clinical trials35050.7Schizophrenia1QOLS××
 Falissard B et al. (2015) [67]Randomized35138.6Schizophrenia1QLS××
 Stark RG et al. (2010) [68]Cross-sectional50242Bowel inflammatory disease1EQ-5D××
 Basra MK et al. (2015) [69]Prospective19238.7Inflammatory skin disease1DLQI××
Pediatry
 Modi AC et al. (2011) [70]Cross-sectional26315.1Obesity/weight1IWQOL-kids××
 Newcombe PA et al. (2011) [71]Prospective3426.5 monthsPediatric Chronic Cough1PC-QOL×××
 Hilliard ME et al. (2013) [72]Prospective500412.5Diabetes type 1 and 21PedsQL××
Rhinology
 Gravbrot N et al. (2018) [73]Prospective

Patients undergoing transsphenoidal

surgery

1ASK Nasal-12×××
 Hoehle LP et al. (2018) [74]Prospective20354.1Chronic rhinosinusitis (CRS)1EQ-5D

HUV

VAS

×××
Pulmonary
 Akaberi A et al. (2018) [75]Prospective8249.4Pulmonary embolism1PEmbQoL×××
More than one disease
 Alanne S et al. (2015) [76]Prospective49036016 diseases115D××

-: Not used; × used, MCID minimal clinically important difference

HRQoL instruments: abbreviations and full names The Anterior Skull Base Nasal Inventory-12 HRQoL health-related-quality of life General characteristics of the included studies (N = 47) QLQ-C30 QLQ-BN20 SF-36 ULCA PCI 223 = early stag 383 = benign breast problems 52.9 58.6 Group1 = 191 Group 2 = 436 49 56 741 = surgery 683 = chemotherapy 56.18 52.08 Patients with Vestibular Schwannoma Group1 = 121 Group 2 = 218 A = 45.5 GP = 48.1 722 = primary procedures 333 = revision 60.8 50.9 185 = surgical patients 86 = nonsurgical patients 52.4 44.9 Patients undergoing transsphenoidal surgery HUV VAS -: Not used; × used, MCID minimal clinically important difference More than half of the studies were prospective (n = 34, 72.3%), 3 retrospective (6.4%), 3 cross-sectional (6.4%) and 7 were clinical trials (14.9%). Nearly 40% of studies have been conducted in the field of oncology. In addition, 75% of studies estimated the MCID for only one HRQoL instrument while 25% for two or three instruments. Twenty-two (46.8%) studies focused only on a generic HRQoL instrument, 23 (48.9%) studies only on a disease-specific HRQoL instrument, and 2 (4.3%) studies combined both.

Methods of MCID estimation

In this review, 18 (38.3%) of the included studies used only anchor-based methods to estimate the MCID; 6 (12.8%) studies used only distribution-based methods, and 23 (48.9%) combined both to provide more accurate estimates (Table 2).

Anchor-based methods

Type of anchors

Among the 41 studies using anchor-based methods, 36 studies applied non-clinical anchors and only 5 studies applied clinical ones. Anchors adopted in the included studies are presented in Table 3. For each of these anchors, authors predefined different cutoffs that vary depending on the study context.
Table 3

MCID methods estimation: anchors and statistical methods

ReferenceAnchor-basedDistribution-based
n1Anchor (s)ViewpointCutoffs usedStatistical methodsn2Distribution criteria
Kvam AK et al. [30]1Global Rating of Change (GRC: 1–7)PatientImprovement: ‘much better, moderately better and a little better’ Deterioration: ‘a little worse, moderately worse and much worse’CD
Kvam AK et al. [31]1Global Rating of Change (GRC: 1–7)Patient

Improved: ‘much better, moderately better and a little better’

Deteriorated: ‘a little worse, moderately worse and much worse’

AC20.2 SD, 0.5 SD
Maringwa J et al. [32]2

World Health Organization performance status (WHO PS: 0–4)

Mini-mental state examination (MMSE: 1–30)

Clinical

WHO PS: ± 1

MMSE: + 4 or + 5

CD40.2SD, 0.3SD, 0.5SD, SEM
Maringwa J et al. [33]2

World Health Organization performance status (WHO PS:0–4)

Weight change

Clinical

WHO PS: ± 1

Weight gain: < 20%

CD30.2SD, 0.5SD, SEM
Zeng L et al. [34]1Karnofsky Performance Status (KPS: 0–100)Clinical± 10CD40.2 SD, 0.3 SD, 0.5 SD, SEM
Jayadevappa et al. [35]2

Health Transition Item of the SF-36 (HTI: NR)

The patient-reported physical signs/symptoms (NR)

Patient

‘General health’

‘More tired’

Linear regression31SEM, 0.3SD, 0.5SD
Den Oudsten BL et al. [36]1General Health and Overall QoL (− 9 to + 9)Patient

‘Small positive change’: 2 ≤ C ≤ 3

‘Small negative change’: − 3 ≤ C ≤ -2

CD21SEM, 0.5SD
Hong F et al. [37]1

The Subject Significance Questionnaire

(SSQ: − 3 to + 3)

PatientNRLinear regression
Bedard G et al. [38]2

Overall health (1–7)

Overall QoL (1–7)

Patient

Overall health: +  2

Overall QoL: +  2

CD4SEM, 0.2SD, 0.3SD, 0.5SD
Binenbaum Y et al. [39]10.5SD
Sagberg LM et al. [40]1Karnofsky Performance Status (KPS: 0–100)Clinical± 10AC10.5SD
Wong E et al. [41]1Overall QoL (1–7)PatientOverall QoL: 1CD4SEM, 0.2SD, 0.3SD, 0.5SD
Bedard G et al. [42]1Overall QoL (1–7)PatientOverall QoL: +  2CD4SEM, 0.2SD, 0.3SD, 0.5SD
Yoshizawa K et al. [43]1Physician’s global impression of treatment effectiveness (PGI: NR)Physician‘Effective’ vs ‘not effective’ROC
Raman S et al. [44]1Overall QoL (1–7)Patient+  10CD40.2SD, 0.3SD, 0.5SD, SEM
Quinten C et al. [45]3

The 15-item Geriatric Depression Scale (GDS15) (0 to 4)

Visual Analogue Scale (VAS) for Fatigue (0 to 10)

ECOG Performance Status (PS) (0 to 4)

Clinical

Improvement: ‘improved’ vs ‘stabe’

Deterioration: ‘no ‘stabe’ vs ‘worse’

CD10.2SD
Kerezoudis P et al. [46]1Health Transition Item (1–5)Patient‘Somewhat better’ or ‘Somewhat worseCD20.5SD, 1SEM
Soer Rt al [47]2

Pain Disability Index (PDI: 1–10)

Global perceived effect (GPE: 1–7)

PatientPDI: -9 GPE: +4ROC
Parker SL et al. [48]2Health Transition Item (HTI: 1–4) Patient’s satisfaction after the surgeryPatientHTI: ‘Slightly better’ or Markedly better’ Patient’s satisfaction: ‘Yes’ROC, AC, MDC, CD
Parker SL et al. [49]2Health Transition Item of SF-36 (HTI: 1–4) Patient’s satisfaction after the surgeryPatientHTI: ‘Slightly better’ or ‘Markedly better’ Patient’s satisfaction: ‘Yes’ROC, AC, MDC, CD
Parker SL et al. [50]1North America Spine Society (NASS) patient Satisfaction Scale (1–4)Patient‘The treatment met my expectations’ROC, AC, MDC, CD
Chuang LH et al. [51]1Health Transition Item of the SF-36 (HTI: 0–15)Patient‘A little better’ or ‘Somewhat better’ROC21SEM, MDC
Díaz-Arribas MJ et al. [52]1Self-reported health status change between baseline and 12 month-assessment (NR)Patient‘Completely recovered’ or ‘improved’ROC, AC, MDC, CD
Shi H et al. [53]10.5SD
Solberg T et al. [54]1Global Perceived Scale Of Change (1–7)Patient‘Completely recovered’ or ‘much improved’ROC
Carreon LY et al. [55]1MDC
Asher AL et al. [56]1North America Spine Society (NASS) society Satisfaction Scale (1–4)Patient‘Satisfied’ and ‘not satisfied’ groupsAC30.5SD,1SEM,MDC
Kwakkenbos et al. [57]2

Global Rating of Change (GRC) (1–7)

The Health Assessment Questionnaire-Disability Index (HAQ-DI:0–3)

Patient

GRC = 2 ‘somewhat better’ or 4 ‘somewhat worse’

MCID of HAQ-DI: +  0.22

CD
Kohn CG et al. [58]31SEM, 0.5SD, 0.33SD
Zhou F et al. [59]1Health Transition Item of the SF-36 (HTI: 1–4)Patient‘Slightly better’ or Markedly better’ROC, AC, MDC, CD
Fulk GD et al. [60]2Global Rating of Change (GRC: − 7 to 7) scoresPatient + Physician+ 5ROC
Frans FA [61]1The change in Fontaine classification (1–4)PhysicianImprovement: ‘improved’ vs ‘no change’ Deterioration: ‘worse’ vs ‘no change’AC10.5SD
Kim SK et al. [62]2The modified Rankin scale (MRS: 0–5) The Barthel index (BI: 0–20)PatientImprovement: ‘Minimally better’ Deterioration: ‘Minimally worse’CD
Chen P et al. [63]1The perceived recovery score of the Stroke Impact Scale 3.0 (NR)Patient10–15%CD10.5SD
Yuksel S et al. [64]1Global Rating of Change (GRC: −7 to 7) scoresPatientPatients perceiving an improvement as opposed to those who do not (i.e. worse or unchanged)Latent class analysis (LCA)1MDC
Le QA et al. [65]2Clinical Global Impression Improvement (CGI:1–7) The symptom Scale-Interview (PSS-I)PhysicianCGI: 3 or less PSS-I: 23 or less

ROC

Regression analysis

20.2SD, 0.5SD
Thwin SS et al. [66]1

Clinical Global Impressions Improvement

(CGI-I: 1–7)

PhysicianCGI-I: 1Equipercentile method
Falissard B et al. [67]1Clinical Global Impressions of Severity (CGI-S: 1–7)Physician‘Slightly improved’CD
Stark RG et al. [68]1Patient’s perceived improvement after the treatment (NR)Patient

Improvement: ‘better’

Deterioration: ‘worse’

Regression analysis
Basra MK et al. [69]1Global Rating of Change (GRC: − 7 to + 7)PatientSmall change ±2, ±3CD
Modi AC et al. [70]1SEM
Newcombe PA et al. [71]1Verbal category descriptive score (VCD: 0–5)Patient+  1CD3ES, SEM, 0.5SD
Hilliard ME et al. [72]11SEM
Gravbrot N et al. [73]2

The 2-wk postoperative overall nasal functioning item

The 2-wk postoperative Short Form Health Survey 8 bodily pain item

Patient1 unitCD2ES, 0.5SD
Hoehle LP et al. [74]1

A question related to change in general

health-related QOL (1–5)

Patient‘About the same’ compared to ‘A little better’

CD

ROC

10.5 SD
Akaberi A et al. [75]2

General QoL using SF-36 PCS and MCS (0–100)

Dyspnea severity (0–120)

Patient

General QoL = a t least a 4-point

Change

Dyspnea severity = t least a 5-point

change

Repeated-measures mixed-effect models1ES
Alanne S et al. [76]1Subjective five-category global assessment scale (GAS: 1–5)Patient

Improvement: ‘Slightly better’

Deterioration: ‘Slightly worse

ROC

-: Not used, n number of anchors, n number of distribution criteria, MCID minimal clinically important difference, QoL quality of life, AC average change, MDC minimal detectable change, CD change difference, ROC receiver operating curve, ES effect size, SD standard deviation, SEM standard error of measurement

MCID methods estimation: anchors and statistical methods Improved: ‘much better, moderately better and a little better’ Deteriorated: ‘a little worse, moderately worse and much worse’ World Health Organization performance status (WHO PS: 0–4) Mini-mental state examination (MMSE: 1–30) WHO PS: ± 1 MMSE: + 4 or + 5 World Health Organization performance status (WHO PS:0–4) Weight change WHO PS: ± 1 Weight gain: < 20% Health Transition Item of the SF-36 (HTI: NR) The patient-reported physical signs/symptoms (NR) ‘General health’ ‘More tired’ ‘Small positive change’: 2 ≤ C ≤ 3 ‘Small negative change’: − 3 ≤ C ≤ -2 The Subject Significance Questionnaire (SSQ: − 3 to + 3) Overall health (1–7) Overall QoL (1–7) Overall health: +  2 Overall QoL: +  2 The 15-item Geriatric Depression Scale (GDS15) (0 to 4) Visual Analogue Scale (VAS) for Fatigue (0 to 10) ECOG Performance Status (PS) (0 to 4) Improvement: ‘improved’ vs ‘stabe’ Deterioration: ‘no ‘stabe’ vs ‘worse’ Pain Disability Index (PDI: 1–10) Global perceived effect (GPE: 1–7) Global Rating of Change (GRC) (1–7) The Health Assessment Questionnaire-Disability Index (HAQ-DI:0–3) GRC = 2 ‘somewhat better’ or 4 ‘somewhat worse’ MCID of HAQ-DI: +  0.22 ROC Regression analysis Clinical Global Impressions Improvement (CGI-I: 1–7) Improvement: ‘better’ Deterioration: ‘worse’ The 2-wk postoperative overall nasal functioning item The 2-wk postoperative Short Form Health Survey 8 bodily pain item A question related to change in general health-related QOL (1–5) CD ROC General QoL using SF-36 PCS and MCS (0–100) Dyspnea severity (0–120) General QoL = a t least a 4-point Change Dyspnea severity = t least a 5-point change Improvement: ‘Slightly better’ Deterioration: ‘Slightly worse -: Not used, n number of anchors, n number of distribution criteria, MCID minimal clinically important difference, QoL quality of life, AC average change, MDC minimal detectable change, CD change difference, ROC receiver operating curve, ES effect size, SD standard deviation, SEM standard error of measurement Among the 36 studies using non-clinical anchors, 30 of them chose anchors from the viewpoint of patients, 5 from the viewpoint of physicians and 1 from the viewpoint of both. Anchors from patient point of view are based on questions to assess how a patient feels about his or her current health status over time or on Patient-Reported Outcomes (PRO): The Global Rating of Change (GRC) scale (n = 6): used by authors on a 15-point ordinal scale or on a 7-point scale. Global and transition questions: the most common was the Health Transition Item (HTI) of the SF-36 (n = 6). The other questions related to the instruments were differently applied and are described in detail in Table 3. PRO such as Pain Disability Index, the perceived recovery score of the Stroke Impact scale, the Symptom Scale-Interview … Other scales such as the Modified Rankin Scale, the Barthel Index … Five studies used a physician point of view anchor: The dichotomous physician’s global impression of treatment effectiveness (PGI): this question was a discrete choice of “effective” or “not effective” treatment. The Clinical Global Impressions scales: Improvement (CGI-I) or severity (CGI-S). The change in Fontaine classification: rated on a 4-point scale (much improved, improved, unchanged and worse). Four studies used a Performance Status (PS) as clinical anchor: The Karnofsky Performance Scale (KPS). The World Health Organization Performance Status (WHO PS), combined with Mini-Mental State Exam (MMSE) or Weight change.

Statistical methods used for anchors-based methods

Among the 41 studies using anchor-based methods, 36 applied only one statistical method. These methods were Change Difference (CD), Receiver Operating Curve (ROC), Regression analysis (REG), Average Change (AC) and Equipercentile Linking (EL). Furthermore, 5 studies combined many of these methods (Fig. 2, Table 3).
Fig. 2

Review of Statistical methods applied in the included studies

Review of Statistical methods applied in the included studies Mostly, determination of MCID was based on the calculation of a change of HRQoL score between two times, from a baseline (longitudinal study). Among the 36 studies using only one statistical method, the most common were (Fig. 2): The CD: MCID was identified by the difference between the average of HRQoL score change of responder patients (defined by the anchor) and the average score change of non-responder patients. The ROC: created by plotting the sensitivity of the instrument (the true positive rate) against the specificity (the false positive rate). Some studies [43, 51, 54, 60, 76] identified the MCID as the upper corner of the curve, and other studies [45, 65] identified the MCID as the point of the receiver operating characteristics curve in which sensitivity and specificity are maximized (Maximum (Sensibility+Specificity-1), Youden index). The area under the curve (AUC) was always calculated to measure the instrument responsiveness, suggesting AUC values upper than 0.7. The regression analysis of HRQoL score (or change) by anchor as regressor: authors defined the MCID as the coefficient estimate of the anchor. The method of AC: by relating the average of the HRQoL score change observed in patients classified as responders according to the anchor. The EL: the value of change in the HRQoL score that corresponds in percentile rank to the change in the anchor is interpreted as the MCID. All 5 studies combining 4 methods: CD, ROC, AC and MDC methods. The MDC (Minimal Detectable Change) is defined as the upper limit of the 95% confidence interval (CI) of the average change detected in non-responders. Two of these 5 studies [49, 50] chose the MDC as the most appropriate method to identify the MCID, since it was the only method to provide a threshold above the 95% Confidence Interval of the unimproved cohort (greater than the measurement error). The three other studies [46, 52, 59] did not find a difference between the 4 methods to determine the true value of MCID.

Distribution-based methods

Among the 29 studies using distribution-based methods, 13 applied only one method, while most studies (n = 16) combined more than one distributional method (Table 3). The most common were (Fig. 2): Multiples of Standard Deviation were used as MCID: 0.5SD, 0.3SD, 1/3SD, 0.2SD. Most authors (n = 22) used 0.5 Standard Deviation of the HRQoL mean change score between two time points. Frequently 2 or 3 multiples of 0.5SD, 0.3SD and/or 0.2SD (n = 12) were used, only one with 1/3SD. Multiples of SD were related to effect size: 0.2SD (small effect) to 0.5SD (median effect). The Standard Error of Measurement (SEM): calculated by the formula SEM = SD √(1-r) where r is a reliability estimation of HRQoL score (ratio of the true score variance to the observed score variance or internal consistency measure as Cronbach’s alpha). This characteristic of precision was frequently used (n = 16), and associated with multiple SD. The Effect Size (ES): used in one study and represents the standardized HRQol score change. Common statistic is calculated by the ratio of the score change divided by the standard deviation of the score. The Minimal Detectable Change (MDC): used in two studies and calculated as 1.96 × √2 × SEM (for a 95% confidence interval). The MDC represents the smallest change above the measurement error with a confidence interval. Thereby, most studies (n = 16) combined the fractionations including 0.2 SD, 0.3 SD or 0.5 SD and/or 1 SEM in order to provide a range of MCID values (Table 3).

MCID values

As shown in the supplementary file (see Additional file 1), variability in MCID results were observed for each HRQoL instrument, depending on: Pathology: MCID for SF-36 PCS ranged from 4.9 to 5.21 [55] and from 4.09 to 9.62 [59] for Rheumatology and neurology population, respectively. Methodology: even for the same pathology, MCID values were variable. For example, for EQ-5D, MCID values, using anchor and/or distribution-based methods, varied from 0.01 to 0.39 for patients with rheumatology/musculoskeletal disorders [47–50, 54] and from 0.08 to 0.15 for oncology patients [31, 40, 43]. For patients with psychology disorders, MCID ranged from 0.05 to 0.08 using anchor-based method, and 0.04 to 0.1 using distribution-based method [65]. Statistical method: MCID for EORTC QLQ-C30 in oncology patients ranged from − 27 to 17.5 using CD method [30], − 12 to 8 using AC method [31] and − 11.8 to 11.8 using regression analysis [37]. Change direction: some studies calculated MCID irrespective of the change direction or separately for improvement and deterioration without major impact on MCID values and did not find a major impact on MCID values. The WHOQOL-100, for example, was assessed in early-stage breast cancer population [36], MCID for improvement ranged from 0.51 to 1.27 and for decline from − 1.56 to − 0.71.

Discussion

Our systematic review identified 47 studies reporting anchors and statistical methods to estimate MCID for generic and disease-specific HRQoL instruments. This review pointed out that the interest of MCID in HRQoL instruments has been increasing in the recent years and the largest work has been done in the field of oncology disorders. Most studies used anchor-based methods in our review (n = 41), either alone or in combination with distribution-based methods. As discussed by Gatchel and Mayer [77], anchor-based methods are good depending on the choice of the external criteria as well as the methodology used. We observed multiple anchors chosen by authors, and the most common anchors were non-clinical and from the viewpoint of patients in order to assess how a patient feels about his or her current health status over time. These anchors are well-studied and applicable to a wide range of patients [78]. However, patients may be aware that the phase of their disease is deteriorating, thus they will conclude that their HRQoL is similarly deteriorating. Furthermore, the patients’ subjective experiences are related to the way in which people construct their memories. It is hard for people to accurately recall a previous health state; they will rather create an impression of how much they have changed by considering their present state and then retrospectively applying some idea of their change over time. Hermann [79] described the problem of “recall bias” where events intervening between the anchor points influence the recall of the original status, while Schwartz and Sprangers [80] described “response-shift” where a patient’s response is influenced by a changing perception of their context. Clinical anchors were not widely applied in the included studies. Changes in Performance Status (PS), in particular the KPS and the WHO PS, were chosen by authors because of accessibility and interpretability [25]. As they do not provide MCID values per se [81], clinical anchors were applied in our review with distribution criterion. However, we did not find any combination with another subjective anchor. Authors recommended the usage of multiple independent anchors [20]. Anchors must be easily interpretable, widely used and at least moderately correlated with the instrument being explored [8, 32, 33]. According to Cohen’s [82], 0.371 was recommended as a correlation threshold to define an important association. However, anchor-based methods may be vulnerable to recall bias, and as was evident in our review, different anchors may produce widely different estimates for the same HRQoL instrument. Cut-offs for different anchors were differently assigned by authors. Even for the same anchor, many cutoffs were used. There is no agreement on the exact cutoffs for anchors, they are generally assigned for the purposes of research and depending on study context and anchor used [29]. Once the anchor has been chosen, different statistical methods were applied to estimate the MCID. The most established method in our review was the mean change score, also called the Change Difference (CD) method. This latter is defined as the mean change of patients who improved and, therefore, authors can set its cutoffs on the basis of the change score of patients who were shown to have had a small, moderate, or large change. MCID corresponds to the difference between two adjacent levels on the anchor. MCID would depend on the number of levels on the anchor: the larger the number of levels, the smaller the difference between two adjacent levels, and the smaller the MCID [83]. Each of the statistical methods has its specific concepts and produces a MCID value different from the other methods. Some authors pointed out that the largest threshold value is most often generated from the average change method, whereas the smallest threshold from the CD and MDC methods [48-50]. Therefore, the usage of patient point of view’ anchors by most studies may be explained by the lack of satisfying objective scales, which incite the usage of subjective anchors in first place. In addition, perhaps the CD method is simple to apply by many authors, but we cannot affirm that this is the most relevant method. We conclude that there are many faces to the MCID, it is not a simple concept, nor simple to estimate. In addition, distribution-based methods, derived from statistical analysis, were also applied in few studies. In accordance with literature [24-29], most often fractionations in our review include 0.2 SD, 0.3 SD, 0.5 SD and 1 standard error of measurement (SEM). Some studies determined that MCID corresponded closest to the 0.5 SD estimate. The 0.5 SD was the value in which most meaningful changes fall, as previously proven in a study by Norman et al. [84]. Distribution-based methods also produced different values of MCID depending on the distributional criterion. Nevertheless, distribution-based methods do not address the question of clinical importance and ignore the aim of MCID, which is to define the clinical importance distinctly from statistical significance. Authors recommended the usage of these methods when anchor-based calculations are unavailable [20]. In this review, MCID values were defined for Patient-Reported Outcomes (PRO) measuring HRQoL using the two methods: anchor and distribution-based methods. Some studies had been developed to determine MCID for the Patient-Reported Outcomes Measurement Information System (PROMIS) instruments, using the same reported statistical methods. In recent years, the PROMIS Network (www.nihpromis.org), a National Institutes of Health Roadmap Initiative, has advanced PRO measurement by developing item banks for measuring major self-reported health domains affected by chronic illness. Therefore, further studies should be developed to determine the meaningful change in HRQoL for PROMIS. Summing up, we did not observe a single MCID value for any HRQoL instrument in our review. Several factors may influence this variability. On one hand, we found many available methods that produced many MCID values for the same HRQoL instrument. Authors applied, for the same instrument and in the same cohort, four different methods and reported four different MCID values [48–50, 52, 59], which suggest that variation could be explained further than differences in disease severity or disease group since the same cohort of patients was analyzed. On the other hand, even with the same methodology for the same instrument, MCID values vary since the variability may be related to study population, in particular, patient demographics and patient baseline status. Wang et al. [85] stated that MCID scores are context-specific, depending on patient baseline and demographic characteristics. Therefore, factors affecting MCID values are specific to the population being studied and are non-transferable across patient groups, also related to the multiple reported conceptual and methodological differences. Our review exhibits some limitations that deserve mention. First, the search strategy only focused on Pubmed and Google scholar, which might have caused the loss of some papers. However, the inclusion of grey literature is a useful source of relevant information, which ensures a certain standard of quality of the selected papers. In addition, our review was limited to the nine last years, which can also lead to the loss of some papers. To our knowledge, there is no review published recently to define MCID in the QoL field, our objective was therefore to provide researchers the new statistical methods to be applied for further researches. The following question remains to be answered “which is the best method for MCID”? Sloan J (2005) [86] stated that “there are many methods available to ascertaining an MCID, none are perfect, but all are useful”. The MCID can be best estimated using a combination of anchor and distribution measures to triangulate toward a single value. Using several methods enables to assess the robustness of the results. This corresponds to a sensitivity analysis not on the data but on the methods. Anchor-based methods should be used as primary measures with distribution methods as supportive measure.

Conclusion

We conclude that many methods have become available, which lead to different estimations of MCID. MCID should be based on the context of each clinical study. Therefore, in order to stay cautious while interpreting MCID in the field of Quality of Life, close collaboration between statisticians and clinicians may be critical and necessary in order to integrate an agreement regarding the appropriate method to determine MCID. Moreover, as performed for the data, a sensitivity analysis on method, ie performing the analysis with several methods is highly recommended to assess the robustness of the results. Additional file 1.
  83 in total

Review 1.  Recommended methods for determining responsiveness and minimally important differences for patient-reported outcomes.

Authors:  Dennis Revicki; Ron D Hays; David Cella; Jeff Sloan
Journal:  J Clin Epidemiol       Date:  2007-08-03       Impact factor: 6.437

2.  A comparison of the measurement properties and estimation of minimal important differences of the EQ-5D and SF-6D utility measures in patients with systemic sclerosis.

Authors:  Linda Kwakkenbos; Jaap Fransen; Madelon C Vonk; Eni S Becker; Maurice Jeurissen; Frank H J van den Hoogen; Cornelia H M van den Ende
Journal:  Clin Exp Rheumatol       Date:  2013-07-22       Impact factor: 4.473

3.  Clinimetrics Corner: The Minimal Clinically Important Change Score (MCID): A Necessary Pretense.

Authors:  Chad E Cook
Journal:  J Man Manip Ther       Date:  2008

4.  The minimal important difference: who's to say what is important?

Authors:  J G Wright
Journal:  J Clin Epidemiol       Date:  1996-11       Impact factor: 6.437

5.  Estimating the minimum important change in the 15D scores.

Authors:  Soili Alanne; Risto P Roine; Pirjo Räsänen; Tarja Vainiola; Harri Sintonen
Journal:  Qual Life Res       Date:  2014-08-22       Impact factor: 4.147

6.  Testing minimal clinically important difference: consensus or conundrum?

Authors:  Robert J Gatchel; Tom G Mayer
Journal:  Spine J       Date:  2010-04       Impact factor: 4.166

7.  Responsiveness and minimal important score differences in quality-of-life questionnaires: a comparison of the EORTC QLQ-C30 cancer-specific questionnaire to the generic utility questionnaires EQ-5D and 15D in patients with multiple myeloma.

Authors:  Ann Kristin Kvam; Peter M Fayers; Finn Wisloff
Journal:  Eur J Haematol       Date:  2011-07-26       Impact factor: 2.997

8.  Minimal Clinically Important Difference in Quality of Life for Patients With Low Back Pain.

Authors:  María J Díaz-Arribas; Mónica Fernández-Serrano; Ana Royuela; Francisco M Kovacs; Tomás Gallego-Izquierdo; Mabel Ramos-Sánchez; Rosa Llorca-Palomera; Pedro Pardo-Hervás; Oscar S Martín-Pariente
Journal:  Spine (Phila Pa 1976)       Date:  2017-12-15       Impact factor: 3.468

9.  Defining the minimal clinically important difference (MCID) of the Heinrichs-carpenter quality of life scale (QLS).

Authors:  Bruno Falissard; Christophe Sapin; Jean-Yves Loze; Wally Landsberg; Karina Hansen
Journal:  Int J Methods Psychiatr Res       Date:  2015-08-04       Impact factor: 4.035

10.  Minimum clinically important difference of the health-related quality of life scales in adult spinal deformity calculated by latent class analysis: is it appropriate to use the same values for surgical and nonsurgical patients?

Authors:  Selcen Yuksel; Selim Ayhan; Vugar Nabiyev; Montse Domingo-Sabat; Alba Vila-Casademunt; Ibrahim Obeid; Francisco Sanchez Perez-Grueso; Emre Acaroglu
Journal:  Spine J       Date:  2018-08-27       Impact factor: 4.166

View more
  26 in total

1.  Effect of Digoxin vs Bisoprolol for Heart Rate Control in Atrial Fibrillation on Patient-Reported Quality of Life: The RATE-AF Randomized Clinical Trial.

Authors:  Dipak Kotecha; Karina V Bunting; Simrat K Gill; Samir Mehta; Mary Stanbury; Jacqueline C Jones; Sandra Haynes; Melanie J Calvert; Jonathan J Deeks; Richard P Steeds; Victoria Y Strauss; Kazem Rahimi; A John Camm; Michael Griffith; Gregory Y H Lip; Jonathan N Townend; Paulus Kirchhof
Journal:  JAMA       Date:  2020-12-22       Impact factor: 56.272

2.  Use, cost-effectiveness, and end user perspectives of a home solar lighting intervention in rural Uganda: a mixed methods, randomized controlled trial.

Authors:  Radhika Sundararajan; Helen D'Couto; Joseph Mugerwa; Mellon Tayebwa; Nicholas Lam; Eli Wallach; Matthew Wiens; Matthew Ponticiello; Debbi Stanistreet; Alexander C Tsai; Jose Vallarino; Joseph G Allen; Daniel Muyanja; Mark G Shrime; Edwin Nuwagira; Peggy S Lai
Journal:  Environ Res Lett       Date:  2021-12-22       Impact factor: 6.793

3.  The minimal important change for the EQ VAS based on the SF-36 health transition item: observations from 25772 spine surgery procedures.

Authors:  Anders Joelson; Fredrik Nerelius; Freyr Gauti Sigmundsson; Jan Karlsson
Journal:  Qual Life Res       Date:  2022-07-11       Impact factor: 3.440

4.  Interpretation of a Quantitative Diagnosis Model of Traditional Chinese Medicine Syndromes Based on Computer Adaptive Testing.

Authors:  Simeng Yao; Zhongyu Huang; Xianhua Liu; Qiaofeng Yan; Jing Tang; Fengbin Liu; Zhengkun Hou
Journal:  Evid Based Complement Alternat Med       Date:  2022-06-30       Impact factor: 2.650

5.  Interpreting Patient-Reported Outcome Scores: Pediatric Inflammatory Bowel Disease as a Use Case.

Authors:  Julia Schuchard; Adam C Carle; Michael D Kappelman; Carole A Tucker; Christopher B Forrest
Journal:  Acad Pediatr       Date:  2022-01-05       Impact factor: 2.993

6.  What Are the Minimally Important Changes of Four Commonly Used Patient-reported Outcome Measures for 36 Hand and Wrist Condition-Treatment Combinations?

Authors:  Lisa Hoogendam; Jaimy Emerentiana Koopman; Yara Eline van Kooij; Reinier Feitz; Caroline Anna Hundepool; Chao Zhou; Harm Pieter Slijper; Ruud Willem Selles; Robbert Maarten Wouters
Journal:  Clin Orthop Relat Res       Date:  2021-12-27       Impact factor: 4.755

7.  Power(ful) myths: misconceptions regarding sample size in quality of life research.

Authors:  Samantha F Anderson
Journal:  Qual Life Res       Date:  2021-10-29       Impact factor: 3.440

8.  Health related quality of life and satisfaction with care of stroke patients in Budapest: A substudy of the EuroHOPE project.

Authors:  Ildikó Szőcs; Balázs Dobi; Judit Lám; Károly Orbán-Kis; Unto Häkkinen; Éva Belicza; Dániel Bereczki; Ildikó Vastagh
Journal:  PLoS One       Date:  2020-10-22       Impact factor: 3.240

9.  Patient-reported outcomes, sociodemographic and clinical factors are associated with 1-year mortality in patients with ischemic heart disease-findings from the DenHeart cohort study.

Authors:  Trine Bernholdt Rasmussen; Britt Borregaard; Pernille Palm; Rikke Elmose Mols; Anne Vinggaard Christensen; Knud Juel; Ola Ekholm; Charlotte Brun Thorup; Lars Thrysoee; Marie Gjengedal; Selina Kikkenborg Berg
Journal:  Qual Life Res       Date:  2021-07-22       Impact factor: 4.147

10.  Estimation of minimally important difference of the EQ-5D-5L utility scores among patients with either hypertension or diabetes or both: a cross-sectional study in Hong Kong.

Authors:  Richard Huan Xu; Eliza Lai-Yi Wong; Annie Wai-Ling Cheung
Journal:  BMJ Open       Date:  2020-11-26       Impact factor: 2.692

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.