| Literature DB >> 35270209 |
Estu Meilani1, Asfarina Zanudin1, Nor Azlin Mohd Nordin1.
Abstract
BACKGROUNDS: Assessing quality of life (QoL) using a well-developed and validated questionnaire is an essential part of a breast cancer-related lymphedema (BCRL) treatment. However, a QoL questionnaire with the best psychometric properties is so far unknown. The aim of this systematic review is to evaluate the psychometric properties of the questionnaires measuring the QoL of patients with BCRL.Entities:
Keywords: breast cancer-related lymphedema; psychometric properties; quality of life; questionnaire
Mesh:
Year: 2022 PMID: 35270209 PMCID: PMC8909332 DOI: 10.3390/ijerph19052519
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
COSMIN definitions of measurement properties.
| Measurement Properties | Definition * |
|---|---|
| Content validity | The degree to which the content of a PROM is an adequate reflection of the construct to be measured |
| Structural validity | The degree to which the scores of a PROM are an adequate reflection of the dimensionality of the construct to be measured |
| Internal consistency | The degree of the interrelatedness among the items |
| Cross-cultural validity | The degree to which the performance of the items on a translated or culturally adapted PROM is an adequate reflection of the original version of the PROM |
| Reliability | The proportion of the total variance in the measurements which is due to “true” differences between patients |
| Measurement error | The systematic and random error of a patient’s score that is not attributed to true changes in the construct to be measured |
| Criterion validity | The degree to which the scores of a PROM are an adequate reflection of a “gold standard” |
| Hypothesis testing for construct validity | The degree to which the scores of a PROM are consistent with the hypothesis (for instance with regard to internal relationships, relationships to scores of other instruments, or differences between relevant groups) based on the assumption that the PROM validly measures the construct to be measured |
| Responsiveness | The degree to which the scores of a PROM to detect change over time in the construct is to be measured |
* Definitions were adapted from COSMIN manual for systematic reviews of PROMs [30]; PROMs = patient-reported outcome measures.
Criteria for good measurement properties.
| Measurement Properties | Rating | Criteria * |
|---|---|---|
| Structural validity | + | |
| ? | ||
| − | Criteria for ‘+’ not met | |
| Internal consistency | + | At least low evidence for sufficient structural validity AND Cronbach’s alpha(s) ≥0.70 for each unidimensional scale or subscale |
| ? | Criteria for “At least low evidence for sufficient structural validity” not met | |
| − | At least low evidence for sufficient structural validity AND Cronbach’s alpha(s) < 0.70 for each unidimensional scale or subscale | |
| Reliability | + | ICC or weighted Kappa ≥ 0.70 |
| ? | ICC or weighted Kappa not reported | |
| − | ICC or weighted Kappa < 0.70 | |
| Measurement error | + | SDC or LoA < MIC |
| ? | MIC not defined | |
| − | SDC or LoA > MIC | |
| Hypothesis testing for construct validity | + | The result is in accordance with the hypothesis |
| ? | No hypothesis defined (by the review team) | |
| − | The result is not in accordance with the hypothesis | |
| Cross-cultural validity | + | No important differences found between group factors (such as age, gender, language) in multiple group factor analysis OR no important DIF for group factors (McFadden’s R2 < 0.02) |
| ? | No multiple group factor analysis OR DIF analysis performed | |
| − | Important differences between group factors were found | |
| Criterion validity | + | Correlation with gold standard ≥ 0.70 OR AUC ≥ 0.70 |
| ? | Not all information for ‘+’ reported | |
| − | Correlation with gold standard < 0.70 OR AUC < 0.70 | |
| Responsiveness | + | The result is in accordance with the hypothesis OR AUC ≥ 0.70 |
| ? | No hypothesis defined (by the review team) | |
| − | The result is not in accordance with the hypothesis OR AUC < 0.70 |
* Criteria adapted from COSMIN manual for systematic reviews of PROMs [30]; “+” = sufficient, “−” = insufficient, “?” = indeterminate, AUC = area under the curve, CFA = confirmatory factor analysis, CFI = comparative fit index, CTT = classical test theory, DIF = differential item functioning, ICC = intraclass correlation coefficient, IRT = item response theory, LoA = limits of agreement, MIC = minimal important change, RMSEA: root mean square error of approximation, SEM = standard error of measurement, SDC = smallest detectable change, SRMR: standardized root mean residuals, TLI = Tucker–Lewis index.
Figure 1PRISMA flowchart on the study selection process.
Characteristics of included studies.
| Author (ref) | Country | PROM | Objective of Study | Sample Size | Age Mean ± SD (Range) Year | Gender (% Female) | Lymphedema Characteristics | ||
|---|---|---|---|---|---|---|---|---|---|
| Type | Duration | Severity | |||||||
| Bakar et al. 2017 [ | Turkey | LYMQoL-Arm A | To translate the English version of LYMQoL to Turkish and to test the reliability and validity of the Turkish version of LYMQoL among patients with BCRL in Turkey | 4 translators | 50.6 ± 12.45 (24–75) | 100% | BCRL | 4.32 ± 3.06 (1–18) years | Not specified |
| Karayurt et al. 2021 [ | Turkey | LYMQoL-Arm A | To adapt Quality of Life Measure for Limb Lymphedema-Arm (LYMQoL-Arm) into Turkish (TR) and test its validity and reliability | 6 translators | 55.69 ± 9.33 (35–79) | 100% | BCRL | 3.28 ± 2.91 (1–13) years | Mild-severe |
| Borman et al. 2018 [ | Turkey | LYMQoL-Arm B | To translate and validate the LYMQoL-Arm for Turkish breast cancer patients with lymphedema | 4 experts for the translation process | 51.8 ± 9.8 (31–82) | 100% | BCRL | 21.1 ± 38.7 (0.2–164) months | Stage 1–3 |
| Degirmenci et al. 2019 [ | Turkey | LLIS ver 1 | To investigate the validity and reliability of the Turkish adaptation of the LLIS in patients with lymphedema | 2 translators | 53.6 ± 11.8 (28–83) | 97.5% for UL group | 70.7% BCRL; 0.94% lymphoma; 25.4% LL lymphedema | Median = 24 (1–396) months for UL | Stage 1–2 for UL |
| Haghighat et al. 2018 [ | Iran | LLIS ver 1 | To validate the Persian version of the LLIS questionnaire | 2 translators | 53.28 ± 10.95 | 100% | Unilateral BCRL | Not specified | Not specified |
| Orhan et al. 2019 [ | Turkey | LLIS ver 2 | To translate and culturally adapt the LLIS ver 2 into Turkish and perform a psychometric evaluation of the Turkish LLIS ver 2 in patients with BCRL | 10 experts for the translation process | 56.5 ± 10.21 | 100% | 69.02% BCRL; 30.9% non-LE | 0–6 mo: 20.5% | Not specified |
| Sharour 2020 [ | Jordan | LLIS ver 2 | To translate and validate an Arabic version of the LLIS | 3 experts for the translation process | 44.1 ± 1.10 | 100% | BCRL | 0–6 mo: 80% | Not specified |
| Devoogdt et al. 2011 [ | Belgium | Lymph-ICF-UL | To investigate the reliability (test-retest, internal consistency, measurement variability) and validity (content and construct) of the newly developed Lymph-ICF in breast cancer patients with lymphedema | 20 patients for phase 1 (generating items) | 61.2 ± 10.0 (objective LE); 56.7 ± 9.3 (subjective LE); 58.3 ± 11.9 (non-LE) | 100% | 66% BCRL; 33.3% non-LE | Objective LE = 41 ± 64 months | Not specified |
| Grarup et al. 2018 [ | Denmark | Lymph-ICF-UL | To translate and culturally adapt the original Dutch version of Lymph-ICF into Danish and examine its content validity and reliability | 4 experts for the translation process | 61 ± 12.4 (validation studies); 61.5 ± 9.7 (cognitive debriefing) | 100% | BCRL | 15.5 ± 58 months for validation studies | Mild to severe |
| de Vrieze et al. 2019 [ | Belgium | Lymph-ICF-UL | To examine the validity and reliability of the Lymph-ICF-UL with NRS in patients with BCRL | 56 patients | 62 ± 10 | 100% | BCRL | 34.5 months | Stage I, IIa, IIb |
| de Vrieze et al. 2020 [ | Belgium | Lymph-ICF-UL | To examine the internal and external responsiveness of the Lymph-ICF-UL in patients with BCRL | 95 patients | 62 ± 10 | 100% | BCRL | 53 ± 42.5 | Stage I, IIa, IIb |
| de Vrieze et al. 2021 [ | Belgium | Lymph-ICF-UL | To perform a cross-cultural validation of the Lymph-ICF-UL French version in patients with BCRL of the arm and/or hand | 3 experts and 3 patients for the translation process | 64 ± 11 | 100% | BCRL | 78 months | Stage I, IIa, IIb |
| Zhao et al. 2022 [ | China | Lymph-ICF-UL | To translate the Lymph-ICF-UL into a Chinese version and subsequently test its reliability and validity among patients with BCRL in a Chinese context | 5 translators | 26–70 | 100% | 63.2% BCRL; 36.7% non-LE | 2–19 months | Stage 0–3 |
| Ridner and Dietrich 2015 [ | USA | LSIDS-A | To develop and examine the psychometric properties (validity and reliability) of LSIDS-A in breast cancer patients experiencing upper limb lymphedema | 128 for preliminary testing | 58.9 ± 11.0 | 100% | BCRL | Not specified | 84.5% had stage II lymphedema |
| Deveci et al. 2021 [ | Turkey | LSIDS-A | To adapt LSIDS-A into Turkish and to test its validity and reliability in patients with BCRL | 6 translators | 55.4 ± 10.2 (20–80) | 100% | BCRL | 48.8 ± 49.5 (1–204) months | Not specified |
| Viehoff and Wittink 2008 [ | Netherland | ULL-27 | To translate the ULL27 into Dutch and to assess its internal consistency and validity for Dutch patients with upper limb lymphedema | 3 translators | 59 ± 11.79 (34–80) | 100% | BCRL | 35.51 ± 45.14 (0.5–276) months | Not specified |
| Vatansever et al. 2020 [ | Turkey | ULL-27 | To perform translation, cultural adaptation, and validation of ULL-27 in Turkish-speaking population of BCRL; To assess QoL of Turkish BCRL patients | 4 translators | 54.96 ± 11.35 | 100% | BCRL | 23.12 ± 30.88 months | Mild to severe |
| Williams et al. 2018 [ | Australia | ULL-QoL | To develop PROM specific to the assessment of HRQoL associated with upper limb lymphedema and assess its psychometric properties | 24 patients for PROM development | 60.3 ± 13.0 (23–86) | 97% | 99% BCRL, 1% Non-Hodgkin’s lymphoma | Not specified | Not specified |
| Klassen et al. 2021 [ | Canada | LYMPH-Q Upper Extremity | To describe the development and psychometric validation of the LYMPH-Q Upper Extremity Module | 15 patients for qualitative interviews | 40–70 | 100% | BCRL | ≤4 yrs: 31% | Mild to severe |
SD = standard deviation, LYMQoL-Arm = Lymphedema Quality of Life Tool-Arm, BCRL = breast cancer-related lymphedema; LLIS 1 = Lymphedema Life Impact Scale version 1, UL = upper limb, LL = lower limb, LE = lymphedema, EORTC QLQ-C30 = European Organization for Research and Treatment of Cancer Quality of Life Questionnaire Core 30, SF-36 = 36-items Short Form Health Survey, LLIS 2 = Lymphedema Life Impact Scale version 2, CVI = chronic venous insufficiency, DVT = deep vein thrombosis, Lymph-ICF-UL = Lymphedema Functioning, Disability, and Health Questionnaire for Upper Limb, Ly-QLI = Lymphedema Quality of Life Inventory, LSIDS-A = Lymphedema Symptom Intensity and Distress Survey-Arm, PROM = patient-reported outcome measure.
Characteristics of included PROMs.
| PROM | Ref | Country (Language in which the PROM was Evaluated) | No of Items | Subscales | Recall Period | Response Option | Scoring | Original Language | Available Translation |
|---|---|---|---|---|---|---|---|---|---|
| LYMQOL-Arm A (Lymphedema Quality of Life Tool-Arm A) | Bakar et al. 2017 [ | Turkey | 21 | 4 domains: function, appearance, symptoms, mood | Not specified | Domains: 4-point Likert scale (1–4); overall QoL: 0–10 scale | Total score of all domains and overall QoL score | English | Turkish |
| Karayurt et al. 2021 [ | Turkey | 21 | 4 subscales: symptom, body image/appearance, function, mood | Not specified | Domains: 4-point Likert scale (1–4); overall QoL: 0–10 scale | Total score of all domains and overall QoL score | English | Turkish | |
| LYMQoL-Arm B (Lymphedema Quality of Life Tool-Arm B) | Borman et al. 2018 [ | Turkey | 28 (adding 7 sub-questions) | 4 domains: function, appearance, symptoms, mood | Not specified | Domains: 4-point Likert scale (1–4); overall QoL: 0–10 scale | Total score of all domains and overall QoL score | English | Turkish |
| LLIS 1 (Lymphedema Life Impact Scale version 1) | Degirmenci et al. 2019 [ | Turkey | 18 | 3 subscales: physical, psychosocial, functional | Not specified | 5-point Likert scale (1–5) | Total score, subscale score | English | Turkish, Persian |
| Haghighat et al. 2018 [ | Iran | 18 | 3 subscales: physical, psychosocial, functional | Not specified | 5-point Likert scale (1–5) | Total score, subscale score | English | Turkish, Persian | |
| LLIS 2 (Lymphedema Life Impact Scale version 2) | Orhan et al. 2019 [ | Turkey | 18 | 3 subscales: physical, psychosocial, functional | Not specified | 5-point Likert scale (0–4) | Total score, subscale score | English | Turkish, Arabic |
| Sharour 2020 [ | Jordan | 18 | 3 subscales: physical, psychosocial, functional | Not specified | 5-point Likert scale (0–4) | Total score, subscale score | English | Turkish, Arabic | |
| Lymph-ICF-UL (Lymphedema Functioning, Disability, and Health Questionnaire for Upper Limb) | Devoogdt et al. 2011 [ | Belgium | 29 | 5 domains: physical, mental, household, mobility, life, and social activities | Complaints during the last 2 weeks | Visual Analog Scale (VAS) 0–100 mm | Total score, domain score | Dutch | English, Danish, French, Chinese |
| Grarup et al. 2018 [ | Denmark | 29 | 5 domains: physical, mental, household, mobility, life, and social activities | Complaints during the last 2 weeks | Visual Analog Scale (VAS) 0–100 mm | Total score, domain score | Dutch | English, Danish, French, Chinese | |
| de Vrieze et al. 2019 [ | Belgium | 29 | 5 domains: physical, mental, household, mobility, life, and social activities | Complaints during the last 2 weeks | 11-point Likert scale (0–10) | Total score, domain score | Dutch | English, Danish, French, Chinese | |
| de Vrieze et al. 2020 [ | Belgium | 29 | 5 domains: physical, mental, household, mobility, life, and social activities | Complaints during the last 2 weeks | 11-point Likert scale (0–10) | Total score, domain score | Dutch | English, Danish, French, Chinese | |
| de Vrieze et al. 2021 [ | Belgium | 29 | 5 domains: physical, mental, household, mobility, life, and social activities | Complaints during the last 2 weeks | 11-point Likert scale (0–10) | Total score, domain score | Dutch | English, Danish, French, Chinese | |
| Zhao et al. 2022 [ | China | 29 | 5 domains: physical, mental, household, mobility, life, and social activities | Complaints during the last 2 weeks | 11-point Likert scale (0–10) | Total score, domain score | Dutch | English, Danish, French, Chinese | |
| LSIDS-A (Lymphedema Symptom Intensity and Distress Survey-Arm) | Ridner and Dietrich 2015 [ | USA | 36 | 7 clusters: soft tissue sensation, neurological sensation, function, biobehavioral, resource, sexuality, activity | Reflective period of 1 week | Yes/no response, if ‘yes’ then 1–10 rating was solicited | Overall score, cluster score, intensity, and distress score | English | Turkish |
| Deveci et al. 2021 [ | Turkey | 36 | 7 clusters: soft tissue sensation, neurological sensation, function, biobehavioral, resource, sexuality, activity | Reflective period of 1 week | Yes/no response, if ‘yes’ then 1–10 rating was solicited | Overall score, cluster score, intensity, and distress score | English | Turkish | |
| ULL27 (Upper Limb Lymphedema 27) | Viehoff and Wittink 2008 [ | Netherlands | 27 | 3 domains: physical, psychological, social | Not specified | 5-point Likert scale | Total score, domain score | French | Dutch, Turkish, English |
| Vatansever et al. 2020 [ | Turkey | 27 | 3 domains: physical, psychological, social | Not specified | 5-point Likert scale | Total score, domain score | French | Dutch, Turkish, English | |
| ULL-QoL (Upper Limb Lymphedema Quality of Life Questionnaire) | Williams et al. 2018 [ | Australia | 14 | 2 dimensions: physical well-being, emotional well-being | Over the previous 2 weeks | 5-point Likert scale | Total score, dimension score | English | None |
| LYMPH-Q Upper Extremity | Klassen et al. 2021 [ | Canada | 68 | 6 scales: appearance function psychological symptoms information arm sleeve | Now (appearance); past week (function, psychological, symptoms); N/A (information); most recent (arm sleeve) | 4 response options for each scale: extremely, moderately, a little, not at all (appearance and function) always, often, sometimes, never (psychological) severe, moderate, mild, none (symptoms) very dissatisfied, somewhat dissatisfied, somewhat satisfied, very satisfied (information and arm sleeve) | Scale score | English | None |
PROM = patient-reported outcome measure, QoL = quality of life.
(a) COSMIN ratings on methodology quality and results per measurement property. (b) COSMIN ratings on methodology quality and results per measurement property (continued). (c) COSMIN ratings on methodology quality and results per measurement property (continued).
| (a) | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| COSMIN Measurement Properties | LYMQoL-Arm A [ | LYMQoL-Arm B [ | LLIS ver 1 [ | ||||||
| Studies (Meth Qual Rating) | Results (Rating) | Summary of Results (Overall Rating) | Studies (Meth Qual Rating) | Results (Rating) | Summary of Results (Overall Rating) | Studies (Meth Qual Rating) | Results (Rating) | Summary of Results (Overall Rating) | |
| V/A/D/I * | +/−/? ** | +/−/±/? ** | V/A/D/I * | +/−/? ** | +/−/±/? ** | V/A/D/I * | +/−/? ** | +/−/±/? ** | |
| Content validity | Bakar 2017 (D) | Relevance: (+) | Content validity: (+) | Borman 2018 (D) | Relevance: (+) | Content validity: (+) | Degirmenci 2019 (D) | Relevance: (+) | Content validity: (+) |
| Karayurt 2021 (D) | Relevance: (+) | Haghighat 2018 (D) | Relevance: (+) | ||||||
| Structural validity | Bakar 2017 (I) | EFA → factor 1 = 0.624–0.912; factor 2 = 0.587–0.876; factor 3 = 0.376–0.866; factor 4 = 0.788–0.861 (+) | 4 factors with acceptable factor loadings (+) | Borman 2018 (I) | CFA → CMIN/df: 1.733, RMSEA: 0.074, GFI: 0.782, IFI: 0.904, CFI: 0.902, TLI: 0.888 (−) | Criteria for model fit were not met (−) | Degirmenci 2019 (I) | EFA → factor 1 = 0.214–0.770; factor 2 = 0.571–0.818; factor 3 = 0.309–0.748 (+) | 3 factors with acceptable factor loadings (+) |
| Karayurt 2021 (A) | CFA → CMIN/df: 1.86, RMSEA: 0.089, SRMR: 0.09, CFI: 0.81, GFI: 0.74, AGFI: 0.68 (−) | Haghighat 2018 (V) | CFA → NFI: 0.856, NNFI: 0.894, CFI: 0.908, MFI: 0.909, RMSEA: 0.087; EFA → factor 1 = 0.621–0.884; factor 2 = 0.651–0.821; factor 3 = 0.443–0.631 (+) | ||||||
| Internal consistency | Bakar 2017 (V) | Cronbach’s α (total) = 0.91; Cronbach’s α (domains) = 0.70–0.94 (+) | Cronbach’s α = 0.70–0.94 (+) | Borman 2018 (V) | Cronbach’s α = 0.85–0.90 (?) | (?) | Degirmenci 2019 (V) | Cronbach’s α (subscales) = 0.771–0.865; Cronbach’s α (total) = 0.916 (+) | Cronbach’s α = 0.771–0.879 (+) |
| Karayurt 2021 (V) | Cronbach’s α (total) = 0.90; Cronbach’s α (domains) = 0.78–0.86 (+) | Haghighat 2018 (V) | Cronbach’s α = 0.853–0.879 (+) | ||||||
| Cross-cultural validity/measurement invariance | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
| Reliability | Bakar 2017 (A) | Test-retest: ICC (total) = 0.99; ICC (domains) = 0.98–0.99 (+) | Test-retest ICC = 0.92–0.99 (+) | Borman 2018 (V) | Test-retest: ICC (total) = 0.627; ICC (domains) = 0.451–0.714 (−) | (−) | Degirmenci 2019 (V) | Test-retest: ICC (subscales) = 0.963–0.985; ICC (total) = 0.991 (+) | Test-retest ICC = 0.855–0.991 (+) |
| Haghighat 2018 (A) | Test retest: ICC (subscales) = 0.855–0.977; ICC (total) = 0.962 (+) | ||||||||
| Measurement error | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
| Criterion validity | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
| Hypothesis testing (for construct validity) | Bakar 2017 (V) | LYMQoL-Arm A and NHP r = 0.539–0.643, | Result in line with 6 hypotheses, but not with 1 hypothesis (+) | Borman 2018 (V) | Convergent validity → LYMQoL-Arm B and EORTC-BR23 (body image, future, systemic complications, breast symptoms, arm symptoms) r = 0.203 to 0.637, | Result in line with 3 hypotheses (+) | Degirmenci 2019 (V) | LLIS 1 and SF-12 rs = −0.453 to −0.703, | Result in line with 5 hypotheses, but not with 1 hypothesis (+) |
| Karayurt 2021 (V) | Known groups validity → the mean scores of LYMQoL-Arm A total (t = −4.628, | Haghighat 2018 (V) | Discriminant validity → patients with LE showed higher impairments in all three subscales compared to those without LE, | ||||||
| Responsiveness | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
| ( | |||||||||
|
| |||||||||
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
| |
| Content validity | Orhan 2019 (D) | Relevance: (+) | Content validity: (+) | Devoogdt 2011 (D) | Relevance: (+) | Content validity: (+) | Ridner 2015 (D) | Relevance: (+) | Content validity: (+) |
| Sharour 2020 (D) | Relevance: (+) | Grarup 2018 (A) | Relevance: (+) | Deveci 2021 (A) | Relevance: (+) | ||||
| De Vrieze 2019 (D) | Relevance: (+) | ||||||||
| De Vrieze 2021 (D) | Relevance: (+) | ||||||||
| Zhao 2022 (A) | Relevance: (+) | ||||||||
| Structural validity | Orhan 2019 (A) | EFA → factor 1 = 0.502–0.751; factor 2 = 0.401–0.787; factor 3 = 0.426–0.844 (+) | 3 factors with acceptable factor loadings (+) | Zhao 2022 (A) | EFA → factor 1 = 0.648–0.784; factor 2 = 0.754–0.798; factor 3 = 0.419–0.802; factor 4 = 0.808–0.881; factor 5 = 0.457–0.739 (+) | 5 factors with acceptable factor loadings (+) | Deveci 2021 (A) | CFA → for intensity scale: CMIN/df: 1.52, RMSEA: 0.056, SRMR: 0.19, CFI: 0.91, GFI: 0.83, IFI: 0.91, TLI: 0.90; for distress scale: CMIN/df: 1.55, RMSEA: 0.055, SRMR: 0.27, CFI: 0.90, GFI: 0.84, IFI: 0.90, TLI: 0.893 (+) | Model fit was acceptable (+) |
| Sharour 2020 (D) | EFA → factor 1 = 0.65–0.76; factor 2 = 0.61–0.88; factor 3 = 0.60–0.72 (+) | ||||||||
| Internal consistency | Orhan 2019 (V) | Cronbach’s α (subscales) = 0.76–0.78; Cronbach’s α (total) = 0.89 (+) | Cronbach’s α = 0.76–0.923 (+) | Devoogdt 2011 (V) | Cronbach’s α (domains) = 0.72–0.92; Cronbach’s α (total) = 0.92 (+) | Cronbach’s α = 0.72–0.98 (+) | Ridner 2015 (V) | KR-20 (symptoms occurrence) = 0.88; Cronbach’s α (intensity score) = 0.93; Cronbach’s α (distress score) = 0.94 (+) | KR-20 = 0.83–0.88; Cronbach’s α = 0.68–0.94 (+) |
| Sharour 2020 (V) | Cronbach’s α (subscales) = 0.861–0.901; Cronbach’s α (total) = 0.923 (+) | Grarup 2018 (V) | Cronbach’s α (domains) = 0.92–0.97; Cronbach’s α (total) = 0.98 (+) | Deveci 2021 (V) | KR-20 (symptoms occurrence) = 0.83; Cronbach’s α (intensity score) = 0.76–0.86; Cronbach’s α (distress score) = 0.68–0.86 (+) | ||||
| De Vrieze 2019 (V) | Cronbach’s α (domains) = 0.89–0.98; Cronbach’s α (total) = 0.98 (+) | ||||||||
| De Vrieze 2021 (V) | Cronbach’s α (domains) = 0.77–0.89; Cronbach’s α (total) = 0.95 (+) | ||||||||
| Zhao 2022 (V) | Cronbach’s α (domains) = 0.789–0.910; Cronbach’s α (total) = 0.918 (+) | ||||||||
| Cross-cultural validity/measurement invariance | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
| Reliability | Orhan 2019 (A) | Test-retest: ICC (subscales) = 0.88–0.93; ICC (total) = 0.91 (+) | Test-retest ICC = 0.88–0.93 (+) | Devoogdt 2011 (I) | Test-retest: ICC (domains) = 0.65–0.91; ICC (total) = 0.93 (+) | Test-retest ICC = 0.65–0.95 (+) | Ridner 2015 (D) | Test-retest: ICC (clusters) = 0.67–0.97; ICC (intensity) = 0.93; ICC (distress) = 0.92 (+) | Test-retest ICC = 0.67–0.93 (+) |
| Grarup 2018 (D) | Test-retest: ICC (domains) = 0.88–0.94; ICC (total) = 0.95 (+) | ||||||||
| De Vrieze 2019 (I) | Test-retest: ICC (domains) = 0.79–0.93; ICC (total) = 0.95 (+) | ||||||||
| De Vrieze 2021 (I) | Test-retest: ICC (domains) = 0.66–0.95; ICC (total) = 0.91 (+) | ||||||||
| Zhao 2022 (V) | Test-retest: ICC (domains) = 0.801–0.834; ICC (total) = 0.828 (+) | ||||||||
| Measurement error | N/A | N/A | N/A | Devoogdt 2011 (I) | Variability → SEM (total) = 4.8; SEM (domains) = 7.0–12.5; Clinically Important Changes → SDC (total) = 13.4; SDC (domains) = 19.4–34.6 (+) | SEM = 4.51–12.6; SDC = 12.5–34.91 (+) | N/A | N/A | N/A |
| Grarup 2018 (D) | Variability → SEM (total) = 4.51; SEM (domains) = 5.69–10.21; Clinically Important Changes → SDC (total) = 12.5; SDC (domains) = 15.8–28.3 (+) | ||||||||
| De Vrieze 2019 (I) | Variability → SEM (total) = 4.89; SEM (domains) = 6.31–12.31; Clinically Important Changes → SDC (total) = 13.56; SDC (domains) = 17.49–34.13 (+) | ||||||||
| De Vrieze 2021 (I) | Variability → SEM (total) = 5.54; SEM (domains) = 6.28–12.6; Clinically Important Changes → SDC (total) = 15.35; SDC (domains) = 17.4–34.91 (+) | ||||||||
| Criterion validity | Orhan 2019 (V) | LLIS 2 (subscales) and LVD r = 0.30–0.36, | Weak correlation with gold measurement standard (LVD) r < 0.40 (−) | N/A | N/A | N/A | N/A | N/A | N/A |
| Hypothesis testing (for construct validity) | Orhan 2019 (V) | Convergent validity → LLIS 2 and LYMQOL (subscales) r = 0.52–0.82, | Result in line with 7 hypotheses (+) | Devoogdt 2011 (V) | Convergent validity → Lymph-ICF-UL and SF-36 (bodily pain, mental health, physical functioning, social functioning) r = −0.33 to −0.70; Divergent validity → Lymph-ICF-UL and SF-36 (role-emotional, mental health, physical functioning, role-physical) r = 0.03 to −0.42; Known-groups validity → the scores on 26 of 29 questions were significantly higher for LE patients compared to non-LE patients, | Result in line with 75 hypotheses, but not with 15 hypotheses (+) | Ridner 2015 (V) | Convergent validity → LSIDS-A and FACT-G rs = −0.20 to −0.53; LSIDS-A and FACT-B+4 rs = −0.41 to −0.50; LSIDS-A and ULL-27 rs = −0.29 to −0.52; LSIDS-A and FASQ rs = 0.25–0.47; LSIDS-A and CES-D rs = 0.29–0.65; LSIDS-A and FACT rs = −0.46 to −0.50; LSIDS-A and POMS-SF rs = 0.07–0.36; Divergent validity → LSIDS-A and MCSDS rs = 0.01 to −0.25 (8+, 6-) | Result in line with 9 hypotheses, but not with 6 hypotheses (−) |
| Sharour 2020 (V) | Convergent validity → LLIS 2 (total) and EORTC QLQ-C30 (functional and symptoms) r = 0.81 to −0.84; LLIS 2 (subscales) and EORTC QLQ-C30 (functional) r = −0.79 to −0.87; LLIS 2 (subscales) and EORTC QLQ-C30 (symptoms) r = 0.73–0.81 (3+) | De Vrieze 2019 (V) | Convergent validity → Lymph-ICF-UL and SF-36 (bodily pain, mental health, physical functioning, social functioning) r = −0.224 to −0.661; Divergent validity → Lymph-ICF-UL and SF-36 (role-emotional, mental health, physical functioning, role-physical) rs = −0.191 to −0.607 (11+, 3-) | Deveci 2021 (V) | Known groups validity → there was a significantly higher mean score in patients with active LE compared to patients with latent LE (1+) | ||||
| De Vrieze 2021 (V) | Convergent validity → Lymph-ICF-UL and SF-36 (bodily pain, mental health, physical functioning, social functioning) rs = −0.156 to −0.704; Divergent validity → Lymph-ICF-UL and SF-36 (role-emotional, mental health, physical functioning, role-physical) rs = −0.144 to −0.499 (9+, 5-) | ||||||||
| Zhao 2022 (V) | Convergent validity → Lymph-ICF-UL and SF-36 (bodily pain, mental health, physical functioning, social functioning) r = −0.371 to −0.563; Lymph-ICF-UL and EORTC-QLQ-C30 r = 0.230 to −0.457; Divergent validity → Lymph-ICF-UL and SF-36 (role-emotional, mental health, physical functioning, role-physical) r = −0.102 to −0.376; Discriminant validity → patients with LE showed more impairments than patients without LE ( | ||||||||
| Responsiveness | N/A | N/A | N/A | De Vrieze 2020 (V) | Internal responsiveness → there were: a significant changes in mean total score between pre- and postintensive treatment ( | Results in line with 5 hypotheses, but not with 1 hypothesis (+) | N/A | N/A | N/A |
| ( | |||||||||
|
| |||||||||
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
| |
| Content validity | Viehoff 2008 (D) | Relevance: (+) | Content validity: (+) | Williams 2018 (D) | Relevance: (+) | Content validity: (+) | Klassen 2021 (D) | Relevance: (+) | Content validity: (+) |
| Vatansever 2020 (D) | Relevance: (+) | ||||||||
| Structural validity | Vatansever 2020 (I) | CFA → RMSEA = 0.074; CFI = 0.97; IFI = 0.97; GFI = 0.96 (+) | Model fit was acceptable (+) | Williams 2018 (A) | EFA → factor 1 = 0.348–0.852; factor 2 = 0.375–0.870 (+) | 2 factors with acceptable factor loadings (+) | Klassen 2021 (A) | Rasch: item fit was within ±2.5 for 27 of the 68 items (−) | Not all model fit was reported (−) |
| Internal consistency | Viehoff 2008 (V) | Cronbach’s α = 0.78–0.92 (?) | Cronbach’s α = 0.75–0.93 (+) | Williams 2018 (V) | Cronbach’s α = 0.87 (+) | (+) | Klassen 2021 (V) | Cronbach’s α (scales) = 0.89–0.97 (?) | (?) |
| Vatansever 2020 (V) | Cronbach’s α (dimensions) = 0.75–0.90; Cronbach’s α (total) = 0.93 (+) | ||||||||
| Cross-cultural validity/measurement invariance | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
| Reliability | Vatansever 2020 (I) | Test-retest r = 0.40, | r = 0.40, | Williams 2018 (A) | Test-retest ICC (total) = 0.93 (+) | (+) | Klassen 2021 (D) | Test-retest ICC (scales) = 0.92–0.96 (+) | (+) |
| Measurement error | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
| Criterion validity | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
| Hypothesis testing (for construct validity) | Viehoff 2008 (V) | Convergent validity → ULL-27 and RAND-36 rs = 0.45–0.69; Discriminant validity → there was a significant difference in total scores and all domain scores between LE and non-LE groups, | Result in line with 11 hypotheses, but not with 14 hypotheses (−) | Williams 2018 (V) | Convergent validity → ULL-QoL and EQ-5D-3L r = −0.44 to −0.59; ULL-QoL (physical well-being) and SF-36 (PCS) r = −0.57; Divergent validity → ULL-QoL and % excess limb volume r = 0.12–0.18; ULL-QoL and SF-36 r = −0.31 to −0.43; ULL-QoL (emotional well-being) and EQ-5D-3L (utility scores) r = −0.50 (7+,1-) | Result in line with 7 hypotheses, but not with 1 hypothesis (+) | Klassen 2021 (V) | The correlation between symptoms, function, appearance, psychological, arm sleeve with each other was higher than with information (r = >0.50); All six scales were associated with increased severity of arm swelling, reporting of arm problem caused by cancer treatments, and wearing of a compression sleeve to reduce or prevent swelling in the past 12 months (3+, 1-) | Result in line with 3 hypotheses, but not with 1 hypothesis (+) |
| Vatansever 2020 (V) | Convergent validity → ULL-27 and EORTC QLQ-C30 (QL2, PF2, RF2, EF, SF, FA, NV, PA, DY, SL, AP) r = −0.221 to −0.546, | ||||||||
| Responsiveness | N/A | N/A | N/A | Williams 2018 (D) | LE transition to better → Mean change (SD of changes scores) = −5.4 (19.0) to −8.9 (17.7); MSRM = 0.30–0.64; LE transition to worse → Mean change (SD of changes scores) = 8.4 (13.8)–15.0 (27.7); MSRM = 0.61–0.83 (2+) | Result in line with 2 hypotheses (+) | N/A | N/A | N/A |
* V = very good, A = adequate, D = doubtful, I = inadequate; ** + = sufficient, - = insufficient, ± = inconsistent, ? = indeterminate; meth qual = methodological quality, LYMQOL-Arm A = Lymphedema Quality of Life Tool-Arm A, LYMQoL-Arm B = Lymphedema Quality of Life Tool-Arm B, LLIS 1 = Lymphedema Life Impact Scale version 1, LLIS 2 = Lymphedema Life Impact Scale version 2, Lymph-ICF-UL = Lymphedema Functioning, Disability, and Health Questionnaire for Upper Limb, LSIDS-A = Lymphedema Symptom Intensity and Distress Survey-Arm, ULL27 = Upper Limb Lymphedema 27, ULL-QoL = Upper Limb Lymphedema Quality of Life Questionnaire, EQ-5D-3L = EuroQol 5D three level version, EQ-VAS = EuroQol visual analogue scale, NHP = Nottingham health profile, EFA = exploratory factor analysis, ICC = intraclass correlation coefficient, CFA = confirmatory factor analysis, SF-36 = Short form 36, EORTC-BR23 = European Organization for Research and Treatment of Cancer Quality of Life Questionnaire Breast Cancer-Specific Version, FACT-B4 = Functional assessment of cancer therapy breast-4, CMIN/df = Satorra-Bentler scaled chi-square/degree of freedom, RMSEA = root mean square error of approximation, SRMR = standardized root mean square residual, GFI = goodness-of-fit index, IFI = incremental fit index, CFI = comparative fit index, TLI = Trucker-Lewis index, LEFS = Lower extremity functional scale, PCA = principal component analysis, UL = upper limb, LL = lower limb, EORTC QLQ-C30 = European Organization for Research and Treatment of Cancer Quality of Life Questionnaire Core 30, DASH = disabilities of arm shoulder and hand, NFI = Bentler-Bonnet normed fit index, NNFI = Bentler-Bonnet non-normed fit index, MFI = McDonald fit index, LVD = limb volume difference, SEM = standard error measurement, SRD = smallest real difference, KMO = Kaiser-Mayer Olkin, ADL = activity daily living, AUC = area under the ROC curve, CI = confidence interval, RP = rehabilitation program, LS = liposuction, FACT-G = Functional assessment of cancer therapy general, FASQ = Functional assessment screening questionnaire, CES-D = Center for epidemiologic studies-depression, POMS-SF = Profile of mood states short form, MCSDS = Marlowe–Crowne social desirability scale, KR-20 = Kuder–Richardson-20, SRM = standardized response mean, GPE = global perceived effect, MCID = minimal clinically important difference, MSRM = modified standardized response mean, N/A = not applicable.
Quality of evidence for measurement properties of PROMs.
| PROM * (ref) | Quality of Evidence Rating (GRADE **) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Content Validity | Structural Validity | Internal Consistency | Cross-Cultural Validity | Reliability | Measurement Error | Criterion Validity | Hypothesis Testing | Responsiveness | |||
|
|
|
| |||||||||
| LYMQOL-Arm A [ | Moderate | Moderate | Low | Moderate | High | N/A | Low | N/A | N/A | High | N/A |
| LYMQOL-Arm B [ | Low | Low | Low | Very Low | High | N/A | Moderate | N/A | N/A | High | N/A |
| LLIS 1 [ | Moderate | Moderate | Moderate | Moderate | Moderate | N/A | Moderate | N/A | N/A | Moderate | N/A |
| LLIS 2 [ | Low | Low | Low | Low | Moderate | N/A | Very Low | N/A | Moderate | High | N/A |
| Lymph-ICF-UL [ | High | High | High | Moderate | High | N/A | High | Low | N/A | High | Moderate |
| LSIDS-A [ | Low | Moderate | Low | Moderate | High | N/A | Very Low | N/A | N/A | High | N/A |
| ULL-27 [ | Low | Low | Low | Very Low | High | N/A | Very Low | N/A | N/A | High | N/A |
| ULL-QoL [ | High | High | Moderate | Moderate | High | N/A | Very Low | N/A | N/A | High | Very Low |
| LYMPH-Q Upper Extremity [ | Moderate | Moderate | Moderate | Moderate | High | N/A | Very Low | N/A | N/A | High | N/A |
PROM * = patient-reported outcome measure; GRADE ** = Grading of Recommendation Assessment, Development, and Evaluation; LYMQOL-Arm A = Lymphedema Quality of Life Tool-Arm A, LYMQoL-Arm B = Lymphedema Quality of Life Tool-Arm B, LLIS 1 = Lymphedema Life Impact Scale version 1, LLIS 2 = Lymphedema Life Impact Scale version 2, Lymph-ICF-UL = Lymphedema Functioning, Disability, and Health Questionnaire for Upper Limb, LSIDS-A = Lymphedema Symptom Intensity and Distress Survey-Arm, ULL27 = Upper Limb Lymphedema 27, ULL-QoL = Upper Limb Lymphedema Quality of Life Questionnaire, N/A = not applicable.