| Literature DB >> 31193216 |
Maren K Olsen1,2, Karen M Stechuchak1, Karen E Steinhauser1,3,4.
Abstract
In a two-arm randomized trial where both arms receive active treatment (i.e., treatments A and B), often the primary goal is to determine which of the treatments, on average, is more effective. A supplementary objective is to understand possible heterogeneity in the treatment effect by identifying multivariable subgroups of patients for whom A is more effective than B and, conversely, patients for whom B is more effective than A, known as a qualitative interaction. This is the objective of the qualitative interaction trees (QUINT) algorithm developed by Dusseldorp et al (Statistics in Medicine, 2014). We apply QUINT to a small randomized trial comparing facilitated relaxation meditation to facilitated life completion and preparation among patients with life-limiting illness (n = 135). We then conduct an internal validation of the QUINT solution using bootstrap resampling and compare it to an external validation with another, similarly conducted small randomized trial. Internal and external validation showed the apparent range in effect sizes was over-estimated, and subgroups identified were not consistent between the two trials. While the qualitative interaction trees algorithm is a promising area of data-driven multivariable subgroup discovery, our analyses illustrate the importance of validating the solution, particularly for trials with smaller numbers of participants.Entities:
Year: 2019 PMID: 31193216 PMCID: PMC6523033 DOI: 10.1016/j.conctc.2019.100372
Source DB: PubMed Journal: Contemp Clin Trials Commun ISSN: 2451-8654
Variables included in QUINT specification for both Original and External Studies.
| Variable Name used in QUINT | Original Study N = 135 | External Study N = 149 | |
|---|---|---|---|
| Age, mean (SD) | Age | 62.5 (13.3) | 68.5 (9.6) |
| Highest level of education is high school or less, n (%) | EDUC_HS_OR_LESS | 60 (44.4) | 67 (45.0) |
| Married, n (%) | Married | 73 (54.1) | 84 (56.4) |
| Non-Hispanic white race, n (%) | White_Race | 69 (51.1) | 78 (52.3) |
| High score on Palliative Performance Scale, n (%) [ | HighPPS | 110 (81.5) | 110 (73.8) |
| Cancer diagnosis | CancerDX | 94 (69.6) | 68 (45.6) |
| Insecure financial situation, n (%) | InsecureFinance | 59 (43.7) | 65 (43.6) |
| Faith or spirituality is “Very important” in life | VeryImpFaith | 100 (74.1) | 98 (65.8) |
| Functional impairment [ | Functional_Impairment | 5.8 (1.8) | 6.5 (1.9) |
| Functional impairment (walk and groom self) [ | Walk_Groom_Impair | 2.3 (0.7) | 2.3 (0.6) |
| Fundamental-functional impairment [ | FundFunc_Impair | 5.5 (1.1) | 5.6 (1.3) |
| Instrumental-functional impairment [ | Instr_Impairment | 9.9 (3.1) | 9.5 (3.0) |
| QUAL-E Preparation [ | Baseline_Preparation | 14.0 (4.0) | 15.1 (3.4) |
| QUAL-E Life completion [ | Baseline_LifeComp | 27.3 (5.1) | 26.2 (5.5) |
| QUAL-E Relationship with health system [ | RelationshipHealthSystem | 19.9 (3.7) | 19.2 (3.7) |
| QUAL-E Quality of life item [ | QualityOfLife | 3.7 (0.8) | 3.6 (0.8) |
| CES-D-10 [ | CESD10 | 8.8 (6.1) | 8.4 (5.6) |
| FACIT-Sp Faith [ | Faith | 11.6 (4.0) | 11.1 (3.7) |
| FACIT-Sp Peace [ | Peace | 11.1 (3.4) | 10.9 (3.2) |
| FACIT-Sp Meaning [ | Meaning | 12.6 (3.0) | 12.0 (3.0) |
| Anxiety12, | Anxiety | 4.3 (4.8) | 5.7 (4.3) |
| Daily Spiritual Experience Scale [ | SpiritualExp | 2.5 (1.1) | 2.7 (1.0) |
| FACT-G Emotional well-being [ | Emotional | 18.5 (4.5) | 19.2 (4.0) |
| FACT-G Physical well-being [ | Physical | 18.6 (6.2) | 19.6 (5.3) |
| FACT-G Functional well-being [ | Functional | 15.8 (6.8) | 14.4 (5.7) |
| FACT-G Social well-being23, | Social | 18.5 (4.8) | 17.7 (4.7) |
QUAL-E = a 31-item validated measure of quality of life at the end of life; CES-D = Centers for Epidemiological Study depression scale; FACIT-Sp = Functional Assessment of Chronic Illness Therapy - Spiritual Well-Being; FACT-G = Functional Assessment of Cancer Therapy –General.
Note, variables described by means (SD) are continuous variables, variables described by n (%) are dichotomous variables. SD = standard deviation.
Original study: Cancer diagnosis vs. congestive heart failure or end-stage renal disease. External study: Cancer diagnosis vs. congestive heart failure, end-stage renal disease, chronic obstructive pulmonary disease, or end-stage liver disease.
Faith or spiritual importance: “Very important” vs. “Somewhat” or “Not at All Important”.
Profile of Mood States Anxiety subscale, one of 6 items not queried in survey and thus omitted from score calculation.
Satisfaction with sex life question omitted from score calculation.
Fig. 1a. Anxiety: Pruned QUINT Solution in Original Study. Vertical axis represents effect size (d). Subgroup 1 (n = 58) had meaning scores ≤ 12.5 and showed greater improvement in anxiety in RM compared to Outlook (d = 0.32). Subgroup 2 (n = 77) had meaning scores > 12.5 and showed greater improvement in anxiety in Outlook compared to RM (d = −0.78). b. Anxiety: Pruned QUINT Solution in External Study. Vertical axis represents effect size (d). The following subgroups showed greater improvement in anxiety in RM compared to Outlook: Subgroup 1 = physical well-being scores ≤23.5, social well-being scores ≤19.1, and meaning scores ≤9.5 (n = 18, d = 0.88); Subgroup 2 = physical well-being scores ≤23.5, social well-being scores ≤19.1, meaning scores >9.5, and were not of white race (n = 26, d = 0.42); Subgroup 4 = physical well-being ≤23.5, social well-being scores >19.1 (n = 41, d = 1.35). The following subgroups showed greater improvement in anxiety in Outlook compared to RM: Subgroup 3 = physical well-being scores ≤23.5, social well-being scores ≤19.1, meaning scores >9.5, and were of white race (n = 25, d = −1.37); Subgroup 6 = physical well-being scores >23.5 and social well-being scores >19.5 (n = 18, d = −1.71). The following subgroup for whom treatment arm did not make a difference: Subgroup 5 = physical well-being scores >23.5, social well-being scores ≤19.5 (n = 21, d = −0.09).
QUINT solution range in effect sizes: Original study, internal validation, and external study.
| Outcome | Apparent Performance | Internal Mean Optimism ( | Optimism- corrected Range (Rangeappcorr) | Estimated population value range in effect sizes (Rangeex) |
|---|---|---|---|---|
| Anxiety | 1.10 | 0.76 | 0.35 | 0.01 |
| Preparation | 2.50 | 1.54 | 0.97 | 0.31 |
The apparent performance is defined as the difference between the largest effect size (d) favoring RM and the largest effect size favoring Outlook. A larger apparent performance indicates greater separation in effects between subgroups. Because our minimum threshold effect size is 0.3, the minimum apparent performance is 0.6.
Internal mean optimism: this quantity is derived via bootstrap samples of the original study and provides an estimate of how “over optimistic” the range in d's in the original study may be.
Optimism-corrected range: this is the difference between the apparent performance and the internal mean optimism. For our analysis, values under 0.6 suggest a lack of generalizability of the original solution (i.e., effect size favoring RM is less than 0.3 and effect size favoring Outlook is less than 0.3).
Estimate of the population value in range of d's: after creating subgroups in the external study equivalent to those identified in the original study, this is the difference between the subgroup with the largest d and the subgroup with the smallest d. A value close to the optimism-corrected range provides evidence of generalizability of the QUINT solution.
Fig. 2a. Preparation: Pruned QUINT Solution in Original Study. Vertical axis represents effect size (d). The following subgroup showed greater improvement in preparation in RM compared to Outlook: Subgroup 3 = Participants with baseline preparation >14.5 and functional impairment scores >6.5 (n = 25, d = 1.94). The following subgroup showed greater improvement in preparation in Outlook compared to RM: Subgroup 1 = participants with baseline preparation ≤14.5 (n = 67, d = −0.56). The following subgroup for whom treatment arm did not make a difference: Subgroup 2 = baseline preparation >14.5 and functional impairment scores ≤6.5, n = 43, d = 0.02). b. Preparation: Pruned QUINT solution in External Study. Vertical axis represents effect size (d). The following subgroup showed greater improvement in preparation in RM compared to Outlook: Subgroup 1 = physical well-being scores ≤24.5 (n = 124, d = 0.41). The following subgroup showed greater improvement in preparation in Outlook compared to RM: Subgroup 2 = physical well-being scores >24.5 (n = 25, d = −0.65).