| Literature DB >> 24359184 |
Bradley C Johnston1, Donald L Patrick, Kristian Thorlund, Jason W Busse, Bruno R da Costa, Holger J Schünemann, Gordon H Guyatt.
Abstract
Systematic reviews and meta-analyses of randomized trials that include patient-reported outcomes (PROs) often provide crucial information for patients, clinicians and policy-makers facing challenging health care decisions. Based on emerging methods, guidance on improving the interpretability of meta-analysis of patient-reported outcomes, typically continuous in nature, is likely to enhance decision-making. The objective of this paper is to summarize approaches to enhancing the interpretability of pooled estimates of PROs in meta-analyses. When differences in PROs between groups are statistically significant, decision-makers must be able to interpret the magnitude of effect. This is challenging when, as is often the case, clinical trial investigators use different measurement instruments for the same construct within and between individual randomized trials. For such cases, in addition to pooling results as a standardized mean difference, we recommend that systematic review authors use other methods to present results such as relative (relative risk, odds ratio) or absolute (risk difference) dichotomized treatment effects, complimented by presentation in either: natural units (e.g. overall depression reduced by 2.4 points when measured on a 50-point Hamilton Rating Scale for Depression); minimal important difference units (e.g. where 1.0 unit represents the smallest difference in depression that patients, on average, perceive as important the depression score was 0.38 (95% CI 0.30 to 0.47) units less than the control group); or a ratio of means (e.g. where the mean in the treatment group is divided by the mean in the control group, the ratio of means is 1.27, representing a 27% relative reduction in the mean depression score).Entities:
Mesh:
Year: 2013 PMID: 24359184 PMCID: PMC3984637 DOI: 10.1186/1477-7525-11-211
Source DB: PubMed Journal: Health Qual Life Outcomes ISSN: 1477-7525 Impact factor: 3.186
Five approaches to presenting pooled PRO variables when primary studies have used different instruments to measure the same construct
| (A) Standard deviation (SD) units (standardized mean difference; effect size) | The pooled mean difference is presented in standard deviation units | (+) Widely used | (-) Interpretation challenging | Consider complimenting other approaches with this; it is not recommended to use this approach independently. |
| (-) Misleading when trial SDs are heterogeneous | ||||
| (B) Natural units | Linear transformation of trial data to most familiar scale | (+) Easier to interpret if scale well-known | (-) Few instruments in clinical practice are easy to interpret | Approaches to conversion to natural units include those based on SD units and re-scaling approaches. We suggest the latter. In rare situations when instrument very familiar to front line clinicians seriously consider this presentation |
| (C) Relative and absolute dichotomized effects | Obtain proportion above threshold in both groups and calculate relative or absolute binary effect measure | (+) Very familiar to clinical audiences | (-) Involve statistical assumptions that may be questionable | If the minimal important difference is known use this strategy in preference to relying on SD units |
| Always seriously consider this option | ||||
| (D) Ratio of means | The ratio between the mean responses in the intervention and control group | (+) May be easily interpretable to clinical audience | (-) Not applicable for change scores | Consider as complementing other approaches, particularly the presentation of relative and absolute effects |
| (+) Fewer questionable assumptions | (-) Interpretation requires knowledge of control group mean | |||
| (E) Minimal important difference units | The pooled mean differences is presented in MID units | (+) May be easily interpretable to clinical audience | (-) Only applicable when minimally important difference is known | Consider as complementing other approaches, particularly the presentation of relative and absolute effects |
Application of summary approaches to paroxetine vs placebo for major depression in adults
| The depression score in the paroxetine groups was on average 0.31 SDs (0.24 to 0.38 lower than in the placebo groups) | --- | 5736 (34) | As a rule of thumb, 0.2 SD represents a small difference, 0.5 moderate, and 0.8 large (Cohen, 1988) | |||
| Major depression measured on Hamilton Rating Scale for Depression, generally scored from 0 to 50, higher scores indicate more severe depression | The mean depression scores with placebo ranged from 3.1 to 11.3 | The mean depression score in the intervention groups was on average 2.47 (1.91 to 3.03) lower | | 5736 (34) | Scores estimated based on an SMD of 0.31 (95% CI 0.24 to 0.38)The minimal important difference on the 0 to 50 depression scale is 7 points. Although the depression score was on average only 2.47 lower, the corresponding NNT is 11 | |
| 50 per 100 patients | 39 per 100 patients | OR=1.64 (95% CI 1.47 to 1.84) | 5736 (34) | This approach uses binomial and equal variance assumptions and baseline risks, and demonstrates that for every 100 patients treated with paroxetine, 11 will achieve important improvement | ||
| Differences in proportion achieving important improvement | ||||||
| 0.11 (95% CI 0.07 to 0.16) in favor of paroxetine | ||||||
| --- | --- | Ratio of means | 5736 (34) | Weighted average of the mean depression score in paroxetine group divided by mean depression score in placebo. RoM method provides similar effect estimates compared with the traditionally used standard deviation unit, with SMDs of 0.2, 0.5, and 0.8, corresponding to increases in RoM of approximately 8%, 22%, and 37%, respectively (Friedrich 2011). | ||
| 1.27 (1.18 to 1.36) | ||||||
| The depression score in the paroxetine groups was on average 0.38 (95% CI 0.30 to 0.47) minimal important difference units less than the control group | --- | 5736 (34) | An effect less than half the minimal important difference suggests a small effect | |||
Note: Investigators measured depression using different instruments, higher scores indicate more severe depression. 1Quality rating from 1 (very low quality) to 4 (high quality); 2Evidence limited by heterogeneity between studies; 3Evidence limited by risk of bias (i.e. missing participant data and potential for selective reporting bias).
For situations in which the event is undesirable, reduction [or increase if intervention harmful] in adverse events with the intervention
| SMD = -0.2 | -0.03 | -0.05 | -0.07 | -0.08 | -0.08 | -0.08 | -0.07 | -0.06 | -0.040 |
| SMD = -0.5 | -0.06 | -0.11 | -0.15 | -0.17 | -0.19 | -0.20 | -0.20 | -0.17 | -0.12 |
| SMD = -0.8 | -0.08 | -0.15 | -0.21 | -0.25 | -0.29 | -0.31 | -0.31 | -0.28 | -0.22 |
| SMD = -1.0 | -0.09 | -0.17 | -0.24 | -0.23 | -0.34 | -0.37 | -0.38 | -0.36 | -0.29 |
For situations in which the event is desirable, increase [or decrease if intervention harmful] in positive responses to the intervention
| SMD = 0.2 | 0.04 | 0.61 | 0.07 | 0.08 | 0.08 | 0.08 | 0.07 | 0.05 | 0.03 |
| SMD = 0.5 | 0.12 | 0.17 | 0.19 | 0.20 | 0.19 | 0.17 | 0.15 | 0.11 | 0.06 |
| SMD = 0.8 | 0.22 | 0.28 | 0.31 | 0.31 | 0.29 | 0.25 | 0.21 | 0.15 | 0.08 |
| SMD = 1.0 | 0.29 | 0.36 | 0.38 | 0.38 | 0.34 | 0.30 | 0.24 | 0.17 | 0.09 |