| Literature DB >> 35387508 |
Gabrielle Simoneau1, Fabio Pellegrini2, Thomas Pa Debray3, Julie Rouette4, Johanna Muñoz3, Robert W Platt5, John Petkau6, Justin Bohn7, Changyu Shen7, Carl de Moor7, Mohammad Ehsanul Karim8.
Abstract
BACKGROUND: With many disease-modifying therapies currently approved for the management of multiple sclerosis, there is a growing need to evaluate the comparative effectiveness and safety of those therapies from real-world data sources. Propensity score methods have recently gained popularity in multiple sclerosis research to generate real-world evidence. Recent evidence suggests, however, that the conduct and reporting of propensity score analyses are often suboptimal in multiple sclerosis studies.Entities:
Keywords: Comparative effectiveness; covariate selection; inverse probability of treatment weighting; observational study; propensity score matching; real-world data
Mesh:
Year: 2022 PMID: 35387508 PMCID: PMC9260471 DOI: 10.1177/13524585221085733
Source DB: PubMed Journal: Mult Scler ISSN: 1352-4585 Impact factor: 5.855
Figure 1.Number of publications by year identified on PubMed with the search query (multiple sclerosis) and (propensity score).
A brief explanation of conditions required for the use of propensity score methods.
| Condition | Description | Example |
|---|---|---|
| No unmeasured confounding | All confounders must be measured in the data source. If two confounders are strongly correlated, it is sufficient to measure only one of them (exchangeability). This condition cannot be verified in practice, but sensitivity analyses can quantify the impact of violations | The number of gadolinium-enhancing lesions may be a confounder in the treatment–outcome relationship of interest, yet the data source for the study (e.g. a claims database) does not capture it. Then, the condition of no unmeasured confounding is violated |
| Positivity | Each patient should a priori be eligible for both treatments. Positivity violations may be deterministic (e.g. the patient with comorbidity is not eligible to receive the treatment) or random (e.g. the patient is eligible, but the sample was too small to capture that aspect). This condition may be verified empirically | In a study comparing the effectiveness of two DMTs, the study period should be restricted to the time period when both drugs were available to patients for positivity to be satisfied |
| Consistency | Also known as “well-defined treatment” or “no multiple versions of treatment.” This condition requires that there be only one version of the treatment. When multiple versions of the treatment exist (e.g. different doses), disease area expert consensus should inform whether considering the different versions as the same treatment is warranted | Patients receiving either a low- or high-dose interferon beta-1 may be grouped into one treatment arm. For consistency to hold, the effect of low- or high-dose interferon beta-1 on the outcome should be the same |
| No interference | This condition requires that a patient’s outcome is not influenced by other patients’ treatment assignments. For example, this condition is violated when the outcome is an infectious disease, where treating an individual may protect others from infection. In MS, this condition likely holds, although available data rarely allow its plausibility to be assessed | If a participant is prescribed a DMT, and his spouse is not (while both are in a study), but they share the medication, then this condition is violated |
DMT: disease-modifying therapy; MS: multiple sclerosis; PS: propensity score; RWD: real-world data.
Summary of recommendations on the reporting of propensity score methods in multiple sclerosis research.
| Manuscript section | Proposed items to report |
|---|---|
| Abstract | 1. Indicate target estimand (e.g. ATE, ATT) |
| Introduction | 1. State research question |
| Methods | Covariate selection for PS model |
| Results | 1. Report sample size at each stage (eligible, included,
analyzed) |
| Discussion | 1. Interpret effect estimate in relation with research question,
choice of PS approach, target population and
estimand |
| Appendix | 1. Provide results from additional analyses (subgroup,
interactions or effect modifications, sensitivity
analyses). |
ATE: average treatment effect; ATT: average treatment effect in the treated; HdPS: high-dimensional propensity score; PS: propensity score; SMD: standardized mean difference.
Figure 2.Mirrored histograms showing the distributions of the estimated propensity scores by hypothetical treatment groups (DMT A vs DMT B). The distributions show good overlap. The data presented in this figure are based on a simulated data set.
DMT: disease-modifying therapy.
Figure 3.Love plot with absolute standardized mean differences between two hypothetical treatment groups (DMT A vs DMT B) for a subset of covariates before and after matching. The vertical dotted line represents the threshold of 0.1 under which balance is considered acceptable. The data presented in this figure are based on a simulated data set.
DMT: disease-modifying therapy, EDSS: Expanded Disability Status Scale, GdE: gadolinium-enhancing, MSFC: multiple sclerosis functional composite.
Brief explanation of most common target of inferences (estimands).
| Estimand | Description | Example clinical question | Example |
|---|---|---|---|
| Average treatment effect (ATE) | Treatment effect in the entire population, that is, in a population of patients with baseline characteristics similar to those of patients who received either the treatment or the control. | “How would the outcome have differed if the entire population had been treated or if instead the entire population had received the control treatment?” | A study compares drug A (the treatment) to drug B (the control) in adults, where patients who receive drug A are, on average, older than those receiving drug B. The ATE estimates the effect of drug A vs drug B in a population of adult patients of any age. |
| Average treatment effect in the treated (ATT) | Treatment effect in the patient population who actually received the treatment, that is, in a population of patients with baseline characteristics similar to those of patients who received the treatment. | “How would the outcome in the treated patients have differed if those patients had instead received the control treatment?” | In the same study, the ATT estimates the effect of drug A vs drug B in a population of patients similar to those who received drug A (the treatment), so older patients. |
| Average treatment effect in the control (ATC) | Treatment effect in the patient population who received the control, that is, in a population of patients with baseline characteristics similar to those of patients who received the control. | “How would the outcome in the control patients have differed if those patients had instead received the treatment?” | In the same study, the ATC estimates the effect of drug A vs drug B in a population of patients similar to those who received drug B (the control), so younger patients. |
| Average treatment effect in the overlap population (ATO) | Treatment effect in the patient population in equipoise between treatments, that is, in a population of patients with baseline characteristics which could appear with high probability in either treatment group | “How would the outcome have differed if patients in equipoise between treatments had been treated or if instead these patients had not been treated?” | In the same study, the ATO estimates the effect of drug A vs drug B in a population of patients with characteristics likely to appear in both treatment groups, so middle-aged patients. |