| Literature DB >> 35962310 |
Angély Loubert1,2, Antoine Regnault3,4, Véronique Sébille4,5, Jean-Benoit Hardouin4,5.
Abstract
BACKGROUND: Meaningfully interpreting patient-reported outcomes (PRO) results from randomized clinical trials requires that the PRO scores obtained in the trial have the same meaning across patients and previous applications of the PRO instrument. Calibration of PRO instruments warrants this property. In the Rasch measurement theory (RMT) framework, calibration is performed by fixing the item parameter estimates when measuring the targeted concept for each individual of the trial. The item parameter estimates used for this purpose are typically obtained from a previous "calibration" study. But imposing this constraint on item parameters, instead of freely estimating them directly in the specific sample of the trial, may hamper the ability to detect a treatment effect. The objective of this simulation study was to explore the potential negative impact of calibration of PRO instruments that were developed using RMT on the comparison of results between treatment groups, using different analysis methods.Entities:
Keywords: Calibration; Clinical trials; Patient-reported outcomes; Rasch measurement theory
Mesh:
Year: 2022 PMID: 35962310 PMCID: PMC9375403 DOI: 10.1186/s12874-022-01680-z
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.612
Fig. 1Illustration of the archetypes of items distribution, for different scenarios. Legend: Vertical dashed lines represent the item response category thresholds (δjl, with each color corresponding to a different item) in different scenarios, and the probability density function curve represents the distribution of the latent trait in the calibration sample (case with a variance = 1). The left part of the figure includes cases where the item locations δj have a low dispersion (range = 0.5) and the δjl have a high dispersion (SD = 2.5). The right part of the figure includes cases where the item locations δj have a high dispersion (range = 2) and the δjl have a low dispersion (SD = 1.5). Each line corresponds to different scenarios regarding the number of item and modalities: A) J = 4 items, M = 3 modalities. B) J = 4 items, M = 5 modalities. C) J = 10 items, M = 5 modalities. Full values for the response category thresholds δjl are provided in supplementary materials (Additional file 1)
Values of simulation parameters
| Characteristics | Parameter | Values |
|---|---|---|
| PRO instrument | Number of items J | 4, 7, 10 |
| Number of response categories M (response options from 0 to M-1) | 3, 5 | |
| Item locations ( | First archetype:
Second archetype:
| |
| Calibration sample | Sample size | 100, 250, 500 |
| Variance | 1, 2 | |
| Mean of latent trait | 0 | |
| Trial sample | Sample size | 50, 100, 200, 500 (per group) |
| Effect size (Standardized mean difference between groups) γ | 0, 0.2, 0.5 | |
| Mean of latent trait | 0, 0.5, 2 | |
| Variance within each group | 1 |
Type-I error, power, position bias and SD of the treatment effect estimates
| J | M | Ntrial | μ | γ = 0 | γ = 0.2 | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Type-I error | Power | Position bias | SD of the estimates | ||||||||||||||||
| Non calibrated | Calibrated | Non calibrated | Calibrated | Non calibrated | Calibrated | Non calibrated | Calibrated | ||||||||||||
|
| t test on |
| t test on |
| t test on |
| t test on |
| t test on |
| t test on |
| t test on |
| t test on | ||||
| 4 | 3 | 200 | 0 | 5.6 | 5.6 | 5.6 | 5.6 | 32.8 | 32.8 | 32.8 | 32.6 | 0.01 | 0.08 | 0.01 | 0.08 | 0.13 | 0.08 | 0.13 | 0.08 |
| 0.5 | 4.4 | 4.4 | 4.6 | 4.4 | 32.8 | 32.8 | 33.2 | 33.0 | 0.00 | 0.08 | 0.00 | 0.08 | 0.13 | 0.08 | 0.13 | 0.08 | |||
| 2 | 5.8 | 5.6 | 5.6 | 5.4 | 29.6 | 29.6 | 29.8 | 29.6 | 0.00 | 0.10 | 0.01 | 0.10 | 0.15 | 0.07 | 0.15 | 0.07 | |||
| 500 | 0 | 6.8 | 6.6 | 6.8 | 6.8 | 68.0 | 68.0 | 68.0 | 68.0 | 0.00 | 0.08 | 0.00 | 0.08 | 0.08 | 0.05 | 0.08 | 0.05 | ||
| 0.5 | 2.6 | 2.6 | 2.6 | 2.6 | 68.4 | 68.4 | 68.6 | 68.4 | 0.00 | 0.08 | 0.00 | 0.08 | 0.08 | 0.05 | 0.08 | 0.05 | |||
| 2 | 4.6 | 4.6 | 4.6 | 4.6 | 58.6 | 58.6 | 58.4 | 58.4 | 0.00 | 0.11 | 0.00 | 0.10 | 0.09 | 0.04 | 0.09 | 0.05 | |||
| 5 | 200 | 0 | 4.6 | 4.6 | 4.8 | 4.6 | 41.2 | 41.2 | 41.2 | 40.6 | 0.00 | 0.05 | 0.00 | 0.05 | 0.12 | 0.09 | 0.12 | 0.09 | |
| 0.5 | 4.6 | 4.6 | 4.6 | 4.4 | 40.8 | 40.8 | 40.6 | 40.4 | 0.00 | 0.05 | 0.00 | 0.05 | 0.12 | 0.09 | 0.12 | 0.09 | |||
| 2 | 5.4 | 5.4 | 5.6 | 5.4 | 32.2 | 32.4 | 32.6 | 32.4 | 0.00 | 0.08 | 0.00 | 0.07 | 0.13 | 0.08 | 0.13 | 0.08 | |||
| 500 | 0 | 5.2 | 5.2 | 5.2 | 5.2 | 76.6 | 76.6 | 76.8 | 76.6 | 0.00 | 0.05 | 0.00 | 0.05 | 0.07 | 0.06 | 0.07 | 0.06 | ||
| 0.5 | 4.6 | 4.6 | 4.6 | 4.6 | 75.6 | 75.6 | 75.8 | 75.8 | 0.00 | 0.05 | 0.00 | 0.05 | 0.08 | 0.06 | 0.08 | 0.06 | |||
| 2 | 4.2 | 4.2 | 4.2 | 4.2 | 69.8 | 69.8 | 69.4 | 69.4 | 0.00 | 0.08 | 0.00 | 0.07 | 0.08 | 0.05 | 0.08 | 0.05 | |||
| 7 | 3 | 200 | 0 | 7.6 | 7.2 | 7.4 | 7.2 | 40.8 | 40.8 | 40.8 | 40.8 | 0.00 | 0.05 | 0.00 | 0.05 | 0.12 | 0.09 | 0.12 | 0.09 |
| 0.5 | 5.4 | 5.2 | 5.6 | 5.2 | 40.0 | 40.0 | 40.2 | 40.0 | 0.00 | 0.06 | 0.00 | 0.06 | 0.12 | 0.08 | 0.12 | 0.08 | |||
| 2 | 5.2 | 5.2 | 5.2 | 5.2 | 36.6 | 36.6 | 36.6 | 36.4 | 0.01 | 0.08 | 0.01 | 0.08 | 0.13 | 0.08 | 0.13 | 0.08 | |||
| 500 | 0 | 5.2 | 5.2 | 5.2 | 5.2 | 76.8 | 76.6 | 77.0 | 76.8 | 0.00 | 0.05 | 0.00 | 0.05 | 0.07 | 0.05 | 0.07 | 0.05 | ||
| 0.5 | 5.2 | 5.2 | 5.2 | 5.2 | 75.0 | 75.0 | 75.0 | 75.0 | 0.00 | 0.05 | 0.00 | 0.05 | 0.08 | 0.06 | 0.08 | 0.06 | |||
| 2 | 3.6 | 3.6 | 3.6 | 3.6 | 67.8 | 67.8 | 67.8 | 67.8 | 0.00 | 0.08 | 0.00 | 0.08 | 0.08 | 0.05 | 0.09 | 0.05 | |||
| 5 | 200 | 0 | 5.2 | 5.2 | 5.2 | 5.2 | 42.6 | 42.6 | 42.8 | 42.6 | 0.00 | 0.03 | 0.00 | 0.03 | 0.11 | 0.09 | 0.11 | 0.09 | |
| 0.5 | 5.2 | 5.0 | 5.2 | 5.0 | 46.0 | 45.6 | 46.0 | 46.0 | 0.00 | 0.03 | 0.00 | 0.03 | 0.11 | 0.10 | 0.11 | 0.10 | |||
| 2 | 6.4 | 6.4 | 6.4 | 6.4 | 41.2 | 40.6 | 41.2 | 40.6 | 0.00 | 0.05 | 0.01 | 0.05 | 0.12 | 0.09 | 0.12 | 0.09 | |||
| 500 | 0 | 6.8 | 6.8 | 6.8 | 6.8 | 80.8 | 80.8 | 80.8 | 80.6 | 0.00 | 0.03 | 0.00 | 0.03 | 0.07 | 0.06 | 0.07 | 0.06 | ||
| 0.5 | 3.2 | 3.0 | 3.2 | 3.2 | 80.8 | 80.8 | 80.6 | 80.6 | 0.00 | 0.03 | 0.00 | 0.03 | 0.07 | 0.06 | 0.07 | 0.06 | |||
| 2 | 5.2 | 5.2 | 5.4 | 5.2 | 73.6 | 73.6 | 73.4 | 73.2 | 0.01 | 0.06 | 0.01 | 0.06 | 0.07 | 0.05 | 0.08 | 0.06 | |||
| 10 | 3 | 200 | 0 | 4.2 | 4.2 | 4.2 | 4.2 | 42.6 | 42.4 | 42.6 | 42.2 | 0.00 | 0.04 | 0.00 | 0.04 | 0.11 | 0.09 | 0.12 | 0.09 |
| 0.5 | 5.2 | 5.2 | 5.2 | 5.2 | 42.0 | 41.6 | 42.2 | 41.6 | 0.00 | 0.04 | 0.00 | 0.04 | 0.11 | 0.09 | 0.11 | 0.09 | |||
| 2 | 4.0 | 3.6 | 4.0 | 3.6 | 39.2 | 39.0 | 39.2 | 39.0 | 0.00 | 0.07 | 0.00 | 0.07 | 0.13 | 0.08 | 0.13 | 0.09 | |||
| 500 | 0 | 4.8 | 4.4 | 4.6 | 4.4 | 79.4 | 79.2 | 79.4 | 79.2 | 0.00 | 0.04 | 0.00 | 0.04 | 0.07 | 0.06 | 0.07 | 0.06 | ||
| 0.5 | 6.2 | 6.2 | 6.2 | 6.2 | 78.8 | 78.8 | 78.8 | 78.8 | 0.00 | 0.04 | 0.00 | 0.04 | 0.07 | 0.06 | 0.07 | 0.06 | |||
| 2 | 8.2 | 8.2 | 8.2 | 8.2 | 76.0 | 75.8 | 75.8 | 75.6 | 0.00 | 0.07 | 0.00 | 0.06 | 0.08 | 0.05 | 0.08 | 0.05 | |||
| 5 | 200 | 0 | 5.0 | 5.0 | 5.2 | 5.0 | 46.0 | 45.8 | 45.4 | 45.4 | 0.00 | 0.02 | 0.00 | 0.02 | 0.10 | 0.09 | 0.10 | 0.09 | |
| 0.5 | 5.6 | 5.6 | 6.0 | 5.6 | 47.2 | 46.8 | 47.2 | 46.4 | 0.00 | 0.02 | 0.00 | 0.02 | 0.11 | 0.09 | 0.11 | 0.10 | |||
| 2 | 4.4 | 4.2 | 4.4 | 4.0 | 42.0 | 41.6 | 42.0 | 41.6 | 0.00 | 0.04 | 0.00 | 0.04 | 0.11 | 0.09 | 0.11 | 0.09 | |||
| 500 | 0 | 5.6 | 5.4 | 5.6 | 5.6 | 84.4 | 84.2 | 84.4 | 84.2 | 0.00 | 0.02 | 0.00 | 0.02 | 0.07 | 0.06 | 0.07 | 0.06 | ||
| 0.5 | 6.2 | 6.2 | 6.2 | 6.2 | 83.8 | 83.8 | 84.2 | 83.8 | 0.00 | 0.02 | 0.00 | 0.02 | 0.07 | 0.06 | 0.07 | 0.06 | |||
| 2 | 6.6 | 6.4 | 6.6 | 6.4 | 81.8 | 81.6 | 81.8 | 81.6 | 0.00 | 0.04 | 0.00 | 0.04 | 0.07 | 0.06 | 0.07 | 0.06 | |||
Legend: results are presented for selected scenarios, with N = 250, distribution of the item parameters = second archetype with SD of 1.5 and range of 2, and variance of the calibration sample = 1
Fig. 2Power using calibrated and non-calibrated approaches, depending on mistargeting of the trial sample μ. Legend: power is presented for instruments with varying number of items J and modalities M. Presented results are for comparison of treatment groups based on , for scenarios with the distribution of the item parameters = second archetype with SD of 1.5 and range of 2, γ = 0.2, N = 500, N = 250, variance of the calibration sample = 1