| Literature DB >> 28728577 |
Joe Alexander1, Roger A Edwards2, Alberto Savoldelli3, Luigi Manca3, Roberto Grugni3, Birol Emir4, Ed Whalen5, Stephen Watt4, Marina Brodsky4, Bruce Parsons4.
Abstract
BACKGROUND: More patient-specific medical care is expected as more is learned about variations in patient responses to medical treatments. Analytical tools enable insights by linking treatment responses from different types of studies, such as randomized controlled trials (RCTs) and observational studies. Given the importance of evidence from both types of studies, our goal was to integrate these types of data into a single predictive platform to help predict response to pregabalin in individual patients with painful diabetic peripheral neuropathy (pDPN).Entities:
Keywords: Autoregressive models; Coarsened exact matching; Covariate bias; Diabetic peripheral neuropathy; Hierarchical cluster analysis; Neuropathic pain; Pregabalin; Sleep interference
Mesh:
Substances:
Year: 2017 PMID: 28728577 PMCID: PMC5520324 DOI: 10.1186/s12874-017-0389-2
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Descriptions of the six clusters
| Cluster ( | ||||||
|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | |
|
| 343 | 306 | 245 | 195 | 237 | 202 |
| Females (%)b | 0.0 | 30.1 | 29.8 | 98.5 | 31.7 | 43.1 |
| Age (years), mean (SD) | 60.4 (8.1) | 61.0 (7.7) | 62.5 (7.4) | 61.2 (7.6) | 60.5 (9.0) | 61.2 (8.5) |
| Age group (years), %b | ||||||
| 0–44 | 2.0 | 1.6 | 0.0 | 0.0 | 2.5 | 1.0 |
| 45–64 | 69.7 | 63.4 | 65.3 | 70.3 | 65.4 | 66.3 |
| 65–74 | 26.0 | 33.0 | 29.8 | 26.7 | 28.7 | 27.2 |
| 75+ | 2.3 | 2.0 | 4.9 | 3.1 | 3.4 | 5.5 |
| BMI (mg/m2)c | ||||||
| Mean (SD) | 28.8 (3.4) | 30.5 (4.4) | 30.0 (4.3) | 30.6 (4.8) | 30.9 (5.0) | 31.3 (5.8) |
| Normal (%) | 6.4 | 3.3 | 6.5 | 6.7 | 3.0 | 6.4 |
| Overweight (%) | 66.8 | 49.4 | 50.2 | 38.0 | 50.2 | 39.1 |
| Obese (%) | 26.8 | 47.4 | 43.3 | 55.4 | 46.8 | 54.5 |
| Baseline painb | ||||||
| Mean (SD) | 6.3 (1.3) | 6.5 (1.4) | 6.3 (1.4) | 6.5 (1.5) | 6.5 (1.3) | 7.0 (1.3) |
| Pain score (%) | ||||||
| 0–3 | 1.5 | 1.0 | 1.6 | 2.1 | 1.3 | 0.5 |
| 4–5 | 27.7 | 21.6 | 30.6 | 23.1 | 20.7 | 15.4 |
| 6–7 | 52.5 | 54.3 | 47.8 | 50.8 | 58.2 | 46.0 |
| 8–10 | 18.4 | 23.2 | 20.0 | 24.1 | 19.8 | 38.1 |
| Baseline sleep interferenceb | ||||||
| Mean (SD) | 5.4 (2.2) | 5.7 (2.2) | 5.4 (2.4) | 5.6 (2.3) | 5.7 (2.1) | 6.7 (2.2) |
| Sleep interference score | ||||||
| 0–3 | 22.2 | 16.3 | 22.0 | 19.5 | 14.4 | 10.9 |
| 4–5 | 26.5 | 30.1 | 26.1 | 25.6 | 30.0 | 15.4 |
| 6–7 | 33.8 | 28.4 | 29.4 | 33.9 | 38.4 | 34.2 |
| 8–10 | 17.5 | 25.2 | 22.5 | 21.0 | 17.3 | 39.6 |
| Cross-correlation between sleep interference and paind | ||||||
| 2 weeks prior based on pain in the current week (lag -2) | 0.70 | 0.70 | ||||
| 1 week prior based on pain in the current week (lag -1) | 0.75 | 0.70 | 0.76 | 0.78 | ||
| Based on pain in the current week (Lag 0) | 0.85 | 0.81 | 0.77 | 0.85 | 0.87 | 0.81 |
| 1 week after based on pain in the current week (lag +1) | 0.71 | 0.71 | 0.73 | 0.79 | 0.72 | |
| 2 weeks after based on Pain in the current week (lag +2) | 0.71 | |||||
| Duration of pDPN (years), %c | ||||||
| 0 to ≤5 | 28.9 | 21.9 | 24.4 | 29.6 | 22.9 | 20.8 |
| > 5 to ≤10 | 18.7 | 23.7 | 20.4 | 18.5 | 32.4 | 21.6 |
| > 10 to ≤15 | 28.9 | 24.4 | 23.9 | 25.9 | 19.1 | 23.2 |
| > 15 to ≤20 | 12.4 | 13.3 | 15.4 | 18.5 | 12.4 | 13.6 |
| > 20 to ≤25 | 3.6 | 5.7 | 6.0 | 3.1 | 3.8 | 4.8 |
| > 25 | 7.5 | 11.1 | 10.0 | 4.3 | 9.5 | 16.0 |
| History of depression (%)c | 0.0 | 0.0 | 0.0 | 0.0 | 18.1 | 100.0 |
| Prior or current therapy (%)c | ||||||
| Pregabalin monotherapy | 100.0 | 59.5 | 0.0 | 100.0 | 58.1 | 40.8 |
| Gabapentin | 0.0 | 0.0 | 0.0 | 0.0 | 100.0 | 0.0 |
| Insulin | 3.2 | 100.0 | 18.0 | 16.9 | 78.1 | 57.4 |
| Full of energy at baseline (%) | ||||||
| Always | 1.2 | 0.7 | 1.5 | 0.6 | 0.0 | 0.0 |
| Mostly | 8.1 | 7.9 | 5.0 | 3.7 | 4.8 | 4.8 |
| Fairly often | 11.1 | 6.5 | 11.0 | 11.7 | 8.6 | 8.6 |
| Sometimes | 32.8 | 30.5 | 19.9 | 28.4 | 30.5 | 30.5 |
| Seldom | 40.1 | 40.5 | 56.7 | 44.4 | 39.1 | 39.1 |
| Never | 6.6 | 13.3 | 6.0 | 10.5 | 16.2 | 16.2 |
| Calm and relaxed at baseline (%) | ||||||
| Always | 2.4 | 2.5 | 2.0 | 0.0 | 2.9 | 3.2 |
| Mostly | 13.6 | 15.1 | 16.4 | 14.2 | 12.4 | 4.8 |
| Fairly often | 18.1 | 14.7 | 13.9 | 17.3 | 16.2 | 7.2 |
| Sometimes | 31.9 | 30.8 | 27.4 | 26.5 | 27.6 | 17.6 |
| Seldom | 32.5 | 30.5 | 37.8 | 35.8 | 32.4 | 50.4 |
| Never | 1.5 | 5.7 | 2.5 | 5.6 | 8.6 | 16.8 |
| Sad and discouraged at baseline (%) | ||||||
| Always | 1.2 | 2.5 | 3.0 | 3.1 | 5.7 | 9.6 |
| Mostly | 15.1 | 14.7 | 13.4 | 16.1 | 12.4 | 42.4 |
| Fairly often | 27.1 | 29.0 | 33.3 | 26.5 | 26.7 | 28.8 |
| Sometimes | 32.5 | 29.1 | 29.9 | 23.5 | 27.6 | 7.2 |
| Seldom | 18.1 | 16.9 | 16.4 | 25.9 | 22.9 | 11.2 |
| Never | 6.0 | 7.2 | 3.5 | 3.7 | 4.8 | 0.8 |
| Responders at 50% threshold at end (%)c | 86.0 | 76.9 | 70.1 | 78.1 | 52.3 | 59.7 |
| Daily treatment dose (mg) | ||||||
| 75 | 9.0 | 14.1 | 15.5 | 10.8 | 16.0 | 13.4 |
| 150 | 79.6 | 74.8 | 65.3 | 72.8 | 56.1 | 61.9 |
| 300 | 5.5 | 5.6 | 12.2 | 9.7 | 14.8 | 16.8 |
| 600 | 0.3 | 1.3 | 2.9 | 3.6 | 11.0 | 3.5 |
| Other (30–500 mg, excluding doses above) | 5.5 | 4.3 | 4.1 | 3.1 | 2.1 | 4.5 |
Abbreviations: SD standard deviation, BMI body mass index, pDPN painful diabetic peripheral neuropathy
aEach cluster analysis was derived from the Ward’s minimum variance technique that grouped patients in such a way that patients in the same group (called a cluster) were more similar to each other than to those in other clusters
bVariables used to create the clusters
cOther variables not used for creating the clusters
dOnly cross-correlations ≥0.70 are shown
CEM results
| Cluster | Patients by cluster, | Global imbalancea | Reduction in global imbalancea After CEM (%) | |||
|---|---|---|---|---|---|---|
| Observational Study Alone | Observational Study + RCT in matched dataset After CEM, | |||||
| Before CEM | After CEM | Before CEM | After CEM | |||
| 1 | 696 | 332 | 343 | 0.72 | 0.68 | 6 |
| 2 | 777 | 279 | 306 | 0.70 | 0.26 | 63 |
| 3 | 542 | 201 | 245 | 0.72 | 0.27 | 63 |
| 4 | 556 | 162 | 195 | 0.84 | 0.33 | 61 |
| 5 | 287 | 105 | 237 | 0.74 | 0.30 | 59 |
| 6 | 301 | 125 | 202 | 0.71 | 0.30 | 58 |
| Total | 3159 | 1204b | 1528 | |||
Abbreviations: CEM coarsened exact matching, RCT randomized controlled trial
aThe degree of imbalance represents level of bias in the covariates’ distributions for a given sample. According to Iacus et al. (2008) [56], ‘the key goal of matching is to prune observations from the data so that the remaining data have better balance between the control and the treated groups’ (e.g., the observational study dataset of each cluster and RCT data). ‘Exactly balanced data [i.e., global imbalance score = 0] means that controlling further for X is unnecessary (since it is unrelated to the treatment variable), and so a simple difference in means on matched data can estimate the causal effect; approximately balanced data requires controlling for X with a model (e.g., the same model that would have been used without matching), but the only inferences necessary are those relatively close to the data, leading to less model dependence and reduced statistical bias than without matching.’ (See: 1) Imbens GW, Rubin DB. Causal inference in statistics, social, and biomedical sciences. Cambridge, UK: Cambridge University Press; 2015 [57]. 2) King G, Lucas C, Nielsen R. The balance-sample size frontier in matching methods for causal inference. Am J Poli Sci. doi:10.1111/ajps.12272 [58]. 3) Stuart EA. Matching methods for causal inference: a review and a look forward. Stat Sci. 2010;25:1–21 [59].) Therefore, in our case, an imbalance of 0 means that the empirical distribution of the covariates of the Observational Study dataset in a given cluster is equivalent to in the RCT data; an imbalance of 1 means that the empirical distribution is completely different
bOne hundred sixty-two of them were excluded from the ARMAX model calibration because they lacked pain and sleep interference data for the full six weeks; hence there were 1042 observational study patients in the calibration dataset for developing the ARMAX models
ARMAX model input variables and regression coefficients by cluster for the calibration dataset
| ARMAX model input variables | Final ARMAX output regression coefficients, by clustera | |||||
| 1 | 2 | 3 | 4 | 5 | 6 | |
| y-intercepts for regression models, not variables | −0.0409 | −0.1447 | −0.0604 | −0.0826 | 0.0789 | −0.2732 |
| Age cohort (×10) | – | – | – | – | 0.0465 | – |
| Gender (×9) | – | – | – | – | −0.0496 | – |
| Pregabalin monotherapy (×8) | – | – | – | – | −0.1107 | – |
| pDPN duration (years) (×11) | 0.0179 | – | – | – | – | – |
| Insulin use (×7) | – | – | – | – | – | 0.0276 |
| Pain score (t-1)b (×1) | 0.7180 | 0.8749 | 0.7865 | 0.8341 | 0.9011 | 0.8919 |
| Pain score (t-2)c (×2) | 0.0436 | 0.0196 | 0.0164 | 0.0451 | 0.0107 | −0.0103 |
| Sleep interference score (t-1)b (×3) | 0.0949 | – | 0.0374 | – | – | – |
| Dose (t)d (×4) | – | −0.0006 | – | – | −0.0011 | – |
| Dose (t-1)b (×5) | −0.0012 | −0.0007 | −0.0009 | −0.0012 | – | −0.0004 |
| Dose (t-2)c (×6) | 0.0015 | 0.0015 | 0.0012 | 0.0017 | 0.0012 | 0.0007 |
| General feeling: full of energy (t-1)b (×13) | – | 0.0250 | 0.0425 | – | – | – |
| General feeling: calm and relaxed (t-1)b (×12) | – | – | – | −0.0109 | – | 0.0655 |
| Model performance measures applied | Performance, by cluster | |||||
| 1 | 2 | 3 | 4 | 5 | 6 | |
| Likelihood ratio | <0.0001 | <0.0001 | <0.0001 | <0.0001 | <0.0001 | <0.0001 |
|
| 0.86 | 0.89 | 0.85 | 0.87 | 0.91 | 0.89 |
| Root mean square error | 0.54 | 0.55 | 0.55 | 0.53 | 0.53 | 0.57 |
| Observed vs. estimated responder level (Student’s | 0.95 | 1.00 | 0.95 | 0.97 | 0.92 | 0.96 |
Abbreviations: ARMAX autoregressive moving average model, pDPN painful diabetic peripheral neuropathy
aThe first number in each column is the regression intercept value. Blank spaces in columns indicate that the associated row variable was not a predictor in the final model for that cluster
b(t-1) indicates one week before prediction
c(t-2) indicates two weeks before prediction
d(t) indicates the same week of the prediction
Given the time series of pain scores, ARMAX is essentially a linear regression model for understanding future values of pain scores in the series. The ARMAX model inputs were assigned unique variable names, x1x13, and are represented in the cluster-specific ARMAX equations below
Equations for the ARMAX models (where ‘y’ is pain score, treated as a continuous variable)
CLUSTER 1: y = −0.0409 + 0.7180 × 1 + 0.0436 × 2 + 0.0949 × 3–0.0012 × 5 + 0.0015 × 6 + 0.0179 × 11
CLUSTER 2: y = −0.1447 + 0.8749 × 1 + 0.0196 × 2–0.0006 × 4–0.0007 × 5 + 0.0015 × 6 + 0.0250 × 13
CLUSTER 3: y = −0.0604 + 0.7865 × 1 + 0.0164 × 2 + 0.0374 × 3–0.0009 × 5 + 0.0012 × 6 + 0.0425 × 13
CLUSTER 4: y = −0.0826 + 0.8341 × 1 + 0.0451 × 2–0.0012 × 5 + 0.0017 × 6–0.0109 × 12
CLUSTER 5: y = 0.0789 + 0.9011 × 1 + 0.0107 × 2–0.0011 × 4 + 0.0012 × 6–0.1107 × 8–0.0496 × 9 + 0.0465 × 10
CLUSTER 6: y = −0.2732 + 0.8919 × 1–0.0103 × 2–0.0004 × 5 + 0.0007 × 6 + 0.0276 × 7 + 0.0655 × 12
eThe ARMAXs estimate pain score, but we also want to be able to identify whether that patient is a responder at different thresholds (e.g., 50% reduction in pain or 30% reduction in pain). Hence, we sought to confirm estimation of responder level based on the ARMAXs for pain score
Fig. 1ARMAX model ROC curves for 50% responder levels the six clustersa. aAttaining the responder level of 50% is the dependent variable for these models in contrast to pain score, which is the dependent variable in the models in Table 3. ROC receiver operating characteristic
ARMAX model predictive capability for pain and responder status in the validation dataset
| Clustersa | Observed vs. estimated, | |||
|---|---|---|---|---|
| Pain level for responders | Responder status achieved | |||
| At 50% threshold | At 30% threshold | At 50% threshold | At 30% threshold | |
| 1 | 0.52 | 0.31 | 0.50 | 0.20 |
| 2 | 0.78 | 0.16 | 0.65 | 0.11 |
| 3 | 0.29 | 0.22 | 0.26 | 0.14 |
| 4 | 0.71 | 0.32 | 0.83 | 0.28 |
| 5 | 0.66 | 0.42 | 0.73 | 0.25 |
| 6 | 0.76 | 0.30 | 0.79 | 0.25 |
Abbreviations: ARMAX autoregressive moving average model, RCT randomized controlled trial
aObservational study dataset of patients not matched with RCT patients