| Literature DB >> 32028898 |
E Stühler1, S Braune2, F Lionetto1, Y Heer1, E Jules1, C Westermann1, A Bergmann3, P van Hövell1.
Abstract
BACKGROUND: Personalized healthcare promises to successfully advance the treatment of heterogeneous neurological disorders such as relapsing remitting multiple sclerosis by addressing the caveats of traditional healthcare. This study presents a framework for personalized prediction of treatment response based on real-world data from the NeuroTransData network.Entities:
Keywords: Bayesian generalized linear model; Clinical decision support; Personalized health record; Personalized medicine; Personalized predictive models; Relapsing remitting multiple sclerosis
Year: 2020 PMID: 32028898 PMCID: PMC7006411 DOI: 10.1186/s12874-020-0906-6
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
List of model predictors, along with code names for shorter reference across the study
| Code name | Description |
|---|---|
| Age | Age at the start of the index therapy |
| Gender | Gender |
| EDSS | EDSS (measured at most 6 months before or 3 months after the start of the therapy cycle, and at least 84 days after a relapse) |
| Index / Index therapy | DMT taken during the therapy cycle |
| Current / Current therapy | DMT taken prior to the start of the therapy cycle |
| Diagnosis distance | Time elapsed between MS diagnosis and start of index therapy |
| Relapse distance | Time elapsed between the last relapse preceding the start of the index therapy and the start of the index therapy |
| Relapses count | Number of relapses in the year prior to the start of the index therapy |
| DMTs count | Number of DMTs taken prior to the start of the index therapy |
| Second-line | Whether a second-line DMT has been taken before the start of the index therapy |
| Current duration | Duration of the current therapy |
| Index duration | Duration of the index therapy |
| Clinical site | Clinical site where the course of MS is observed |
EDSS expanded disability status scale, DMT disease modifying therapy, Second-line DMT to be employed by label of the European Medical Agency if previous DMT failed to achieve sufficient control of disease activity (of the DMTs considered in this work, this applies to Fingolimod and Natalizumab), MS multiple sclerosis.
Default priors assigned to the relapse and CDP models’ parameters
| Model | Intercept | Fixed effects | Dispersion | Standard deviation of random intercepts |
|---|---|---|---|---|
| Relapse | N(0, 10) | N(0, 2.5) | Half-Cauchy(0, 5) | Gamma(1, 1) |
| CDP | N(0, 10) | N(0, 2.5) | – | Gamma(1, 1) |
CDP confirmed disability progression.
Overview of the predictors used for predictive models and nested models
| Non-personalized model | Prognostic model | Predictive model | |
|---|---|---|---|
| Clinical site | x | x | x |
| Index therapy | x | x | x |
| Index duration | x | x | x |
| Age | x | x | |
| Gender | x | x | |
| EDSS | x | x | |
| Second-line | x | x | |
| Current therapy | x | x | |
| Current duration | x | x | |
| Interaction (Current therapy, Current duration) | x | x | |
| Diagnosis distance | x | x | |
| Relapse distance | x | x | |
| Relapses count | x | x | |
| DMTs count | x | x | |
| Interaction (Index therapy, Diagnosis distance) | x | ||
| Interaction (Index therapy, Gender) | x | ||
| Interaction (Index therapy, Relapses count) | x | ||
| Interaction (Index therapy, Second-line) | x |
EDSS expanded disability status scale, DMT disease modifying therapy, Second-line DMT to be employed by label of the European Medical Agency if previous DMT failed to achieve sufficient control of disease activity.
Fig. 1Comparison of predicted therapy. a Patients assigned to the same highest ranked therapy are divided into two groups: those who received indeed this therapy (red) and those who received another therapy (gray). b Weights are calculated for the two groups to mitigate the effect of confounds on the analysis. In particular, the weights are calculated such that each group matches the population statistics. As an example, this results in larger weights for females in the group with a smaller ratio of females compared with the overall occurrence than in the group with a larger ratio. c The weights are included in a survey-weighted GLM, where an indicator variable encodes membership to one of the two groups, and observation time is accounted for. The GLM allows for a propensity-score-based weighting of the clinically relevant outcomes of the two groups. The estimated slope allows then for a comparison of the disease activity between the two groups
Most important predictors in the relapse model
| Ranka | Predictor | Signb | MAD |
|---|---|---|---|
| 1 | Current = TERI: Current duration | – | 4.301 |
| 2 | Intercept | – | 0.553 |
| 3 | Current = TERI | – | 1.406 |
| 4 | Current = FTY: Current duration | + | 1.983 |
| 5 | Index = NA: Diagnosis distance | + | 0.368 |
| 6 | Current duration | + | 1.414 |
| 7 | Index = TERI: Relapses count | – | 0.332 |
| 8 | Index = FTY: Second-line = TRUE | – | 0.335 |
MAD median absolute deviation, TERI Teriflunomide, FTY Fingolimod, NA Natalizumab, Second-line DMT to be employed by label of the European Medical Agency if previous DMT failed to achieve sufficient control of disease activity.
a Ranked according to the magnitude of the median of the corresponding coefficient’s posterior distribution
b A positive sign is associated with a boosting effect on the number of relapses; a negative sign is associated with a lessening effect on the number of relapses
Most important predictors in the CDP model
| Ranka | Predictor | Signb | MAD |
|---|---|---|---|
| 1 | Intercept | – | 0.721 |
| 2 | Index = TERI: Second-line = TRUE | – | 0.977 |
| 3 | Index = NA: Second-line = TRUE | – | 0.837 |
| 4 | Index = NA: Diagnosis distance | + | 0.518 |
| 5 | Index = FTY: Second-line = TRUE | – | 0.487 |
| 6 | Current = IF: Current duration | – | 1.604 |
| 7 | Current = NA: Current duration | – | 2.124 |
| 8 | Current | + | 0.711 |
MAD median absolute deviation, TERI Teriflunomide, FTY Fingolimod, NA Natalizumab, IF Interferon-ß1, Second-line DMT to be employed by label of the European Medical Agency if previous DMT failed to achieve sufficient control of disease activity.
a Ranked according to the magnitude of the median of the corresponding coefficient’s posterior distribution
b A positive sign is associated with a boosting effect on the likelihood of observing a CDP; a negative sign is associated with a lessening effect on the likelihood of observing a CDP
Fig. 2Model coefficients. MADs of the fixed effects’ posterior distributions in the relapse model (a) and in the CDP model (b)
Fig. 3Calibration. Calibration of the relapse model (top) and CDP model (bottom) using equally-populated bins, when considering all DMTs together (left) and when considering each DMT separately (right). The predictions are split into 20 equally-populated bins from zero to four relapses, for the relapse model, and from zero to one, for the CDP model. Different shades of gray highlight a different population size
Performance of predictive, prognostic and non-personalized models based on out-of-sample and in-sample predictions
| Measure | Out-of-sample mean (SE)a | In-sample mean (SE)a | Sample sizeb | Response | Model |
|---|---|---|---|---|---|
| C-Index | 0.5819 (0.0008) | 0.6546 (0.0005) | 307,784 | CDP | predictive |
| C-Index | 0.5625 (0.0007) | 0.6220 (0.0004) | 307,784 | CDP | prognostic |
| C-Index | 0.5467 (0.0006) | 0.5649 (0.0005) | 307,784 | CDP | non-personalized |
| C-Index | 0.6458 (0.0004) | 0.6781 (0.0003) | 505,724 | relapse | predictive |
| C-Index | 0.6482 (0.0003) | 0.6700 (0.0002) | 505,724 | relapse | prognostic |
| C-Index | 0.5531 (0.0003) | 0.5609 (0.0003) | 505,724 | relapse | non-personalized |
| MSE | 0.12497 (0.00005) | 0.11928 (0.00004) | 3119 | CDP | predictive |
| MSE | 0.12486 (0.00004) | 0.12132 (0.00003) | 3119 | CDP | prognostic |
| MSE | 0.12449 (0.00002) | 0.12388 (0.00001) | 3119 | CDP | non-personalized |
| MSE | 0.7554 (0.0008) | 0.7097 (0.0006) | 3119 | relapse | predictive |
| MSE | 0.7312 (0.0005) | 0.7049 (0.0003) | 3119 | relapse | prognostic |
| MSE | 0.7557 (0.0002) | 0.7517 (0.0001) | 3119 | relapse | non-personalized |
| NLL | 1252.6 (0.6) | 1190.9 (0.4) | 3119 | CDP | predictive |
| NLL | 1254.0 (0.5) | 1215.5 (0.2) | 3119 | CDP | prognostic |
| NLL | 1246.8 (0.2) | 1240.3 (0.1) | 3119 | CDP | non-personalized |
| NLL | 2580.8 (0.6) | 2519.9 (0.4) | 3119 | relapse | predictive |
| NLL | 2574.6 (0.5) | 2534.9 (0.3) | 3119 | relapse | prognostic |
| NLL | 2650.1 (0.2) | 2641.9 (0.2) | 3119 | relapse | non-personalized |
SE standard error of the mean, CDP confirmed disability progression, C-Index Harrell’s concordance statistic, MSE mean squared error, NLL negative log-likelihood.
a Estimated by repeating 10-fold cross-validation 40 times
b Refers either to the number of observations (MSE, NLL) or the number of matched pairs (C-Index)
Performance of models trained on the full data set and evaluated on the test set
| Measure | Value | Sample sizea | Response | Model |
|---|---|---|---|---|
| C-Index | 0.554 | 3354 | CDP | predictive |
| C-Index | 0.608 | 5606 | relapse | predictive |
| Average NLL | 0.423 | 314 | CDP | predictive |
| Average NLL | 0.821 | 314 | relapse | predictive |
| MSE | 0.131 | 314 | CDP | predictive |
| MSE | 0.784 | 314 | relapse | predictive |
CDP confirmed disability progression, C-Index Harrell’s concordance statistic, MSE mean squared error, NLL negative log-likelihood.
a Refers either to the number of observations (MSE, average NLL) or the number of matched pairs (C-Index)
Performance of the predictive models based on leave-one-site-out cross-validation
| Measure | Out-of-samplea mean | In-sampleb mean | Sample sizec | Response | Model |
|---|---|---|---|---|---|
| C-Index | 0.579 | 0.652 | 307,784 | CDP | predictive |
| C-Index | 0.646 | 0.675 | 505,724 | Relapse | predictive |
| MSE | 0.125 | 0.119 | 3119 | CDP | predictive |
| MSE | 0.748 | 0.711 | 3119 | Relapse | predictive |
| NLL | 1254.330 | 1192.544 | 3119 | CDP | predictive |
| NLL | 2581.261 | 2523.731 | 3119 | Relapse | predictive |
CDP confirmed disability progression, C-Index Harrell’s concordance statistic, MSE mean squared error, NLL negative log-likelihood.
a Based on predictions for patients from a clinical site not used during training
b Based on predictions for all patients from all clinical sites used for training
c Refers either to the number of observations (MSE, NLL) or the number of matched pairs (C-Index)
Comparison of therapy effectiveness for the relapse model
| DMT* | Slope coefficienta | Sample size when DMT* was taken | Sample size when DMT* was not taken | |
|---|---|---|---|---|
| Dimethylfumarat | ||||
| Fingolimod | 0.0423 | 128 | 112 | 0.860 |
| Natalizumab |
DMT* highest ranked disease modifying therapy.
a Derived from a survey-weighted negative binomial generalized linear model where negative sign indicates lower disease activity
Comparison of therapy effectiveness for the CDP model
| DMT* | Slope coefficienta | Sample size when DMT* was taken | Sample size when DMT* was not taken | |
|---|---|---|---|---|
| Dimethylfumarat | ||||
| Fingolimod | 0.1114 | 135 | 87 | 0.792 |
| Glatirameracetat | −0.5336 | 238 | 45 | 0.350 |
| Natalizumab | −0.4021 | 1101 | 132 | 0.405 |
| Teriflunomide | −0.4317 | 179 | 16 | 0.730 |
DMT* highest ranked disease modifying therapy.
a Derived from a survey-weighted quasi-binomial generalized linear model where negative sign indicates lower disease activity