| Literature DB >> 34950873 |
Valery Fuh-Ngwa1, Yuan Zhou1, Jac C Charlesworth1, Anne-Louise Ponsonby2, Steve Simpson-Yap1,3, Jeannette Lechner-Scott4,5, Bruce V Taylor1.
Abstract
Our inability to reliably predict disease outcomes in multiple sclerosis remains an issue for clinicians and clinical trialists. This study aims to create, from available clinical, genetic and environmental factors; a clinical-environmental-genotypic prognostic index to predict the probability of new relapses and disability worsening. The analyses cohort included prospectively assessed multiple sclerosis cases (N = 253) with 2858 repeated observations measured over 10 years. N = 219 had been diagnosed as relapsing-onset, while N = 34 remained as clinically isolated syndrome by the 10th-year review. Genotype data were available for 199 genetic variants associated with multiple sclerosis risk. Penalized Cox regression models were used to select potential genetic variants and predict risk for relapses and/or worsening of disability. Multivariable Cox regression models with backward elimination were then used to construct clinical-environmental, genetic and clinical-environmental-genotypic prognostic index, respectively. Robust time-course predictions were obtained by Landmarking. To validate our models, Weibull calibration models were used, and the Chi-square statistics, Harrell's C-index and pseudo-R 2 were used to compare models. The predictive performance at diagnosis was evaluated using the Kullback-Leibler and Brier (dynamic) prediction error (reduction) curves. The combined index (clinical-environmental-genotypic) predicted a quadratic time-dynamic disease course in terms of worsening (HR = 2.74, CI: 2.00-3.76; pseudo-R 2=0.64; C-index = 0.76), relapses (HR = 2.16, CI: 1.74-2.68; pseudo-R 2 = 0.91; C-index = 0.85), or both (HR = 3.32, CI: 1.88-5.86; pseudo-R 2 = 0.72; C-index = 0.77). The Kullback-Leibler and Brier curves suggested that for short-term prognosis (≤5 years from diagnosis), the clinical-environmental components of disease were more relevant, whereas the genetic components reduced the prediction errors only in the long-term (≥5 years from diagnosis). The combined components performed slightly better than the individual ones, although their prognostic sensitivities were largely modulated by the clinical-environmental components. We have created a clinical-environmental-genotypic prognostic index using relevant clinical, environmental, and genetic predictors, and obtained robust dynamic predictions for the probability of developing new relapses and worsening of symptoms in multiple sclerosis. Our prognostic index provides reliable information that is relevant for long-term prognostication and may be used as a selection criterion and risk stratification tool for clinical trials. Further work to investigate component interactions is required and to validate the index in independent data sets.Entities:
Keywords: clinical–environmental; dynamic predictions; genetic variants; multiple sclerosis; prognostic index
Year: 2021 PMID: 34950873 PMCID: PMC8691056 DOI: 10.1093/braincomms/fcab288
Source DB: PubMed Journal: Brain Commun ISSN: 2632-1297
Regression coefficients (), standard errors (SE) and P-values (P) for the candidate clinical and environmental predictors included in the clinical–environmental prognostic index (CEPI) when predicting the risk of worsening of disease (WoD), relapses (RRE) and relapse and/or worsening of disease (RWoD). Estimates for clinical predictors not included in the final models are left blank as they did not pass the significance level () to stay in the model
|
|
|
| ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Clinical variables | Categories |
| SE |
|
| SE |
|
| SE |
|
| Baseline predictors | ||||||||||
|
| 0.01 | 0.01 | 0.07 | −0.02 | 0.01 | <0.01 | <−0.01 | <0.01 | 0.86 | |
|
| Female | −0.06 | 0.09 | 0.55 | 0.02 | 0.11 | 0.88 | −0.10 | 0.08 | 0.22 |
|
|
| 0.09 | 0.13 | 0.52 | −0.04 | 0.12 | 0.71 | −0.03 | 0.09 | 0.71 |
|
| −0.08 | 0.15 | 0.57 | 0.11 | 0.11 | 0.33 | −0.14 | 0.10 | 0.17 | |
|
| 0.02 | 0.16 | 0.92 | 0.20 | 0.11 | 0.07 | 0.11 | 0.10 | 0.27 | |
|
| Reference | Reference | Reference | |||||||
|
| <−0.01 | <0.01 | 0.12 | <−0.01 | <0.01 | 0.83 | <−0.01 | <0.01 | <0.01 | |
|
| Yes | – | – | – | – | – | – | – | – | – |
|
| Yes | – | – | – | – | – | – | – | – | – |
|
| HE | – | – | – | – | – | – | – | – | – |
| SE | −0.15 | 0.10 | 0.13 | −0.12 | 0.07 | 0.06 | ||||
| LSE | Reference | Reference | Reference | |||||||
|
| 0.45 | 0.09 | <0.01 | 0.25 | 0.06 | <0.01 | 0.26 | 0.06 | <0.01 | |
|
| −0.01 | 0.19 | 0.94 | −0.22 | 0.14 | 0.12 | −0.01 | 0.06 | 0.85 | |
| Time-dependent predictors | ||||||||||
|
| Yes | – | – | – | – | – | – | – | – | – |
|
| Yes | – | – | – | 0.25 | 0.11 | 0.02 | – | – | – |
|
| 1.42 | 0.41 | <0.01 | 0.57 | 0.32 | 0.07 | 2.51 | 0.99 | 0.01 | |
|
| 0.42 | 0.08 | <0.01 | 0.29 | 0.07 | <0.01 | 0.67 | 0.05 | <0.01 | |
|
| 0.29 | 0.09 | 0.01 | −0.12 | 0.14 | 0.38 | 0.09 | 0.08 | 0.23 | |
|
| Yes | −0.69 | 0.19 | <0.01 | −3.87 | 0.17 | <0.01 | −0.61 | 0.18 | <0.01 |
|
| 0.28 | 0.09 | <0.01 | 0.44 | 0.13 | <0.01 | 0.60 | 0.07 | <0.01 | |
|
| Yes | – | – | – | – | – | – | – | – | – |
|
| FT | – | – | – | – | – | – | −0.27 | 0.10 | 0.01 |
| DP | −0.25 | 0.17 | 0.15 | – | – | – | −0.30 | 0.13 | 0.02 | |
| PT/WH | – | – | – | – | – | – | – | – | – | |
| UE | Reference | Reference | Reference | |||||||
|
| 0.07 | 0.03 | 0.01 | – | – | – | 0.07 | 0.03 | 0.01 | |
|
| $1500–$2000 | – | – | – | 0.51 | 0.16 | <0.01 | 0.45 | 0.17 | 0.01 |
| $600–$1499 | 0.58 | 0.14 | <0.01 | 0.84 | 0.13 | <0.01 | 0.64 | 0.12 | <0.01 | |
| $1–$599 | 0.38 | 0.15 | 0.01 | 0.52 | 0.15 | <0.01 | 0.46 | 0.12 | <0.01 | |
| $0 | Reference | Reference | Reference | |||||||
|
| −0.24 | 0.02 | <0.01 | −0.99 | 0.05 | <0.01 | −3.59 | 0.21 | <0.01 | |
NB: The actual values for P-values < 0.01 range between .
number of events, number of observations.
DP, disability pension; FT, full time; HADS, hospital anxiety depression score; HE, higher education; LSE, less than secondary education; PT/WH, part-time/work from home; SE, secondary education; UE, unemployed.
NSW, New South Wales; QLD, Queensland; TAS, Tasmania; VIC, Victoria.
Inter-attack intervals = Difference between event stop and event start time.
in sunlight exposure: Difference in hours of sunlight exposure (winter-summer).
FDE, first demyelinating event; 25(OH)D, 25 hydroxy vitamin D levels measured in units of nmol/l.
Figure 1Distributions of the prognostic index. Panel (A) WoD, (B) RRE and (C) RWoD.
Time-fixed supermodel Cox regression on cross-validation based clinical–environmental (CEPI), genetic (GPI) and clinical–environmental–genotype (CEGPI) prognostic indices. The P-values for all parameters were significantly less than
|
|
|
| ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Prognostic indices included | Parms. (Model) |
| ( |
|
| ( |
|
| ( |
|
| Clinical–Env. ( |
| 0.96 (0.03) | (0.73, 0.57) | 859 | 0.93 (0.03) | (0.85, 0.90) | 1311 | 0.93 (0.05) | (0.76, 0.71) | 1709 |
| Genetic ( |
| 0.86 (0.04) | (0.65, 0.32) | 389 | 0.82 (0.05) | (0.79, 0.73) | 746 | 0.84 (0.04) | (0.69,0.39) | 680 |
|
| 0.86 (0.04) | 0.80 (0.05) | 0.86 (0.02) | |||||||
| Both ( |
| 0.56 (0.07) | (0.76, 0.64) | 1043 | 0.26 (0.06) | (0.85, 0.91) | 1358 | 0.27 (0.06) | (0.77, 0.72) | 1776 |
| Clinical–Env–genotype ( |
| 1.00 (0.03) | (0.76, 0.64) | 1043 | 1.00 (0.03) | (0.85, 0.91) | 1358 | 1.00 (0.03) | (0.77 0.72) | 1776 |
| Calibrated clinical |
| 0.86/0.96 | 0.80/0.93 | 0.86/0.93 | ||||||
| Calibrated genetic |
| 0.56/0.86 | 0.26/0.82 | 0.31/0.85 | ||||||
number of observations.
Cross-validated Harrell’s C-index.
.
NB: In the column denoted ‘Parms’, the actual parameters in the supermodels are given. The results on this table were obtained from the fit of models respectively (see Supplementary methods).
CEPI: Clinical–Env Prognostic Index (clinical + environmental predictors).
GPI: Genetic Prognostic Index (Cumulative effects of single nucleotide polymorphisms markers).
CEGPI: Clinical–Env–Genotypic Prognostic Index = CEPI+GPI (clinical + environmental + genetic).
Time-fixed Supermodels (): Cox regression performed on CEPI only, or GPI only, or combination of CEPI+GPI, without time-varying effects.
Super learner (.
Time-fixed and time-varying supermodel Cox regression on the clinical–environmental (CEPI), genetic (GPI) and clinical–env–genotype (CEGPI) prognostic index, respectively. Shown are the standard errors (SE) and the regression coefficients for the time-fixed effects () and time-varying effects ln(1+t). Non-significant effects have been highlighted
| Prognostic index (PI) |
|
|
|
| Model AIC | |
|---|---|---|---|---|---|---|
| Worsening of disease ( | ||||||
| Clinical–Env. (CEPI) |
| Time-fixed | 0.96 (0.03) | 859 | 13757 | |
| Time-varying | 0.87 (0.06) | 0.17 (0.09) | 862 | 13756 | ||
| Genetic (GPI) |
| Time-fixed | 0.86 (0.04) | 389 | 14227 | |
| Time-varying | 0.53 (0.09) | 0.79 (0.17) | 413 | 14204 | ||
| Clinical–env–genotypic (CEGPI) |
| Time-fixed | 1.00 (0.03) | 1043 | 13572 | |
| Time-varying | 0.92 (0.04) | 0.16 (0.07) | 1050 | 13568 | ||
| Relapses ( | ||||||
| Clinical–Env. (CEPI) |
| Time-fixed | 0.93 (0.03) | 1311 | 7080 | |
| Time-varying | 0.88 (0.03) | 0.17 (0.07) | 1317 | 7076 | ||
| Genetic (GPI) |
| Time-fixed | 0.82 (0.05) | 746 | 7645 | |
| Time-Varying | 0.76 (0.04) | 0.28 (0.11) | 727 | 7637 | ||
| Clinical–env–genotypic (CEGPI) |
| Time-fixed | 1.00 (0.03) | 1358 | 7033 | |
| Time-varying | 0.93 (0.03) | 0.25 (0.07) | 1369 | 7023 | ||
| Relapses and/or worsening of disease ( | ||||||
| Clinical–Env. (CEPI) |
| Time-fixed | 0.93 (0.05) | 1709 | 18162 | |
| Time-varying | 0.97 (0.04) |
| 1712 | 18162 | ||
| Genetic (GPI) |
| Time-fixed | 0.84 (0.04) | 680 | 19192 | |
| Time-Varying | 0.77 (0.05) | 0.24 (0.12) | 684 | 19189 | ||
| Clinical–env–genotypic (CEGPI) |
| Time-fixed | 1.00 (0.03) | 1776 | 18095 | |
| Time-varying | 1.02 (0.04) |
| 1777 | 18097 | ||
Supermodels (): Cox regression performed on CEPI only, or GPI only, or CEPI+GPI.
Super learner (.
number of observations.
NB: The ‘time-fixed’/‘time-varying’ estimates were obtained from a Cox supermodel without/with time-varying effects, respectively. The ‘time-fixed’ estimates are identical to those found in Table 3 above. Adding the time-varying effects improved the performance of the time-vary supermodels over the time-fixed counterparts in terms of model chi-square () and AIC.
Figure 2Time-dependent regression effects of the prognostic index. From left to right is the risk for WoD, RRE and RWoD, respectively.
Figure 3Performance of the supermodels in predicting RWoD at diagnosis. Prognostic errors and error-reduction probabilities based on the utility of the prognostic index.
Figure 4Prospective accuracies of Landmark supermodels. Prospective accuracies of Landmark supermodels in predicting RWoD within the next 5 years from diagnosis.
Figure 5Dynamic probability of having an RWoD event within the next 5 years. Shown based on the Landmark approach.
Figure 6Cross-validated Kaplan–Meier survival estimates based on the effects of the CEGPI. From left to right: WoD, RRE and RWoD, respectively.