| Literature DB >> 28833412 |
David M Hughes1, Riham El Saeiti1,2, Marta García-Fiñana1.
Abstract
Longitudinal discriminant analysis (LoDA) can be used to classify patients into prognostic groups based on their clinical history, which often involves longitudinal measurements of various clinically relevant markers. Patients' longitudinal data is first modelled using multivariate generalised linear mixed models, allowing markers of different types (e.g. continuous, binary, counts) to be modelled simultaneously. We describe three approaches to calculating a patient's posterior group membership probabilities which have been outlined in previous studies, based on the marginal distribution of the longitudinal markers, conditional distribution and distribution of the random effects. Here we compare the three approaches, first using data from the Mayo Primary Biliary Cirrhosis study and then by way of simulation study to explore in which situations each of the three approaches is expected to give the best prediction. We demonstrate situations in which the marginal or random-effects approach perform well, but find that the conditional approach offers little extra information to the random-effects and marginal approaches.Entities:
Keywords: conditional distribution; longitudinal discriminant analysis; marginal distribution; multivariate; random-effects distribution
Mesh:
Year: 2017 PMID: 28833412 PMCID: PMC5873537 DOI: 10.1002/bimj.201700013
Source DB: PubMed Journal: Biom J ISSN: 0323-3847 Impact factor: 2.207
Figure 1Observed longitudinal profiles of albumin (mg/dl), log(bilirubin) (log(mg/dl)), platelet counts and blood vessel malformation for patients who are known to be alive at 5 years (Group 0, solid lines) and who die between 2.5 and 5 years (Group 1, dashed lines); the thick lines show fitted mean over time
Prediction accuracy from leave‐one‐out cross‐validation of random‐effects, marginal and conditional prediction for PBC data
| Random | Marginal | Conditional | |
|---|---|---|---|
| Cut‐off | 0.98 | 0.21 | 0.12 |
| Sensitivity | 0.75 | 0.78 | 0.61 |
| Specificity | 0.78 | 0.81 | 0.67 |
| PCC | 0.77 | 0.81 | 0.66 |
| AUC | 0.81 | 0.85 | 0.63 |
| PPV | 0.46 | 0.51 | 0.32 |
| NPV | 0.92 | 0.94 | 0.94 |
Figure 2Receiver‐operating characteristic curves of the LoDA using the random‐effects (solid), marginal (dotted) and conditional (dot‐dashed) prediction methods for PBC data
Parameter estimates for the PBC data and the modifications used for each simulation scenario
| Group 0 | Group 1 | |||||
|---|---|---|---|---|---|---|
| PBC Data | Scenario 1 | Scenario 2 | PBC Data | Scenario 1 | Scenario 2 | |
|
| ||||||
| E[Albumin:Intercept] | 3.69 | 3.69 | 3.00 | 3.39 | 3.39 | 3.00 |
| E[Albumin:slope] | −6.83 | −6.83 | 0.00 | −1.44 | −1.44 | 0.00 |
| SD[Albumin:Intercept] | 2.73 | 2.64 | 6.50 | 2.64 | 2.64 | 6.50 |
| Corr[Albumin:Intercept,Albumin:slope] | −8.60 | −6.46 | −6.46 | −6.46 | −6.46 | −6.46 |
| Corr[Albumin:Intercept,log(Bilirubin):Intercept] | −2.48 | −1.97 | −1.97 | −1.97 | −1.97 | −1.97 |
| Corr[Albumin:Intercept,log(Bilirubin):slope] | −1.10 | 2.11 | 2.11 | 2.11 | 2.11 | 2.11 |
| Corr[Albumin:Intercept,Platelet:Intercept] | 1.82 | 1.91 | 1.91 | 1.91 | ||
| Corr[Albumin:Intercept,Platelet:slope] | 5.72 | 1.09 | 1.09 | 1.09 | ||
| Corr[Albumin:Intercept,Blood vessel malformation:Intercept] | −2.27 | −3.48 | −3.48 | −3.48 | ||
| SD[Albumin:slope] | 4.30 | 7.76 | 7.76 | 7.76 | 7.76 | 7.76 |
| Corr[Albumin:slope,log(Bilirubin):Intercept] | −2.91 | 1.57 | 1.57 | 1.57 | 1.57 | 1.57 |
| Corr[Albumin:slope,log(Bilirubin):slope] | −6.50 | −2.33 | −2.33 | −2.33 | −2.33 | −2.33 |
| Corr[Albumin:slope,Platelet:Intercept] | 8.89 | −2.57 | −2.57 | −2.57 | ||
| Corr[Albumin:slope,log(Bilirubin):slope] | 2.96 | −2.60 | −2.60 | −2.60 | ||
| Corr[Albumin:slope,Blood vessel malformation:Intercept] | −2.93 | 2.27 | 2.27 | 2.27 | ||
| SD[Albumin:residual] | 3.18 | 3.18 | 3.14 | 3.14 | 3.14 | 1.59 |
|
| ||||||
| E[log(Bilirubin):Intercept] | 2.13 | 2.13 | 1.00 | 1.23 | 1.23 | 1.00 |
| E[log(Bilirubin):slope] | 9.94 | 9.94 | 0.00 | 2.38 | 2.38 | 0.00 |
| SD[log(Bilirubin):Intercept] | 6.88 | 8.45 | 1.12 | 8.45 | 8.45 | 1.12 |
| Corr[log(Bilirubin):Intercept,log(Bilirubin):slope] | 2.32 | −1.75 | −1.75 | −1.75 | −1.75 | −1.75 |
| Corr[log(Bilirubin):Intercept,Platelet:Intercept] | −1.66 | 2.47 | 2.47 | 2.47 | ||
| Corr[log(Bilirubin):Intercept,Platelet:slope] | −2.04 | −1.87 | −1.87 | −1.87 | ||
| Corr[log(Bilirubin):Intercept,Blood vessel malformation:Intercept] | 3.42 | 2.70 | 2.70 | 2.70 | ||
| SD[log(Bilirubin):slope] | 1.12 | 1.49 | 1.49 | 1.49 | 1.49 | 1.49 |
| Corr[log(Bilirubin):slope,Platelet:Intercept] | 1.44 | −1.69 | −1.69 | −1.69 | ||
| Corr[log(Bilirubin):slope,Platelet:slope] | −2.40 | 1.25 | 1.25 | 1.25 | ||
| Corr[log(Bilirubin):slope,Blood vessel malformation:Intercept] | 3.05 | 8.13 | 8.13 | 8.13 | ||
| SD[log(Bilirubin):residual] | 3.38 | 3.38 | 3.95 | 3.96 | 3.96 | 1.69 |
|
| ||||||
| E[Platelet:Intercept] | 5.54 | 5.54 | 5.46 | 5.46 | ||
| E[Platelet:slope] | −4.29 | −4.29 | −1.14 | −1.14 | ||
| SD[Platelet:Intercept] | 3.73 | 3.45 | 3.45 | 3.45 | ||
| Corr[Platelet:Intercept,Platelet:slope] | −4.64 | 6.14 | 6.14 | 6.14 | ||
| Corr[Platelet:Intercept,Blood vessel malformation:Intercept] | −7.41 | −2.48 | −2.48 | −2.48 | ||
| SD[Platelet:slope] | 5.66 | 1.51 | 1.51 | 1.51 | ||
| Corr[Platelet:slope,Blood vessel malformation:Intercept] | −1.68 | −8.03 | −8.03 | −8.03 | ||
|
| ||||||
| E[Blood vessel malformation:Intercept] | −2.54 | −2.54 | −6.81 | −6.81 | ||
| Blood vessel malformation:slope | 1.46 | 1.46 | 4.81 | 4.81 | ||
| SD[Blood vessel malformation:Intercept] | 3.00 | 1.88 | 1.88 | 1.88 | ||
Note. Blank entries occur when the parameter was not used in Scenario 2.
Simulation study Scenario 1: Posterior means, highest posterior density (HPD) intervals, bias, standard deviation (SD), mean square error (MSE) and coverage for the fixed and random effects
| Group 0 | Group 1 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Posterior Mean | 95% HPD Interval | SD | Bias | MSE | Coverage | Posterior Mean | 95% HPD Interval | SD | Bias | MSE | Coverage | |
|
| ||||||||||||
| E[Albumin:Intercept] | 3.69 | (3.64,3.74) | 4.03 | 4.22 | 6.35 | 0.93 | 3.39 | (3.29,3.49) | 4.02 | −3.19 | 2.08 | 0.98 |
| E[Albumin:slope] | −7.03 | (−9.35,−4.68) | 3.47 | −2.02 | 2.15 | 0.88 | −1.42 | (−1.91,−0.93) | 2.76 | 2.65 | 6.85 | 0.94 |
| SD[Albumin:Intercept] | 2.63 | (2.22,3.06) | 1.81 | −6.12 | 4.10 | 0.98 | 2.58 | (1.71,3.46) | 3.80 | −5.75 | 2.77 | 0.89 |
| Corr[Albumin:Intercept,Albumin:slope] | 2.09 | (−5.25,5.89) | 2.45 | 8.55 | 2.28 | 1.00 | 3.96 | (−6.24,7.09) | 1.43 | 1.04 | 3.08 | 0.99 |
| Corr[Albumin:Intercept,log(Bilirubin):Intercept] | −1.75 | (−3.49,0.02) | 6.79 | 2.21 | 6.34 | 0.99 | −1.61 | (−4.97,1.81) | 1.14 | 3.56 | 2.87 | 0.97 |
| Corr[Albumin:Intercept,log(Bilirubin):slope] | 1.73 | (−1.67,5.06) | 1.35 | −3.87 | 2.44 | 0.96 | 5.84 | (−5.94,6.94) | 1.38 | −1.53 | 5.29 | 0.98 |
| Corr[Albumin:Intercept,Platelet:Intercept] | 1.52 | (−0.24,3.27) | 8.84 | −3.88 | 9.83 | 0.93 | 1.12 | (−2.25,4.44) | 1.12 | −7.95 | 3.13 | 0.94 |
| Corr[Albumin:Intercept,Platelet:slope] | 6.59 | (−1.14,2.46) | 8.38 | −4.35 | 9.15 | 0.93 | 4.21 | (−3.00,3.86) | 1.17 | −6.73 | 3.86 | 0.91 |
| Corr[Albumin:Intercept,Blood Vessel Malformations:Intercept] | −3.01 | (−5.22,‐0.76) | 9.16 | 4.69 | 1.36 | 0.95 | −2.58 | (−6.39,1.37) | 1.31 | 9.01 | 4.24 | 0.96 |
| SD[Albumin:slope] | 3.47 | (0.65,7.13) | 6.27 | −4.30 | 2.53 | 0.45 | 3.98 | (0.36,9.15) | 3.65 | −3.79 | 2.46 | 0.57 |
| Corr[Albumin:slope,log(Bilirubin):Intercept] | −2.06 | (−5.58,5.18) | 2.22 | −2.22 | 1.29 | 0.99 | −1.31 | (−6.54,6.34) | 1.23 | −1.47 | 1.74 | 0.99 |
| Corr[Albumin:slope,log(Bilirubin):slope] | −2.65 | (−6.18,5.74) | 2.55 | 2.06 | 6.09 | 0.99 | ‐1.03 | (−7.62,7.46) | 1.14 | 2.23 | 5.82 | 1.00 |
| Corr[Albumin:slope,Platelet:Intercept] | −1.69 | (−6.92,3.87) | 3.54 | 8.78 | 2.56 | 1.00 | −5.97 | (−6.86,5.84) | 1.30 | 1.97 | 5.69 | 0.99 |
| Corr[Albumin:slope,Platelet:slope] | −1.54 | (−6.54,3.73) | 3.06 | 1.07 | 3.29 | 0.99 | −8.70 | (−6.94,5.46) | 1.39 | 1.73 | 5.08 | 0.98 |
| Corr[Albumin:slope,Blood Vessel Malformations:Intercept] | 1.01 | (−4.86,6.62) | 3.09 | −1.26 | 3.14 | 0.98 | 2.11 | (−6.62,6.96) | 1.49 | −2.06 | 6.47 | 0.99 |
|
| ||||||||||||
| E[log(Bilirubin):Intercept] | 1.67 | (−1.06,1.39) | 4.04 | −4.57 | 3.71 | 0.96 | 1.24 | (0.99,1.49) | 8.51 | 9.60 | 1.62 | 0.96 |
| E[log(Bilirubin):slope] | 1.00 | (0.69,1.32) | 1.27 | 8.22 | 3.41 | 0.91 | 2.31 | (1.65,2.99) | 3.44 | −6.91 | 1.33 | 0.94 |
| SD[log(Bilirubin):Intercept] | 8.42 | (7.52,9.35) | 3.03 | −3.32 | 2.37 | 0.97 | 8.38 | (0.66,1.03) | 6.08 | −7.28 | 7.98 | 0.96 |
| Corr[log(Bilirubin):Intercept,log(Bilirubin):slope] | −1.33 | (−4.25,1.78) | 1.11 | 4.12 | 2.26 | 0.93 | −4.34 | (−6.45,5.95) | 1.31 | 1.31 | 4.00 | 0.99 |
| Corr[log(Bilirubin):Intercept,Platelet:Intercept] | 2.35 | (0.97,3.70) | 4.56 | −1.25 | 4.23 | 0.98 | 2.12 | (−0.60,4.78) | 8.66 | −3.48 | 1.67 | 0.96 |
| Corr[log(Bilirubin):Intercept,Platelet:slope] | −1.86 | (−3.28,−0.42) | 4.66 | 9.56 | 5.38 | 0.93 | −1.49 | (−4.27,1.34) | 9.22 | 3.77 | 2.06 | 0.94 |
| Corr[log(Bilirubin):Intercept,Blood Vessel Malformations:Intercept] | 2.73 | (0.82,4.60) | 6.34 | 2.29 | 8.31 | 0.96 | 2.48 | (−0.88,5.74) | 1.12 | −2.19 | 2.70 | 0.95 |
| SD[log(Bilirubin):slope] | 1.20 | (0.64,1.72) | 4.52 | −2.93 | 2.17 | 0.76 | 7.03 | (0.05,1.55) | 4.76 | −7.90 | 8.64 | 0.50 |
| Corr[log(Bilirubin):slope,Platelet:Intercept] | −1.75 | (−4.77,1.28) | 1.12 | −5.37 | 1.87 | 0.94 | −6.26 | (−6.68,5.62) | 1.30 | 1.07 | 3.67 | 0.99 |
| Corr[log(Bilirubin):slope,Platelet:slope] | 1.10 | (−1.88,4.07) | 1.01 | −1.48 | 2.24 | 0.94 | 6.57 | (−5.44,6.61) | 1.26 | −5.91 | 2.35 | 1.00 |
| Corr[log(Bilirubin):slope,Blood Vessel Malformations:Intercept] | 1.92 | (−3.55,3.97) | 1.35 | 1.11 | 3.06 | 0.95 | 1.62 | (−6.44,6.77) | 1.41 | 8.09 | 2.87 | 0.99 |
|
| ||||||||||||
| E[Platelet:Intercept] | 5.54 | (5.49,5.59) | 1.58 | 2.20 | 5.03 | 0.96 | 5.46 | (5.36,5.55) | 3.12 | −4.86 | 1.70 | 0.98 |
| E[Platelet:slope] | −4.29 | (−6.45,−2.14) | 7.02 | 4.70 | 1.25 | 0.94 | −1.14 | (−1.59,‐0.70) | 1.42 | −3.02 | 4.33 | 0.95 |
| SD[Platelet:Intercept] | 3.49 | (3.15,3.85) | 1.13 | 4.27 | 3.38 | 0.96 | 3.49 | (2.81,4.23) | 2.34 | 4.34 | 1.39 | 0.94 |
| Corr[Platelet:Intercept,Platelet:slope] | 6.66 | (−0.75,2.08) | 4.65 | 5.20 | 5.77 | 0.93 | 6.77 | (−2.11,3.45) | 9.28 | 6.25 | 1.59 | 0.97 |
| Corr[Platelet:Intercept,Blood vessel malformations:Intercept] | −2.57 | (−4.40,−0.71) | 6.11 | −9.29 | 9.27 | 0.94 | −2.23 | (−5.42,1.05) | 1.04 | 2.47 | 2.60 | 0.97 |
| SD[Platelet:slope] | 1.52 | (1.37,1.69) | 5.20 | 9.68 | 6.37 | 0.95 | 1.55 | (1.23,1.90) | 1.09 | 3.74 | 2.92 | 0.97 |
| Corr[Platelet:slope,Blood vessel malformations:Intercept] | −7.64 | (−2.72,1.20) | 6.45 | 3.96 | 9.90 | 0.96 | −7.61 | (−4.16,2.68) | 1.13 | 4.24 | 2.39 | 0.97 |
|
| ||||||||||||
| E[Blood vessel malformations:Intercept] | ‐2.55 | (−3.12,−2.00) | 1.93 | −7.35 | 1.19 | 0.89 | −6.74 | (−1.48,0.11) | 2.57 | 6.45 | 1.82 | 0.92 |
| Blood vessel malformations:Slope | 1.37 | (−1.03,3.78) | 7.98 | −8.53 | 1.84 | 0.88 | 4.63 | (0.67,8.66) | 1.35 | −1.77 | 3.96 | 0.95 |
| SD[Blood vessel malformations:Intercept] | 1.89 | (1.39,2.40) | 1.84 | 4.76 | 1.02 | 0.89 | 1.82 | (1.02,2.69) | 3.42 | −5.73 | 2.71 | 0.91 |
Note. These measurements are the average of 100 simulations.
Scenario 1 prediction accuracy from leave‐one‐out cross‐validation of random‐effects, marginal and conditional prediction; the reported values are the averages over the 100 simulated data sets
| Random | Marginal | Conditional | |
|---|---|---|---|
| Cut‐off | 0.81 | 0.19 | 0.12 |
| Sensitivity | 0.93 | 0.85 | 0.70 |
| Specificity | 0.71 | 0.85 | 0.73 |
| PCC | 0.75 | 0.85 | 0.73 |
| AUC | 0.84 | 0.91 | 0.76 |
| PPV | 0.57 | 0.59 | 0.40 |
| NPV | 0.98 | 0.96 | 0.96 |
Figure 3Receiver operating characteristic curves of the LoDA using the random‐effects (solid), marginal (dotted) and conditional (dot‐dashed) prediction methods for Scenario 1
Simulation study Scenario 2: Posterior means, highest posterior density (HPD) intervals, bias, standard deviation (SD), mean square error (MSE) and coverage for the fixed and random effects
| Group 0 | Group 1 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Posterior Mean | 95% HPD Interval | SD | Bias | MSE | Coverage | Posterior Mean | 95% HPD Interval | SD | Bias | MSE | Coverage | |
|
| ||||||||||||
| E[Albumin:Intercept] | 3.00 | (2.97,3.03) | 2.80 | −2.04 | 3.73 | 0.89 | 3.00 | (2.96,3.04) | 1.89 | 3.45 | 4.00 | 0.92 |
| E[Albumin:slope] | −9.63 | (−2.37,2.10) | 2.12 | −9.63 | 1.49 | 0.92 | −3.08 | (−3.01,2.38) | 1.31 | −3.08 | 1.88 | 0.92 |
| SD[Albumin:Intercept] | 4.19 | (0.34,9.07) | 4.21 | −2.31 | 1.06 | 0.84 | 4.28 | (0.68,8.68) | 2.10 | −2.22 | 1.19 | 0.76 |
| Corr[Albumin:Intercept,Albumin:slope] | −7.48 | (−8.53,8.51) | 2.02 | 5.72 | 1.26 | 1.00 | 2.61 | (−8.02,8.50) | 1.18 | 9.07 | 2.60 | 1.00 |
| Corr[Albumin:Intercept,log(Bilirubin):Intercept] | 1.50 | (−8.38,8.64) | 2.32 | 2.12 | 5.79 | 1.00 | 3.21 | (−8.46,8.64) | 1.10 | 2.00 | 5.28 | 1.00 |
| Corr[Albumin:Intercept,log(Bilirubin):slope] | 3.21 | (−7.43,7.95) | 2.46 | −1.79 | 7.77 | 0.97 | 1.01 | (−6.29,7.96) | 1.57 | −1.10 | 5.22 | 0.98 |
| SD[Albumin:slope] | 2.13 | (0.07,5.38) | 2.54 | −5.63 | 3.38 | 0.16 | 2.91 | (0.56,5.79) | 1.32 | −4.85 | 2.75 | 0.24 |
| Corr[Albumin:slope,log(Bilirubin):Intercept] | 1.18 | (−8.59,8.73) | 2.35 | 1.02 | 8.28 | 1.00 | −2.69 | (−8.80,8.37) | 1.06 | −2.84 | 8.65 | 1.00 |
| Corr[Albumin:slope,log(Bilirubin):slope] | −4.95 | (−8.46,7.84) | 2.98 | 1.83 | 6.26 | 0.99 | −8.38 | (−7.91,6.32) | 1.50 | 1.49 | 6.69 | 0.98 |
|
| ||||||||||||
| E[log(Bilirubin):Intercept] | 1.00 | (0.96,1.04) | 3.05 | −3.92 | 3.58 | 0.96 | 1.00 | (0.96,1.04) | 1.74 | 7.55 | 3.71 | 0.92 |
| E[log(Bilirubin):slope] | 8.35 | (−3.20,3.37) | 1.95 | 8.35 | 2.18 | 0.98 | 3.08 | (−3.69,4.28) | 1.50 | 3.08 | 4.18 | 0.93 |
| SD[log(Bilirubin):Intercept] | 3.35 | (0.03,9.02) | 3.39 | 2.23 | 1.11 | 0.99 | 1.79 | (0.00,5.29) | 1.36 | 6.65 | 2.21 | 0.98 |
| Corr[log(Bilirubin):Intercept,log(Bilirubin):slope] | −1.11 | (−8.24,8.29) | 2.61 | 1.64 | 5.96 | 1.00 | −1.66 | (−8.36,8.30) | 1.48 | 1.58 | 4.98 | 0.99 |
| SD[log(Bilirubin):slope] | 1.02 | (0.59,1.43) | 2.67 | −4.74 | 2.95 | 0.38 | 1.10 | (0.79,1.43) | 1.06 | −3.93 | 1.82 | 0.39 |
Note. These measurements are the average of 100 simulations.
Scenario 2 prediction accuracy from leave‐one‐out cross‐validation of random‐effects, marginal and conditional prediction; the reported values are the averages over the 100 simulated data sets
| Random | Marginal | Conditional | |
|---|---|---|---|
| Cut‐off | 0.56 | 0.26 | 0.58 |
| Sensitivity | 0.78 | 0.92 | 0.92 |
| Specificity | 0.70 | 0.89 | 0.88 |
| PCC | 0.72 | 0.90 | 0.89 |
| AUC | 0.74 | 0.96 | 0.95 |
| PPV | 0.55 | 0.69 | 0.66 |
| NPV | 0.90 | 0.98 | 0.93 |
Figure 4Receiver operating characteristic curves of the LoDA using the random‐effects (solid), marginal (dotted) and conditional (dot‐dashed) prediction methods for Scenario 2
Figure 5Histograms showing the sensitivity, specificity, PCC and AUC of each of the three approaches for each of the 100 simulated data sets under Scenario 2