| Literature DB >> 28762546 |
David M Hughes1, Arnošt Komárek2, Laura J Bonnett1, Gabriela Czanner1,3, Marta García-Fiñana1.
Abstract
Recently developed methods of longitudinal discriminant analysis allow for classification of subjects into prespecified prognostic groups using longitudinal history of both continuous and discrete biomarkers. The classification uses Bayesian estimates of the group membership probabilities for each prognostic group. These estimates are derived from a multivariate generalised linear mixed model of the biomarker's longitudinal evolution in each of the groups and can be updated each time new data is available for a patient, providing a dynamic (over time) allocation scheme. However, the precision of the estimated group probabilities differs for each patient and also over time. This precision can be assessed by looking at credible intervals for the group membership probabilities. In this paper, we propose a new allocation rule that incorporates credible intervals for use in context of a dynamic longitudinal discriminant analysis and show that this can decrease the number of false positives in a prognostic test, improving the positive predictive value. We also establish that by leaving some patients unclassified for a certain period, the classification accuracy of those patients who are classified can be improved, giving increased confidence to clinicians in their decision making. Finally, we show that determining a stopping rule dynamically can be more accurate than specifying a set time point at which to decide on a patient's status. We illustrate our methodology using data from patients with epilepsy and show how patients who fail to achieve adequate seizure control are more accurately identified using credible intervals compared to existing methods.Entities:
Keywords: allocation scheme; credible intervals; longitudinal discriminant analysis
Mesh:
Year: 2017 PMID: 28762546 PMCID: PMC5655752 DOI: 10.1002/sim.7397
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Figure 2Marginal group membership probabilities over time for patient A (top panel). Histograms showing the posterior distribution of the probability of being in the refractory group are shown for each clinic visit below the top panel. The dotted vertical line denotes the time at which the patient was classified as refractory using the dynCI scheme with 99% credible intervals. The dotted horizontal line shows the required cutoff of 0.83 [Colour figure can be viewed at wileyonlinelibrary.com]
Figure 3Marginal group membership probabilities over time for Patient B (top panel). Histograms showing the posterior distribution of the probability of being in the refractory group are shown for each clinic visit below the top panel. The dotted vertical line denotes the time at which the patient was classified as refractory using the dynCI scheme with 99% credible intervals. The dotted horizontal line shows the required cutoff of 0.83 [Colour figure can be viewed at wileyonlinelibrary.com]
Classification for the marginal prediction with a cutoff of 0.83 using A, the dynLoDA scheme, and the dynCI prediction with a level of B, 99%; C, 95%; D, 90%; and E, 50%
| (A) | |||||
|---|---|---|---|---|---|
| Classification | |||||
| Remission | Refractory | Total | |||
| True | Remission | 1384 | 126 | 1510 | |
| Status | Refractory | 12 | 163 | 175 | |
| Total | 1396 | 289 | 1685 | ||
| (B) 99% HPD CI | |||||
| Classification | |||||
| Remission | Refractory | Unclassified | Total | ||
| True | Remission | 1368 | 67 | 75 | 1510 |
| Status | Refractory | 8 | 151 | 16 | 175 |
| Total | 1376 | 218 | 91 | 1685 | |
| (C) 95% HPD CI | |||||
| True | Remission | 1368 | 90 | 52 | 1510 |
| Status | Refractory | 9 | 155 | 11 | 175 |
| Total | 1377 | 245 | 63 | 1685 | |
| (D) 90% HPD CI | |||||
| True | Remission | 1368 | 98 | 44 | 1510 |
| Status | Refractory | 10 | 156 | 9 | 175 |
| Total | 1378 | 254 | 53 | 1685 | |
| (E) 50% HPD CI | |||||
| True | Remission | 1361 | 131 | 18 | 1510 |
| Status | Refractory | 12 | 162 | 1 | 175 |
| Total | 1373 | 293 | 19 | 1685 | |
Abbreviations: CI, credible interval; HPD, highest posterior density.
A comparison of the prediction accuracy using dynLoDA and dynCI schemes
|
|
|
|
| ||
|---|---|---|---|---|---|
|
| 99% HPDCI | 95% HPDCI | 90% HPDCI | 50% HPDCI | |
| Cutoff | 0.83 | 0.83 | 0.83 | 0.83 | 0.83 |
|
|
|
|
|
|
|
| Sensitivity | 0.93 | 0.86 | 0.89 | 0.89 | 0.93 |
|
|
|
|
|
|
|
| Specificity | 0.92 | 0.91 | 0.91 | 0.91 | 0.90 |
|
|
|
|
|
|
|
| PCC | 0.92 | 0.90 | 0.90 | 0.90 | 0.90 |
| AUC | 0.97 | 0.98 | 0.98 | 0.97 | 0.97 |
|
|
|
|
|
|
|
| NPV | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 |
| Proportion unclassified | 0.00 | 0.05 | 0.04 | 0.03 | 0.01 |
| Mean lead time (days) | 675 | 565 | 595 | 614 | 661 |
| Mean prediction time (days) | 857 | 972 | 942 | 918 | 872 |
Note. The sensitivities and specificities highlighted in bold are calculated without considering the unclassified patients.
AUC, area under curve; HPDCI, highest posterior density credible interval; NPV, negative predictive value; PCC, probability of correct classification; PPV, positive predictive value.
Figure 1The longitudinal observations of 2 patients from the Standard and New Antiepleptic Drugs Dataset. Patient A was a 17‐year‐old male and patient B was an 8‐year‐old female. Both had generalised epilepsy diagnosed before June 6, 2001 [Colour figure can be viewed at wileyonlinelibrary.com]
Prediction accuracy using different allocation rules
|
| Youden | Max PCC | Max PPV | Max NPV |
| Youden | Max PCC | Max PPV | Max NPV | |
|---|---|---|---|---|---|---|---|---|---|---|
|
|
| |||||||||
| Cutoff | 0.83 | 0.83 | 0.99 | 0.99 | 0.01 | 0.83 | 0.83 | 0.99 | 0.99 | 0.01 |
| Sensitivity | 0.95 | 0.95 | 0.84 | 0.84 | 1.00 | 0.93 | 0.93 | 0.77 | 0.77 | 1.00 |
| Specificity | 0.95 | 0.95 | 0.98 | 0.98 | 0.11 | 0.92 | 0.92 | 0.97 | 0.97 | 0.12 |
| PCC | 0.95 | 0.95 | 0.97 | 0.97 | 0.24 | 0.92 | 0.92 | 0.95 | 0.95 | 0.21 |
| AUC | 0.98 | 0.95 | 0.95 | 0.95 | 0.95 | 0.97 | 0.97 | 0.97 | 0.97 | 0.97 |
| PPV | 0.69 | 0.69 | 0.80 | 0.80 | 0.16 | 0.56 | 0.56 | 0.77 | 0.77 | 0.12 |
| NPV | 0.99 | 0.99 | 0.98 | 0.98 | 1.00 | 0.99 | 0.99 | 0.97 | 0.97 | 1.00 |
| Mean mead time, days | 565 | 565 | 378 | 378 | 1427 | 675 | 675 | 432 | 432 | 1427 |
| Mean prediction time, days | 972 | 972 | 1162 | 1162 | 108 | 857 | 857 | 1106 | 1106 | 107 |
|
|
| |||||||||
| Cutoff | 0.58 | 0.49 | 0.97 | 0.99 | 0.01 | 0.10 | 0.10 | 1.00 | 0.54 | 0.01 |
| Sensitivity | 0.91 | 0.94 | 0.65 | 0.58 | 1.00 | 0.83 | 0.83 | 0.00 | 0.16 | 0.95 |
| Specificity | 0.89 | 0.86 | 0.98 | 0.98 | 0.33 | 0.73 | 0.73 | 1.00 | 0.96 | 0.53 |
| PCC | 0.89 | 0.87 | 0.94 | 0.94 | 0.41 | 0.74 | 0.74 | 0.90 | 0.88 | 0.57 |
| AUC | 0.96 | 0.96 | 0.96 | 0.96 | 0.96 | 0.83 | 0.83 | 0.83 | 0.83 | 0.83 |
| PPV | 0.52 | 0.47 | 0.79 | 0.81 | 0.16 | 0.26 | 0.26 | 0.32 | 0.19 | |
| NPV | 0.99 | 0.99 | 0.96 | 0.95 | 1.00 | 0.97 | 0.97 | 0.90 | 0.91 | 0.99 |
| Mean lead time, days | 703 | 799 | 386 | 337 | 1301 | 1250 | 1250 | 1194 | 1256 | |
| Mean prediction time, days | 831 | 735 | 1148 | 1202 | 233 | 280 | 280 | 312 | 277 | |
|
|
| |||||||||
| Cutoff | 0.31 | 0.21 | 0.91 | 0.95 | 0.01 | 0.87 | 0.77 | 0.87 | 0.99 | 0.16 |
| Sensitivity | 0.79 | 0.87 | 0.26 | 0.20 | 0.98 | 0.76 | 0.81 | 0.76 | 0.42 | 0.95 |
| Specificity | 0.67 | 0.62 | 0.95 | 0.97 | 0.29 | 0.67 | 0.62 | 0.67 | 0.86 | 0.31 |
| PCC | 0.69 | 0.67 | 0.80 | 0.80 | 0.44 | 0.71 | 0.70 | 0.71 | 0.67 | 0.58 |
| AUC | 0.80 | 0.80 | 0.80 | 0.80 | 0.80 | 0.75 | 0.75 | 0.75 | 0.75 | 0.75 |
| PPV | 0.40 | 0.38 | 0.58 | 0.62 | 0.28 | 0.63 | 0.61 | 0.63 | 0.69 | 0.50 |
| NPV | 0.92 | 0.95 | 0.82 | 0.81 | 0.98 | 0.79 | 0.81 | 0.79 | 0.67 | 0.90 |
| Mean lead time, days | 929 | 937 | 887 | 866 | 943 | 588 | 593 | 588 | 546 | 615 |
| Mean prediction time, days | 599 | 595 | 653 | 662 | 590 | 953 | 947 | 953 | 998 | 927 |
|
| ||||||||||
| Cutoff | 0.99 | 0.93 | 0.91 | 0.96 | 0.02 | |||||
| Sensitivity | 0.76 | 0.89 | 0.90 | 0.85 | 0.99 | |||||
| Specificity | 0.57 | 0.50 | 0.49 | 0.53 | 0.06 | |||||
| PCC | 0.71 | 0.77 | 0.78 | 0.76 | 0.71 | |||||
| AUC | 0.70 | 0.70 | 0.70 | 0.70 | 0.70 | |||||
| PPV | 0.81 | 0.80 | 0.80 | 0.81 | 0.71 | |||||
| NPV | 0.51 | 0.67 | 0.69 | 0.61 | 0.80 | |||||
| Mean lead time, days | 248 | 279 | 286 | 266 | 297 | |||||
| Mean prediction time, days | 1309 | 1280 | 1272 | 1291 | 1261 | |||||
Note. Blank entries represent cases where no patients were classified as refractory. A 99% credibility interval was used for the dynCI scheme.
AUC, area under curve; NPV, negative predictive value; PCC, probability of correct classification; PPV, positive predictive value.
Figure 4Receiver operating characteristic curves for each classification rule. The dynCI results are based on the classified patients with a 99% credibility interval [Colour figure can be viewed at wileyonlinelibrary.com]