| Literature DB >> 22479598 |
Vanya M C A Van Belle1, Ben Van Calster, Dirk Timmerman, Tom Bourne, Cecilia Bottomley, Lil Valentin, Patrick Neven, Sabine Van Huffel, Johan A K Suykens, Stephen Boyd.
Abstract
BACKGROUND: Over time, methods for the development of clinical decision support (CDS) systems have evolved from interpretable and easy-to-use scoring systems to very complex and non-interpretable mathematical models. In order to accomplish effective decision support, CDS systems should provide information on how the model arrives at a certain decision. To address the issue of incompatibility between performance, interpretability and applicability of CDS systems, this paper proposes an innovative model structure, automatically leading to interpretable and easily applicable models. The resulting models can be used to guide clinicians when deciding upon the appropriate treatment, estimating patient-specific risks and to improve communication with patients. METHODS ANDEntities:
Mesh:
Year: 2012 PMID: 22479598 PMCID: PMC3315538 DOI: 10.1371/journal.pone.0034312
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Advantages and disadvantages of different classification methods in clinical decision making.
| LR | Nomogramafter LR | Score systemafter LR | (LS-)SVM | (LS-)SVMadditive kernel | Rule extraction after (LS-)SVM | ICS | |
| Interpretability | + | ++ | +++ | − − | + | − | +++ |
| Speed when used manually | − − | − | + | − − − | −− | + | ++ |
| Communication to patients | − | − | ++ | − − | − | + | ++ |
| Usable by patients | − | + | ++ | − − | − | ++ | +++ |
| Underlying model structure | simple | simple | simple | very flexible | flexible | flexible | flexible |
| Applicability | |||||||
|
| − | + | ++ | − − − | − − | ++ | +++ |
|
| + | + | + | + | + | + | ++ |
| Post-processing | yes | yes | no |
Logistic regression.
(Least-squares) Support Vector Machine.
Artificial Neural Network.
Post-processing in order to obtain interpretable and easily applicable models.
Figure 1Illustration of the effect of iteratively reweighted
regularization. The unweighted model results in the black solid functional form. After iteratively reweighted regularization, the estimated functional form becomes much sparser (see gray dashed line). Small and clinically irrelevant intervals are removed from the functional form.
Properties of different model implementations for clinical use.
| Representation | Table or figure | Paper or software | Color representation | Illustrative example |
| 1 | Table | Paper | No |
|
| 2 | Figure | Paper | No |
|
| 3 | Figure | Paper | No |
|
| 4 | Table | Software | No |
|
| 5 | Figure | Software | No |
|
| 6 | figure | Software | Yes |
|
Illustration of a table-based representation which can be filled out by hand for the ICS index in diagnosing malignancies in adnexal masses.
| Question | Points |
|
| |
| between 40 (included) and 60 years old? | 1 |
| older than 60 (included) years old? | 3 |
|
| |
| between 40 (included) and 75 mm? | 1 |
| between 75 (included) and 95 mm? | 3 |
| between 95 (included) and 140 mm? | 4 |
| between 140 (included) and 200 mm? | 5 |
| larger than 200 (included) mm? | 8 |
|
| |
| between 0.25 (included) and 0.55? | 4 |
| between 0.55 (included) and 0.9? | 7 |
| larger than 0.9 (included)? | 8 |
|
| |
| 4 or more papillations? | 4 |
|
| |
| yes | 3 |
|
| |
| irregular? | 3 |
|
| |
| yes | −5 |
|
| |
| 2? | 1 |
| 3? | 3 |
| 4? | 5 |
|
| |
| yes | 4 |
|
| |
| between 10 (included) and 20 mm? | 2 |
| more than 20 mm (included)? | 4 |
For each variable several questions, corresponding to the different variables intervals, are posed. If the answer to the question is yes, the points in the last column, need to be added to the score. The software based version is provided as Supporting Information: movie1.
Link between the scores obtained from Table 3 and the estimate of the risk.
| Score | Risk |
| ≤1 | <0.001 |
| 2 to 4 | 0.01 |
| 5 | 0.04 |
| 6 to 9 | 0.06 |
| 10 to 11 | 0.10 |
| 12 | 0.12 |
| 13 | 0.24 |
| 14 | 0.36 |
| 15 | 0.54 |
| 16 to 17 | 0.63 |
| 18 to 19 | 0.69 |
| 20 to 23 | 0.91 |
| 24 to 25 | 0.93 |
| 26 to 27 | 0.96 |
| 28 to 30 | 0.98 |
| ≥31 | >0.99 |
Figure 2Application of the ICS approach to the diagnosis of the malignancy of adnexal masses.
(a) Picture-based representation by means of bar charts (without color indications) representing the intervals in which the variable effect is estimated to be constant. The bottom bar represents the predicted risk associated with the final score, obtained by summing all contributions of all variables. A software implementation is provided as Movie S2 and Movie S3. (b) Estimated link function, linking the score with the risk of a malignant tumor. (c) Calibration of the ICS model on the test set. For each possible value of the predicted risk (some values were taken together in order to obtain at least 10% of the patients in each group), the observed percentage of malignancies is calculated (dots). A 95% confidence interval on the percentage of the observed malignancies is illustrated by means of the vertical lines.
Figure 3Application of the ICS approach to the prediction of non-viable pregnancies.
(a) Picture-based representation by means of color bars, representing the intervals in which the variable effect is estimated to be constant. For each of the represented bars, the points corresponding to the value of the patient's covariates are obtained. The total score is obtained by summing all points. The color bar at the bottom represents the predicted risk associated with the final score. (b) Estimated link function, linking the score with the risk of a non-viable pregnancy at the end of the first trimester. (c) Calibration of the ICS model on the test set. For each possible value of the predicted risk (some values were taken together in order to obtain at least 10% of the patients in each group), the observed percentage of non-viable pregnancies is calculated (dots). A 95% confidence interval on the percentage of the observed non-viable pregnancies is illustrated by means of the vertical lines. Fhr: fetal heart rate.
Summary of the performance of previously built models and the ICS model for the prediction of malignancy of adnexal masses.
| Model | AUC (95% CI) | Sensitivity | Specificity | LR+ | LR− | DOR (95% CI) |
| RMI | 0.911 (0.880–0.935) | 67.5 | 94.6 | 12.60 | 0.343 | 37 (24–57) |
| LR1 | 0.956 (0.939–0.968) | 92.2 | 86.5 | 6.84 | 0.091 | 75 (46–125) |
| LS-SVM RBF | 0.954 (0.935–0.967) | 89.4 | 89.9 | 8.85 | 0.118 | 75 (47–120) |
| RVM RBF | 0.951 (0.933–0.965) | 90.6 | 87.7 | 7.39 | 0.107 | 69 (43–111) |
| ICS | 0.958 (0.943–0.969) | 93.3 | 85.7 | 6.53 | 0.078 | 84 (51–150) |
The measures are calculated on the test set of 997 patients from external centers. Except for the AUC, all measures were calculated using the cut-off mentioned in the original paper.
Summary of the test set performance of the ICS-based score system and two classical score systems (M1 and M2) for the prediction of malignancy of adnexal masses.
| Model | AUC (95% CI) | R2 adj | # intervals |
| ICS | 0.958 (0.943–0.969) | 0.72 | 30 |
| LR | 0.963 (0.947–0.974) | 0.73 | |
| M1 | 0.961 (0.944–0.971) | 0.71 | 57 |
| M2 | 0.933 (0.911–0.949) | 0.58 | 36 |
The classical score systems are based on a logistic regression model using the variables selected with ICS (LR). In a second step, the variables are manually divided into intervals. M1 uses a high number of intervals for continuous variables, M2 uses fewer intervals. The ICS approach is able to obtain good performance using a small number of intervals. The classical score systems are able to obtain good performance provided that a large number of intervals is considered.
Summary of the test set performance of the ICS-based score system and two classical score systems (M1 and M2) for the prediction of non-viability of pregnancies.
| Model | AUC (95% CI) | R2 adj | # intervals |
| ICS | 0.924 (0.897–0.942) | 0.62 | 17 |
| LR | 0.940 (0.916–0.957) | 0.69 | |
| M1 | 0.897 (0.872–0.922) | 0.65 | 31 |
| M2 | 0.788 (0.749–0.823) | 0.36 | 23 |
The classical score systems are based on a logistic regression model using the variables selected with ICS (LR). In a second step, the variables are manually divided into intervals. M1 uses a high number of intervals for continuous variables, M2 uses fewer intervals. The ICS approach is able to obtain good performance using a small number of intervals. The classical score systems are able to obtain good performance provided that a large number of intervals is considered.