| Literature DB >> 31304389 |
David Chen1, Sijia Liu1, Paul Kingsbury1, Sunghwan Sohn1, Curtis B Storlie2, Elizabeth B Habermann2, James M Naessens2, David W Larson3, Hongfang Liu1.
Abstract
In recent years, there is increasing enthusiasm in the healthcare research community for artificial intelligence to provide big data analytics and augment decision making. One of the prime reasons for this is the enormous impact of deep learning for utilization of complex healthcare big data. Although deep learning is a powerful analytic tool for the complex data contained in electronic health records (EHRs), there are also limitations which can make the choice of deep learning inferior in some healthcare applications. In this paper, we give a brief overview of the limitations of deep learning illustrated through case studies done over the years aiming to promote the consideration of alternative analytic strategies for healthcare.Entities:
Keywords: Diagnosis; Prognosis; Signs and symptoms; Translational research
Year: 2019 PMID: 31304389 PMCID: PMC6550223 DOI: 10.1038/s41746-019-0122-0
Source DB: PubMed Journal: NPJ Digit Med ISSN: 2398-6352
Measures of dataset size, number of variables, and percentage of missing values
| Dataset | # of cases | # of variables | Median # of time points | Median % Missing/variable | Max % Missing/variable | ||
|---|---|---|---|---|---|---|---|
| Time-varying | Tabular | Time-varying | Tabular | ||||
| CRS-PSC | 13399 | 117 | 71 | 21.3 | 99.9 | 100 | 49.6 |
| A-DR* | 4013 | 51 | 83 | 0 | 0 | 0 | 0 |
| CLL-TFT | 737 | 31 | 6 | 0 | 0 | 89.6 | 29.9 |
| ICU-M | 4000 | 41 | 24 | 21.6 | 24.0 | 95.7 | 94.9 |
| Opioid‡ | 142377 | 836 | 32 | 0 | 0 | 0 | 0 |
Percentage of missing values are split between time-varying data and the condensed tabular data
*Features from clinical notes, no mentions considered negative result
‡No codes considered negative result
Predictive power of each model and their associated training time
| App | Model | AUROC | Training time (s) |
|---|---|---|---|
| CRS-PSC | LR | 0.735 ± 0.004 | 5.3 |
| BN | 0.765 ± 0.004 | 48.6 | |
| SVM | 0.781 ± 0.003 | 33.1 | |
| RF | 0.812 ± 0.002 | 9.8 | |
| GBM | 13.8 | ||
| MLP | 0.795 ± 0.003 | 219 | |
| LSTM | 0.805 ± 0.004 | 703 | |
| A-DRR | LR | 0.875 ± 0.044 | 5.3 |
| SVM | 0.884 ± 0.011 | 23.1 | |
| RF | 10.8 | ||
| GBM | 0.945 ± 0.011 | 23.8 | |
| LSTM | 0.845 ± 0.034 | 301.4 | |
| CLL-TFT | LR | 0.795 ± 0.038 | 2.3 |
| SVM | 0.843 ± 0.036 | 5.7 | |
| RF | 3.3 | ||
| GBM | 0.817 ± 0.009 | 5.4 | |
| LSTM | 0.805 ± 0.017 | 46.5 | |
| ICU-M | LR | 0.476 ± 0.074 | 4.3 |
| SVM | 0.556 ± 0.051 | 12.7 | |
| RF | 0.618 ± 0.048 | 9.8 | |
| GBM | 16.4 | ||
| LSTM | 0.646 ± 0.043 | 243.5 | |
| Opioid | LR | 0.907 ± 0.002 | NA |
| SVM | 0.904 ± 0.002 | NA | |
| RF | 0.875 ± 0.003 | NA | |
| MLP | 0.909 ± 0.002 | NA | |
| LSTM | NA |
Bolded values indicate best achieved metric for each project. Training times were not measured during the Opioid project
Predictive power of each model based on percentage of data used to train
| % of Data (N) | LR | BN | SVM | RF | GBM | MLP |
|---|---|---|---|---|---|---|
| 10 (1340) | 0.728 | 0.748 | 0.731 | 0.776 | 0.804 | 0.742 |
| 20 (2680) | 0.727 | 0.749 | 0.758 | 0.797 | 0.811 | 0.762 |
| 40 (5360) | 0.730 | 0.753 | 0.773 | 0.798 | 0.809 | 0.778 |
| 60 (8039) | 0.732 | 0.759 | 0.772 | 0.801 | 0.812 | 0.786 |
| 80 (10,719) | 0.731 | 0.764 | 0.779 | 0.808 | 0.818 | 0.791 |
| 100 (13,399) | 0.735 | 0.765 | 0.781 | 0.812 | 0.822 | 0.795 |