| Literature DB >> 35345728 |
Richard W Issitt1, Mario Cortina-Borja2, William Bryant1, Stuart Bowyer1, Andrew M Taylor1, Neil Sebire1.
Abstract
Machine learning encompasses statistical approaches such as logistic regression (LR) through to more computationally complex models such as neural networks (NN). The aim of this study is to review current published evidence for performance from studies directly comparing logistic regression, and neural network classification approaches in medicine. A literature review was carried out to identify primary research studies which provided information regarding comparative area under the curve (AUC) values for the overall performance of both LR and NN for a defined clinical healthcare-related problem. Following an initial search, articles were reviewed to remove those that did not meet the criteria and performance metrics were extracted from the included articles. Teh initial search revealed 114 articles; 21 studies were included in the study. In 13/21 (62%) of cases, NN had a greater AUC compared to LR, but in most the difference was small and unlikely to be of clinical significance; (unweighted mean difference in AUC 0.03 (95% CI 0-0.06) in favour of NN versus LR. In the majority of cases examined across a range of clinical settings, LR models provide reasonable performance that is only marginally improved using more complex methods such as NN. In many circumstances, the use of a relatively simple LR model is likely to be adequate for real-world needs but in specific circumstances in which large amounts of data are available, and where even small increases in performance would provide significant management value, the application of advanced analytic tools such as NNs may be indicated.Entities:
Keywords: clinical informatics; electronic health records; logistic regression; machine learning; neural network; performance
Year: 2022 PMID: 35345728 PMCID: PMC8942139 DOI: 10.7759/cureus.22443
Source DB: PubMed Journal: Cureus ISSN: 2168-8184
Figure 1Flow diagram illustrating search strategy and outcome
Summary of studies
Studies directly comparing the performance of logistic regression (LR) and neural network (NN) machine learning models in clinical medicine, in terms of area under the receiver operating curve (AUC) using identical datasets for specific clinical classification scenarios. (ICU=intensive care unit, SAH=subarachnoid haemorrhage, ED=emergency department)
| Author | Clinical area | n | AUC LR | AUC NN |
| Ing et al. [ | Giant cell arteritis diagnosis | 1,201 | 0.87 | 0.86 |
| Kuang et al. [ | Alzheimer's disease progression | 361 | 0.81 | 0.9 |
| Owari & Miyatake [ | Lower back pain progression | 96 | 0.72 | 0.77 |
| Parsaeian et al. [ | Low back pain outcome | 17,294 | 0.75 | 0.75 |
| Abouzari et al. [ | Subdural haematoma outcome | 300 | 0.59 | 0.77 |
| Tang et al. [ | Cardiovascular autonomic dysfunction | 2.092 | 0.76 | 0.76 |
| Hsieh et al. [ | Pancreatic cancer diagnosis | 1,358,634 | 0.73 | 0.61 |
| McLaren et al. [ | Malignant breast lesion diagnosis | 71 | 0.8 | 0.82 |
| Lin et al. [ | ICU mortality | 1,496 | 0.72 | 0.75 |
| Sakai et al. [ | Appendicitis outcome | 169 | 0.72 | 0.74 |
| Erol et al. [ | Head injury outcome | 46 | 0.9 | 0.93 |
| Bassi et al. [ | Survival post cystectomy | 369 | 0.76 | 0.76 |
| Dumont et al. [ | Outcome post SAH | 91 | 0.93 | 0.96 |
| Doig et al. [ | ICU mortality | 422 | 0.82 | 0.82 |
| Botha et al. [ | Structural vascular disease diagnosis | 171 | 0.71 | 0.71 |
| Borzouei et al. [ | Thyroid disease diagnosis | 310 | 0.95 | 0.97 |
| Yao et al. [ | Diabetic retinopathy diagnosis | 530 | 0.77 | 0.84 |
| Chen et al. [ | Hip fracture outcome | 10.534 | 0.88 | 0.93 |
| Lin et al. [ | Adipose tissue volume | 5,772 | 0.77 | 0.9 |
| Tong et al. [ | Pancreatic cancer outcome | 221 | 0.85 | 0.92 |
| Sutradhar et al. [ | Cancer ED visits | 42,523 | 0.67 | 0.67 |
Figure 2Scatter plot for studies in Table 1
Scatter plot of the area under the receiver operating curve (ROC AUC) results for logistic regression (LR) and neural network (NN) classification models for studies presented in Table 1. The line indicates equal performance, with superior NN performance represented for points above the line and superior LR performance for those below the line. Point size represents study sample size (logarithmic scale). Overall, in the majority of studies, NN performance is similar or superior but the clinical significance remains uncertain.