| Literature DB >> 33937750 |
Fiona Leonard1, John Gilligan2, Michael J Barrett3,4.
Abstract
Introduction: Patients boarding in the Emergency Department can contribute to overcrowding, leading to longer waiting times and patients leaving without being seen or completing their treatment. The early identification of potential admissions could act as an additional decision support tool to alert clinicians that a patient needs to be reviewed for admission and would also be of benefit to bed managers in advance bed planning for the patient. We aim to create a low-dimensional model predicting admissions early from the paediatric Emergency Department. Methods and Analysis: The methodology Cross Industry Standard Process for Data Mining (CRISP-DM) will be followed. The dataset will comprise of 2 years of data, ~76,000 records. Potential predictors were identified from previous research, comprising of demographics, registration details, triage assessment, hospital usage and past medical history. Fifteen models will be developed comprised of 3 machine learning algorithms (Logistic regression, naïve Bayes and gradient boosting machine) and 5 sampling methods, 4 of which are aimed at addressing class imbalance (undersampling, oversampling, and synthetic oversampling techniques). The variables of importance will then be identified from the optimal model (selected based on the highest Area under the curve) and used to develop an additional low-dimensional model for deployment. Discussion: A low-dimensional model comprised of routinely collected data, captured up to post triage assessment would benefit many hospitals without data rich platforms for the development of models with a high number of predictors. Novel to the planned study is the use of data from the Republic of Ireland and the application of sampling techniques aimed at improving model performance impacted by an imbalance between admissions and discharges in the outcome variable.Entities:
Keywords: admission; emergency department; machine learning; paediatric; prediction; protocol
Year: 2021 PMID: 33937750 PMCID: PMC8085432 DOI: 10.3389/fdata.2021.643558
Source DB: PubMed Journal: Front Big Data ISSN: 2624-909X
Top 15 variables considered for inclusion across 26 studies.
| Araz et al. ( | Adult | ++ | o | ++ | o | ++ | o | o | ||||||||
| Barak-Corren et al. ( | Mixed | + | o | + | + | + | o | o | + | + | o | + | o | |||
| Barak-Corren et al. ( | Paediatric | + | o | + | + | + | + | + | + | + | + | o | ||||
| Cameron et al. ( | Adult | ++ | o | ++ | ++ | o | o | o | ++ | ++ | ||||||
| Considine et al. ( | Adult | ++ | o | o | + | + | ++ | |||||||||
| Dinh et al. ( | Adult | ++ | o | ++ | ++ | ++ | ++ | o | ++ | + | + | |||||
| Golmohammadi ( | Mixed | ++ | ++ | ++ | ++ | ++ | o | ++ | ||||||||
| Gorelick et al. ( | Paediatric | o | o | o | ++ | o | + | |||||||||
| Goto et al. ( | Paediatric | ++ | o | ++ | ++ | ++ | o | o | o | |||||||
| Goto et al. ( | Adult | ++ | o | o | ++ | ++ | ++ | |||||||||
| Graham et al. ( | Mixed | ++ | + | ++ | ++ | + | + | ++ | ||||||||
| Hong et al. ( | Not stated | ++ | ++ | ++ | o | + | o | o | + | ++ | ++ | o | ++ | o | o | |
| Kim et al. ( | Adult | ++ | + | ++ | ++ | ++ | ++ | ++ | ++ | ++ | ||||||
| Kraaijvanger et al. ( | Mixed | ++ | o | ++ | ++ | ++ | o | + | + | ++ | + | |||||
| LaMantia et al. ( | Adult | ++ | o | ++ | ++ | ++ | o | o | ||||||||
| Leegon et al. ( | Adult | o | o | o | o | o | o | o | ||||||||
| Leegon et al. ( | Paediatric | o | o | o | o | o | o | o | ||||||||
| Li et al. ( | Not stated | o | o | o | ++ | o | ||||||||||
| Lucke et al. ( | Adult | + | + | ++ | ++ | ++ | ++ | o | + | |||||||
| Marlais et al. ( | Paediatric | ++ | o | ++ | o | |||||||||||
| Parker et al. ( | Adult | ++ | + | ++ | + | ++ | + | + | + | + | ||||||
| Patel et al. ( | Paediatric | ++ | o | ++ | ++ | o | ||||||||||
| Peck et al. ( | Adult | ++ | + | ++ | ++ | |||||||||||
| Peck et al. ( | Adult | ++ | ++ | ++ | ++ | |||||||||||
| Rendell et al. ( | Adult | ++ | o | ++ | ++ | ++ | o | o | o | ++ | o | o | ||||
| Sun et al. ( | Mixed | ++ | + | ++ | ++ | + | + | + | + |
o considered or added to prediction model.
+ reported as significant in bivariant tests, if carried out (p < 0.001).
++ highlighted in respective study as the top variables of importance or strongest predictors in final models.
Most commonly used model evaluation methods and AUC results across the 26 studies.
| Araz et al. ( | X | X | X | X | 0.77, 0.79, 0.81, 0.83, 0.84, 0.86 | |||||
| Barak-Corren et al. ( | X | X | X | X | X | X | X | 0.79, 0.87, 0.91 | ||
| Barak-Corren et al. ( | X | X | X | X | X | X | X | X | 0.82, 0.83, 0.86, 0.96, 0.97 | |
| Cameron et al. ( | X | X | X | X | X | X | 0.88 | |||
| Considine et al. ( | X | |||||||||
| Dinh et al. ( | X | X | X | X | X | X | X | 0.82 | ||
| Golmohammadi ( | X | X | X | X | ||||||
| Gorelick et al. ( | X | X | X | X | X | X | 0.92 | |||
| Goto et al. ( | X | X | X | X | X | X | X | X | 0.78, 0.80 | |
| Goto et al. ( | X | X | X | X | X | 0.82, 0.83 | ||||
| Graham et al. ( | X | X | X | X | X | 0.82, 0.85, 0.86 | ||||
| Hong et al. ( | X | X | X | X | X | 0.86, 0.87, 0.91, 0.92 | ||||
| Kim et al. ( | X | X | X | 0.68, 0.75, 0.77, 0.80, 0.82, 0.84 | ||||||
| Kraaijvanger et al. ( | X | X | X | X | X | X | 0.76, 0.84, 0.87 | |||
| LaMantia et al. ( | X | X | 0.73 | |||||||
| Leegon et al. ( | X | X | X | X | X | 0.89 | ||||
| Leegon et al. ( | X | X | X | X | X | 0.90, 0.91 | ||||
| Li et al. ( | X | X | X | |||||||
| Lucke et al. ( | X | X | X | X | X | X | X | X | 0.77, 0.86 | |
| Marlais et al. ( | X | X | X | X | X | X | 0.81 | |||
| Parker et al. ( | X | X | X | X | X | X | 0.83 | |||
| Patel et al. ( | X | X | X | 0.72, 0.82, 0.83, 0.84 | ||||||
| Peck et al. ( | X | X | 0.84, 0.89 | |||||||
| Peck et al. ( | X | X | 0.80, 0.82, 0.86, 0.89 | |||||||
| Rendell et al. ( | X | X | 0.82, 0.83 (highest AUC out of 2 sets of models) | |||||||
| Sun et al. ( | X | X | X | X | X | X | 0.85 | |||
| Total | 23 | 19 | 19 | 18 | 15 | 14 | 9 | 4 | 4 |
AUC, Area under the curve; PPV, Positive predictive value; NPV, Negative predictive value; PLR, Positive likelihood ratio; NLR, Negative likelihood ratio.
Figure 1Design of experiment to identify the model with the highest Area Under the Curve (AUC) from 15 models which will be used to obtain the variables of importance for the creation of a low-dimensional model. The reference training set will have no additional sampling technique applied.