| Literature DB >> 33250568 |
Zhaozhi Qian1, Ahmed M Alaa2, Mihaela van der Schaar1,2,3.
Abstract
The coronavirus disease 2019 (COVID-19) global pandemic poses the threat of overwhelming healthcare systems with unprecedented demands for intensive care resources. Managing these demands cannot be effectively conducted without a nationwide collective effort that relies on data to forecast hospital demands on the national, regional, hospital and individual levels. To this end, we developed the COVID-19 Capacity Planning and Analysis System (CPAS)-a machine learning-based system for hospital resource planning that we have successfully deployed at individual hospitals and across regions in the UK in coordination with NHS Digital. In this paper, we discuss the main challenges of deploying a machine learning-based decision support system at national scale, and explain how CPAS addresses these challenges by (1) defining the appropriate learning problem, (2) combining bottom-up and top-down analytical approaches, (3) using state-of-the-art machine learning algorithms, (4) integrating heterogeneous data sources, and (5) presenting the result with an interactive and transparent interface. CPAS is one of the first machine learning-based systems to be deployed in hospitals on a national scale to address the COVID-19 pandemic-we conclude the paper with a summary of the lessons learned from this experience.Entities:
Keywords: Automated machine learning; COVID-19; Compartmental models; Gaussian processes; Healthcare; Resource planning
Year: 2020 PMID: 33250568 PMCID: PMC7685302 DOI: 10.1007/s10994-020-05921-4
Source DB: PubMed Journal: Mach Learn ISSN: 0885-6125 Impact factor: 2.940
Fig. 1Illustration of how the different components in CPAS address the diverse needs of stakeholders on various levels. On the regional level, “hospital trusts” refers to the NHS foundation trusts, organizations that manage several hospitals in a region
Fig. 2Schematic depiction for the individualized risk predictor. A patient’s features are fed into multiple pipelines in parallel. Each pipeline estimates the hazard function at a different time step . The pipeline for is illustrated in more details in the figure. The pipeline configuration specifies the algorithms and the associated hyperparameters. The configurations are determined by AutoPrognosis using Eq. 9 and may vary across
The algorithms considered in each stage of the pipeline, which includes MICE (Buuren and Groothuis-Oudshoorn 2010), MissForest (Stekhoven and Bühlmann 2012), GAIN (Yoon et al. 2018), PCA, Fast ICA (Hyvarinen 1999), Recursive elimination (Guyon et al. 2002), Elastic net (Zou and Hastie 2005), Random forest (Liaw and Wiener 2002), Xgboost (Chen and Guestrin 2016), Multi-layer Perceptron (MLP) (Hinton 1990), Isotonic regression (De Leeuw 1977), Bootstrap (Chernick et al. 2011), Platt scaling (Platt et al. 1999)
| Imputation | Feature selection | Prediction | Calibration |
|---|---|---|---|
| Median | Elastic net | ||
| MICE | PCA | Random forest | Bootstrap |
| MissForest | Fast ICA | Platt scaling | |
| Recursive elimination | MLP |
Algorithms in bold are the most frequently selected in each stage
Fig. 3Pictorial illustration of HGPCP. Left to right: The upper-layer GP models the contact rate based on community mobility . The compartmental model gives the deterministic trajectory of the five compartments based on . The lower layer GP uses the hospitalized compartment as prior and predicts the hospital admission
Fig. 4Illustration of the CPAS datasets and the training set up. a CHESS and ICNARC data are joined and linked to HES to form the hospital patient data (18,101 cases) and the ICU patient data (10,868 cases). AutoPrognosis uses these two patient level datasets to train the various predictive pipelines in the individualized risk predictor. The aggregated hospital admission data together with the community mobility data empowers HGPCP to forecast the trend of admission. b The daily hospital admission, ICU admission, fatalities and discharges as recorded in the CPAS data set. c The prevalence of comorbidities and complications of hospitalized COVID-19 patients
Performance in forecasting individualized risk profile using different feature sets and algorithms measured by AUC-ROC
| Model | Feature | ICU admission | Mortality | Ventilation |
|---|---|---|---|---|
| AutoPrognosis | All features | |||
| AutoPrognosis | CHESS only | 0.781 ± 0.002 | 0.836 ± 0.002 | 0.754 ± 0.003 |
| AutoPrognosis | Demographics | 0.770 ± 0.002 | 0.799 ± 0.003 | 0.702 ± 0.003 |
| Cox PH Model | All features | 0.771 ± 0.002 | 0.773 ± 0.003 | 0.690 ± 0.003 |
| Charlson index | – | 0.556 ± 0.013 | 0.596 ± 0.002 | 0.530 ± 0.006 |
The results in bold are significantly better than the rest
Performance in forecasting hospital admission
| Mar. 23 before peak | Mar. 30 at peak | Apr. 23 after peak | |||||||
|---|---|---|---|---|---|---|---|---|---|
| CPAS | GP | CM | CPAS | GP | CM | CPAS | GP | CM | |
| STH | 5.32 | 7.39 | 13.79 | 14.72 | 6.45 | 6.71 | |||
| SGH | 1.60 | 2.46 | 11.11 | 16.33 | 4.05 | 5.58 | |||
| NPH | 5.62 | 8.62 | 5.37 | 4.00 | 1.40 | 2.15 | |||
| KCH | 5.03 | 4.68 | 3.59 | 7.64 | 3.21 | 3.91 | |||
| RLH | 4.88 | 7.43 | 5.29 | 8.86 | 1.39 | 1.36 | |||
| National | 43.51 | 63.25 | 120.59 | 324.59 | 39.35 | 123.57 | |||
The candidate models are CPAS (HGPCP), GP (zero-mean GP) and CM (compartmental models). The first five rows refer to the performance in the five hospitals with most admitted patients. The last row refers to the national total admission. The lowest error for each task is bolded
Fig. 5The configuration interface of CPAS. The user enters the desired level of resolution and the region of interest. The user then inputs the assumed trend for future community mobility. The empirical feature distribution in the region of interest is displayed below for reference
Fig. 6The output interface of CPAS. CPAS displays the projected ICU demand with confidence intervals on the top. It then shows the intermediate prediction that leads to the projections. On the bottom-left, it shows the output of the aggregated trend forecaster. On the bottom-right, it shows the average risk profile for various outcomes given by the individualized risk predictor