| Literature DB >> 32545399 |
Adrian Xi Lin1, Andrew Fu Wah Ho2,3,4, Kang Hao Cheong5,6, Zengxiang Li7, Wentong Cai8, Marcel Lucas Chee9, Yih Yng Ng10,11, Xiaokui Xiao12, Marcus Eng Hock Ong13,14.
Abstract
The accurate prediction of ambulance demand provides great value to emergency service providers and people living within a city. It supports the rational and dynamic allocation of ambulances and hospital staffing, and ensures patients have timely access to such resources. However, this task has been challenging due to complex multi-nature dependencies and nonlinear dynamics within ambulance demand, such as spatial characteristics involving the region of the city at which the demand is estimated, short and long-term historical demands, as well as the demographics of a region. Machine learning techniques are thus useful to quantify these characteristics of ambulance demand. However, there is generally a lack of studies that use machine learning tools for a comprehensive modeling of the important demand dependencies to predict ambulance demands. In this paper, an original and novel approach that leverages machine learning tools and extraction of features based on the multi-nature insights of ambulance demands is proposed. We experimentally evaluate the performance of next-day demand prediction across several state-of-the-art machine learning techniques and ambulance demand prediction methods, using real-world ambulatory and demographical datasets obtained from Singapore. We also provide an analysis of this ambulatory dataset and demonstrate the accuracy in modeling dependencies of different natures using various machine learning techniques.Entities:
Keywords: ambulance deployment; complexity science; demand prediction; emergency medical services; emergency medicine; geospatial; health informatics; nonlinear dynamics
Mesh:
Year: 2020 PMID: 32545399 PMCID: PMC7312953 DOI: 10.3390/ijerph17114179
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Approach overview schematic.
Figure 2Sliding window for extraction of demand features.
Characteristics of ambulance demand dataset.
| Characteristics | Value |
|---|---|
| Incident Year | |
| 2006–2007 | 190,608 (13.6%) |
| 2008–2009 | 216,841 (15.5%) |
| 2010–2011 | 237,451 (17.0%) |
| 2012–2013 | 268,596 (19.2%) |
| 2014–2016 | 311,251 (22.3%) |
| 2016 | 172,009 (12.3%) |
| Patient Age (yrs) | 55 (34–73) |
| 2006–2007 | 51 (32–71) |
| 2008–2009 | 52 (32–71) |
| 2010–2011 | 53 (33–72) |
| 2012–2013 | 56 (35–73) |
| 2014–2016 | 57 (36–74) |
| 2016 | 58 (36–75) |
| Incident Classification | |
| Medical | 968,375 (69.3%) |
| Trauma | 391,986 (28.1%) |
| Assistance Not Required | 35,460 (2.54%) |
| Patient Incident Subclass | |
| Nervous System | 381,634 (27.3%) |
| No Medical Complaint/Un-Injured | 385,430 (27.6%) |
| Bone/Connective Tissue | 116,173 (8.32%) |
| Alcoholic Intoxication | 25,865 (1.85%) |
| Respiratory System | 132.163 (9.46%) |
| Reproductive System | 117,21 (0.839%) |
| Cardiovascular System | 115,587 (8.28%) |
| Digestive System | 98,129 (7.03%) |
| Poisoning/Drug Overdose | 6791 (0.486%) |
| Ear/Nose/Throat/Eye Condition | 5601 (0.401%) |
| Kidney/Urinary System | 16,433 (1.18%) |
| Blood Related | 5590 (0.400%) |
| Maternity/Childbirth | 5062 (0.362 %) |
| Liver/Biliary Tract | 1438 (0.103%) |
| Psychiatric Emergencies | 4413 (0.316%) |
| Endocrine System | 31,850 (2.28%) |
| Infectious Disease/Disorder of Skin | 4649 (0.333%) |
| Others | 35,240 (2.52%) |
| Unknown | 9474 (0.678%) |
| Unclassified | 2705 (0.194%) |
| Gender | |
| Male | 838,737 (60.0%) |
| Female | 554,237 (39.7%) |
| Unclassified | 2163 (0.213%) |
For continuous variables, data is presented in medians and interquartile ranges. For categorical variables, data is presented in frequencies and percentages.
Characteristics of engineered dataset.
| Characteristics | Value |
|---|---|
| Daily Regional Demand | 6.33 (0–10) |
| Total Regional Demands over Past 7 Days | 44. (4–69) |
| Total Regional Demands over Past 30 Days | 190 (20–294) |
Data is presented in means and interquartile ranges.
Figure 3Map of regional variance of daily demand in Singapore from 2006 to 2016.
Method accuracy comparisons
| Method | WAPE (%) | MAE | MSE |
|---|---|---|---|
| Regional Moving Average | 25.8 | 2.20 | 11.2 |
| Linear Regression |
|
|
|
| MLP | 24.6 | 2.10 |
|
| RBFN | 25.1 | 2.14 | 10.8 |
| SVR | 25.2 | 2.15 | 11.2 |
| LightGBM | 24.5 | 2.09 | 10.2 |
Bold indicates the best results for each column. WAPE: weighted absolute percentage error; MAE: mean absolute error; MSE: mean squared error; MLP: multilayer perceptron; RBFN: Radial Basis Function network; SVR: Support Vector Regression; LightGBM: Light Gradient Boosting Machine.
Feature importance.
| Feature | Gain-Based Importance | Mean Absolute SHAP Value |
|---|---|---|
| Region ID | 14,121,071 | 0.230 |
| Day of Week | 844,283 | 0.069 |
| Day of Month | 2,400,462 | 0.043 |
| Month of Year | 723,054 | 0.031 |
| Demand 1 Day Ago | 308,590 | 0.022 |
| Demand 2 Days Ago | 138,543 | 0.011 |
| Demand 3 Days Ago | 159,649 | 0.012 |
| Demand 4 Days Ago | 209,771 | 0.014 |
| Demand 5 Days Ago | 144,138 | 0.015 |
| Demand 6 Days Ago | 146,966 | 0.009 |
| Demand 7 Days Ago | 432,136 | 0.022 |
| Total Demand of the Week up to the Data Sample Day | 1,368,848 | 0.101 |
| Total Demand of the Month up to the Data Sample Day | 82,847 | 0.005 |
| Total Demand over Past 30 Days | 820,758,893 | 4.626 |
| Total Demand over Past 7 Days | 77,034,386 | 0.466 |
| Total Number of People Aged 50 and Above in the Year | 2,528,021 | 0.223 |
SHAP: SHapley Additive exPlanations; ID: identifier.
Accuracy comparisons on inclusion/exclusion of regional socioeconomic features.
| Dataset | WAPE (%) | MAE | MSE |
|---|---|---|---|
| SCDF-Engineered-Socio | 22.0 | 3.00 | 16.3 |
| SCDF-Engineered-Socio, excluding regional socioeconomic features | 22.0 | 3.00 | 16.3 |
WAPE: weighted absolute percentage error; MAE: mean absolute error; MSE: mean squared error; SCDF: Singapore Civil Defence Force.