| Literature DB >> 34179769 |
Brett Snider1, Bhumi Patel1, Edward McBean1.
Abstract
The worldwide rapid spread of the severe acute respiratory syndrome coronavirus 2 has affected millions of individuals and caused unprecedented medical challenges by putting healthcare services under high pressure. Given the global increase in number of cases and mortalities due to the current COVID-19 pandemic, it is critical to identify predictive features that assist identification of individuals most at-risk of COVID-19 mortality and thus, enable planning for effective usage of medical resources. The impact of individual variables in an XGBoost artificial intelligence model, applied to a dataset containing 57,390 individual COVID-19 cases and 2,822 patient deaths in Ontario, is explored with the use of SHapley Additive exPlanations values. The most important variables were found to be: age, date of the positive test, sex, income, dementia plus many more that were considered. The utility of SHapley Additive exPlanations dependency graphs is used to provide greater interpretation of the black-box XGBoost mortality prediction model, allowing focus on the non-linear relationships to improve insights. A "Test-date Dependency" plot indicates mortality risk dropped substantially over time, as likely a result of the improved treatment being developed within the medical system. As well, the findings indicate that people of lower income and people from more ethnically diverse communities, face an increased mortality risk due to COVID-19 within Ontario. These findings will help guide clinical decision-making for patients with COVID-19.Entities:
Keywords: COVID-19; SHAP (shapley additive explanation); XGBoost (extreme gradient boosting); artificial intelligence; co-morbidity; mortality
Year: 2021 PMID: 34179769 PMCID: PMC8222676 DOI: 10.3389/frai.2021.684609
Source DB: PubMed Journal: Front Artif Intell ISSN: 2624-8212
Characteristics of 57,390 Ontario residents with COVID-19.
| Variable | Description | Range of values |
|---|---|---|
| Age | Age in years, as of Jan 1, 2020 | 0–105 |
| Test date | Test date | Feb–Oct 2020 |
| Sex | Indicator variable for sex | 26,861 (M = 1, F = 0) |
| Hypertension | Chronic hypertension, as of Jan 1, 2020 | 15,778 (0,1) |
| LTC resident | LTC resident, as of Jan 1, 2020 | 5,179 (0,1) |
| Chronic_dementia | Chronic dementia diagnosed, as of Jan 1, 2020 | 4,746 (0,1) |
| Chronic_odd | Chronic diabetes diagnosed as of Jan 1, 2020 | 9,002 (0,1) |
| Ethnic concentration quint. | Calculated from Ontario marginalization index, based on census designation. Refers to visible minorities and/or recent immigrants | (0–5) |
| Commuter concentration quint | % Of people that commute within census designated area - converted to quintiles | (0–5) |
| Median income quint. | Median income within census-designated area - converted to quintiles | (0–5) |
| Charl | Charlson co-morbidity index. Only 2,059 patients with charl above 0. | (0–10) |
| Household size quint. | Avg. Household size within census-designated area - converted to quintiles (5 being the highest, 0 = missing DA info). | (0–5) |
| CKD | Chronic kidney disease. | 2,523 (0,1) |
| Cancer | Cancer index | 2,995 (0–1) |
| Chronic_copd | Chronic obstructive pulmonary disease | 4,030 (0–1) |
| Chronic_asthma | Asthma | 9,100 (0–1) |
| Chronic_chf | Congestive heart failure | 2,257 (0–1) |
| Stroke | If patient suffered a stroke previous to Jan 1, 2020 | 1,016 (0–1) |
| Cardiac ISCH | Cardiac ischemia | 1,916 (0–1) |
| Rural | Indicator if a patient lives in a rural residence | 1,746 (0–1) |
| Chronic_ra | Rheumatoid arthritis | 567 (0–1) |
| Tia | Transient Ischemic Attack | 722 (0–1) |
| immuno_comp | Immuno-compromised | 237 (0–1) |
| Thala | History of Thalassemia | 36 (0–1) |
| Cases recovered | 54,568 | |
| Cases died | 2,822 |
(0 referring to missing information).
FIGURE 1SHAP summary plot for XGBoost model.
FIGURE 2SHAP plot for Age.
FIGURE 3SHAP plot for Test Date.
FIGURE 4SHAP box-plot for median income.
FIGURE 5SHAP box-plot for ethnic concentration.