| Literature DB >> 33102426 |
Yun Li1,2, Melanie Alfonzo Horowitz3, Jiakang Liu4, Aaron Chew5, Hai Lan2,6, Qian Liu1,2, Dexuan Sha1,2, Chaowei Yang1,2.
Abstract
The global covid-19 pandemic puts great pressure on medical resources worldwide and leads healthcare professionals to question which individuals are in imminent need of care. With appropriate data of each patient, hospitals can heuristically predict whether or not a patient requires immediate care. We adopted a deep learning model to predict fatality of individuals tested positive given the patient's underlying health conditions, age, sex, and other factors. As the allocation of resources toward a vulnerable patient could mean the difference between life and death, a fatality prediction model serves as a valuable tool to healthcare workers in prioritizing resources and hospital space. The models adopted were evaluated and refined using the metrics of accuracy, specificity, and sensitivity. After data preprocessing and training, our model is able to predict whether a covid-19 confirmed patient is likely to be dead or not, given their information and disposition. The metrics between the different models are compared. Results indicate that the deep learning model outperforms other machine learning models to solve this rare event prediction problem.Entities:
Keywords: COVID-19; deep learning; fatality prediction; machine learning; pandemic; rare event
Mesh:
Year: 2020 PMID: 33102426 PMCID: PMC7556112 DOI: 10.3389/fpubh.2020.587937
Source DB: PubMed Journal: Front Public Health ISSN: 2296-2565
Comparison of the attributes included in the two filtered datasets.
| ID | ID issued to each deindividualized case in the dataset | ✓ | ✓ | High fever | 0—individual did not have a high fever (>39*C) | ✓ | |
| Age range | Age range individual's age falls into during time of positive COVID-19 test | ✓ | ✓ | Kidney S | 0—individual did not display kidney related symptoms | ✓ | |
| Gender | Reported gender of individual | ✓ | ✓ | Asymptomatic | 0—individual is displaying symptoms | ✓ | |
| Latitude | Latitude where case is reported | ✓ | Diabetes | 0—individual does not have diabetes | ✓ | ||
| Longitude | Longitude where case is reported | ✓ | Neuro | 0—individual does not have neurological chronic disease. | ✓ | ||
| Symptoms | 0—individual displayed no signs of symptoms | ✓ | No chronic Disease | 0—individual has chronic disease history | ✓ | ||
| Chronic Disease | 0—individual had no reported chronic disease history | ✓ | ✓ | Hypertension | 0—individual does not have hypertension | ✓ | |
| Outcome | 0—alive | ✓ | ✓ | Cancer | 0—individual does not have cancer | ✓ | |
| Respiratory S | 0—individual did not display respiratory symptoms | ✓ | Orthopedic CD | 0—individual does not have orthopedic related chronic disease | ✓ | ||
| weakness/pain | 0—individual had no weakness or pain | ✓ | Respiratory related CD | 0—individual does not have respiratory related chronic disease | ✓ | ||
| Low fever | 0—individual did not have a low fever (<39*C) | ✓ | Cardiac related CD | 0—individual does not have cardiac related chronic disease | ✓ | ||
| Gastrointestinal S | 0—individual did not display gastrointestinal symptoms | ✓ | Kidney related CD | 0—individual does not have kidney related chronic disease | ✓ | ||
| Other symptoms | 0—individual did not display other | ✓ | Blood related CD | 0—individual does not have blood related chronic disease | ✓ | ||
| Nausea | 0—individual did not experience nausea | ✓ | Prostate related CD | 0—individual does not have prostate related chronic disease | ✓ | ||
| Cardiac S | 0—individual did not display cardiac related symptoms | ✓ | Thyroid related CD | 0—individual does not have thyroid related chronic disease | ✓ |
Figure 1The network structure of an autoencoder.
Figure 2The workflow of fatality prediction.
Figure 3A comparison of the different models using the GitHub data.
Figure 4A correlation matrix of the variables used in the analysis of the Wolfram database.
Figure 5(A) Is used to gather a threshold for the autoencoder model. (B) The reconstruction error at the chosen threshold of 2.5 on validation dataset.
Figure 6A comparison of the resulting metrics of 7 models used in the Wolfram dataset.