| Literature DB >> 36160183 |
GholamReza Askari1,2, Mohammad Hossein Rouhani1,2, Mohammad Sattari3.
Abstract
The aim of this paper is to predict the patient hospitalization time with coronavirus disease 2019 (COVID-19). It uses various data mining techniques, such as random forest. Many rules were derived by applying these techniques to the dataset. The extracted rules mainly were related to people over 55 years old. The rule with the most support states that if the person is between 70 and 80 years old, has cardiovascular disease, and the gender is female; then, the person will be hospitalized for at least five days. The gradient boosting random forest technique has performed better than other techniques. As a limitation of the study, it can be pointed out that a few features were unavailable and had not been recorded. Patients with diabetes, chronic respiratory problems, and cardiovascular diseases have a relatively long hospitalization. So, the hospital manager should consider a suitable priority for these patients. Older people were also more likely to take part in the selection rules.Entities:
Year: 2022 PMID: 36160183 PMCID: PMC9507755 DOI: 10.1155/2022/6474883
Source DB: PubMed Journal: Int J Biomater ISSN: 1687-8787
The number of COVID-19 patients with each underlying disease.
| Underlying disease | The number of patients |
|---|---|
| Cancer | 184 |
| Cardiovascular disease | 1105 |
| Chronic liver disease | 73 |
| Chronic neurological disease | 287 |
| Chronic respiratory disease | 478 |
| Chronic kidney disease | 326 |
| Diabetes | 1094 |
| AIDS | 4 |
| Hypertension | 1098 |
| Obesity | 65 |
| Chronic blood disease | 68 |
| Other immunodeficiency diseases | 54 |
| Splenectomy | 4 |
Lists of the other attributes and their different values.
| Attributes | Type of values |
|---|---|
| Pregnancy | Yes-No |
|
| |
| Discharge | Outpatient |
| Hospitalization | |
|
| |
| Age | <18 |
| 18–55 | |
| 56–64 | |
| 65–69 | |
| 70–79 | |
| ≥80 | |
|
| |
| Gender | Female-male |
|
| |
| COVID-19 result | Negative |
| Positive | |
| Positive again | |
| Need for re-sampling | |
|
| |
| Length of hospital stay | <1 day |
| 1–3 days | |
| 4–5 days | |
| 6–8 days | |
| 9–10 days | |
| >10 day | |
The accuracy of different data mining techniques.
| Techniques | Accuracy (%) |
|---|---|
| ID3 | 72.28 |
| Random forest | 70.13 |
| Gradient boosting random forest | 73.51 |
Extracted selected rules regarding the prediction of the hospitalization time of COVID-19 patients.
| Extracted selected rules | Support | Confidence (%) |
|---|---|---|
| If the person is between 18 and 55 years old, has cancer, the gender is male, and the COVID-19 result is positive; then, the person will be hospitalized for between 1 and 3 days | 285 | 72 |
| If the person is between 70 and 80 years old, has a cardiovascular disease, and the gender is female; then, the person will be hospitalized for at least 5 days | 2570 | 76 |
| If the person is between 55 and 64 years old, has a chronic kidney disease, and the gender is male; then, the person will be hospitalized for between 1 and 5 days | 874 | 83 |
| If the person is between 18 and 54 years old and has a chronic liver disease; then, the person will be hospitalized for 4 to 5 days | 547 | 70 |
| If the person is over 65 years old, has a chronic neurological disease, and the gender is female; then, the person will be hospitalized for between 1 and 5 days | 475 | 75 |
| If the person is between 65 and 70 old, has a chronic respiratory disease, and the gender is male; then, the person will be hospitalized for between 1 and 8 days | 319 | 68 |
| If the person is between 18 and 54 years old and has diabetes; then, the person will be hospitalized for between 5 and 10 days | 727 | 81 |
| If the person is between 55 and 64 years old, has diabetes, and the gender is male; then, the person will be hospitalized for more than 8 days | 673 | 71 |
| If the person is a woman, over 80 years old, and has hypertension; then, the person will be hospitalized for less than five days | 823 | 69 |