| Literature DB >> 34189026 |
José A Guzmán-Torres1, Elia M Alonso-Guzmán1, Francisco J Domínguez-Mota1,2, Gerardo Tinoco-Guerrero2.
Abstract
Nowadays, society faces a catastrophic problem related to respiratory syndrome due to the coronavirus SARS-CoV-2: the Covid-19 disease. This virus has changed our coexistence rules and, in consequence, has reshaped the daily activities in modern societies. Thus, there are many efforts to understand the virus behaviour in order to reduce its negative impact, and these efforts produce an incredible amount of information and data sources every week. Data scientists, which use techniques such as Machine learning, are focusing their abilities to develop mathematical models for analysing this critical situation. This paper uses Machine Learning techniques as tools to help understand some specific new patterns in Covid patients that arise from unknown complex interactions in the transmission-dynamic models of the SARS-CoV-2 virus, and their relation with the corresponding social contact patterns which are often known or can be inferred from populations variables. One of the main motivations of this research is to find the diseases that cause an increase in the risk of death in infected people with the Covid-19 virus. Mexico is the case of study in this research. The general conditions of health that cause death are well known generally in the world. However, these conditions in each country can differ depending on different factors such as the general health status of people. The results show that the principal causes of death in Mexico are related to age, bad eating habits, chronic diseases, and contact with infected people having not proper care. Results from the analysis show a remarkable accuracy of 87%, which is considered satisfactory.Entities:
Keywords: Covid-19; Data science; Diseases; Machine learning
Year: 2021 PMID: 34189026 PMCID: PMC8223079 DOI: 10.1016/j.rinp.2021.104483
Source DB: PubMed Journal: Results Phys ISSN: 2211-3797 Impact factor: 4.476
Fig. 1Flow chart - Backward diagram process.
Dummy variables generated as a result of the one-hot encoding.
| Disease number | Disease | Disease number | Disease |
|---|---|---|---|
| 1 | AGE | 17 | INMUSUPR-IGNORED |
| 2 | RESULT-POSITIVE Covid-19 | 18 | INMUSUPR-YES |
| 3 | RESULT-PENDING RESULT | 19 | HYPERTENSION-IGNORED |
| 4 | SEX-WOMAN | 20 | HYPERTENSION-YES |
| 5 | KIND OF PATIENT-HOSPITALIZED | 21 | ANOTHER COM-IGNORED |
| 6 | PNEUMONIA-NOT SPECIFIED | 22 | ANOTHER COM-YES |
| 7 | PNEUMONIA-YES | 23 | CARDIOVASCULAR-IGNORED |
| 8 | PREGNANCY-NOT APPLIED | 24 | CARDIOVASCULAR-YES |
| 9 | PREGNANCY-IGNORED | 25 | OBESITY-IGNORED |
| 10 | PREGNANCY-YES | 26 | OBESITY-YES |
| 11 | DIABETES-IGNORED | 27 | RENAL CHRONIC-IGNORED |
| 12 | DIABETES-YES | 28 | RENAL CHRONIC-YES |
| 13 | COPD-IGNORED | 29 | TABACISM-IGNORED |
| 14 | COPD-YES | 30 | TABACISM-YES |
| 15 | ASTHMA-IGNORED | 31 | ANOTHER CASE-NOT SPECIFIED |
| 16 | ASTHMA-YES | 32 | ANOTHER CASE-YES |
Top ten conditions causing mortality in Covid patients in Mexico.
| Disease number | Disease |
|---|---|
| - | KIND OF PATIENT-HOSPITALIZED |
| 1 | AGE |
| 2 | ANOTHER CASE-NOT SPECIFIED |
| 3 | DIABETES-YES |
| 4 | PNEUMONIA-YES |
| 5 | ANOTHER CASE-YES |
| 6 | RESULT-POSITIVE Covid-19 |
| 7 | CARDIOVASCULAR-YES |
| 8 | HYPERTENSION-YES |
| 9 | INMUSUPR-YES |
| 10 | ANOTHER COM-YES |
Fig. 2Performance of ROC curve.
Classification report.
| Precision | Recall | F1-score | Support | |
|---|---|---|---|---|
| Class 0 | 0.87 | 0.87 | 0.87 | 23,511 |
| Class 1 | 0.87 | 0.86 | 0.87 | 23,611 |
| Accuracy | 0.87 | 47,122 | ||
| Macro avg | 0.87 | 0.87 | 0.87 | 47,122 |
| Weighted avg | 0.87 | 0.87 | 0.87 | 47,122 |
Fig. 3Predicted vs Actual diagnostics.