| Literature DB >> 34966184 |
Demian da Silveira Barcellos1, Giovane Matheus Kayser Fernandes2, Fábio Teodoro de Souza3,4.
Abstract
There is an ongoing need for scientific analysis to help governments and public health authorities make decisions regarding the COVID-19 pandemic. This article presents a methodology based on data mining that can offer support for coping with epidemic diseases. The methodological approach was applied in São Paulo, Rio de Janeiro and Manaus, the cities in Brazil with the most COVID-19 deaths until the first half of 2021. We aimed to predict the evolution of COVID-19 in metropolises and identify air quality and meteorological variables correlated with confirmed cases and deaths. The statistical analyses indicated the most important explanatory environmental variables, while the cluster analyses showed the potential best input variables for the forecasting models. The forecast models were built by two different algorithms and their results have been compared. The relationship between epidemiological and environmental variables was particular to each of the three cities studied. Low solar radiation periods predicted in Manaus can guide managers to likely increase deaths due to COVID-19. In São Paulo, an increase in the mortality rate can be indicated by drought periods. The developed models can predict new cases and deaths by COVID-19 in studied cities. Furthermore, the methodological approach can be applied in other cities and for other epidemic diseases.Entities:
Mesh:
Year: 2021 PMID: 34966184 PMCID: PMC8716530 DOI: 10.1038/s41598-021-04029-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Sequential diagram of the proposed data approach for public health management.
Figure 2Analysis of the main components in Manaus (generated by the Statistica 7 software—https://statistica.software.informer.com/).
Figure 3Analysis of the main components in SP(generated by the Statistica 7 software—https://statistica.software.informer.com/).
Figure 4Dendrogram and K-means clusters of SP (generated by the Statistica 7 software—https://statistica.software.informer.com/).
Figure 5Accuracy comparison of the CBA and J48 prediction models.
Predictive models generated by the CBA.
| New canfirmed (NC) | C% | S% | New deaths (ND) | C% | S% |
|---|---|---|---|---|---|
IF: 800_ < _NC_ < _2200 and 40_ < _isol_avg_index_ < _45 and 15_ < _dew_pointMax_ < _18 THEN→ 800_ < _NC7_SP_ < _2200 | 81.5 | 5 | IF: soil_wat_avail_ < _55 and ND_ > _75 and tAvg _ < _20 THEN→ ND7_SP_ > _75 | 100 | 6.4 |
IF: 3_<_pot_evapo_<_5 and urMax_>_91 and death_rate_<_9 THEN→ NC2_RJ_<_240 (C%: 81.2, S%: 5.6) | 81.2 | 5.6 | IF: soil_wat_avail_ < _55 and ND_ > _75 and May THEN→ ND7_SP_ > _75 | 100 | 5.6 |
IF: NC_<_240 and urMax_>_91 and March THEN→ NC6_RJ_<_240 (C%: 100, S%: 5) | 100 | 5 | IF: ND_ > _75 and tMax _ < _25 and 0_ < _sum_rain_t3_ < _5 THEN→ ND7_SP_ > _75 | 100 | 6.4 |
IF: NC_ < _240 and death_rate_ < _9 and March THEN→ NC7_RJ_ < _240 | 100 | 6 | IF: 36_ < _ urMin _ < _48 and NC_ < _240 and death_rate_ < _9 THEN→ ND4_RJ_ < _25 | 96.9 | 6.8 |
IF: NC_ < _240 and urMax _ > _91 and March THEN→ NC7_RJ_ < _240 | 100 | 5.1 | IF: NC_ < _240 and death_rate_ < _9 and March THEN→ ND7_RJ_ < _25 | 100 | 6 |
IF: dew_pointMin _>_16 and NC_<_240 and March THEN→ NC7_RJ_<_240 | 100 | 5 | IF: NC_ < _240 and urMax _ > _91 and March THEN→ ND7_RJ_ < _25 | 100 | 5.1 |
IF: death_rate_ < _5 and total_conf_ > _85000 THEN→ NC2_Manaus_ > _400 | 100 | 8 | IF: ND_ > _15 and total_deaths_ > _3300 and sum_rain_t5_ > _45 THEN→ ND2_Manaus_ > _15 | 90.1 | 9.1 |
IF: death_rate_ < _5 and total_deaths_ > _3300 and avg_rad_t5_ < _12 THEN→ NC6_Manaus_ > _400 | 100 | 6.7 | IF: ND_ > _15 and total_deaths_ > _3300 and tMax _ < _31 THEN→ ND1_Manaus_ > _15 | 93.7 | 10.2 |
IF:ND_ > _15 and NC_ > _400 and 5_ < _sum_rain_t3_ < _26 THEN→ NC1_Manaus_ > _400 | 90.3 | 6.3 | IF: ND_ > _15 and NC_ > _400 and avg_rad_t7_ < _12 THEN→ ND6_Manaus_ > _15 | 100 | 10.8 |
IF:death_rate_ < _5 and total_deaths_ > _3300 and avg_rad_t3_ < _12 THEN→ NC4_Manaus_ > _400 | 100 | 6.7 | IF: dew_pointMin _ > _21 and ND_ > _15 and NC_ > _400 THEN→ ND7_Manaus_ > _15 | 93.5 | 9.9 |
IF:death_rate_ < _5 and total_deaths_ > _330 and sum_rain_t7_ > _70 THEN→ NC6_Manaus_ > _400 | 100 | 6.7 | IF: drought_=_0 and ND_ > _15 and NC_ > _400 THEN→ ND6_Manaus_ > _15 | 94.2 | 11.2 |