| Literature DB >> 34250321 |
Darius Phiri1, Serajis Salekin2, Vincent R Nyirenda3, Matamyo Simwanda1, Manjula Ranagalage4,5, Yuji Murayama5.
Abstract
The global pandemic emergent from SARS-COV-2 (COVID-19) has continued to cause both health and socioeconomic challenges worldwide. However, there is limited information on the factors affecting the dynamics of COVID-19, especially in developing countries, including African countries. In this study, we have focused on understanding the association of COVID-19 cases with environmental and socioeconomic factors in Zambia - a sub-Saharan African country. We used Zambia's district-level COVID-19 data, covering 18 March 2020 (i.e., from first reported cases) to 17 July 2020. Geospatial approaches were used to organise, extract and establish the dataset, while a classification tree (CT) technique was employed to analyse the factors associated with the COVID-19 cases. The analyses were conducted in two stages: (1) the binary analysis of occurrences of COVID-19 (i.e., COVID-19 or No COVID-19), and (2) a risk level analysis which grouped the number of cases into four risk levels (high, moderate, low and very low). The results showed that the distribution of COVID-19 cases in Zambia was significantly influenced by the socioeconomic factors compared to environmental factors. More specifically, the binary model showed that distance to the airport, population density and distance to the town centres were the most combination influential factors, while the risk level analysis indicated that areas with high rates of human immunodeficient virus (HIV) infection had relatively high chances of having many COVID-19 cases compared to areas with low HIV rates. The districts that are far from major urban establishments and that experience higher temperatures have lower chances of having COVID-19 cases. This study makes two major contributions towards the understanding of COVID-19 dynamics: (1) the methodology presented here can be effectively applied in other areas to understand the association of environmental and socioeconomic factors with COVID-19 cases, and (2), the findings from this study present the empirical evidence of the relationship between COVID-19 cases and their associated environmental and socioeconomic factors. Further studies are needed to understand the relationship of this disease and the associated factors in different cultural settings, seasons and age groups, especially as the COVID-19 cases increase and spread in many countries.Entities:
Keywords: Africa; COVID-19; Classification Tree; GIS, Zambia; HIV/AIDS
Year: 2021 PMID: 34250321 PMCID: PMC8256674 DOI: 10.1016/j.sciaf.2021.e00827
Source DB: PubMed Journal: Sci Afr ISSN: 2468-2276
Fig. 1The location of the study site - Zambia. The map also shows the location of the neighboring countries, airports, border posts and the major towns in Zambia.
Fig. 2The distribution of COVID-19 cases in Zambia at district level as of 17 July 2020 (Modified from the Ministry of Health daily reports).
The response variables used for both the binary and risk level Classification Tree models for assessing COVID-19 cases in Zambia.
| Model | Level | Range of case | Number of Districts |
|---|---|---|---|
| Binary | Presence of COVID-19 | ≥1 | 68 |
| Absence of COVID-19 | 0 | 44 | |
| Risk levels | High | >100 | 9 |
| Moderate | 50–100 | 21 | |
| Low | 5–50 | 24 | |
| Very low | <5 | 12 |
The description of socioeconomic and environmental factors used in this study with their spatial and temporal resolutions, and sources.
| Category | Factors and units | Range | Spatial resolution | Temporal resolution | Sources |
|---|---|---|---|---|---|
| Topographic | Elevation (m) | 325–2296 | 30 m | _ | USGS |
| Slope (°) | 0–57.42 | 30 m | _ | USGS | |
| Aspect (°) | -1–359.90 | 30 m | _ | USGS | |
| Climatic | Total annual precipitation (mm) | 590–1503 | 1 km | 1970–2000 | |
| Solar radiance (w m−2) | 15763–20511 | 1 km | 1970–2000 | ||
| Maximum temperature (°C) | 19.78–33.63 | 1 km | 1970–2000 | ||
| Minimum temperature (°C) | 8.60–21.57 | 1 km | 1970–2000 | ||
| Mean temperature (°C) | 14.30–26.30 | 1 km | 1970–2000 | ||
| Social | District status (urban, rural) | - | District level | Central Statistics Office of Zambia (CSO) | |
| Total population (count) | 25, 294–1,701,640 | District level | 1969–2010 | CSO | |
| Human immunodeficiency virus (HIV) rate | District level | 2010 | CSO | ||
| Population density (persons km-2) | 2.70–4,841.60 | District level | 2010 | CSO | |
| Crop yield (tonnes) | 451.93–67,600.78 | District level | 1990–2010 | CSO | |
| Population change (count) | 17,369–1,359, 354 | District level | 1990–2010 | CSO | |
| Proximity | Euclidean distance to active mine centres (km) | 0–280.35 | 30 m | _ | Ministry of Mines for Zambia |
| Euclidean distance to waterbody edges (km) | 0–108.62 | 30 m | _ | Forest Department (FD) | |
| Euclidean distance to town centres (km) | 0–82.56 | 30 m | _ | FD | |
| Accessibility | Euclidean distance to road (km) | 0–104.25 | 30 m | _ | Road Development Agency of Zambia (RDA) |
| Euclidean distance to railway (km) | 0–108.67 | 30 m | _ | FD | |
| Euclidean distance to rivers (km) | 0–128.62 | 30 m | _ | FD | |
| Euclidean distance to border towns (km) | 0–355 | 30 m | - | Ministry of Tourism | |
| Distance to airports (km) | 0–305 | 30 m | Ministry of Tourism |
Fig. 3Graphic representation of selected factors used in this study. Note that distance to the airport, borders, towns and mines were calculated using the Euclidean distance approach.
Confusion matrices for accuracy assessment of binary model (COVID-19/NO COVID-19)1.
| Reference | ||||
|---|---|---|---|---|
| Prediction | COVID-19 | No COVID-19 | Total | UA (%) |
| COVID 19 | 15 | 0 | 15 | 100 |
| No COVID-19 | 2 | 5 | 7 | 71 |
| Total | 17 | 5 | 22 | |
| PA (%) | 88 | 100 | OA = 91% |
Fig. 4The CT model showing the relationship between COVID-19 cases and the environmental and socioeconomic factors. The dark grey shade represents the districts where COVID-19 was reported, while the light grey shade represents the districts without COVID-19. Dist Airport refers to distance to the nearest airport, Popu Density refers to population density, Dist Mine refers to distance to the mines, Dist Town is distance to the nearest town, Total Precip is total precipitation, while Dist Boarder refers to distance to the boarder.
Confusion matrices for accuracy assessment of the four risk levels2.
| Reference | ||||||
|---|---|---|---|---|---|---|
| Prediction | High | Moderate | Low | Very low | Total | UA (%) |
| High | 3 | 1 | 0 | 0 | 4 | 75 |
| Moderate | 0 | 6 | 1 | 0 | 7 | 86 |
| Low | 1 | 1 | 8 | 0 | 10 | 80 |
| Very low | 0 | 0 | 0 | 1 | 1 | 100 |
| Total | 4 | 8 | 9 | 1 | 22 | |
| PA (%) | 75 | 75 | 89 | 100 | OA = 82% |
Fig. 5The CT model showing the relationship between four COVID-19 risk levels and the environmental and socioeconomic factors. Where Dist Airport refers to distance to the nearest airport, HIV cases refers to the number of people living with HIV, Dist Mine refers to distance to the mines, Dist Town refers to distance to the nearest towns, Dist Border is distance to the border, while Mean Temp is Mean Temperature.