| Literature DB >> 33009733 |
Chigozie Louisa Jane Ugwu1, Temesgen Zewotir1.
Abstract
Although malaria burden has declined globally following scale up of intervention, the disease has remained a leading cause of hospitalization and deaths among children aged under-5 years in Nigeria. Malaria is known to be related to climate and environmental conditions. Previous research has usually studied the effects of these factors, neglecting possible correlation between them, high correlation among variables is a source of multicollinearity that induces overfitting in regression modelling. In this paper, a factor analysis was first introduced to circumvent the issue of multicollinearity and a Generalized Additive Model (GAM) was subsequently explored to identify the important risk factors that might influence the prevalence of childhood malaria in Nigeria. The GAM incorporated the complexity of the survey data, while simultaneously modelling the nonlinear and spatial random effects to allow a more precise identification of the major malaria risk factors that influence the geographical distribution of the disease. From our findings, the three latent factor components (constituted by humidity, precipitation, potential evapotranspiration, and wet days/maximum and minimum temperature/proximity to permanent waters, respectively) were significantly associated with malaria prevalence. Our analysis also detected statistically significant and nonlinear effect of altitude: the risk of malaria increased with lower values but declined sharply with higher values. A significant spatial variability in under-5 malaria prevalence across the survey clusters was also observed; malaria burden was higher in the northern part of Nigeria. Investigating the impact of important risk factors and geographical location on childhood malaria is of high relevance for the sustainable development goals (SDGs) 2015-2030 Agenda on malaria eradication, and we believe that the information obtained from this study and the generated risk maps can be useful to effectively target intervention efforts to high-risk areas based on climate and environmental context.Entities:
Keywords: Factor analysis; GAMs; hot-spots; multicollinearity; nonlinear effects; spatial autocorrelation
Year: 2020 PMID: 33009733 PMCID: PMC7758859 DOI: 10.2991/jegh.k.200814.001
Source DB: PubMed Journal: J Epidemiol Glob Health ISSN: 2210-6006
Figure 1Locations map of where the survey dataset was collected based on the 2015 MIS–DHS in Nigeria’s 37 States, including the Federal Capital Territory (FCT).
Climate and environmental covariates, and their definitions
| Enhanced Vegetation Index (EVI) | Nigeria Demographic and Health Survey (NDHS) Spatial Analysis data | The average vegetation index value within the 2 km (urban) or 10 km (rural) buffer surrounding the DHS cluster at the time of measurement (year). The enhanced vegetation index was calculated by measuring the density of green leaves in the near-infrared and visible bands. |
| Proximity to waters (Coast/Large Lakes) | NDHS Spatial Analysis data | Straight-line distance to the nearest major water body. Based on the World Vector Shorelines, CIA World Data Bank II, and Atlas of the Cryosphere. |
| Population density | NDHS Spatial Analysis data | Estimates of human population density is the number of persons/km2 based on counts consistent with national censuses and population registers. |
| Precipitation | NDHS Spatial Analysis data | The average precipitation measured within the 2 km (urban) or 10 km (rural) buffer surrounding the DHS survey cluster each year. |
| Travel time to nearest settlement >50,000 inhabitants | NDHS Spatial Analysis data | The average time (min) required to reach a high-density urban center from the area within the 2 km (urban) or 10 km (rural) buffer surrounding the DHS cluster location, based on year 2015 infrastructure data. |
| Minimum temperature | NDHS Spatial Analysis data | The average annual maximum temperature within the 2 km (urban) or 10 km (rural) buffer surrounding the DHS cluster location. The maximum temperature is calculated from the modeled mean temperature and the modeled diurnal temperature range. |
| Maximum temperature | NDHS Spatial Analysis data | The average annual minimum temperature within the 2 km (urban) or 10 km (rural) buffer surrounding the DHS cluster location. The minimum temperature is calculated from the modeled mean temperature and the modeled diurnal temperature range. |
| Potential Evapotranspiration (PET) | NDHS Spatial Analysis data | The average annual PET within the 2 km (urban) or 10 km (rural) buffer surrounding the DHS cluster location, synthetic measurement that was calculated using a variation of the Penman–Monteith formula. |
| Cluster altitude | NDHS Spatial Analysis | Measure of surface altitude (m). The data were interpolated using a thin plate smoothing spline algorithm with altitude, longitude and latitude as independent variables. |
| Wet days | NDHS Spatial Analysis data | The average number of days receiving rainfall within the 2 km (urban) or 10 km (rural) buffer surrounding the DHS cluster location. It combines the number of observed days with rainfall from weather stations with the number of days that should have received rainfall. |
| Urban–rural settlement | NDHS Spatial Analysis data | This is urban–rural population classification of the area within the 2 km (urban) or 10 km (rural) buffer surrounding the DHS survey cluster location. It is the representation of the degree of urbanization concept obtained using urban-rural settlement classification model adopted by the Global Human Settlement Layer (GHSL) project. |
Correlations between the climate and environmental variables (all correlations were significant at 5% level)
| Precp | 1.000 | |||||||||||
| Hum | 0.989 | 1.000 | ||||||||||
| EVI | 0.633 | 0.623 | 1.000 | |||||||||
| MaxT | 0.055 | −0.033 | −0.055 | 1.000 | ||||||||
| MinT | 0.463 | 0.394 | 0.271 | 0.872 | 1.000 | |||||||
| PET | −0.606 | −0.667 | −0.565 | 0.677 | 0.250 | 1.000 | ||||||
| ProxW | −0.616 | −0.633 | −0.518 | 0.310 | −0.099 | 0.791 | 1.000 | |||||
| UR | 0.131 | 0.181 | −0.067 | −0.250 | −0.100 | −0.319 | −0.218 | 1.000 | ||||
| TR | 0.031 | 0.036 | −0.038 | 0.106 | 0.087 | 0.101 | 0.006 | −0.469 | 1.000 | |||
| PopD | −0.145 | −0.085 | −0.307 | −0.452 | −0.392 | −0.270 | −0.092 | 0.601 | −0.310 | 1.000 | ||
| Alt | −0.487 | −0.533 | −0.226 | 0.186 | −0.160 | 0.545 | 0.620 | −0.160 | −0.069 | −0.127 | 1.000 | |
| WetD | 0.957 | 0.959 | 0.686 | 0.070 | 0.499 | −0.629 | 0.624 | 0.184 | −0.002 | −0.109 | −0.443 | 1.000 |
Indicate correlation among variables.
Precp, precipitation; Hum, humidity; EVI, enhanced vegetation index; MaxT, maximum temperature; MinT, minimum temperature; PET, potential evapotranspiration; ProxW, proximity-to-water; UR, urban–rural settlement; TR, travel times; PopD, population density; Alt, cluster altitude; WetD, wet days.
Results of the estimated Varimax-rotated factor loadings, applying factor analysis for the highly correlated covariates
| Annual precipitation | 0.9467 | 0.1313 | −0.2584 |
| Humidity | 0.9567 | 0.0459 | −0.2656 |
| Maximum temperature | −0.0209 | 0.9741 | 0.2151 |
| Minimum temperature | 0.3518 | 0.9244 | −0.0765 |
| Potential evapotranspiration | −0.5819 | 0.5589 | 0.5625 |
| Proximity to permanent water bodies | −0.4099 | 0.1113 | 0.9004 |
| Wet days | 0.9268 | 0.1577 | −0.2978 |
Factors with high loadings.
Approximate significance of the smooth terms
| S (Wet events) | 9.0000 | 27.1206 | 0.0013 |
| S (Temperature variation) | 9.0000 | 19.3509 | 0.0298 |
| S (Proximity to water bodies) | 9.0000 | 43.2202 | <0.0001 |
| S (Cluster altitude) | 8.0000 | 82.5703 | <0.0001 |
| S (Vegetation density) | 7.0000 | 20.5699 | 0.0372 |
| S (Travel time) | 8.0000 | 99.0031 | <0.0001 |
| S (Population density) | 8.0000 | 22.8964 | 0.0035 |
| S (Urban–rural settlement) | 3.0000 | 29.8106 | <0.0001 |
| S (Spatial effects) | 24.4341 | 187.9308 | <0.0001 |
Figure 2Smoothing plots of relationships between under-5 malaria prevalence and climate-environmental factors in GAM with 95% confidence bands. Panel A: wet events component (precipitation, humidity, potential evapotranspiration, and wet days). Panel B: temperature variation component (maximum and minimum temperature). Panel C: distance to permanent water bodies. Panel D: cluster altitude. Panel E: enhanced vegetation index and panel F: travel times.
Figure 3Smoothing plots of relationships between under-5 malaria prevalence and climate and environmental factors with 95% confidence bands. Panel A: population density. Panel B: urban–rural settlement.
Figure 4Predicted risk map of malaria prevalence of under-5 children for the six geopolitical regions in Nigeria. Shown is the colorimetric scale representing the risk of malaria per kilometer square.
Figure 5Predicted risk map of malaria prevalence of under-5 children for the 37 states in Nigeria. Shown is the colorimetric scale representing the risk of malaria per kilometer square.