| Literature DB >> 34306999 |
Justice Moses K Aheto1,2, Henry Ofori Duah3, Pascal Agbadi4, Emmanuel Kweku Nakua5.
Abstract
Malaria is among the leading causes of mortality and morbidity among children in Ghana. Therefore, identifying the predictors of malaria prevalence in children under-five is among the priorities of the global health agenda. In Ghana, the paradigm shifts from using traditional statistics to machine learning techniques to identifying predictors of malaria prevalence are scarce. Thus, the present study used machine learning techniques to identify variables to build the best fitting predictive model of malaria prevalence in Ghana. We analysed the data on 2867 under-five children with malaria RDT results from the 2019 Ghana Malaria Indicator Survey. LASSO, Ridge, and Elastic Net regression methods were used to select variables to build predictive models. The R freeware version 4.0.2 was used. One out of four children tested positive for malaria (25.04%). The logit models based on selected features by LASSO, Ridge, and Elastic Net contained eleven, fifteen, and thirteen features, respectively. The LASSO regression model is preferred because it contains the smallest number of predictors and the smallest prediction error. The significant predictors of malaria among children were being older than 24 months, residing in the poorest household, being severely anaemic, residing in households without electricity, and residing in a rural area. The predictors identified in our study deserve policy attention and interventions to strengthen malaria control efforts in Ghana. The machine learning techniques employed in our study, especially the LASSO regression technique could be beneficial for identifying predictors of malaria prevalence in this group of children.Entities:
Keywords: Elastic net; Ghana; LASSO; Malaria; RIDGE
Year: 2021 PMID: 34306999 PMCID: PMC8258678 DOI: 10.1016/j.pmedr.2021.101475
Source DB: PubMed Journal: Prev Med Rep ISSN: 2211-3355
Descriptive statistics.
| Study variables | N (%) | N (%) | |
|---|---|---|---|
| Negative (-VE) | 2149 (74.96%) | ||
| Positive (+VE) | 718 (25.04) | ||
| Total | 2867 | ||
| -VE | +VE | ||
| < 24 months | 769 (82.51) | 163 (17.49) | χ2 = 46.05, p ≤ 0.001 |
| 24–48 months | 972 (72.65) | 366 (27.35) | |
| > 48 months | 408 (68.34) | 189 (31.66) | |
| 0–1 | 910 (77.65) | 265 (22.35) | χ2 = 25.04, p ≤ 0.005 |
| 2–3 | 1085 (73.81) | 385 (26.19) | |
| > 3 | 154 (68.44) | 71 (31.56) | |
| No | 312 (80.0) | 78 (20.00) | χ2 = 6.12, p ≤ 0.05 |
| Yes | 1837 (74.16) | 640 (25.84) | |
| no | 546 (82.98) | 112 (17.02) | χ2 = 43.23, p ≤ 0.001 |
| all children | 1044 (70.92) | 428 (29.08) | |
| some children | 247 (71.18) | 247 (28.82) | |
| no net in household | 312 (80.00) | 78 (20.00) | |
| Male | 1533 (74.74) | 518 (25.26) | χ2 = 0.17, p = 0.68 |
| Female | 616 (75.49) | 200 (24.51) | |
| male | 1081 (73.84) | 383 (26.16) | χ2 = 1.99, p = 0.16 |
| Female | 1068 (76.12) | 335 (23.88) | |
| no | 1758 (74.40) | 605 (25.60) | χ2 = 2.24, p = 0.13 |
| Yes | 391 (77.58) | 113 (22.42) | |
| Poorest | 619 (64.48) | 341 (35.52) | χ2 = 214.24, p ≤ 0.001 |
| Poorer | 420 (67.42) | 203 (32.58) | |
| Middle | 431 (77.38) | 126 (22.62) | |
| Richer | 369 (90.00) | 41 (10) | |
| Richest | 310 (97.79) | 7 (2.21) | |
| Severe | 10 (20.41) | 39 (79.59) | χ2 = 267.76, p ≤ 0.001 |
| Moderate | 504 (59.57) | 342 (40.43) | |
| Mild | 603 (77.91) | 171 (22.09) | |
| Not anaemic | 1032 (86.14) | 166 (13.86) | |
| No | 475 (59.45) | 324 (40.55) | χ2 = 141.90, p ≤ 0.001 |
| Yes | 1674 (80.95) | 394 (19.05) | |
| No | 822 (65.29) | 437 (34.71) | χ2 = 111.74, p ≤ 0.001 |
| Yes | 1327 (82.52) | 281 (17.48) | |
| Urban | 944 (89.06) | 116 (10.94) | χ2 = 178.12, p ≤ 0.001 |
| Rural | 1205 (66.69) | 602 (33.31) | |
| Western | 205 (71.18) | 83 (28.82) | χ2 = 123.90, p ≤ 0.001 |
| Central | 174 (67.97) | 82 (32.03) | |
| Greater Accra | 184 (98.92) | 2 (1.08) | |
| Volta | 172 (68.53) | 79 (31.47) | |
| Eastern | 160 (71.11) | 65 (28.89) | |
| Ashanti | 240 (85.71) | 40 (14.29) | |
| Brong Ahafo | 165 (62.03) | 101 (37.97) | |
| Northern | 406 (79.45) | 105 (20.55) | |
| Upper East | 210 (70.00) | 90 (30.00) | |
| Upper West | 233 (76.64) | 71 (23.36) | |
| No child | 858 (81.87) | 190 (18.13) | χ2 = 42.48, p ≤ 0.001 |
| 1–2 children | 1157 (71.20) | 468 (28.80) | |
| >3 children | 134 (69.07) | 60 (30.93) | |
| No | 973 (80.28) | 239 (19.72) | χ2 = 31.70, p ≤ 0.001 |
| Yes | 1176 (71.06) | 479 (28.94) | |
| χ2 = 17.03, p ≤ 0.001 | |||
| < 6 members | 1038 (78.46) | 285 (21.54) | |
Members | 822 (72.61) | 310 (27.39) | |
| > 9 members | 289 (70.15) | 123 (29.85) | |
Lasso, Ridge, and Elastic Net.
| LASSO | RIDGE | ELASTIC NET | |
|---|---|---|---|
| alpha = 1 | alpha = 0 | alpha = 0.4186508 | |
| (Intercept) | 1.043406015 | 0.81622002 | 0.957635393 |
| Region | −0.125009418 | −0.11751371 | −0.125348706 |
| Urban-rural residence | 0.797182998 | 0.79393783 | 0.806272194 |
| Has electricity in HH | −0.334558348 | −0.35767631 | −0.353889442 |
| Has Television in HH | . | −0.06166143 | −0.001048464 |
| Sex of HH | 0.019345178 | 0.07802045 | 0.047635449 |
| Has mosquito bed net for sleeping | . | −0.08209747 | . |
| Household wealth index | −0.356708586 | −0.31547715 | −0.352298541 |
| sex of household member | . | −0.02806722 | . |
| Anaemia level | −0.821035391 | −0.77787099 | −0.817498792 |
| Dwelling sprayed against mosquito last 12 months | −0.350619294 | −0.38498364 | −0.367712726 |
| Number of children who slept under mosquito bed net previous night | 0.022397129 | 0.05179602 | 0.025362654 |
| Number of U5C in household | 0.003430091 | 0.03509552 | 0.02308751 |
| Insecticide-treated net | 0.136570517 | 0.18949137 | 0.157816524 |
| Child Age | 0.651001735 | 0.619984 | 0.652697693 |
| number of household members | . | 0.02590606 | 0.000677887 |
Fig. 1The binomial deviance versus the log(Lambda) plots. Note: 1st row: LASSO; 2nd row: RIDGE; 3rd row: ELASTIC NET.
Explained variance and area under the curve results for LASSO, RIDGE, and Elastic Net.
| Model | R-Square | RMSE (95% CI) | SD | AUC Value | Number |
|---|---|---|---|---|---|
| Lasso | 0.196989 | 0.9489 (0.9286, 0.9691) | 0.0202 | 81.20% | 11 |
| Ridge | 0.1972966 | 1.0366 (1.0194, 1.0537) | 0.0172 | 81.20% | 15 |
| Elastic net | 0.1971316 | 0.9531 (0.9342, 0.9721) | 0.0190 | 81.20% | 13 |
Fig. 2Area under the curve values for the LASSO, RIDGE, AND Elastic Net feature selected models.
Fig. 3Area under the ROC curve comparing the predictive ability of the training and the validation sets.
Regressors of Malaria Prevalence in Ghana.
| LASSO | RIDGE | ELASTIC NET | |
|---|---|---|---|
| Variables | |||
| (Intercept) | 4.35 [1.81, 11.10] | 4.76 [1.92, 12.56] | 4.33 [1.79, 11.12] |
| < 24 months | 1 | 1 | 1 |
| 24–48 months | 2.63 [2.06, 3.36] | 2.61 [2.05, 3.34] | 2.61 [2.05, 3.35] |
| > 48 months | 4.28 [3.19, 5.77] | 4.26 [3.18, 5.74] | 4.26 [3.18, 5.75] |
| 0–1 | 1 | 1 | 1 |
| 2–3 | 1.12 [0.90, 1.39] | 1.09 [0.87, 1.36] | 1.09 [0.87, 1.36] |
| > 3 | 1.54 [0.99, 2.38] | 1.33 [0.81, 2.18] | 1.34 [0.81, 2.19] |
| Male | 1 | 1 | 1 |
| Female | 0.97 [0.77, 1.22] | 0.97 [0.77, 1.23] | 0.98 [0.77, 1.23] |
| no | 1 | 1 | 1 |
| Yes | 0.83 [0.57, 1.20] | 0.83 [0.57, 1.20] | 0.83 [0.57, 1.20] |
| Poorest | 1 | ||
| Poorer | 0.81 [0.61, 1.09] | 0.82 [0.61, 1.11] | 0.82 [0.61, 1.10] |
| Middle | 0.59 [0.40, 0.86] | 0.60 [0.40, 0.90] | 0.60 [0.40, 0.90] |
| Richer | 0.33 [0.20, 0.52] | 0.34 [0.20, 0.56] | 0.33 [0.20, 0.56] |
| Richest | 0.10 [0.04, 0.23] | 0.11 [0.04, 0.24] | 0.11 [0.04, 0.24] |
| Severe | 1 | ||
| Moderate | 0.16 [0.07, 0.34] | 0.16 [0.07, 0.34] | 0.16 [0.07, 0.33] |
| Mild | 0.05 [0.02, 0.11] | 0.05 [0.02, 0.11] | 0.05 [0.02, 0.11] |
| not anaemic | 0.03 [0.01, 0.06] | 0.03 [0.01, 0.06] | 0.03 [0.01, 0.06] |
| No | 1 | ||
| Yes | 0.68 [0.53, 0.87] | 0.68 [0.52, 0.90] | 0.68 [0.52, 0.89] |
| Urban | 1 | ||
| Rural | 2.09 [1.58, 2.77] | 2.09 [1.58, 2.77] | 2.08 [1.57, 2.76] |
| Western | 1 | 1 | 1 |
| Central | 0.90 [0.58, 1.38] | 0.90 [0.58, 1.38] | 0.90 [0.58, 1.38] |
| Greater Accra | 0.08 [0.01, 0.29] | 0.08 [0.01, 0.28] | 0.08 [0.01, 0.28] |
| Volta | 0.72 [0.47, 1.11] | 0.72 [0.47, 1.10] | 0.72 [0.47, 1.10] |
| Eastern | 0.94 [0.60, 1.46] | 0.95 [0.61, 1.47] | 0.94 [0.60, 1.47] |
| Ashanti | 0.49 [0.30, 0.79] | 0.49 [0.30, 0.78] | 0.49 [0.30, 0.78] |
| Brong Ahafo | 0.81 [0.53, 1.23] | 0.80 [0.53, 1.22] | 0.80 [0.53, 1.22] |
| Northern | 0.17 [0.11, 0.26] | 0.16 [0.10, 0.26] | 0.16 [0.10, 0.26] |
| Upper East | 0.37 [0.23, 0.57] | 0.36 [0.23, 0.56] | 0.36 [0.23, 0.56] |
| Upper West | 0.31 [0.18, 0.54] | 0.30 [0.17, 0.52] | 0.30 [0.17, 0.52] |
| No child | 1 | 1 | 1 |
| 1–2 children | 1.15 [0.75, 1.76] | 1.18 [0.75, 1.86] | 1.12 [0.72, 1.72] |
| >3 children | 1.02 [0.55, 1.85] | 1.04 [0.56, 1.94] | 0.98 [0.53, 1.79] |
| No | 1 | 1 | 1 |
| Yes | 1.04 [0.69, 1.56] | 1.08 [0.72, 1.63] | 1.08 [0.74, 1.63] |
| no | — | 1 | 1 |
| yes | — | 0.97 [0.74, 1.28] | 0.98 [0.74, 1.29] |
| — | |||
| < 6 members | — | 1 | 1 |
| 6–9 members | — | 1.04 [0.83, 1.31] | 1.04 [0.83, 1.31] |
| > 9 members | — | 1.25 [0.86, 1.79] | 1.25 [0.86, 1.79] |
| — | — | ||
| male | — | 1 | — |
| female | — | 0.95 [0.78, 1.16] | — |
| — | — | ||
| No | — | 1 | — |
| Yes | — | 0.87 [0.59, 1.28] | — |