| Literature DB >> 32801191 |
Adeyinka Emmanuel Adegbosin1, Bela Stantic2, Jing Sun3.
Abstract
OBJECTIVES: To explore the efficacy of machine learning (ML) techniques in predicting under-five mortality (U5M) in low-income and middle-income countries (LMICs) and to identify significant predictors of U5M.Entities:
Keywords: community child health; deep learning; machine learning; maternal medicine; random forest; under-five mortality
Mesh:
Year: 2020 PMID: 32801191 PMCID: PMC7430449 DOI: 10.1136/bmjopen-2019-034524
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Figure 1Architecture of the deep neural network (DNN)-convolution neural network (CNN) ensemble model. FC, fully connected.
Hyperparameters choice and selected values through grid search
| Hyperparemeter | Available choice | Selected choice |
| Optimizer | Adam, Adadelta, RMSprop | Adam |
| Dropout rate | 0.1, 0.2, 0.3, 0.4, 0.5 | 0.2 |
| Learning rate | 0.002, 0.004, 0.006 | 0.004 |
Descriptive analysis of the study population
| Variables | n (%) | M (SD) |
| Age of child (years) | 1.89 (1.40) | |
| Gender | ||
| Male | 775 957 (51.0) | |
| Female | 744 061 (49.0) | |
| Residence | ||
| Urban | 413 705 (27.3) | |
| Rural | 1 100 211 (72.7) | |
| Wealth quintile | ||
| Poorest | 334 135 (23.7) | |
| Poorer | 302 747 (21.5) | |
| Middle | 283 295 (20.9) | |
| Richer | 260 049 (18.4) | |
| Richest | 231 177 (16.4) | |
| Maternal education level | ||
| No education | 670 115 (44.6) | |
| Primary | 430 147 (28.6) | |
| Secondary | 342 889 (22.8) | |
| Higher | 60 517 (4) | |
| Postnatal check received | ||
| Yes | 283 683 (52.6) | |
| No | 255 752 (47.4) | |
| Prenatal care received | ||
| Yes | 839 821 (79) | |
| No | 223 913 (21) | |
| Delivery care received | ||
| Yes | 1 393 930 (95.8) | |
| No | 60 774 (4.2) | |
| Tetanus injection before birth | ||
| Yes | 387 999 (75.5) | |
| No | 125 889 (24.5) | |
| Duration of breast feeding | ||
| <6 months | 517 671 (36.4) | |
| 6 months or more | 903 170 (63.6) |
N=1 520 018.
Figure 2Feature importance using random forest.
Performance comparison (without feature selection)
| Performance metrics | LR | DNN | CNN | CNN-DNN |
| Sensitivity | 0.47 | 0.67 | 0.66 | 0.68 |
| Specificity | 0.53 | 0.84 | 0.83 | 0.83 |
| Precision | 0.35 | 0.58 | 0.57 | 0.62 |
| F1-score | 0.38 | 0.62 | 0.60 | 0.63 |
| AUC | 0.93 | 0.97 | 0.97 | 0.97 |
AUC, area under the curve; CNN, convolution neural network; DNN, deep neural network; LR, logistic regression.
Figure 3Micro-average receiver operating characteristic (ROC) curve before feature selection. CNN, convolution neural network; DNN, deep neural network; LR, logistic regression.
Metrics comparison after feature selection
| Performance metrics | LR | DNN | CNN | CNN-DNN |
| Sensitivity | 0.47 | 0.69 | 0.68 | 0.71 |
| Specificity | 0.53 | 0.83 | 0.83 | 0.83 |
| Precision | 0.35 | 0.62 | 0.62 | 0.67 |
| F1-score | 0.38 | 0.63 | 0.62 | 0.67 |
| AUC | 0.93 | 0.97 | 0.97 | 0.97 |
AUC, area under the curve; CNN, convolution neural network; DNN, deep neural network; LR, logistic regression.
Figure 4Micro-average receiver operating characteristic (ROC) curve after feature selection. CNN, convolution neural network; DNN, deep neural network; LR, logistic regression.