| Literature DB >> 30181112 |
Subhash Chandir1,2, Danya Arif Siddiqi3, Owais Ahmed Hussain4, Tahira Niazi4, Mubarak Taighoon Shah3, Vijay Kumar Dharma3, Ali Habib4, Aamir Javed Khan3.
Abstract
BACKGROUND: Despite the availability of free routine immunizations in low- and middle-income countries, many children are not completely vaccinated, vaccinated late for age, or drop out from the course of the immunization schedule. Without the technology to model and visualize risk of large datasets, vaccinators and policy makers are unable to identify target groups and individuals at high risk of dropping out; thus default rates remain high, preventing universal immunization coverage. Predictive analytics algorithm leverages artificial intelligence and uses statistical modeling, machine learning, and multidimensional data mining to accurately identify children who are most likely to delay or miss their follow-up immunization visits.Entities:
Keywords: artificial intelligence; dropouts; immunizations; machine learning; predictive analytics
Year: 2018 PMID: 30181112 PMCID: PMC6231754 DOI: 10.2196/publichealth.9681
Source DB: PubMed Journal: JMIR Public Health Surveill ISSN: 2369-2960
Figure 1Procedures of the Study.
Figure 2Derivation procedure for extracting training and validation cohort data. ZM: Zindagi Mehfooz.
Baseline characteristics of the training and validation data cohorts.
| Characteristics of the participants | Training cohort (N=47,554) | Validation cohort (N=11,889) | |||||
| Enrollment age (weeks), mean (SD) | 12.92 (15.9) | 12.93 (15.9) | |||||
| Gender (female), n (%) | 20,425 (42.95) | 5049 (42.47) | |||||
| BCGa | 24,744 (52.03) | 6195 (52.11) | |||||
| Pentavalent-1 | 8955 (18.83) | 2236 (18.81) | |||||
| Others | 13,855 (29.14) | 3458 (29.08) | |||||
| Urdu | 846 (1.78) | 208 (1.75) | |||||
| Unknown | 46,561 (97.91) | 11,644 (97.94) | |||||
| Others | 147 (0.31) | 37 (0.31) | |||||
| Korangi | 41,225 (86.69) | 10,296 (86.60) | |||||
| Muzafargarh Town | 1693 (3.56) | 445 (3.74) | |||||
| Others | 4636 (9.75) | 1148 (9.66) | |||||
| Karachi | 45,415 (95.50) | 11,334 (95.33) | |||||
| Muzafargarh | 1996 (4.20) | 519 (4.37) | |||||
| Others | 43 (0.30) | 36 (0.30) | |||||
| Early | 16 (0.07) | 4 (0.07) | |||||
| Late | 17,126 (70.19) | 4254 (69.61) | |||||
| Timely | 7258 (29.75) | 1852 (30.32) | |||||
| Early | 11 (0.12) | 1 (0.02) | |||||
| Late | 8892 (99.73) | 2220 (99.78) | |||||
| Timely | 13 (0.15) | 4 (0.18) | |||||
| Early | 9 (0.22) | 2 (0.20) | |||||
| Late | 4099 (99.15) | 996 (99.20) | |||||
| Timely | 26 (0.63) | 6 (0.60) | |||||
| Early | 14 (0.38) | 3 (0.34) | |||||
| Late | 4338 (99.31) | 883 (99.21) | |||||
| Timely | 11 (0.30) | 4 (0.45) | |||||
| Early | 6 (0.14) | 1 (0.09) | |||||
| Late | 4338 (99.20) | 1113 (99.02) | |||||
| Timely | 29 (0.66) | 10 (0.89) | |||||
| <1 month | 5465 (11.49) | 1386 (11.66) | |||||
| 1-9 months | 35,972 (75.64) | 8949 (75.27) | |||||
| >1 year | 6117 (12.86) | 1554 (13.07) | |||||
aBCG: Bacillus Calmette–Guérin.
bExcludes records with invalid dates.
Figure 3Flow diagram of all the study predictive models.
Performance of the study models predicting the likelihood of defaulting from the follow-up immunization visits. Higher C-statistics results in better algorithm discrimination.
| Model | Area under the curve C-statistic | 95% CI |
| Recursive partitioning | 0.791 | 0.784-0.798 |
| Support vector machines | 0.786 | 0.777-0.792 |
| Random forests | 0.750 | 0.742-0.756 |
| C-Forest | 0.782 | 0.775-0.789 |
Performance metrics of all the study predictive models.
| Model | Accuracy (%) | Sensitivity (%) | Specificity (%) | Precision (%) | Negative predicted value (%) |
| Recursive partitioning | 78.9 | 74.0 | 84.2 | 83.4 | 75.1 |
| Support vector machines | 78.8 | 88.9 | 68.0 | 74.9 | 85.1 |
| Random forests | 75.6 | 94.9 | 54.9 | 69.3 | 91.0 |
| C-Forest | 78.6 | 90.5 | 65.8 | 74.0 | 86.6 |
Figure 4Receiver operating characteristic for all the study predictive models.