| Literature DB >> 30523337 |
Carinna Hockham1,2, Samir Bhatt3, Roshan Colah4, Malay B Mukherjee4, Bridget S Penman5,6, Sunetra Gupta5, Frédéric B Piel5,7.
Abstract
Sickle-cell anaemia (SCA) is a neglected chronic disorder of increasing global health importance, with India estimated to have the second highest burden of the disease. In the country, SCA is particularly prevalent in scheduled populations, which comprise the most socioeconomically disadvantaged communities. We compiled a geodatabase of a substantial number of SCA surveys carried out in India over the last decade. Using generalised additive models and bootstrapping methods, we generated the first India-specific model-based map of sickle-cell allele frequency which accounts for the district-level distribution of scheduled and non-scheduled populations. Where possible, we derived state- and district-level estimates of the number of SCA newborns in 2020 in the two groups. Through the inclusion of an additional 158 data points and 1.3 million individuals, we considerably increased the amount of data in our mapping evidence-base compared to previous studies. Highest predicted frequencies of up to 10% spanned central India, whilst a hotspot of ~12% was observed in Jammu and Kashmir. Evidence was heavily biased towards scheduled populations and remained limited for non-scheduled populations, which can lead to considerable uncertainties in newborn estimates at national and state level. This has important implications for health policy and planning. By taking population composition into account, we have generated maps and estimates that better reflect the complex epidemiology of SCA in India and in turn provide more reliable estimates of its burden in the vast country. This work was supported by European Union's Seventh Framework Programme (FP7//2007-2013)/European Research Council [268904 - DIVERSITY]; and the Newton-Bhabha Fund [227756052 to CH].Entities:
Mesh:
Year: 2018 PMID: 30523337 PMCID: PMC6283872 DOI: 10.1038/s41598-018-36077-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Schematic overview of database generation procedures and geostatistical modelling processes. Pink diamonds represent input data; green boxes denote methodological steps; blue rods depict model outputs. *Historical map of malaria endemicity and contemporary map of malaria. **Two urban accessibility metrics, nighttime lights and travel time to the nearest city (Supplementary Information S2).
Figure 2(a) A map of the sickle-cell surveys included in our database (n = 249). Data points are coloured according to the βS allele frequency reported in the study sample. The size of the data points relates to their sample size. A spatial jitter of up to 0.3° latitude and longitude decimal degrees coordinates was applied to improve visualisation of the data. (b) Map of median predicted βS allele frequency estimates at a resolution of 10 km × 10 km. State boundaries are displayed in dark grey.
Summary results of the selected GAM for the scheduled and non-scheduled training datasets.
| Estimate | ||
|---|---|---|
|
| ||
|
| −3.0898 | < 0.0001 |
|
|
|
|
| 22.1800 | <0.0001 | |
| R2 = 0.5170 | ||
| GCV = 1.0038 | ||
| MSE = 0.7555 | ||
| AIC = 495.9284 | ||
|
| ||
|
| −4.0709 | <0.0001 |
|
|
|
|
| 10.0900 | <0.0001 | |
| R2 = 0.6890 | ||
| GCV = 1.0728 | ||
| MSE = 0.6639 | ||
| AIC = 150.4534 | ||
The intercept, smoothing term f and its corresponding p-value, adjusted R2, Generalised Cross Validation (GCV) score, mean squared error (MSE) and Akaike Information Criterion (AIC) are given.
Figure 3Map of the estimated number of scheduled newborns born with SCA in India, (a) by state and, (b) by district, in 2020. The medians of the predictive probability distribution of the areal estimates are displayed. The district shaded grey in Tamil Nadu in (b) is that where the 95% CI was very large (>1000). State boundaries are displayed in dark grey and district boundaries in light grey.
Figure 4Map of the estimated number of non-scheduled newborns born with SCA in India, (a) by state and, (b) by district, in 2020. The medians of the predictive probability distribution of the areal estimates are displayed. The states and districts shaded grey are those where our estimates were highly variable (95% CI > 10 000 and > 1000, respectively (Supplementary Figure S9). State boundaries are displayed in dark grey and district boundaries in light grey.