| Literature DB >> 35239641 |
Timos Papadopoulos1,2, Emilia Vynnycky1,3,4.
Abstract
The basic reproduction number (R0) of an infection determines the impact of its control. For many endemic infections, R0 is often estimated from appropriate country-specific seroprevalence data. Studies sometimes pool estimates from the same region for settings lacking seroprevalence data, but the reliability of this approach is unclear. Plausibly, indicator-based approaches could predict R0 for such settings. We calculated R0 for rubella for 98 settings and correlated its value against 66 demographic, economic, education, housing and health-related indicators. We also trained a random forest regression algorithm using these indicators as the input and R0 as the output. We used the mean-square error to compare the performances of the random forest, simple linear regression and a regional averaging method in predicting R0 using 4-fold cross validation. R0 was <5, 5-10 and >10 for 81, 14 and 3 settings respectively, with no apparent regional differences and in the limited available data, it was usually lower for rural than urban areas. R0 was most correlated with educational attainment, and household indicators for the Pearson and Spearman correlation coefficients respectively and with poverty-related indicators followed by the crude death rate considering the Maximum Information Coefficient, although the correlation for each was relatively weak (Pearson correlation coefficient: 0.4, 95%CI: (0.24,0.48) for educational attainment). A random forest did not perform better in predicting R0 than simple linear regression, depending on the subsets of training indicators and studies, and neither out-performed a regional averaging approach. R0 for rubella is typically low and using indicators to estimate its value is not straightforward. A regional averaging approach may provide as reliable an estimate of R0 for settings lacking seroprevalence data as one based on indicators. The findings may be relevant for other infections and studies estimating the disease burden and the impact of interventions for settings lacking seroprevalence data.Entities:
Mesh:
Year: 2022 PMID: 35239641 PMCID: PMC8893344 DOI: 10.1371/journal.pcbi.1008858
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 1Estimates of the basic reproduction number for each of the studies.
The blue circles reflect the values calculated from study-specific seroprevalence data (study-specific point estimates), the red circles reflect the default R estimates and the vertical markers reflect R estimated from the regional point estimate of the force of infection. The red and blue bars reflect the 95% ranges of the study-specific and default estimates based on bootstrapping.
Summary of the top ten correlation coefficients and MIC for the association between the basic reproduction number and the indicators, obtained considering estimates of the basic reproduction number for all countries.
The columns labelled “CC” hold the coefficient, with the 95% range obtained by bootstrapping; the column labelled “N” holds the number of data points used to calculate the coefficient.
| Rank | Pearson | Spearman | MIC | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Indicator | CC | N | Indicator | CC | N | Indicator | CC | N | ||
| 1 | Educational attainment, at least completed upper secondary, population 25+, total (%) (cumulative) | 0.4 | 88 | Number of households 5 persons—Proportion over All households | -0.45 | 56 | Poverty gap at $1.90 a day (2011 PPP) (%) | 0.37 | 92 | |
| 2 | Educational attainment, at least completed upper secondary, population 25+, female (%) (cumulative) | 0.4 | 88 | Number of households 5 persons—Per capita | -0.41 | 56 | Poverty headcount ratio at $5.50 a day (2011 PPP) (% of population) | 0.36 | 92 | |
| 3 | Educational attainment, at least completed upper secondary, population 25+, male (%) (cumulative) | 0.39 | 88 | Number of households 6 persons and over—Per capita | -0.33 | 53 | Crude death rate per 1000 population | 0.35 | 98 | |
| 4 | Number of households 5 persons—Proportion over All households | -0.34 | 56 | Physicians (per 1,000 people) | 0.32 | 98 | Life expectancy at birth (both sexes) | 0.34 | 98 | |
| 5 | Physicians (per 1,000 people) | 0.33 | 98 | Prevalence of underweight, weight for age (% of children under 5) | -0.31 | 94 | Poverty gap at $3.20 a day (2011 PPP) (% of population) | 0.34 | 92 | |
| 6 | Proportion of the population aged 65+ | 0.32 | 98 | Immunization, measles (% of children ages 12–23 months) | 0.3 | 98 | Poverty headcount ratio at $1.90 a day (2011 PPP) (% of population) | 0.34 | 92 | |
| 7 | Number of households 5 persons—Per capita | -0.28 | 56 | Number of households 6 persons and over—Proportion over All households | -0.29 | 53 | Poverty gap at $5.50 a day (2011 PPP) (% of population) | 0.33 | 92 | |
| 8 | Proportion of the population aged 0–14 | -0.27 | 98 | Health expenditure, total (% of GDP) | 0.28 | 98 | Poverty headcount ratio at $3.20 a day (2011 PPP) (% of population) | 0.33 | 92 | |
| 9 | Population living in slums (% of urban population) | -0.27 | 75 | Urban population (% of total) | 0.28 | 98 | Proportion of the population aged 0–4 | 0.32 | 98 | |
| 10 | Number of households 6 persons and over—Proportion over All households | -0.26 | 53 | Educational attainment, at least completed upper secondary, population 25+, total (%) (cumulative) | 0.27 | 88 | Physicians (per 1,000 people) | 0.31 | 98 | |
Fig 2Indicators in each category with the largest Spearman correlation with R (correlation value and CI in red font).
The Pearson correlation regression line is plotted in red.
Summary of the top five correlation coefficients and MIC for the association between the basic reproduction number and the indicators, obtained considering estimates of the basic reproduction number just for countries in the African region or the Americas.
The columns labelled “CC” hold the coefficient, with the 95% range obtained by bootstrapping; the column labelled “N” holds the number of data points used to calculate the coefficient.
| Rank | Pearson | Spearman | MIC | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Indicator | CC | N | Indicator | CC | N | Indicator | CC | N | |
|
| |||||||||
| 1 | Unemployment, total (% of total labor force) (national estimate) | 0.35 | 24 | Poverty headcount ratio at national poverty lines (% of population) | -0.4 | 24 | Number of households 3 persons—Proportion over All households | 1.0 | 6 |
| 2 | Poverty headcount ratio at national poverty lines (% of population) | -0.33 | 24 | Income share held by highest 10% | 0.38 | 24 | Number of households 4 persons—Proportion over All households | 1.0 | 6 |
| 3 | Income share held by highest 10% | 0.31 | 24 | Health expenditure, total (% of GDP) | 0.38 | 24 | Immunization, measles (% of children ages 12–23 months) | 0.55 | 24 |
| 4 | Poverty gap at $1.90 a day (2011 PPP) (%) | -0.3 | 24 | Income share held by highest 20% | 0.33 | 24 | Unemployment, total (% of total labor force) (national estimate) | 0.52 | 24 |
| 5 | Poverty gap at $3.20 a day (2011 PPP) (% of population) | -0.29 | 24 | Educational attainment, at least completed upper secondary, population 25+, male (%) (cumulative) | -0.32 | 19 | Physicians (per 1,000 people) | 0.49 | 24 |
|
| |||||||||
| 1 | Exclusive breastfeeding (% of children under 6 months) | 0.45 | 23 | Urban population (% of total) | 0.61 | 26 | Immunization, DPT (% of children ages 12–23 months) | 0.58 | 26 |
| 2 | Immunization, DPT (% of children ages 12–23 months) | 0.45 | 26 | Exclusive breastfeeding (% of children under 6 months) | 0.59 | 23 | Exclusive breastfeeding (% of children under 6 months) | 0.55 | 23 |
| 3 | Unemployment, total (% of total labor force) (modeled ILO estimate) | -0.42 | 26 | Prevalence of underweight, weight for age (% of children under 5) | -0.57 | 26 | Poverty gap at $3.20 a day (2011 PPP) (% of population) | 0.53 (0.41, 0.53) | 26 |
| 4 | Educational attainment, at least completed upper secondary, population 25+, total (%) (cumulative) | 0.42 | 24 | Unemployment, total (% of total labor force) (modeled ILO estimate) | -0.57 | 26 | Poverty headcount ratio at $3.20 a day (2011 PPP) (% of population) | 0.53 | 26 |
| 5 | Urban population (% of total) | 0.42 | 26 | Immunization, DPT (% of children ages 12–23 months) | 0.49 | 26 | Adjusted net enrollment rate, primary (% of primary school age children) | 0.48 | 25 |
Fig 3Mean value (blue and orange dots) and minimum-maximum value range (blue and orange line) of the MSE of the predicted R over the 10 repetitions of the 4-fold cross-validation experiment using the non-imputed dataset.
The blue color corresponds to the 66 linear regression results (one for each indicator) and the orange color to the 5 random forest results (one for each subset of indicators). The green dots indicate the average MSE for R as calculated using the default approach compared to that calculated using study-specific seroprevalence data.
Fig 4Mean value (blue and orange dots) and minimum-maximum value range (blue and orange line) of the MSE of the predicted R over the 10 repetitions of the 4-fold cross-validation experiment using the imputed dataset.
The blue color corresponds to the 66 linear regression results (one for each indicator) and the orange color to the 5 random forest results (one for each subset of indicators). The green dots indicate the average MSE for R as calculated using the default approach compared to that calculated using study-specific seroprevalence data.
Fig 5Performance (as quantified by the MSE in each of the 10 splits to folds) of the default set of parameter values used in the random forest (yellow x) compared to the optimal parameters for the random forest identified through nested optimisation (blue circle).
Blue lines are the MSE minimum to maximum range for the 96 points of the parameters grid (hence this minimum is the optimal result of the non-nested optimization method).
Fig 6Summary of the force of infection estimates used to calculate the basic reproduction number for each study in each region.
The bars reflect the 95% ranges based on bootstrapping.