| Literature DB >> 23071852 |
Fernando Abad-Franch1, Gustavo H Grimmer, Vanessa S de Paula, Luiz T M Figueiredo, Wornei S M Braga, Sérgio L B Luz.
Abstract
BACKGROUND: Arboviral diseases are major global public health threats. Yet, our understanding of infection risk factors is, with a few exceptions, considerably limited. A crucial shortcoming is the widespread use of analytical methods generally not suited for observational data--particularly null hypothesis-testing (NHT) and step-wise regression (SWR). Using Mayaro virus (MAYV) as a case study, here we compare information theory-based multimodel inference (MMI) with conventional analyses for arboviral infection risk factor assessment. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2012 PMID: 23071852 PMCID: PMC3469468 DOI: 10.1371/journal.pntd.0001846
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Mayaro virus seroprevalence in a rural Amazonian settlement: descriptive and bivariate null hypothesis-testing statistics.
| Variable | Levels | Seropositive | Total | % ⊕ | OR (95%CI) | Test (d.f.) | p-value | |
| Yes | No | |||||||
| Overall | - | 119 | 151 | 270 | 44.1 | - | - | - |
| Sex | Female | 63 | 83 | 146 | 43.2 | 1 (reference) | ||
| Male | 56 | 68 | 124 | 45.2 | 1.08 (0.67–1.76) | FET | 0.810 | |
| Age | 1-yr increment | - | - | - | - | 1.004 (0.99–1.02) | LR χ2 = 0.37 (1) | 0.540 |
| Age class | 0–3 | 7 | 12 | 19 | 36.8 | 1 (reference) | ||
| 4–7 | 17 | 20 | 37 | 45.9 | 1.46 (0.48–4.70) | |||
| 8–12 | 21 | 27 | 48 | 43.8 | 1.33 (0.45–4.14) | |||
| 13–17 | 9 | 14 | 23 | 39.1 | 1.10 (0.31–3.95) | |||
| 18–29 | 15 | 23 | 38 | 39.5 | 1.12 (0.36–3.60) | |||
| 30–64 | 43 | 49 | 92 | 46.7 | 1.50 (0.55–4.36) | |||
| 65+ | 7 | 6 | 13 | 53.9 | 2.00 (0.48–8.75) | LR χ2 = 1.79 (6) | 0.938 | |
|
| No | 106 | 115 | 221 | 48.0 | 1 (reference) | ||
| Yes | 13 | 36 | 49 | 26.5 | 0.39 (0.20–0.78) | FET |
| |
|
| Village | 78 | 11 | 89 | 87.6 | 1 (reference) | ||
| Upland | 39 | 102 | 141 | 27.7 | 0.05 (0.02–0.11) | |||
| Riverbanks | 2 | 38 | 40 | 5.0 | 0.01 (0.001–0.03) | LR χ2 = 121.7 (2) |
| |
|
| No | 62 | 51 | 113 | 54.9 | 1 (reference) | ||
| Yes | 57 | 100 | 157 | 36.3 | 0.47 (0.29–0.77) | FET |
| |
|
| No | 52 | 34 | 86 | 60.5 | 1 (reference) | ||
| Yes | 67 | 117 | 184 | 36.4 | 0.37 (0.22–0.63) | FET |
| |
| Cats | No | 95 | 111 | 206 | 46.1 | 1 (reference) | ||
| Yes | 24 | 40 | 64 | 37.5 | 0.70 (0.39–1.25) | FET | 0.251 | |
|
| No | 105 | 93 | 198 | 53.0 | 1 (reference) | ||
| Yes | 14 | 58 | 72 | 19.4 | 0.21 (0.11–0.41) | FET |
| |
|
| Open | 98 | 138 | 236 | 41.5 | 1 (reference) | ||
| Closed | 21 | 13 | 34 | 61.8 | 2.27 (1.09–4.76) | FET |
| |
| Waste disposal | Adequate | 106 | 129 | 235 | 45.1 | 1 (reference) | ||
| Inadequate | 13 | 22 | 35 | 37.1 | 0.72 (0.35–1.50) | FET | 0.466 | |
| Crop-plot | No | 14 | 14 | 28 | 50.0 | 1 (reference) | ||
| Yes | 105 | 137 | 242 | 43.4 | 0.77 (0.33–1.68) | FET | 0.550 | |
⊕: seropositive; OR: unadjusted odds ratio; 95%CI: 95% confidence interval; d.f.: degrees of freedom; FET: Fisher's exact test; LR: likelihood-ratio test.
Variable names and p-values in bold typeface indicate covariates that entered the saturated model used as the starting point in backward step-wise regression; see text for details.
Effect size estimates from the step-wise multivariate logistic regression ‘minimum adequate’ model.
| Term |
| Adjusted OR |
| Intercept | −1.30 (0.32) | - |
| Residence area | ||
| Village | 2.93 (0.38) | 1 (reference) |
| Upland | −0.49 (0.31) | 0.03 (0.01–0.08) |
| Riverbanks | −2.44 (0.51) | 0.005 (0.001–0.02) |
| Regular bednet use | −1.02 (0.28) | 0.36 (0.21–0.62) |
β: slope coefficient; SE: standard error; OR: odds ratio; 95%CI : 95% confidence interval.
Odds ratios estimated as OR = exp(β−β Reference);
With respect to the other two area categories considered together.
Individual-covariate model set.
| Model | AICc | ΔAICc | Likelihood |
|
|
| Bednet | 366.75 | 0 | 1 | 0.497 | 2 |
| Age+Bednet | 368.68 | 1.93 | 0.381 | 0.189 | 3 |
| Sex+Bednet | 368.70 | 1.95 | 0.377 | 0.188 | 3 |
| Age+Sex+Bednet | 370.66 | 3.91 | 0.142 | 0.070 | 4 |
| Null | 372.51 | 5.77 | 0.056 | 0.028 | 1 |
| Age | 374.17 | 7.42 | 0.024 | 0.012 | 2 |
| Sex | 374.43 | 7.69 | 0.021 | 0.011 | 2 |
| Age+Sex | 376.13 | 9.38 | 0.009 | 0.005 | 3 |
AICc: Akaike's Information Criterion corrected for small sample size; ΔAICc: variation in AICc relative to the best-performing model; Likelihood: likelihood of the model, given the data; wi: Akaike weights; K: number of estimable parameters.
Household-covariate model set.
| Model | AICc | ΔAICc | Likelihood |
|
|
| Area+Crop | 249.16 | 0 | 1 | 0.293 | 5 |
| Area+Crop+Toilet/latrine | 249.17 | 0.01 | 0.998 | 0.292 | 6 |
| Area+Crop+Waste | 250.01 | 0.85 | 0.655 | 0.192 | 6 |
| Area+Crop+Toilet/latrine+Waste | 250.06 | 0.90 | 0.640 | 0.187 | 7 |
| Area | 254.85 | 5.68 | 0.058 | 0.017 | 4 |
| Area+Toilet/latrine | 255.47 | 6.31 | 0.043 | 0.013 | 5 |
| Area+Toilet/latrine+Waste | 257.06 | 7.90 | 0.019 | 0.006 | 6 |
| Toilet/latrine | 369.63 | 120.47 | 0.000 | 0.000 | 2 |
| Toilet/latrine+Waste | 371.05 | 121.89 | 0.000 | 0.000 | 3 |
| Crop+Toilet/latrine | 371.52 | 122.36 | 0.000 | 0.000 | 3 |
| Null | 372.51 | 123.35 | 0.000 | 0.000 | 1 |
| Crop+Toilet/latrine+Waste | 373.02 | 123.86 | 0.000 | 0.000 | 4 |
| Area+Waste | 373.75 | 124.59 | 0.000 | 0.000 | 5 |
| Waste | 373.75 | 124.59 | 0.000 | 0.000 | 2 |
| Crop | 374.10 | 124.94 | 0.000 | 0.000 | 2 |
| Crop+Waste | 375.49 | 126.33 | 0.000 | 0.000 | 3 |
AICc: Akaike's Information Criterion corrected for small sample size; ΔAICc: variation in AICc relative to the best-performing model; Likelihood: likelihood of the model, given the data; wi: Akaike weights; K: number of estimable parameters.
Domestic animal-covariate model set.
| Model | AICc | ΔAICc | Likelihood |
|
|
| Pig+Dog | 346.38 | 0 | 1 | 0.371 | 3 |
| Pig+Dog+Fowl | 347.94 | 1.56 | 0.458 | 0.170 | 4 |
| Pig+Dog+Cat | 348.44 | 2.06 | 0.358 | 0.133 | 4 |
| Pig | 348.74 | 2.36 | 0.307 | 0.114 | 2 |
| Pig+Fowl | 349.51 | 3.13 | 0.209 | 0.077 | 3 |
| Pig+Dog+Cat+Fowl | 350.01 | 3.63 | 0.163 | 0.060 | 5 |
| Pig+Cat | 350.55 | 4.18 | 0.124 | 0.046 | 3 |
| Pig+Cat+Fowl | 351.54 | 5.16 | 0.076 | 0.028 | 4 |
| Dog+Fowl | 358.92 | 12.54 | 0.002 | 0.001 | 3 |
| Dog | 360.79 | 14.41 | 0.001 | 0.000 | 2 |
| Dog+Cat+Fowl | 360.98 | 14.60 | 0.001 | 0.000 | 4 |
| Dog+Cat | 362.69 | 16.32 | 0.000 | 0.000 | 3 |
| Fowl | 365.34 | 18.96 | 0.000 | 0.000 | 2 |
| Cat+Fowl | 367.23 | 20.86 | 0.000 | 0.000 | 3 |
| Null | 372.51 | 26.13 | 0.000 | 0.000 | 1 |
| Cat | 373.06 | 26.68 | 0.000 | 0.000 | 2 |
AICc: Akaike's Information Criterion corrected for small sample size; ΔAICc: variation in AICc relative to the best-performing model; Likelihood: likelihood of the model, given the data; wi: Akaike weights; K: number of estimable parameters.
Combined analysis: models with ΔAICc<2 in the 128-model set used for inference (see Table S1 for the complete model set).
| Model | AICc | ΔAICc | Likelihood |
|
|
| Area+Crop+Bednet | 238.92 | 0 | 1 | 0.070 | 6 |
| Area+Crop+Bednet+Toilet/latrine | 239.03 | 0.10 | 0.949 | 0.067 | 7 |
| Area+Crop+Bednet+Pig+Toilet/latrine | 239.16 | 0.24 | 0.889 | 0.062 | 8 |
| Area+Crop+Bednet+Pig | 239.32 | 0.40 | 0.819 | 0.056 | 7 |
| Area+Bednet | 239.55 | 0.62 | 0.732 | 0.051 | 5 |
| Area+Crop+Bednet+Pig+Toilet/latrine+Waste | 239.71 | 0.78 | 0.675 | 0.047 | 9 |
| Area+Pig+Bednet | 239.80 | 0.87 | 0.646 | 0.045 | 6 |
| Area+Pig+Bednet+Toilet/latrine | 239.85 | 0.92 | 0.630 | 0.044 | 7 |
| Area+Bednet+Toilet/latrine | 239.88 | 0.96 | 0.619 | 0.043 | 6 |
| Area+Crop+Bednet+Pig+Waste | 240.05 | 1.13 | 0.569 | 0.040 | 8 |
| Area+Crop+Bednet+Dog+Toilet/latrine | 240.21 | 1.29 | 0.526 | 0.037 | 8 |
| Area+Crop+Bednet+Dog | 240.24 | 1.32 | 0.517 | 0.036 | 7 |
| Area+Crop+Bednet+Waste | 240.26 | 1.33 | 0.513 | 0.036 | 7 |
| Area+Crop+Bednet+Toilet/latrine+Waste | 240.30 | 1.37 | 0.503 | 0.035 | 8 |
| Area+Crop+Bednet+Dog+Pig+Toilet/latrine | 240.75 | 1.83 | 0.401 | 0.028 | 9 |
AICc: Akaike's Information Criterion corrected for small sample size; ΔAICc: variation in AICc relative to the best-performing model; Likelihood: likelihood of the model, given the data; wi: Akaike weights; K: number of estimable parameters.
The ‘minimum adequate’ model selected by step-wise regression analysis.
Figure 1The models in the 128-model set used for inference on risk factors for Mayaro virus infection.
Models were ranked according to variation in the Akaike's information criterion value of each model with respect to the best-performing model in the set (i.e., ranked by ΔAICc). Arrows highlight ΔAICc ‘leaps’ associated with the removal of the two most important covariates, residence area and bednet use. The position and ΔAICc value of the saturated model (Full) and the intercept-only model (Null) are also indicated. Note that the y-axis is in log10 scale.
Model-averaged effect-sizes (β coefficients) from the final 128-model set.
| Factor |
| SE | Lower 95%CI | Upper 95%CI |
| Village | 2.93 | 0.41 | 2.21 | 3.82 |
| Upland | −0.56 | 0.33 | −1.16 | 0.14 |
| Riverbanks | −2.37 | 0.55 | −3.63 | −1.48 |
| Bednet use | −0.95 | 0.28 | −1.53 | −0.44 |
| Crop-plot | 0.39 | 0.22 | −0.04 | 0.84 |
| Toilet/latrine | 0.19 | 0.13 | −0.07 | 0.44 |
| Waste | 0.09 | 0.09 | −0.09 | 0.28 |
| Pig | −0.14 | 0.10 | −0.35 | 0.05 |
| Dog | −0.05 | 0.08 | −0.21 | 0.11 |
SE: standard error; 95%CI: 95% confidence interval.
See main text for the definition of covariates;
With respect to the other two area categories considered together.
Figure 2Effects of covariates on Mayaro virus seropositivity, averaged over the 128 models in the final set.
Covariates describe: residence area (Village: village-like household clusters; Upland: upland areas; Riverbanks: better-preserved riverbank areas); bednet use (Bednet); crop-plot ownership (Crop-plot); use of a closed toilet/latrine (Toilet/latrine); adequate solid waste disposal (Waste); and the keeping/rearing of pigs (Pig) or dogs (Dog). Estimates are presented as odds ratios (solid circles) and 95% confidence intervals. The dotted line at odds ratio = 1 represents no effect; values >1 indicate a positive effect (increased risk of infection), and values <1 a negative (protective) effect.