| Literature DB >> 31357990 |
Christopher C Stanley1,2, Lawrence N Kazembe3, Mavuto Mukaka4,5, Kennedy N Otwombe1,6, Andrea G Buchwald7, Michael G Hudgens8, Don P Mathanga2, Miriam K Laufer9, Tobias F Chirwa1.
Abstract
BACKGROUND: Modelling risk of malaria in longitudinal studies is common, because individuals are at risk for repeated infections over time. Malaria infections result in acquired immunity to clinical malaria disease. Prospective cohorts are an ideal design to relate the historical exposure to infection and development of clinical malaria over time, and analysis methods should consider the longitudinal nature of the data. Models must take into account the acquisition of immunity to disease that increases with each infection and the heterogeneous exposure to bites from infected Anopheles mosquitoes. Methods that fail to capture these important factors in malaria risk will not accurately model risk of malaria infection or disease.Entities:
Keywords: Cohort studies; Longitudinal analysis; Longitudinal studies; Plasmodium falciparum
Mesh:
Year: 2019 PMID: 31357990 PMCID: PMC6664716 DOI: 10.1186/s12936-019-2885-9
Source DB: PubMed Journal: Malar J ISSN: 1475-2875 Impact factor: 2.979
Fig. 1Flow chart for study screening process of articles
Fig. 2Total number of articles from 1996 to 2017 with lowess curve. The number of articles published increased over the 22-year study period, with the highest number found between 2012 and 2016
Frequency of articles using a particular statistical method
| Method | All (n = 197) | 1996–2006 (n = 49) | 2007–2017 (n = 148) |
|---|---|---|---|
| N (%) | N (%) | N (%) | |
| Descriptive statistics only | 17 (8.6) | 5 (10.2) | 12 (8.1) |
| Contingency tables | 102 (51.8) | 32 (65.3) | 70 (47.3) |
| Multiway tables | 50 (25.4) | 16 (32.7) | 34 (23.0) |
| Epidemiologic statistics | 37 (18.8) | 12 (24.5) | 25 (16.9) |
| t-tests | 82 (41.6) | 23 (46.9) | 59 (39.9) |
| Analysis of variance | 18 (9.1) | 1 (2.0) | 17 (11.5) |
| Simple linear regression | 6 (3.1) | 5 (10.2) | 1 (0.7) |
| Non-parametric tests | 48 (24.4) | 11 (22.5) | 37 (25.0) |
| Non-parametric correlation | 21 (10.7) | 3 (6.1) | 18 (12.2) |
| Multiple regression | 71 (36.0) | 15 (30.6) | 56 (37.8) |
| Poisson regression | 49 (24.9) | 8 (16.3) | 41 (27.7) |
| Negative binomial regression | 22 (11.2) | 3 (6.1) | 19 (12.8) |
| Survival analysis | |||
| Kaplan–Meier estimator | 59 (30.0) | 11 (22.5) | 48 (32.4) |
| Log-rank test | 36 (18.3) | 7 (14.3) | 29 (19.6) |
| Cox model | 62 (31.5) | 9 (18.4) | 53 (35.8) |
| Other survival methods | 4 (2.0) | 3 (6.1) | 1 (0.7) |
| Longitudinal analysis | |||
| Generalized estimating equations | 41 (20.8) | 9 (18.4) | 32 (21.6) |
| Mixed-effects regression | 24 (12.2) | 2 (4.1) | 22 (14.9) |
| Recurrent analysis | |||
| Shared frailty model | 2 (1.0) | – | 2 (1.4) |
| Andersen–Gill (AG) model | 4 (2.0) | – | 4 (2.7) |
Odds ratio of using a statistical method during 2007–2017 compared to 1996–2006 adjusted for article sample size
| Method | Odds ratio (OR) | 95% CI |
|---|---|---|
| Descriptive statistics only | 0.81 | 0.26–2.42 |
| Contingency table | 0.49 | 0.27–0.91 |
| Kaplan–Meier estimator | 1.80 | 0.88–3.67 |
| Cox model | 2.57 | 1.21–5.42 |
| Log-rank | 1.60 | 0.66–3.84 |
| Poisson model | 1.78 | 0.85–3.72 |
| GEE | 1.16 | 0.56–2.39 |
| Mixed-effects model | 3.85 | 1.13–6.39 |
CI confidence interval
Fig. 3Time series plot with percentage of articles using a particular statistical method, from 1996 to 2017. The percentage of articles using Cox models, Kaplan–Meier estimators increased over entire period while GEE, and mixed-effects models increased before remaining stable
Categories of statistical methods used to assess the statistical content of articles
| Category | Brief description | Include |
|---|---|---|
| No statistical methods or descriptive statistics only | Describe basic features of data to provide simple measures of summaries | No statistical content, or descriptive statistics only e.g., percentages, means, standard deviations, standard errors, histograms |
| Contingency tables | Cross tabulations used to summarize the relationship between categorical data | Chi-square test, Fisher’s exact test, McNemar’s test |
| Epidemiologic statistics | Measures of association for outcome of interest such as disease and some exposure(s) | Relative risk, risk ratios, rate ratios, risk difference, rate difference, odds ratio, log odds, risk difference, attributable risk fraction, sensitivity, specificity |
| Multiway tables | Extend two-way relationships to include three or more variables | Mantel–Haenszel procedure, log-linear models, logistic regression |
| t-test | Assess mean differences between groups | One-sample, matched-pair, two-sample t-tests |
| Pearson’s correlation | Measures linear correlation between two variables | Classical product-moment correlation |
| Simple linear regression | Regression that summarizes relationships between two continuous variables, an explanatory and a response | Least-squares regression with one predictor and one response variable |
| Multiple regression | Extends the simple regression to include two or more explanatory variables for a response | Polynomial regression and stepwise regression |
| Analysis of variance | Assess within and between group differences in means | Analysis of variance, Analysis of covariance, simple linear contrasts, F-tests |
| Multiple comparisons | Methods for handling multiple inferences on same data sets | Bonferroni techniques, Scheffé’s contrasts, Duncan multiple-range methods, Newman–Keuls procedure |
| Non-parametric test | Tests used when data is not assumed to follow a particular distribution, and are based on ranks of data | Sign test, Wilcoxon signed-rank test, Mann–Whitney test, Kruskal–Wallis test, Friedman test, Kolmogorov–Smirnov test |
| Non-parametric correlation | Measure strength and direction of association between two variables | Spearman’s rho, Kendall’s tau, monotone regression, test for trend |
| Survival analysis | Methods where outcome variable is the time until the occurrence of an event | Actuarial life table, Kaplan–Meier estimator for survival, survival function, Cox model, other parametric survival models, rate adjustment, log-rank test, Breslow’s test |
| Sensitivity analysis | Examines sensitivity of outcome to small changes in parameters of model or in other assumptions | Sample size, multiple outcomes, model distribution assumptions |
| Transformation | Use of data transformation often in regression | Natural logarithm, square, cubic |
| Cluster analysis | Involves dividing a multivariate dataset into “natural” clusters (groups) for in-depth assessment | Hierarchical, K-means, two-step clustering |
| Repeated-measures analysis | Approaches that account for correlation for within-participant observations and non-constant variance in response over time | Generalized estimating equations (GEE), mixed-effects models, repeated measures ANOVA |
| Other | Other methods not specified in above | Receiver-operating characteristic, principal component analysis, power analysis, propensity score |