| Literature DB >> 28591190 |
Matthew J Hayat1, Amanda Powell2, Tessa Johnson1,3, Betsy L Cadwell3.
Abstract
Statistical literacy and knowledge is needed to read and understand the public health literature. The purpose of this study was to quantify basic and advanced statistical methods used in public health research. We randomly sampled 216 published articles from seven top tier general public health journals. Studies were reviewed by two readers and a standardized data collection form completed for each article. Data were analyzed with descriptive statistics and frequency distributions. Results were summarized for statistical methods used in the literature, including descriptive and inferential statistics, modeling, advanced statistical techniques, and statistical software used. Approximately 81.9% of articles reported an observational study design and 93.1% of articles were substantively focused. Descriptive statistics in table or graphical form were reported in more than 95% of the articles, and statistical inference reported in more than 76% of the studies reviewed. These results reveal the types of statistical methods currently used in the public health literature. Although this study did not obtain information on what should be taught, information on statistical methods being used is useful for curriculum development in graduate health sciences education, as well as making informed decisions about continuing education for public health professionals.Entities:
Mesh:
Year: 2017 PMID: 28591190 PMCID: PMC5462407 DOI: 10.1371/journal.pone.0179032
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Public health journals reviewed and number of articles from 2013 included in the study sample.
| Journal | Sections included | 5-year Impact Factor (as cited in 2013) | Eligible Articles | Study Sample | |
|---|---|---|---|---|---|
| Proportion by Journal | Sampled Articles | ||||
| All substantive research articles | 6.894 | 308 | 30.1 | 64 | |
| 6.067 | 296 | 29.0 | 62 | ||
| 4.997 | 141 | 13.8 | 30 | ||
| 5.086 | 79 | 7.7 | 16 | ||
| All substantive research articles | 4.245 | 75 | 7.3 | 16 | |
| 5.092 | 62 | 6.1 | 14 | ||
| All substantive research articles | 8.000 | 62 | 6.1 | 14 | |
Frequency of reported types of studies and use of descriptive and inferential statistics (n = 216).
| Count | Percent | |
|---|---|---|
| Article type (select one) | ||
| Methodological | 8 | 3.7 |
| Substantive | 201 | 93.1 |
| Other | 7 | 3.2 |
| Study Design (select one) | ||
| Observational | 177 | 81.9 |
| Experimental | 12 | 5.6 |
| Systematic Review or Meta-Analysis | 17 | 7.9 |
| Other | 10 | 4.6 |
| Analytic Framework | ||
| Primary Data Collection | 38 | 17.6 |
| Summary Statistics | ||
| Descriptive Statistics | 198 | 91.7 |
| Tables | 206 | 95.4 |
| Charts/Figures/Graphs | 133 | 61.6 |
| Epidemiological Statistics | ||
| Prevalence | 37 | 17.1 |
| Relative Risk | 27 | 12.5 |
| Odds Ratio | 88 | 40.7 |
| Incidence | 19 | 8.8 |
| Mortality | 25 | 11.6 |
| Hazard Ratio | 32 | 14.8 |
| Other | ||
| Statistical Inference | ||
| p-values | 156 | 72.2 |
| Confidence Intervals | 165 | 76.4 |
| *,**,*** p-value reporting used | 39 | 18.1 |
| Multiple Testing | 11 | 5.1 |
| Statistical Tests | ||
| t-test | 28 | 13.0 |
| Pearson’s Chi-square or Fisher’s Exact | 56 | 25.9 |
| Kaplan Meir | 5 | 2.3 |
| Log-rank | 2 | 1.0 |
| Nonparametric (any) | 17 | 7.9 |
| Correlation | 34 | 15.7 |
| Other |
1Other article types included program evaluation, evaluation assessment, and a study design overview
2Other study designs reported included qualitative studies
3Other epidemiological statistics with more than one reported use included sensitivity, specificity, positive predictive value, negative predictive value
4Other statistical tests with more than one reported use included the bootstrap, Cronbach’s alpha, Cochran’s Q, Hardy-Weinberg equilibrium test
Frequency of use reported for statistical modeling and related advanced statistical techniques (n = 216).
| Count | Percent | |
|---|---|---|
| Statistical Modeling, Independent | ||
| Analysis of Variance (ANOVA) | 21 | 9.7 |
| Linear Regression | 42 | 19.4 |
| Logistic Regression | 83 | 38.4 |
| Poisson Regression | 16 | 7.4 |
| Cox Proportional Hazards Regression | 33 | 15.3 |
| Nonlinear Regression | 1 | <1 |
| Other | 22 | 10.2 |
| Statistical Modeling, Dependent | ||
| General Linear Mixed Model | 15 | 6.9 |
| Generalized Linear Mixed Model | 22 | 10.2 |
| Marginal Models (Generalized Estimating Equations/GEE) | 16 | 7.4 |
| Other | 5 | 2.3 |
| Complex Statistical Modeling Techniques | ||
| Structural Equation Modeling | 2 | 1.0 |
| Mixture Model | 1 | <1 |
| Latent Class Model | 3 | 1.4 |
| PCA/Factor Analysis | 9 | 4.2 |
| Other | 4 | 1.9 |
| Missing Data | ||
| Casewise Deletion | 66 | 30.6 |
| Mean (Single) Imputation | 10 | 4.6 |
| Multiple Imputation | 12 | 5.6 |
| Other | 9 | 4.2 |
1Other statistical models with more than one reported use included difference in difference, loglinear, ordinal logistic, multinomial logistic, and negative binomial regression
2No methods other than those listed were reported more than once
Frequency of use reported for statistical software packages (n = 216).
| Count | Percent | |
|---|---|---|
| Statistical Software | ||
| SAS | 75 | 34.7 |
| SUDAAN | 9 | 4.2 |
| SPSS | 18 | 8.3 |
| STATA | 54 | 25.0 |
| R | 18 | 8.3 |
| Other | 32 | 14.8 |
1Other packages with more than one reported use included ARCGIS, HLM, IVEWARE, MPlus, BUGS