| Literature DB >> 29900177 |
Pentti Nieminen1, Tuula Toljamo2, Hannu Vähänikkilä1.
Abstract
Data analysis methods play an important role in respiratory research. We evaluated the application and complexity of data analytical methods in high-impact respiratory journals and compared the statistical reporting in these respiratory articles with reports published in other eminent medical journals. This study involved a total of 160 papers published in 2015 in the European Respiratory Journal, American Journal of Respiratory and Critical Care Medicine, Chest and Thorax, and 680 papers published between 2007-2015 in other medical journals including the Lancet and New England Journal of Medicine. We manually reviewed the articles to determine the way in which they reported the methods applied in data analysis. The statistical intensity in the respiratory journals was equal to that in eminent medical journals. Traditional ways of testing statistical significance were widely used in respiratory articles. Statistical procedures were not always described in sufficient detail, and the prominent respiratory journals did not display different profiles with respect to their statistical content. Readers of the prominent respiratory journals need to possess a substantial level of statistical expertise if they wish to critically evaluate the design, methodology, data analysis and interpretation of the findings published in these journals.Entities:
Year: 2018 PMID: 29900177 PMCID: PMC5994482 DOI: 10.1183/23120541.00140-2017
Source DB: PubMed Journal: ERJ Open Res ISSN: 2312-0541
Basic statistical methods used in medical research by research goal and type of outcome variable
| Mean± | Median, interquartile range | Proportion | Kaplan–Meier curve | |
| Comparing means with independent samples t-test | Comparing medians with Mann–Whitney test | Cross-tabulation with Chi-squared test | Kaplan–Meier curves and log-rank test | |
| Comparing means with one-way ANOVA | Comparing medians with Kruskal–Wallis test | Cross-tabulation with Chi-squared test | Kaplan–Meier curves and log-rank test | |
| t-test for repeated measurements | Wilcoxon test | McNemar's test | ||
| Repeated-measures ANOVA | Friedman test | |||
| Pearson correlation | Spearman correlation | Cross-tabulation with Chi-squared test, risk ratio or odds ratio statistics | ||
| Multivariable linear regression | Negative binomial regression | Logistic regression | Cox proportional hazard regression | |
Reproduced and modified from [10].
Respiratory and other medical journals surveyed for use of statistical techniques
|
| 13.204 | 2015 | 40 |
|
| 6.044 | 2015 | 40 |
|
| 10.569 | 2015 | 40 |
|
| 8.272 | 2015 | 40 |
|
| 72.406 | 2007, 2008, 2010, 2017 | 160 |
|
| 47.831 | 2008, 2009, 2011, 2017 | 160 |
|
| 14.176 | 2011, 2016 | 80 |
|
| 30.410 | 2010 | 40 |
|
| 2.431 | 2009 | 40 |
|
| 7.738 | 2011 | 40 |
|
| 4.755 | 2010 | 40 |
|
| 3.456 | 2010 | 40 |
|
| 1.693 | 2013 | 40 |
|
| 1.339 | 2015 | 40 |
Distribution of study design, sample size and main outcome by the evaluated journals
| 40 | 40 | 40 | 40 | 320 | 360 | 840 | |
|
Cross-sectional survey | 14 (35.0) | 10 (25.0) | 10 (25.0) | 12 (30.0) | 43 (13.4) | 107 (29.7) | 196 (23.3) |
|
Longitudinal cohort study | 14 (35.0) | 10 (25.0) | 11 (27.5) | 10 (25.0) | 49 (15.3) | 62 (17.2) | 156 (18.6) |
|
Case–control | 4 (10.0) | 7 (17.5) | 3 (7.5) | 3 (7.5) | 7 (2.2) | 18 (5.0) | 42 (5.0) |
|
Interventional study (clinical trial) | 3 (7.5) | 7 (17.5) | 6 (15.0) | 11 (27.5) | 173 (54.1) | 60 (16.7) | 260 (30.9) |
|
Laboratory work | 4 (10.0) | 5 (12.5) | 1 (2.5) | 2 (5.0) | 7 (2.2) | 69 (19.2) | 88 (10.5) |
|
Meta-analysis | 0 | 1 (2.5) | 5 (12.5) | 1 (2.5) | 15 (4.7) | 18 (5.0) | 40 (4.8) |
|
Other | 1 (2.5) | 0 | 4 (10.0) | 1 (2.5) | 26 (8.1) | 26 (7.2) | 58 (6.9) |
|
<30 | 4 (10.0) | 8 (20.0) | 7 (17.5) | 5 (12.5) | 34 (10.6) | 65 (18.1) | 120 (14.3) |
|
30–99 | 9 (22.5) | 9 (22.5) | 7 (17.5) | 9 (22.5) | 27 (8.4) | 62 (17.2) | 123 (14.6) |
|
100–300 | 12 (30.0) | 7 (17.5) | 9 (22.5) | 7 (17.5) | 39 (12.2) | 45 (12.5) | 117 (13.9) |
|
>300 | 15 (37.5) | 16 (40.0) | 17 (42.5) | 19 (47.5) | 206 (64.4) | 145 (40.3) | 423 (50.4) |
|
Missing | 0 | 0 | 0 | 0 | 14 (4.4) | 43 (11.9) | 57 (6.8) |
|
Not significant | 6 (15.0) | 4 (10.0) | 7 (17.5) | 5 (12.5) | 72 (22.5) | 41 (11.4) | 135 (16.1) |
|
Significant | 30 (75.0) | 28 (70.0) | 21 (52.5) | 27 (67.5) | 162 (50.6) | 218 (60.6) | 486 (57.8) |
|
Not evaluated | 4 (10.0) | 8 (20.0) | 12 (30.0) | 8 (20.0) | 86 (26.9) | 101 (28.0) | 219 (26.1) |
Data are presented as number of articles (%), unless stated otherwise. AJRCCM: American Journal of Respiratory and Critical Care Medicine; ERJ: European Respiratory Journal; NEJM: New England Journal of Medicine.
Statistical methods and reporting in the evaluated journals
| 160 | 320 | 360 | 840 | ||
| 100 (62.5) | 219 (68.4) | 221 (61.4) | 540 (64.3) | 0.140 | |
| 38 (23.8) | 144 (45.0) | 104 (28.9) | 286 (34.0) | <0.001 | |
| 126 (78.8) | 191 (59.7) | 215 (59.7) | 532 (63.3) | <0.001 | |
| 111 (69.4) | 221 (69.1) | 199 (55.3) | 531 (63.2) | <0.001 | |
| 18 (11.3) | 35 (10.9) | 36 (10.0) | 89 (10.6) | 0.884 | |
| 38 (23.8) | 25 (7.8) | 75 (20.8) | 138 (16.4) | <0.001 | |
| 96 (60.0) | 205 (64.1) | 165 (45.8) | 466 (55.5) | <0.001 | |
| 9 (5.6) | 13 (4.1) | 35 (9.7) | 57 (6.8) | 0.010 | |
| 24 (15.0) | 58 (18.1) | 57 (15.8) | 139 (16.5) | 0.614 | |
| 27 (16.9) | 31 (9.7) | 69 (19.2) | 127 (15.1) | 0.002 | |
| 1 (0.6) | 5 (1.6) | 4 (1.1) | 10 (1.2) | 0.712 | |
| 0 | 2 (0.6) | 0 | 2 (0.2) | 0.181 |
Data are presented as n (%), unless stated otherwise. NEJM: New England Journal of Medicine; ANN: artificial neural networks.
FIGURE 1Intensity of statistical methods and reporting subdivided by the journal groups. NEJM: New England Journal of Medicine. The horizontal line in the middle of the box is the median value of the scores and the lower and upper boundaries indicate the 25th and 75th percentiles, respectively. The box plot also displays outliers, articles with values more than 1.5 box-lengths from the lower edge of the box are designated with a circle. The largest and smallest observed values that are not outliers are also shown; lines (whiskers) are drawn from the ends of the box to those values.