| Literature DB >> 22125459 |
Shunquan Wu1, Zhichao Jin, Xin Wei, Qingbin Gao, Jian Lu, Xiuqiang Ma, Cheng Wu, Qian He, Meijing Wu, Rui Wang, Jinfang Xu, Jia He.
Abstract
Statistical methods are vital to biomedical research. Our aim was to find out whether progress has been made in the last decade in the use of statistical methods in Chinese medical research. We reviewed 10 leading Chinese medical journals published in 1998 and in 2008. Regarding statistical methods, using a multiple t-test for multiple group comparison was the most common error in the t-test in both years, which significantly decreased in 2008. In contingency tables, no significant level adjustment for multiple comparison significantly decreased in 2008. In ANOVA, over a quarter of articles misused the method of multiple pair-wise comparison in both years, and no significant difference was seen between the two years. In the rank transformation nonparametric test, the error of using multiple pair-wise comparison for multiple group comparison became less common. Many mistakes were found in the randomised controlled trial (56.3% in 1998; 67.9% in 2008), non- randomised clinical trial (57.3%; 58.6%), basic science study (72.9%; 65.5%), case study or case series study (48.4%; 47.2%), and cross-sectional study (57.1%; 44.2%). Progress has been made in the use of statistical methods in Chinese medical journals, but much is yet to be done.Entities:
Keywords: errors; medical journals; misuse; statistical methods
Mesh:
Year: 2011 PMID: 22125459 PMCID: PMC3217588 DOI: 10.1100/2011/139494
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Errors/defects in statistical methods.
| Types of errors | Incorrect use in 1998 | Incorrect use in 2008 |
|
| OR* (1998/2008) | 95% CI** |
|---|---|---|---|---|---|---|
|
| 305 (62.0%) | 253 (44.4%) | 32.83 | <0.001 | 2.04 | (1.60,2.61) |
| (1) Using multiple | 153 (31.1%) | 129 (22.6 | 9.71 | 0.002 | 1.54 | (1.17,2.03) |
| (2) Using paired | 40 (8.1%) | 34 (6.0%) | 1.91 | 0.167 | 1.40 | (0.87,2.24) |
| (3) Using | 89 (18.1%) | 60 (10.5%) | 12.52 | <0.001 | 1.88 | (1.32,2.67) |
| (4) Using | 52 (10.6%) | 33 (5.8%) | 8.19 | 0.004 | 1.92 | (1.22,3.03) |
| (5) Using | 73 (14.8%) | 60 (10.5%) | 4.48 | 0.034 | 1.48 | (1.03,2.13) |
| Others | 28 (5.7%) | 8 (1.4%) | 14.82 | <0.001 | 4.24 | (1.91,9.39) |
|
| ||||||
| Contingency tables | 154 (48.3%) | 169 (32.3%) | 21.35 | <0.001 | 1.96 | (1.47,2.60) |
| (1) No continuity correction or Fisher exact test if needed | 52 (16.3%) | 53 (10.1%) | 6.90 | 0.009 | 1.73 | (1.15,2.61) |
| (2) No significant level adjustment for multiple comparison | 82 (25.7%) | 74 (14.2%) | 17.53 | <0.001 | 2.10 | (1.48,2.98) |
| (3) Misusing Chi-square test for paired fourfold table | 10 (3.1%) | 12 (2.3%) | 0.55 | 0.458 | 1.38 | (0.59,3.23) |
| (4) Using Chi-square test for ranked data | 29 (9.1%) | 31 (5.9%) | 3.00 | 0.083 | 1.59 | (0.94,2.69) |
| (5) Ignorance of stratification factors | 12 (3.8%) | 12 (2.3%) | 1.54 | 0.215 | 1.66 | (0.74,3.75) |
| (6) Using | 8 (2.5%) | 4 (0.8%) | 3.13 | 0.077 | 3.34 | (0.98,11.18) |
| Others | 21 (6.6%) | 19 (3.6%) | 3.81 | 0.051 | 1.87 | (0.99,3.53) |
|
| ||||||
| ANOVA*** | 128 (63.4%) | 263 (59.0%) | 1.12 | 0.289 | 1.20 | (0.85,1.70) |
| (1) Using one-factorial ANOVA to analyse data from multifactorial designs | 10 (5.0%) | 31 (7.0%) | 0.94 | 0.333 | 0.70 | (0.34,1.45) |
| (2) Ignoring the setting of ANOVA for completely random design data | 25 (12.4%) | 53 (11.9%) | 0.03 | 0.858 | 1.05 | (0.63,1.74) |
| (3) No multiple pair-wise comparison of ANOVA when needed | 25 (12.4%) | 28 (6.3%) | 6.89 | 0.009 | 2.11 | (1.20,3.72) |
| (4) Misusing the method of multiple pair-wise comparison of ANOVA | 51 (25.3%) | 132 (29.6%) | 1.30 | 0.255 | 0.80 | (0.55,1.17) |
| (5) Using ANOVA to analyse repeated-measures data | 45 (22.3%) | 63 (14.1%) | 6.65 | 0.010 | 1.74 | (1.14,2.67) |
| Others | 16 (7.9%) | 10 (2.2%) | 11.64 | 0.001 | 3.75 | (1.67,8.42) |
|
| ||||||
| Rank transformation nonparametric test | 29 (43.3%) | 33 (17.7%) | 17.57 | <0.001 | 3.56 | (1.93,6.57) |
| (1) Using multiple pair-wise comparison for multiple group comparison | 14 (20.9%) | 20 (10.7%) | 4.43 | 0.035 | 2.21 | (1.04,4.47) |
| (2) Using wrong type of rank sum test for different study types | 4 (6.0%) | 3 (1.6%) | 2.07 | 0.150 | 3.89 | (0.85,17.88) |
| Others | 20 (29.9%) | 6 (3.2%) | 38.11 | <0.001 | 12.84 | (4.88,33.77) |
*OR: odds ratio; **CI: confidence interval; ***ANOVA: analysis of variance.
Incorrect use of n (%): for each statistical method, n is the number of articles using this statistical methods incorrectly and the percentage = n/the number of papers using this statistical methods × 100%; for each error under certain statistical methods, n is the number of articles with this mistake and the percentage = n/the number of papers using these statistical methods × 100%.
Statistical methods under different study designs.
| Types of study design | 1998 | 2008 | ||
|---|---|---|---|---|
| Articles that used statistical methods | Incorrect use of statistical methods | Articles that used statistical methods | Incorrect use of statistical methods | |
| Systematic review | 0 | 0 | 6 (100.0%) | 0 (0%) |
| RCT* | 64 (98.0%) | 36 (56.3%) | 56 (93.3%) | 38 (67.9%) |
| Non-RCT** | 82 (91.1%) | 47 (57.3%) | 58 (95.1%) | 34 (58.6%) |
| Cohort study | 47 (79.7%) | 28 (59.6%) | 80 (92.0%) | 17 (21.3%) |
| Case-control study | 254 (92.4%) | 148 (58.3%) | 276 (97.2%) | 129 (46.7%) |
| Cross-sectional study | 56 (74.7%) | 32 (57.1%) | 52 (88.1%) | 23 (44.2%) |
| Case study or case series study | 122 (31.9%) | 59 (48.4%) | 233 (48.9%) | 110 (47.2%) |
| Diagnostic test | 47 (74.6%) | 20 (42.6%) | 63 (82.9%) | 25(39.7%) |
| Basic science study | 240 (74.1%) | 175 (72.9%) | 409 (87.4%) | 268 (65.5%) |
| Total | 912 (68.3%) | 545 (59.8%) | 1233 (78.1%) | 644 (52.2%) |
*RCT: randomised controlled trial.
**Non-RCT: nonrandomised clinical trial.
N = total articles (1998:1335; 2008:1578).
All articles n (%): n is the number of articles of each type of study design and percentage = n/N × 100%.
Articles that used statistical methods n (%): n is the number of articles using statistical methods in each type of study design and the percentage = n/the number of articles of each type of study design, incorrect use of statistical methods n (%): n is the number of articles using statistical methods incorrectly and percentage = n/the number of articles using statistical methods in each type of study design.