| Literature DB >> 20482841 |
Luis Carlos Silva-Ayçaguer1, Patricio Suárez-Gil, Ana Fernández-Somoano.
Abstract
BACKGROUND: The null hypothesis significance test (NHST) is the most frequently used statistical method, although its inferential validity has been widely criticized since its introduction. In 1988, the International Committee of Medical Journal Editors (ICMJE) warned against sole reliance on NHST to substantiate study conclusions and suggested supplementary use of confidence intervals (CI). Our objective was to evaluate the extent and quality in the use of NHST and CI, both in English and Spanish language biomedical publications between 1995 and 2006, taking into account the International Committee of Medical Journal Editors recommendations, with particular focus on the accuracy of the interpretation of statistical significance and the validity of conclusions.Entities:
Mesh:
Year: 2010 PMID: 20482841 PMCID: PMC2886084 DOI: 10.1186/1471-2288-10-44
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Sizes of the populations (and the samples) for selected journals and periods.
| Clinical | General Medicine | Public Health and Epidemiology | |||||
|---|---|---|---|---|---|---|---|
| 1995-1996 | 623 (62) | 125 (60) | 346 (62) | 238 (61) | 315 (60) | 169 (60) | 1816 (365) |
| 2000-2001 | 600 (60) | 146 (60) | 519 (62) | 196 (61) | 286 (60) | 145 (61) | 1892 (364) |
| 2005-2006 | 537 (59) | 144 (59) | 474 (62) | 158 (62) | 212 (61) | 167 (60) | 1692 (363) |
| Total | 1760 (181) | 415 (179) | 1339 (186) | 592 (184) | 813 (181) | 481 (181) | 5400 (1092) |
G&O: Obstetrics & Gynecology; REC: Revista Española de Cardiología; BMJ: British Medical Journal; MC: Medicina Clínica; IJE: International Journal of Epidemiology; AP: Atención Primaria.
Figure 1Flow chart of the selection process for eligible papers.
Prevalence of NHST and CI across periods, languages and research areas.
| Total of papers | P-values and no CI | CI and P-values | CI and no P-values | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| n | % (95%CI) | n | % (95%CI) | n | % (95%CI) | |||||
| Period | 1995-1996 | 285 | 119 | 41 (35 to 47) | 138 | 49 (43 to 55) | 28 | 10 (6 to13) | ||
| 2000-2001 | 278 | 101 | 38 (31 to 44) | 150 | 51 (44 to 58) | 27 | 11 (6 to 15) | |||
| 2005-2006 | 306 | 65 | 21 (16 to 26) | 198 | 65 (59 to 71) | 43 | 14 (9 to 17) | |||
| Language | Spanish | 396 | 156 | 39 (34 to 43) | 211 | 54 (49 to 59) | 29 | 7 (5 to 10) | ||
| English | 473 | 129 | 32 (28 to 36) | 275 | 55 (51 to 60) | 69 | 12 (10 to 15) | |||
| Area | Clinical | 300 | 166 | 52 (45 to 58) | 125 | 45 (39 to 51) | 9 | 3 (1 to 6) | ||
| General Medicine | 278 | 69 | 22 (17 to 27) | 170 | 61 (55 to 67) | 39 | 17 (12 to 22) | |||
| Public Health and Epidemiology | 291 | 50 | 18 (13 to 23) | 191 | 65 (59 to 71) | 50 | 17 (13 to 22) | |||
CI: Confidence Interval
Frequency of occurrence of the significance fallacy across periods, languages and research areas.
| Criteria | Categories | Number of papers | Frequency of occurrence of the | % |
|---|---|---|---|---|
| Period | 1995-1996 | 285 | 224 | 80 (75 to 85) |
| 2000-2001 | 278 | 210 | 78 (72 to 83) | |
| 2005-2006 | 306 | 216 | 70 (64 to 75) | |
| Language | Spanish | 396 | 295 | 73 (69 to 78) |
| English | 473 | 355 | 76 (73 to 80) | |
| Area | Clinical | 300 | 248 | 81(76 to 86) |
| General Medicine | 278 | 200 | 72 (66 to 77) | |
| Public | 291 | 202 | 71 (66 to 76) | |
CI: Confidence Interval
Frequency of use of numerical results in conclusions across periods, languages and research areas.
| Criteria | Categories | Number of papers | Frequency of use of numerical results | % |
|---|---|---|---|---|
| Period | 1995-1996 | 285 | 44 | 15 (10 to 19) |
| 2000-2001 | 278 | 48 | 15 (10 to 20) | |
| 2005-2006 | 306 | 45 | 12,1 (8 to 16) | |
| Language | Spanish | 396 | 85 | 21 (17 to 25) |
| English | 473 | 52 | 12 (9 to 15) | |
| Area | Clinical | 300 | 58 | 16 (12 to 21) |
| General Medicine | 278 | 39 | 13 (9 to 17) | |
| Public Health and Epidemiology | 291 | 40 | 12 (8 to 15) | |
CI: Confidence Interval
Frequency of presence of the term Significance (or statistical significance) in conclusions across periods, languages and research areas.
| Criteria | Categories | Number of papers | Frequency of presence of significance | % |
|---|---|---|---|---|
| Period | 1995-1996 | 285 | 35 | 14 (9 to 19) |
| 2000-2001 | 278 | 32 | 12 (8 to 16) | |
| 2005-2006 | 306 | 41 | 14 (9 to 19) | |
| Language | Spanish | 396 | 39 | 10 (7 to 13) |
| English | 473 | 69 | 15 (11 to 18) | |
| Area | Clinical | 300 | 44 | 16 (11 to 20) |
| General Medicine | 278 | 30 | 11 (7 to 15) | |
| Public Health and Epidemiology | 291 | 34 | 12 (8 to 16) | |
CI: Confidence Interval