| Literature DB >> 29354483 |
Thomas F Heston1, Jackson M King2.
Abstract
A statistically significant research finding should not be defined as a P-value of 0.05 or less, because this definition does not take into account study power. Statistical significance was originally defined by Fisher RA as a P-value of 0.05 or less. According to Fisher, any finding that is likely to occur by random variation no more than 1 in 20 times is considered significant. Neyman J and Pearson ES subsequently argued that Fisher's definition was incomplete. They proposed that statistical significance could only be determined by analyzing the chance of incorrectly considering a study finding was significant (a Type I error) or incorrectly considering a study finding was insignificant (a Type II error). Their definition of statistical significance is also incomplete because the error rates are considered separately, not together. A better definition of statistical significance is the positive predictive value of a P-value, which is equal to the power divided by the sum of power and the P-value. This definition is more complete and relevant than Fisher's or Neyman-Peason's definitions, because it takes into account both concepts of statistical significance. Using this definition, a statistically significant finding requires a P-value of 0.05 or less when the power is at least 95%, and a P-value of 0.032 or less when the power is 60%. To achieve statistical significance, P-values must be adjusted downward as the study power decreases.Entities:
Keywords: Biostatistics; Clinical significance; Positive predictive value; Power; Statistical significance
Year: 2017 PMID: 29354483 PMCID: PMC5746664 DOI: 10.5662/wjm.v7.i4.112
Source DB: PubMed Journal: World J Methodol ISSN: 2222-0682
Figure 1According to the classical definition, research findings are considered statistically significant when the difference observed falls in the upper or lower tails of the frequency distribution, represented above in black.
Figure 2If the observed difference is greater than x, then we consider that the finding is statistically significant and the null hypothesis is rejected. If the difference found is less than x, then we accept the null hypothesis and reject the alternative hypothesis. The area in black represents a Type I error which occurs when the difference is greater than x, but the null hypothesis is in fact true. The lined area represents a Type II error which occurs when the difference found is less than x, but the alternative hypothesis is in fact true.
Statistically significant research findings can represent a true positive or false positive
| Study findings | Alternative hypothesis true | Null hypothesis true | |
| Significant | True positive | False positive | |
| Insignificant | False negative | True negative | |
Similarly, statistically insignificant findings may represent a true or false negative.
When the P-value is utilized to determine whether or not a finding is statistically significant, 1-beta represents the sensitivity for identifying the alternative hypothesis, and 1-alpha represents the specificity
| Study findings | Alternative hypothesis true | Null hypothesis true | |
| Significant | 1 - beta (power) | Alpha (exact | |
| Insignificant | Beta | 1 - alpha | |
A Type I error corresponds to 1-specificity and a Type II error corresponds to 1-sensitivity when study findings are determined to be significant or insignificant based upon the P-value
| Study findings | Alternative hypothesis true | Null hypothesis true | |
| Significant | Correct | Type I error | |
| Insignificant | Type II error | Correct | |
This 2 × 2 contingency table shows the corresponding values for a research study where a study finding is determined to be significant based upon a P-value of 0.05 and when the study’s power is 80%
| Study findings | Alternative hypothesis true | Null hypothesis true | |
| Significant | 0.8 | 0.05 | |
| Insignificant | 0.2 | 0.95 | |
P-values corrected for study power
| 0.95 | 0.05 |
| 0.9 | 0.047 |
| 0.85 | 0.045 |
| 0.8 | 0.042 |
| 0.75 | 0.039 |
| 0.7 | 0.037 |
| 0.65 | 0.034 |
| 0.6 | 0.032 |