| Literature DB >> 32998683 |
Abstract
BACKGROUND: Researchers often misinterpret and misrepresent statistical outputs. This abuse has led to a large literature on modification or replacement of testing thresholds and P-values with confidence intervals, Bayes factors, and other devices. Because the core problems appear cognitive rather than statistical, we review some simple methods to aid researchers in interpreting statistical outputs. These methods emphasize logical and information concepts over probability, and thus may be more robust to common misinterpretations than are traditional descriptions.Entities:
Keywords: Bias; Cognitive science; Confidence intervals; Data interpretation; Evidence; Hypothesis tests; Information; Models, statistical; P-values; Statistical significance
Mesh:
Year: 2020 PMID: 32998683 PMCID: PMC7528258 DOI: 10.1186/s12874-020-01105-9
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Fig. 1Comparison of P-value and S-value scales. Top labels: Data compatibility with test model as measured by P-values. Bottom labels: Information against test model as measured by S-values
P-values and binary S-values, with corresponding maximum-likelihood ratios (MLR) and deviance (likelihood-ratio) statistics for a simple test hypothesis H under background assumptions A
| Maximum-likelihood ratio against | Deviance statistic 2ln(MLR) | ||
|---|---|---|---|
| 0.99 | 0.014 | 1.00 | 0.00016 |
| 0.90 | 0.15 | 1.01 | 0.016 |
| 0.50 | 1.00 | 1.26 | 0.45 |
| 0.25 | 2.00 | 1.94 | 1.32 |
| 0.10 | 3.32 | 3.87 | 2.71 |
| 0.05 | 4.32 | 6.83 | 3.84 |
| 0.025 | 5.32 | 12.3 | 5.02 |
| 0.01 | 6.64 | 27.6 | 6.63 |
| 0.005 | 7.64 | 51.4 | 7.88 |
| 0.0001 | 13.3 | 1935 | 15.1 |
| 5 sigmaa (~ 2.9 in 10 million) | 21.7 | 5.2 × 105 | 26.3 |
| 1 in 100 million (GWAS) | 26.6 | 1.4 × 107 | 32.8 |
| 6 sigmaa (~ 1 in a billion) | 29.9 | 1.3 × 108 | 37.4 |
a5 and 6 sigma cutoffs are the upper standard-normal tail probabilities at 5 and 6 standard deviations above the mean [51]
Reanalysis of the Brown et al. HDPS results [34]a
| Test Hypothesis (H) | Maximum-likelihood ratio | Likelihood-ratio statistic | ||
|---|---|---|---|---|
| Halving of hazard, HR = 0.5 | 1.6 × 10− 6 | 19.3 | 1.0 × 105 | 23.1 |
| No association (null), HR = 1 | 0.0505 | 4.31 | 6.77 | 3.82 |
| Point estimate, HR = 1.61 | 1.00 | 0.00 | 1.00 | 0.00 |
| Doubling of hazard, HR = 2 | 0.373 | 1.42 | 1.49 | 0.79 |
| Tripling of hazard, HR = 3 | 0.01 | 6.56 | 26.2 | 6.53 |
| Quintupling of hazard, HR = 5 | 3.3 × 10−6 | 18.2 | 5.0 × 104 | 21.7 |
aComputed from the normal approximations given in the Appendix
P-values, S-values, maximum-likelihood ratios, and likelihood-ratio statistics for several test hypotheses about the hazard ratio (HR) computed from Brown et al. HDPS results [34].
Fig. 2P-Values for a range of hazard ratios (HR). A compatibility graph in which P-values are plotted across alternative hazard ratios. Computed from results in Brown et al. [34]. Compatibility intervals (CI) in percents can be read moving from the right-hand axis to the bottom (HR) axis. HR = 1 represents no association
Fig. 3S-Values (surprisals) for a range of hazard ratios (HR). An information graph in which S-values are plotted across alternative hazard ratios. Computed from results in Brown et al. [34]. HR = 1 represents no association