| Literature DB >> 33865412 |
Agustín Ciapponi1, José M Belizán2,3, Gilda Piaggio4,3, Sanni Yaya5,6.
Abstract
This article challenges the "tyranny of P-value" and promote more valuable and applicable interpretations of the results of research on health care delivery. We provide here solid arguments to retire statistical significance as the unique way to interpret results, after presenting the current state of the debate inside the scientific community. Instead, we promote reporting the much more informative confidence intervals and eventually adding exact P-values. We also provide some clues to integrate statistical and clinical significance by referring to minimal important differences and integrating the effect size of an intervention and the certainty of evidence ideally using the GRADE approach. We have argued against interpreting or reporting results as statistically significant or statistically non-significant. We recommend showing important clinical benefits with their confidence intervals in cases of point estimates compatible with results benefits and even important harms. It seems fair to report the point estimate and the more likely values along with a very clear statement of the implications of extremes of the intervals. We recommend drawing conclusions, considering the multiple factors besides P-values such as certainty of the evidence for each outcome, net benefit, economic considerations and values and preferences. We use several examples and figures to illustrate different scenarios and further suggest a wording to standardize the reporting. Several statistical measures have a role in the scientific communication of studies, but it is time to understand that there is life beyond the statistical significance. There is a great opportunity for improvement towards a more complete interpretation and to a more standardized reporting.Entities:
Year: 2021 PMID: 33865412 PMCID: PMC8052638 DOI: 10.1186/s12978-021-01131-w
Source DB: PubMed Journal: Reprod Health ISSN: 1742-4755 Impact factor: 3.223
Fig. 1Probability density function of the difference between two sample means. The point estimate is the most likely value of the parameter of interest across the CI. A confidence interval (CI) is a range of values used to estimate a population parameter and is associated with a specific confidence. With a CI of 95% confidence, there is a 95% probability that any given CI will contain the true population parameter and a 5% chance that it won’t (two tails of 2.5%)
Fig. 2Interpretation of results for different scenarios according to statistical and clinical thresholds or minimal important difference (MID). Minimal (clinically) important difference (MID). The blue squares indicate the point estimate of the effect of a new treatment compared with a standard treatment, and the blue lines on either side of it the 95% confidence interval
Suggested narrative statements for phrasing conclusions
| Certainty of the evidence | Effect size | Suggested statements for conclusions (replace X with intervention, choose ‘reduce’ or ‘increase’ depending on the direction of the effect, replace ‘outcome’ with name of outcome, include ‘when compared with Y’ when needed) |
|---|---|---|
| High | Large | X results in a large reduction/increase in outcome |
| Moderate | X reduces/increases outcome | |
| X results in a reduction/increase in outcome | ||
| Small important effect | X reduces/increases outcome slightly X results in a slight reduction/increase in outcome | |
| Unimportant or no effect | X results in little to no difference in outcome X does not reduce/increase outcome | |
| Moderate | Large | X likely results in a large reduction/increase in outcome X probably results in a large reduction/increase in outcome |
| Moderate | X likely reduces/increases outcome X probably reduces/increases outcome X likely results in a reduction/increase in outcome X probably results in a reduction/increase in outcome | |
| Small important effect | X probably reduces/increases outcome slightly | |
| X likely reduces/increases outcome slightly | ||
| X probably results in a slight reduction/increase in outcome | ||
| X likely results in a slight reduction/increase in | ||
| Unimportant or no effect | X likely results in little to no difference in outcome X probably results in little to no difference in outcome X likely does not reduce/increase outcome X probably does not reduce/increase outcome | |
| Low | Large | X may result in a large reduction/increase in outcome |
| The evidence suggests X results in a large reduction/increase in outcome | ||
| Moderate | X may reduce/increase outcome The evidence suggests X reduces/increases outcome X may result in a reduction/increase in outcome The evidence suggests X results in a reduction/increase in outcome | |
| Small important effect | X may reduce/increase outcome slightly The evidence suggests X reduces/increases outcome slightly X may result in a slight reduction/increase in outcome The evidence suggests X results in a slight reduction/increase in outcome | |
| Unimportant or no effect | X may result in little to no difference in outcome The evidence suggests that X results in little to no difference in outcome X may not reduce/increase outcome The evidence suggests that X does not reduce/increase outcome | |
| Very low | Any effect | The evidence is very uncertain about the effect of X on outcome X may reduce/increase/have little to no effect on outcome but the evidence is very uncertain |
1. Include confidence intervals (and exact P-values when relevant) but do not report results as being statistically significant or non-significant 2. Refer to a minimal important difference, as soundly as possible, to establish clinical or practical significance 3. Present the effect estimates for each outcome together with the certainty of the evidence of the effect (high, moderate, low, and very low) 4. To draw conclusions, consider the multiple factors besides P-values such as certainty of evidence, net benefit, economic considerations and values and preferences 5. Present the results consistently, using similar words and expressions, such as those suggested in Table |