| Literature DB >> 32195200 |
Amrita Sil1, Jayadev Betkerur2, Nilay Kanti Das3.
Abstract
Biomedical research relies on proving (or disproving) a research hypothesis, and P value becomes a cornerstone of "null hypothesis significance testing." P value is the maximum probability of getting the observed outcome by chance. For a statistical test to achieve significance, the error by chance must be less than 5%. The pros are the P value that gives the strength of evidence against the null hypothesis. We can reject a null hypothesis depending on a small P value. However, the value of P is a function of sample size. When the sample size is large, the P value is destined to be small or "significant." P value is condemned by one school of thought who claims that focusing more on P value undermines the generalizability and reproducibility of research. For such a situation, presently, the scientific world is inclined in knowing the effect size, confidence interval, and the descriptive statistics; thus, researchers need to highlight them along with the P value. In spite of all the criticism, it needs to be understood that P value carries paramount importance in "precise" understanding of the estimation of the difference calculated by "null hypothesis significance testing." Choosing the correct test for assessing the significance of the difference is profoundly important. The choice can be arrived by asking oneself three questions, namely, the type of data, whether the data is paired or not, and on the number of study groups (two or more). It is worth mentioning that association between variables, agreement between assessments, time-trend cannot be arrived by calculating the P value alone but needs to highlight the correlation and regression coefficients, odds ratio, relative risk, etc. Copyright:Entities:
Keywords: Confidence interval; P value; hypothesis testing; non-parametric data; null hypothesis; null hypothesis significance testing; parametric data
Year: 2019 PMID: 32195200 PMCID: PMC6859766 DOI: 10.4103/idoj.IDOJ_368_19
Source DB: PubMed Journal: Indian Dermatol Online J ISSN: 2229-5178
2 × 2 table showing schematically the concept of errors
| Researcher’s decision | ||
|---|---|---|
| Fail to reject null hypothesis | Reject null hypothesis | |
| Reality | ||
| Null is true | Correct decision | Type I error (α) |
| Null is false | Type II error (β) | Correct decision, Power (1-β) |
The concept of P
| Parameter | Group A | Group B | |
|---|---|---|---|
| Age | |||
| Mean±Standard deviation | 31.06±13.98 | 33.02±12.05 | 0.442 |
| Urticaria activity score (UAS) | |||
| Mean±Standard deviation | 4.81±3.63 | 6.92±4.05 | 0.009 |
Figure 1Graphical representation of 95% confidence interval of mean in a normally distributed (bell-shaped curve) population. The white both ways arrow area represents the 95% CI. The ends of the interval are the “confidence limits”
Figure 2Concept of P-value with 95% confidence interval
Explanation of Figure 2
| Serial no | Scenario | Explanation | Interpretation |
|---|---|---|---|
| Scenario 1 | Not statistically significant, | Rejected | |
| Scenario 2 | Statistically significant, Clinically relevant, precise | Clinically relevant difference → can be accepted | |
| Scenario 3 | Statistically significant, not clinically relevant, precise | Observed change not clinically relevant though | |
| Scenario 4 | Not statistically significant, clinically relevant, imprecise | The |
Figure 3Null hypothesis significance tests to be used while testing the difference between groups of numerical parametric data
Figure 5Null hypothesis significance tests to be used while testing the difference between groups of categorical data