Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Common pitfalls in statistical analysis: Clinical versus statistical significance.

Literature DB >> 26229754

Common pitfalls in statistical analysis: Clinical versus statistical significance.

Priya Ranganathan¹, C S Pramesh², Marc Buyse³.

Abstract

In clinical research, study results, which are statistically significant are often interpreted as being clinically important. While statistical significance indicates the reliability of the study results, clinical significance reflects its impact on clinical practice. The third article in this series exploring pitfalls in statistical analysis clarifies the importance of differentiating between statistical significance and clinical significance.

Entities: Chemical Disease Species

Keywords: Biostatistics; confidence intervals; data interpretation; statistical

Year: 2015 PMID： 26229754 PMCID： PMC4504060 DOI： 10.4103/2229-3485.159943

Source DB: PubMed Journal: Perspect Clin Res ISSN： 2229-3485

One of the common problems faced by readers (and authors!) of medical articles is in the interpretation of the word “significance.” The term “statistical significance” is often misinterpreted as a “clinically important” result. The confusion stems from the fact that many people equate “significance” with its literal meaning of “importance,” whereas in statistics, it has a far more restrictive connotation. This article explains the idea of the statistical significance and differentiates it from clinical relevance or importance, which is an entirely different concept. In the previous article, in this series, we looked at different ways of expressing statistical significance (“P” values versus confidence intervals).[1] Measures of statistical significance quantify the probability of a study's results being due to chance. Clinical significance, on the other hand, refers to the magnitude of the actual treatment effect (i.e., the difference between the intervention and control groups, also known as the “treatment effect size”), which will determine whether the results of the trial are likely to impact current medical practice. The “P” value, frequently used to measure statistical significance, is the probability that the study results are due to chance rather than to a real treatment effect. The conventional cut off for the “P” value to be considered statistically significant is of 0.05 (or 5%). What a P < 0.05 implies is that the possibility of the results in a study being due to chance is <5%. In clinical practice, the “clinical significance” of a result is dependent on its implications on existing practice-treatment effect size being one of the most important factors that drives treatment decisions. LeFort suggests that the clinical significance should reflect “the extent of change, whether the change makes a real difference to subject lives, how long the effects last, consumer acceptability, cost-effectiveness, and ease of implementation”.[2] While there are established, traditionally accepted values for statistical significance testing, this is lacking for evaluating clinical significance.[3] More often than not, it is the judgment of the clinician (and the patient) which decides whether a result is clinically significant or not. Statistical significance is heavily dependent on the study's sample size; with large sample sizes, even small treatment effects (which are clinically inconsequential) can appear statistically significant; therefore, the reader has to interpret carefully whether this “significance” is clinically meaningful. A study published in the Journal of Clinical Oncology compared overall survival in 569 patients with advanced pancreatic cancer who were randomised to receive erlotinib plus gemcitabine versus gemcitabine alone.[4] Median survival was found to be “significantly” prolonged in the erlotinib/gemcitabine arm (6.24 months vs. 5.91 months, P = 0.038). The P = 0.038 means that there is only a 3.8% chance that this observed difference between the groups occurred by chance (which is less than the traditional cut-off of 5%) and therefore, statistically significant. In this example, the clinical relevance of this “positive” study is the “treatment effect” or difference in median survival between 6.24 and 5.91 months – a mere 10 days, which most oncologists would agree is a clinically irrelevant “improvement” in outcomes, especially when considering the added toxicity and costs involved with the combination. Most journals now endorse the use of the CONSORT statement for reporting of parallel-group randomized trials, which emphasizes the need for reporting of the estimated effect size and its precision (such as 95% confidence interval) for each primary and secondary outcome.[5] Readers should bear in mind that interpretation of study results should take into account the clinical significance by looking at the actual treatment effect (with confidence intervals) and should not just be based on “P” values and statistical significance.

5 in total

1. CONSORT 2010 statement: updated guidelines for reporting parallel group randomized trials.

Authors: Kenneth F Schulz; Douglas G Altman; David Moher
Journal: Ann Intern Med Date: 2010-03-24 Impact factor: 25.391

2. Statistical and clinical significance, and how to use confidence intervals to help interpret both.

Authors: Judith Fethney
Journal: Aust Crit Care Date: 2010-03-29 Impact factor: 2.737

3. The statistical versus clinical significance debate.

Authors: S M LeFort
Journal: Image J Nurs Sch Date: 1993

4. Erlotinib plus gemcitabine compared with gemcitabine alone in patients with advanced pancreatic cancer: a phase III trial of the National Cancer Institute of Canada Clinical Trials Group.

Authors: Malcolm J Moore; David Goldstein; John Hamm; Arie Figer; Joel R Hecht; Steven Gallinger; Heather J Au; Pawel Murawa; David Walde; Robert A Wolff; Daniel Campos; Robert Lim; Keyue Ding; Gary Clark; Theodora Voskoglou-Nomikos; Mieke Ptasynski; Wendy Parulekar
Journal: J Clin Oncol Date: 2007-04-23 Impact factor: 44.544

5. Common pitfalls in statistical analysis: "P" values, statistical significance and confidence intervals.

Authors: Priya Ranganathan; C S Pramesh; Marc Buyse
Journal: Perspect Clin Res Date: 2015 Apr-Jun

5 in total

31 in total

1. Association of Systemic Inflammatory and Anti-inflammatory Responses with Adverse Outcomes in Acute Pancreatitis: Preliminary Results of an Ongoing Study.

Authors: Deepesh Sharma; Aparna Jakkampudi; Ratnakar Reddy; Panyala Balakumar Reddy; Aasish Patil; H V V Murthy; G Venkat Rao; D Nageshwar Reddy; Rupjyoti Talukdar
Journal: Dig Dis Sci Date: 2017-10-27 Impact factor: 3.199

Review 2. Statistical data presentation: a primer for rheumatology researchers.

Authors: Durga Prasanna Misra; Olena Zimba; Armen Yuri Gasparyan
Journal: Rheumatol Int Date: 2020-11-17 Impact factor: 2.631

3. The accuracy of teledentistry in caries detection in children - A diagnostic study.

Authors: Mohammad AlShaya; Deema Farsi; Nada Farsi; Najat Farsi
Journal: Digit Health Date: 2022-06-22

Review 4. Randomized Controlled Trials in Lung, Gastrointestinal, and Breast Cancers: An Overview of Global Research Activity.

Authors: J Connor Wells; Adam Fundytus; Shubham Sharma; Wilma M Hopman; Joseph C Del Paggio; Bishal Gyawali; Deborah Mukherji; Nazik Hammad; C S Pramesh; Ajay Aggarwal; Richard Sullivan; Christopher M Booth
Journal: Curr Oncol Date: 2022-04-07 Impact factor: 3.109

5. Impact of macronutrient supplements on later growth of children born preterm or small for gestational age: A systematic review and meta-analysis of randomised and quasirandomised controlled trials.

Authors: Luling Lin; Emma Amissah; Gregory D Gamble; Caroline A Crowther; Jane E Harding
Journal: PLoS Med Date: 2020-05-26 Impact factor: 11.069

6. Deprivation matters: understanding associations between neighbourhood deprivation, unhealthy food outlets, unhealthy dietary behaviours and child body size using structural equation modelling.

Authors: Victoria Egli; Matthew Hobbs; Jordan Carlson; Niamh Donnellan; Lisa Mackay; Daniel Exeter; Karen Villanueva; Caryn Zinn; Melody Smith
Journal: J Epidemiol Community Health Date: 2020-02-26 Impact factor: 3.710

7. Does the implementation of a care pathway for patients with hip or knee osteoarthritis lead to fewer diagnostic imaging and referrals by general practitioners? A pre-post-implementation study of claims data.

Authors: Esther H A van den Bogaart; Mariëlle E A L Kroese; Marieke D Spreeuwenberg; Ramon P G Ottenheijm; Patrick Deckers; Dirk Ruwaard
Journal: BMC Fam Pract Date: 2019-11-09 Impact factor: 2.497

8. Pediatric Reference Intervals for Transferrin Saturation in the CALIPER Cohort of Healthy Children and Adolescents.

Authors: Victoria Higgins; Man Khun Chan; Khosrow Adeli
Journal: EJIFCC Date: 2017-03-08

9. Custom Focal Trough in Cone-Beam Computed Tomography Reformatted Panoramic Versus Digital Panoramic for Mental Foramen Position to Aid Implant Planning.

Authors: Khaled Beshtawi; Emad Qirresh; Mohamed Parker; Shoayeb Shaik
Journal: J Clin Imaging Sci Date: 2020-06-08

10. BMI trajectory in childhood is associated with asthma incidence at young adulthood mediated by DNA methylation.

Authors: Rutu Rathod; Hongmei Zhang; Wilfried Karmaus; Susan Ewart; Latha Kadalayil; Caroline Relton; Susan Ring; S Hasan Arshad; John W Holloway
Journal: Allergy Asthma Clin Immunol Date: 2021-07-23 Impact factor: 3.406