Literature DB >> 26229754

Common pitfalls in statistical analysis: Clinical versus statistical significance.

Priya Ranganathan1, C S Pramesh2, Marc Buyse3.   

Abstract

In clinical research, study results, which are statistically significant are often interpreted as being clinically important. While statistical significance indicates the reliability of the study results, clinical significance reflects its impact on clinical practice. The third article in this series exploring pitfalls in statistical analysis clarifies the importance of differentiating between statistical significance and clinical significance.

Entities:  

Keywords:  Biostatistics; confidence intervals; data interpretation; statistical

Year:  2015        PMID: 26229754      PMCID: PMC4504060          DOI: 10.4103/2229-3485.159943

Source DB:  PubMed          Journal:  Perspect Clin Res        ISSN: 2229-3485


One of the common problems faced by readers (and authors!) of medical articles is in the interpretation of the word “significance.” The term “statistical significance” is often misinterpreted as a “clinically important” result. The confusion stems from the fact that many people equate “significance” with its literal meaning of “importance,” whereas in statistics, it has a far more restrictive connotation. This article explains the idea of the statistical significance and differentiates it from clinical relevance or importance, which is an entirely different concept. In the previous article, in this series, we looked at different ways of expressing statistical significance (“P” values versus confidence intervals).[1] Measures of statistical significance quantify the probability of a study's results being due to chance. Clinical significance, on the other hand, refers to the magnitude of the actual treatment effect (i.e., the difference between the intervention and control groups, also known as the “treatment effect size”), which will determine whether the results of the trial are likely to impact current medical practice. The “P” value, frequently used to measure statistical significance, is the probability that the study results are due to chance rather than to a real treatment effect. The conventional cut off for the “P” value to be considered statistically significant is of 0.05 (or 5%). What a P < 0.05 implies is that the possibility of the results in a study being due to chance is <5%. In clinical practice, the “clinical significance” of a result is dependent on its implications on existing practice-treatment effect size being one of the most important factors that drives treatment decisions. LeFort suggests that the clinical significance should reflect “the extent of change, whether the change makes a real difference to subject lives, how long the effects last, consumer acceptability, cost-effectiveness, and ease of implementation”.[2] While there are established, traditionally accepted values for statistical significance testing, this is lacking for evaluating clinical significance.[3] More often than not, it is the judgment of the clinician (and the patient) which decides whether a result is clinically significant or not. Statistical significance is heavily dependent on the study's sample size; with large sample sizes, even small treatment effects (which are clinically inconsequential) can appear statistically significant; therefore, the reader has to interpret carefully whether this “significance” is clinically meaningful. A study published in the Journal of Clinical Oncology compared overall survival in 569 patients with advanced pancreatic cancer who were randomised to receive erlotinib plus gemcitabine versus gemcitabine alone.[4] Median survival was found to be “significantly” prolonged in the erlotinib/gemcitabine arm (6.24 months vs. 5.91 months, P = 0.038). The P = 0.038 means that there is only a 3.8% chance that this observed difference between the groups occurred by chance (which is less than the traditional cut-off of 5%) and therefore, statistically significant. In this example, the clinical relevance of this “positive” study is the “treatment effect” or difference in median survival between 6.24 and 5.91 months – a mere 10 days, which most oncologists would agree is a clinically irrelevant “improvement” in outcomes, especially when considering the added toxicity and costs involved with the combination. Most journals now endorse the use of the CONSORT statement for reporting of parallel-group randomized trials, which emphasizes the need for reporting of the estimated effect size and its precision (such as 95% confidence interval) for each primary and secondary outcome.[5] Readers should bear in mind that interpretation of study results should take into account the clinical significance by looking at the actual treatment effect (with confidence intervals) and should not just be based on “P” values and statistical significance.
  5 in total

1.  CONSORT 2010 statement: updated guidelines for reporting parallel group randomized trials.

Authors:  Kenneth F Schulz; Douglas G Altman; David Moher
Journal:  Ann Intern Med       Date:  2010-03-24       Impact factor: 25.391

2.  Statistical and clinical significance, and how to use confidence intervals to help interpret both.

Authors:  Judith Fethney
Journal:  Aust Crit Care       Date:  2010-03-29       Impact factor: 2.737

3.  The statistical versus clinical significance debate.

Authors:  S M LeFort
Journal:  Image J Nurs Sch       Date:  1993

4.  Erlotinib plus gemcitabine compared with gemcitabine alone in patients with advanced pancreatic cancer: a phase III trial of the National Cancer Institute of Canada Clinical Trials Group.

Authors:  Malcolm J Moore; David Goldstein; John Hamm; Arie Figer; Joel R Hecht; Steven Gallinger; Heather J Au; Pawel Murawa; David Walde; Robert A Wolff; Daniel Campos; Robert Lim; Keyue Ding; Gary Clark; Theodora Voskoglou-Nomikos; Mieke Ptasynski; Wendy Parulekar
Journal:  J Clin Oncol       Date:  2007-04-23       Impact factor: 44.544

5.  Common pitfalls in statistical analysis: "P" values, statistical significance and confidence intervals.

Authors:  Priya Ranganathan; C S Pramesh; Marc Buyse
Journal:  Perspect Clin Res       Date:  2015 Apr-Jun
  5 in total
  31 in total

1.  Association of Systemic Inflammatory and Anti-inflammatory Responses with Adverse Outcomes in Acute Pancreatitis: Preliminary Results of an Ongoing Study.

Authors:  Deepesh Sharma; Aparna Jakkampudi; Ratnakar Reddy; Panyala Balakumar Reddy; Aasish Patil; H V V Murthy; G Venkat Rao; D Nageshwar Reddy; Rupjyoti Talukdar
Journal:  Dig Dis Sci       Date:  2017-10-27       Impact factor: 3.199

Review 2.  Statistical data presentation: a primer for rheumatology researchers.

Authors:  Durga Prasanna Misra; Olena Zimba; Armen Yuri Gasparyan
Journal:  Rheumatol Int       Date:  2020-11-17       Impact factor: 2.631

3.  The accuracy of teledentistry in caries detection in children - A diagnostic study.

Authors:  Mohammad AlShaya; Deema Farsi; Nada Farsi; Najat Farsi
Journal:  Digit Health       Date:  2022-06-22

Review 4.  Randomized Controlled Trials in Lung, Gastrointestinal, and Breast Cancers: An Overview of Global Research Activity.

Authors:  J Connor Wells; Adam Fundytus; Shubham Sharma; Wilma M Hopman; Joseph C Del Paggio; Bishal Gyawali; Deborah Mukherji; Nazik Hammad; C S Pramesh; Ajay Aggarwal; Richard Sullivan; Christopher M Booth
Journal:  Curr Oncol       Date:  2022-04-07       Impact factor: 3.109

5.  Impact of macronutrient supplements on later growth of children born preterm or small for gestational age: A systematic review and meta-analysis of randomised and quasirandomised controlled trials.

Authors:  Luling Lin; Emma Amissah; Gregory D Gamble; Caroline A Crowther; Jane E Harding
Journal:  PLoS Med       Date:  2020-05-26       Impact factor: 11.069

6.  Deprivation matters: understanding associations between neighbourhood deprivation, unhealthy food outlets, unhealthy dietary behaviours and child body size using structural equation modelling.

Authors:  Victoria Egli; Matthew Hobbs; Jordan Carlson; Niamh Donnellan; Lisa Mackay; Daniel Exeter; Karen Villanueva; Caryn Zinn; Melody Smith
Journal:  J Epidemiol Community Health       Date:  2020-02-26       Impact factor: 3.710

7.  Does the implementation of a care pathway for patients with hip or knee osteoarthritis lead to fewer diagnostic imaging and referrals by general practitioners? A pre-post-implementation study of claims data.

Authors:  Esther H A van den Bogaart; Mariëlle E A L Kroese; Marieke D Spreeuwenberg; Ramon P G Ottenheijm; Patrick Deckers; Dirk Ruwaard
Journal:  BMC Fam Pract       Date:  2019-11-09       Impact factor: 2.497

8.  Pediatric Reference Intervals for Transferrin Saturation in the CALIPER Cohort of Healthy Children and Adolescents.

Authors:  Victoria Higgins; Man Khun Chan; Khosrow Adeli
Journal:  EJIFCC       Date:  2017-03-08

9.  Custom Focal Trough in Cone-Beam Computed Tomography Reformatted Panoramic Versus Digital Panoramic for Mental Foramen Position to Aid Implant Planning.

Authors:  Khaled Beshtawi; Emad Qirresh; Mohamed Parker; Shoayeb Shaik
Journal:  J Clin Imaging Sci       Date:  2020-06-08

10.  BMI trajectory in childhood is associated with asthma incidence at young adulthood mediated by DNA methylation.

Authors:  Rutu Rathod; Hongmei Zhang; Wilfried Karmaus; Susan Ewart; Latha Kadalayil; Caroline Relton; Susan Ring; S Hasan Arshad; John W Holloway
Journal:  Allergy Asthma Clin Immunol       Date:  2021-07-23       Impact factor: 3.406

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.