Literature DB >> 32923665

Effect of removing outliers on statistical inference: implications to interpretation of experimental data in medical research.

Todd W Gress1, James Denvir1, Joseph I Shapiro1.   

Abstract

BACKGROUND: Data editing with elimination of "outliers" is commonly performed in the biomedical sciences. The effects of this type of data editing could influence study results, and with the vast and expanding amount of research in medicine, these effects would be magnified. METHODS AND
RESULTS: We first performed an anonymous survey of medical school faculty at institutions across the United States and found that indeed some form of outlier exclusion was performed by a large percentage of the respondents to the survey. We next performed Monte Carlo simulations of excluding high and low values from samplings from the same normal distribution. We found that removal of one pair of "outliers", specifically removal of the high and low values of the two samplings, respectively, had measurable effects on the type I error as the sample size was increased into the thousands. We developed an adjustment to the t score that accounts for the anticipated alteration of the type I error (tadj=tobs-2(log(n)^0.5/n^0.5)), and propose that this be used when outliers are eliminated prior to parametric analysis.
CONCLUSION: Data editing with elimination of outliers that includes removal of high and low values from two samples, respectively, can have significant effects on the occurrence of type 1 error. This type of data editing could have profound effects in high volume research fields, particularly in medicine, and we recommend an adjustment to the t score be used to reduce the potential for error.

Entities:  

Keywords:  experimental design; non-parametric; normal distribution; outliers; parametric

Year:  2018        PMID: 32923665      PMCID: PMC7485938          DOI: 10.18590/mjm.2018.vol4.iss2.9

Source DB:  PubMed          Journal:  Marshall J Med        ISSN: 2379-9536


  9 in total

1.  Random regression test-day models with residuals following a Student's-t distribution.

Authors:  J Jamrozik; I Strandén; L R Schaeffer
Journal:  J Dairy Sci       Date:  2004-03       Impact factor: 4.034

2.  Robust regression for high throughput drug screening.

Authors:  Igor Fomenko; Mark Durst; David Balaban
Journal:  Comput Methods Programs Biomed       Date:  2006-03-23       Impact factor: 5.428

3.  Analyzing outliers: influential or nuisance?

Authors:  Naomi Altman; Martin Krzywinski
Journal:  Nat Methods       Date:  2016-04       Impact factor: 28.547

4.  A finite mixture method for outlier detection and robustness in meta-analysis.

Authors:  Ken J Beath
Journal:  Res Synth Methods       Date:  2014-03-06       Impact factor: 5.273

5.  The scandal of poor medical research.

Authors:  D G Altman
Journal:  BMJ       Date:  1994-01-29

6.  Investigation of outliers of evaluation scores among school of health instructors using outlier - determination indices.

Authors:  Hamidreza Tabatabaee; Fariba Ghahramani; Alireza Choobineh; Mona Arvinfar
Journal:  J Adv Med Educ Prof       Date:  2016-01

7.  Why statistical inference from clinical trials is likely to generate false and irreproducible results.

Authors:  Leonid Hanin
Journal:  BMC Med Res Methodol       Date:  2017-08-22       Impact factor: 4.615

8.  Cardiovascular diseases in the mirror of science.

Authors:  Mohammad-Hossein Biglu; Mostafa Ghavami; Sahar Biglu
Journal:  J Cardiovasc Thorac Res       Date:  2016-12-27

9.  Why most published research findings are false.

Authors:  John P A Ioannidis
Journal:  PLoS Med       Date:  2005-08-30       Impact factor: 11.613

  9 in total
  1 in total

1.  Viewing Time Measures of Sexual Interest and Sexual Offending Propensity: An Online Survey of Fathers.

Authors:  Patrizia Pezzoli; Kelly Babchishin; Lesleigh Pullman; Michael C Seto
Journal:  Arch Sex Behav       Date:  2022-10-04
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.