Literature DB >> 22276243

An open-source software program for performing Bonferroni and related corrections for multiple comparisons.

Abstract

Increased type I error resulting from multiple statistical comparisons remains a common problem in the scientific literature. This may result in the reporting and promulgation of spurious findings. One approach to this problem is to correct groups of P-values for "family-wide significance" using a Bonferroni correction or the less conservative Bonferroni-Holm correction or to correct for the "false discovery rate" with a Benjamini-Hochberg correction. Although several solutions are available for performing this correction through commercially available software there are no widely available easy to use open source programs to perform these calculations. In this paper we present an open source program written in Python 3.2 that performs calculations for standard Bonferroni, Bonferroni-Holm and Benjamini-Hochberg corrections.

Entities: Chemical

Keywords: Bonferroni correction; software program; type I error

Year: 2011 PMID： 22276243 PMCID： PMC3263024 DOI： 10.4103/2153-3539.91130

Source DB: PubMed Journal: J Pathol Inform

BACKGROUND

When multiple hypotheses are tested in a single experiment, the risk of type I error is increased and with it the risk of promulgating spurious “significant” findings.[1-3] The likelihood of obtaining a false positive result increases proportional to the number of tests performed. For example, the probability of obtaining at least one false positive result when performing 10 tests is given by 1- P(A) = 1-0.9510=0.4013 (1) where P(A) is the confidence level of the test. Although the problems associated with multiple testing are well known, numerous studies still fail to correct their reported P-values. For instance, Bennett et al. found that only between 60% and 74% of the neuroimaging articles published in several major journals corrected for multiple comparisons.[4] Similarly, a study performed by Austin et al. also demonstrated that the failure to account for multiple testing resulted in statistically significant, yet implausible results.[5] In both cases the results were no longer significant after correcting for multiple testing. The lack of attention paid to this problem in the pathology literature stands in stark contrast to its recognition in other fields such as ecology where there has been intense interest for over two decades since the seminal publication by Rice.[6] That being said, even within the field of ecology this topic still engenders debate.[7] A systematic exploration of this problem in the pathology literature has not been undertaken; however we have previously reported on a convenience sample of 800 publications from the pathology literature in 2003, of which 37 presented multiple comparisons. Twenty one of these 37 did not attempt to control for increased type I error due to multiple comparisons.[8] One means of reducing the type I error from multiple testing is the Bonferroni correction, which controls the family-wise error rate (FWER). The FWER is the probability of type I error among the entire set of hypotheses. The Bonferroni correction is calculated as follows: Pcorrected= Poriginal .n (2) where n is the number of hypotheses tested. There is a lack of consensus as to what actually represents a “family” of statistical tests; however it has been suggested that if it is appropriate to place multiple P-values in the same table, it may be appropriate to correct all values in that table for multiple comparisons.[6] Because the Bonferroni correction is conservative with regard to statistical power, other methods of correcting for multiple testing have been developed. Another method that controls for the FWER is the Bonferroni-Holm correction.[9] The Bonferroni-Holm correction is calculated as follows: Pcorrected= Poriginal .(n-k+1) (3) where n is the number of hypotheses tested, and k is the ordered rank of the uncorrected P-values (from smallest P-value to largest P-value). Rather than controlling for the probability of one or more type I errors in the entire experiment, some of the more recent approaches to the multiple testing problem have focused on controlling the false discovery rate (FDR) in the experiment. By controlling the proportion of type I errors, this has the advantage of further increasing the statistical power of the algorithm, and is especially suitable when conducting numerous hypothesis tests.[1011] The Benjamini-Hochberg method[12] is a commonly used way to control the FDR of an experiment. It is calculated as follows: where n is the number of hypotheses tested, and k is the rank of the uncorrected P value. Several commercial statistical software packages are capable of performing one or more of these corrections as well as at least one open-source program (GNU R); however the cost of the commercial packages, and the learning curves involved, may discourage researchers from using these programs. Online tools are also available (e.g., http://www.quantitativeskills.com/sisa/calculations/bonfer.htm) but are limited in scope and available options and rely on continued access to the publisher's website.

“Bonferroni Calculator” software

Using the open-source programming language Python v 3.2, we developed a program capable of performing Bonferroni, Bonferroni-Holm, and Benjamini-Hochberg corrections for any number of P-values. The user is prompted for a set of P-values and the desired significance (alpha) level. From the main menu the user may choose to display the results of the desired correction to the screen, or to export the corrected P values to the hard disk (text and csv file types). The source code is available free as a supplementary file to this article (which may serve as a literature reference for the program). A copy of the source code may also be obtained by email from the corresponding author. The program requires the free programming language Python 3.2 which is capable of running on Microsoft Windows, MAC OS, and Linux/Unix operating systems. It may be downloaded from http://www.python.org/getit/releases/3.2/.. The program is available for free by emailing the senior author at christopher.naugler@cls.ab.ca. Detailed instructions and a FAQ are available at https://sites.google.com/site/christophernaugler/. To use the Bonferroni Calculator software, place the files “Bonferroni Calculator.py” and “Lesack and Naugler.txt” in a folder on your hard drive. In windows, the program will run from the command line by double clicking on the “Bonferroni Calculator.py” icon; however the preferred method is to right click on the icon and select “Edit with IDLE” from the dropdown list. Press F5 to run the software, and then maximize the size of the window. Follow the instructions on the screen. If the option is selected to save the results to files, these will be found in the same folder as the “Bonferroni Calculator.py” icon. The program is also available from the authors as a stand-alone executable file.

3 in total

Review 1. Adjusting for multiple testing--when and how?

Authors: R Bender; S Lange
Journal: J Clin Epidemiol Date: 2001-04 Impact factor: 6.437

2. Testing multiple statistical hypotheses resulted in spurious associations: a study of astrological signs and health.

Authors: Peter C Austin; Muhammad M Mamdani; David N Juurlink; Janet E Hux
Journal: J Clin Epidemiol Date: 2006-07-11 Impact factor: 6.437

3. ANALYZING TABLES OF STATISTICAL TESTS.

Authors: William R Rice
Journal: Evolution Date: 1989-01 Impact factor: 3.694

3 in total

30 in total

1. Identification of serum exosomal microRNAs in acute spinal cord injured rats.

Authors: Shu-Qin Ding; Jing Chen; Sai-Nan Wang; Fei-Xiang Duan; Yu-Qing Chen; Yu-Jiao Shi; Jian-Guo Hu; He-Zuo Lü
Journal: Exp Biol Med (Maywood) Date: 2019-08-26

2. Investigation of genetic risk factors for chronic adult diseases for association with preterm birth.

Authors: Nadia Falah; Jude McElroy; Victoria Snegovskikh; Charles J Lockwood; Errol Norwitz; Jeffey C Murray; Edward Kuczynski; Ramkumar Menon; Kari Teramo; Louis J Muglia; Thomas Morgan
Journal: Hum Genet Date: 2012-09-13 Impact factor: 4.132

3. Anterior-posterior cerebral blood volume gradient in human subiculum.

Authors: Pratik Talati; Swati Rane; Samet Kose; John Gore; Stephan Heckers
Journal: Hippocampus Date: 2014-02-24 Impact factor: 3.899

4. Apatinib enhances chemosensitivity of ABT-199 in diffuse large B-cell lymphoma.

Authors: Yuanfei Shi; Jing Ye; Huafei Shen; Yi Xu; Rui Wan; Xiujin Ye; Jie Jin; Wanzhuo Xie
Journal: Mol Oncol Date: 2022-09-07 Impact factor: 7.449

5. Ablation of Sirtuin5 in the postnatal mouse heart results in protein succinylation and normal survival in response to chronic pressure overload.

Authors: Kathleen A Hershberger; Dennis M Abraham; Juan Liu; Jason W Locasale; Paul A Grimsrud; Matthew D Hirschey
Journal: J Biol Chem Date: 2018-05-16 Impact factor: 5.157

6. Sleepiness and nocturnal hypoxemia in Peruvian men with obstructive sleep apnea.

Authors: Charles Huamaní; Jorge Rey de Castro; Edward Mezones-Holguín
Journal: Sleep Breath Date: 2013-11-19 Impact factor: 2.816

7. Treatment and posttreatment effects induced by the Forsus appliance: A controlled clinical study.

Authors: Giorgio Cacciatore; Luis Tomas Huanca Ghislanzoni; Lisa Alvetro; Veronica Giuntini; Lorenzo Franchi
Journal: Angle Orthod Date: 2014-03-25 Impact factor: 2.079

8. Genome-wide bioinformatic analyses predict key host and viral factors in SARS-CoV-2 pathogenesis.

Authors: Mariana G Ferrarini; Avantika Lal; Rita Rebollo; Andreas J Gruber; Andrea Guarracino; Itziar Martinez Gonzalez; Taylor Floyd; Daniel Siqueira de Oliveira; Justin Shanklin; Ethan Beausoleil; Taneli Pusa; Brett E Pickett; Vanessa Aguiar-Pulido
Journal: Commun Biol Date: 2021-05-17

9. Core Oligosaccharide Portion of Lipopolysaccharide Plays Important Roles in Multiple Antibiotic Resistance in Escherichia coli.

Authors: Jianli Wang; Wenjian Ma; Yu Fang; Hao Liang; Huiting Yang; Yiwen Wang; Xiaofei Dong; Yi Zhan; Xiaoyuan Wang
Journal: Antimicrob Agents Chemother Date: 2021-07-26 Impact factor: 5.191

10. Intrinsic OXPHOS limitations underlie cellular bioenergetics in leukemia.

Authors: Margaret Am Nelson; Kelsey L McLaughlin; James T Hagen; Hannah S Coalson; Cameron Schmidt; Miki Kassai; Kimberly A Kew; Joseph M McClung; P Darrell Neufer; Patricia Brophy; Nasreen A Vohra; Darla Liles; Myles C Cabot; Kelsey H Fisher-Wellman
Journal: Elife Date: 2021-06-16 Impact factor: 8.140