Literature DB >> 25162562

Controlling false positive rates in research and its clinical implications.

Rafael Simas, Felipe Maestri, David Normando.   

Abstract

Entities:  

Mesh:

Year:  2014        PMID: 25162562      PMCID: PMC4296621          DOI: 10.1590/2176-9451.19.3.024-025.ebo

Source DB:  PubMed          Journal:  Dental Press J Orthod        ISSN: 2176-9451


× No keyword cloud information.
Statistical analysis is, in fact, an error analysis. A statistical test does not guarantee reliable results, it only quantifies the probability of error of a given conclusion.[1] While reading the articles of this journal, you will find a p-value. For instance, the article by Garib et al[2] describes the p-values for a given variable at two different moments: this p-value, also known as false-positive rate,[1] demonstrates the probability of error when asserting that there is a difference before and after expansion. Every research is subjected to some degree of error, given that we are not investigating an entire population, but only a fraction, a sample. For this reason, when we compare two samples undergoing different treatment procedures with a view to identifying the most efficient therapy, we will always have the chance of having reached a wrong conclusion. Therefore, the lower the p-value is, the smaller the chance of error and, as a result, the more certain we are to assure that treatment "A" is more efficient than "B". But, how can we control a false-positive error? Initially, we have to decide on the significance level (α) we expect to establish. In Dentistry, we usually set a significance level not greater than 5% (α = 5%). Nevertheless, should we increase the number of comparisons of a given study, we increase the chances of yielding outcomes that are due just to chance and, as a consequence, finding a false-positive result. The lottery is a good example. The chances of winning are little, less than 5%. However, the more we bet, the higher our chances of winning. In statistical tests, there is a dramatic increase in false-positive rates, in which the number of comparisons is directly proportional to the number of false-positive results, as shown in Table 1.
Table 1

Number of comparisons (tests) and increase in false-positive rates.

# testsα valueFW α
10.050.05
30.050.14
60.050.26
100.050.4
150.050.54

αfw= 1 - (1 - αpc)c

C = # of comparisons, αpc stands for error type I (0.05).

Number of comparisons (tests) and increase in false-positive rates. αfw= 1 - (1 - αpc)c C = # of comparisons, αpc stands for error type I (0.05). Thus, when we make several comparisons using a simple statistical test, we significantly increase the chances of yielding a false-positive result. Table 1 demonstrates that the chances of yielding a false-positive result are of 40% for a study involving 10 comparisons. In these cases, some adjustments are necessary to keep the significance level set at 5%. One of the procedures employed to correct false-positive rates is the Bonferroni correction. It consists of dividing the significance level by the number of comparisons made in a given study.[3] Suppose we carried out a comparative analysis of five cephalometric variables between two groups using an independent t-test. By dividing the significance level initially set at 0.05 or 5% by 5, the new level of error will be adjusted to 0.01 or 1%. Thus, differences will be considered significant for a p-value lower than or equal to 0.01. Nevertheless, Bonferroni correction results in a much more inflexible significance level than necessary, thus increasing the chances of yielding a false-negative rate.[4] In 1995, Benjamini and Hochberg[5] (BH) suggested another method to counteract false-positive rates when multiple comparisons with univariate statistical analysis are carried out. In this procedure, the researcher has to accept a minor false-positive rate and set this rate before the procedure. Suppose we compared 10 cephalometric measures between two populations A and B. After the number of comparisons is established, we determine the p-value for each analysis and organize these values in ascending order. The value of i = 1 (0.01) will be lower than the p-value, with i = 10 being the highest value. Table 2 shows the p-values in ascending order. After values are properly ranked, we apply the Benjamini-Hochberg formula: (i/m).Q (Q = false-positive acceptance rate; m = total number of comparisons). This formula allows us to correct the p-value and eliminate potential false-positive rates. With a view to obtaining the Q value, we divide the number of comparisons with P < 0.05 by the number of comparisons with P > 0.05. Table 3 shows that after finding the Q value and applying the Benjamini-Hocheberg formula, we find the corrected p-value for each comparison (i = 1, i = 2, etc.). Subsequently, we arrange the data in a table similar to Table 3, including the initial p-value and the p-value corrected by means of the formula. This method allows us to determine which comparisons are significant, in which case only those with a p-value lower than [(i/m).Q] are significant.6 Table 3 shows that comparisons 1 and 2 are the only ones with p-value lower than [(i/m).Q].
Table 2
ComparisonsP-value
i = 10.01
i = 20.017
i = 30.2
i = 40.22
i = 50.23
i = 60.3
i = 70.35
i = 80.4
i = 90.45
i = 100.5
Table 3
ComparisonsP-value(i/m). Q
i = 10.010.025
i = 20.0170.05
i = 30.20.075
i = 40.220.1
i = 50.230.125
i = 60.30.15
i = 70.350.175
i = 80.40.2
i = 90.450.225
i = 100.50.25

Q= 2/8 = 0.25.

Q= 2/8 = 0.25. In this same example, should we use Bonferroni correction to counteract error type I, comparisons 1 and 2 would probably not be significant, since α = 5% divided by the number of comparisons (ten) would result in 0.05/10 = 0.005. This value would be lower than comparisons 1 and 2 corrected by the BH technique, which demonstrates how strict Bonferroni's procedure is. Choosing the wrong statistical test may lead clinicians to jump to conclusions. For instance, a given treatment may be considered the best one as a result of statistical analysis. Thus, statistical analysis is the key to reach more reliable clinical results. Employing more simple statistical procedures, such as the t-test, to carry out multiple comparisons, creates the need to counteract type I error (false-positive). Therefore, it is reasonable to conclude that multiple comparisons require one to carefully choose the test as well as the corrections to be employed.
  2 in total

1.  The false discovery rate: a key concept in large-scale genetic studies.

Authors:  James J Chen; Paula K Roberson; Michael J Schell
Journal:  Cancer Control       Date:  2010-01       Impact factor: 3.302

2.  Immediate periodontal bone plate changes induced by rapid maxillary expansion in the early mixed dentition: CT findings.

Authors:  Daniela Gamba Garib; Maria Helena Ocké Menezes; Omar Gabriel Silva Filho; Patricia Bittencourt Dutra Santos
Journal:  Dental Press J Orthod       Date:  2014 May-Jun
  2 in total
  2 in total

1.  Vitamin D levels and vitamin D receptor variants are associated with chronic heart failure in Chinese patients.

Authors:  Yi Hao; Yueling Chen
Journal:  J Clin Lab Anal       Date:  2019-02-04       Impact factor: 2.352

2.  Clinical Characteristics in Early Childhood Associated with a Nevus-Prone Phenotype in Adults from Tropical Australia: Two Decades of Follow-Up of the Townsville Preschool Cohort Study.

Authors:  Ramez Barsoum; Simone L Harrison
Journal:  Int J Environ Res Public Health       Date:  2020-11-23       Impact factor: 3.390

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.