Literature DB >> 8465801

p values, hypothesis tests, and likelihood: implications for epidemiology of a neglected historical debate.

S N Goodman1.   

Abstract

It is not generally appreciated that the p value, as conceived by R. A. Fisher, is not compatible with the Neyman-Pearson hypothesis test in which it has become embedded. The p value was meant to be a flexible inferential measure, whereas the hypothesis test was a rule for behavior, not inference. The combination of the two methods has led to a reinterpretation of the p value simultaneously as an "observed error rate" and as a measure of evidence. Both of these interpretations are problematic, and their combination has obscured the important differences between Neyman and Fisher on the nature of the scientific method and inhibited our understanding of the philosophic implications of the basic methods in use today. An analysis using another method promoted by Fisher, mathematical likelihood, shows that the p value substantially overstates the evidence against the null hypothesis. Likelihood makes clearer the distinction between error rates and inferential evidence and is a quantitative tool for expressing evidential strength that is more appropriate for the purposes of epidemiology than the p value.

Mesh:

Year:  1993        PMID: 8465801     DOI: 10.1093/oxfordjournals.aje.a116700

Source DB:  PubMed          Journal:  Am J Epidemiol        ISSN: 0002-9262            Impact factor:   4.897


  48 in total

Review 1.  Sifting the evidence-what's wrong with significance tests?

Authors:  J A Sterne; G Davey Smith
Journal:  BMJ       Date:  2001-01-27

2.  Automated detection of informative combined effects in genetic association studies of complex traits.

Authors:  Nadia Tahri-Daizadeh; David-Alexandre Tregouet; Viviane Nicaud; Nicolas Manuel; François Cambien; Laurence Tiret
Journal:  Genome Res       Date:  2003-08       Impact factor: 9.043

Review 3.  Disease-modifying therapy in MS: a critical review of the literature. Part I: Analysis of clinical trial errors.

Authors:  Douglas S Goodin
Journal:  J Neurol       Date:  2004-09       Impact factor: 4.849

4.  Statistics in brief: when to use and when not to use a threshold p value.

Authors:  Bruno Falissard
Journal:  Clin Orthop Relat Res       Date:  2012-01       Impact factor: 4.176

5.  A practical solution to the pervasive problems of p values.

Authors:  Eric-Jan Wagenmakers
Journal:  Psychon Bull Rev       Date:  2007-10

6.  A Bayesian measure of the probability of false discovery in genetic epidemiology studies.

Authors:  Jon Wakefield
Journal:  Am J Hum Genet       Date:  2007-07-03       Impact factor: 11.025

7.  Up from 'false positives' in genetic-and other-epidemiology.

Authors:  Olli S Miettinen
Journal:  Eur J Epidemiol       Date:  2008-10-22       Impact factor: 8.082

8.  The researcher and the consultant: a dialogue on null hypothesis significance testing.

Authors:  Andreas Stang; Charles Poole
Journal:  Eur J Epidemiol       Date:  2013-12       Impact factor: 8.082

9.  Methods of linkage analysis--and the assumptions underlying them [see comment].

Authors:  R C Elston
Journal:  Am J Hum Genet       Date:  1998-10       Impact factor: 11.025

Review 10.  Association chain graphs: modelling etiological pathways.

Authors:  Michael Höfler; Hans-Ulrich Wittchen; Roselind Lieb; Jürgen Hoyer; Robert H Friis
Journal:  Int J Methods Psychiatr Res       Date:  2003       Impact factor: 4.035

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.