Literature DB >> 26719631

Illustrations on Using the Distribution of a P-value in High Dimensional Data Analyses.

Xiaojun Hu1, Gary L Gadbury2, Qinfang Xiang1, David B Allison3.   

Abstract

Several statistical methods have recently been developed that use the distribution of P-values from multiple tests of hypotheses to analyze data from high-dimensional experiments. These methods are only as valid as the P-values that were derived from test statistics. If an incorrect distribution for a test statistic was used, the P-value will not be valid and the distribution of P-values from multiple test statistics could give misleading results. Moreover, if the correct distribution of a test statistic is used, a distribution of P-values may still give misleading results if P-values are correlated. A primary focus of this paper is on the distribution of a P-value under a null hypothesis, and the test statistic that is considered is the number of rejected null hypotheses. Two issues are demonstrated using six data examples, two that are simulated and four from actual microarray experiments. The results provide some insight into how much of an effect might be introduced into a distribution of P-values if invalid P-values are computed or if P-values are correlated. Additional illustration is given regarding the distribution of a P-value under an alternative hypothesis and some approaches to modeling it are presented.

Entities:  

Keywords:  FDR; correlation; microarray; multiple testing; type I error

Year:  2010        PMID: 26719631      PMCID: PMC4692473     

Source DB:  PubMed          Journal:  Adv Appl Stat Sci        ISSN: 0974-6811


  16 in total

1.  Estimating the occurrence of false positives and false negatives in microarray studies by approximating and partitioning the empirical distribution of p-values.

Authors:  Stan Pounds; Stephan W Morris
Journal:  Bioinformatics       Date:  2003-07-01       Impact factor: 6.937

Review 2.  Microarray data analysis: from disarray to consolidation and consensus.

Authors:  David B Allison; Xiangqin Cui; Grier P Page; Mahyar Sabripour
Journal:  Nat Rev Genet       Date:  2006-01       Impact factor: 53.242

3.  Correlation between gene expression levels and limitations of the empirical bayes methodology for finding differentially expressed genes.

Authors:  Xing Qiu; Lev Klebanov; Andrei Yakovlev
Journal:  Stat Appl Genet Mol Biol       Date:  2005-11-22

4.  The behavior of the P-value when the alternative hypothesis is true.

Authors:  H M Hung; R T O'Neill; P Bauer; K Köhne
Journal:  Biometrics       Date:  1997-03       Impact factor: 2.571

5.  Identifying important results from multiple statistical tests.

Authors:  R A Parker; R B Rothenberg
Journal:  Stat Med       Date:  1988-10       Impact factor: 2.373

6.  Microarray profiling of isolated abdominal subcutaneous adipocytes from obese vs non-obese Pima Indians: increased expression of inflammation-related genes.

Authors:  Y H Lee; S Nair; E Rousseau; D B Allison; G P Page; P A Tataranni; C Bogardus; P A Permana
Journal:  Diabetologia       Date:  2005-07-30       Impact factor: 10.122

7.  Novel tumor necrosis factor alpha-regulated genes in rheumatoid arthritis.

Authors:  Huang-Ge Zhang; Karren Hyde; Grier P Page; Jacob P L Brand; Juling Zhou; Shaohua Yu; David B Allison; Hui-Chen Hsu; John D Mountz
Journal:  Arthritis Rheum       Date:  2004-02

8.  Assessing stability of gene selection in microarray data analysis.

Authors:  Xing Qiu; Yuanhui Xiao; Alexander Gordon; Andrei Yakovlev
Journal:  BMC Bioinformatics       Date:  2006-02-01       Impact factor: 3.169

9.  HDBStat!: a platform-independent software suite for statistical analysis of high dimensional biology data.

Authors:  Prinal Trivedi; Jode W Edwards; Jelai Wang; Gary L Gadbury; Vinodh Srinivasasainagendra; Stanislav O Zakharkin; Kyoungmi Kim; Tapan Mehta; Jacob P L Brand; Amit Patki; Grier P Page; David B Allison
Journal:  BMC Bioinformatics       Date:  2005-04-06       Impact factor: 3.169

10.  Evaluating statistical methods using plasmode data sets in the age of massive public databases: an illustration using false discovery rates.

Authors:  Gary L Gadbury; Qinfang Xiang; Lin Yang; Stephen Barnes; Grier P Page; David B Allison
Journal:  PLoS Genet       Date:  2008-06-20       Impact factor: 5.917

View more
  3 in total

1.  A comparison of methods for multiple degree of freedom testing in repeated measures RNA-sequencing experiments.

Authors:  Elizabeth A Wynn; Brian E Vestal; Tasha E Fingerlin; Camille M Moore
Journal:  BMC Med Res Methodol       Date:  2022-05-28       Impact factor: 4.612

2.  Inappropriate fiddling with statistical analyses to obtain a desirable p-value: tests to detect its presence in published literature.

Authors:  Gary L Gadbury; David B Allison
Journal:  PLoS One       Date:  2012-10-08       Impact factor: 3.240

3.  Characterization of Children's Exposure to Extremely Low Frequency Magnetic Fields by Stochastic Modeling.

Authors:  Marta Bonato; Marta Parazzini; Emma Chiaramello; Serena Fiocchi; Laurent Le Brusquet; Isabelle Magne; Martine Souques; Martin Röösli; Paolo Ravazzani
Journal:  Int J Environ Res Public Health       Date:  2018-09-08       Impact factor: 3.390

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.