Literature DB >> 15691856

Empirical Bayes screening of many p-values with applications to microarray studies.

Susmita Datta1, Somnath Datta.   

Abstract

MOTIVATION: Statistical tests for the detection of differentially expressed genes lead to a large collection of p-values one for each gene comparison. Without any further adjustment, these p-values may lead to a large number of false positives, simply because the number of genes to be tested is huge, which might mean wastage of laboratory resources. To account for multiple hypotheses, these p-values are typically adjusted using a single step method or a step-down method in order to achieve an overall control of the error rate (the so-called familywise error rate). In many applications, this may lead to an overly conservative strategy leading to too few genes being flagged.
RESULTS: In this paper we introduce a novel empirical Bayes screening (EBS) technique to inspect a large number of p-values in an effort to detect additional positive cases. In effect, each case borrows strength from an overall picture of the alternative hypotheses computed from all the p-values, while the entire procedure is calibrated by a step-down method so that the familywise error rate at the complete null hypothesis is still controlled. It is shown that the EBS has substantially higher sensitivity than the standard step-down approach for multiple comparison at the cost of a modest increase in the false discovery rate (FDR). The EBS procedure also compares favorably when compared with existing FDR control procedures for multiple testing. The EBS procedure is particularly useful in situations where it is important to identify all possible potentially positive cases which can be subjected to further confirmatory testing in order to eliminate the false positives. We illustrated this screening procedure using a data set on human colorectal cancer where we show that the EBS method detected additional genes related to colon cancer that were missed by other methods. This novel empirical Bayes procedure is advantageous over our earlier proposed empirical Bayes adjustments due to the following reasons: (i) it offers an automatic screening of the p-values the user may obtain from a univariate (i.e., gene by gene) analysis package making it extremely easy to use for a non-statistician, (ii) since it applies to the p-values, the tests do not have to be t-tests; in particular they could be F-tests which might arise in certain ANOVA formulations with expression data or even nonparametric tests, (iii) the empirical Bayes adjustment uses nonparametric function estimation techniques to estimate the marginal density of the transformed p-values rather than using a parametric model for the prior distribution and is therefore robust against model mis-specification. AVAILABILITY: R code for EBS is available from the authors upon request. SUPPLEMENTARY INFORMATION: http://www.stat.uga.edu/~datta/EBS/supp.htm

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15691856     DOI: 10.1093/bioinformatics/bti301

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  19 in total

Review 1.  Laser capture sampling and analytical issues in proteomics.

Authors:  Howard B Gutstein; Jeffrey S Morris
Journal:  Expert Rev Proteomics       Date:  2007-10       Impact factor: 3.940

2.  Adaptive choice of the number of bootstrap samples in large scale multiple testing.

Authors:  Wenge Guo; Shyamal Peddada
Journal:  Stat Appl Genet Mol Biol       Date:  2008-03-24

3.  False discovery rates: a new deal.

Authors:  Matthew Stephens
Journal:  Biostatistics       Date:  2017-04-01       Impact factor: 5.899

Review 4.  Gene--environment-wide association studies: emerging approaches.

Authors:  Duncan Thomas
Journal:  Nat Rev Genet       Date:  2010-04       Impact factor: 53.242

5.  Using DNA microarrays to assay part function.

Authors:  Virgil A Rhodius; Carol A Gross
Journal:  Methods Enzymol       Date:  2011       Impact factor: 1.600

6.  Ventral tegmental transcriptome response to intermittent nicotine treatment and withdrawal in BALB/cJ, C57BL/6ByJ, and quasi-congenic RQI mice.

Authors:  Csaba Vadasz; Mariko Saito; Danielle O'Brien; Jiri Zavadil; Grant Morahan; Goutam Chakraborty; Ray Wang
Journal:  Neurochem Res       Date:  2007-03       Impact factor: 3.996

Review 7.  Microproteomics: analysis of protein diversity in small samples.

Authors:  Howard B Gutstein; Jeffrey S Morris; Suresh P Annangudi; Jonathan V Sweedler
Journal:  Mass Spectrom Rev       Date:  2008 Jul-Aug       Impact factor: 10.946

8.  Statistical Methods for Proteomic Biomarker Discovery based on Feature Extraction or Functional Modeling Approaches.

Authors:  Jeffrey S Morris
Journal:  Stat Interface       Date:  2012-01-01       Impact factor: 0.582

9.  Identification of significant features in DNA microarray data.

Authors:  Eric Bair
Journal:  Wiley Interdiscip Rev Comput Stat       Date:  2013-07

10.  Presenting the uncertainties of odds ratios using empirical-Bayes prediction intervals.

Authors:  Wan-Yu Lin; Wen-Chung Lee
Journal:  PLoS One       Date:  2012-02-21       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.