Literature DB >> 15339301

Multiple-testing strategy for analyzing cDNA array data on gene expression.

Robert R Delongchamp1, John F Bowyer, James J Chen, Ralph L Kodell.   

Abstract

An objective of many functional genomics studies is to estimate treatment-induced changes in gene expression. cDNA arrays interrogate each tissue sample for the levels of mRNA for hundreds to tens of thousands of genes, and the use of this technology leads to a multitude of treatment contrasts. By-gene hypotheses tests evaluate the evidence supporting no effect, but selecting a significance level requires dealing with the multitude of comparisons. The p-values from these tests order the genes such that a p-value cutoff divides the genes into two sets. Ideally one set would contain the affected genes and the other would contain the unaffected genes. However, the set of genes selected as affected will have false positives, i.e., genes that are not affected by treatment. Likewise, the other set of genes, selected as unaffected, will contain false negatives, i.e., genes that are affected. A plot of the observed p-values (1 - p) versus their expectation under a uniform [0, 1] distribution allows one to estimate the number of true null hypotheses. With this estimate, the false positive rates and false negative rates associated with any p-value cutoff can be estimated. When computed for a range of cutoffs, these rates summarize the ability of the study to resolve effects. In our work, we are more interested in selecting most of the affected genes rather than protecting against a few false positives. An optimum cutoff, i.e., the best set given the data, depends upon the relative cost of falsely classifying a gene as affected versus the cost of falsely classifying a gene as unaffected. We select the cutoff by a decision-theoretic method analogous to methods developed for receiver operating characteristic curves. In addition, we estimate the false discovery rate and the false nondiscovery rate associated with any cutoff value. Two functional genomics studies that were designed to assess a treatment effect are used to illustrate how the methods allowed the investigators to determine a cutoff to suit their research goals.

Mesh:

Substances:

Year:  2004        PMID: 15339301     DOI: 10.1111/j.0006-341X.2004.00228.x

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  14 in total

1.  Analysis of gene expression in pathophysiological states: balancing false discovery and false negative rates.

Authors:  Andrew W Norris; C Ronald Kahn
Journal:  Proc Natl Acad Sci U S A       Date:  2006-01-09       Impact factor: 11.205

2.  Optimal screening for promising genes in 2-stage designs.

Authors:  B Moerkerke; E Goetghebeur
Journal:  Biostatistics       Date:  2008-03-18       Impact factor: 5.899

3.  A threshold neurotoxic amphetamine exposure inhibits parietal cortex expression of synaptic plasticity-related genes.

Authors:  J F Bowyer; A R Pogge; R R Delongchamp; J P O'Callaghan; K M Patel; K E Vrana; W M Freeman
Journal:  Neuroscience       Date:  2006-10-13       Impact factor: 3.590

4.  Post hoc power estimation in large-scale multiple testing problems.

Authors:  Sonja Zehetmayer; Martin Posch
Journal:  Bioinformatics       Date:  2010-02-25       Impact factor: 6.937

5.  Estimating effect sizes in genome-wide association studies.

Authors:  József Bukszár; Edwin J C G van den Oord
Journal:  Behav Genet       Date:  2010-01-06       Impact factor: 2.805

6.  False-Negative-Rate Based Approach for Selecting Top Single-Nucleotide Polymorphisms in the First Stage of a Two-Stage Genome-Wide Association Study.

Authors:  Zhuying Huang; Jian Wang; Chih-Chieh Wu; Richard S Houlston; Melissa L Bondy; Sanjay Shete
Journal:  Stat Interface       Date:  2011       Impact factor: 0.582

7.  Sample size reassessment for a two-stage design controlling the false discovery rate.

Authors:  Sonja Zehetmayer; Alexandra C Graf; Martin Posch
Journal:  Stat Appl Genet Mol Biol       Date:  2015-11

8.  Association of HADHA expression with the risk of breast cancer: targeted subset analysis and meta-analysis of microarray data.

Authors:  Manju Mamtani; Hemant Kulkarni
Journal:  BMC Res Notes       Date:  2012-01-12

9.  Designing toxicogenomics studies that use DNA array technology.

Authors:  Robert R Delongchamp; Cruz Velasco; Varsha G Desai; Taewon Lee; James C Fuscoe
Journal:  Bioinform Biol Insights       Date:  2008-08-14

10.  Transcriptome profiling of peripheral blood cells identifies potential biomarkers for doxorubicin cardiotoxicity in a rat model.

Authors:  Valentina K Todorova; Marjorie L Beggs; Robert R Delongchamp; Ishwori Dhakal; Issam Makhoul; Jeanne Y Wei; V Suzanne Klimberg
Journal:  PLoS One       Date:  2012-11-27       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.