Liang Chen1, Tiejun Tong, Hongyu Zhao. 1. Molecular and Computational Biology Program, Department of Biological Sciences, University of Southern California, Los Angeles, CA, USA. liang.chen@usc.edu
Abstract
MOTIVATION: Multiple comparison adjustment is a significant and challenging statistical issue in large-scale biological studies. In previous studies, dependence among genes is largely ignored. However, such dependence may be strong for some genomic-scale studies such as genetical genomics [also called expression quantitative trait loci (eQTL) mapping] in which thousands of genes are treated as quantitative traits and mapped to different genetical markers. Besides the dependence among markers, the dependence among the expression levels of genes can also have a significant impact on data analysis and interpretation. RESULTS: In this article, we propose to consider both the mean as well as the variance of false discovery number for multiple comparison adjustment to handle dependence among hypotheses. This is achieved by developing a variance estimator for false discovery number, and using the upper bound of false discovery proportion (uFDP) for false discovery control. More importantly, we introduce a weighted version of uFDP (wuFDP) control to improve the statistical power of eQTL identification. In addition, the wuFDP approach can better control false positives than false discovery rate (FDR) and uFDP approaches when markers are in linkage disequilibrium. The relative performance of uFDP control and wuFDP control is illustrated through simulation studies and real data analysis. SUPPLEMENTARY INFORMATION: Supplementary figures, tables and appendices are available at Bioinformatics online.
MOTIVATION: Multiple comparison adjustment is a significant and challenging statistical issue in large-scale biological studies. In previous studies, dependence among genes is largely ignored. However, such dependence may be strong for some genomic-scale studies such as genetical genomics [also called expression quantitative trait loci (eQTL) mapping] in which thousands of genes are treated as quantitative traits and mapped to different genetical markers. Besides the dependence among markers, the dependence among the expression levels of genes can also have a significant impact on data analysis and interpretation. RESULTS: In this article, we propose to consider both the mean as well as the variance of false discovery number for multiple comparison adjustment to handle dependence among hypotheses. This is achieved by developing a variance estimator for false discovery number, and using the upper bound of false discovery proportion (uFDP) for false discovery control. More importantly, we introduce a weighted version of uFDP (wuFDP) control to improve the statistical power of eQTL identification. In addition, the wuFDP approach can better control false positives than false discovery rate (FDR) and uFDP approaches when markers are in linkage disequilibrium. The relative performance of uFDP control and wuFDP control is illustrated through simulation studies and real data analysis. SUPPLEMENTARY INFORMATION: Supplementary figures, tables and appendices are available at Bioinformatics online.
Authors: Leonid Bystrykh; Ellen Weersing; Bert Dontje; Sue Sutton; Mathew T Pletcher; Tim Wiltshire; Andrew I Su; Edo Vellenga; Jintao Wang; Kenneth F Manly; Lu Lu; Elissa J Chesler; Rudi Alberts; Ritsert C Jansen; Robert W Williams; Michael P Cooke; Gerald de Haan Journal: Nat Genet Date: 2005-02-13 Impact factor: 38.330
Authors: Eric E Schadt; Stephanie A Monks; Thomas A Drake; Aldons J Lusis; Nam Che; Veronica Colinayo; Thomas G Ruff; Stephen B Milligan; John R Lamb; Guy Cavet; Peter S Linsley; Mao Mao; Roland B Stoughton; Stephen H Friend Journal: Nature Date: 2003-03-20 Impact factor: 49.962
Authors: Barbara E Stranger; Matthew S Forrest; Andrew G Clark; Mark J Minichiello; Samuel Deutsch; Robert Lyle; Sarah Hunt; Brenda Kahl; Stylianos E Antonarakis; Simon Tavaré; Panagiotis Deloukas; Emmanouil T Dermitzakis Journal: PLoS Genet Date: 2005-12-16 Impact factor: 5.917