Literature DB >> 19259400

Estimating the proportion of true null hypotheses for multiple comparisons.

Abstract

Whole genome microarray investigations (e.g. differential expression, differential methylation, ChIP-Chip) provide opportunities to test millions of features in a genome. Traditional multiple comparison procedures such as familywise error rate (FWER) controlling procedures are too conservative. Although false discovery rate (FDR) procedures have been suggested as having greater power, the control itself is not exact and depends on the proportion of true null hypotheses. Because this proportion is unknown, it has to be accurately (small bias, small variance) estimated, preferably using a simple calculation that can be made accessible to the general scientific community. We propose an easy-to-implement method and make the R code available, for estimating the proportion of true null hypotheses. This estimate has relatively small bias and small variance as demonstrated by (simulated and real data) comparing it with four existing procedures. Although presented here in the context of microarrays, this estimate is applicable for many multiple comparison situations.

Entities: Chemical Disease Species

Keywords: epigenomics; false discovery rate; microarray; multiple comparisons; type I error rate

Year: 2008 PMID： 19259400 PMCID： PMC2623313

Source DB: PubMed Journal: Cancer Inform ISSN： 1176-9351

Introduction

Genomic technologies are producing vast amounts of biological data that are the basis for investigations that require repetitive testing of the same hypothesis. Because the number of tests performed (e.g. differential expression) is so large, sometimes the multiple comparison procedures that control the familywise error rate are too strict for biological applications (e.g. differential methylation). In fact, many biologists would rather experience several more false positives (i.e. type I errors; false rejections of the null hypothesis) than lose important information. In an attempt to address the multiple comparison issue Benjamini and Hochberg (1995) introduced an error rate measure called False Discovery Rate (FDR). Specifically, a family of m hypothesis tests is considered, of which m0 are true. The proportion of erroneously rejected null hypotheses among all the rejected null hypotheses can be cap tured by the random variable Q = V/R,where R is the number of rejected hypotheses and V is the number of false rejections (type I errors). Benjamini and Hochberg (1995) formally define the FDR to be the expected proportion of falsely rejected hypotheses among all the rejections, where Q = 0 when R = 0 (no rejections). If we let p(1) ≤ p(2) ≤ ··· ≤ p() be the ordered p-values and H() be the null hypothesis corresponding to p(), then in Benjamini and Hochberg’s (BH) FDR controlling procedure (Benjamini and Hochberg, 1995), K is considered to be the largest k such that p() ≤ (k/m)α, where α is the pre-chosen FDR significance level. If K exists, all null hypotheses H( ), i = 1, ··· ,K are rejected. If no such K exists, then no hypotheses are rejected. The BH FDR controlling procedure controls the FDR at exactly the level (m0/m)α ≤ α, and hence conservatively at α , for independent test statistics and for any configuration of false null hypotheses (Benjamini and Yekutieli, 2001; Storey, et al. 2004). In 2000 Benjamini and Hochberg proposed an adaptive procedure which provides more power than the original FDR controlling procedure by comparing each p() with (k/m̂0)α where m̂0 is an estimate of m0. If the estimated value of m0 is such that m̂0 ≥ m0 with probability one, then the adaptive BH FDR controlling procedure will lead to . Because the accuracy and variation of the estimate of m0, or π0 = m0/m, directly affects the performance of the adaptive FDR controlling procedure our focus is on the estimation and effect of π0. We propose a simple and easy-to-implement method for estimating the proportion of true null hypotheses. The performance of this estimate is compared to existing methods via simulated and real data. Specifically, Benjamini and Hochberg (2000) estimated the number of true hypotheses from the observed p-values using the Lowest SLope (LSL) estimator. Their approach was based on a modification of the graphical method of Schweder and Spjotvoll (1982). Alternatively, Storey (2002) proposed an estimate of π0 by assuming the p-values corresponding to true null hypotheses are uniformly distributed on the interval (0,1) and selecting a reasonable tuning parameter 0 ≤ λ < 1. Finally, Langaas, et al. (2005) derived estimators based on nonparametric maximum likelihood estimation of the p-value density, under the restriction of decreasing and convex decreasing densities. Although Benjamini and Hochberg’s original and adaptive FDR controlling procedure are developed for independent statistics these procedures can also be applied to some dependence structures (Benjamini and Yekutieli, 2001). Simulations have also demonstrated that they can be used for situations where there is a weak correlation structure among the genes (Storey et al. 2004). However, because of the small number of biological replicates used in most micro-array experiments, it is very difficult to measure the correlation structure among a set or family of genes. Reiner et al. (2003) proposed a procedure for the general case, but it is conservative when compared to the adaptive FDR controlling procedures.

Methods

Storey’s approach

Our approach is motivated by the work of Storey (2002), where the proportion of true null hypotheses, π0, is estimated by where W(λ) = #{p : p > λ}, and 0 ≤ λ < 1 is a tuning parameter. The rationale for this estimate is that p-values corresponding to true null hypotheses are uniformly distributed on the interval (0,1), of which most should be close to 1. Thus, for a reasonable λ, there are about m0(1 – λ) such p-values in the interval (λ,1] such that W(λ) ≈ m0(1 – λ). Black (2004) pointed out that Equation (2) is an unbiased estimate of π0 for all values of λ if all the null hypotheses are true and the p-values have a uniform distribution on the interval (0,1). However, there is an upward bias when the p-values come from both true null and true alternative hypotheses. As it turns out, choosing the tuning parameter λ in Equation (2) is very important since there is a bias-variance trade-off. When λ → 0, the variance of π̂0 (λ) becomes smaller and the bias of this estimate increases. When λ → 1, the bias of π̂0 (λ) becomes smaller, and the variance of this estimate increases. To address this point, Storey et al. (2004) proposed a bootstrap method that automatically chooses λ when estimating π̂0 (λ). Instead of choosing one specific λ, Storey and Tibshirani (2003) proposed an estimate of π0 using so that the bias is small and there is a balance between both bias and variance. For this approach, π̂0 (λ) is plotted over a range of λ = 0,0.05,0.10,…,0.90 and then a natural cubic smoothing spline is fit to these data for the purpose of estimating the overall trend of π̂0 (λ) as λ → 1. In the QVALUE (http://faculty.washington.edu/~jstorey/) software, the predicted value of π̂0 (λ) at λ = 0.90 is chosen as the estimate of π0.

Average estimate approach

As mentioned previously, the estimate where 0 ≤ λ <1, has a large bias and small variance when λ is small and a small bias and large variance when λ is big. Suppose for each λ,where 0 < λ1 < λ2 < ··· < λ < 1, we compute π̂0 (λ) as in Equation (2), then where E[ɛ] = δ, δ ≥ δ+1, Var [ɛ] = σ2, and σ2≤σ+2. Therefore, a natural choice is to consider the average of π̂0(λ) over the values of λ The bias of π̂0, 1/∑=1=n , is smaller than δ1 (the bias of the estimate of π0 at λ = λ1) and at the same time, π̂0 has a smaller variance. Considering the average of π̂0 (λ) over a range of λ to estimate π0 reduces the problem to choosing the range of λ. Define 0 = t1 < t2 <··· < t < t+1 = 1 as equally spaced points in the interval [0,1] such that the interval [0,1] is divided into B small intervals with equal length 1/B. Specifically, t = (i–1)/B. For example, when B = 10, t1 = 0, t2 = 0.1,…, t10 = 0.9. For each t (i= 1,…,B), π̂0 (t) is an estimate of π0 via Equation (2) with λ = t. The goal then becomes finding a subset of t’s such that a new estimate of π0 is obtained by taking the average of the corresponding values of π̂0 (t). Let NB denote the number of p-values which are greater than or equal to t, and let NS represent the number of p-values in the interval of [t,t+ 1). Therefore, where i = 1,…,B. If the NB p-values come from the null distribution, then on average there are p-values in each of the (B–i + 1) small intervals on [t,1]. In other words, there are p-values in each small interval [t,t+1) for i ≤ j ≤ B. Since the p-values corresponding to the true alternative hypotheses are smaller than those corresponding to the true null hypotheses, there are more p-values in the intervals [t ,t1) with small index i. For small i, NS is usually greater than . Therefore, initiating from i = 1, we find the first i such that . If such i exists, t can be considered as the change point and we assume all the p-values bigger than t come from the true null hypotheses. From this π0 can be estimated by where . In order to find the range of λ, only a lower bound of λ is required. The large values of t are used so that it ensures the estimate of π0 has small bias. This is equivalent to fitting a straight line with slope 0 in the right bottom part of a π̂0 (t) versus t plot, such that the intercept provides the estimate of π0. A simple modification of this approach is to estimate π0 by taking the average of π̂0(t) from j = i–1 to B, that is, where . This ensures that the upward bias increases and the variance decreases, as π̂0 (t –1) has smaller variance and bigger bias than π̂0 (t) for j = i,…, B. A remaining challenge is how to choose B. Specifically, how many λ’s should be used in the interval [0,1]. Recall that a motivating factor of the proposed average estimate approach is to balance the bias and variance. The natural way to measure both the bias and variance is the mean-squared error, E[π̂0 (B)−π0]2. Since the true value of π0 is unknown and the theoretical result is intractable, we take a bootstrap approach in the following way: For each B ε I, I = {5, 10, 20, 50, 100}, compute [π̂0 (B) as in Equation (8). Form N bootstrap samples of the p-values, and compute the bootstrap estimates [π̂0*(B) for b = 1,…, N and B ε {5, 10, 20, 50, 100}. For each B ε I, estimate its respective mean-squared error as Where Let , then the estimate of π0 is π̂0 = π̂0 (B̂). Notice that in step three the value of π0 is estimated by the average of the π̂0 (B) over arrange of B.

Results

Simulation studies

To investigate the performance of the proposed average estimate approach, a simulation study was performed. Taking m = 1,000 (i.e. 1,000 genes are tested for differential expression), let π0 vary over a wide range, say π0 = 0.50,0.60,…,0.90 which are reasonable for microarray experiments. Hypotheses, H0: μ = 0 versus H: μ > 0, are tested for independent random variables Z (i = 1,…,m) from null distribution N(0,1) and alternative distribution N(2,1). Specifically, mπ0 and m(1 –π0) random variables have mean 0 and 2, respectively. For each test, the p-value is computed as p = P(Z > z), where Z is a random variable from a standard normal distribution N(0,1) and z is the observed value of Z. For each value of π0, l = 1,000 data sets were simulated. For the choice of B, B is either fixed (i.e. B = 5, 10, 20, 50, and 100) or chosen by the proposed bootstrap approach. For each of the l = 1,000 simulated data sets, when B is fixed, the estimate of π0 is computed via Equation (8), that is, where . If such i does not exist, π0 is estimated by the average of π̂0 (t –1)and π̂0 (t). For the bootstrap approach to automatically choose B, the range of B is 5, 10, 20, 50, 100. For completion the performance of the proposed average estimate approach is compared with several existing procedures: Benjamini and Hochberg’s lowest slope estimate (LSL) (Benjamini and Hochberg, 2000), Storey’s bootstrap estimate (Storeyboot) (Storey et al. 2004), Storey and Tibshirani’s smoother estimate (STsmoother) (Storey and Tibshirani, 2003), Langass et al.’s nonparametric maximum likelihood estimate (convest) (Langaas et al. 2005). For procedures 2 and 3, the QVALUE software (http://faculty.washington.edu/~jstorey/) was employed. For procedure 4, the R function ‘convest’ was downloaded from the R library ‘limma’ as part of the Bioconductor project (http://www.bioconductor.org). Table 1 summarizes the simulation results. Bias and the standard deviation of the estimates are estimated by

Table 1

The estimate of the proportion of true null hypotheses is compared for: Benjamini and Hochberg’s lowest slope approach (LSL), Storey’s π̂0 (λ) estimate with λ selected via bootstrapping (Storeyboot), Storey and Tibshirani’s smoother method (STsmoother), Langass’s nonparametric maximum likelihood approach (convest), and the proposed average estimate approach with fixed values of B = 5, 10, 20, 50, 100 and with B chosen via the bootstrapping procedure (Bboot). There are 1,000 simulated data sets, each with a total of m = 1, 000 hypothesis tests, for each value of π0.

π₀	0.5	0.6	0.7	0.8	0.9
	Estimates of π₀
LSL	0.7151	0.7889	0.8561	0.9184	0.9683
Storey_boot	0.4814	0.5789	0.6765	0.7728	0.8660
ST _smoother	0.4951	0.5939	0.6980	0.7993	0.8973
convest	0.4963	0.5938	0.6947	0.7921	0.8882
B = 5	0.5132	0.6113	0.7136	0.8086	0.9058
B = 10	0.5082	0.6084	0.7083	0.8045	0.9052
B = 20	0.5141	0.6128	0.7115	0.8076	0.9064
B = 50	0.5196	0.6175	0.7156	0.8106	0.9078
B = 100	0.5243	0.6210	0.7180	0.8122	0.9085
Bboot	0.5195	0.6175	0.7148	0.8113	0.9082

	Standard deviation of π₀ estimates
LSL	0.0323	0.0269	0.0225	0.0155	0.0092
Storey_boot	0.0467	0.0491	0.0513	0.0522	0.0549
ST _smoother	0.0513	0.0570	0.0608	0.0654	0.0656
convest	0.0331	0.0364	0.0337	0.0321	0.0328
B = 5	0.0335	0.0356	0.0420	0.0428	0.0382
B = 10	0.0391	0.0390	0.0402	0.0412	0.0366
B = 20	0.0331	0.0343	0.0358	0.0371	0.0331
B = 50	0.0293	0.0309	0.0321	0.0334	0.0315
B = 100	0.0272	0.0291	0.0307	0.0321	0.0312
B_boot	0.0301	0.0301	0.0313	0.0313	0.0311

where π̂0 estimates π0 for the i th simulation, and π0 is the true value. As demonstrated, the LSL approach has the largest upward bias which guarantees that Benjamini and Hochberg’s adaptive FDR controlling procedure controls the FDR below a pre-chosen FDR level. However, the FDR can be much lower than the pre-chosen FDR level. The LSL approach also has the smallest variation. The last three approaches [2-4] all underestimate the proportion of true null hypotheses. The proposed average estimate approach provides estimates of π0 that have upward but relatively small bias and relatively small variance regardless of whether B is fixed or automatically chosen via bootstrap procedure. When B increases, the bias increases and the variation decreases. Both the small upward bias and small variance provide evidence that the proposed average estimate approach has better properties when compared to the other approaches. The average of the true false discovery rate (FDR) from 1000 simulations is also compared in this simulation study by applying Benjamini and Hochberg’s adaptive FDR controlling procedure (Benjamini and Hochberg, 2000) with π0 estimated using the above mentioned five methods (Fig. 1). The FDR significance level was chosen as α = 0.05. For the purpose of comparison, the original BH FDR controlling procedure (Benjamini and Hochberg, 1995) and the adaptive FDR controlling procedure with the incorporation of the true value of π0 were also applied to the p-values. It can be seen that the original BH FDR controlling procedure has the lowest FDR as expected. Because Benjamini and Hochberg’s lowest slope approach overestimates π0, the FDR is below, but much lower than, the pre-chosen α, although this approach has a bigger FDR than that of the BH procedure. Storey’s bootstrap estimate, the smoother estimate and convest estimate produce higher FDRs than the pre-chosen level because all three methods underestimate π0. Our proposed average estimate approach overestimates π0, its FDR is below but very close to the pre-chosen significance level α = 0.05. Table 1 also demonstrates that the FDR for the proposed average estimate has the relatively small variation.

Figure 1

Simulation results of the False Discovery Rate (FDR) at significance level α = 0.05 for seven procedures: Benjamini and Hochberg’s FDR controlling procedure with incorporation of the true π0 (BHπ0 ), Benjamini and Hochberg’s FDR controlling procedure (BH), Benjamini and Hochberg’s adaptive approach with incorporation of the estimate of π0 which is estimated by the proposed average estimate procedure where B is chosen via bootstrapping (Bboot), Benjamini and Hochberg’s lowest slope approach (LSL), Storey’s bootstrapping approach (Storeyboot), Storey and Tibshirani’s smoother method (STsmoother), and Langass et al.’s nonparametric maximum likelihood estimate (convest), respectively. The black straight line represents FDR = 0.05. The total number of hypotheses tests is m = 1, 000 and the size of simulation study 1,000 for each value of π0.

The power of the five adaptive FDR controlling procedures is compared (Fig. 2). The power of a procedure is measured by average power which is defined to be the ratio of average number of correct rejections of true alternative hypotheses to the total number of true alternative hypotheses. Formally, average power = E(S)/(m – m0). As illustrated, the power decreases when π0 increases for all of the FDR controlling procedures. The original BH procedure has the lowest power, while Benjamini and Hochberg’s adaptive procedure has the second lowest power. It is not surprising that Storeyboot procedure has the largest statistical power, because the FDR of this procedure exceeds the pre-chosen FDR significance level (Fig. 1).

Figure 2

Simulation results for the evaluation of statistical power at significance level α = 0.05 for seven procedures: Benjamini and Hochberg’s FDR controlling procedure with incorporation of the true π0 (BHπ0 ), Benjamini and Hochberg’s FDR controlling procedure (BH), Benjamini and Hochberg’s adaptive approach with incorporation of the estimate of π0 which is estimated by the proposed average estimate procedure where B is chosen via bootstrapping (Bboot), Benjamini and Hochberg’s lowest slope approach (LSL), Storey’s bootstrapping approach (Storeyboot), Storey and Tibshirani’s smoother method (STsmoother) and Langass et al.’s nonparametric maximum likelihood estimate (convest), respectively. The total number of hypotheses tests is m = 1, 000, and the size of simulation study is 1,000 for each value of π0.

Microarray data application

The same five estimating π0 methods were also applied to the training samples of the leukemia data of Golub et al. (1999), which consist of 27 patients with acute lymphoblastic leukemia (ALL) and 11 patients with acute myeloid leukemia (AML). The samples were assayed using Affymetrix Hgu6800 chips and the gene expression data of 7129 genes (Affymetrix probes) are available from R library golubEsets. For each gene, a simple two-sample t-test was employed for testing differential gene expression and the p-value was computed. Table 2 gives the estimate of the proportion of true null hypotheses and the number of statistically significant genes.

Table 2

The estimate of the proportion of true null hypotheses and the number of statistically significant genes for the leukemai data (Golub et al. 1999) at significance level α = 0.05 after applying Benjamni and Hochberg’s adaptive FDR controlling procedure with π0 estimated using five methods: Benjamini and Hochberg’s lowest slope approach (LSL), Storey’s π̂0 (λ) estimate with λ selected via bootstrapping (Storeyboot), Storey and Tibshirani’s smoother method (STsmoother), Langass’s convest approach (convest), and the proposed average approach with B chosen via the bootstrapping procedure (Bboot). A two-sample t-test was used to compute the p-values.

Method	Estimate of π₀	Number of Signicant genes
LSL	0.899	584
Storey_boot	0.595	787
ST _smoother	0.583	791
convest	0.595	787
B_boot	0.604	776

From this real data analysis, it can be seen that the Benjamni and Hochberg’s LSL approach conservatively overestimates π0, hence it leads to lowest power in terms of the number of rejections. Our proposed average approach provides a slightly larger estimate than Storey’s bootstrap approach, the smoother estimate and the nonparametric maximum likelihood approach (convest), even though they end up with a similar number of rejections.

Summary

As array technology improves, it is anticipated that the number of features per array will only increase, hence multiple comparisons will continue to be a challenging problem. Specific to microarrays, the false discovery rate (FDR) is preferred to family-wise error rate (FWER) because the FDR controlling procedures have more statistical power than the FWER controlling procedures, even at the cost of a few more type I errors (i.e. false positives). Since Benjamini and Hochberg (1995) proposed their FDR controlling procedure, a variety of methods have been proposed to estimate π0, the proportion of true null hypotheses. As seen here, overestimating π0 controls the FDR below the specified rate. When our and others, estimate of π0 is incorporated into the Benjamini and Hochberg’s FDR controlling procedure, the adaptive FDR controlling procedure has more power and an FDR close to the pre-chosen level. In this work, we have compared several methods for estimating the proportion of true null hypotheses (π0). Benjamini Hochberg’s lowest slope approach (Benjamini and Hochberg, 2000) overestimates π0. Storey’s estimate π̂0(λ) (Storey, 2002) also overestimates π0 for any fixed value 0 ≤ λ <1. When λ → 1, the bias becomes smaller, and the variance becomes bigger. In order to find the optimal λ such that π̂0(λ) has small variation, Storey proposed a bootstrapping method (Storey et al. 2004). However, this method underestimates π0 and the downward bias increases as the true value π0 gets bigger. Storey and Tibshirani (2003) proposed a smoother method to estimate such that this estimate has small bias. Unfortunately, this method also underestimates π0, although the bias is very small. Furthermore, the variation of this estimate is relatively large, which makes the adaptive FDR controlling procedure unstable. More recently, Langaas et al. (2005) proposed an estimate based on the nonparametric maximum likelihood function of the p-value density restricted to convex decreasing densities. However, this method also underestimates π0, most likely because the distribution of the p-values is not decreasing for large p-values and tends to be flat. Using the limitations of the existing approaches for estimating π0 as the motivation, we propose the average estimate approach by taking average of the estimates of π0 over a range of equally spaced points on the interval [0,1]. While our average estimate approach has a slightly larger bias, it also has smaller variation than any of the other methods. Furthermore, when compared to the other methods it is easy to implement (e.g. Excel) when the number of points used in approach is fixed (say, B = 10), and can be automated to choose B via a bootstrap procedure (R code available: www.stat.purdue.edu/~doerge). When our proposed estimated value of π0 is incorporated into Benjamini and Hochberg’s adaptive FDR controlling procedure, more statistical power is gained such that the FDR can be controlled below, yet extremely close to a desired level α.

3 in total

1. Identifying differentially expressed genes using false discovery rate controlling procedures.

Authors: Anat Reiner; Daniel Yekutieli; Yoav Benjamini
Journal: Bioinformatics Date: 2003-02-12 Impact factor: 6.937

2. Statistical significance for genomewide studies.

Authors: John D Storey; Robert Tibshirani
Journal: Proc Natl Acad Sci U S A Date: 2003-07-25 Impact factor: 11.205

3. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.

Authors: T R Golub; D K Slonim; P Tamayo; C Huard; M Gaasenbeek; J P Mesirov; H Coller; M L Loh; J R Downing; M A Caligiuri; C D Bloomfield; E S Lander
Journal: Science Date: 1999-10-15 Impact factor: 47.728

3 in total

10 in total

1. A censored beta mixture model for the estimation of the proportion of non-differentially expressed genes.

Authors: Anastasios Markitsis; Yinglei Lai
Journal: Bioinformatics Date: 2010-01-15 Impact factor: 6.937

2. Bias and variance reduction in estimating the proportion of true-null hypotheses.

Authors: Yebin Cheng; Dexiang Gao; Tiejun Tong
Journal: Biostatistics Date: 2014-06-23 Impact factor: 5.899

3. Estimating the Proportion of True Null Hypotheses Using the Pattern of Observed p-values.

Authors: Tiejun Tong; Zeny Feng; Julia S Hilton; Hongyu Zhao
Journal: J Appl Stat Date: 2013-01-01 Impact factor: 1.404

4. Meta-analysis of the heritability of human traits based on fifty years of twin studies.

Authors: Tinca J C Polderman; Beben Benyamin; Christiaan A de Leeuw; Patrick F Sullivan; Arjen van Bochoven; Peter M Visscher; Danielle Posthuma
Journal: Nat Genet Date: 2015-05-18 Impact factor: 38.330

5. Rare coding variation provides insight into the genetic architecture and phenotypic context of autism.

Authors: Jack M Fu; F Kyle Satterstrom; Minshi Peng; Harrison Brand; Ryan L Collins; Shan Dong; Brie Wamsley; Lambertus Klei; Lily Wang; Stephanie P Hao; Christine R Stevens; Caroline Cusick; Mehrtash Babadi; Eric Banks; Brett Collins; Sheila Dodge; Stacey B Gabriel; Laura Gauthier; Samuel K Lee; Lindsay Liang; Alicia Ljungdahl; Behrang Mahjani; Laura Sloofman; Andrey N Smirnov; Mafalda Barbosa; Catalina Betancur; Alfredo Brusco; Brian H Y Chung; Edwin H Cook; Michael L Cuccaro; Enrico Domenici; Giovanni Battista Ferrero; J Jay Gargus; Gail E Herman; Irva Hertz-Picciotto; Patricia Maciel; Dara S Manoach; Maria Rita Passos-Bueno; Antonio M Persico; Alessandra Renieri; James S Sutcliffe; Flora Tassone; Elisabetta Trabetti; Gabriele Campos; Simona Cardaropoli; Diana Carli; Marcus C Y Chan; Chiara Fallerini; Elisa Giorgio; Ana Cristina Girardi; Emily Hansen-Kiss; So Lun Lee; Carla Lintas; Yunin Ludena; Rachel Nguyen; Lisa Pavinato; Margaret Pericak-Vance; Isaac N Pessah; Rebecca J Schmidt; Moyra Smith; Claudia I S Costa; Slavica Trajkova; Jaqueline Y T Wang; Mullin H C Yu; David J Cutler; Silvia De Rubeis; Joseph D Buxbaum; Mark J Daly; Bernie Devlin; Kathryn Roeder; Stephan J Sanders; Michael E Talkowski
Journal: Nat Genet Date: 2022-08-18 Impact factor: 41.307

6. Bias-corrected estimators for proportion of true null hypotheses: application of adaptive FDR-controlling in segmented failure data.

Authors: Aniket Biswas; Gaurangadeb Chattopadhyay; Aditya Chatterjee
Journal: J Appl Stat Date: 2021-07-27 Impact factor: 1.416

7. Parallel multiplicity and error discovery rate (EDR) in microarray experiments.

Authors: Wayne Wenzhong Xu; Clay J Carter
Journal: BMC Bioinformatics Date: 2010-09-16 Impact factor: 3.169

8. Sequential Monte Carlo multiple testing.

Authors: Geir Kjetil Sandve; Egil Ferkingstad; Ståle Nygård
Journal: Bioinformatics Date: 2011-10-13 Impact factor: 6.937

Review 9. Statistical methods for mediation analysis in the era of high-throughput genomics: Current successes and future challenges.

Authors: Ping Zeng; Zhonghe Shao; Xiang Zhou
Journal: Comput Struct Biotechnol J Date: 2021-05-26 Impact factor: 7.271

10. FDRestimation: Flexible False Discovery Rate Computation in R.

Authors: Megan H Murray; Jeffrey D Blume
Journal: F1000Res Date: 2021-06-03

10 in total