Literature DB >> 35860413

Evaluating statistical significance in a meta-analysis by using numerical integration.

Yin-Chun Lin¹, Yu-Jen Liang¹, Hsin-Chou Yang^1,2,3,4.

Abstract

Meta-analysis is a method for enhancing statistical power through the integration of information from multiple studies. Various methods for integrating p-values (i.e., statistical significance), including Fisher's method under an independence assumption, the permutation method, and the decorrelation method, have been broadly used in bioinformatics and computational biotechnology studies. However, these methods have limitations related to statistical assumption, computing efficiency, and accuracy of statistical significance estimation. In this study, we proposed a numerical integration method and examined its theoretical properties. Simulation studies were conducted to evaluate its Type I error, statistical power, computational efficiency, and estimation accuracy, and the results were compared with those of other methods. The results demonstrate that our proposed method performs well in terms of Type I error, statistical power, computing efficiency (regardless of sample size), and statistical significance estimation accuracy. P-value data from multiple large-scale genome-wide association studies (GWASs) and transcriptome-wise association studies (TWASs) were analyzed. The results demonstrate that our proposed method can be used to identify critical genomic regions associated with rheumatoid arthritis and asthma, increase statistical significance in individual GWASs and TWASs, and control for false-positives more effectively than can Fisher's method under an independence assumption. We created the software package Pbine, available at GitHub (https://github.com/Yinchun-Lin/Pbine).

Entities: Chemical

Keywords: Decorrelation; Fisher’s method; GWAS, Genome-Wide Association Study; Genome-wide association study; MHC, Major Histocompatibility Commplex; Meta-analysis; NARAC, North American Rheumatoid Arthritis Consortium; P-value combination; Permutation; SNP, Single Nucleotide Polymorphism; TWAS, Transcriptome-Wide Association Study; Transcriptome-wise association study; WTCCC, Wellcome Trust Case Control Consortium

Year: 2022 PMID： 35860413 PMCID： PMC9283883 DOI： 10.1016/j.csbj.2022.06.055

Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN： 2001-0370 Impact factor: 6.155

Introduction

Fisher’s p-value combination method [1], which integrates statistical significance (i.e., p-value ) from many statistical hypothesis tests was originally proposed to examine a joint null hypothesis (i.e., an intersection of the individual null hypotheses) in a meta-analysis [2]. Test statistic of Fisher’s method is defined as negative two times the summation of log-transformed p-values (i.e., ) [1]. This method has been broadly applied in bioinformatics and computational biotechnology studies, such as the meta-analysis of genome-wide association study (GWAS) [3], [4], [5] and that of transcriptome-wide association study (TWAS) [6], [7], [8]. If all the p-values follow Uniform (0,1) distribution under a null hypothesis independently, test statistic follows a chi-squared distribution with degrees of freedom. As such, the exact p-value () of F can be derived. However, when the p-values are correlated, the mathematical derivation for the sampling distribution of becomes intractable. Remarkably, ignorance of the correlation of p-values will inflate false-positives in the subsequent statistical inference [9]. The permutation procedure [10], which involves non-parametric resampling without replacement based on a set of observed data, has been applied to generate a null distribution and calculate an empirical p-value () of when the independence assumption of p-values is violated. The permutation procedure is simple in concept and robust to various correlation structures of p-values. However, this method has some limitations. For instance, in a GWAS with a large sample size, which is used to evaluate the phenotype–genotype relationship, permutations over the phenotype status of study samples require intensive computation. Permutations over single nucleotide polymorphisms (SNPs) may distort the inherent structure of SNPs (i.e., linkage disequilibrium). In addition, numerous permutations and intensive computation are required for a correction for multiple testing in GWASs and TWASs [11], [12]. The permutation procedure requires raw data (only p-value data itself cannot perform permutations), which are not always available. For instance, in a meta-analysis of GWAS, the p-values of single-locus association tests from public genomic databases (e.g., GWAS Catalog [13] and GWAS Central [14]) are available and can be combined to infer the genetic association between SNPs and a phenotype of interest. However, no raw genotype and phenotype data are provided in GWAS Catalog and GWAS Central. Without the need for raw data, a decorrelation procedure [15] transforms dependent p-values into independent p-values only on the basis of p-value data and the correlation structure of the p-values. The distribution F can thus be derived and computed rapidly. However, in this study, we demonstrated that the order of the combination of p-values influences the results (the order-noninterchangeable property). For example, if and are combined and their correlation coefficient = 0.5, then the decorrelation procedure provides a p-value of F (; however, if the order of the combination of p-values is reversed, then . Considering the permutation and decorrelation methods’ limitations, we suggested a numerical integration method () to evaluate the statistical significance of F. Herein, the theoretical properties of a p-value combination are examined. Type I error, statistical power, computational efficiency, and estimation accuracy were evaluated through simulation studies and compared with those of Fisher’s method under an independence assumption (), permutation (), and decorrelation (). Real-world examples of meta-analyses of GWASs and meta-analyses of TWASs are given. R codes Pbine are provided in GitHub at .

Methods

Proposed method

Let denote a list of dependent p-values from K hypothesis tests with a correlation . Because the correlation is approximately invariant under monotone transformations [15], satisfies the following transformation T: , where is the Cholesky factor of (i.e. ), is a list of independent p-values, and denotes the cumulative distribution function of a standard normal random variable. Let be the p-values calculated from the raw data observations for testing K null hypotheses. In our method, the p-value of F is calculated as follows:where is the joint probability density function of (Text S1), is the region enclosed by and . Noted that is fixed for any permutation on For reduced case with correlation , then . The p-value of is derived (Text S1) as follows:where and is the derivative of a cumulative distribution function of a standard normal variate. A logarithm transformation of ’s was considered to prevent integration on curvilinear region . in Eq. (1) can be rewritten as follows:where and . If correlation of two p-values is , in Eq. (1) can be reduced to Fisher’s method as follows: For , the detailed derivation of (Text S1), the symmetric behavior and limiting behavior of (Text S2), and a comparison of the p-value of our proposed method () with the individual p-values (Text S3 and Fig. S1) are provided.

Simulation studies

To evaluate the Type I error, power, and computation time of the p-value combination methods, bivariate linear regression models were applied to generate data in the simulation studies. Model parameters were assigned or estimated following the purposes of the simulation study. The p-values of our numerical integration method (), the Fisher’s method under an independence assumption (), and the decorrelation method () were calculated and compared with the benchmark method – the permutation method ().

Evaluation of Type I error

A bivariate linear regression model was applied to generate two phenotypes (, ) and gene expression () data in the simulation study as follows:where N denotes the total number of samples and G indicates the total number of genes. Random error terms and have correlation coefficient . We assumed that and would be independent white noises following a standard normal distribution. Gene expression was independently generated from . Here, we considered the regression coefficient under the null hypothesis: ; sample size ; gene number ; simulation replications . Given a specified , we generated the phenotypes and gene expression data. In each of the simulated datasets, p-values and were obtained by testing null hypotheses and , respectively, through Student’s t test. The results demonstrate that the methods control Type I error well for uncorrelated p-values, i.e., (Fig. 1A). For the positively correlated p-values , Fisher’s method (red line) under an independence assumption () exhibited an inflated Type I error particularly for an increased correlation of p-values. Our method and the decorrelation procedure exhibited a Type I error similar to that of the benchmark method ; the false-positive was 0.05–0.06. This indicates that p-value dependency must be considered when the sampling distribution of a p-value combination is derived. Therefore, our proposed method controls Type I errors well.

Fig. 1

(A) Type I error for the p-value combination methods. The x-axis represents correlation . The y-axis represents Type I error. (B) Statistical power for the p-value combination methods. The x-axis represents correlation . The y-axis represents statistical power. (C) Computation time of the proposed method () and the permutation method (). The x-axis indicates number of thousand permutations (K). The y-axis indicates computation time. Red, green, and blue lines denote gene sizes G of 10,000, 20,000, and 30,000, respectively. The squares, triangles, and circles denote sample sizes N of 3,000, 5,000, and 7,000, respectively. Solid, dotted, and dot-dashed lines represent correlation coefficients of 0.1, 0.5, and 0.9, respectively. (D)–(F) Estimation accuracy of p-value. The x-axis represents the p-values of the benchmark method (). The y-axis indicates the p-values of the other p-value combination methods: , brown; , blue; , green). The results based on correlation coefficients 0.3, 0.5, and 0.7 are arranged from left to right.

Evaluation of statistical power

A bivariate linear regression model was applied to generate two phenotypes (, ) and gene expression () data in the simulation study as follows:where N indicates the total number of samples and G indicates the total number of genes. That is, and , and . We assumed that random error terms and would follow an independent standard normal random distribution individually. Gene expression was generated from independently. Here, we considered ; ; sample size ; number of genes ; simulation replications . P-values and were obtained by testing null hypotheses and . The relationship between and is discussed (Text S4 and Fig. S2). The results demonstrated that the four methods have similar power and that the power increases with (Fig. 1B). has the highest power, particularly for a higher ; however, the high power is accompanied by an inflated Type I error as mentioned in the previous section (Fig. 1A). has the lowest power and deviates from the benchmark , particularly for a high . The proposed method has high power similar to that of the benchmark .

Evaluation of computation time

The simulation model for an evaluation of Type I error was then applied. We compared the computation time between the proposed method () and the benchmark method () with various sample sizes ( 3,000, 5,000, and 7,000), gene numbers ( 10,000, 20,000, and 30,000), correlation coefficients (), and permutation times ( 2,000–20,000, with increments of 2,000). The results show that the computation time of the proposed method () is unchanged with the sample size, number of genes, and correlation coefficient (Fig. 1C). The computation time for the permutation method () increases with sample size, number of genes, and permutation time, but it did not change as the correlation coefficient increased; it required approximately 2,500 h to compute 30,000 pairs of p-value combinations. Our proposed method was more computationally efficient, requiring < 10 h for computation. Because Fisher’s method under an independence assumption () and the decorrelation method () do not involve complex re-sampling or integration procedures, they are computationally efficient.

Evaluation of estimation accuracy

We applied the model for an evaluation of Type I error with , and , and compared the estimation accuracy of the three p-value combination methods , , and with that of the benchmark permutation . Correlation coefficients were considered. The results demonstrate that , , and deviate from the benchmark as increases (Fig. 1D–1F). In addition, exhibits a certain proportion of outliers (Fig. 1D–1F). The biased estimation can be explained by the order-noninterchangeable property (Text S5) and non-uniformity property (Fig. S3) of . Compared with and , the proposed method () is closest to the benchmark . Thus, the proposed method provides a more accurate estimate than do and .

Real data applications

Meta-GWAS for rheumatoid arthritis

This meta-analysis identified SNPs associated with rheumatoid arthritis on the basis of two large-scale population-based GWASs – The North American Rheumatoid Arthritis Consortium (NARAC) data [16] and Wellcome Trust Case Control Consortium (WTCCC) data [17]. In each of the two GWASs, a logistic regression analysis with covariate adjustment for sex and SNP coding based on an additive genetic model was performed to examine the genetic associations between rheumatoid arthritis disease status and individual SNP markers. At each SNP, p-values based on NARAC and WTCCC data were obtained separately. We applied Fisher’s () and our () methods to combine the two p-values at each SNP locus in the NARAC and WTCCC data. There were 4,963 statistical tests (because of 4,963 SNPs on chromosome 6) in this meta-GWAS. Bonferroni’s correction [18] for multiple testing was performed to obtain adjusted p-values for and separately. The result demonstrate that both the methods could be used to identify the major histocompatibility complex (MHC) region on chromosome 6p21.3 (Fig. 2), which is strongly associated with rheumatoid arthritis [19], [20]. Our method ennabled us to identify the SNP rs9391858 (p = ) truly associated with rheumatoid arthritis [21] (Fig. 2); however, the individual GWASs could not detect this SNP in NARAC and in WTCCC; Fig. S4). Fisher’s method but not ours identified six false-positive SNPs: rs2394102, rs11752073, rs12697946, rs9394169, rs3818528, and rs3130014 (Fig. 2).

Fig. 2

Manhattan plots for chromosome 6 in the This meta-GWAS contained 4,963 SNPs on chromosome 6. The Fisher’s method () and our method () were employed. Each point indicates a SNP. The x-axis indicates physical position of a SNP. The y-axis indicates p-value in a scale of –log10. The green lines indicate false-positive events identified by the Fisher’s method () but not by our method (); they involved six SNPs: rs11752073, rs2394102, rs3130014, rs3818528, rs9394169, and rs12697946 (green line). The orange lines indicate false-positive events identified by both of the Fisher’s method () and our method (). The light blue line indicates the SNP known to be associated with rheumatoid arthritis and identified by the Fisher’s method () and our method (), but not by either of the two studies (Fig. S4). The red dashed line indicates the significance level after Bonferroni correction for multiple testing.

Meta-TWAS for rheumatoid arthritis

This meta-analysis was performed to identify genes differentially expressed in patients with rheumatoid arthritis and normal controls of European descent based on the two large-scale TWASs: Eyre et al. [22] and Stahl et al. [23]. In each TWAS, the gene-level p-values used to examine the association of rheumatoid arthritis disease status with gene expression were downloaded from webTWAS [24]. However, raw gene expression data were unavailable on the website. When the p-value data were downloaded, Elastic-net was used as a model for transcriptome data prediction in the MetaXcan framework [25]. We applied Fisher’s () and our () methods to combine the two p-values from the two TWASs. There were 3,175 statistical tests (because of 3,175 genes overlapping between the two studied TWAS datasets) in this meta-TWAS. Bonferroni’s correction [18] for multiple testing was performed to obtain adjusted p-values for and separately. The results indicate that both methods identified the MHC region as a key genomic region for rheumatoid arthritis (Fig. 3). Fisher’s method identified 11 genes outside the MHC region. Except ANKRD55 was a true-positive, all other 10 genes were false-positively identified. Our method did not have false-positives but failed to detect ANKRD55 (). Our and Fisher’s methods did not identify additional genes associated with rheumatoid arthritis detected by individual TWASs; nevertheless, our method demonstrated a more significant signal for several rheumatoid arthritis genes. For instance, in the TWAS of Eyre et al [22], the TWAS of Stahl et al [23], and this meta-TWAS, the p-values were , , and , respectively, for AFF3 on chromosome 2, and , , and , respectively, for IRF5 on chromosome 7 (Fig. S5). Thus, this real data analysis demonstrated the inference of our meta-TWAS.

Fig. 3

Manhattan plots for 22 autosomes in the This meta-TWAS contained 3,175 genes overlapping between the two studied TWAS datasets. The Fisher’s method () and our method () were employed. Each point indicates a gene. The x-axis indicates the physical position of a gene in an autosome. The y-axis indicates p-value in a scale of –log10. The 11 purple lines indicate the genes for which Fisher’s method () reported genetic association but our method () did not. All genes, ANKRD55, were false-positively detected. The red dashed line indicates the significance level after Bonferroni correction for multiple testing.

Meta-TWAS for asthma

We evaluated the performance of the proposed method in combining p-values from more than two studies. We downloaded the gene-level p-values data in four large-scale studies for asthma from webTWAS [24] – Canela-Xandri et al. [26] with and ; Zhu et al. [27] with and ; Zhu et al. [28] with and ; and Demenais et al. [29] with and . We analyzed 15 genes, including 10 asthma-associated genes (HLA-G, ATP6V1G2, TAP1, TRIM10, HLA-DRB1, LST1, HLA-DRB5, DDX39B, MSH5, and HLA-A) in the MHC region and five genes that they are located outside the MHC region and no studies have reported association of asthma with the genes (PHIP, EED, PYGB, SMARCD2, and BORCS8) (Table S1). We applied Fisher’s () and our () methods to identify differentially expressed genes in this meta-TWAS. After applying Bonferroni’s adjustment for multiple testing correction, the adjusted p-values are provided (Table 1). Fisher’s method () identified all the ten asthma-associated genes, but also identified a high proportion of false-positive genes. Our method () identified most of the asthma-associated genes, except for HLA-DRB5 (adjusted p-value = 0.069) and controlled false positive well. The results suggest that our method performs well and better than Fisher’s method in combining more than two p-values.

Table 1

	Gene name	Chromosome	pF	pNw	pN
Asthma	HLA-G	Chr. 6	0.0000127	0.0244885	0.0012742
Related	ATP6V1G2	Chr. 6	0.0005959	0.0030811	0.0168662
Genes	HLA-DRB5	Chr. 6	0.0059697	0.0386449	0.0689027
	TAP1	Chr. 6	0.0003276	0.0025073	0.0118823
	TRIM10	Chr. 6	0.0000009	0.0002722	0.0000589
	HLA-DRB1	Chr. 6	0.0000018	0.0004108	0.0003191
	LST1	Chr. 6	0.0000009	0.0002963	0.0000931
	DDX39B	Chr. 6	0.0000008	0.0001398	0.0000000
	MSH5	Chr. 6	0.0005163	0.0009750	0.0154954
	HLA-A	Chr. 6	0.0016827	0.0046854	0.0318339
Asthma	PHIP	Chr. 6	0.0418319	0.0986147	0.2299088
Unrelated	EED	Chr. 11	0.0094608	0.0100037	0.0915084
Genes	PYGB	Chr. 20	0.0124950	0.0805022	0.1086624
	SMARCD2	Chr. 17	0.0131833	0.0153122	0.1123372
	BORCS8	Chr. 19	0.0233820	0.1422346	0.1601654

Our method performs better than Fisher’s method when combining more than two p-values. Ten asthma related genes and five asthma unrelated gene were analyzed in this meta-TWAS. Adjusted p-values of Fisher’s method (), our method with equal weights (), and our method with unequal weights () are provided. All the p-values were adjusted using Bonferroni’s correction for multiple testing. The numbers marked in bold indicate they are statistically significant. Furthermore, our method can assign different weights to p-values in different TWASs. Here, the sample size of a study relative to the total sample size in the four TWASs was calculated as a weight; that is, a higher weight was assigned to a study with a higher sample size. Our weighted method () can identify all the ten asthma-associated genes, including HLA-DRB5 that cannot be detected by the equal-weighted method (). However, obtained some false-positive findings, such as EED and SMARCD2 in this analysis.

Conclusion and discussion

In this study, we proposed a novel numerical integration method () to evaluate statistical significance by combining correlated p-values from multiple studies in a meta-analysis. The proposed method is simple in concept and flexible to various correlation structures. Our theoretical investigation and simulation studies demonstrated that our proposed method performs well in terms of Type I error, statistical power, computing efficiency (regardless of the sample size), and statistical significance estimation accuracy. Real applications in large-scale GWASs and TWASs for rheumatoid arthritis and asthma facilitated efficient identification of critical genomic regions, such as the MHC region, and genes associated with rheumatoid arthritis and asthma, not reported in the previous GWASs or TWASs. We developed Pbine () for meta-analysis based on a combination of p-values from multiple studies. Our method can combine p-values from more than two studies with different sample sizes and precision so as to different levels of importance and information. The incorporation of unequal weights into a p-value combination for more than two studies has been implemented into Pbine. The results in our simulation studies and real data analyses demonstrated that our method outperforms Fisher’s method. We discussed the weakness of the decorrelation method – an order-noninterchangeable property. Although can be calculated in either an ascending order or a descending order of p-values, we showed that p-values of these two procedures in the decorrelation method violate the uniformity property under a null distribution – the ascending order method () tends to have more false negative and the descending order method () have more false positive, particularly at the case with a high between-study correlation of p-values (Fig. S3). When we applied and in the meta-GWAS and meta-TWAS for rheumatoid arthritis, we did find a number of false-positive and false-negative findings (Table S2). In addition to the methods discussed in this paper, studies have reported other p-value combination methods [30]. Some of these methods depend on p-value independency assumptions [31], [32], parametric assumptions [33], [34], and mathematical approximations such as Satterthwaite’s approximation [35], [36]. These methods may be efficient in computation. However, when their assumptions are violated, their performance is negatively affected by inflated Type I error, particularly when significance level is low [37]. In addition, several methods have been developed on the basis of a generalization of Fisher’s product p-value method, such as the weighted [38], truncated [15], and rank-truncated [39] product p-value methods. Our method can be generalized to more complicated cases.

CRediT authorship contribution statement

Yin-Chun Lin: Methodology, Software, Formal analysis, Writing – original draft, Writing – review & editing. Yu-Jen Liang: Data curation, Resources. Hsin-Chou Yang: Conceptualization, Methodology, Writing – original draft, Writing – review & editing, Resources, Supervision, Funding acquisition.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

31 in total

Review 1. Overview of Statistical Methods for Genome-Wide Association Studies (GWAS).

Authors: Ben Hayes
Journal: Methods Mol Biol Date: 2013

2. Fisher's method of combining dependent statistics using generalizations of the gamma distribution with applications to genetic pleiotropic associations.

Authors: Qizhai Li; Jiyuan Hu; Juan Ding; Gang Zheng
Journal: Biostatistics Date: 2013-10-29 Impact factor: 5.899

3. Shared genetics of asthma and mental health disorders: a large-scale genome-wide cross-trait analysis.

Authors: Zhaozhong Zhu; Xi Zhu; Cong-Lin Liu; Huwenbo Shi; Sipeng Shen; Yunqi Yang; Kohei Hasegawa; Carlos A Camargo; Liming Liang
Journal: Eur Respir J Date: 2019-12-19 Impact factor: 16.671

4. webTWAS: a resource for disease candidate susceptibility genes identified by transcriptome-wide association study.

Authors: Chen Cao; Jianhua Wang; Devin Kwok; Feifei Cui; Zilong Zhang; Da Zhao; Mulin Jun Li; Quan Zou
Journal: Nucleic Acids Res Date: 2022-01-07 Impact factor: 16.971

5. An atlas of genetic associations in UK Biobank.

Authors: Oriol Canela-Xandri; Konrad Rawlik; Albert Tenesa
Journal: Nat Genet Date: 2018-10-22 Impact factor: 38.330

6. Aggregating multiple expression prediction models improves the power of transcriptome-wide association studies.

Authors: Ping Zeng; Jing Dai; Siyi Jin; Xiang Zhou
Journal: Hum Mol Genet Date: 2021-05-29 Impact factor: 6.150

Review 7. The MHC locus and genetic susceptibility to autoimmune and infectious diseases.

Authors: Vasiliki Matzaraki; Vinod Kumar; Cisca Wijmenga; Alexandra Zhernakova
Journal: Genome Biol Date: 2017-04-27 Impact factor: 13.583

8. A genome-wide cross-trait analysis from UK Biobank highlights the shared genetic architecture of asthma and allergic diseases.

Authors: Zhaozhong Zhu; Phil H Lee; Mark D Chaffin; Wonil Chung; Po-Ru Loh; Quan Lu; David C Christiani; Liming Liang
Journal: Nat Genet Date: 2018-05-21 Impact factor: 38.330

9. Multiancestry association study identifies new asthma risk loci that colocalize with immune-cell enhancer marks.

Authors: Florence Demenais; Patricia Margaritte-Jeannin; Kathleen C Barnes; William O C Cookson; Janine Altmüller; Wei Ang; R Graham Barr; Terri H Beaty; Allan B Becker; John Beilby; Hans Bisgaard; Unnur Steina Bjornsdottir; Eugene Bleecker; Klaus Bønnelykke; Dorret I Boomsma; Emmanuelle Bouzigon; Christopher E Brightling; Myriam Brossard; Guy G Brusselle; Esteban Burchard; Kristin M Burkart; Andrew Bush; Moira Chan-Yeung; Kian Fan Chung; Alexessander Couto Alves; John A Curtin; Adnan Custovic; Denise Daley; Johan C de Jongste; Blanca E Del-Rio-Navarro; Kathleen M Donohue; Liesbeth Duijts; Celeste Eng; Johan G Eriksson; Martin Farrall; Yuliya Fedorova; Bjarke Feenstra; Manuel A Ferreira; Maxim B Freidin; Zofia Gajdos; Jim Gauderman; Ulrike Gehring; Frank Geller; Jon Genuneit; Sina A Gharib; Frank Gilliland; Raquel Granell; Penelope E Graves; Daniel F Gudbjartsson; Tari Haahtela; Susan R Heckbert; Dick Heederik; Joachim Heinrich; Markku Heliövaara; John Henderson; Blanca E Himes; Hiroshi Hirose; Joel N Hirschhorn; Albert Hofman; Patrick Holt; Jouke Hottenga; Thomas J Hudson; Jennie Hui; Medea Imboden; Vladimir Ivanov; Vincent W V Jaddoe; Alan James; Christer Janson; Marjo-Riitta Jarvelin; Deborah Jarvis; Graham Jones; Ingileif Jonsdottir; Pekka Jousilahti; Michael Kabesch; Mika Kähönen; David B Kantor; Alexandra S Karunas; Elza Khusnutdinova; Gerard H Koppelman; Anita L Kozyrskyj; Eskil Kreiner; Michiaki Kubo; Rajesh Kumar; Ashish Kumar; Mikko Kuokkanen; Lies Lahousse; Tarja Laitinen; Catherine Laprise; Mark Lathrop; Susanne Lau; Young-Ae Lee; Terho Lehtimäki; Sébastien Letort; Albert M Levin; Guo Li; Liming Liang; Laura R Loehr; Stephanie J London; Daan W Loth; Ani Manichaikul; Ingo Marenholz; Fernando J Martinez; Melanie C Matheson; Rasika A Mathias; Kenji Matsumoto; Hamdi Mbarek; Wendy L McArdle; Mads Melbye; Erik Melén; Deborah Meyers; Sven Michel; Hamida Mohamdi; Arthur W Musk; Rachel A Myers; Maartje A E Nieuwenhuis; Emiko Noguchi; George T O'Connor; Ludmila M Ogorodova; Cameron D Palmer; Aarno Palotie; Julie E Park; Craig E Pennell; Göran Pershagen; Alexey Polonikov; Dirkje S Postma; Nicole Probst-Hensch; Valery P Puzyrev; Benjamin A Raby; Olli T Raitakari; Adaikalavan Ramasamy; Stephen S Rich; Colin F Robertson; Isabelle Romieu; Muhammad T Salam; Veikko Salomaa; Vivi Schlünssen; Robert Scott; Polina A Selivanova; Torben Sigsgaard; Angela Simpson; Valérie Siroux; Lewis J Smith; Maria Solodilova; Marie Standl; Kari Stefansson; David P Strachan; Bruno H Stricker; Atsushi Takahashi; Philip J Thompson; Gudmar Thorleifsson; Unnur Thorsteinsdottir; Carla M T Tiesler; Dara G Torgerson; Tatsuhiko Tsunoda; André G Uitterlinden; Ralf J P van der Valk; Amaury Vaysse; Sailaja Vedantam; Andrea von Berg; Erika von Mutius; Judith M Vonk; Johannes Waage; Nick J Wareham; Scott T Weiss; Wendy B White; Magnus Wickman; Elisabeth Widén; Gonneke Willemsen; L Keoki Williams; Inge M Wouters; James J Yang; Jing Hua Zhao; Miriam F Moffatt; Carole Ober; Dan L Nicolae
Journal: Nat Genet Date: 2017-12-22 Impact factor: 38.330

10. GWAS Central: a comprehensive resource for the comparison and interrogation of genome-wide association studies.

Authors: Tim Beck; Robert K Hastings; Sirisha Gollapudi; Robert C Free; Anthony J Brookes
Journal: Eur J Hum Genet Date: 2013-12-04 Impact factor: 4.246