Literature DB >> 31596850

Bayesian multivariate reanalysis of large genetic studies identifies many new associations.

Michael C Turchin1, Matthew Stephens1,2.   

Abstract

Genome-wide association studies (GWAS) have now been conducted for hundreds of phenotypes of relevance to human health. Many such GWAS involve multiple closely-related phenotypes collected on the same samples. However, the vast majority of these GWAS have been analyzed using simple univariate analyses, which consider one phenotype at a time. This is despite the fact that, at least in simulation experiments, multivariate analyses have been shown to be more powerful at detecting associations. Here, we conduct multivariate association analyses on 13 different publicly-available GWAS datasets that involve multiple closely-related phenotypes. These data include large studies of anthropometric traits (GIANT), plasma lipid traits (GlobalLipids), and red blood cell traits (HaemgenRBC). Our analyses identify many new associations (433 in total across the 13 studies), many of which replicate when follow-up samples are available. Overall, our results demonstrate that multivariate analyses can help make more effective use of data from both existing and future GWAS.

Entities:  

Mesh:

Year:  2019        PMID: 31596850      PMCID: PMC6802844          DOI: 10.1371/journal.pgen.1008431

Source DB:  PubMed          Journal:  PLoS Genet        ISSN: 1553-7390            Impact factor:   5.917


Introduction

Genome wide association studies (GWAS) have been widely used to identify genetic factors—particularly single nucleotide polymorphisms (SNPs) and copy number variations (CNVs)—associated with human disease risk and other phenotypes of interest [1, 2]. Indeed, at time of writing over 24,000 such associations have been identified as ‘genome-wide significant’ [3]. The vast majority of these many genetic association analyses consider only one phenotype at a time (“univariate association analysis”). This is despite the fact that measurements on multiple phenotypes are often available, and joint association analysis of multiple phenotypes (“multivariate association analysis”) can substantially increase power [4-8]. There are likely multiple reasons for the preponderance of univariate analyses. One possible reason is that initial association analyses are usually performed under tight time constraints, and at a time when many other analysis issues (e.g. quality control, population stratification) are competing for attention. In these conditions it makes sense to focus on the simplest possible approach that will quickly yield new associations, without overly worrying about loss of efficiency. In addition analysts may be legitimately concerned that deviation from the most widely adopted analysis pipeline may invite unwanted additional reviewer attention. Nonetheless, we believe that multivariate association analysis has an important role to play in making the most of costly and time-consuming GWAS studies. One way forward is to conduct multivariate analyses of previously-published GWAS, checking for additional associations that may have been missed by the initial univariate association analyses. This is greatly facilitated by the fact that many GWAS now make summary data from single-SNP tests freely available [9-13], and that simple multivariate analysis can be conducted using such summary data [14-16]. Here we demonstrate the potential benefits of reanalyzing published GWAS using multivariate methods. Specifically we apply multivariate methods from [14] to reanalyze 13 different GWAS whose initial publications reported only univariate results. In most cases our multivariate analyses find many new associations. For example, in GIANT 2014/5 we find over 150 new associations. In studies with multiple data releases, we find that new multivariate associations found in initial releases typically replicate in subsequent releases, supporting that many of the new associations are likely real. We also demonstrate that the multivariate approach is not equivalent to simply relaxing the univariate GWAS significance threshold. Finally, we point out some limitations of the specific framework we used here, and suggest some alternative strategies that may help address those limitations in future multivariate GWAS analyses.

Results

Multivariate association analyses

To facilitate multivariate association analyses using the methods from [14], we implemented them in an R package bmass (Bayesian multivariate analysis of summary statistics). The software requires as input univariate GWAS summary statistics, for the same set of SNPs, on d related phenotypes. (The derivations in [14] are for quantitative phenotypes, but the methods can also be applied to summary data from binary phenotypes, which can be interpreted as making a normal approximate to the likelihood for the effect sizes as in [17].) Then, for each SNP, it attempts to categorize each phenotype as belonging to one of three categories: Unassociated, Directly Associated, or Indirectly Associated with the SNP. The difference between D and I is that an indirect association disappears after controlling for associations with other phenotypes (see Methods and S1 Fig). For d phenotypes, there are 3 possible assignments of phenotypes to these 3 categories, and each assignment corresponds to a different “model” γ. For example, one model corresponds to the “null” that all phenotypes are Unassociated; another model corresponds to the model that all phenotypes are Directly associated; another model corresponds to just the first phenotype being Directly associated, etc. The goal of the association analysis is to determine which of these models is consistent with the data and, in particular, to assess overall evidence against the null model. The support in the data for model γ, relative to the null model, is summarized by a Bayes Factor (BF). Large values of BF indicate strong evidence for model γ compared against the null. One advantage of Bayes Factors over p-values is that the Bayes Factors from different models can be easily compared and combined. For example, the overall evidence against the null is given by the (weighted) average of these BFs: where the weights w are chosen to reflect the relative plausibility of each model γ. In bmass we implemented the Empirical Bayes approach from [14] that learns appropriate weights from the data (see Methods).

Comparisons with published univariate analyses

To provide a benchmark against which to compare our multivariate analysis results, we compiled a list of “previous univariate associations”: SNPs that were both reported as significant in the original publication and exceeded the original publication’s definition for genome-wide significance in at least one phenotype in the publicly-available (univariate) summary data analyzed here. This does not include all SNPs reported in every original publication because in some studies SNPs became genome-wide significant only after adding additional samples not included in the publicly available summary data. We used these previous univariate associations to determine a significance threshold for our multivariate associations. Specifically, we declared a multivariate association as significant if its BFav exceeds that of any previous univariate association’s BFav in the same study [14]. The rationale is that the evidence for these multivariate associations exceeds the evidence for previously-reported genome-wide significant associations, which are generally regarded as likely to be (mostly) real associations. Finally, we defined a list of “new multivariate associations”, which are SNPs that are significant in our multivariate analysis but are not a “previous univariate association”. To avoid double-counting of signals due to linkage disequilibrium (LD), we pruned the list of new multivariate associations so that they are all at least 0.5Mb apart. For additional details, see Methods.

Many new loci identified in reanalyzing 13 publicly available GWAS studies

We applied bmass to 13 publicly available GWAS studies, representing 10 different collections of phenotypes (Table 1). Phenotypic collections include blood lipid traits (GlobalLipids: [9, 18]), body morphological traits (GIANT: [10–12, 19–21]), red blood cell traits (HaemgenRBC: [13, 22]), blood pressure traits [23, 24], bone density traits [25], and kidney function traits [26, 27]. For three of these phenotypic collections (GlobalLipids, GIANT, and HaemgenRBC), two different releases were available from the source consortiums. We conducted basic QC as described in Methods.
Table 1

Dataset summary.

DatasetReleaseNPhenotypes
GlobalLipids201095454LDL, HDL, TC, TGa
2013188577LDL, HDL, TC, TG
GIANT201077167Height, BMI, WHRadjBMIb
2014/5224459Height, BMI, WHRadjBMI
HaemgenRBC2012135367RBC, PCV, MCV, MCH, MCHC, Hbc
2016173480RBC, PCV, MCV, MCH, MCHC, Hb
ICBP201169395SBP, DBP, PP, MAPd
MAGIC201046186FstIns, FstGlu, HOMA_B, HOMA_IRe
GEFOS201532965FA, FN, LSf
GIS201448972Iron, Sat, TrnsFrn, Log10Frtng
SSGAC2016343072NEB_Pooled, AFB_Pooledh
CKDGen2010/167093Crea, Cys, CKD, UACR, MAi
ENIGMA2201530717ICV, Accumbens, Amygdala, Caudate, Hippocampus, Pallidum, Putamen, Thalamusj

N is the maximum number of samples contributing to each study.

a—Low-Density Lipoproteins (LDL), High-Density Lipoproteins (HDL), Total Cholesterol (TC), Total Triglycerides (TG)

b—Body Mass Index (BMI), Waist-Hip Ratio adjusted for BMI (WHRadjBMI)

c—Red Blood Cell Count (RBC), Packed Cell Volume (PCV), Mean Cell Volume (MCV), Mean Cell Haemoglobin (MCH), Mean Cell Haemoglobin Concentration (MCHC), Haemoglobin (Hb)

d—Systolic Blood Pressure (SBP), Diastolic Blood Pressure (DBP), Pulse Pressure (PP), Mean Arterial Pressure (MAP)

e—Fasting Insulin (FstIns), Fasting Glucose (FstGlu), Homeostatic Model Assessment of Beta Cell Function (HOMA_B), Homeostatic Model Assessment of Insulin Resistance Function (HOMA_IR)

f—Forearm Bone Mineral Density (FA), Femoral Neck Bone Mineral Density (FN), Lumbar Spine Bone Mineral Density (LS)

g—Serum Iron (Iron), Serum Transferrin Saturation (Sat), Serum Transferrin (TrnsFrn), Log-Transformed Ferritin (Log10Frtn)

h—Number of Children Ever Born, Male & Female (NEB_Pooled), Age at First Birth, Male & Female (AFB_Pooled)

i—Serum Creatine (Crea), Serum Cystatin (Cys), Chronic Kidney Disease (CKD), Urinary Albumin-to-Creatine Ratio (UACR), Microalbuminuria (MA)

j—Intracranial Volume (ICV), specified subcortical brain structures refer to MRI-derived volume measurements for each one

N is the maximum number of samples contributing to each study. a—Low-Density Lipoproteins (LDL), High-Density Lipoproteins (HDL), Total Cholesterol (TC), Total Triglycerides (TG) b—Body Mass Index (BMI), Waist-Hip Ratio adjusted for BMI (WHRadjBMI) c—Red Blood Cell Count (RBC), Packed Cell Volume (PCV), Mean Cell Volume (MCV), Mean Cell Haemoglobin (MCH), Mean Cell Haemoglobin Concentration (MCHC), Haemoglobin (Hb) d—Systolic Blood Pressure (SBP), Diastolic Blood Pressure (DBP), Pulse Pressure (PP), Mean Arterial Pressure (MAP) e—Fasting Insulin (FstIns), Fasting Glucose (FstGlu), Homeostatic Model Assessment of Beta Cell Function (HOMA_B), Homeostatic Model Assessment of Insulin Resistance Function (HOMA_IR) f—Forearm Bone Mineral Density (FA), Femoral Neck Bone Mineral Density (FN), Lumbar Spine Bone Mineral Density (LS) g—Serum Iron (Iron), Serum Transferrin Saturation (Sat), Serum Transferrin (TrnsFrn), Log-Transformed Ferritin (Log10Frtn) h—Number of Children Ever Born, Male & Female (NEB_Pooled), Age at First Birth, Male & Female (AFB_Pooled) i—Serum Creatine (Crea), Serum Cystatin (Cys), Chronic Kidney Disease (CKD), Urinary Albumin-to-Creatine Ratio (UACR), Microalbuminuria (MA) j—Intracranial Volume (ICV), specified subcortical brain structures refer to MRI-derived volume measurements for each one Our multivariate analyses identify, in total, hundreds of new associations. The numbers of previous univariate associations and new multivariate associations are summarized in Fig 1 (see also Table 2). For example, we identify 162 new multivariate associations in GIANT2014/5, 65 in GlobalLipids2013, and 60 in HaemgenRBC2016. These represent power increases from 10% to 45% compared with previous univariate analyses.
Fig 1

Number of independent significant SNPs, by study.

The barplot shows the number of independent SNPs that were significant in previous univariate analyses (blue) and the number of additional significant associations in our new multivariate analyses (red). For univariate analysis, significance levels were set by the original study. For multivariate analyses, we declared a SNP to be significant if its weighted average Bayes Factor (BFav) exceeded that of the smallest BFav among the previous univariate significant SNPs. We considered SNPs more than.5Mb apart to be independent. See Table 1 and Methods for phenotype details, Methods for further analysis details, and S2–S4 Tables for lists of significant SNPs from each dataset.

Table 2

Summary of new multivariate associations identified.

—SNP Associations—
DatasetReleasePrevious UnivariateNew MultivariateBFav ThreshOverlap With Next Release
GlobalLipids2010102194.3513/19
2013145654.29-
GIANT2010144604.1149/60
2014/57241624.49-
HaemgenRBC201263165.219/16
2016610604.68-
ICBP201122225.24-
MAGIC20101216.90-
GEFOS201534135.06-
GIS2014857.04-
SSGAC2016915.43-
CKDGen2010/12864.10-
ENIGMA22015537.48-

Previous Univariate: the number of previous genome-wide significant univariate associations based on the publicly available summary data. New Multivariate: the number of new genome-wide significant multivariate associations. BFav Thresh: the Bayes Factor threshold used in declaring new multivariate associations to be significant. Overlap With Next Release: for GlobalLipids2010, GIANT2010, and HaemgenRBC2012, the last column shows the number of new multivariate associations that overlap with the univariate GWAS associations in the next release from the same consortium; overlap is defined as being within 50kb of the univariate GWAS variant.

Number of independent significant SNPs, by study.

The barplot shows the number of independent SNPs that were significant in previous univariate analyses (blue) and the number of additional significant associations in our new multivariate analyses (red). For univariate analysis, significance levels were set by the original study. For multivariate analyses, we declared a SNP to be significant if its weighted average Bayes Factor (BFav) exceeded that of the smallest BFav among the previous univariate significant SNPs. We considered SNPs more than.5Mb apart to be independent. See Table 1 and Methods for phenotype details, Methods for further analysis details, and S2–S4 Tables for lists of significant SNPs from each dataset. Previous Univariate: the number of previous genome-wide significant univariate associations based on the publicly available summary data. New Multivariate: the number of new genome-wide significant multivariate associations. BFav Thresh: the Bayes Factor threshold used in declaring new multivariate associations to be significant. Overlap With Next Release: for GlobalLipids2010, GIANT2010, and HaemgenRBC2012, the last column shows the number of new multivariate associations that overlap with the univariate GWAS associations in the next release from the same consortium; overlap is defined as being within 50kb of the univariate GWAS variant.

Replication of multivariate associations across releases

To demonstrate that many of these new multivariate associations are likely to be real we take advantage of three datasets that each have two releases separated by several years (GlobalLipids, GIANT, and HaemgenRBC). In each case we performed multivariate association analysis of the earlier release and checked how the new multivariate associations fared in univariate analyses of the later release (Fig 2). Since later releases include the samples from earlier releases, to assess “replication” we focus on whether the association in the new release is more significant than the original release—that is, whether the signal in the new (non-overlapping) samples provides additional evidence over and above the original signal. By this measure the results show high replication rates for the new multivariate associations: in total, 84 of 94 new associations have smaller minimum univariate p-values in the later release (at exactly the same SNP), and indeed the majority of these reach univariate GWAS significance in the later release.
Fig 2

Replication of new multivariate associations.

The figure shows results based on earlier and later releases from studies with multiple releases (GlobalLipids, GIANT, and HaemgenRBC). Each point represents a new multivariate association identified in our multivariate analysis of the earlier release. The x- and y-axes show the minimum (across phenotypes) of the -log10 univariate p-values from the earlier release (x-axis) vs. the later release (y-axis). Dashed red lines represent the univariate significance GWAS thresholds used for each study’s releases. Across all three studies, 84 out of 94 new multivariate associations from the earlier releases have smaller minimum univariate p-values in the later release, and 68 out of 84 new multivariate associations that did not reach GWAS significance in the earlier release do so in the later release (see S5 Table for a per-dataset breakdown).

Replication of new multivariate associations.

The figure shows results based on earlier and later releases from studies with multiple releases (GlobalLipids, GIANT, and HaemgenRBC). Each point represents a new multivariate association identified in our multivariate analysis of the earlier release. The x- and y-axes show the minimum (across phenotypes) of the -log10 univariate p-values from the earlier release (x-axis) vs. the later release (y-axis). Dashed red lines represent the univariate significance GWAS thresholds used for each study’s releases. Across all three studies, 84 out of 94 new multivariate associations from the earlier releases have smaller minimum univariate p-values in the later release, and 68 out of 84 new multivariate associations that did not reach GWAS significance in the earlier release do so in the later release (see S5 Table for a per-dataset breakdown).

Multivariate analysis is different from multiple univariate analyses

Because multivariate analysis takes account of joint patterns across phenotypes, its ranking of significance of SNPs can change compared with that from the univariate p-values alone. That is, multivariate analysis is not simply equivalent to multiple univariate analyses. To illustrate this we examined, in three well-powered studies, the associations that would be declared significant if the univariate significance threshold were relaxed, and assessed which of them would also be significant in our multivariate analysis (i.e. we assess whether, if we go deeper into the univariate results, we find the same SNPs as appear in our multivariate results). The results are shown in Fig 3. Although there is, understandably, substantial overlap between the significant SNPs, any non-trivial relaxation of the univariate threshold includes many SNPs that are not multivariate significant in our analysis; for example, at a univariate threshold of 5 × 10−7 only 66% of the univariate significant SNPs are also multivariate significant across these three studies. This demonstrates that, indeed, our multivariate approach reorders significance of SNPs compared with multiple univariate analyses.
Fig 3

Comparison of new multivariate hits vs. relaxing univariate p-value threshold.

For each data set the graph shows how many associations become significant as the univariate p-value threshold is relaxed (moving from right to left on the x-axis), and how many of these are declared as new multivariate hits in our analysis. In both cases results are pruned to avoid counting associations of SNPs in strong LD; see Methods for details. The appearance of appreciable blue areas indicates that the multivariate analysis is reordering the significance of SNPs compared with performing multiple univariate analyses.

Comparison of new multivariate hits vs. relaxing univariate p-value threshold.

For each data set the graph shows how many associations become significant as the univariate p-value threshold is relaxed (moving from right to left on the x-axis), and how many of these are declared as new multivariate hits in our analysis. In both cases results are pruned to avoid counting associations of SNPs in strong LD; see Methods for details. The appearance of appreciable blue areas indicates that the multivariate analysis is reordering the significance of SNPs compared with performing multiple univariate analyses.

Reanalysis also identifies new univariate associations

During our multivariate reanalyses we noticed many SNPs that appeared to be genome-wide univariate significant but were—somewhat mysteriously—not reported as such by the original studies (i.e. SNPs whose univariate p-values crossed the significance threshold, as defined by the given study, in at least one trait). S1 Table reports 79 such associations. There may be many reasons why such variants went unreported, but one reason may be physical proximity to a variant with a stronger signal. Indeed, more than half of the variants described above are within 1Mb of a previously-reported univariate GWAS association. Refraining from reporting multiple near-by associations seems a reasonable—if conservative—strategy to avoid reporting redundant associations due to LD. Further, even when redundant associations due to LD can be ruled out (e.g. by directly examining LD rather than by simply using physical distance), it might be assumed that multiple nearby associated variants may all act through the same biological mechanism and therefore provide redundant biological insights. However, we found that multi-phenotype patterns of association can differ between nearby SNPs, suggesting that they act through different mechanisms. To highlight just one example, consider rs7515577—which is an original univariate association in GlobalLipids2010—and rs12038699—which is a new multivariate association in GlobalLipids2013. We note that rs12038699 actually reached univariate genome-wide significance in the GlobalLipids2013 dataset, but was not reported (S6 Table). This is possibly because the latter SNP is relatively close, in genomic terms, to the former SNP (549kb). However, these SNPs are not in strong LD (r2 = .08), and so these associations almost certainly represent non-redundant associations. This is further supported by the effect sizes in each phenotype, which clearly reveal very different multivariate patterns of effect sizes among phenotypes (S2 Fig & S6 Table). Indeed the very different multivariate patterns of effect size suggest that not only are these associations non-redundant but likely involve different biological mechanisms as well. These results suggest that, moving forward, it may pay to be more careful in designing filters designed to avoid reporting redundant associations, and that multi-phenotype analyses may have a helpful role to play here.

Limitations

One goal of the multivariate approach introduced in [14] was to increase interpretability of multivariate analyses; in particular, the goal was to not only test for associations but also to help explain the associations by partitioning the phenotypes into “Unassociated”, “Directly Associated”, and “Indirectly Associated” categories. In principle one might hope to use these classifications to gain insights into the relationships among phenotypes and also perhaps to identify different “types” of multivariate association—effectively clustering associations into different groups. However, in practice we find that these discrete classifications are often not as helpful as one might hope. One reason is the difficulty of reliably distinguishing between direct and indirect effects [14]. Another reason is widespread associations with multiple phenotypes. Indeed, we find that, consistently across data sets, the most common multivariate models involve associations—either direct or indirect—with many phenotypes (S7 Table) and many SNPs are classified as being associated with many phenotypes (Fig 4A). Further, SNPs are very rarely confidently classified as “Unassociated” with any phenotype (Fig 4B). This last observation can be explained by the fact that it is essentially impossible to distinguish ‘unassociated’ from ‘weakly associated’. Nonetheless when all SNPs show similar classifications it is difficult to get insights into different patterns of multivariate association.
Fig 4

Distribution, across significant SNPs, of number of phenotypes that are confidently associated (A) or confidently unassociated (B).

Results are shown for three well-powered datasets: GlobalLipids2013, GIANT2014/5, and HaemgenRBC2016. Here “confident” means with probability >0.95, so a SNP is considered “confidently associated” with a phenotype if the sum of its probabilities in the “Directly Associated” and “Indirectly Associated” categories exceeds 0.95 (A), and is considered confidently unassociated with the phenotype if this probability is less than 0.05 (B). The set of significant SNPs includes both previous univariate associations and new multivariate associations.

Distribution, across significant SNPs, of number of phenotypes that are confidently associated (A) or confidently unassociated (B).

Results are shown for three well-powered datasets: GlobalLipids2013, GIANT2014/5, and HaemgenRBC2016. Here “confident” means with probability >0.95, so a SNP is considered “confidently associated” with a phenotype if the sum of its probabilities in the “Directly Associated” and “Indirectly Associated” categories exceeds 0.95 (A), and is considered confidently unassociated with the phenotype if this probability is less than 0.05 (B). The set of significant SNPs includes both previous univariate associations and new multivariate associations. Moving forward, rather than relying on the discrete classifications of “Unassociated”, “Directly Associated”, and “Indirectly Associated” to identify different patterns of multivariate association, we believe it will be more fruitful to use multivariate methods that take a more quantitative approach, such as identifying different patterns of effect size (including direction of effect) among phenotypes [28]. Focusing on effect sizes has the potential to be much more informative than discrete classification, which can hide effect size differences. For example, when multiple SNPs are classified as associated with all phenotypes, they can still show very different patterns of estimated effect sizes/direction (see S3 Fig). Another limitation of our multivariate methods is that they can lead to (what appear to be) false positive associations when applied to test SNPs with very low minor allele frequencies. Specifically we saw examples where very low-frequency SNPs (e.g. MAF < .001) showed strong signals of multivariate association despite showing very little signal in any univariate test. Although such results are not impossible, we believe that most of these cases were likely false positives, and we applied a MAF cut-off (of 0.01 or 0.005) to avoid these issues. Consequently we recommend caution in interpreting results of multivariate analyses at very low-frequency SNPs, and more generally we recommend that multivariate results be compared against univariate results to check they make sense—highly significant multivariate associations that do not also show at least a moderate level of univariate association should be treated with caution.

Discussion

We reanalyzed 13 publicly available GWAS datasets using a Bayesian multivariate approach and identified many new genetic associations. Turning genetic associations into biological discoveries remains, of course, a challenging problem. Nonetheless, our results suggest that the increased power of multivariate association analysis that has been reported in many simulation studies [8, 14, 29] also translates to discovery of many new associations in practice. Our results exploit the public availability of summary data from several large GWAS. Despite progress toward easier availability of individual-level data for large studies [30], in many cases summary data remain much easier to obtain and work with; there are big practical advantages as well to modular pipelines that first compute summary data and then use these as inputs to subsequent (more sophisticated) analyses. For example, the multivariate analyses we present here are simplified by assuming that the summary data were computed while adequately adjusting for population stratification and other relevant covariates (indeed, our current bmass software implementation does not allow adjustment for covariates, and so any adjustment must be done in the univariate analyses). And our results illustrate the potential for reanalysis of summary data to yield novel inferences. In this regard we also emphasize the importance of consortia releasing carefully-chosen summaries. For example, Z-scores are much more helpful than p-values because they preserve information on the direction of the effect. Even better would be both the effect size and standard error that created the Z-score. More generally, it is always helpful to include additional key meta-data (e.g. the reference allele, or effect allele, the minor allele frequency, and sample size). The specific multivariate methods used here were derived under the assumption that the summary data from each phenotype has been obtained from the same sampled individuals (which is true, at least approximately, for studies analyzed here). However, multivariate analysis of summary data is also possible even when data were obtained from different samples for each phenotype. The main difference between these settings is that, for data from overlapping samples, the “noise” is correlated as well as the signal: i.e. the summary data are correlated under the null due to sample overlap, and correlated under the alternative due to both sample overlap and any shared genetic effects. In contrast, for data from non-overlapping samples the noise is uncorrelated (whereas the signal may remain correlated if genetic factors are shared). Our methods use data at (empirically) null SNPs to estimate the noise correlation, and so their overall assessment of associations should be relatively robust to whether samples for different phenotypes overlap (however, our definitions of D (direct) vs I (indirect) associations requires the same samples to be measured across phenotypes.) Moving forward, we expect multivariate association analyses to play an increasingly important role in detecting and understanding genetic associations and relationships among phenotypes. Large studies are now collecting, and making available, rich human genetic and phenotypic information on many complex phenotypes, most notably the UKBioBank [30]. In addition, there are increasingly large studies linking genetic variation and molecular phenotypes such as gene expression (e.g. the GTEx project [31]), as well as epigenetic modifications and transcript degradation [32-35]. Analysis of multiple molecular traits can help yield insights into causal connections among traits [36], and joint analysis of molecular traits with complex phenotypes may also shed light on functional mechanisms (as in “co-localization” analyses [16, 37–39]). Even simply moving from single phenotype to pairwise analysis can shed considerable light on sharing of genetic effects and possible causal connections [15, 40]. These increasingly-complex new data also bring new analytic and computational challenges. Here we have restricted our analyses to relatively small sets of closely-related traits, and indeed the specific multivariate framework we used here—which performs an exhaustive search over all possible multivariate models—is fully tractable for only moderate numbers of traits (up to about 10). Scaling methods up to dealing with larger number of traits may well be helpful for some settings, and recent multivariate analysis methods can deal with dozens of outcomes [28, 41]. In addition, developing multivariate methods to perform fine-mapping of genetic associations simultaneously across multiple phenotypes [42] seems an important and challenging area for future work.

URLs

bmass R package: https://github.com/mturchin20/bmass.

Methods

GWAS datasets

Below are specific details regarding retrieval and data-processing for each dataset analyzed. Where applicable, these details include the sample size (N), minor allele frequency (MAF), and p-value thresholds that were applied (based on the thresholds used in the original publications). For each dataset variants were dropped if they satisfied at least one of the following criteria: did not contain information for every phenotype; had missing MAF; were fixed (MAF of 0); had effect size exactly 0 (i.e. direction of effect would be indeterminable); or did not contain the same reference and alternative alleles across each phenotype. For a handful of studies, external databases were used to retrieve chromosome, basepair information, and MAF based on rsID#; in these studies SNPs for which this information could not be retrieved were also dropped. GlobalLipids2010 [18]: Original merged, processed, and GWAS-hit annotated summary data from [14] for HDL, LDL, TG, and TC was downloaded from https://github.com/stephens999/multivariate (dtlesssignif.annot.txt and RSS0.txt). GlobalLipids2013 [9]: Summary data for HDL, LDL, TG, and TC was downloaded from http://csg.sph.umich.edu/abecasis/public/lipids2013/. We used a minimum N threshold of 50,000, a MAF threshold of 1%, and a univariate significant GWAS p-value threshold of 5 × 10−8. All variants were oriented to the HDL minor allele. The final merged and QC’d datafile contained 2,004,701 SNPs. rsID#’s of published GWAS SNPs were retrieved for all four phenotypes from https://www.nature.com/ng/journal/v45/n11/full/ng.2797.html via Supplementary Tables 2 and 3. GIANT2010 [19-21]: Summary data for Height, BMI, and WHRadjBMI were downloaded from https://www.broadinstitute.org/collaboration/giant/index.php/GIANT_consortium_data_files. We used a minimum N threshold of 50,000, a MAF threshold of 1%, and a univariate significant GWAS p-value threshold of 5 × 10−8. Chromosome and basepair position per variant were retrieved from dbSNP130 [43]. All variants were oriented to the Height minor allele. The final merged and QC’ed datafile contained 2,363,881 SNPs. rsID#’s of published GWAS SNPs were retrieved for Height from https://www.nature.com/nature/journal/v467/n7317/full/nature09410.html via Supplementary Table 1, for BMI from https://www.nature.com/ng/journal/v42/n11/full/ng.686.html via Table 1, and for WHRadjBMI from https://www.nature.com/ng/journal/v42/n11/full/ng.685.html via Table 1. GIANT2014/5 [10-12]: Summary data for Height, BMI, and WHRadjBMI were downloaded from https://www.broadinstitute.org/collaboration/giant/index.php/GIANT_consortium_data_files. We used a minimum N threshold of 50,000, a MAF threshold of 1%, and a univariate significant GWAS p-value threshold of 5 × 10−8. Chromosome and basepair position per variant were retrieved from dbSNP130 [43]. All variants were oriented to the Height minor allele. The final merged and QC’ed datafile contained 2,340,715 SNPs. rsID#’s of published GWAS SNPs were retrieved for Height from https://www.nature.com/ng/journal/v46/n11/full/ng.3097.html via Supplementary Table 1, for BMI from https://www.nature.com/nature/journal/v518/n7538/full/nature14177.html via Supplementary Tables 1 and 2, and for WHRadjBMI from https://www.nature.com/nature/journal/v518/n7538/full/nature14132.html via Supplementary Table 4. HaemgenRBC2012 [22]: Summary data for RBC, PCV, MCV, MCH, MCHC, and Hb were downloaded from the European Genome-Phenome Archive via accession number EGAS00000000132 (https://www.ebi.ac.uk/ega/studies/EGAS00000000132). We used a minimum N threshold of 10,000, a MAF threshold of 1%, and a univariate significant GWAS p-value threshold of 1 × 10−8. Chromosome, basepair position, and MAF per variant were retrieved from HapMap release 22 [44]. All variants were oriented to the RBC minor allele. The final merged and QC’ed datafile contained 2,327,567 SNPs. rsID#’s of published GWAS SNPs were retrieved for all six phenotypes from https://www.nature.com/nature/journal/v492/n7429/full/nature11677.html via Table 1. HaemgenRBC2016 [13]: Summary data for RBC, PCV, MCV, MCH, MCHC, and Hb were shared via personal communication with the authors. We used a MAF threshold of 1% and a univariate significant GWAS p-value threshold of 8.319×10−9. Since sample size was not provided per variant, the following overall study sample sizes were used as proxies per phenotype: 172,952 for RBC, 172,433 for PCV, 173,039 for MCV, 172,332 for MCH, for 172,925 MCHC, and 172,851 for Hb. All variants were oriented to the RBC minor allele. Only SNPs were analyzed. The final merged and QC’ed datafile contained 8,649,095 SNPs. We then used these summary data to create a list of (non-redundant) “Previous univariate associations”. This was done separately for each phenotype by collecting all SNPs that exceeded the univariate significant GWAS p-value threshold and greedily pruning the SNPs: i.e. we went down the list, removing SNPs that were less significant than another SNP within 500kb. The pruned lists of previous univariate associations for each phenotype were then combined to produce the final SNP list of “published GWAS results”. Published CNVs that tagged regions that were not identified by this ‘final published SNP list’ were also included to avoid erroneously claiming downstream a region as a ‘new unpublished result’; these CNVs however were only used to mask additional loci as being ‘nearby a published univariate GWAS result’ and for nothing else in the bmass analysis pipeline. ICBP2011 [23, 24]: Summary data for SBP, DBP, PP, and MAP were downloaded from dbGaP via accession number phs000585.v1.p1 (https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000585.v1.p1). We used a minimum N threshold of 10,000, a MAF threshold of 1%, and a univariate significant GWAS p-value threshold of 5 × 10−8. Chromosome and basepair position per variant were retrieved from HapMap release 21 [44]. All variants were oriented to the SBP minor allele. The final merged and QC’ed datafile contained 2,387,851 SNPs. rsID#’s of published GWAS SNPs were retrieved for SBP and DBP from https://www.nature.com/nature/journal/v478/n7367/full/nature10405.html via Supplementary Table 5, and for PP and MAP from https://www.nature.com/ng/journal/v43/n10/full/ng.922.html via Table 1 and Supplementary Table 2F. Additionally, we gratefully acknowledge the International Consortium for Blood Pressure Genome-Wide Association Studies (Nature. 2011 Sep 11;478(7367):103-9, Nat Genet. 2011 Sep 11;43(10):1005-11) for generating and sharing these data. MAGIC2010 [45]: Summary data for FstIns, FstGlu, HOMA_B, and HOMA_IR were downloaded from https://www.magicinvestigators.org/downloads/. We used a MAF threshold of 1% and a univariate significant GWAS p-value threshold of 5 × 10−8. Since sample size was not provided per variant, the overall study sample size of 46,186 was used as a proxy. Chromosome and basepair position per variant were retrieved from HapMap release 22 [44]. All variants were oriented to the FstIns minor allele. The final merged and QC’ed datafile contained 2,333,328 SNPs. rsID#’s of published GWAS SNPs were retrieved for all four phenotypes from https://www.nature.com/ng/journal/v42/n2/full/ng.520.html via Table 1. GEFOS2015 [25]: Summary data for FA, FN, and LS were downloaded from http://www.gefos.org/?q=content/data-release-2015. We used a MAF threshold of.5% and a univariate significant GWAS p-value threshold of 1.2 × 10−8. Since sample size was not provided per variant, the overall study sample size of 32,965 was used as a proxy. All variants were oriented to the FA minor allele. The final merged and QC’ed datafile contained 8,938,035 SNPs. rsID#’s of published GWAS SNPs were retrieved for all four phenotypes from https://www.nature.com/nature/journal/v526/n7571/full/nature14878.html via Supplementary Table 13. GIS2014 [46]: Summary data for Iron, Sat, TrnsFrn, and Log10Frtn were shared via personal communication with the authors. We used a MAF threshold of 1% and a univariate significant GWAS p-value threshold of 5 × 10−8. Since sample size was not provided per variant, the overall study sample size of 48,972 was used as a proxy. All variants were oriented to the Iron minor allele. The final merged and QC’ed datafile contained 1,985,313 SNPs. rsID#’s of published GWAS SNPs were retrieved for all four phenotypes from https://www.nature.com/articles/ncomms5926/ via Table 1. SSGAC2016 [47]: Summary data for NEB_Pooled and AFB_Pooled were downloaded from https://www.thessgac.org/data. We used a MAF threshold of 1% and a univariate significant GWAS p-value threshold of 5 × 10−8. Since sample size was not provided per variant, the following overall study sample sizes were used as proxies per phenotype: 251,151 for NEB_Pooled and 343,072 for AFB_Pooled. All variants were oriented to the NEB_Pooled minor allele. The final merged and QC’ed datafile contained 2,395,561 SNPs. rsID#’s of published GWAS SNPs were retrieved for all four phenotypes from https://www.nature.com/ng/journal/v48/n12/full/ng.3698.html via Table 1. CKDGen2010/1 [26, 27]: Summary data for Crea, Cys, CKD, UACR, and MA were downloaded from https://www.nhlbi.nih.gov/research/intramural/researchers/pi/fox-caroline/datasets. We used a MAF threshold of 1% and a univariate significant GWAS p-value threshold of 5 × 10−8. Since sample size was not provided per variant, the following overall study sample sizes were used as proxies per phenotype: 67,093 for Crea, 20,957 for Cys, 62,237 for CKD, 31,580 for UACR, and 30,482 for MA. All variants were oriented to the Crea minor allele. The final merged and QC’ed datafile contained 2,333,498 SNPs. rsID#’s of published GWAS SNPs were retrieved for Crea, Cys, and CKD from https://www.nature.com/ng/journal/v42/n5/full/ng.568.html via Table 2. ENIGMA22015 [48]: Summary data for ICV, Accumbens, Amygdala, Caudate, Hippocampus, Pallidum, Putamen, and Thalamus were downloaded from http://enigma.ini.usc.edu/research/download-enigma-gwas-results/. We used a minimum N threshold of 10,000, a MAF threshold of 1% and a univariate significant GWAS p-value threshold of 5 × 10−8. All variants were oriented to the ICV minor allele. The final merged and QC’ed datafile contained 6,271,117 SNPs. rsID#’s of published GWAS SNPs were retrieved for all 8 phenotypes from https://www.nature.com/nature/journal/v520/n7546/full/nature14101.html via Table 1.

bmass

bmass implements in an R package the statistical methods described in [14], which should be consulted for full details. In particular, the sections “Computation” and “Detailed Methods (Global Lipids Analysis)” in [14] describe how multivariate analyses are applied to GWAS summary data, and bmass implements the data analysis pipeline described in the “Detailed Methods (Global Lipids Analysis)” section. The bmass R package also includes two vignettes to help users begin to process GWAS summary data and implement these methods.

Additional details for Fig 3

For each dataset we made a list of “marginally-significant” SNPs, with p-values smaller than 1 × 10−6 but not genome-wide significant at the relevant datasets’ GWAS threshold. We then greedily pruned these lists of marginally-significant SNPs: that is we repeatedly went through the lists removing SNPs that were less significant than another SNP within 500kb. We then removed any SNPs that were within 500kb of a new multivariate association, and merged the resulting list with the list of new multivariate associations, and sorted this merged list of SNPs by their minimum univariate p-value. This results in a non-redundant list of marginally-significant SNPs—some of which are new multivariate associations and some of which are not—sorted by their smallest univariate p-value. The plot shows how the number of SNPs of each type varies as the p-value threshold is relaxed from the GWAS threshold to 10−6 (the HaemgenRBC2016 results show only the top 500 SNPs due to the abundance of SNPs between 8.31 × 10−9 and 1 × 10−6).

Graphical model of multivariate categories.

Shown here is a Directed Acyclic Graphical (DAG) model of our multivariate categories in the context of our vector of phenotypes Y (e.g. Y = {Y, Y, Y}) and their connections with the variant of interest g. The relationships described in-text can be seen here. Y, our unassociated phenotypes, have no connection with g. Y, our directly associated phenotypes, have a direct connection with g. And Y, our indirectly associated phenotypes, have a connection with g only by going through Y first. Note that if Y were not observed, Y would appear as a direct connection. (PDF) Click here for additional data file.

Refining association signals—GlobalLipids2013 rs7515577 and rs12038699.

Shown are the -log10 univariate p-values from the GlobalLipids2013 analysis for both the previous univariate association rs7515577 (“Previous Univariate SNP”) and the new multivariate association rs12038699 (“New Multivariate SNP”) across all four phenotypes analyzed. rs7515577 is represented as a triangle and rs12038699 is represented as a square. Also shown are the -log10 univariate p-values of SNPs within 1Mb of the midpoint between rs7515577 and rs12038699. Color-coding of the SNPs represent the degree of linkage disequilibrium between variants and the new association rs12038699 based on the GBR cohort of 1000Genomes [49]; for color coding details, see legend. (PDF) Click here for additional data file.

Effect size heterogeneity among SNPs with identical multivariate model assignments.

Shown are the phenotype effect sizes (points), and ±2 standard errors (bars), for four significantly associated SNPs from HaemgenRBC2016. All four SNPs were classified as being “associated” with all six phenotypes (i.e. marginal posterior probability of association >= 95% for each phenotype). However, they clearly show different patterns of effect sizes. Therefore focusing simply on binary calls of “associated” vs “unassociated” can hide different patterns of multivariate association. (PDF) Click here for additional data file.

Summary of associations in each dataset.

a Number of new multivariate associations discovered by our analysis. Note that we required a multivariate association to be at least 500kb from a previous reported association to be considered “new”. b Univariate GWAS significance p-value threshold used by the original study publication. c These are new multivariate SNPs that were not reported by the original study despite having a univariate association (in the public summary data) that was genome-wide significant by the original study’s univariate significance threshold. d A “previous association” means an association reported by the original GWAS; “near” means within 1Mb (but these are all more than 500kb away from a previous association since our classification of new multivariate SNPs requires this). (PDF) Click here for additional data file.

Lists of new bmass multivariate associations, per dataset.

Attached Excel sheets list new bmass associations for each dataset analyzed. (XLS) Click here for additional data file.

Lists of retrieved univariate associations from original publications, per dataset.

Attached Excel sheets list the rsID#’s of the univariate significant SNPs that were retrieved from the original publication(s) associated with each dataset (see Methods for details). (XLS) Click here for additional data file.

Results for previous univariate associations, per dataset.

Attached Excel sheets give bmass results for previous univariate associations, per dataset. Note that these results may not include all SNPs from S3a–S3m Table, because some SNPs were dropped during QC and other SNPs were dropped because they did not reach univariate significance in the publicly available summary data (see Methods for details). (XLS) Click here for additional data file. Shown are example metrics of how well our new multivariate associations replicate in datasets that allow such an evaluation. Specifically, for three of the studies used (GlobalLipids, GIANT, and HaemgenRBC), there are multiple dataset releases. To examine how well our new multivariate bmass associations replicate, we compared the results from the first releases (“1st”) with the univariate GWAS associations of the second releases (“2nd”). In essence, each of these approaches aim to increase power—one by using a multivariate approach (bmass) and the other by increasing sample size (the 2nd releases)—thus allowing us to compare the results against one another. Univariate p-Value Threshold: univariate GWAS significance p-value thresholds used by the original publication(s) for both the earlier (1st) and later (2nd) releases. New Multivariate SNPs in 1st: number of new multivariate associations from the earlier release. Lower Univariate p-Value in 2nd: number of new multivariate associations from the earlier release that also have lower univariate p-values in the later release. Below 2nd Univariate Threshold: number of new multivariate associations from the earlier release that also cross the later release’s univariate GWAS significance threshold. (PDF) Click here for additional data file.

p-Values for rs7515577 & rs12038699 in 2010 and 2013 GlobalLipids releases.

In the 2010 release rs7515577 has a univariate p-value that crosses the 5 × 10−8 threshold (TC), whereas rs12038699 does not. Since rs12038699 is near to rs7515577 it may get masked for future analyses; however in the 2013 data rs12038699 not only has a lower minimum univariate p-value, but also has a different multivariate p-value pattern as compared to rs7515577. Both these signals suggest that rs12038699 should be viewed as a separate GWAS hit for GlobalLipids2013. (PDF) Click here for additional data file.

Top multivariate model examples per SNP.

List of multivariate models that most frequently have the highest posterior probabilities per SNP. Top 5 models are shown from across both the previous univariate associations analyzed and the new multivariate associations discovered in the GlobalLipids2013, GIANT2014/5, and HaemgenRBC2016 datasets. Phenotype ordering is shown in the header, where 0, 1, and 2 refer to the multivariate categories of Unassociated, Directly Associated, and Indirectly Associated. n is the number of SNPs that show the specified model as having the largest posterior probability, with Mean Posterior displaying the average posterior probability of the given model across the n SNPs, and Original Prior showing the prior established for the given model from training on all the previous univariate associations from that dataset. (PDF) Click here for additional data file. 10 Jul 2019 * Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. * Dear Dr Turchin, Thank you very much for submitting your Research Article entitled 'Bayesian multivariate reanalysis of large genetic studies identifies many new associations' to PLOS Genetics. Your manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated the attention to an important topic but identified some aspects of the manuscript that should be improved. We therefore ask you to modify the manuscript according to the review recommendations before we can consider your manuscript for acceptance. Your revisions should address the specific points made by each reviewer. In addition we ask that you: 1) Provide a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. 2) Upload a Striking Image with a corresponding caption to accompany your manuscript if one is available (either a new image or an existing one from within your manuscript). If this image is judged to be suitable, it may be featured on our website. Images should ideally be high resolution, eye-catching, single panel square images. For examples, please browse our archive. If your image is from someone other than yourself, please ensure that the artist has read and agreed to the terms and conditions of the Creative Commons Attribution License. Note: we cannot publish copyrighted images. We hope to receive your revised manuscript within the next 30 days. If you anticipate any delay in its return, we would ask you to let us know the expected resubmission date by email to plosgenetics@plos.org. If present, accompanying reviewer attachments should be included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist. While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission. PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process. To resubmit, you will need to go to the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder. [LINK] Please let us know if you have any questions while making these revisions. Yours sincerely, Michael P. Epstein Associate Editor PLOS Genetics Hua Tang Section Editor: Natural Variation PLOS Genetics Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: Turchin & Stephens show that many more associations can be identified with their Bayesian multivariate analysis. This paper advocates an important message that statistical power can easily improved from univariate to multivariate analysis. Overall the paper is well written and results are striking. One interesting question is that whether the performance gain is from the bayesian framework (i.e., bmass) or from multivariate analysis, a simple way to address that is to apply another multivariate method and compare its results with that from bmass. Reviewer #2: Overview: This manuscript takes the much needed step to show consortiums what they miss by not considering multivariate analysis of related phenotypes. The authors re-analyzed several publicly available datasets to compare findings from multivariate analysis of multiple phenotypes against those from single-trait analyses. Further, the authors empirically validate their additional findings from multivariate analysis by replicating those findings in larger, more recent datasets (e.g. in studies with multiple releases where later releases have larger sample sizes). The authors additionally emphasize that multivariate analysis is different from multiple single-trait analyses. I have a few discussion-type comments that will likely help other researchers adopt multivariate analyses in GWAS (which seems to be the goal of this manuscript) and a few minor comments for the authors to consider. *Major comments* 1. The authors give a couple reasons for why multivariate analyses are not adopted more widely. One reason that I have encountered from consortiums is that two traits may have widely different sample sizes, and no one wants to “sacrifice” higher sample size of one trait by trying to jointly analyze the two traits. This idea is probably stemming from the fact that multivariate analysis based on individual-level data requires the two traits to be measured on the same set of individuals. So, I think it will be useful to emphasize that these days, multivariate analysis can be done using methods based on GWAS summary statistics that do not have such a requirement. However, it also comes with additional questions that needs discussion: (a) if the consortiums have individual-level phenotype-genotype data where each trait has widely varying sample sizes, should the analysts first obtain single-trait single-SNP GWAS summary statistics and then proceed with multivariate analysis (using methods based on summary data)? Is this 2-step process an efficient approach? What would the authors recommend? (b) are the multivariate methods based on summary data robust across wide differences in sample sizes? Or can the authors give a thumb-rule (that analysts can follow) on the ratio of sample sizes of the traits beyond which summary-data-based-multivariate-methods may not be robust and may not prove more effective than univariate analysis? In my opinion, another hurdle in the adoption of multivariate analysis in consortiums: usually the multivariate analyses focus on genetic variants with MAF 1% or more (due to concerns about type I error inflation) while univariate analyses can conveniently go to lower MAFs without as much worry about inflation. Consequently, univariate analyses may end up detecting significantly associated genetic regions that are not even considered for multivariate analyses in the first place (due to more stringent MAF screening). Apparently then multivariate analysis is not giving anything over and above the findings from univariate analysis - the end result is the publication of univariate analysis results only. How would the authors sell multivariate methods to consortiums in such a context where univariate analysis appears more effective? 2. Since the authors are advocating multivariate analyses (which I completely agree with), I think readers will find it useful if the authors can provide a general multivariate analysis pipeline that consortiums can broadly follow. For instance, a general univariate analysis pipeline involves (1) linear or logistic regression of each trait, (2) then clumping of significant variants into unique loci, (3) then performing conditional analyses to determine which signals are independent from signals that are known from past studies, (4) then trying to find biological relevance of independent new signals for the corresponding trait, etc. On the other hand, for multivariate analysis, to avoid the problem of sacrificing higher sample size of one trait (as pointed out in the previous comment), (1) the first step will still be univariate analysis, (2) then multivariate analysis using say BFav approach based on the summary stats from first step (is BFav only applicable to continuous traits?), (3) then clumping of unique loci, (4) then performing conditional analyses? How to do conditional analyses with BFav approach? Which SNP-trait associations from past studies should be considered for conditional analyses? For the 5th step, how should one go about determining biological relevance of a multivariate signal? What next steps can be taken? 3. The authors defined “new multivariate association” as significant “if its BFav exceeds that of any previous univariate association’s BFav in the same study (Stephens, 2013). The rationale is that the evidence for these multivariate associations exceeds the evidence for univariate associations that are generally accepted as likely to be real.” - Is this, in some sense, equivalent to (in the frequentist world) defining significance by looking at the multivariate p-value; if it is smaller than the minimum of the univariate p-values, then we have a new multivariate association? 
If so, are we going to call this situation new multivariate association: p_Trait1 = 0.01, p_Trait2=0.005 and p_multivariate=0.001? 
If yes, the definition does not seem right because in GWAS, we look at more stringent significance thresholds. Can the authors explain in more detail? 4. Figure 2: Some of the new multivariate SNPs are below the 45-degree line. If I am interpreting it correctly, these SNPs have larger negative log-transformed p-values (i.e., more significant) in the earlier release than the later release (later release with larger sample size). When can this happen? Are these variants low-frequency? *Minor Comments* 1. Since the authors recommend re-analyzing published GWAS using multivariate methods that use single-SNP summary data, I think it is worth emphasizing that consortiums ensure detailed summary data files are made publicly available. For instance, I have often come across publicly available summary data files that are almost useless because they have missing MAF or risk allele frequency, or missing info on the risk allele or the reference allele, or missing info on the Z statistics (or beta estimates) making direction of effect indeterminable. 2. “in total, 84 of 94 new associations have smaller univariate p-values in the later release, and indeed the majority of these reach univariate GWAS significance in the later release.” - Do the authors mean exactly the same variants have smaller univariate p-values in later release compared to the first release? 3. “at a univariate threshold of 5e-7 only 66% of the univariate significant SNPs are also multivariate significant across these three studies.” - Was 5e-7 threshold used for determining multivariate significance as well? What’s the equivalent BFav value (since the multivariate approach used the BFav Bayesian approach)? 4. Can the authors explain the following choices? (a) Looks like the author screened our variants with MAF < 1% for all datasets but one. The MAF threshold for GEFOS2015 is 0.5%. (b) “We used a minimum N threshold of 50,000”. (c) The authors used “a univariate significant GWAS p-value threshold of 5e-8” for most studies but not all (e.g., for HaemgenRBC2016, threshold used is 8.319e-9). 5. “There may be many reasons why such variants went unreported, but one reason may be physical proximity to a variant with a stronger signal.” - Is it possible that such variants were not reported because they were no longer significant in subsequent conditional analyses? 6. Supplementary Figure 3 plots effect sizes for each trait. Does the BFav approach provide effect size estimates as well? Reviewer #3: This is a nicely written paper applying a multivariate method to 13 datasets to search for new associations. The paper presents a large number of new findings, with the ultimate goal of encouraging increased use of the multivariate method. General comments: The performance of this multivariate approach is illustrated by looking at very closely related phenotypes, such as blood cell subtypes and anthropometric measures. Would this method still yield substantial power gains when investigating joint associations for less similar traits (i.e.: BMI and different cancer types)? Is the power of this approach related to the degree of correlation (genetic and/or phenotypic) between the traits? The authors note that “we declared a multivariate association as significant if its BFav exceeds that of any previous univariate association’s BFav in the same study.” Thus, the multivariate is deemed significant if it has a Bayes factor larger than any one univariate result. Is this criterion too liberal, and might it result in false positives for the multivariate approach? It is unclear why examining associations “that would be declared significant if the univariate significance threshold were relaxed” and comparing these to multivariate association results illustrates that multivariate analysis is not the same as multiple univariate analyses. I agree that these are not the same thing, but wouldn’t it make sense to look at whether a given SNP is associated with traits 1-3 in multiple univariate analyses and then see if it is also identified in the multivariate analysis of the same traits? The “Reanalysis also identifies new univariate associations” part is a little vague. Does this mean that there were significant SNPs in the original summary stats files that were simply not reported? Or are these new SNPs that were identified in the multivariate analysis, but were only associated with one trait? Showing replication is very important to support the use of the multivariate approach. When using a later release of the same study/consortium to replicate associations, were the older studies excluded from the new release? What is the degree of overlap between the two, and can this be considered independent replication? If the initial study is included in the latter release, wouldn’t one expect the univariate p-value to decrease based on the larger sample size regardless of it being a true signal? When evaluating the independence of new multivariate associations, it seems that pruning was based on distance (SNPs at least 0.5 Mb apart), but was LD was explicitly considered based on an r2 cut-off? Why were different significance thresholds used for different studies? HaemgenRBC 2012: p<1�10-8, HaemgenRBC 2016: p<8.319�10-9, GEFOS2015: p<1.2�10-8 In the Reanalysis section the authors state “Indeed the very different multivariate patterns of effect size suggest that not only are these associations non-redundant but likely involve different biological mechanisms as well.” Is there evidence to support this statement? Do we really think that different effect sizes mean different mechanisms? ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No Reviewer #3: No 12 Aug 2019 Submitted filename: TurchinStephens2019_ReviewerResponse2.pdf Click here for additional data file. 4 Sep 2019 [EXSCINDED] * Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. * Dear Dr Turchin, Thank you very much for submitting your revised Research Article entitled 'Bayesian multivariate reanalysis of large genetic studies identifies many new associations' to PLOS Genetics. The peer reviewers were satisfied with most of your revisions, although one reviewer requested some additional minor edits. We therefore ask you to modify the manuscript accordingly before we can move forward with formal acceptance of your work. In addition we ask that you: 1) Provide a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. 2) Upload a Striking Image with a corresponding caption to accompany your manuscript if one is available (either a new image or an existing one from within your manuscript). If this image is judged to be suitable, it may be featured on our website. Images should ideally be high resolution, eye-catching, single panel square images. For examples, please browse our archive. If your image is from someone other than yourself, please ensure that the artist has read and agreed to the terms and conditions of the Creative Commons Attribution License. Note: we cannot publish copyrighted images. We hope to receive your revised manuscript within the next 30 days. If you anticipate any delay in its return, we would ask you to let us know the expected resubmission date by email to plosgenetics@plos.org. If present, accompanying reviewer attachments should be included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist. While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission. PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process. To resubmit, you will need to go to the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder. [LINK] Please let us know if you have any questions while making these revisions. Yours sincerely, Michael P. Epstein Associate Editor PLOS Genetics Hua Tang Section Editor: Natural Variation PLOS Genetics Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: My comments have been addressed. Reviewer #2: I thank the authors for addressing my previous comments. With regards to the authors’ “response to reviewers”, it would have been less time intensive to review if the authors highlighted the text they added in the manuscript. Or, at least mention the relevant page numbers wherever “We have now included […] in the manuscript” is mentioned in their response. I’d request the authors to consider giving this info and/or an annotated manuscript in any subsequent review. *Minor Comments* 1. The authors added “More generally, although not necessarily essential for our analyses here, it is always helpful to include additional key meta-data (e.g. the reference allele, or effect allele, the minor allele frequency, and sample size).” However, earlier the authors mentioned in section 6.1 that they dropped variants from GWAS datasets that had missing MAF or for which reference/alt alleles did not match across phenotypes. So, I am not sure why the authors say meta-data such as “reference allele, or effect allele, the minor allele frequency” are “not necessarily essential” for their analyses. It may not be necessary for implementing their R program bmass but these info seem essential for subsequent interpretation (without which, there’s high risk of erroneous conclusions about statistical significance - e.g., mixing up effect alleles across traits can inflate results; results may not be trustworthy if the MAF is low/rare). 2. In the response, the authors mention “In particular our current framework does not immediately have options for including covariates in the multivariate analysis, so correcting for confounders such as population structure should be done on the first set of univariate analyses. We have added specific mention of this to the Discussion.” In the Discussion, I found this text: “in many cases summary data remain much easier to obtain and work with; there are big practical advantages as well to modular pipelines that first compute summary data and then use these as inputs to subsequent (more sophisticated) analyses. For example, the multivariate analyses we present here are simplified by assuming that the summary data were computed while adequately adjusting for population stratification.” I may have missed other relevant text but the above quoted text from Discussion does not seem to emphasize/convey the message that any necessary covariate adjustment *has* to be done before obtaining the single-trait GWAS summary statistics, and cannot be done at the multivariate analysis step. So, if one has access to GWAS summary data from a study that did not adjust for population stratification, one cannot expect it to be taken care of by the multivariate analysis. 3. In the response, the authors mention “Under the scenario where a consortium finds that running a multivariate analysis does not indicate any new SNPs as being genome-wide significant, then we would not criticize them for choosing to focus on univariate analyses. We have added a discussion of this to the manuscript.” I have probably missed but I couldn’t find this discussion in the manuscript. 4. It is not clear if the BFav approach is only applicable to continuous traits (if I am not mistaken, the authors have analyzed continuous traits only in this manuscript). If so, it is worth emphasizing. Reviewer #3: No additional comments. ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No Reviewer #3: No 13 Sep 2019 Submitted filename: TurchinStephens2019_ReviewerResponse_Rnd2_v2.pdf Click here for additional data file. 17 Sep 2019 Dear Dr Turchin, We are pleased to inform you that your manuscript entitled "Bayesian multivariate reanalysis of large genetic studies identifies many new associations" has been editorially accepted for publication in PLOS Genetics. Congratulations! Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional accept, but your manuscript will not be scheduled for publication until the required changes have been made. Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org. In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field.  This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. If you have a press-related query, or would like to know about one way to make your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date. Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics! Yours sincerely, Michael P. Epstein Associate Editor PLOS Genetics Hua Tang Section Editor: Natural Variation PLOS Genetics www.plosgenetics.org Twitter: @PLOSGenetics ---------------------------------------------------- Comments from the reviewers (if applicable): ---------------------------------------------------- Data Deposition If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website. The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-19-00888R2 More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support. Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present. ---------------------------------------------------- Press Queries If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org. 2 Oct 2019 PGENETICS-D-19-00888R2 Bayesian multivariate reanalysis of large genetic studies identifies many new associations Dear Dr Turchin, We are pleased to inform you that your manuscript entitled "Bayesian multivariate reanalysis of large genetic studies identifies many new associations" has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work! With kind regards, Kaitlin Butler PLOS Genetics On behalf of: The PLOS Genetics Team Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom plosgenetics@plos.org | +44 (0) 1223-442823 plosgenetics.org | Twitter: @PLOSGenetics
  49 in total

1.  Joint Bayesian inference of risk variants and tissue-specific epigenomic enrichments across multiple complex human diseases.

Authors:  Yue Li; Manolis Kellis
Journal:  Nucleic Acids Res       Date:  2016-07-12       Impact factor: 16.971

2.  Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets.

Authors:  Zhihong Zhu; Futao Zhang; Han Hu; Andrew Bakshi; Matthew R Robinson; Joseph E Powell; Grant W Montgomery; Michael E Goddard; Naomi R Wray; Peter M Visscher; Jian Yang
Journal:  Nat Genet       Date:  2016-03-28       Impact factor: 38.330

3.  Meta-analysis identifies 13 new loci associated with waist-hip ratio and reveals sexual dimorphism in the genetic basis of fat distribution.

Authors:  Iris M Heid; Anne U Jackson; Joshua C Randall; Thomas W Winkler; Lu Qi; Valgerdur Steinthorsdottir; Gudmar Thorleifsson; M Carola Zillikens; Elizabeth K Speliotes; Reedik Mägi; Tsegaselassie Workalemahu; Charles C White; Nabila Bouatia-Naji; Tamara B Harris; Sonja I Berndt; Erik Ingelsson; Cristen J Willer; Michael N Weedon; Jian'an Luan; Sailaja Vedantam; Tõnu Esko; Tuomas O Kilpeläinen; Zoltán Kutalik; Shengxu Li; Keri L Monda; Anna L Dixon; Christopher C Holmes; Lee M Kaplan; Liming Liang; Josine L Min; Miriam F Moffatt; Cliona Molony; George Nicholson; Eric E Schadt; Krina T Zondervan; Mary F Feitosa; Teresa Ferreira; Hana Lango Allen; Robert J Weyant; Eleanor Wheeler; Andrew R Wood; Karol Estrada; Michael E Goddard; Guillaume Lettre; Massimo Mangino; Dale R Nyholt; Shaun Purcell; Albert Vernon Smith; Peter M Visscher; Jian Yang; Steven A McCarroll; James Nemesh; Benjamin F Voight; Devin Absher; Najaf Amin; Thor Aspelund; Lachlan Coin; Nicole L Glazer; Caroline Hayward; Nancy L Heard-Costa; Jouke-Jan Hottenga; Asa Johansson; Toby Johnson; Marika Kaakinen; Karen Kapur; Shamika Ketkar; Joshua W Knowles; Peter Kraft; Aldi T Kraja; Claudia Lamina; Michael F Leitzmann; Barbara McKnight; Andrew P Morris; Ken K Ong; John R B Perry; Marjolein J Peters; Ozren Polasek; Inga Prokopenko; Nigel W Rayner; Samuli Ripatti; Fernando Rivadeneira; Neil R Robertson; Serena Sanna; Ulla Sovio; Ida Surakka; Alexander Teumer; Sophie van Wingerden; Veronique Vitart; Jing Hua Zhao; Christine Cavalcanti-Proença; Peter S Chines; Eva Fisher; Jennifer R Kulzer; Cecile Lecoeur; Narisu Narisu; Camilla Sandholt; Laura J Scott; Kaisa Silander; Klaus Stark; Mari-Liis Tammesoo; Tanya M Teslovich; Nicholas John Timpson; Richard M Watanabe; Ryan Welch; Daniel I Chasman; Matthew N Cooper; John-Olov Jansson; Johannes Kettunen; Robert W Lawrence; Niina Pellikka; Markus Perola; Liesbeth Vandenput; Helene Alavere; Peter Almgren; Larry D Atwood; Amanda J Bennett; Reiner Biffar; Lori L Bonnycastle; Stefan R Bornstein; Thomas A Buchanan; Harry Campbell; Ian N M Day; Mariano Dei; Marcus Dörr; Paul Elliott; Michael R Erdos; Johan G Eriksson; Nelson B Freimer; Mao Fu; Stefan Gaget; Eco J C Geus; Anette P Gjesing; Harald Grallert; Jürgen Grässler; Christopher J Groves; Candace Guiducci; Anna-Liisa Hartikainen; Neelam Hassanali; Aki S Havulinna; Karl-Heinz Herzig; Andrew A Hicks; Jennie Hui; Wilmar Igl; Pekka Jousilahti; Antti Jula; Eero Kajantie; Leena Kinnunen; Ivana Kolcic; Seppo Koskinen; Peter Kovacs; Heyo K Kroemer; Vjekoslav Krzelj; Johanna Kuusisto; Kirsti Kvaloy; Jaana Laitinen; Olivier Lantieri; G Mark Lathrop; Marja-Liisa Lokki; Robert N Luben; Barbara Ludwig; Wendy L McArdle; Anne McCarthy; Mario A Morken; Mari Nelis; Matt J Neville; Guillaume Paré; Alex N Parker; John F Peden; Irene Pichler; Kirsi H Pietiläinen; Carl G P Platou; Anneli Pouta; Martin Ridderstråle; Nilesh J Samani; Jouko Saramies; Juha Sinisalo; Jan H Smit; Rona J Strawbridge; Heather M Stringham; Amy J Swift; Maris Teder-Laving; Brian Thomson; Gianluca Usala; Joyce B J van Meurs; Gert-Jan van Ommen; Vincent Vatin; Claudia B Volpato; Henri Wallaschofski; G Bragi Walters; Elisabeth Widen; Sarah H Wild; Gonneke Willemsen; Daniel R Witte; Lina Zgaga; Paavo Zitting; John P Beilby; Alan L James; Mika Kähönen; Terho Lehtimäki; Markku S Nieminen; Claes Ohlsson; Lyle J Palmer; Olli Raitakari; Paul M Ridker; Michael Stumvoll; Anke Tönjes; Jorma Viikari; Beverley Balkau; Yoav Ben-Shlomo; Richard N Bergman; Heiner Boeing; George Davey Smith; Shah Ebrahim; Philippe Froguel; Torben Hansen; Christian Hengstenberg; Kristian Hveem; Bo Isomaa; Torben Jørgensen; Fredrik Karpe; Kay-Tee Khaw; Markku Laakso; Debbie A Lawlor; Michel Marre; Thomas Meitinger; Andres Metspalu; Kristian Midthjell; Oluf Pedersen; Veikko Salomaa; Peter E H Schwarz; Tiinamaija Tuomi; Jaakko Tuomilehto; Timo T Valle; Nicholas J Wareham; Alice M Arnold; Jacques S Beckmann; Sven Bergmann; Eric Boerwinkle; Dorret I Boomsma; Mark J Caulfield; Francis S Collins; Gudny Eiriksdottir; Vilmundur Gudnason; Ulf Gyllensten; Anders Hamsten; Andrew T Hattersley; Albert Hofman; Frank B Hu; Thomas Illig; Carlos Iribarren; Marjo-Riitta Jarvelin; W H Linda Kao; Jaakko Kaprio; Lenore J Launer; Patricia B Munroe; Ben Oostra; Brenda W Penninx; Peter P Pramstaller; Bruce M Psaty; Thomas Quertermous; Aila Rissanen; Igor Rudan; Alan R Shuldiner; Nicole Soranzo; Timothy D Spector; Ann-Christine Syvanen; Manuela Uda; André Uitterlinden; Henry Völzke; Peter Vollenweider; James F Wilson; Jacqueline C Witteman; Alan F Wright; Gonçalo R Abecasis; Michael Boehnke; Ingrid B Borecki; Panos Deloukas; Timothy M Frayling; Leif C Groop; Talin Haritunians; David J Hunter; Robert C Kaplan; Kari E North; Jeffrey R O'Connell; Leena Peltonen; David Schlessinger; David P Strachan; Joel N Hirschhorn; Themistocles L Assimes; H-Erich Wichmann; Unnur Thorsteinsdottir; Cornelia M van Duijn; Kari Stefansson; L Adrienne Cupples; Ruth J F Loos; Inês Barroso; Mark I McCarthy; Caroline S Fox; Karen L Mohlke; Cecilia M Lindgren
Journal:  Nat Genet       Date:  2010-10-10       Impact factor: 38.330

Review 4.  10 Years of GWAS Discovery: Biology, Function, and Translation.

Authors:  Peter M Visscher; Naomi R Wray; Qian Zhang; Pamela Sklar; Mark I McCarthy; Matthew A Brown; Jian Yang
Journal:  Am J Hum Genet       Date:  2017-07-06       Impact factor: 11.025

5.  New genetic loci link adipose and insulin biology to body fat distribution.

Authors:  Dmitry Shungin; Thomas W Winkler; Damien C Croteau-Chonka; Teresa Ferreira; Adam E Locke; Reedik Mägi; Rona J Strawbridge; Tune H Pers; Krista Fischer; Anne E Justice; Tsegaselassie Workalemahu; Joseph M W Wu; Martin L Buchkovich; Nancy L Heard-Costa; Tamara S Roman; Alexander W Drong; Ci Song; Stefan Gustafsson; Felix R Day; Tonu Esko; Tove Fall; Zoltán Kutalik; Jian'an Luan; Joshua C Randall; André Scherag; Sailaja Vedantam; Andrew R Wood; Jin Chen; Rudolf Fehrmann; Juha Karjalainen; Bratati Kahali; Ching-Ti Liu; Ellen M Schmidt; Devin Absher; Najaf Amin; Denise Anderson; Marian Beekman; Jennifer L Bragg-Gresham; Steven Buyske; Ayse Demirkan; Georg B Ehret; Mary F Feitosa; Anuj Goel; Anne U Jackson; Toby Johnson; Marcus E Kleber; Kati Kristiansson; Massimo Mangino; Irene Mateo Leach; Carolina Medina-Gomez; Cameron D Palmer; Dorota Pasko; Sonali Pechlivanis; Marjolein J Peters; Inga Prokopenko; Alena Stančáková; Yun Ju Sung; Toshiko Tanaka; Alexander Teumer; Jana V Van Vliet-Ostaptchouk; Loïc Yengo; Weihua Zhang; Eva Albrecht; Johan Ärnlöv; Gillian M Arscott; Stefania Bandinelli; Amy Barrett; Claire Bellis; Amanda J Bennett; Christian Berne; Matthias Blüher; Stefan Böhringer; Fabrice Bonnet; Yvonne Böttcher; Marcel Bruinenberg; Delia B Carba; Ida H Caspersen; Robert Clarke; E Warwick Daw; Joris Deelen; Ewa Deelman; Graciela Delgado; Alex Sf Doney; Niina Eklund; Michael R Erdos; Karol Estrada; Elodie Eury; Nele Friedrich; Melissa E Garcia; Vilmantas Giedraitis; Bruna Gigante; Alan S Go; Alain Golay; Harald Grallert; Tanja B Grammer; Jürgen Gräßler; Jagvir Grewal; Christopher J Groves; Toomas Haller; Goran Hallmans; Catharina A Hartman; Maija Hassinen; Caroline Hayward; Kauko Heikkilä; Karl-Heinz Herzig; Quinta Helmer; Hans L Hillege; Oddgeir Holmen; Steven C Hunt; Aaron Isaacs; Till Ittermann; Alan L James; Ingegerd Johansson; Thorhildur Juliusdottir; Ioanna-Panagiota Kalafati; Leena Kinnunen; Wolfgang Koenig; Ishminder K Kooner; Wolfgang Kratzer; Claudia Lamina; Karin Leander; Nanette R Lee; Peter Lichtner; Lars Lind; Jaana Lindström; Stéphane Lobbens; Mattias Lorentzon; François Mach; Patrik Ke Magnusson; Anubha Mahajan; Wendy L McArdle; Cristina Menni; Sigrun Merger; Evelin Mihailov; Lili Milani; Rebecca Mills; Alireza Moayyeri; Keri L Monda; Simon P Mooijaart; Thomas W Mühleisen; Antonella Mulas; Gabriele Müller; Martina Müller-Nurasyid; Ramaiah Nagaraja; Michael A Nalls; Narisu Narisu; Nicola Glorioso; Ilja M Nolte; Matthias Olden; Nigel W Rayner; Frida Renstrom; Janina S Ried; Neil R Robertson; Lynda M Rose; Serena Sanna; Hubert Scharnagl; Salome Scholtens; Bengt Sennblad; Thomas Seufferlein; Colleen M Sitlani; Albert Vernon Smith; Kathleen Stirrups; Heather M Stringham; Johan Sundström; Morris A Swertz; Amy J Swift; Ann-Christine Syvänen; Bamidele O Tayo; Barbara Thorand; Gudmar Thorleifsson; Andreas Tomaschitz; Chiara Troffa; Floor Va van Oort; Niek Verweij; Judith M Vonk; Lindsay L Waite; Roman Wennauer; Tom Wilsgaard; Mary K Wojczynski; Andrew Wong; Qunyuan Zhang; Jing Hua Zhao; Eoin P Brennan; Murim Choi; Per Eriksson; Lasse Folkersen; Anders Franco-Cereceda; Ali G Gharavi; Åsa K Hedman; Marie-France Hivert; Jinyan Huang; Stavroula Kanoni; Fredrik Karpe; Sarah Keildson; Krzysztof Kiryluk; Liming Liang; Richard P Lifton; Baoshan Ma; Amy J McKnight; Ruth McPherson; Andres Metspalu; Josine L Min; Miriam F Moffatt; Grant W Montgomery; Joanne M Murabito; George Nicholson; Dale R Nyholt; Christian Olsson; John Rb Perry; Eva Reinmaa; Rany M Salem; Niina Sandholm; Eric E Schadt; Robert A Scott; Lisette Stolk; Edgar E Vallejo; Harm-Jan Westra; Krina T Zondervan; Philippe Amouyel; Dominique Arveiler; Stephan Jl Bakker; John Beilby; Richard N Bergman; John Blangero; Morris J Brown; Michel Burnier; Harry Campbell; Aravinda Chakravarti; Peter S Chines; Simone Claudi-Boehm; Francis S Collins; Dana C Crawford; John Danesh; Ulf de Faire; Eco Jc de Geus; Marcus Dörr; Raimund Erbel; Johan G Eriksson; Martin Farrall; Ele Ferrannini; Jean Ferrières; Nita G Forouhi; Terrence Forrester; Oscar H Franco; Ron T Gansevoort; Christian Gieger; Vilmundur Gudnason; Christopher A Haiman; Tamara B Harris; Andrew T Hattersley; Markku Heliövaara; Andrew A Hicks; Aroon D Hingorani; Wolfgang Hoffmann; Albert Hofman; Georg Homuth; Steve E Humphries; Elina Hyppönen; Thomas Illig; Marjo-Riitta Jarvelin; Berit Johansen; Pekka Jousilahti; Antti M Jula; Jaakko Kaprio; Frank Kee; Sirkka M Keinanen-Kiukaanniemi; Jaspal S Kooner; Charles Kooperberg; Peter Kovacs; Aldi T Kraja; Meena Kumari; Kari Kuulasmaa; Johanna Kuusisto; Timo A Lakka; Claudia Langenberg; Loic Le Marchand; Terho Lehtimäki; Valeriya Lyssenko; Satu Männistö; André Marette; Tara C Matise; Colin A McKenzie; Barbara McKnight; Arthur W Musk; Stefan Möhlenkamp; Andrew D Morris; Mari Nelis; Claes Ohlsson; Albertine J Oldehinkel; Ken K Ong; Lyle J Palmer; Brenda W Penninx; Annette Peters; Peter P Pramstaller; Olli T Raitakari; Tuomo Rankinen; D C Rao; Treva K Rice; Paul M Ridker; Marylyn D Ritchie; Igor Rudan; Veikko Salomaa; Nilesh J Samani; Jouko Saramies; Mark A Sarzynski; Peter Eh Schwarz; Alan R Shuldiner; Jan A Staessen; Valgerdur Steinthorsdottir; Ronald P Stolk; Konstantin Strauch; Anke Tönjes; Angelo Tremblay; Elena Tremoli; Marie-Claude Vohl; Uwe Völker; Peter Vollenweider; James F Wilson; Jacqueline C Witteman; Linda S Adair; Murielle Bochud; Bernhard O Boehm; Stefan R Bornstein; Claude Bouchard; Stéphane Cauchi; Mark J Caulfield; John C Chambers; Daniel I Chasman; Richard S Cooper; George Dedoussis; Luigi Ferrucci; Philippe Froguel; Hans-Jörgen Grabe; Anders Hamsten; Jennie Hui; Kristian Hveem; Karl-Heinz Jöckel; Mika Kivimaki; Diana Kuh; Markku Laakso; Yongmei Liu; Winfried März; Patricia B Munroe; Inger Njølstad; Ben A Oostra; Colin Na Palmer; Nancy L Pedersen; Markus Perola; Louis Pérusse; Ulrike Peters; Chris Power; Thomas Quertermous; Rainer Rauramaa; Fernando Rivadeneira; Timo E Saaristo; Danish Saleheen; Juha Sinisalo; P Eline Slagboom; Harold Snieder; Tim D Spector; Kari Stefansson; Michael Stumvoll; Jaakko Tuomilehto; André G Uitterlinden; Matti Uusitupa; Pim van der Harst; Giovanni Veronesi; Mark Walker; Nicholas J Wareham; Hugh Watkins; H-Erich Wichmann; Goncalo R Abecasis; Themistocles L Assimes; Sonja I Berndt; Michael Boehnke; Ingrid B Borecki; Panos Deloukas; Lude Franke; Timothy M Frayling; Leif C Groop; David J Hunter; Robert C Kaplan; Jeffrey R O'Connell; Lu Qi; David Schlessinger; David P Strachan; Unnur Thorsteinsdottir; Cornelia M van Duijn; Cristen J Willer; Peter M Visscher; Jian Yang; Joel N Hirschhorn; M Carola Zillikens; Mark I McCarthy; Elizabeth K Speliotes; Kari E North; Caroline S Fox; Inês Barroso; Paul W Franks; Erik Ingelsson; Iris M Heid; Ruth Jf Loos; L Adrienne Cupples; Andrew P Morris; Cecilia M Lindgren; Karen L Mohlke
Journal:  Nature       Date:  2015-02-12       Impact factor: 49.962

6.  Whole-genome sequencing identifies EN1 as a determinant of bone density and fracture.

Authors:  Hou-Feng Zheng; Vincenzo Forgetta; Yi-Hsiang Hsu; Karol Estrada; Alberto Rosello-Diez; Paul J Leo; Chitra L Dahia; Kyung Hyun Park-Min; Jonathan H Tobias; Charles Kooperberg; Aaron Kleinman; Unnur Styrkarsdottir; Ching-Ti Liu; Charlotta Uggla; Daniel S Evans; Carrie M Nielson; Klaudia Walter; Ulrika Pettersson-Kymmer; Shane McCarthy; Joel Eriksson; Tony Kwan; Mila Jhamai; Katerina Trajanoska; Yasin Memari; Josine Min; Jie Huang; Petr Danecek; Beth Wilmot; Rui Li; Wen-Chi Chou; Lauren E Mokry; Alireza Moayyeri; Melina Claussnitzer; Chia-Ho Cheng; Warren Cheung; Carolina Medina-Gómez; Bing Ge; Shu-Huang Chen; Kwangbom Choi; Ling Oei; James Fraser; Robert Kraaij; Matthew A Hibbs; Celia L Gregson; Denis Paquette; Albert Hofman; Carl Wibom; Gregory J Tranah; Mhairi Marshall; Brooke B Gardiner; Katie Cremin; Paul Auer; Li Hsu; Sue Ring; Joyce Y Tung; Gudmar Thorleifsson; Anke W Enneman; Natasja M van Schoor; Lisette C P G M de Groot; Nathalie van der Velde; Beatrice Melin; John P Kemp; Claus Christiansen; Adrian Sayers; Yanhua Zhou; Sophie Calderari; Jeroen van Rooij; Chris Carlson; Ulrike Peters; Soizik Berlivet; Josée Dostie; Andre G Uitterlinden; Stephen R Williams; Charles Farber; Daniel Grinberg; Andrea Z LaCroix; Jeff Haessler; Daniel I Chasman; Franco Giulianini; Lynda M Rose; Paul M Ridker; John A Eisman; Tuan V Nguyen; Jacqueline R Center; Xavier Nogues; Natalia Garcia-Giralt; Lenore L Launer; Vilmunder Gudnason; Dan Mellström; Liesbeth Vandenput; Najaf Amin; Cornelia M van Duijn; Magnus K Karlsson; Östen Ljunggren; Olle Svensson; Göran Hallmans; François Rousseau; Sylvie Giroux; Johanne Bussière; Pascal P Arp; Fjorda Koromani; Richard L Prince; Joshua R Lewis; Bente L Langdahl; A Pernille Hermann; Jens-Erik B Jensen; Stephen Kaptoge; Kay-Tee Khaw; Jonathan Reeve; Melissa M Formosa; Angela Xuereb-Anastasi; Kristina Åkesson; Fiona E McGuigan; Gaurav Garg; Jose M Olmos; Maria T Zarrabeitia; Jose A Riancho; Stuart H Ralston; Nerea Alonso; Xi Jiang; David Goltzman; Tomi Pastinen; Elin Grundberg; Dominique Gauguier; Eric S Orwoll; David Karasik; George Davey-Smith; Albert V Smith; Kristin Siggeirsdottir; Tamara B Harris; M Carola Zillikens; Joyce B J van Meurs; Unnur Thorsteinsdottir; Matthew T Maurano; Nicholas J Timpson; Nicole Soranzo; Richard Durbin; Scott G Wilson; Evangelia E Ntzani; Matthew A Brown; Kari Stefansson; David A Hinds; Tim Spector; L Adrienne Cupples; Claes Ohlsson; Celia M T Greenwood; Rebecca D Jackson; David W Rowe; Cynthia A Loomis; David M Evans; Cheryl L Ackert-Bicknell; Alexandra L Joyner; Emma L Duncan; Douglas P Kiel; Fernando Rivadeneira; J Brent Richards
Journal:  Nature       Date:  2015-09-14       Impact factor: 49.962

7.  RNA splicing is a primary link between genetic variation and disease.

Authors:  Yang I Li; Bryce van de Geijn; Anil Raj; David A Knowles; Allegra A Petti; David Golan; Yoav Gilad; Jonathan K Pritchard
Journal:  Science       Date:  2016-04-28       Impact factor: 47.728

8.  Defining the role of common variation in the genomic and biological architecture of adult human height.

Authors:  Andrew R Wood; Tonu Esko; Jian Yang; Sailaja Vedantam; Tune H Pers; Stefan Gustafsson; Audrey Y Chu; Karol Estrada; Jian'an Luan; Zoltán Kutalik; Najaf Amin; Martin L Buchkovich; Damien C Croteau-Chonka; Felix R Day; Yanan Duan; Tove Fall; Rudolf Fehrmann; Teresa Ferreira; Anne U Jackson; Juha Karjalainen; Ken Sin Lo; Adam E Locke; Reedik Mägi; Evelin Mihailov; Eleonora Porcu; Joshua C Randall; André Scherag; Anna A E Vinkhuyzen; Harm-Jan Westra; Thomas W Winkler; Tsegaselassie Workalemahu; Jing Hua Zhao; Devin Absher; Eva Albrecht; Denise Anderson; Jeffrey Baron; Marian Beekman; Ayse Demirkan; Georg B Ehret; Bjarke Feenstra; Mary F Feitosa; Krista Fischer; Ross M Fraser; Anuj Goel; Jian Gong; Anne E Justice; Stavroula Kanoni; Marcus E Kleber; Kati Kristiansson; Unhee Lim; Vaneet Lotay; Julian C Lui; Massimo Mangino; Irene Mateo Leach; Carolina Medina-Gomez; Michael A Nalls; Dale R Nyholt; Cameron D Palmer; Dorota Pasko; Sonali Pechlivanis; Inga Prokopenko; Janina S Ried; Stephan Ripke; Dmitry Shungin; Alena Stancáková; Rona J Strawbridge; Yun Ju Sung; Toshiko Tanaka; Alexander Teumer; Stella Trompet; Sander W van der Laan; Jessica van Setten; Jana V Van Vliet-Ostaptchouk; Zhaoming Wang; Loïc Yengo; Weihua Zhang; Uzma Afzal; Johan Arnlöv; Gillian M Arscott; Stefania Bandinelli; Amy Barrett; Claire Bellis; Amanda J Bennett; Christian Berne; Matthias Blüher; Jennifer L Bolton; Yvonne Böttcher; Heather A Boyd; Marcel Bruinenberg; Brendan M Buckley; Steven Buyske; Ida H Caspersen; Peter S Chines; Robert Clarke; Simone Claudi-Boehm; Matthew Cooper; E Warwick Daw; Pim A De Jong; Joris Deelen; Graciela Delgado; Josh C Denny; Rosalie Dhonukshe-Rutten; Maria Dimitriou; Alex S F Doney; Marcus Dörr; Niina Eklund; Elodie Eury; Lasse Folkersen; Melissa E Garcia; Frank Geller; Vilmantas Giedraitis; Alan S Go; Harald Grallert; Tanja B Grammer; Jürgen Gräßler; Henrik Grönberg; Lisette C P G M de Groot; Christopher J Groves; Jeffrey Haessler; Per Hall; Toomas Haller; Goran Hallmans; Anke Hannemann; Catharina A Hartman; Maija Hassinen; Caroline Hayward; Nancy L Heard-Costa; Quinta Helmer; Gibran Hemani; Anjali K Henders; Hans L Hillege; Mark A Hlatky; Wolfgang Hoffmann; Per Hoffmann; Oddgeir Holmen; Jeanine J Houwing-Duistermaat; Thomas Illig; Aaron Isaacs; Alan L James; Janina Jeff; Berit Johansen; Åsa Johansson; Jennifer Jolley; Thorhildur Juliusdottir; Juhani Junttila; Abel N Kho; Leena Kinnunen; Norman Klopp; Thomas Kocher; Wolfgang Kratzer; Peter Lichtner; Lars Lind; Jaana Lindström; Stéphane Lobbens; Mattias Lorentzon; Yingchang Lu; Valeriya Lyssenko; Patrik K E Magnusson; Anubha Mahajan; Marc Maillard; Wendy L McArdle; Colin A McKenzie; Stela McLachlan; Paul J McLaren; Cristina Menni; Sigrun Merger; Lili Milani; Alireza Moayyeri; Keri L Monda; Mario A Morken; Gabriele Müller; Martina Müller-Nurasyid; Arthur W Musk; Narisu Narisu; Matthias Nauck; Ilja M Nolte; Markus M Nöthen; Laticia Oozageer; Stefan Pilz; Nigel W Rayner; Frida Renstrom; Neil R Robertson; Lynda M Rose; Ronan Roussel; Serena Sanna; Hubert Scharnagl; Salome Scholtens; Fredrick R Schumacher; Heribert Schunkert; Robert A Scott; Joban Sehmi; Thomas Seufferlein; Jianxin Shi; Karri Silventoinen; Johannes H Smit; Albert Vernon Smith; Joanna Smolonska; Alice V Stanton; Kathleen Stirrups; David J Stott; Heather M Stringham; Johan Sundström; Morris A Swertz; Ann-Christine Syvänen; Bamidele O Tayo; Gudmar Thorleifsson; Jonathan P Tyrer; Suzanne van Dijk; Natasja M van Schoor; Nathalie van der Velde; Diana van Heemst; Floor V A van Oort; Sita H Vermeulen; Niek Verweij; Judith M Vonk; Lindsay L Waite; Melanie Waldenberger; Roman Wennauer; Lynne R Wilkens; Christina Willenborg; Tom Wilsgaard; Mary K Wojczynski; Andrew Wong; Alan F Wright; Qunyuan Zhang; Dominique Arveiler; Stephan J L Bakker; John Beilby; Richard N Bergman; Sven Bergmann; Reiner Biffar; John Blangero; Dorret I Boomsma; Stefan R Bornstein; Pascal Bovet; Paolo Brambilla; Morris J Brown; Harry Campbell; Mark J Caulfield; Aravinda Chakravarti; Rory Collins; Francis S Collins; Dana C Crawford; L Adrienne Cupples; John Danesh; Ulf de Faire; Hester M den Ruijter; Raimund Erbel; Jeanette Erdmann; Johan G Eriksson; Martin Farrall; Ele Ferrannini; Jean Ferrières; Ian Ford; Nita G Forouhi; Terrence Forrester; Ron T Gansevoort; Pablo V Gejman; Christian Gieger; Alain Golay; Omri Gottesman; Vilmundur Gudnason; Ulf Gyllensten; David W Haas; Alistair S Hall; Tamara B Harris; Andrew T Hattersley; Andrew C Heath; Christian Hengstenberg; Andrew A Hicks; Lucia A Hindorff; Aroon D Hingorani; Albert Hofman; G Kees Hovingh; Steve E Humphries; Steven C Hunt; Elina Hypponen; Kevin B Jacobs; Marjo-Riitta Jarvelin; Pekka Jousilahti; Antti M Jula; Jaakko Kaprio; John J P Kastelein; Manfred Kayser; Frank Kee; Sirkka M Keinanen-Kiukaanniemi; Lambertus A Kiemeney; Jaspal S Kooner; Charles Kooperberg; Seppo Koskinen; Peter Kovacs; Aldi T Kraja; Meena Kumari; Johanna Kuusisto; Timo A Lakka; Claudia Langenberg; Loic Le Marchand; Terho Lehtimäki; Sara Lupoli; Pamela A F Madden; Satu Männistö; Paolo Manunta; André Marette; Tara C Matise; Barbara McKnight; Thomas Meitinger; Frans L Moll; Grant W Montgomery; Andrew D Morris; Andrew P Morris; Jeffrey C Murray; Mari Nelis; Claes Ohlsson; Albertine J Oldehinkel; Ken K Ong; Willem H Ouwehand; Gerard Pasterkamp; Annette Peters; Peter P Pramstaller; Jackie F Price; Lu Qi; Olli T Raitakari; Tuomo Rankinen; D C Rao; Treva K Rice; Marylyn Ritchie; Igor Rudan; Veikko Salomaa; Nilesh J Samani; Jouko Saramies; Mark A Sarzynski; Peter E H Schwarz; Sylvain Sebert; Peter Sever; Alan R Shuldiner; Juha Sinisalo; Valgerdur Steinthorsdottir; Ronald P Stolk; Jean-Claude Tardif; Anke Tönjes; Angelo Tremblay; Elena Tremoli; Jarmo Virtamo; Marie-Claude Vohl; Philippe Amouyel; Folkert W Asselbergs; Themistocles L Assimes; Murielle Bochud; Bernhard O Boehm; Eric Boerwinkle; Erwin P Bottinger; Claude Bouchard; Stéphane Cauchi; John C Chambers; Stephen J Chanock; Richard S Cooper; Paul I W de Bakker; George Dedoussis; Luigi Ferrucci; Paul W Franks; Philippe Froguel; Leif C Groop; Christopher A Haiman; Anders Hamsten; M Geoffrey Hayes; Jennie Hui; David J Hunter; Kristian Hveem; J Wouter Jukema; Robert C Kaplan; Mika Kivimaki; Diana Kuh; Markku Laakso; Yongmei Liu; Nicholas G Martin; Winfried März; Mads Melbye; Susanne Moebus; Patricia B Munroe; Inger Njølstad; Ben A Oostra; Colin N A Palmer; Nancy L Pedersen; Markus Perola; Louis Pérusse; Ulrike Peters; Joseph E Powell; Chris Power; Thomas Quertermous; Rainer Rauramaa; Eva Reinmaa; Paul M Ridker; Fernando Rivadeneira; Jerome I Rotter; Timo E Saaristo; Danish Saleheen; David Schlessinger; P Eline Slagboom; Harold Snieder; Tim D Spector; Konstantin Strauch; Michael Stumvoll; Jaakko Tuomilehto; Matti Uusitupa; Pim van der Harst; Henry Völzke; Mark Walker; Nicholas J Wareham; Hugh Watkins; H-Erich Wichmann; James F Wilson; Pieter Zanen; Panos Deloukas; Iris M Heid; Cecilia M Lindgren; Karen L Mohlke; Elizabeth K Speliotes; Unnur Thorsteinsdottir; Inês Barroso; Caroline S Fox; Kari E North; David P Strachan; Jacques S Beckmann; Sonja I Berndt; Michael Boehnke; Ingrid B Borecki; Mark I McCarthy; Andres Metspalu; Kari Stefansson; André G Uitterlinden; Cornelia M van Duijn; Lude Franke; Cristen J Willer; Alkes L Price; Guillaume Lettre; Ruth J F Loos; Michael N Weedon; Erik Ingelsson; Jeffrey R O'Connell; Goncalo R Abecasis; Daniel I Chasman; Michael E Goddard; Peter M Visscher; Joel N Hirschhorn; Timothy M Frayling
Journal:  Nat Genet       Date:  2014-10-05       Impact factor: 38.330

Review 9.  Progress and promise in understanding the genetic basis of common diseases.

Authors:  Alkes L Price; Chris C A Spencer; Peter Donnelly
Journal:  Proc Biol Sci       Date:  2015-12-22       Impact factor: 5.349

10.  Multivariate simulation framework reveals performance of multi-trait GWAS methods.

Authors:  Heather F Porter; Paul F O'Reilly
Journal:  Sci Rep       Date:  2017-03-13       Impact factor: 4.379

View more
  4 in total

1.  Assessing Digital Phenotyping to Enhance Genetic Studies of Human Diseases.

Authors:  Christopher DeBoever; Yosuke Tanigawa; Matthew Aguirre; Greg McInnes; Adam Lavertu; Manuel A Rivas
Journal:  Am J Hum Genet       Date:  2020-04-09       Impact factor: 11.025

Review 2.  Fine-mapping genetic associations.

Authors:  Anna Hutchinson; Jennifer Asimit; Chris Wallace
Journal:  Hum Mol Genet       Date:  2020-09-30       Impact factor: 6.150

3.  Enrichment analyses identify shared associations for 25 quantitative traits in over 600,000 individuals from seven diverse ancestries.

Authors:  Samuel Pattillo Smith; Sahar Shahamatdar; Wei Cheng; Selena Zhang; Joseph Paik; Misa Graff; Christopher Haiman; T C Matise; Kari E North; Ulrike Peters; Eimear Kenny; Chris Gignoux; Genevieve Wojcik; Lorin Crawford; Sohini Ramachandran
Journal:  Am J Hum Genet       Date:  2022-03-28       Impact factor: 11.043

4.  Genetic basis of lacunar stroke: a pooled analysis of individual patient data and genome-wide association studies.

Authors:  Matthew Traylor; Elodie Persyn; Liisa Tomppo; Sofia Klasson; Vida Abedi; Mark K Bakker; Nuria Torres; Linxin Li; Steven Bell; Loes Rutten-Jacobs; Daniel J Tozer; Christoph J Griessenauer; Yanfei Zhang; Annie Pedersen; Pankaj Sharma; Jordi Jimenez-Conde; Tatjana Rundek; Raji P Grewal; Arne Lindgren; James F Meschia; Veikko Salomaa; Aki Havulinna; Christina Kourkoulis; Katherine Crawford; Sandro Marini; Braxton D Mitchell; Steven J Kittner; Jonathan Rosand; Martin Dichgans; Christina Jern; Daniel Strbian; Israel Fernandez-Cadenas; Ramin Zand; Ynte Ruigrok; Natalia Rost; Robin Lemmens; Peter M Rothwell; Christopher D Anderson; Joanna Wardlaw; Cathryn M Lewis; Hugh S Markus
Journal:  Lancet Neurol       Date:  2021-03-25       Impact factor: 59.935

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.