Literature DB >> 27701450

Multivariate Analysis of Anthropometric Traits Using Summary Statistics of Genome-Wide Association Studies from GIANT Consortium.

Haeil Park1,2, Xiaoyin Li1, Yeunjoo E Song1, Karen Y He1, Xiaofeng Zhu1.   

Abstract

Meta-analysis of single trait for multiple cohorts has been used for increasing statistical power in genome-wide association studies (GWASs). Although hundreds of variants have been identified by GWAS, these variants only explain a small fraction of phenotypic variation. Cross-phenotype association analysis (CPASSOC) can further improve statistical power by searching for variants that contribute to multiple traits, which is often relevant to pleiotropy. In this study, we performed CPASSOC analysis on the summary statistics from the Genetic Investigation of ANthropometric Traits (GIANT) consortium using a novel method recently developed by our group. Sex-specific meta-analysis data for height, body mass index (BMI), and waist-to-hip ratio adjusted for BMI (WHRadjBMI) from discovery phase of the GIANT consortium study were combined using CPASSOC for each trait as well as 3 traits together. The conventional meta-analysis results from the discovery phase data of GIANT consortium studies were used to compare with that from CPASSOC analysis. The CPASSOC analysis was able to identify 17 loci associated with anthropometric traits that were missed by conventional meta-analysis. Among these loci, 16 have been reported in literature by including additional samples and 1 is novel. We also demonstrated that CPASSOC is able to detect pleiotropic effects when analyzing multiple traits.

Entities:  

Mesh:

Year:  2016        PMID: 27701450      PMCID: PMC5049793          DOI: 10.1371/journal.pone.0163912

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

For over a decade, genome-wide association studies (GWASs) have been a major tool for detecting genetic variants underlying complex traits [1], including various anthropometric traits. Anthropometric traits such as height and body mass index (BMI) are highly heritable [2, 3], but the genetic loci that have been reported by large GWASs so far only account for a small portion of heritability, suggesting additional variants still need to be uncovered. Up until now, most GWASs were performed by examining a single trait association, although it has been frequently observed that the same variant or gene can be associated with multiple traits [4]. During the past decade, the number of meta-analysis of GWASs using multiple cohorts has increased in order to maximize the statistical power in detecting significant genetic loci associated with a trait [5]. Several large-scale meta-analyses have been conducted for anthropometric traits, such as height, BMI, and waist-to-hip ratio (WHR) [6-11]. Among these three anthropometric traits, height is a classic polygenic trait that has been heavily studied in order to understand the genetic architecture of complex traits or diseases [9]. BMI is a convenient measure of overall adiposity as well as obesity, which is a risk factor for many diseases. WHR is a measure of body fat distribution and WHR adjusted for BMI (WHRadjBMI) is positively associated with mortality. Up until recently, 423, 97, and 49 loci have been identified to be associated with height, BMI, and WHRadjBMI in European populations, respectively [9-11]. Several multivariate analysis approaches for analyzing multiple phenotypes have been developed to further improve statistical power and provide biological interpretation of results, and their pros and cons were well reviewed [4]. Statistical methods of integrating evidence from summary statistics of multiple traits have recently been developed [12-14]. In traditional meta-analysis studies, the samples from different cohorts are assumed to be independent. Overlapped or related samples between different cohorts and correlated traits within the same cohorts would result in correlation among effect sizes, and both would lead to an inflated type I error rate. However, this issue can be resolved by estimating the correlation of effect sizes using summary statistics from genetic markers across the genome [12, 13, 15]. Solovieff et al. [4] coined the association of a genetic variant with multiple traits as cross-phenotype (CP) association. Zhu et al. demonstrated that power could be improved by testing multiple traits using both simulated and real data [12, 16]. CP association implies potential pleiotropy where a variant is associated with multiple traits regardless of underlying causes. Thus, CP association is more general than pleiotropy. Statistical software package for CP association analysis (CPASSOC) was developed to meta-analyze association evidence of genetic variant with correlated traits [12]. In this paper, we hypothesized that genetic variants commonly underlying the three anthropometric traits can be identified by combining all three traits when single trait analyses may not have enough power to detect them. Since the three traits are correlated, conventional meta-analysis assuming independence will result in an inflated type I error. In addition, it is unclear whether there are any overlapped or related samples among GIANT consortium studies, which would lead to correlated summary statistics among cohorts. Findings from meta-analysis accounting for correlation among cohorts will result in a more accurate estimation of genetic effects on traits. In this study, we compared the results of CPASSOC with that from conventional meta-analysis of GIANT consortium studies when using the same discovery phase data set. We performed meta-analysis with summary statistics of height, BMI, and WHRadjBMI obtained from the GIANT consortium [17] using the CPASSOC software.

Results

GIANT consortium provided sex-specific summary statistics for the three traits: height, BMI and WHRadjBMI [17], which were downloaded from the GIANT consortium website (https://www.broadinstitute.org/collaboration/giant/index.php/GIANT_consortium_data_files). The correlations among the six cohorts were calculated using the summary statistics of common SNPs after linkage disequilibrium pruning (Materials and Methods, S1 Table). The correlation between male and female summary statistics is 0.03, 0.096 and 0.0126 for WHRadjBMI, BMI and height, respectively (S2 Table). The correlations for BMI and height are still high despite of excluding the SNPs with absolute Z score larger than 1.96. The large correlations between male and female summary statistics for height and BMI suggest that many genetic variants with small effect sizes contribute both trait variations besides the possible overlapped or related samples. The correlation was much smaller between traits than within traits (S2 Table). We next calculated statistics S and S in CPASSOC (Materials and Methods). S makes an assumption that genetic effect is homogenous across traits and cohorts while S assumes genetic heterogeneity. Fig 1 presents the Q-Q plots for height, BMI, and WHRadjBMI individually as well as combined together, using S and S. The genomic control values (λ) for statistics S, are 1.078, 1.079, 1.025 and 1.092 for height, BMI, WHRadjBMI and combining the three traits, respectively. For S, the λ values are 0.998, 1.006, 0.994 and 0.992 for height, BMI, WHRadjBMI and combining three traits, respectively. We did not observe any inflation for S but slight inflation was observed for height and BMI for S. Both height and BMI have many causal variants, which can inflate the corresponding λ values. In the original and the latest GIANT reports [18, 19], the λ values were 1.42 for height and 1.526 for BMI. Our observed λ values for both S and S are much smaller than that in GIANT reports. Thus, our observed association evidence is unlikely due to the effects of population stratification. We observed that females have substantially more genetic contribution than males for WHRadjBMI (Fig 1C). Table 1 compares the number of significant loci (P < 5 × 10−8) detected by CPASSOC and conventional meta-analysis by GIANT. For height, both conventional meta-analysis and S identified 116 loci and 3 of them were missed by either method, while S identified 89 loci. For BMI, S and S identified 20 and 17 loci, respectively, while the conventional meta-analysis detected 18 loci. For WHRadjBMI, S and S identified 11 and 14 loci, respectively, while meta-analysis detected 11 loci. The detailed P-values and effect sizes for these variants in CPASSOC and conventional meta-analysis are presented in S3 Table. P-values for combining the three traits, S and S identified 55 and 129 loci, respectively, while conventional meta-analysis identified 122 loci for three traits together. The detailed P-values and effect sizes for these variants in CPASSOC and conventional meta-analysis are presented in S4 Table. The reason for substantially fewer variants detected by S is due to different effect directions in the three traits.
Fig 1

Q-Q plots of GIANT discovery result, CPASSOC S and S

(A) Height. (B) BMI. (C) WHRadjBMI. (D) Combining three traits.

Table 1

The number of genome-wide significant loci identified by CPASSOC for height, BMI, and WHRadjBMI from sex-specific data of discovery phase.

GIANT Consortium Studiesb
TraitCPASSOC MethodaP < 5 × 10−8P > 5 × 10−8Total
HeightSHomP < 5 × 10−81133116
P > 5 × 10−833
Total1163
SHetP < 5 × 10−889089
P > 5 × 10−82727
Total1160
BMISHomP < 5 × 10−817320
P > 5 × 10−811
Total183
SHetP < 5 × 10−816117
P > 5 × 10−822
Total181
WHRadjBMISHomP < 5 × 10−810111
P > 5 × 10−811
Total111
SHetP < 5 × 10−811314
P > 5 × 10−800
Total113

Note: CPASSOC (cross-phenotype association), GIANT (genetic investigation of anthropometric traits), BMI (body mass index), WHRadjBMI (waist-to-hip ratio adjusted for body mass index)

aCPASSOC was applied to meta-analyze male and female data for each of the three traits.

bThe result of conventional meta-analyses of discovery phase data for each of the three traits.

Q-Q plots of GIANT discovery result, CPASSOC S and S

(A) Height. (B) BMI. (C) WHRadjBMI. (D) Combining three traits. Note: CPASSOC (cross-phenotype association), GIANT (genetic investigation of anthropometric traits), BMI (body mass index), WHRadjBMI (waist-to-hip ratio adjusted for body mass index) aCPASSOC was applied to meta-analyze male and female data for each of the three traits. bThe result of conventional meta-analyses of discovery phase data for each of the three traits. The Manhattan plots for the trait specific analysis are presented in Fig 2. The three loci identified by S but not by conventional meta-analysis for height are rs4676386, rs2597513, and rs8181166 (Table 2 and Fig 2A). Variants rs2597513 and rs8181166 were reported to be associated with height in meta-analyses combining discovery and follow-up phases [6], which had a substantially increased sample size. The region of rs4676386 has been reported in [9], and the reported variant rs4344931 was located 43,541 bp away from rs4676386. Those two variants were in weak linkage disequilibrium (r2 = 0.18).
Fig 2

Manhattan plots of conventional meta-analysis, S and S for three traits.

The loci reaching genome-wide significance by S and S but not by conventional meta-analysis are marked with corresponding SNP names. (A) Height by conventional meta-analysis, S and S; (B) BMI by conventional meta-analysis, S and S; (C) WHRadjBMI by conventional meta-analysis, S and S.

Table 2

Additional genome-wide significant loci identified by CPASSOC for each trait but not conventional meta-analysis of GIANT consortium.

CPASSOCConventional Meta-analysis of GIANT consortiumSex-specific Meta-analysis of GIANT consortium
SHOMSHETMaleFemale
SNPChrPosition (bp)Effect / other alleleFrequency (effect allele)P-valueP-valueP-valueBeta (Males)SE (Males)P-value (Males)Beta (Females)SE (Females)P-value (Females)
Heightrs46763862240835569A/T0.444..19E-082.16E-73.10E-070.0220.00635.90E-030.0270.00584.99E-06
rs2597513313514336T/C0.882.51E-081.30E-071.14E-07-0.0410.023.88E-05-0.040.00932.11E-05
rs8181166986501713C/G0.521.73E-089.04E-081.09E-070.0170.0636.3E-030.0320.0587.21E-08
BMIrs13078807385835000A/G0.823.77E-082.65E-071.06E-07-0.0250.00791.4E-03-0.0370.00761.22E-06
rs131073254102267552T/C0.123.15E-082.22E-071.37E-070.0620.0141.1E-050.0540.0141.6E-04
rs38102911947065746A/G0.634.13E-082.89E-071.4E-070.0310.00732.59E-050.0280.00717.46E-05
rs178063132052478024A/G0.731.65E-061.83E-081.07E-060.00480.0070.490.040.00673.5E-09
WHRadjBMIrs9864077364719215T/C0.761.96E-084.62E-081.85E-070.0180.00730.0160.0380.00711.06E-07
rs4684854312447383C/G0.431.3E-054.13E-081E-044E-040.00710.960.040.00712.36E-08
rs23015733129587076T/C0.931.21E-072.88E-113.68E-060.00250.0120.830.0740.0229.33E-11

Manhattan plots of conventional meta-analysis, S and S for three traits.

The loci reaching genome-wide significance by S and S but not by conventional meta-analysis are marked with corresponding SNP names. (A) Height by conventional meta-analysis, S and S; (B) BMI by conventional meta-analysis, S and S; (C) WHRadjBMI by conventional meta-analysis, S and S. For BMI, S identified rs13078807, rs13107325, and rs3810291 and S identified rs17806313, all of them were missed by conventional meta-analysis (Table 2 and Fig 2B). Among these four variants, rs13078807, rs13107325, and rs3810291 were validated in the follow-up stage [7] and rs17806313 was only significant in females [17] (Table 2). For WHRadjBMI, both S and S identified rs9864077, which was missed by conventional meta-analysis. S identified two additional loci (rs4684854, rs2301573, Table 2 and Fig 2C). Both rs4684854 and rs2301573 were not genome-wide significant in the combined analysis of discovery and replication phases [8], but they were genome-wide significant in females only (P = 2.36 × 10−08 and P = 9.93 × 10−11, respectively) (Table 2) [17]. When combining three gender specific traits, 7 independent variants were identified by S and S but were missed by the conventional meta-analysis (Table 3 and Fig 3). Variants rs9324162 and rs17391694 were detected by both S and S and were located more than 500 kb apart. These two variants were in weak linkage disequilibrium (r2 = 0.4). Thus, we consider them as two separate signals. Variant rs17391694 was reported to be associated with height by the GIANT study using conventional meta-analysis when combining discovery and follow-up phase data [6] [17]. The remaining five variants in Table 3 were only identified by S. SNP rs6441170 as well as rs13107325 were reported in later GIANT studies for height [9] and BMI [10], respectively. Variant rs17806313 was significantly associated with BMI in females only (Table 3). SNP rs4488509 is in high linkage disequilibrium (r2 = 0.84) with reported SNP rs4640244 in later GIANT studies for height [6], and they are located 450 bp away. Variant rs7842858, has not been reported previously. SNP rs7842858 is located on TOX (Fig 4), which has been reported to be associated with obesity and diabetes [20].
Table 3

Genome-wide significant loci that were only detected by CPASSOC when combining 3 traits but not by conventional meta-analysis of GIANT consortium from discovery phase.

CPASSOCConventional Meta-analysis of GIANT consortiumSex-specific Meta-analysis of GIANT consortium
HeightBMIWHRadjBMIHeightBMIWHRadjBMI
SNPChrPosition (bp)Effect / other alleleFrequency (effect allele)P-valueP-valueP-valueP-valueP-valueP-value (Males)P-value (Females)P-value (Males)P-value (Females)P-value (Males)P-value (Females)
rs9324162177489119A/G0.841.56E-087.97E-103.52E-063.51E-050.472.94E-051.5E-031.8E-032.7E-030.630.11
rs17391694178157942T/C0.121.47E-084.51E-095.92E-071.52E-040.383.91E-055.2E-040.0121.1E-030.850.35
rs64411703158098171T/C0.650.0613.22E-084.87E-070.00110.195.9E-041.11E-050.0190.0840.110.87
rs131073254102267552T/C0.120.333.49E-082.5E-031.37E-070.160.0160.021.1E-051.6E-040.10.73
rs7842858a858987111T/C0.646.95E-052.49E-082.33E-060.6670.023.1E-031E-046.9E-030.0410.180.047
rs44885091721381361A/T0.440.421.94E-085.33E-077.63E-040.131.55E-051.4E-040.0164.1E-030.20.48
rs178063132052478024A/G0.732.06E-041.48E-080.6091.07E-050.270.0950.410.493.5E-090.0230.53

aNew locus that has not been reported previously.

Fig 3

Manhattan plots of S and S for combining three gender specific traits.

The loci reaching genome-wide significance by S and S but not by the conventional meta-analysis are marked with corresponding SNP names. (A) S (B) S

Fig 4

Regional plots of the novel locus rs7842858 identified by S.

Manhattan plots of S and S for combining three gender specific traits.

The loci reaching genome-wide significance by S and S but not by the conventional meta-analysis are marked with corresponding SNP names. (A) S (B) S aNew locus that has not been reported previously. To compare the variants identified by S and S, we plotted the forest plots of the effect sizes for the SNPs in Tables 2 and 3, which were missed by the conventional meta-analysis in GIANT. We observed that the effect sizes were similar for variants only detected by S (Fig 5A, rs4676386, rs2597513, rs8181166, rs13078807, rs13107325, and rs3810291). All the variants detected by both S and S showed some degree of heterogeneity (Fig 5B, rs9864077, rs9324162, and rs17391694). The variants detected only by S showed large amount of heterogeneity (Fig 5C, right panel of SNPs). In particularly, variants rs17806313, rs4684854 and rs2301573 only have genetic effects in females for BMI and WHRadjBMI. Variants rs644170 and rs13107325 have positive effects on BMI but negative effects on height and WHRadjBMI. Variant rs7842858 has a negative effect on height and WHRadjBMI in both genders, negative effect on BMI in males, and positive effect on BMI in females (Fig 5). The effect of rs4488509 is negative for height and positive for BMI and WHRadjBMI. The effect of rs17806313 shows substantial heterogeneity between males and females in each of the three traits.
Fig 5

The forest plots of the effect sizes for the SNPs presented in Table 2 and 3.

(A): variants only detected by S; (B): variants detected by both S and S; (C) variants only detected by S (right panel).

The forest plots of the effect sizes for the SNPs presented in Table 2 and 3.

(A): variants only detected by S; (B): variants detected by both S and S; (C) variants only detected by S (right panel).

Discussion

In this study, we performed CPASSOC analysis using the summary statistics available from a GIANT consortium study [17]. Our results showed that CPASSOC was able to identify most of the loci detected by conventional meta-analysis of GIANT consortium. CPASSOC was also able to identify 17 loci reaching genome-wide significance (P < 5 × 10−8) that were missed by conventional meta-analysis for a single trait analysis when using the same discovery phase data. All 10 loci (Table 2) detected by CPASSOC that combine the effects of males and females for height and BMI have been replicated by GIANT using either additional replication samples or are significant in either males or female [6, 7, 11]. Our results showed that CPASSOC analysis can improve statistical power over conventional meta-analysis by combining males and females for individual traits. When combining male and female summary statistics for height, the statistic S identified fewer loci than either S or the conventional meta-analysis method, suggesting that there is rare heterogeneity between males and females for height. However, the current study only analyzed autosomal variants rather than variants on the sex chromosome. When combining the three traits, S identified 55 loci while S identified 129 loci. The substantial fewer number of variants detected by S suggests that many variants have different effect directions in the three traits; therefore, the association evidence by S is diluted. This result further indicates that the statistic S would be less powerful when combining multiple different traits, where substantial heterogeneity exists (Fig 5). In our analysis of combining height, BMI and WHRadjBMI, CPASSOC was able to identify additional 7 loci (Table 3) that were missed by conventional meta-analysis (Table 3 and Fig 3). Five variants (rs9324162, rs17391694, rs6441170, 13107325, and rs17806313) have been reported to be associated with height, BMI or in gender-specific analysis by the GIANT consortium using additional data [6, 9, 10, 21]. CPASSOC also identified one novel locus (rs7842858) that has not been reported to be associated with any of the three traits previously. This variant shows substantial heterogeneity among the three traits. In particular, variant rs7842858 has a positive effect on the three traits in males but negative effect in females (Fig 5), which leads to the failure of detection by either conventional single trait meta-analysis or combining the three traits by S. SNP rs7842858 (8:58987111) resides within TOX (Fig 4), which has been reported to be associated with Type 2 Diabetes in Chinese Han population [20]. It is possible that this variant affects obesity variation and therefore indirectly contributes to Type 2 Diabetes. However, this hypothesis requires further studies. The CPASSOC analysis of combining height, BMI and WHRadjBMI also demonstrated pleotropic effects for all 7 loci (Fig 5). This is because the association evidence of these loci could not be detected by single trait analysis using the GIANT discovery data. The association evidence could only be detected when combining the three traits. We also observed substantial genetic heterogeneity among these three traits. When pleotropic effects exist, CPASSOC analyses of multiple traits clearly improve statistical power to detect these genetic effects. In particular, statistic S is able to detect many variants with pleotropic effects; however, it may lose power when there is no heterogeneity, as observed in single trait analysis. Our analysis applied a significance level 5 × 10−8, which is widely used in GWAS including the original GIANT publications. This criterion makes the results identified by CPASSOC comparable with that from GIANT results. Alternatively, the methods by [22], which requires genotype data, may result better genome-wide significance level. Since we only analyzed the summary statistics obtained from GIANT website, we did not applied the method by [22]. When we used a genome-wide significance level 1.0 × 10−7, we were able to identify additional 134 loci (S5 Table). These variants should be further examined in independent studies. The CPASSOC [12] was designed to combine the association evidence across multiple, sometimes seemingly distinct phenotypes. Cross phenotype association will increase statistical power when analyzed traits share common variants or common genetic pathways, which may reflect the relevance of pleiotropy [4]. Thus, cross phenotype association analysis is preferred when a genetic variant affects more than one trait either through biological mediated pleiotropy. For example, in genetic study of hypertension, different measurements of blood pressure such as systolic, diastolic blood pressure, and hypertension status can be analyzed using CPASSOC [12].The hypertension related traits can be analyzed together with cardiovascular traits because of sharing potential common pathways. In the current study, we analyzed the anthropometric traits by cross phenotype analysis because these traits are likely share common biological pathways. Many traits have demonstrates different biology between males and females [17]. Sex-specific analysis is able to detect specific variants to a sex group and therefore such analysis should always be performed. On the other hand, males and female share common biology and combined analysis will increase a study sample size and consequently increase statistical power for identifying variants shared by both males and females. Our results show that CPASSOC analysis can detect sex-specific variants as well as the variants shared by both sexes. We also observed much larger correlations between male and female summary statistics for height and BMI compared to that of WHRadjBMI (S2 Table), although we pruned SNPs with linkage disequilibrium and excluded SNPs with large summary statistics. These high correlations for height and BMI may not be surprising because of the contributions of many genetic variants with small effect sizes for both traits. In comparison, the lower correlation between male and female summary statistics for WHRadjBMI may suggest that most of genetic contribution is shared between BMI and WHR. There are several recently developed statistical approaches in analyzing multiple traits at summary statistics level, including the ASSET [23] and the Cross Phenotype Meta-Analysis (CPMA) [24]. The ASSET is suitable for identifying a subset of associated traits while CPASSOC directly evaluates the aggregated association evidence between a SNP to multiple phenotypes. CPASSOC is much faster than the ASSET computationally because ASSET has to search all possible subsets. When the number of traits and studies increased, the number of possible subsets can grow exponentially. The Cross Phenotype Meta-Analysis (CPMA) was developed to test whether there is association of a SNP to multiple phenotypes, or a true pleiotropy effect. When one of traits is not associated with a SNP, CPMA will not have statistical power. It will be interesting to perform these two tests to search for the subsets of traits associated with a SNP or to test for pleiotropy effect in the GIANT consortium. In conclusion, CPASSOC identified 17 loci associated with anthropometric traits that were missed by conventional meta-analysis for a single anthropometric trait in GIANT consortium studies that used the same discovery phase data as we did. CPASSOC is also able to detect pleiotropic effects when analyzing multiple traits.

Materials and Methods

Datasets

The summary statistics of height, BMI and WHRadjBMI were downloaded from the GIANT consortium website (https://www.broadinstitute.org/collaboration/giant/index.php/GIANT_consortium_data_files). The downloaded data include sex-specific [17] and sex-combined [6-8] GWASs meta-analysis summary statistics from the discovery phase. For discovery stage in sex- specific studies, 46 studies (up to 60,586 males, 73,137 females) on height and BMI and 32 studies (up to 34,629 males, 42,969 females) on WHR were included [17]. In the discovery phase of sex-combined studies, summary statistics were collected from 46 GWASs in a meta-analysis of 133,653 individuals (60,587 males and 73,066 females) for height [6], 46 studies with up to 123,865 individuals for BMI [7], and 32 studies with up to 77,167 individuals (34,601 males and 42,735 females) for WHRadjBMI [8].

Cross-phenotype association analysis

We applied the CPASSOC package developed by Zhu et al. [12] to combine association evidence of both sexes with height, BMI and WHRadjBMI. CPASSOC can integrate association evidence from summary statistics of multiple traits. It uses summary-level data from single SNP-trait association of GWASs to detect which variant is associated with at least one trait. This method improves statistical power by analyzing multiple phenotypes and it can be executed with the summary statistics from GWASs. CPASSOC provides two statistics, S and S. S is similar to the fixed effect meta-analysis method [25] but accounting for the correlation of summary statistics among cohorts induced by potential overlapped or related samples. In brief, assuming we have summary statistical results of GWAS from J cohorts with K phenotypic traits. In each cohort, single SNP-trait association was analyzed for each trait separately. Let T be a summary statistic for a SNP, j cohort and k trait. Let T = (T11,⋯,T, ⋯,T1,⋯, T) represents a vector of test statistics for testing the association of a SNP with K traits. We used a Wald test statistic , where are the estimated coefficient and corresponding standard error for the k trait in the j cohort. S is then defined as which follows a χ2 distribution with one degree of freedom, where e = (1,…,1) has length J × K and W is a diagonal matrix of weights for the individual test statistics. We used the sample sizes for the weights, i.e., for the sample size n of the j cohort. To define S we first let where T(τ) is the sub-vector of T satisfying |T| > τ for a given τ>0, and R(τ) is a sub-matrix of R representing the correlation matrix, and W(τ) be the diagonal submatrix of W, corresponding to T(τ). Here we let . Then the test statistic is S = maxS(τ). The asymptotic distribution of S does not follow a standard distribution but can be evaluated using simulation. S is an extension of S but power can be improved when the genetic effect sizes vary for different traits. The distribution of S under the null hypothesis can be obtained through simulations or approximated by an estimated beta distribution. We first applied both S and S to combine sex-specific summary statistics for each of the three traits and compared the results with those from conventional meta-analysis of the same discovery phase data in GIANT consortium studies [6-8]. We next applied both S and S for combining all the sex-specific summary statistics of the three traits: height, BMI and WHRadjBMI. We hypothesized that meta-analyzing multiple traits would allow us to identify additional variants that are likely to be missed by the conventional meta-analyses for a single trait. To perform CPASSOC analysis, a correlation matrix is required to account for the correlation among phenotypes or induced by overlapped or related samples from different cohorts. Zhu et al. [12] suggested using a set of SNPs in linkage equilibrium to estimate the correlation coefficients. We selected the SNP set based on linkage disequilibrium (LD) pattern in the ARIC European American (EA) data (downloaded from dbGaP http://www.ncbi.nlm.nih.gov/gap). In brief, the ARIC EA cohort includes 9,707 individuals with approximately 840,000 SNPs genotyped on the Affymetrix Array 6.0 [26, 27]. We first applied pairwise LD pruning with r2 threshold of 0.2 using the software PLINK (http://pngu.mgh.harvard.edu/purcell/plink/). SNPs with large effect sizes may represent true association, and consequently may inflate correlation among summary statistics. Therefore, we removed SNPs whose summary statistics Z scores were greater than 1.96 or less than -1.96. The final SNP sets for correlation estimation include 81,322 SNPs for height, 82,012 SNPs for BMI, and 81,130 SNPs for WHRadjBMI. We chose the common sets of SNPs for both sexes and three traits that can be mapped to dbSNP human Build 142 to perform the CPASSOC analyses. The numbers of SNPs used in this study are presented in S1 Table. We reported loci that reached genome-wide significance (P < 5 × 10−8) by CPASSOC from sex-specific data [17], but not by sex-combined conventional meta-analysis [6-8] when using the same samples from the discovery phase. Here we applied the same significant level P = 5 × 10−8 as in GWAS because CPASSOC performs the same number of tests although multiple traits are analyzed. To do this, for a SNP reaching P < 5 × 10−8 by either S or S, we examined the region within 500 kb of each side of the SNP. The SNP was considered to be identified only by CPASSOC if no SNPs that are genome-wide significant with conventional meta-analysis from the discovery phase data were found in the 1.0 Mb region, and it is not in LD with the index SNPs of the GIANT studies. We performed meta-analysis by combining male and female data for each trait separately, as well as by combining all the three traits and both sexes.

The number of SNPs mappable to dbSNP human Build 142, which were used in meta-analysis with CPASSOC.

aSNPs used for conventional meta-analyses of sex-combined discovery phase data in the GIANT consortium studies. bIntersection set of SNPs used for conventional meta-analyses of sex-specific discovery phase data in the GIANT consortium studies. cIntersection set of SNPs between sex-combined and sex-specific study. dIntersection set of SNPs among height, BMI, and WHRadjBMI. (DOCX) Click here for additional data file.

Correlations between male and female cohort for each trait and those between combinations of sex and trait.

(DOCX) Click here for additional data file.

SNPs representing identified through either conventional meta-analysis (sex combined and sex-specific) or CPASSOC for height, BMI, and WHRadjBMI.

(XLSX) Click here for additional data file.

SNPs representing identified through either CPASSOC for combining 3 sex-specific traits or conventional meta-analysis of GIANT consortium (sex-combined and sex-specific) in discovery phase data.

(XLSX) Click here for additional data file.

SNPs with P-values between 1E-7 and 5E-8 for either S or S from CPASSOC when combining three traits.

(XLSX) Click here for additional data file.
  25 in total

1.  A subset-based approach improves power and interpretation for the combined analysis of genetic association studies of heterogeneous traits.

Authors:  Samsiddhi Bhattacharjee; Preetha Rajaraman; Kevin B Jacobs; William A Wheeler; Beatrice S Melin; Patricia Hartge; Meredith Yeager; Charles C Chung; Stephen J Chanock; Nilanjan Chatterjee
Journal:  Am J Hum Genet       Date:  2012-05-04       Impact factor: 11.025

2.  Genome-wide association analysis of blood-pressure traits in African-ancestry individuals reveals common associated genes in African and non-African populations.

Authors:  Nora Franceschini; Ervin Fox; Zhaogong Zhang; Todd L Edwards; Michael A Nalls; Yun Ju Sung; Bamidele O Tayo; Yan V Sun; Omri Gottesman; Adebawole Adeyemo; Andrew D Johnson; J Hunter Young; Ken Rice; Qing Duan; Fang Chen; Yun Li; Hua Tang; Myriam Fornage; Keith L Keene; Jeanette S Andrews; Jennifer A Smith; Jessica D Faul; Zhang Guangfa; Wei Guo; Yu Liu; Sarah S Murray; Solomon K Musani; Sathanur Srinivasan; Digna R Velez Edwards; Heming Wang; Lewis C Becker; Pascal Bovet; Murielle Bochud; Ulrich Broeckel; Michel Burnier; Cara Carty; Daniel I Chasman; Georg Ehret; Wei-Min Chen; Guanjie Chen; Wei Chen; Jingzhong Ding; Albert W Dreisbach; Michele K Evans; Xiuqing Guo; Melissa E Garcia; Rich Jensen; Margaux F Keller; Guillaume Lettre; Vaneet Lotay; Lisa W Martin; Jason H Moore; Alanna C Morrison; Thomas H Mosley; Adesola Ogunniyi; Walter Palmas; George Papanicolaou; Alan Penman; Joseph F Polak; Paul M Ridker; Babatunde Salako; Andrew B Singleton; Daniel Shriner; Kent D Taylor; Ramachandran Vasan; Kerri Wiggins; Scott M Williams; Lisa R Yanek; Wei Zhao; Alan B Zonderman; Diane M Becker; Gerald Berenson; Eric Boerwinkle; Erwin Bottinger; Mary Cushman; Charles Eaton; Fredrik Nyberg; Gerardo Heiss; Joel N Hirschhron; Virginia J Howard; Konrad J Karczewsk; Matthew B Lanktree; Kiang Liu; Yongmei Liu; Ruth Loos; Karen Margolis; Michael Snyder; Bruce M Psaty; Nicholas J Schork; David R Weir; Charles N Rotimi; Michele M Sale; Tamara Harris; Sharon L R Kardia; Steven C Hunt; Donna Arnett; Susan Redline; Richard S Cooper; Neil J Risch; D C Rao; Jerome I Rotter; Aravinda Chakravarti; Alex P Reiner; Daniel Levy; Brendan J Keating; Xiaofeng Zhu
Journal:  Am J Hum Genet       Date:  2013-08-22       Impact factor: 11.025

3.  Meta-analysis of correlated traits via summary statistics from GWASs with an application in hypertension.

Authors:  Xiaofeng Zhu; Tao Feng; Bamidele O Tayo; Jingjing Liang; J Hunter Young; Nora Franceschini; Jennifer A Smith; Lisa R Yanek; Yan V Sun; Todd L Edwards; Wei Chen; Mike Nalls; Ervin Fox; Michele Sale; Erwin Bottinger; Charles Rotimi; Yongmei Liu; Barbara McKnight; Kiang Liu; Donna K Arnett; Aravinda Chakravati; Richard S Cooper; Susan Redline
Journal:  Am J Hum Genet       Date:  2014-12-11       Impact factor: 11.025

4.  Pervasive sharing of genetic effects in autoimmune disease.

Authors:  Chris Cotsapas; Benjamin F Voight; Elizabeth Rossin; Kasper Lage; Benjamin M Neale; Chris Wallace; Gonçalo R Abecasis; Jeffrey C Barrett; Timothy Behrens; Judy Cho; Philip L De Jager; James T Elder; Robert R Graham; Peter Gregersen; Lars Klareskog; Katherine A Siminovitch; David A van Heel; Cisca Wijmenga; Jane Worthington; John A Todd; David A Hafler; Stephen S Rich; Mark J Daly
Journal:  PLoS Genet       Date:  2011-08-10       Impact factor: 5.917

Review 5.  Pleiotropy in complex traits: challenges and strategies.

Authors:  Nadia Solovieff; Chris Cotsapas; Phil H Lee; Shaun M Purcell; Jordan W Smoller
Journal:  Nat Rev Genet       Date:  2013-06-11       Impact factor: 53.242

6.  The Atherosclerosis Risk in Communities (ARIC) Study: design and objectives. The ARIC investigators.

Authors: 
Journal:  Am J Epidemiol       Date:  1989-04       Impact factor: 4.897

7.  Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium: Design of prospective meta-analyses of genome-wide association studies from 5 cohorts.

Authors:  Bruce M Psaty; Christopher J O'Donnell; Vilmundur Gudnason; Kathryn L Lunetta; Aaron R Folsom; Jerome I Rotter; André G Uitterlinden; Tamara B Harris; Jacqueline C M Witteman; Eric Boerwinkle
Journal:  Circ Cardiovasc Genet       Date:  2009-02

8.  An Adaptive Association Test for Multiple Phenotypes with GWAS Summary Statistics.

Authors:  Junghi Kim; Yun Bai; Wei Pan
Journal:  Genet Epidemiol       Date:  2015-10-22       Impact factor: 2.135

9.  Hundreds of variants clustered in genomic loci and biological pathways affect human height.

Authors:  Hana Lango Allen; Karol Estrada; Guillaume Lettre; Sonja I Berndt; Michael N Weedon; Fernando Rivadeneira; Cristen J Willer; Anne U Jackson; Sailaja Vedantam; Soumya Raychaudhuri; Teresa Ferreira; Andrew R Wood; Robert J Weyant; Ayellet V Segrè; Elizabeth K Speliotes; Eleanor Wheeler; Nicole Soranzo; Ju-Hyun Park; Jian Yang; Daniel Gudbjartsson; Nancy L Heard-Costa; Joshua C Randall; Lu Qi; Albert Vernon Smith; Reedik Mägi; Tomi Pastinen; Liming Liang; Iris M Heid; Jian'an Luan; Gudmar Thorleifsson; Thomas W Winkler; Michael E Goddard; Ken Sin Lo; Cameron Palmer; Tsegaselassie Workalemahu; Yurii S Aulchenko; Asa Johansson; M Carola Zillikens; Mary F Feitosa; Tõnu Esko; Toby Johnson; Shamika Ketkar; Peter Kraft; Massimo Mangino; Inga Prokopenko; Devin Absher; Eva Albrecht; Florian Ernst; Nicole L Glazer; Caroline Hayward; Jouke-Jan Hottenga; Kevin B Jacobs; Joshua W Knowles; Zoltán Kutalik; Keri L Monda; Ozren Polasek; Michael Preuss; Nigel W Rayner; Neil R Robertson; Valgerdur Steinthorsdottir; Jonathan P Tyrer; Benjamin F Voight; Fredrik Wiklund; Jianfeng Xu; Jing Hua Zhao; Dale R Nyholt; Niina Pellikka; Markus Perola; John R B Perry; Ida Surakka; Mari-Liis Tammesoo; Elizabeth L Altmaier; Najaf Amin; Thor Aspelund; Tushar Bhangale; Gabrielle Boucher; Daniel I Chasman; Constance Chen; Lachlan Coin; Matthew N Cooper; Anna L Dixon; Quince Gibson; Elin Grundberg; Ke Hao; M Juhani Junttila; Lee M Kaplan; Johannes Kettunen; Inke R König; Tony Kwan; Robert W Lawrence; Douglas F Levinson; Mattias Lorentzon; Barbara McKnight; Andrew P Morris; Martina Müller; Julius Suh Ngwa; Shaun Purcell; Suzanne Rafelt; Rany M Salem; Erika Salvi; Serena Sanna; Jianxin Shi; Ulla Sovio; John R Thompson; Michael C Turchin; Liesbeth Vandenput; Dominique J Verlaan; Veronique Vitart; Charles C White; Andreas Ziegler; Peter Almgren; Anthony J Balmforth; Harry Campbell; Lorena Citterio; Alessandro De Grandi; Anna Dominiczak; Jubao Duan; Paul Elliott; Roberto Elosua; Johan G Eriksson; Nelson B Freimer; Eco J C Geus; Nicola Glorioso; Shen Haiqing; Anna-Liisa Hartikainen; Aki S Havulinna; Andrew A Hicks; Jennie Hui; Wilmar Igl; Thomas Illig; Antti Jula; Eero Kajantie; Tuomas O Kilpeläinen; Markku Koiranen; Ivana Kolcic; Seppo Koskinen; Peter Kovacs; Jaana Laitinen; Jianjun Liu; Marja-Liisa Lokki; Ana Marusic; Andrea Maschio; Thomas Meitinger; Antonella Mulas; Guillaume Paré; Alex N Parker; John F Peden; Astrid Petersmann; Irene Pichler; Kirsi H Pietiläinen; Anneli Pouta; Martin Ridderstråle; Jerome I Rotter; Jennifer G Sambrook; Alan R Sanders; Carsten Oliver Schmidt; Juha Sinisalo; Jan H Smit; Heather M Stringham; G Bragi Walters; Elisabeth Widen; Sarah H Wild; Gonneke Willemsen; Laura Zagato; Lina Zgaga; Paavo Zitting; Helene Alavere; Martin Farrall; Wendy L McArdle; Mari Nelis; Marjolein J Peters; Samuli Ripatti; Joyce B J van Meurs; Katja K Aben; Kristin G Ardlie; Jacques S Beckmann; John P Beilby; Richard N Bergman; Sven Bergmann; Francis S Collins; Daniele Cusi; Martin den Heijer; Gudny Eiriksdottir; Pablo V Gejman; Alistair S Hall; Anders Hamsten; Heikki V Huikuri; Carlos Iribarren; Mika Kähönen; Jaakko Kaprio; Sekar Kathiresan; Lambertus Kiemeney; Thomas Kocher; Lenore J Launer; Terho Lehtimäki; Olle Melander; Tom H Mosley; Arthur W Musk; Markku S Nieminen; Christopher J O'Donnell; Claes Ohlsson; Ben Oostra; Lyle J Palmer; Olli Raitakari; Paul M Ridker; John D Rioux; Aila Rissanen; Carlo Rivolta; Heribert Schunkert; Alan R Shuldiner; David S Siscovick; Michael Stumvoll; Anke Tönjes; Jaakko Tuomilehto; Gert-Jan van Ommen; Jorma Viikari; Andrew C Heath; Nicholas G Martin; Grant W Montgomery; Michael A Province; Manfred Kayser; Alice M Arnold; Larry D Atwood; Eric Boerwinkle; Stephen J Chanock; Panos Deloukas; Christian Gieger; Henrik Grönberg; Per Hall; Andrew T Hattersley; Christian Hengstenberg; Wolfgang Hoffman; G Mark Lathrop; Veikko Salomaa; Stefan Schreiber; Manuela Uda; Dawn Waterworth; Alan F Wright; Themistocles L Assimes; Inês Barroso; Albert Hofman; Karen L Mohlke; Dorret I Boomsma; Mark J Caulfield; L Adrienne Cupples; Jeanette Erdmann; Caroline S Fox; Vilmundur Gudnason; Ulf Gyllensten; Tamara B Harris; Richard B Hayes; Marjo-Riitta Jarvelin; Vincent Mooser; Patricia B Munroe; Willem H Ouwehand; Brenda W Penninx; Peter P Pramstaller; Thomas Quertermous; Igor Rudan; Nilesh J Samani; Timothy D Spector; Henry Völzke; Hugh Watkins; James F Wilson; Leif C Groop; Talin Haritunians; Frank B Hu; Robert C Kaplan; Andres Metspalu; Kari E North; David Schlessinger; Nicholas J Wareham; David J Hunter; Jeffrey R O'Connell; David P Strachan; H-Erich Wichmann; Ingrid B Borecki; Cornelia M van Duijn; Eric E Schadt; Unnur Thorsteinsdottir; Leena Peltonen; André G Uitterlinden; Peter M Visscher; Nilanjan Chatterjee; Ruth J F Loos; Michael Boehnke; Mark I McCarthy; Erik Ingelsson; Cecilia M Lindgren; Gonçalo R Abecasis; Kari Stefansson; Timothy M Frayling; Joel N Hirschhorn
Journal:  Nature       Date:  2010-09-29       Impact factor: 49.962

10.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations.

Authors:  Danielle Welter; Jacqueline MacArthur; Joannella Morales; Tony Burdett; Peggy Hall; Heather Junkins; Alan Klemm; Paul Flicek; Teri Manolio; Lucia Hindorff; Helen Parkinson
Journal:  Nucleic Acids Res       Date:  2013-12-06       Impact factor: 16.971

View more
  13 in total

1.  An Analytic Solution to the Computation of Power and Sample Size for Genetic Association Studies under a Pleiotropic Mode of Inheritance.

Authors:  Derek Gordon; Douglas Londono; Payal Patel; Wonkuk Kim; Stephen J Finch; Gary A Heiman
Journal:  Hum Hered       Date:  2017-03-18       Impact factor: 0.444

2.  Adjustment for covariates using summary statistics of genome-wide association studies.

Authors:  Tao Wang; Xiaonan Xue; Xianhong Xie; Kenny Ye; Xiaofeng Zhu; Robert C Elston
Journal:  Genet Epidemiol       Date:  2018-09-20       Impact factor: 2.135

3.  Mendelian randomization and pleiotropy analysis.

Authors:  Xiaofeng Zhu
Journal:  Quant Biol       Date:  2020-10-21

Review 4.  Statistical methods to detect pleiotropy in human complex traits.

Authors:  Sophie Hackinger; Eleftheria Zeggini
Journal:  Open Biol       Date:  2017-11       Impact factor: 6.411

5.  A systematic SNP selection approach to identify mechanisms underlying disease aetiology: linking height to post-menopausal breast and colorectal cancer risk.

Authors:  Rachel J J Elands; Colinda C J M Simons; Mona Riemenschneider; Aaron Isaacs; Leo J Schouten; Bas A Verhage; Kristel Van Steen; Roger W L Godschalk; Piet A van den Brandt; Monika Stoll; Matty P Weijenberg
Journal:  Sci Rep       Date:  2017-01-24       Impact factor: 4.379

6.  Evaluation of shared genetic aetiology between osteoarthritis and bone mineral density identifies SMAD3 as a novel osteoarthritis risk locus.

Authors:  Sophie Hackinger; Katerina Trajanoska; Unnur Styrkarsdottir; Eleni Zengini; Julia Steinberg; Graham R S Ritchie; Konstantinos Hatzikotoulas; Arthur Gilly; Evangelos Evangelou; John P Kemp; David Evans; Thorvaldur Ingvarsson; Helgi Jonsson; Unnur Thorsteinsdottir; Kari Stefansson; Andrew W McCaskie; Roger A Brooks; Jeremy M Wilkinson; Fernando Rivadeneira; Eleftheria Zeggini
Journal:  Hum Mol Genet       Date:  2017-10-01       Impact factor: 6.150

7.  An iterative approach to detect pleiotropy and perform Mendelian Randomization analysis using GWAS summary statistics.

Authors:  Xiaofeng Zhu; Xiaoyin Li; Rong Xu; Tao Wang
Journal:  Bioinformatics       Date:  2021-06-16       Impact factor: 6.937

8.  Ancestry-specific associations identified in genome-wide combined-phenotype study of red blood cell traits emphasize benefits of diversity in genomics.

Authors:  Chani J Hodonsky; Antoine R Baldassari; Stephanie A Bien; Laura M Raffield; Heather M Highland; Colleen M Sitlani; Genevieve L Wojcik; Ran Tao; Marielisa Graff; Weihong Tang; Bharat Thyagarajan; Steve Buyske; Myriam Fornage; Lucia A Hindorff; Yun Li; Danyu Lin; Alex P Reiner; Kari E North; Ruth J F Loos; Charles Kooperberg; Christy L Avery
Journal:  BMC Genomics       Date:  2020-03-14       Impact factor: 4.547

9.  Multi-trait genome-wide association study identifies new loci associated with optic disc parameters.

Authors:  Pieter W M Bonnemaijer; Elisabeth M van Leeuwen; Adriana I Iglesias; Puya Gharahkhani; Veronique Vitart; Anthony P Khawaja; Mark Simcoe; René Höhn; Angela J Cree; Rob P Igo; Aslihan Gerhold-Ay; Stefan Nickels; James F Wilson; Caroline Hayward; Thibaud S Boutin; Ozren Polašek; Tin Aung; Chiea Chuen Khor; Najaf Amin; Andrew J Lotery; Janey L Wiggs; Ching-Yu Cheng; Pirro G Hysi; Christopher J Hammond; Alberta A H J Thiadens; Stuart MacGregor; Caroline C W Klaver; Cornelia M van Duijn
Journal:  Commun Biol       Date:  2019-11-27

10.  Nontrivial Replication of Loci Detected by Multi-Trait Methods.

Authors:  Zheng Ning; Yakov A Tsepilov; Sodbo Zh Sharapov; Zhipeng Wang; Alexander K Grishenko; Xiao Feng; Masoud Shirali; Peter K Joshi; James F Wilson; Yudi Pawitan; Chris S Haley; Yurii S Aulchenko; Xia Shen
Journal:  Front Genet       Date:  2021-02-03       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.