| Literature DB >> 23359524 |
Sophie van der Sluis1, Danielle Posthuma, Conor V Dolan.
Abstract
To date, the genome-wide association study (GWAS) is the primary tool to identify genetic variants that cause phenotypic variation. As GWAS analyses are generally univariate in nature, multivariate phenotypic information is usually reduced to a single composite score. This practice often results in loss of statistical power to detect causal variants. Multivariate genotype-phenotype methods do exist but attain maximal power only in special circumstances. Here, we present a new multivariate method that we refer to as TATES (Trait-based Association Test that uses Extended Simes procedure), inspired by the GATES procedure proposed by Li et al (2011). For each component of a multivariate trait, TATES combines p-values obtained in standard univariate GWAS to acquire one trait-based p-value, while correcting for correlations between components. Extensive simulations, probing a wide variety of genotype-phenotype models, show that TATES's false positive rate is correct, and that TATES's statistical power to detect causal variants explaining 0.5% of the variance can be 2.5-9 times higher than the power of univariate tests based on composite scores and 1.5-2 times higher than the power of the standard MANOVA. Unlike other multivariate methods, TATES detects both genetic variants that are common to multiple phenotypes and genetic variants that are specific to a single phenotype, i.e. TATES provides a more complete view of the genetic architecture of complex traits. As the actual causal genotype-phenotype model is usually unknown and probably phenotypically and genetically complex, TATES, available as an open source program, constitutes a powerful new multivariate strategy that allows researchers to identify novel causal variants, while the complexity of traits is no longer a limiting factor.Entities:
Mesh:
Year: 2013 PMID: 23359524 PMCID: PMC3554627 DOI: 10.1371/journal.pgen.1003235
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Schematic representation of the simulation settings and results.
Schematic representation of the simulation settings (a–f) and radar plot (g) of the power to detect 1 genetic variant (GV) explaining .5% of the phenotypic variance in 12 simulation settings. The power radar plot (power running from 0 (midpoint) to 1 (outer edge)) displays the power for the univariate sum score analyses (blue), MANOVA (green), and TATES (red). The phenotypic correlation structure was either due to one common factor (a,e), multiple underlying latent factors (b,c,d), or a network model (f). Within these phenotypic settings, the GV either affected multiple phenotypes via a common factor (a,b,c), or affect a single phenotype directly (d,e,f). Power results for 12 simulation settings and a GV explaining .5% of the variance are highlighted (g, colour labels corresponds to colour simulation settings; see Tables S1, S2, S3, S4, S5, S6, S7, S8, S9, S10, S11, S12 for more GV effect sizes). Specifically, gA1–3: 1-factor models with GV effect on the factor. gA1: mix of dichotomous, ordinal and continuous phenotypes correlating .36 to .81; gA2: continuous phenotypes correlating .56; gA3: continuous phenotypes correlating .12. gE1–3: 1-factor models with GV effect specific to 1 phenotype. gE1: phenotypes correlate .56 (like gA2); gE2: phenotypes correlate .30; gE3, phenotypes correlate .12 (like gA3). gF1–F3: network models with GV effect specific to 1 phenotype. gF1: phenotypes correlate .56 (like gA2 and gE1); gF2: phenotypes correlate .12 (like gA3 and gE3); gF3: 4 clusters of phenotypes that within clusters correlate .55, and between clusters correlate .13. gC1: 2-factor model, 10 phenotypes per factor, correlating .36–.81 within factors, and a factorial correlation of .5. GV affects only the 2nd factor. gB1: 4-factor model, 5 phenotypes per factor, correlating .81 within factors, and factorial correlations of .1. GV affects only the 4th factor. gD1: like gB1 but GV effect specific to 1 phenotype.