| Literature DB >> 25519348 |
Hua Zhou1, Jin Zhou2, Eric M Sobel3, Kenneth Lange4.
Abstract
The linkage era left a rich legacy of pedigree samples that can be used for modern genome-wide association sequencing (GWAS) or next-generation sequencing (NGS) studies. Family designs are naturally equipped to detect rare variants, control for population stratification, and facilitate the study of parent-of-origin effects. Unfortunately, pedigree likelihoods are notoriously hard to compute, and current software for association mapping in pedigrees is prohibitively slow in processing dense marker maps. In a recent release of the comprehensive genetic analysis software MENDEL, we implemented an ultra-fast score test for association mapping with pedigree-based GWAS or NGS study data. Our implementation (a) works for random sample data, pedigree data, or a mix of both;(b) allows for covariate adjustment, including correction for population stratification;(c) accommodates both univariate and multivariate quantitative traits; and (d) allows missing values in multivariate traits. In this paper, we assess the capabilities of MENDEL on the Genetic Analysis Workshop 18 sequencing data. For instance, when jointly testing the 4 longitudinally measured diastolic blood pressure traits, it takes MENDEL less than 51 minutes on a standard laptop computer to read, quality check, and analyze a data set with 959 individuals and 8.3 million single-nucleotide polymorphisms (SNPs). Our analysis reveals association of one SNP in the q32.2 region of chromosome 1. MENDEL is freely available on http://www.genetics.ucla.edu/software.Entities:
Year: 2014 PMID: 25519348 PMCID: PMC4143629 DOI: 10.1186/1753-6561-8-S1-S93
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Summary of environmental effects for traits systolic blood pressure(top), diastolic blood pressure(middle) and Q1 (bottom) in simulation replicate SIMPHEN.1
| SBP | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| LM | 119.360 | 0.135 | 13.088 | 0.284 | −19.547 | 0.387 | -- | 139.558 | 42.4% |
| LM | 75.781 | −0.052 | 1.893 | −0.109 | −8.201 | 0.124 | -- | 81.632 | 4.8% |
| LM | 38.642 | −0.087 | −2.508 | 0.270 | 8.904 | 0.005 | -- | 85.260 | 21.9% |
Numbers in parenthesis are p-values.
DBP, diastolic blood pressure; SBP, systolic blood pressure.
Figure 1Results of power and size study. Top: Box plots of −log10(p-values) from score tests for the 6 functional variants in MAP4 based on 200 simulation replicates. The red (left) ones use the decorrelated residuals (method 1). The blue (right) ones use the correlated residuals (method 2). The horizontal line represents the 0.05 significance level. Bottom: Empirical power and type I error.
Figure 2Results of pedigree genome-wide association sequencing for testing traits systolic blood pressure (SBP), diastolic blood pressure (DBP) and Q1 in simulation replicate SIMPHEN.1 on the 1,213,657 single-nucleotide polymorphisms on chromosome 3 and 849 individuals. Top: Run times on a standard laptop. Bottom: Manhattan plot (left) and QQ plot (right) for the traits . The horizontal line represents the genome-wide significance level. Plots for SBP and Q1 are similar and are omitted here.
Figure 3Results for pedigree genome-wide association sequencing of 8,348,674 single-nucleotide polymorphisms for the real diastolic blood pressure (DBP) traits. Top: Environmental effects fitted from linear model (LM) and linear mixed model (LMM). Numbers in parenthesis are p-values. Bottom: Manhattan plot (left) and quartile-quartileplot (right). The horizontal line represents the genome-wide significance level.