| Literature DB >> 25519388 |
Juan M Peralta1, Marcio Almeida2, Jack W Kent2, John Blangero2.
Abstract
We propose a novel variance component approach for the analysis of next-generation sequencing data. Our method is based on the detection of the proportion of the trait phenotypic variance that can be explained by the introduction of a new variance component that accounts for the local gene-specific departure of the empirical kinship relationship matrix, estimated from single-nucleotide polymorphism (SNP) genotypes, from their theoretical expectation based on the genealogical information in the pedigree. We tested our method with simulated phenotypes and imputed SNP genotypes from the Genetic Analysis Workshop 18 data set. We observed considerable variation in the differences between theoretical and gene-specific kinship estimates that proved to be informative for our test and allowed us to detect the MAP4 causal gene at a genome-wide significance level. The distribution of our test statistic show no inflation under the null hypothesis and results from a random set of genes suggest that the detection of MAP4 is both sensitive and specific. The use of 2 different strategies for the selection of the SNPs used to derive the gene-specific empirical kinship relationship matrices provides us with suggestive evidence that our method is performing as an empirical test of linkage.Entities:
Year: 2014 PMID: 25519388 PMCID: PMC4143638 DOI: 10.1186/1753-6561-8-S1-S49
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1Distribution of the gene-specific differences between TKEs) and EKEs. Differences between TKE and EKE values were averaged by gene for a sample of 100 random and 12 SBP_1 causal genes. The negative sign indicates that the gene-specific EKE average is larger than the TKE average.
Estimated effects on the simulated SBP_1 trait for known causal genes
| Gene | Strategy | |||||||
|---|---|---|---|---|---|---|---|---|
| Intragenic | Nonsyn | |||||||
| h2r | h2r_p | geff | geff_p | h2r | h2r_p | geff | geff_p | |
| 0.17 | 3.90 × 10−6 | 0.10955 | 7.20 × 10−6 | 0.18 | 7.00 × 10−7 | 0.10382 | 1.00 × 10−7 | |
| 0.26 | 4.16 × 10−8 | 0.04702 | 6.52 × 10−3 | 0.31 | 2.28 × 10−10 | 0.01147 | 1.71 × 10−1 | |
| 0.28 | 6.97 × 10−9 | 0.03575 | 6.55 × 10−3 | 0.32 | 3.44 × 10−11 | 0 | 1 | |
| 0.29 | 4.19 × 10−9 | 0.01755 | 9.24 × 10−2 | 0.32 | 3.44 × 10−11 | 0 | 1 | |
| 0.30 | 9.51 × 10−10 | 0.01615 | 9.29 × 10−2 | 0.27 | 7.20 × 10−9 | 0.03433 | 1.26 × 10−3 | |
| 0.30 | 8.37 × 10−10 | 0.00906 | 1.59 × 10−1 | 0.32 | 3.44 × 10−11 | 0 | 1 | |
| 0.32 | 4.12 × 10−11 | 0.00037 | 4.76 × 10−1 | 0.32 | 3.44 × 10−11 | 0 | 1 | |
| 0.32 | 3.44 × 10−11 | 0 | 1 | 0.21 | 4.01 × 10−11 | 0.17969 | 1.90 × 10−1 | |
| 0.32 | 3.44 × 10−11 | 0 | 1 | 0.32 | 3.44 × 10−11 | 0 | 1 | |
| 0.32 | 3.44 × 10−11 | 0 | 1 | 0.32 | 3.44 × 10−11 | 0 | 1 | |
| 0.32 | 3.44 × 10−11 | 0 | 1 | 0.32 | 3.44 × 10−11 | 0 | 1 | |
| 0.32 | 3.44 × 10−11 | 0 | 1 | 0.30 | 9.46 × 10−11 | 0.00949 | 1.25 × 10−1 | |
geff, Gene-specific effect estimate (); geff_p, significance of the gene-specific effect estimate; h2r, trait heritability estimate (); h2r_p, significance of the trait heritability estimate.
Top 10 most significant results for genes in a combined sample of 100 random and 12 causal genes
| Rank | Strategy | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Intragenic | Nonsyn | |||||||||
| Gene | h2r | h2r_p | geff | geff_p | Gene | h2r | h2r_p | geff | geff_p | |
| 1 | 0.17 | 3.90 × 10−6 | 0.10955 | 7.20 × 10−6 | MAP4* | 0.18 | 7.00 × 10−7 | 0.10382 | 1.00 × 10−7 | |
| 2 | 0.18 | 6.16 × 10−11 | 0.20337 | 4.64 × 10−3 | TNN* | 0.27 | 7.20 × 10−9 | 0.03433 | 1.26 × 10−3 | |
| 3 | 0.26 | 4.16 × 10−8 | 0.04702 | 6.52 × 10−3 | LSM12 | 0.15 | 3.40 × 10−11 | 0.26452 | 4.96 × 10−3 | |
| 4 | 0.28 | 6.97 × 10−9 | 0.03575 | 6.55 × 10−3 | NAT6 | 0.30 | 3.20 × 10−11 | 0.02515 | 1.16 × 10−2 | |
| 5 | 0.28 | 1.12 × 10−10 | 0.03592 | 8.39 × 10−3 | AK123654 | 0.15 | 2.27 × 10−10 | 0.25783 | 1.37 × 10−2 | |
| 6 | 0.28 | 1.05 × 10−8 | 0.03547 | 2.25 × 10−2 | OR2T27 | 0.28 | 1.05 × 10−10 | 0.04869 | 1.46 × 10−2 | |
| 7 | 0.30 | 1.07 × 10−10 | 0.03072 | 4.20 × 10−2 | HSPA9 | 0.15 | 8.69 × 10−12 | 0.26952 | 5.04 × 10−2 | |
| 8 | 0.31 | 8.85 × 10−10 | 0.01913 | 4.53 × 10−2 | LOC389493 | 0.21 | 1.52 × 10−10 | 0.16663 | 5.12 × 10−2 | |
| 9 | 0.18 | 1.85 × 10−10 | 0.20838 | 5.94 × 10−2 | SRD5A1 | 0.32 | 2.25 × 10−11 | 0.01056 | 1.12 × 10−1 | |
| 10 | 0.28 | 1.91 × 10−8 | 0.03356 | 6.07 × 10−2 | PSMD5* | 0.30 | 9.46 × 10−11 | 0.00949 | 1.25 × 10−1 | |
geff, Gene-specific effect estimate (); geff_p, significance of the gene-specific effect estimate; h2r, trait heritability estimate (); h2r_p, significance of the trait heritability estimate.
*Known causal gene for SBP_1 in the simulated data set.
Figure 2Q-Q plot of the . The p values for the gene-specific effect estimates were calculated using SNPs selected with the intragenic strategy for a random sample of 5000 genes, using the Q1 trait, a trait highly heritable but not influenced by any of the GAW18 SNPs.