| Literature DB >> 22373401 |
Yan V Sun1, Wei Zhao, Kerby A Shedden, Sharon Lr Kardia.
Abstract
Using the exome sequencing data from 697 unrelated individuals and their simulated disease phenotypes from Genetic Analysis Workshop 17, we develop and apply a gene-based method to identify the relationship between a gene with multiple rare genetic variants and a phenotype. The method is based on the Mantel test, which assesses the correlation between two distance matrices using a permutation procedure. Using up to 100,000 permutations to estimate the statistical significance in 200 replicate data sets, we found that the method had 5.1% type I error at an α level of 0.05 and had various power to detect genes with simulated genetic associations. FLT1 and KDR had the most significant correlations with Q1 and were replicated 170 and 24 times, respectively, in 200 simulated data sets using a Bonferroni corrected p-value of 0.05 as a threshold. These results suggest that the distance correlation method can be used to identify genotype-phenotype association when multiple rare genetic variants in a gene are involved.Entities:
Year: 2011 PMID: 22373401 PMCID: PMC3287845 DOI: 10.1186/1753-6561-5-S9-S120
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Power of identifying nine genes with simulated genetic association
| Gene | Power | Number of causal SNPs/total SNPs | ||||
|---|---|---|---|---|---|---|
| All SNPs | MAF < 5% | MAF ≥ 5% | ||||
| 0.255 | 0 | 0 | 5/18 | 5/17 | 0/1 | |
| 0.01 | 0 | 0 | 2/10 | 2/8 | 0/2 | |
| 1 | 0.97 | 0.95 | 11/35 | 10/32 | 1/3 | |
| 0.135 | 0.01 | 0 | 2/10 | 2/10 | 0/0 | |
| 0.17 | 0 | 0 | 4/8 | 4/8 | 0/0 | |
| 0.09 | 0 | 0 | 3/21 | 3/17 | 0/4 | |
| 0.955 | 0.525 | 0.245 | 10/16 | 9/15 | 1/1 | |
| 0.225 | 0.005 | 0.005 | 1/6 | 1/6 | 0/0 | |
| 0.775 | 0 | 0 | 1/1 | 1/1 | 0/0 | |
Most significant genes correlated with Q1 and Affected
| Trait | Gene | Chromosome | Gene start (bp) | Gene end (bp) | Gene length (bp) | Number of SNPs | Number of significant testsa |
|---|---|---|---|---|---|---|---|
| Q1 (SNPs with MAF < 0.05) | 13 | 27774389 | 27967265 | 192877 | 32 | 49 | |
| 18 | 37829928 | 37789197 | 126250 | 7 | 13 | ||
| 4 | 55639406 | 55686519 | 47114 | 15 | 12 | ||
| 12 | 10889715 | 11215480 | 325766 | 17 | 11 | ||
| Q1 (SNPs with MAF < 0.05)b | 13 | 27774389 | 27967265 | 192877 | 32 | 39 | |
| Q1 (nonsynonymous SNPs with MAF < 0.05)b | 13 | 27774389 | 27967265 | 192877 | 19 | 170 | |
| 4 | 55639406 | 55686519 | 47114 | 10 | 24 | ||
| Affected (nonsynonymous SNPs with MAF < 0.05) | 13 | 27774389 | 27967265 | 192877 | 19 | 13 |
a Number of significant tests out of 200 simulated data sets. The threshold of statistical significance is a Bonferroni-corrected p-value of 0.05 (1.74 × 10−5 for SNPs with MAF < 0.05 and 2.48 × 10−5 for nonsynonymous SNPs with MAF < 0.05).
b Adjusted for Age, Sex, Smoke, and first two principal components.