| Literature DB >> 27980666 |
Tae-Hwi Schwantes-An1, Heejong Sung1, Jeremy A Sabourin1, Cristina M Justice1, Alexa J M Sorant1, Alexander F Wilson1.
Abstract
In this study, the effects of (a) the minor allele frequency of the single nucleotide variant (SNV), (b) the degree of departure from normality of the trait, and (c) the position of the SNVs on type I error rates were investigated in the Genetic Analysis Workshop (GAW) 19 whole exome sequence data. To test the distribution of the type I error rate, 5 simulated traits were considered: standard normal and gamma distributed traits; 2 transformed versions of the gamma trait (log10 and rank-based inverse normal transformations); and trait Q1 provided by GAW 19. Each trait was tested with 313,340 SNVs. Tests of association were performed with simple linear regression and average type I error rates were determined for minor allele frequency classes. Rare SNVs (minor allele frequency < 0.05) showed inflated type I error rates for non-normally distributed traits that increased as the minor allele frequency decreased. The inflation of average type I error rates increased as the significance threshold decreased. Normally distributed traits did not show inflated type I error rates with respect to the minor allele frequency for rare SNVs. There was no consistent effect of transformation on the uniformity of the distribution of the location of SNVs with a type I error.Entities:
Year: 2016 PMID: 27980666 PMCID: PMC5133479 DOI: 10.1186/s12919-016-0060-7
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Fig. 1Frequency of SNVs by MAF
Fig. 2Distribution of type I error rate by MAF. Type I error rate versus MAF. Different color/symbols indicate different null traits. Each point indicates average type I error rate at the critical level of 10-5 of SNVs grouped by MAF; extremely rare variants (MAF < 0.0025) are classified by number of occurrences of the rare allele, while rare (0.0025 ≤ MAF < 0.05) and common (MAF ≥ 0.05) ones are classified by MAF range
Estimatesa of the mean, variance, skewness, and kurtosis of each simulated trait
| Distribution | Mean | Variance | Skewness | Kurtosis |
|---|---|---|---|---|
| Normal | −0.003 | 1.00 | −0.001 | 3.00 |
| Gamma | 59.97 | 1197.63 | 1.15 | 4.97 |
| Log10 gamma | 1.70 | 0.07 | −0.62 | 3.77 |
| RITbgamma | 0.0 | 1.00 | 0.0 | 2.98 |
| Q1 | 41.34 | 113.36 | −0.03 | 2.99 |
aThese values are averages over 200 replicates for each trait
bRank-based inverse normal transformation
P-values for goodness of fit tests of uniformity of the distribution of SNVs with any type I error
| Test among | Normal | Gamma | Log10 gamma | RIT gamma | Q1 |
|---|---|---|---|---|---|
| Chromosomes | 0.113 | 0.880 | 0.338 | 0.006 | 0.008 |
| 10 Mb intervals | 0.234 | 0.966 | 0.088 | 0.001 | 0.007 |