| Literature DB >> 32066829 |
Huma Asif1, Ney Alliey-Rodriguez2, Sarah Keedy2, Carol A Tamminga3, John A Sweeney4, Godfrey Pearlson5, Brett A Clementz6, Matcheri S Keshavan7, Peter Buckley8, Chunyu Liu9, Benjamin Neale10, Elliot S Gershon11,12.
Abstract
An important issue affecting genome-wide association studies with deep phenotyping (multiple correlated phenotypes) is determining the suitable family-wise significance threshold. Straightforward family-wise correction (Bonferroni) of p < 0.05 for 4.3 million genotypes and 335 phenotypes would give a threshold of p < 3.46E-11. This would be too conservative because it assumes all tests are independent. The effective number of tests, both phenotypic and genotypic, must be adjusted for the correlations between them. Spectral decomposition of the phenotype matrix and LD-based correction of the number of tested SNPs are currently used to determine an effective number of tests. In this paper, we compare these calculated estimates with permutation-determined family-wise significance thresholds. Permutations are performed by shuffling individual IDs of the genotype vector for this dataset, to preserve correlation of phenotypes. Our results demonstrate that the permutation threshold is influenced by minor allele frequency (MAF) of the SNPs, and by the number of individuals tested. For the more common SNPs (MAF > 0.1), the permutation family-wise threshold was in close agreement with spectral decomposition methods. However, for less common SNPs (0.05 < MAF ≤ 0.1), the permutation threshold calculated over all SNPs was off by orders of magnitude. This applies to the number of individuals studied (here 777) but not to very much larger numbers. Based on these findings, we propose that the threshold to find a particular level of family-wise significance may need to be established using separate permutations of the actual data for several MAF bins.Entities:
Mesh:
Year: 2020 PMID: 32066829 PMCID: PMC7429341 DOI: 10.1038/s41380-020-0670-3
Source DB: PubMed Journal: Mol Psychiatry ISSN: 1359-4184 Impact factor: 13.437
Fig. 1Observed permutation p-values as a function of minor allele frequency in 777 individuals and 335 correlated phenotypes. SNPs with P value ≤ 1.0E-06 are extracted from permuted genotype data and plotted against their minor allele frequency (MAF). Each red dot represents a P value of one SNP.
Estimated effective number of independent phenotypes
| Methods | Total number of phenotypes | Effective number of phenotypes | References |
|---|---|---|---|
| Bonferroni | 335 | 335 | (Bonferroni, 1936) |
| Nyholt spectral decomposition method | 335 | 286 | (Nyholt, 2004) |
| Li and Ji method (2005) | 335 | 122 | (Li and Ji, 2005) |
Estimated effective number of independent SNPs
| Methods | Total number of SNPs | Effective number of SNPs | References |
|---|---|---|---|
| Bonferroni | 4.3M | 4.3M | (Bonferroni, 1936) |
| Gao et al’s method (MGao) (Composite LD correlation between the SNPs) | 4.3M | 1.22M | (Gao et al., 2008) |
| Li et al’s method (MLi) (Block wise LD of genotypes) | 4.3M | 0.90M | (Li et al., 2012) |
| Han et al’s method (LD in sliding window of genotypes) (MHan) | 4.3M | 1.75M | (Han et al., 2009) |
GWAS significance thresholds for correlated genotypes and phenotypes in this data set
| Methods | GWAS threshold for one phenotype | Effective number of phenotypic tests | Family-wise threshold (alpha=0.05) |
|---|---|---|---|
| Bonferroni correction | 1.16E-08 | 335 | 3.46E-11 |
| Nyholt method for phenotypes with Bonferroni correction of genotypes | 1.16E-08 | 286 | 4.06E-11 |
| Li and Ji method for phenotypes with Bonferroni correction of genotypes | 1.16E-08 | 122 | 9.51E-11 |
| Han et al (LD in sliding window of genotypes, Li and Ji for phenotypes) | 2.86E-08 | 122 | 2.34E-10 |
| Li et al (Block wise LD of genotypes, Li and Ji for phenotypes) | 5.54E-08 | 122 | 4.54E-10 |
| Gao et al (LD of genotypes, Li and Ji for phenotypes) | 4.08E-08 | 122 | 3.34E-10 |
| Permutation of genotype (subject indices) all allele frequencies | 5.29E-08 | NA | 8.39E-13 |
| Permutation of genotype (subject indices) (MAF > 0.1) | 5.29E-08 | NA | 1.93E-10 |
| Permutation of genotype (subject indices) (SNPs with MAF ≤ 0.1) | 1.51E-08 | NA | 8.39E-13 |
Fig. 2:Family-wise Permutation p-values as a function of minor allele frequency (MAF) and sample sizes.
We selected SNPs each at MAFs of 4–10%, 20–25%, 30–35% and 40–45% and examine the threshold for different sample sizes (N=777 and N= 77700). The effect of minor allele frequency is attenuated in the larger sample size.