| Literature DB >> 29740473 |
Jo Nishino1,2, Yuta Kochi2,3, Daichi Shigemizu1,2,4,5, Mamoru Kato2,6, Katsunori Ikari2,7, Hidenori Ochi2,8,9, Hisashi Noma2,10, Kota Matsui2,11, Takashi Morizono5, Keith A Boroevich5, Tatsuhiko Tsunoda1,2,5,12, Shigeyuki Matsui2,11,12.
Abstract
Genome-wide association studies (GWAS) suggest that the genetic architecture of complex diseases consists of unexpectedly numerous variants with small effect sizes. However, the polygenic architectures of many diseases have not been well characterized due to lack of simple and fast methods for unbiased estimation of the underlying proportion of disease-associated variants and their effect-size distribution. Applying empirical Bayes estimation of semi-parametric hierarchical mixture models to GWAS summary statistics, we confirmed that schizophrenia was extremely polygenic [~40% of independent genome-wide SNPs are risk variants, most within odds ratio (OR = 1.03)], whereas rheumatoid arthritis was less polygenic (~4 to 8% risk variants, significant portion reaching OR = 1.05 to 1.1). For rheumatoid arthritis, stratified estimations revealed that expression quantitative loci in blood explained large genetic variance, and low- and high-frequency derived alleles were prone to be risk and protective, respectively, suggesting a predominance of deleterious-risk and advantageous-protective mutations. Despite genetic correlation, effect-size distributions for schizophrenia and bipolar disorder differed across allele frequency. These analyses distinguished disease polygenic architectures and provided clues for etiological differences in complex diseases.Entities:
Keywords: effect-size distribution; genome-wide association study (GWAS); polygenic disease architecture; polygenicity; semi-parametric hierarchical mixture model
Year: 2018 PMID: 29740473 PMCID: PMC5928254 DOI: 10.3389/fgene.2018.00115
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Evaluation of SP-HMM estimation by simulations. The true proportion of associated SNPs is set to be π = 0.1. Effect-size distribution of associated SNPs is the normal distribution with a variance of 0.032. Various sample sizes (n = 3,000–100,000 cases and controls) were used. The total number of SNPs is 100,000. (A) Estimated proportion of disease-associated variants, . “qvalue”: results obtained by qvalue R package (Bioconductor v.3.4). (B) Estimated effect-size distributions, ĝ. Average curves over 100 simulations for each sample size are shown. The specified (true) effect-size distributions are given in dotted lines.
Estimated proportions of disease-associated SNPs, , and liability-scale variance explained by SNPs, .
| Rheumatoid arthritis (Asian) | 3.6 (1.8) | 14.0 (1.8) |
| Rheumatoid arthritis (European) | 8.1 (2.4) | 20.2 (1.5) |
| Coronary artery disease (CARDIoGRAM) | 15.9 (3.7) | 20.9 (1.3) |
| Coronary artery disease (C4D) | 26.1 (3.5) | 22.2 (1.4) |
| Schizophrenia | 43.0 (1.1) | 40.2 (0.7) |
| Bipolar disorder | 39.6 (2.2) | 50.0 (1.9) |
Estimates for the P-value-based SNP sets are shown.
For .
Estimated based on 100 parametric bootstrap samples based on the estimated SP-HMM.
Figure 2Estimated effect-size distributions for disease-associated SNPs, . P-value-based pruned SNP sets are used. (A) Rheumatoid arthritis (Asian). (B) Rheumatoid arthritis (European). (C) Coronary artery disease (CARDIoGRAM). (D) Coronary artery disease (C4D). (E) Schizophrenia. (F) Bipolar disorder.
Figure 3Estimated effect size distributions for eQTL-SNPs and non-eQTL-SNPs, . Green and orange graphs show the results for the eQTL-SNP and non-eQTL-SNP sets, respectively. Estimated proportion of disease-associated SNPs, , correspond to the areas under the curves. (A) Rheumatoid arthritis (Asian). (B) Rheumatoid arthritis (European). (C) Coronary artery disease (CARDIoGRAM). (D) Coronary artery disease (C4D). (E) Schizophrenia. (F) Bipolar disorder.
Figure 4Estimated effect-size distributions, , by derived allele frequency (DAF) bins. The upper panels (heatmap colors) for each GWAS results show . The lower panels show means of . (A) Rheumatoid arthritis (Asian). (B) Rheumatoid arthritis (European). (C) Coronary artery disease (CARDIoGRAM). (D) Coronary artery disease (C4D). (E) Schizophrenia. (F) Bipolar disorder.