| Literature DB >> 36105080 |
Xiaoyi Li1, Han Xiao1, Yujia Ma1, Zechen Zhou1, Dafang Chen1.
Abstract
Genome-wide association studies (GWAS) have identified several common variants associated with polycystic ovary syndrome (PCOS). However, the etiology behind PCOS remains incomplete. Available evidence suggests a potential genetic correlation between PCOS and type 2 diabetes (T2D). The publicly available data may provide an opportunity to enhance the understanding of the PCOS etiology. Here, we quantified the polygenic overlap between PCOS and T2D using summary statistics of PCOS and T2D and then identified the novel genetic variants associated with PCOS behind this phenotypic association. A bivariate causal mixture model (MiXeR model) found a moderate genetic overlap between PCOS and T2D (Dice coefficient = 44.1% and after adjusting for body mass index, 32.1%). The conditional/conjunctional false discovery rate method identified 11 potential risk variants of PCOS conditional on associations with T2D, 9 of which were novel and 6 of which were jointly associated with two phenotypes. The functional annotation of these genetic variants supports a significant role for genes involved in lipid metabolism, immune response, and the insulin signaling pathway. An expression quantitative trait locus functionality analysis successfully repeated that 5 loci were significantly associated with the expression of candidate genes in many tissues, including the whole blood, subcutaneous adipose, adrenal gland, and cerebellum. We found that SCN2A gene is co-localized with PCOS in subcutaneous adipose using GWAS-eQTL co-localization analyses. A total of 11 candidate genes were differentially expressed in multiple tissues of the PCOS samples. These findings provide a new understanding of the shared genetic architecture between PCOS and T2D and the underlying molecular genetic mechanism of PCOS.Entities:
Keywords: conditional FDR; conjunctional FDR; pleiotropy; polycystic ovary syndrome; type 2 diabetes
Year: 2022 PMID: 36105080 PMCID: PMC9464923 DOI: 10.3389/fgene.2022.905716
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1Flow diagram of the overall study design. GWAS summary statistics on PCOS and T2D with or without adjustment of BMI were collected from publicly available databases and open literature reports. We quantified the polygenic overlap between PCOS and T2D. Then, we visualized the genetic pleiotropy enrichment between both traits and identified the genetic loci associated with PCOS or/and T2D. Finally, four bioinformatics approaches were applied to explore the potential function of the identified genetic loci.
FIGURE 2Venn diagrams of shared causal variants (gray). (A) T2D unadjusted by BMI (red) and PCOS (blue); (B) T2D adjusted by BMI (red) and PCOS (blue). The estimated numbers of causal variants are shown in thousands, explaining 90% of SNP heritability in each phenotype, followed by the standard error. The Dice coefficients, an overall measure on a 0–100% scale of the polygenic overlap, are 44.1% (A) and 32.1% (B), respectively. The genetic correlations ( ) estimated by the MiXeR model are 0.41 (A) and 0.21 (B), respectively.
FIGURE 3Genetic pleiotropy enrichment of PCOS conditional on T2D (without adjustment for BMI) and vice versa. Stratified conditional Q–Q plot of nominal versus empirical and negative log10-transformed p-values in the primary phenotype as a functional significance of association with the secondary phenotype at the level of p < 1(All SNPs, blue), p < 0.1 (orange), p < 0.01 (green), and p < 0.001 (purple) corresponding to , respectively. (A) PCOS conditioned on T2D; (B) T2D conditioned on PCOS. Dotted lines demonstrate the null hypothesis. The summary statistics of T2D without adjustment of BMI were selected for plotting those conditional Q–Q plots.
FIGURE 4Manhattan plot showing 11 genetic loci associated with PCOS or/and T2D. Manhattan plots showing the negative log10-transformed condFDR values (A) and negative log10-transformed conjFDR values (B) for each SNP on the y axis and chromosomal positions along the x axis. The most significant SNP lead in each LD block is enlarged and encircled in black, whereas the small points represent other SNPs. Labels correspond to lead SNPs previously reported for published loci (black) and for newly identified loci (red).
Genetic susceptibility loci associated with PCOS identified by the conditional FDR and conjunctional FDR methods.
| Loc | CHRPOS | SNP | Ref/Alt | Functional category | Nearest gene | Mapped gene |
|
| GWAS P-value | GWAS Z-scorei | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| T2D | PCOS | T2D | PCOS | |||||||||
| 1 | 2:165,737,889 | rs1509096 | G/A | intergenic | RNA5SP111 | SLC38A11 | 9.43E-03 | 9.53E-03 | 1.80E-05 | 1.40E-05 | −4.34 | −4.29 |
| 2 | 3:12,349,924 | rs13061415 | C/T | intronic | PPARG | PPARG | 3.09E-02 | 3.09E-02 | 6.60E-09 | 5.60E-05 | 4.03 | 5.80 |
| 3 | 3:123,010,775 | rs4234212 | C/T | intronic | ADCY5 | SEC22A | 4.64E-02 | 4.64E-02 | 5.80E-05 | 9.30E-05 | 3.91 | −4.02 |
| 4 | 5:43,618,391 | rs138484257 | G/T | intronic | NNT | NNT | 1.46E-02 | 9.69E-01 | 5.30E-01 | 9.80E-08 | −5.33 | 0.63 |
| 5 | 8:11,625,205 | rs804274 | C/A | intergenic | NEIL2 | CTSB | 6.60E-03 | 7.47E-01 | 7.30E-02 | 2.10E-07 | −5.19 | 1.79 |
| 6 | 11:30,339,461 | rs7929660 | G/A | intergenic | ARL14EP | ARL14EP | 9.72E-03 | 5.65E-01 | 2.70E-02 | 6.80E-07 | −4.97 | 2.21 |
| 7 | 11:32,449,098 | rs5030174 | G/A | intronic | WT1 | WT1 | 1.67E-02 | 1.67E-02 | 1.50E-05 | 2.70E-05 | −4.20 | 4.33 |
| 8 | 11:83,562,895 | rs12808938 | G/T | intronic | DLG2 | DLG2 | 2.43E-02 | 6.11E-01 | 3.50E-02 | 1.60E-06 | 4.80 | 2.11 |
| 9 | 16:53,822,502 | rs7190396 | G/T | intronic | FTO | FTO | 1.97E-03 | 1.97E-03 | 7.60E-74 | 2.50E-06 | −4.71 | −18.18 |
| 10 | 16:81,463,967 | rs2432581 | G/A | intergenic | CMIP | CMIP | 1.56E-02 | 3.32E-02 | 1.00E-04 | 2.50E-05 | 4.21 | 3.89 |
| 11 | 20:56,125,891 | rs1474758 | C/A | intergenic | PCK1 | PCK1 | 2.49E-02 | 9.17E-01 | 3.50E-01 | 2.50E-07 | −5.16 | −0.93 |
Note:
Loc, locus.
CHRPOS, chromosome position in the human reference genome build37 (or hg19).
Ref/Alt, reference allele/alternative allele.
Mapped gene, gene that is most likely to be associated with lead SNP, defined by the V2G score.
FDRPCOS|T2D, conditional FDR of PCOS conditioned on T2D.
FDRPCOS&T2D, conjunctional FDR of PCOS and T2D.
FIGURE 5Distribution of the functional annotation for candidate SNPs. (A) Distribution of the functional consequences of independent SNPs. (B) Distribution of RegulomeDB scores, with a lower score indicating a higher likelihood of having a regulatory function. (C) Minimum chromatin state across 127 tissue and cell types for SNPs in a shared genomic region, with lower states indicating higher accessibility and states 1–7 referring to open chromatin states.