| Literature DB >> 31488892 |
Tatsuo Masuda1,2, Siew-Kee Low3,4, Masato Akiyama3,5, Makoto Hirata6, Yutaka Ueda2, Koichi Matsuda7, Tadashi Kimura2, Yoshinori Murakami8, Michiaki Kubo9, Yoichiro Kamatani3,10, Yukinori Okada11,12.
Abstract
We performed genome-wide association studies of five gynecologic diseases using data of 46,837 subjects (5236 uterine fibroid, 645 endometriosis, 647 ovarian cancer (OC), 909 uterine endometrial cancer (UEC), and 538 uterine cervical cancer (UCC) cases allowing overlaps, and 39,556 shared female controls) from Biobank Japan Project. We used the population-specific imputation reference panel (n = 3541), yielding 7,645,193 imputed variants. Analyses performed under logistic model, linear mixed model, and model incorporating correlations identified nine significant associations with three gynecologic diseases including four novel findings (rs79219469:C > T, LINC02183, P = 3.3 × 10-8 and rs567534295:C > T, BRCA1, P = 3.1 × 10-8 with OC, rs150806792:C > T, INS-IGF2, P = 4.9 × 10-8 and rs140991990:A > G, SOX9, P = 3.3 × 10-8 with UCC). Random-effect meta-analysis of the five GWASs correcting for the overlapping subjects suggested one novel shared risk locus (rs937380553:A > G, LOC730100, P = 2.0 × 10-8). Reverse regression analysis identified three additional novel associations (rs73494486:C > T, GABBR2, P = 4.8 × 10-8, rs145152209:A > G, SH3GL3/BNC1, P = 3.3 × 10-8, and rs147427629:G > A, LOC107985484, P = 3.8 × 10-8). Estimated heritability ranged from 0.026 for OC to 0.220 for endometriosis. Genetic correlations were relatively strong between OC and UEC, endometriosis and OC, and uterine fibroid and OC (rg > 0.79) compared with relatively weak correlations between UCC and the other four (rg = -0.08 ~ 0.25). We successfully identified genetic associations with gynecologic diseases in the Japanese population. Shared genetic effects among multiple related diseases may help understanding the pathophysiology.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31488892 PMCID: PMC6906293 DOI: 10.1038/s41431-019-0495-1
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 4.246
Characteristics of the genome-wide association studies of five gynecologic diseases
| Diseases | No. cases | No. controlsa | Disease prevalenceb | ||
|---|---|---|---|---|---|
| Uterine fibroid | 5 236 | 39 556 | 1.89 × 10−1 | 0.26 | 1.70 × 10−1 (2.57 × 10−2) |
| Endometriosis | 645 | 39 556 | 6.94 × 10−2 | 0.47–0.51 | 2.20 × 10−1 (1.30 × 10−1) |
| Ovarian cancer | 647 | 39 556 | 1.22 × 10−2 | 0.40 | 2.60 × 10−2 (7.95 × 10−2) |
| Uterine endometrial cancer | 909 | 39 556 | 1.64 × 10−2 | 0.52 | 1.26 × 10−1 (6.19 × 10−2) |
| Uterine cervical cancer | 538 | 39 556 | 1.28 × 10−2 | 0.11–0.34 | 1.17 × 10−1 (9.75 × 10−2) |
The numbers are rounded to three significant digits for disease prevalence and hSNP
aShared female controls among the five GWAS
bDisease prevalence among Japanese population
cHeritability previously estimated from twin, family or population-based case-control studies
dHeritability estimated from the GWAS data adjusted for disease prevalence
Fig. 1Manhattan plots of the five GWASs of gynecologic diseases. Manhattan plots of the GWAS of the five gynecologic diseases among Japanese. The y-axis indicates -log10(P) of association of each variant calculated by three methods, including the logistic regression model using mach2dat, the linear mixed model using BOLT-LMM, and incorporating correlations using MTAG, displayed from left to right. Horizontal dashed grey lines indicate genome-wide significance threshold (P < 5.0 × 10−8). Dots colored in red indicates genome-wide significant loci. Asterisks indicate the novel findings
Genetic variants significantly associated with gynecologic diseases
| Disease | SNPa | Chr | Position (bp)b | Genes | Ref | AltAllele Freqd | Imputation Rsq | mach2dat | BOLT-LMM | MTAG | Novel loci | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| OR (95% CI) | OR (95% CI) | OR (95% CI) | ||||||||||||
| Uterine fibroid | rs7412010 | 1 | 22 436 446 | G | 0.58 | 0.98 | 1.19 (1.13–1.24) | 2.0 × 10−11 | 1.15 (1.10–1.19) | 1.2 × 10−12 | 1.03 (1.02–1.05) | 5.4 × 10−8 | ||
| rs12415148 | 10 | 105 680 586 | T | 0.078 | 0.97 | 1.24 (1.14–1.35) | 8.6 × 10−7 | 1.25 (1.16–1.33) | 3.5 × 10−10 | 1.05 (1.03–1.07) | 2.0 × 10−5 | |||
| rs12225799 | 11 | 241 124 | C | 0.13 | 0.99 | 0.71 (0.65–0.76) | 1.7 × 10−17 | 0.76 (0.72–0.80) | 1.1 × 10−21 | 0.93 (0.91–0.94) | 4.0 × 10−17 | |||
| rs17332320 | 22 | 40 711 620 | G | 0.36 | 1.00 | 1.18 (1.13–1.24) | 4.4 × 10−11 | 1.15 (1.10–1.19) | 1.6 × 10−12 | 1.05 (1.03–1.06) | 3.9 × 10−13 | |||
| Ovarian cancer | rs79219469 | 16 | 54 587 853 | C | 0.039 | 0.94 | 1.85 (1.48–2.32) | 6.9 × 10−8 | 2.25 (1.69–2.99) | 3.3 × 10−8 | 1.10 (1.06–1.14) | 1.5 × 10−7 | ||
| rs567534295 | 17 | 41 200 107 | C | 0.012 | 0.79 | 2.83 (1.94–4.12) | 5.7 × 10−8 | 4.75 (2.73–8.24) | 3.1 × 10−8 | 1.19 (1.12–1.26) | 4.6 × 10−8 | |||
| Uterine cervical cancer | rs140668832 | 6 | 30 479 914 | the MHC region | A | 0.11 | 0.95 | 1.73 (1.45–2.05) | 5.8 × 10−10 | 1.92 (1.57–2.35) | 2.9 × 10−10 | 1.07 (1.05–1.09) | 1.8 × 10−9 | |
| rs117670375 | 6 | 30 687 472 | the MHC region | C | 0.11 | 0.94 | 1.73 (1.46–2.06) | 5.2 × 10−10 | 1.92 (1.57–2.35) | 3.0 × 10−10 | 1.07 (1.05–1.09) | 1.9 × 10−9 | ||
| rs150806792 | 11 | 2 179 342 | C | 0.012 | 0.92 | 2.79 (1.91–4.06) | 1.1 × 10−7 | 4.96 (2.79–8.83) | 4.9 × 10−8 | 1.19 (1.12–1.26) | 8.3 × 10−8 | |||
| rs140991990 | 17 | 70 097 851 | A | 0.015 | 0.72 | 2.76 (1.89–4.05) | 1.8 × 10−7 | 4.94 (2.80–8.71) | 3.3 × 10−8 | 1.16 (1.10–1.23) | 6.8 × 10−8 | |||
The underlined odds ratios (OR) and P-values indicate the analytic method which showed the most significant association for each SNP
Among the three methods, mach2dat, BOLT-LMM, and MTAG. OR in BOLT-LMM is adjusted using case fraction u, using the formula; log(OR) = β/(u × (1-u))
aVariants significantly associated with gynecologic diseases
bBased on hg19
cReference (Ref) and alternative (Alt) alleles on forward strand
dAlternative allele frequency among control subjects
Fig. 2Manhattan plots of meta-analysis of the five GWASs of gynecologic diseases. Manhattan plots of association P-values of all cases versus shared controls (top), cross-trait random effect meta-analysis of the five GWASs of gynecologic diseases controlling for the overlapping samples using RE2C (middle), and the reverse regression analysis using SCOPA (bottom). Analyses under the logistic regression model using mach2dat and under the linear mixed model using BOLT-LMM are displayed from left to right. Horizontal dashed grey lines indicate genome-wide significance threshold (P < 5.0 × 10−8). Dots colored in red indicates genome-wide significant loci. An asterisk indicates the novel findings
Meta-analysis results of the five GWASs of gynecologic diseases
| SNPa | Chr | Position (bp)b | Genes | Ref | Alt Allele Freqd | Imputation Rsq | mach2dat | BOLT-LMM | Novel loci | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| All cases vs controls | RE2C | All cases vs controls | RE2C | ||||||||||||
| OR (95% CI) | OR (95% CI) | ||||||||||||||
| rs7412010 | 1 | 22 436 446 | G | 0.58 | 0.98 | 1.14 (1.10–1.19) | 2.3 × 10−10 | 4.3 × 10−11 | 77.8 | 1.12 (1.08–1.15) | 5.0 × 10−11 | 3.7 × 10−10 | 91.7 | ||
| rs937380553 | 2 | 52 063 361 | A | 0.014 | 0.94 | 1.22 (1.03–1.45) | 2.1 × 10−2 | 8.9 × 10−8 | 83.0 | 1.16 (1.01–1.33) | 3.6 × 10−2 | 2.0 × 10−8 | 77.4 | * | |
| rs202217993 | 6 | 29 386 975 | the MHC region | GT | 0.10 | 0.95 | 1.20 (1.12–1.28) | 2.7 × 10−7 | 1.8 × 10−8 | 69.2 | 1.18 (1.11–1.25) | 1.2 × 10−8 | 1.8 × 10−9 | 76.2 | |
| rs17179851 | 6 | 29 924 440 | the MHC region | T | 0.25 | 0.93 | 1.14 (1.08–1.19) | 8.8 × 10−8 | 9.3 × 10−8 | 44.3 | 1.11 (1.07–1.15) | 6.8 × 10−8 | 6.8 × 10−7 | 74.1 | |
| rs117670375 | 6 | 30 687 472 | the MHC region | C | 0.11 | 0.94 | 1.18 (1.10–1.26) | 1.0 × 10−6 | 2.8 × 10−9 | 81.3 | 1.16 (1.10–1.22) | 1.3 × 10−7 | 2.0 × 10−10 | 79.2 | |
| rs12225799 | 11 | 241 124 | C | 0.13 | 0.99 | 0.78 (0.73–0.84) | 1.5 × 10−13 | 2.1 × 10−15 | 83.7 | 0.81 (0.78–0.86) | 3.3 × 10−16 | 3.7 × 10−17 | 95.3 | ||
| rs141244868 | 11 | 244 817 | GA | 0.13 | 0.98 | 0.78 (0.73–0.84) | 1.9 × 10−13 | 1.6 × 10−15 | 84.0 | 0.81 (0.78–0.86) | 4.7 × 10−16 | 4.2 × 10−17 | 95.3 | ||
| rs112251865 | 22 | 40 665 225 | C | 0.36 | 0.99 | 1.12 (1.08–1.17) | 3.0 × 10−8 | 2.3 × 10−9 | 75.5 | 1.10 (1.07–1.14) | 7.6 × 10−9 | 3.3 × 10−8 | 91.4 | ||
| rs17332320 | 22 | 40 711 620 | G | 0.36 | 1.00 | 1.12 (1.08–1.17) | 2.4 × 10−8 | 2.9 × 10−7 | 77.3 | 1.10 (1.07–1.14) | 6.7 × 10−9 | 1.5 × 10−8 | 91.6 | ||
The underlined odds ratios (OR) and P-values indicate the analytic method which showed the most significant association for each SNP among the four methods, joint analysis of all cases versus shared controls and random-effect meta-analysis using mach2dat and BOLT-LMM, respectively
OR in BOLT-LMM is adjusted using case fraction u, using the formula; log(OR) = β/(u × (1-u)) RE2C does not produce OR
aVariants significantly associated with gynecologic diseases.
bBased on hg19
cReference (Ref) and alternative (Alt) alleles on forward strand
dAlternative allele frequency among control subjects
eI statistics were calculated using Metasoft (v2.0.1)
Summary statistics of the detected variants in SCOPA
| SNPa | Chr | Position (bp)b | Genes | Ref / Altc | AltAlleleFreqd | Imputation Rsq | Diseases in the best model | Novel loci | |
|---|---|---|---|---|---|---|---|---|---|
| rs7412010 | 1 | 22 436 446 | C | 0.42 | 0.98 | UF | 7.0E–11 | ||
| rs6432216 | 2 | 11 702 960 | T | 0.27 | 1.00 | UF + Endometriosis | 4.6E–08 | ||
| rs937380553 | 2 | 52 063 361 | A | 0.014 | 0.94 | Endometriosis + OC + UEC | 3.6E–11 | ||
| rs9257985 | 6 | 29 652 253 | The MHC region | A | 0.14 | 1.00 | UF + UCC | 1.5E–08 | |
| rs117670375 | 6 | 30 687 472 | The MHC region | C | 0.11 | 0.95 | UCC | 1.4E–08 | |
| rs2507968 | 6 | 31 372 718 | The MHC region | G | 0.74 | 0.95 | UF + UCC | 2.5E–09 | |
| rs9271215 | 6 | 32 579 277 | The MHC region | C | 0.53 | 0.90 | UF + UCC | 2.4E–08 | |
| rs116832992 | 7 | 31 784 755 | T | 0.059 | 0.91 | Endometriosis + UCC | 3.2E–09 | ||
| rs73494486 | 9 | 101 341 851 | C | 0.14 | 0.95 | UF + OC | 4.8E–08 | ||
| rs138060871 | 11 | 241 284 | G | 0.085 | 0.86 | UF | 7.8E–16 | ||
| rs145152209 | 15 | 84 077 212 | A | 0.075 | 0.96 | UF | 3.3E–08 | ||
| 17:41200107 | 17 | 41 200 107 | C | 0.012 | 0.79 | OC | 1.0E–08 | ||
| 17:70097851 | 17 | 70 097 851 | A | 0.016 | 0.73 | UCC | 3.9E–08 | ||
| rs147427629 | 21 | 40 419 321 | G | 0.023 | 0.84 | OC + UEC | 3.8E–08 | * | |
| rs17332320 | 22 | 40 711 620 | G | 0.36 | 1.00 | UF + Endometriosis | 4.7E–13 |
UF Uterine fibroid, OC ovarian cancer, UEC uterine endometrial cancer, UCC uterine cervical cancer
aVariants significantly associated with gynecologic diseases
bBased on hg19
cReference (Ref) and alternative (Alt) alleles on forward strand
dAlternative allele frequency among control subjects
eSCOPA produces P-value of the best model
Fig. 3Cross-trait evaluation of genetic correlation among five gynecologic diseases. Genetic correlations among five gynecologic diseases calculated under the linear mixed model by Haseman–Elston regression using PCGC-s. Correlation is expressed by the color and size of square on the right upper triangle, while represented in digits on the left lower triangle. Asterisks indicate that the real output value exceeded one but was set to one for display purpose