| Literature DB >> 30557369 |
Varalee Yodsurang1, Yaqi Tang1,2, Yukie Takahashi1,3, Chizu Tanikawa4, Yoichiro Kamatani5, Atsushi Takahashi5, Yukihide Momozawa5, Nobuo Fuse6, Junichi Sugawara6, Atsushi Shimizu7, Akimune Fukushima7, Asahi Hishida8, Norihiro Furusyo9, Mariko Naito8,10, Kenji Wakai8, Taiki Yamaji11, Norie Sawada11, Motoki Iwasaki11, Shoichiro Tsugane12, Makoto Hirata4, Yoshinori Murakami13, Michiaki Kubo5, Koichi Matsuda1,4.
Abstract
Genome-wide association studies (GWAS) have identified greater than 30 variants associated with ovarian cancer, but most of these variants were investigated in European populations. Here, we integrated GWAS and subsequent functional analyses to identify the genetic variants with potential regulatory effects. We conducted GWAS for ovarian cancer using 681 Japanese cases and 17,492 controls and found that rs137672 on 22q13.1 exhibited a strong association with a P-value of 1.05 × 10(-7) and an odds ratio of 0.573 with a 95% confidence interval of 0.466-0.703. In addition, three previously reported SNPs, i.e., rs10088218, rs9870207 and rs1400482, were validated in the Japanese population (P < 0.05) with the same risk allele as noted in previous studies. Functional studies including regulatory feature analysis and electrophoretic mobility shift assay (EMSA) revealed two regulatory SNPs in 22q13.1, rs2072872 and rs6509, that affect the binding affinity to some nuclear proteins in ovarian cancer cells. The plausible regulatory proteins whose motifs could be affected by the allele changes of these two SNPs were also proposed. Moreover, the protective G allele of rs6509 was associated with a decreased SYNGR1 expression level in normal ovarian tissues. Our findings elucidated the regulatory variants in 22q13.1 that are associated with ovarian cancer risk.Entities:
Mesh:
Year: 2018 PMID: 30557369 PMCID: PMC6296504 DOI: 10.1371/journal.pone.0209096
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Characteristics of study population.
| Sample type | Source | Age (mean ± SD) | Platform | Number of SNPs | |
|---|---|---|---|---|---|
| Ovarian cancer cases | Biobank Japan | 30 | 54.43±11.8 | HumanExome_v10 | 247,870 |
| OmniExpress | 730,525 | ||||
| 571 | 56.99±11.25 | HumanExome_v11 | 242,901 | ||
| OmniExpress | 730,525 | ||||
| 12 | 70.17±9.33 | OmniExpressExome_v10 | 951,117 | ||
| 68 | 58.03±12.57 | OmniExpressExome_v12 | 964,193 | ||
| Controls | 1) JPHC | 5,019 | 53.49 ± 7.80 | OmniExpressExome_v12 | 964,193 |
| 2) J-MICC | 7,049 | 54.25 ± 9.41 | OmniExpressExome_v12 | 964,193 | |
| 3) ToMMo | 2,852 | 58.28 ± 11.95 | OmniExpressExome_v12 | 964,193 | |
| 4) IMM | 2,572 | 62.21 ± 10.03 | OmniExpressExome_v12 | 964,193 | |
a JPHC, Japan Public Health Center-based Prospective Study; J-MICC, Japan Multi-Institutional Collaborative Cohort study; ToMMo, Tohoku Medical Megabank Organization; IMM, Iwate Tohoku Medical Megabank Organization.
Fig 1Genome-wide association results for ovarian cancer in a Japanese population.
(A) Manhattan plot of 7,521,072 imputed SNPs on chromosomes 1 to 22. The red line indicates a threshold P-value of 1 × 10−5. Among 201 SNPs with P-values less than the threshold, rs137672 on chromosome 22 exhibited the lowest P-value (P = 1.05 × 10−7). (B) Quantile-quantile plot. The genomic inflation factor lambda (λ) was 1.035.
Associations of lead SNPs in 24 regions meeting the criteria (P < 1 × 10−5 and Rsq > 0.3).
| Locus | SNP | Eff/non allele | Case freq | Ctrl freq | Rsq | OR (95% CI) | Gene | Relative location | |
|---|---|---|---|---|---|---|---|---|---|
| 1p22.1 | rs185345278 | A/G | 0.9664 | 0.9784 | 0.3344 | 0.348 (0.221–0.547) | 4.86 × 10−6 | -23582 | |
| 1p12 | rs12031579 | G/A | 0.9177 | 0.9453 | 0.6395 | 0.548 (0.432–0.696) | 7.88 × 10−7 | -60859 | |
| 1q22 | rs188625872 | C/T | 0.9885 | 0.9966 | 0.8794 | 0.266 (0.154–0.460) | 2.01 × 10−6 | 0 | |
| 1q24.3 | rs12117623 | C/A | 0.4070 | 0.4694 | 0.9943 | 0.762 (0.683–0.850) | 1.46 × 10−6 | 0 | |
| 2p11.2 | rs17027263 | C/T | 0.8437 | 0.8828 | 0.9998 | 0.693 (0.596–0.806) | 1.69 × 10−6 | 0 | |
| 3q27.3 | rs6801612 | A/G | 0.7460 | 0.6830 | 0.9997 | 1.328 (1.174–1.503) | 7.57 × 10−6 | 1722 | |
| 5p15.32 | rs12658731 | G/A | 0.7030 | 0.6438 | 0.7678 | 1.405 (1.227–1.608) | 8.57 × 10−7 | 0 | |
| 5q31.2 | rs147867139 | A/G | 0.9671 | 0.9825 | 0.7531 | 0.455 (0.322–0.642) | 7.65 × 10−6 | 29634 | |
| 5q35.3 | rs60982503 | G/A | 0.7852 | 0.7345 | 0.9017 | 1.384 (1.204–1.591) | 4.66 × 10−6 | -9381 | |
| 6p24.3 | rs303051 | A/G | 0.5521 | 0.6053 | 0.9928 | 0.776 (0.696–0.867) | 6.21 × 10−6 | 0 | |
| 6p21.31 | rs56855829 | C/T | 0.9877 | 0.9964 | 0.8154 | 0.263 (0.151–0.459) | 2.37 × 10−6 | 0 | |
| 6q16.1 | rs4599655 | A/C | 0.8281 | 0.8635 | 0.9934 | 0.718 (0.621–0.829) | 7.69 × 10−6 | 68769 | |
| 6q24.3 | rs73589840 | C/T | 0.9452 | 0.9671 | 0.9537 | 0.573 (0.450–0.731) | 7.42 × 10−6 | 0 | |
| 7p12.3 | rs181474944 | C/T | 0.9710 | 0.9850 | 0.6381 | 0.407 (0.276–0.600) | 5.60 × 10−6 | 3731 | |
| 7q21.13 | rs76926936 | T/G | 0.9261 | 0.9483 | 0.4682 | 0.520 (0.390–0.694) | 9.24 × 10−6 | 361650 | |
| 9q21.33 | rs10117922 | A/G | 0.6249 | 0.6750 | 0.7793 | 0.749 (0.661–0.849) | 7.35 × 10−6 | 0 | |
| 9q34.3 | rs10858374 | T/C | 0.8444 | 0.8894 | 0.9488 | 0.681 (0.583–0.795) | 1.03 × 10−6 | -95595 | |
| 10q25.3 | rs2615880 | A/C | 0.8385 | 0.8766 | 0.9776 | 0.713 (0.614–0.828) | 8.30 × 10−6 | 0 | |
| 12q15 | rs789336 | C/T | 0.6857 | 0.6082 | 0.9999 | 1.369 (1.217–1.540) | 1.46 × 10−7 | 69976 | |
| 16q12.2 | rs145065165 | G/A | 0.9640 | 0.9777 | 0.4136 | 0.348 (0.233–0.519) | 2.33 × 10−7 | 5394 | |
| 17p13.2 | rs11870446 | C/G | 0.8360 | 0.8762 | 0.7774 | 0.674 (0.570–0.796) | 3.37 × 10−6 | 0 | |
| 19q13.43 | rs12151036 | G/T | 0.9736 | 0.9881 | 0.9998 | 0.453 (0.322–0.636) | 5.27 × 10−6 | 0 | |
| 20q13.31 | rs1884920 | A/G | 0.4534 | 0.4973 | 0.5171 | 0.711 (0.611–0.827) | 8.29 × 10−6 | 8727 | |
| 22q13.1 | rs137672 | C/T | 0.9045 | 0.9399 | 0.7961 | 0.573 (0.466–0.703) | 1.05 × 10−7 | -8860 |
a Effect allele/non-effect allele.
b Effect allele frequency.
c Rsq, imputation quality score.
d OR, odd ratio (non-effect alleles were considered as references); 95% CI, 95% confidence interval.
e Relative location, the distance from the transcription start site of the nearest gene to the SNP
Associations of the previously reported SNPs in the study population.
| Locus | SNP | Eff/non allele | GWAS results in this study (Japanese) | Gene | Previous study | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Case freq | Ctrl freq | Rsq | OR (95% CI) | Concordance with this study | PMID (population) | |||||
| 2q13 | rs2165109 | A/C | 0.5403 | 0.5183 | 1.72 × 10−1 | 0.9929 | 1.079 (0.967–1.204) | opposite | 28346442 (EU) | |
| 2q13 | rs17041869 | A/G | 0.7439 | 0.7474 | 8.50 × 10−1 | 0.9987 | 0.988 (0.873–1.118) | opposite | 28346442 (EU) | |
| 2q14.1 | rs752590 | A/G | 0.8554 | 0.8459 | 2.60 × 10−1 | 0.8992 | 1.098 (0.934–1.293) | same | 26075790 (EU) | |
| 2q31.1 | rs711830 | G/A | 0.7832 | 0.7938 | 7.39 × 10−1 | 0.9972 | 0.978 (0.858–1.116) | same | 28346442 (EU) | |
| 2q31.1 | rs2072590 | C/A | 0.7830 | 0.7935 | 7.52 × 10−1 | 0.9970 | 0.979 (0.859–1.117) | same | 20852632 (EU) | |
| 2q31.1 | rs6755777 | G/T | 0.7828 | 0.7931 | 7.68 × 10−1 | 0.9982 | 0.980 (0.860–1.118) | same | 28346442 (EU) | |
| 3q25.31 | rs2665390 | T/C | 0.9934 | 0.9901 | 1.39 × 10−1 | 0.9994 | 1.655 (0.850–3.223) | same | 20852632 (EU) | |
| 3q28 | rs9870207 | A/G | 0.5387 | 0.5034 | 0.9994 | 1.124 (1.007–1.255) | same | 28346442 (EU) | ||
| 4q32.3 | rs13113999 | T/G | 0.9397 | 0.9439 | 2.94 × 10−1 | 0.7344 | 0.867 (0.664–1.132) | opposite | 28346442 (EU) | |
| 5p15.33 | rs10069690 | C/T | 0.7485 | 0.7550 | 2.22 × 10−1 | 0.9847 | 0.925 (0.816–1.049) | same | 25581431 (EU) | |
| 5p15.33 | rs7705526 | C/A | 0.6226 | 0.6361 | 1.27 × 10−1 | 0.8361 | 0.909 (0.805–1.027) | same | 28346442 (EU) | |
| 7p12.1 | rs2190503 | G/A | 0.8747 | 0.8781 | 1.91 × 10−1 | 0.9948 | 0.895 (0.760–1.056) | same | 24190013 (EU) | |
| 7p12.1 | rs6593140 | T/C | 0.8869 | 0.8875 | 3.15 × 10−1 | 0.9998 | 0.916 (0.771–1.088) | same | 24190013 (EU) | |
| 7p12.1 | rs2329554 | G/A | 0.6888 | 0.6946 | 4.71 × 10−1 | 0.9890 | 0.957 (0.852–1.077) | same | 24190013 (EU) | |
| 8q24.21 | rs9886651 | A/G | 0.7318 | 0.7160 | 4.96 × 10−1 | 0.9823 | 1.044 (0.923–1.181) | opposite | 28346442 (EU) | |
| 8q24.21 | rs1400482 | G/A | 0.9897 | 0.9830 | 0.9867 | 1.709 (1.001–2.919) | same | 28346442 (EU) | ||
| 8q24.21 | rs10088218 | G/A | 0.9897 | 0.9829 | 0.9988 | 1.713 (1.007–2.913) | same | 20852632 (EU) | ||
| 9p22.2 | rs3814113 | T/C | 0.7525 | 0.7519 | 8.30 × 10−1 | 0.9975 | 1.014 (0.895–1.150) | same | 19648919 (EU) | |
| 9q22.33 | rs1413299 | T/G | 0.7249 | 0.7326 | 7.96 × 10−1 | 0.9988 | 0.984 (0.872–1.111) | same | 25134534 (AS) | |
| 9q34.2 | rs633862 | T/C | 0.5470 | 0.5265 | 3.05 × 10−1 | 1.0000 | 1.058 (0.950–1.179) | same | 25134534 (AS) | |
| 10p11.21 | rs1192691 | G/T | 0.4723 | 0.4746 | 4.02 × 10−1 | 0.9902 | 0.954 (0.855–1.065) | opposite | 25134534 (AS) | |
| 12q14.2 | rs11175194 | G/A | 0.6417 | 0.6377 | 6.76 × 10−1 | 0.9998 | 1.024 (0.914–1.148) | same | 25134534 (AS) | |
| 12q22 | rs11108890 | C/A | 0.8847 | 0.8933 | 9.89 × 10−2 | 0.9994 | 0.867 (0.731–1.028) | same | 24190013 (EU) | |
| 13q14.2 | rs970651 | G/A | 0.7590 | 0.7533 | 8.57 × 10−1 | 0.9844 | 1.012 (0.893–1.147) | opposite | 24190013 (EU) | |
| 14q24.1 | rs17106154 | T/C | 0.6988 | 0.6817 | 2.61 × 10−1 | 0.9093 | 1.074 (0.949–1.215) | opposite | 24190013 (EU) | |
| 17q12 | rs7405776 | G/A | 0.6906 | 0.6873 | 8.46 × 10−1 | 0.6209 | 1.015 (0.875–1.178) | same | 28346442 (EU) | |
| 17q12 | rs757210 | C/T | 0.6854 | 0.6821 | 8.57 × 10−1 | 0.6656 | 1.013 (0.878–1.169) | opposite | 25581431 (EU) | |
| 17q12 | rs11651755 | T/C | 0.6799 | 0.6779 | 9.76 × 10−1 | 0.9990 | 1.002 (0.891–1.127) | same | 28346442 (EU) | |
| 17q21.31 | rs183211 | A/G | 0.7048 | 0.7051 | 7.30 × 10−1 | 0.8396 | 1.023 (0.899–1.165) | same | 25581431 (EU) | |
| 17q21.32 | rs9303542 | A/G | 0.8094 | 0.8094 | 3.63 × 10−1 | 0.9788 | 0.937 (0.815–1.077) | same | 25581431 (EU) | |
| 19p13.11 | rs2363956 | T/G | 0.6970 | 0.6894 | 2.52 × 10−1 | 0.9995 | 1.071 (0.953–1.205) | opposite | 20852633 (EU) | |
| 19p13.11 | rs1469713 | A/G | 0.7357 | 0.7374 | 9.17 × 10−1 | 0.9890 | 0.993 (0.878–1.124) | same | 28346442 (EU) | |
| 22q12.1 | rs6005807 | C/T | 0.9751 | 0.9685 | 1.32 × 10−1 | 0.9183 | 1.324 (0.920–1.907) | same | 28346442 (EU) | |
| 22q12.2 | rs9609538 | T/C | 0.8122 | 0.8025 | 2.06 × 10−1 | 0.9986 | 1.094 (0.952–1.258) | same | 24190013 (EU) | |
a Effect allele/non-effect allele.
b Effect allele frequency.
c Rsq, imputation quality score.
d OR, odd ratio (non-effect alleles were considered as references); 95% CI, 95% confidence interval.
e EU, European; AS, Asian.
Fig 2Regulatory feature analysis of 30 candidate SNPs at 22q13.1.
(A) The regional plot of 22q13.1, the most associated loci in a Japanese population, with an enlarged view of 30 candidate SNPs meeting the criteria (P < 1 × 10−5 and Rsq > 0.3). The SNPs surrounding the lead SNP (rs137672) are color-coded to reflect their correlation as indicated. Pairwise r2 values are obtained from 1000 Genomes East Asian data (March 2012 release). Genes, the position of exons and the direction of transcription obtained from the UCSC genome browser. (B). Thirty SNPs were marked on chromosome 22 (chr22:39,671,929–39,738,825; GRCh37/hg19) based on their positions. Nine yellow-highlighted SNPs, variant located in regulatory regions in 22q13.1. The regulatory features included 1) epigenetic markers, i.e., H3K4Me1, H3K4Me3, and H3K27Ac; 2) Ensembl regulatory build representing regions involved in gene regulation; 3) transcriptional regulation binding data.
Transcriptional regulations and eQTL analysis of the nine SNPs in regulatory regions in 22q13.1.
| SNP | r2 to rs6509 | Transcriptional regulations | eQTL in ovary | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| ReMap | Ensembl (data in ovary) | Ref/alt allele | |||||||||
| TF | Epigenome activity | TF | Beta | Beta | Beta | ||||||
| rs137620 | 1.000 | atf2 | N/A | - | T/C | 0.102 | 0.18 | 0.777 | -0.04 | 0.025 | 0.19 |
| rs137621 | 1.000 | atf2 | N/A | - | G/A | 0.079 | 0.19 | 0.784 | -0.04 | 0.015 | 0.21 |
| rs94852 | 1.000 | sin3a, bcl6 | N/A | - | T/C | 0.119 | 0.16 | 0.42 | -0.12 | 0.023 | 0.19 |
| rs2072872 | 1.000 | sin3a, bcl6, mllt1, smad5, chd1 | N/A | - | A/G | 0.14 | -0.2 | 0.368 | 0.17 | 0.297 | -0.11 |
| rs738331 | 1.000 | sin3a, bcl6, mllt1, smad5, chd1, max | Active promotor (G allele) | Cjun, Gabp, Jund, FOSL2, Egr1, CTCF, Yy1, JUN::FOS, SP1 | A/G | 0.14 | -0.2 | 0.368 | 0.17 | 0.298 | -0.11 |
| rs6509 | 1.000 | bcl6, chd1, mllt1, max, taf3 | Active promotor (T allele) | Yy1, Jund, SP1, CTCF, JUN::FOS, Cjun, FOSL2, Gabp, Egr1 | C/T | 0.065 | 0.2 | 0.648 | -0.07 | 0.005 | 0.23 |
| rs470082 | 1.000 | chd1, mllt1, max, taf3, bclaf1, smad5, bcor, brd4, atf2, mxi1, cdk8, myc, taf1, smc3, sin3a, rad21, cbfb, bcl6, chd8, phf8, med1 | Active promotor (T allele) | SP1, Yy1, JUN::FOS, Egr1, FOSL2, Gabp, Jund, Cjun, CTCF | C/T | 0.091 | 0.19 | 0.379 | -0.13 | 0.003 | 0.25 |
| rs5757613 | 0.543 | chd1, mllt1, max, taf3, bclaf1, smad5, bcor, brd4, atf2, mxi1, cdk8, myc, taf1, smc3, sin3a, rad21, cbfb, bcl6, chd8, phf8, med1 | Active promotor (A allele) | Cjun, Jund, Gabp, Egr1, FOSL2, CTCF, JUN::FOS, Yy1, SP1 | G/A | 0.116 | -0.22 | 0.164 | 0.26 | 0.171 | -0.15 |
| rs137627 | 1.000 | chd1, max, taf3, smad5, bcor, brd4, atf2, mxi1, cdk8, taf1, smc3, sin3a, rad21, bcl6, chd8, phf8, med1, ctcf, stag1 | Active promotor (C and A alleles) | JUN::FOS, FOSL2, SP1, Cjun, Gabp, Egr1, CTCF, Yy1, Jund | G/A | 0.049 | 0.22 | 0.383 | -0.13 | 0.002 | 0.27 |
a Pairwise linkage disequilibrium (r2) to rs6509 in Japanese.
b Transcriptional regulations in ReMap and Ensembl; TF, transcription factor.
c Reference/alternative allele.
d Beta, effect size representing the effect of alternative allele to the gene expression; (value > 0), increased expression; value < 0, decreased expression.
Fig 3Allele-specific binding of nuclear proteins to rs2072872 and rs6509.
EMSA using 31-bp labeled oligonucleotide probes flanking each SNP (SNP ± 15 bp). Sequences of oligonucleotide probes are listed in S2 Table. The shifted band indicated the interaction between nuclear protein extracted from SKOV3 cells and probes containing the SNP allele as indicated, i.e., A or G allele. The star indicates specific binding to the G allele of each SNP. Arrow indicates non-specific interaction found in every sample. The intensity of a shifted band was quantified based on the fold-change of the G allele with respect to the A allele using ImageJ software.
Fig 4The predicted transcription factor motifs containing rs2072872 and rs6509.
The SNP with flanking sequences (SNP ± 25 nucleotides) was searched for transcription factor binding site motifs using HOCOMOCO-11 collection (http://opera.autosome.ru) with a p-value cutoff = 0.0005 and fold-change cutoff = 4.0. The motifs built from ChIP-Seq data with quality A demonstrating high affinity (indicated by fold-change of G/A) for G allele are included. The SNP position is labeled in red.