Literature DB >> 29262571

Ovarian cancer variant rs2072590 is associated with HOXD1 and HOXD3 gene expression.

Liyuan Guo1, Yan Peng2, Lei Sun3, Xia Han4, Juan Xu4, Dongwei Mao4.   

Abstract

Ovarian cancer (OC) is a common cancer in women and the leading cause of deaths from gynaecological malignancies in the world. In addition to the candidate gene approach to identify OC susceptibility genes, the genome-wide association study (GWAS) methods have reported new variants that are associated with OC risk. The minor allele of rs2072590 at 2q31 was associated with an increased OC risk, and was primarily significant for serous subtype. The OC risk-associated SNP rs2072590 lies in non-coding DNA downstream of HOXD3 and upstream of HOXD1, and it tags SNPs in the HOXD3 3' UTR. We think that the non-coding rs2072590 variant may contribute to OC susceptibility by regulating the gene expression of HOXD1 and HOXD3. In order to investigate this association, we performed a bioinformatics analysis by a functional annotation of rs2072590 variant using RegulomeDB (version 1.1), HaploReg (version 4.1), and PhenoScanner (version 1.1). Using HaploReg, we identified 19 genetic variants tagged by rs2072590 variant with with r2 >= 0.8. Using RegulomeDB, we identified that three genetic variants are likely to affect TF binding + any motif + DNase Footprint + DNase peak. Other genetic variants are likely to affect TF binding + DNase peak. Using PhenoScanner (version 1.1), we identified that these 19 genetic variants could significantly regulate the expression of nearby genes, especially the HOXD1 and HOXD3 in human ovary tissue.

Entities:  

Keywords:  gene expression; genome-wide association study; ovarian cancer

Year:  2017        PMID: 29262571      PMCID: PMC5732737          DOI: 10.18632/oncotarget.21902

Source DB:  PubMed          Journal:  Oncotarget        ISSN: 1949-2553


INTRODUCTION

Ovarian cancer (OC) is a common cancer in women and the leading cause of deaths from gynaecological malignancies in the world [1]. Like other human complex diseases, OC is caused by the combination of genetic variants and environmental factors, including the familial BRCA1 and BRCA2 mutations and common genetic variants of lower penetrance [1]. In addition to the candidate gene approach to identify OC susceptibility genes, the genome-wide association study (GWAS) methods have also reported new variants that are associated with OC risk [1]. However, the exact genetic mechanisms for these OC susceptibility variants are still unclear [2]. It is reported that the potential associations between gene expression and OC risk alleles may connect risk variants to their putative target genes/transcripts and biological pathways [2]. The minor allele of rs2072590 at 2q31 was associated with an increased OC risk (OR = 1.16, 95% CI 1.12–1.21, p = 4.5 × 10−14), and was primarily significant for serous subtype (OR = 1.20, 95% CI 1.14–1.25, p = 3.8×10−14) [3]. The 2q31 locus contains a family of homeobox (HOX) genes involved in regulating embryogenesis and organogenesis [3]. Altered expression of HOX genes has been reported in many cancers [3]. The OC risk-associated SNP rs2072590 lies in non-coding DNA downstream of HOXD3 and upstream of HOXD1, and it tags SNPs in the HOXD3 3′ UTR [3]. We think that the non-coding rs2072590 variant may contribute to OC susceptibility by regulating the gene expression of HOXD1 and HOXD3. In order to investigate this association, we conducted a functional annotation of rs2072590 variant using RegulomeDB (version 1.1) [4], HaploReg (version 4.1) [5], and PhenoScanner (version 1.1) [6].

RESULTS

LD analysis using HaploReg

Using the LD information from the 1000 Genomes Project (EUR), we got 19 genetic variants tagged by rs2072590 variant with with r2 >= 0.8. These 19 genetic variants are located around the HOXD4, HOXD3, AC009336.24 and HOXD-AS1. Here, we give the detailed information including the LD information about these variants in Table 1.
Table 1

rs2072590 and variants with r2 > = 0.8

SNPchromosomepos (hg38)LD (r2)LD (D’)RefAltGeneFunctional annotation
rs497250421761539980.890.98TCHOXD4
rs255180221761574300.850.93CGHOXD3
rs225289521761591920.960.98AGHOXD3
rs225289421761591940.880.96GCHOXD3
rs285753821761595330.981CTHOXD3
rs285754021761619700.980.99GTHOXD3
rs211355921761663710.970.99AGHOXD3intronic
rs71785221761668950.981CTHOXD3intronic
rs224913121761673670.981CTHOXD3intronic
rs285753221761685550.981AGHOXD3intronic
rs1051929217617202611TCHOXD3synonymous
rs711830217617258311AGHOXD33′-UTR
rs1318778217617310311CGHOXD3
rs1549334217617446911GAHOXD3
rs643357121761748500.981GTHOXD3
rs2072590217617790511ACAC009336.24intronic
rs675576621761784770.961TCAC009336.24intronic
rs675577721761784980.991TGAC009336.24intronic
rs156231521761807540.981TAHOXD-AS1intronic

AFR, African samples; AMR, Ad Mixed American samples; ASN, East Asian samples; EUR, European samples; LD, linkage disequilibrium; SNP, single nucleotide polymorphism; Ref = reference allele; Alt = altered allele.

AFR, African samples; AMR, Ad Mixed American samples; ASN, East Asian samples; EUR, European samples; LD, linkage disequilibrium; SNP, single nucleotide polymorphism; Ref = reference allele; Alt = altered allele.

Functional annotation using RegulomeDB

RegulomeDB was used to annotate these 19 genetic variants with known and predicted regulatory elements. The results showed that three genetic variants including rs1562315, rs2551802 and rs6433571 likely to affect TF binding + any motif + DNase Footprint + DNase peak, as described in Table 2. Other genetic variants are likely to affect TF binding + DNase peak. More detailed results are described in Table 2.
Table 2

Functional annotation results using RegulomeDB

SNPchromosomepos (hg38)RefAltRegulome DB Score
rs15623152176180754TA2b
rs25518022176157430CG2b
rs64335712176174850GT2b
rs10519292176172026TC4
rs13187782176173103CG4
rs15493342176174469GA4
rs20725902176177905AC4
rs22491312176167367CT4
rs22528942176159194GC4
rs22528952176159192AG4
rs28575382176159533CT4
rs28575402176161970GT4
rs67557662176178477TC4
rs67557772176178498TG4
rs7118302176172583AG4
rs21135592176166371AG5
rs28575322176168555AG5
rs49725042176153998TC5
rs7178522176166895CT5

1a, eQTL + TF binding + matched TF motif + matched DNase Footprint + DNase peak; 1b, eQTL + TF binding + any motif + DNase Footprint + DNase peak; 1c, eQTL + TF binding + matched TF motif + DNase peak; 1d, eQTL + TF binding + any motif + DNase peak; 1e, eQTL + TF binding + matched TF motif; 1f, eQTL + TF binding / DNase peak; 2a, TF binding + matched TF motif + matched DNase Footprint + DNase peak; 2b, TF binding + any motif + DNase Footprint + DNase peak; 2c, TF binding + matched TF motif + DNase peak; 3a, TF binding + any motif + DNase peak; 3b, TF binding + matched TF motif; 4, TF binding + DNase peak; 5, TF binding or DNase peak; 6, other.

1a, eQTL + TF binding + matched TF motif + matched DNase Footprint + DNase peak; 1b, eQTL + TF binding + any motif + DNase Footprint + DNase peak; 1c, eQTL + TF binding + matched TF motif + DNase peak; 1d, eQTL + TF binding + any motif + DNase peak; 1e, eQTL + TF binding + matched TF motif; 1f, eQTL + TF binding / DNase peak; 2a, TF binding + matched TF motif + matched DNase Footprint + DNase peak; 2b, TF binding + any motif + DNase Footprint + DNase peak; 2c, TF binding + matched TF motif + DNase peak; 3a, TF binding + any motif + DNase peak; 3b, TF binding + matched TF motif; 4, TF binding + DNase peak; 5, TF binding or DNase peak; 6, other.

Functional annotation using PhenoScanner

Using PhenoScanner (version 1.1), we identified that these 19 genetic variants could significantly regulate the expression of nearby genes including HOXD-AS1, HOXD3, HOXD1, HOXD4, ATP5G3, HOXD9, HOXD11, KIAA1715, MTX2, LINC01116, HOXD-AS2, HOXD8, and HOXD10 in 32 human tissues. These tissues include Adipose subcutaneous, Adipose visceral omentum, Artery tibial, Brain cerebellar hemisphere, Brain hippocampus, Brain nucleus accumbens basal ganglia, Brain putamen basal ganglia, Breast mammary tissue, Cells transformed fibroblasts, Colon sigmoid, Colon transverse, Esophagus gastroesophageal junction, Esophagus mucosa. Esophagus muscularis, Heart atrial appendage, Lung, Lymphoblastoid cell lines, Muscle skeletal, Nerve tibial, Ovary, Pancreas, Peripheral blood, Skin, Skin not sun exposed suprapubic, Skin sun exposed lower leg, Small intestine terminal ileum, Spleen, Stomach, Testis, Thyroid, Uterus and Whole blood. Interestingly, these genetic variants could significantly regulate the gene expression of HOXD1 and HOXD3 in human ovary tissue, as described in Table 3. More detailed results in 32 human tissues are described in Supplementary Table 1.
Table 3

19 genetic variants and gene expression in human ovary tissue

SNPPos (hg19)AllelesTissueGeneEnsemblNEffect AlleleBetaSEP
rs1051929chr2:177036754T/COvaryHOXD1ENSG00000128645.1185T0.48910.13840.000765
rs1051929chr2:177036754T/COvaryHOXD3ENSG00000128652.785T0.57010.15250.000396
rs1318778chr2:177037831C/GOvaryHOXD1ENSG00000128645.1185C0.49210.13670.000623
rs1318778chr2:177037831C/GOvaryHOXD3ENSG00000128652.785C0.57210.15070.000328
rs1549334chr2:177039197G/AOvaryHOXD1ENSG00000128645.1185G0.49210.13670.000623
rs1549334chr2:177039197G/AOvaryHOXD3ENSG00000128652.785G0.57210.15070.000328
rs1562315chr2:177045482T/AOvaryHOXD1ENSG00000128645.1185T0.44040.13340.00158
rs1562315chr2:177045482T/AOvaryHOXD3ENSG00000128652.785T0.51720.1470.000804
rs2072590chr2:177042633C/AOvaryHOXD1ENSG00000128645.1185C-0.49230.13670.00062
rs2072590chr2:177042633C/AOvaryHOXD3ENSG00000128652.785C-0.57240.15070.000327
rs2113559chr2:177031099G/AOvaryHOXD1ENSG00000128645.1185G-0.46230.14440.002132
rs2113559chr2:177031099G/AOvaryHOXD3ENSG00000128652.785G-0.54610.1590.001047
rs2249131chr2:177032095C/TOvaryHOXD1ENSG00000128645.1185C0.46290.14410.002068
rs2249131chr2:177032095C/TOvaryHOXD3ENSG00000128652.785C0.54640.15870.001019
rs2252894chr2:177023922C/GOvaryHOXD1ENSG00000128645.1185C-0.39230.14620.009254
rs2252895chr2:177023920A/GOvaryHOXD1ENSG00000128645.1185A0.47380.14390.001618
rs2252895chr2:177023920A/GOvaryHOXD3ENSG00000128652.785A0.51870.16050.001948
rs2551802chr2:177022158C/GOvaryHOXD1ENSG00000128645.1185C0.44330.13740.001984
rs2551802chr2:177022158C/GOvaryHOXD3ENSG00000128652.785C0.50680.15220.001447
rs2857532chr2:177033283A/GOvaryHOXD1ENSG00000128645.1185A0.46290.14410.002068
rs2857532chr2:177033283A/GOvaryHOXD3ENSG00000128652.785A0.54640.15870.001019
rs2857538chr2:177024261C/TOvaryHOXD1ENSG00000128645.1185C0.47150.14350.001652
rs2857538chr2:177024261C/TOvaryHOXD3ENSG00000128652.785C0.5320.15930.001399
rs2857540chr2:177026698G/TOvaryHOXD1ENSG00000128645.1185G0.41970.150.006799
rs2857540chr2:177026698G/TOvaryHOXD3ENSG00000128652.785G0.50230.16530.003439
rs4972504chr2:177018726C/TOvaryHOXD1ENSG00000128645.1185C-0.46580.14160.001635
rs4972504chr2:177018726C/TOvaryHOXD3ENSG00000128652.785C-0.50970.1580.001977
rs6433571chr2:177039578G/TOvaryHOXD1ENSG00000128645.1185G0.45840.13530.00121
rs6433571chr2:177039578G/TOvaryHOXD3ENSG00000128652.785G0.52540.14970.000827
rs6755766chr2:177043205C/TOvaryHOXD1ENSG00000128645.1185C-0.44810.13680.001702
rs6755766chr2:177043205C/TOvaryHOXD3ENSG00000128652.785C-0.53480.15020.000705
rs6755777chr2:177043226G/TOvaryHOXD1ENSG00000128645.1185G-0.44050.13340.001577
rs6755777chr2:177043226G/TOvaryHOXD3ENSG00000128652.785G-0.51710.1470.000807
rs711830chr2:177037311G/AOvaryHOXD1ENSG00000128645.1185G-0.49210.13670.000623
rs711830chr2:177037311G/AOvaryHOXD3ENSG00000128652.785G-0.57210.15070.000328
rs717852chr2:177031623T/COvaryHOXD1ENSG00000128645.1185T-0.46290.14410.002068
rs717852chr2:177031623T/COvaryHOXD3ENSG00000128652.785T-0.54640.15870.001019

DISCUSSION

Overall, the GWAS methods have reported new variants that are associated with OC risk [1]. However, the exact genetic mechanisms for these OC susceptibility variants are still unclear [2]. Evidence shows that the potential associations between gene expression and OC risk alleles may connect risk variants to their putative target genes/transcripts and biological pathways [2]. Zhao et al. selected seven OC risk variants including rs3814113 on 9p22, rs2072590 on 2q31, rs2665390 on 3q25, rs10088218, rs1516982, rs10098821 on 8q24, and rs2363956 on 19p13 [2]. They evaluated the associations between gene expression and OC risk alleles using the whole genome mRNA expression data in 121 lymphoblastoid cell lines from 74 non-related familial ovarian cancer patients, and 47 non-cancer unrelated family controls [2]. They identified two cis-associations between rs10098821 and c-Myc, and rs2072590 and HS.565379. The OC risk-associated SNP rs2072590 lies in non-coding DNA downstream of HOXD3 and upstream of HOXD1, and it tags SNPs in the HOXD3 3′ UTR [3]. However, Zhao et al. did not report any significant association between rs2072590 and HOXD1 or HOXD3. We think that the non-coding rs2072590 variant may contribute to OC susceptibility by regulating the gene expression of HOXD1 and HOXD3. Here, we conducted a functional annotation of rs2072590 variant using RegulomeDB (version 1.1) [4], HaploReg (version 4.1) [5], and PhenoScanner (version 1.1) [6]. Using HaploReg, we identified 19 genetic variants tagged by rs2072590 variant with with r2 >= 0.8. Using RegulomeDB, we identified that three genetic variants are likely to affect TF binding + any motif + DNase Footprint + DNase peak. Other genetic variants are likely to affect TF binding + DNase peak. Using PhenoScanner (version 1.1), we identified that these 19 genetic variants could significantly regulate the expression of nearby genes, especially the HOXD1 and HOXD3 in human ovary tissue. In addition to the OC, some other comprehensive functional annotation of human complex diseases have also been conducted including colorectal cancer [7, 8], prostate cancer [9-11], breast cancer [12], multiple sclerosis [13], and Alzheimer’s disease [14]. Collectively, we think that our results provide further insight into the genetic architecture of inherited susceptibility to OC, as did in previous studies [7-14].

MATERIALS AND METHODS

HaploReg is a tool for exploring annotations of the noncoding genome at variants on haplotype blocks [5]. HaploReg includes LD information from the 1000 Genomes Project, chromatin state and protein binding annotation from the Roadmap Epigenomics and the Encyclopedia of DNA Elements (ENCODE) projects, sequence conservation across mammals, the effect of SNPs on regulatory motifs, and the effect of SNPs on gene expression from eQTL studies [5]. We used HaploReg (version 4.1) to identify the rs2072590 tagged variants using the LD information from the 1000 Genomes Project (EUR) with r2 > = 0.8 [5]. RegulomeDB (version 1.1) is a database that annotates SNPs with known and predicted regulatory elements in the intergenic regions of the human genome [4]. Known and predicted regulatory DNA elements include regions of DNAase hypersensitivity, binding sites of transcription factors, and promoter regions that have been biochemically characterized to regulation transcription [4]. RegulomeDB (version 1.1) includes the public datasets from Gene Expression Omnibus (GEO), the ENCODE project, and published literature [4]. PhenoScanner (version 1.1) is a curated database holding publicly available results from large-scale GWAS [6]. The motivation for creating this tool is to facilitate “phenome scans”, the cross-referencing of genetic variants with a broad range of phenotypes, to help aid the understanding of disease pathways and biology [6]. The catalogue currently contains nearly 3 billion associations and over 10 million unique SNPs [6]. The results are aligned across traits to the same effect and non-effect alleles for each SNP [6].
  14 in total

1.  DNase hypersensitive sites and association with multiple sclerosis.

Authors:  Giulio Disanto; Geir Kjetil Sandve; Vito A G Ricigliano; Julia Pakpoor; Antonio J Berlanga-Taylor; Adam E Handel; Jens Kuhle; Lars Holden; Corey T Watson; Gavin Giovannoni; Lahiru Handunnetthi; Sreeram V Ramagopalan
Journal:  Hum Mol Genet       Date:  2013-10-02       Impact factor: 6.150

2.  Genome-wide association study identifies new susceptibility loci for epithelial ovarian cancer in Han Chinese women.

Authors:  Kexin Chen; Hongxia Ma; Lian Li; Rongyu Zang; Cheng Wang; Fengju Song; Tingyan Shi; Dianke Yu; Ming Yang; Wenqiong Xue; Juncheng Dai; Shuang Li; Hong Zheng; Chen Wu; Ying Zhang; Xiaohua Wu; Dake Li; Fengxia Xue; Haixin Li; Zhi Jiang; Jibin Liu; Yuexin Liu; Pei Li; Wen Tan; Jing Han; Jiang Jie; Quan Hao; Zhibin Hu; Dongxin Lin; Ding Ma; Weihua Jia; Hongbing Shen; Qingyi Wei
Journal:  Nat Commun       Date:  2014-08-19       Impact factor: 14.919

3.  A genome-wide association study identifies susceptibility loci for ovarian cancer at 2q31 and 8q24.

Authors:  Ellen L Goode; Georgia Chenevix-Trench; Honglin Song; Susan J Ramus; Maria Notaridou; Kate Lawrenson; Martin Widschwendter; Robert A Vierkant; Melissa C Larson; Susanne K Kjaer; Michael J Birrer; Andrew Berchuck; Joellen Schildkraut; Ian Tomlinson; Lambertus A Kiemeney; Linda S Cook; Jacek Gronwald; Montserrat Garcia-Closas; Martin E Gore; Ian Campbell; Alice S Whittemore; Rebecca Sutphen; Catherine Phelan; Hoda Anton-Culver; Celeste Leigh Pearce; Diether Lambrechts; Mary Anne Rossing; Jenny Chang-Claude; Kirsten B Moysich; Marc T Goodman; Thilo Dörk; Heli Nevanlinna; Roberta B Ness; Thorunn Rafnar; Claus Hogdall; Estrid Hogdall; Brooke L Fridley; Julie M Cunningham; Weiva Sieh; Valerie McGuire; Andrew K Godwin; Daniel W Cramer; Dena Hernandez; Douglas Levine; Karen Lu; Edwin S Iversen; Rachel T Palmieri; Richard Houlston; Anne M van Altena; Katja K H Aben; Leon F A G Massuger; Angela Brooks-Wilson; Linda E Kelemen; Nhu D Le; Anna Jakubowska; Jan Lubinski; Krzysztof Medrek; Anne Stafford; Douglas F Easton; Jonathan Tyrer; Kelly L Bolton; Patricia Harrington; Diana Eccles; Ann Chen; Ashley N Molina; Barbara N Davila; Hector Arango; Ya-Yu Tsai; Zhihua Chen; Harvey A Risch; John McLaughlin; Steven A Narod; Argyrios Ziogas; Wendy Brewster; Aleksandra Gentry-Maharaj; Usha Menon; Anna H Wu; Daniel O Stram; Malcolm C Pike; Jonathan Beesley; Penelope M Webb; Xiaoqing Chen; Arif B Ekici; Falk C Thiel; Matthias W Beckmann; Hannah Yang; Nicolas Wentzensen; Jolanta Lissowska; Peter A Fasching; Evelyn Despierre; Frederic Amant; Ignace Vergote; Jennifer Doherty; Rebecca Hein; Shan Wang-Gohrke; Galina Lurie; Michael E Carney; Pamela J Thompson; Ingo Runnebaum; Peter Hillemanns; Matthias Dürst; Natalia Antonenkova; Natalia Bogdanova; Arto Leminen; Ralf Butzow; Tuomas Heikkinen; Kari Stefansson; Patrick Sulem; Sören Besenbacher; Thomas A Sellers; Simon A Gayther; Paul D P Pharoah
Journal:  Nat Genet       Date:  2010-09-19       Impact factor: 38.330

4.  Functional annotation of risk loci identified through genome-wide association studies for prostate cancer.

Authors:  Yizhen Lu; Zheng Zhang; Hongjie Yu; S Lily Zheng; William B Isaacs; Jianfeng Xu; Jielin Sun
Journal:  Prostate       Date:  2010-12-06       Impact factor: 4.104

5.  Alzheimer's Disease Variants with the Genome-Wide Significance are Significantly Enriched in Immune Pathways and Active in Immune Cells.

Authors:  Qinghua Jiang; Shuilin Jin; Yongshuai Jiang; Mingzhi Liao; Rennan Feng; Liangcai Zhang; Guiyou Liu; Junwei Hao
Journal:  Mol Neurobiol       Date:  2016-01-09       Impact factor: 5.590

6.  Top associated SNPs in prostate cancer are significantly enriched in cis-expression quantitative trait loci and at transcription factor binding sites.

Authors:  Junfeng Jiang; Peilin Jia; Bairong Shen; Zhongming Zhao
Journal:  Oncotarget       Date:  2014-08-15

7.  PhenoScanner: a database of human genotype-phenotype associations.

Authors:  James R Staley; James Blackshaw; Mihir A Kamat; Steve Ellis; Praveen Surendran; Benjamin B Sun; Dirk S Paul; Daniel Freitag; Stephen Burgess; John Danesh; Robin Young; Adam S Butterworth
Journal:  Bioinformatics       Date:  2016-06-17       Impact factor: 6.937

8.  Associations between gene expression variations and ovarian cancer risk alleles identified from genome wide association studies.

Authors:  Hua Zhao; Jie Shen; Dan Wang; Yuqing Guo; Steven Gregory; Leonardo Medico; Qiang Hu; Li Yan; Kunle Odunsi; Shashikant Lele; Song Liu
Journal:  PLoS One       Date:  2012-11-02       Impact factor: 3.240

9.  Comprehensive functional annotation of seventy-one breast cancer risk Loci.

Authors:  Suhn Kyong Rhie; Simon G Coetzee; Houtan Noushmehr; Chunli Yan; Jae Mun Kim; Christopher A Haiman; Gerhard A Coetzee
Journal:  PLoS One       Date:  2013-05-22       Impact factor: 3.240

10.  Colorectal cancer risk genes are functionally enriched in regulatory pathways.

Authors:  Xi Lu; Mingming Cao; Su Han; Youlin Yang; Jin Zhou
Journal:  Sci Rep       Date:  2016-05-05       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.