| Literature DB >> 34930913 |
Hongji Dai1, Xinlei Chu1, Qian Liang2, Mengyun Wang3,4, Lian Li1, Yao Zhou2, Zhanye Zheng2, Wei Wang1, Zhao Wang2, Haixin Li5, Jianhua Wang2, Hong Zheng1, Yanrui Zhao1, Luyang Liu1, Hongcheng Yao6, Menghan Luo2, Qiong Wang1, Shan Kang7, Yan Li8, Ke Wang9, Fengju Song1, Ruoxin Zhang3,4, Xiaohua Wu3,4, Xi Cheng3,4, Wei Zhang10,11, Qingyi Wei12,13,14, Mulin Jun Li15,16, Kexin Chen17.
Abstract
Ovarian cancer survival varies considerably among patients, to which germline variation may also contribute in addition to mutational signatures. To identify genetic markers modulating ovarian cancer outcome, we performed a genome-wide association study in 2130 Chinese ovarian cancer patients and found a hitherto unrecognized locus at 3p26.1 to be associated with the overall survival (Pcombined = 8.90 × 10-10). Subsequent statistical fine-mapping, functional annotation, and eQTL mapping prioritized a likely casual SNP rs9311399 in the non-coding regulatory region. Mechanistically, rs9311399 altered its enhancer activity through an allele-specific transcription factor binding and a long-range interaction with the promoter of a lncRNA BHLHE40-AS1. Deletion of the rs9311399-associated enhancer resulted in expression changes in several oncogenic signaling pathway genes and a decrease in tumor growth. Thus, we have identified a novel genetic locus that is associated with ovarian cancer survival possibly through a long-range gene regulation of oncogenic pathways.Entities:
Year: 2021 PMID: 34930913 PMCID: PMC8688503 DOI: 10.1038/s41421-021-00342-6
Source DB: PubMed Journal: Cell Discov ISSN: 2056-5968 Impact factor: 38.079
Clinical characteristics of patients in each study.
| Characteristics | Discovery set | Replication set | Combined ( | ||
|---|---|---|---|---|---|
| SOCS-I ( | SOCS-II ( | TOCS-I ( | TOCS-II ( | ||
| Age at diagnosis, mean (SD) | 54.5 (9.9) | 54.7 (10.6) | 54.7 (9.6) | 52.9 (11.7) | 54.2 (10.7) |
| I | 13 (3.9) | 80 (7.9) | 32 (16.1) | 142 (24.3) | 267 (12.5) |
| II | 31 (9.2) | 126 (12.5) | 40 (20.1) | 63 (10.8) | 260 (12.2) |
| III | 258 (76.6) | 676 (67.0) | 103 (51.8) | 350 (59.8) | 1387 (65.1) |
| IV | 35 (10.4) | 127 (12.6) | 24 (12.1) | 30 (5.1) | 216 (10.1) |
| HGSOC | 303 (89.9) | 838 (83.1) | 109 (54.8) | 311 (53.2) | 1561 (73.3) |
| LGSOC | 2 (0.6) | 19 (1.9) | 6 (3.0) | 49 (8.4) | 76 (3.6) |
| ENOC | 13 (3.9) | 30 (3.0) | 58 (29.1) | 118 (20.2) | 219 (10.3) |
| CCOC | 7 (2.1) | 56 (5.6) | 5 (2.5) | 15 (2.6) | 83 (3.9) |
| MOC | 8 (2.4) | 29 (2.9) | 10 (5.0) | 38 (6.5) | 85 (4.0) |
| Other/unknown | 4 (1.2) | 37 (3.7) | 11 (5.5) | 54 (9.2) | 106 (5.0) |
| Mortality (%)a | 48.1 | 16.7 | 48.2 | 27.5 | 27.6 |
| Median survival time (months)b | 67 | 95 | 64 | 77 | 75 |
TOCS Tianjin Ovarian Cancer Study, SOCS Shanghai Ovarian Cancer Study, FIGO International Federation of Gynecology and Obstetrics, HGSOC High-Grade Serous Ovarian Cancer, LGSOC Low-Grade Serous Ovarian Cancer, ENOC ENdometrioid Ovarian Cancer, CCOC Clear Cell Ovarian Cancer, MOC Mucinous Ovarian Cancer.
aPercentage of the total deaths as recorded by the last date of follow-up.
bMedian survival time was estimated from the KM survival curve.
Fig. 1Results of genome-wide association study of survival in ovarian cancer patients.
a Manhattan plot of results of genome-wide association analysis for survival in the discovery stage. Each point represents the negative log P value for an association with overall survival. The horizontal blue line shows the suggestive threshold of 1 × 10−5. b Regional plot for association statistics at the 3p26.1 region. Results are shown for SNPs in the region 500 kb up- or downstream of the leading SNP rs7631664. Each dot represents the negative log P value for the association statistics. The top associated SNP is colored in purple (circle in the discovery stage and diamonds in the combined stages) and the remaining SNPs are colored according to linkage disequilibrium values (r2) with the top SNP in the discovery stage. c Kaplan–Meier estimates of the overall survival time for ovarian cancer patients stratified by genotypes of rs7631664. The left, middle, and right represent the discovery cohort from SOCS, replication cohort from TOCS, and combined cohort, respectively. All P < 0.001 for the log-rank test.
HRs and MSTs by rs7631664 genotypes in the discovery, replication, and combined GWAS samplesa.
| Genotypes | MST | HR (95% CI)b | HR (95% CI)c | |||
|---|---|---|---|---|---|---|
| 1.60 (1.32–1.94) | 1.77 × 10−6 | 1.59 (1.31–1.93) | 2.54 × 10−6 | |||
| AA | 962 (71.5) | 88 | ||||
| AG | 354 (26.3) | 53 | ||||
| GG | 29 (2.2) | 46 | ||||
| 1.56 (1.24–1.95) | 1.23 × 10−4 | 1.45 (1.15–1.82) | 1.45 × 10−3 | |||
| AA | 540 (68.9) | 78 | ||||
| AG | 224 (28.6) | 53 | ||||
| GG | 20 (2.6) | 19 | ||||
| 1.58 (1.36–1.83) | 8.90 × 10−10 | 1.53 (1.32–1.78) | 1.62 × 10−8 | |||
| AA | 1502 (70.5) | 86 | ||||
| AG | 578 (27.1) | 53 | ||||
| GG | 49 (2.3) | 38 |
HR hazard ratio, MST median survival time (months).
aGenotypes of rs7631664 were extracted from directly genotyping data.
bHR (95% CI). HR and P value was calculated using multivariable-adjusted Cox regression under a log-additive genetic model, adjusting for the top three principal components of population stratification.
cCox regression model was further adjusted by age and clinical stage.
dOne patient with missing genotype of rs7631664 in SOCS-1.
Fig. 2Functional annotations and prioritization of associated SNPs at 3p26.1 region.
a Regional plot of association signals in the combined stages and annotations with ovary-specific epigenomics data, including DNase-Seq, H3K27ac, and H3K4me1 histone modification ChIP-seq profiles for primary ovary tissue (data from Roadmap), and ATAC-seq profiles from three ovarian cancer cell lines (OVCA432, DOV13, and SKOV3). The region of rs9311399 overlaps with peaks of H3K27ac, H3K4me1, and DNase I hypersensitivity sites (DHSs) measured by DNase-Seq and ATAC-seq in ovary tissues/cells. b TF footprint analysis using ATAC-seq. Footprint analysis showed that the rs9311399 associated sequences could affect the TF footprint. The average profiles estimated by Sasquatch software showed the highest-scoring k-mer pair (blue = reference, red = variant). The k-mers were two sequences showing the biggest difference of protein footprint with and without the investigated variant. The number represented times that k-mer within open chromatin sites (indicated by #). c Luciferase reporter assay using vectors containing rs9311399 in OVCA432 cells. Luciferase signals were normalized to Renilla signals (n = 3). Luciferase reporter assay showed the effect allele A fragment had a higher activity, compared with the non-effect allele C fragment. d EMSA assay showed rs9311399-A caused enhanced protein binding relative to rs9311399-C using synthetic allele-specific probes and nuclear extracts from SKOV3 cells. The rs9311399-C allele was associated with a reduction of 20% in the intensity of the shifted protein band compared to the A allele. The competitor was the unlabeled probe with the corresponding genotype. Statistical comparisons of relative luciferase activity were undertaken using Student’s t tests. Data are shown as means ± sd with *P < 0.05, ***P < 0.001.
Fig. 3Allele-specific effect of rs9311399 on enhancer activity and protein binding.
a eQTL analysis from Chinese ovarian cancer patients showed the rs9311399 A allele was significantly associated with high expression of BHLHE40-AS1 in ovarian cancer tissues, P = 0.002 for linear regression adjusted by several confounding factors, and outlier samples in the 1.5 interquartile range (IQR) of gene expression range (measured by TPM) were removed. b Survival plot for patients in the TCGA dataset. The yellow line represents patients with low expression of mRNA for both BHLHE40 and BHLHE40-AS1 genes, while the blue line indicates those with high expression of at least one of these two genes. P = 0.034 for the log-rank test. c Silver staining of rs9311399 A/C DNA pulldown proteins and mass spectrometric analysis of rs9311399-A specific binding proteins. DNA pulldown of nuclear protein extract from SKOV3 cells with biotin-labeled rs9311399-A and rs9311399-C. The eluates were resolved on SDS-PAGE and silver-stained. The two differential bands that co-precipitated with rs9311399-A indicated by the black and red arrow were cut from the gel and followed by mass spectrometric analysis. Detailed results from the mass spectrometric analysis are shown in Supplementary Data S4. d Motif scanning using ATAC-seq identified HOXB4 and HOXB8 could be altered by rs9311399. e The rs9311399-A probe brought down more HOXB4 or HOXB8 proteins, compared with the rs9311399-C probe. DNA pulldown followed by immunoblotting with antibodies against the indicated proteins. For quantification, the intensity of HOXB4 or HOXB8 that co-precipitated with rs9311399-C was set at 1.00.
Fig. 4rs9311399 physically interacts with the promoter of BHLHE40-AS1 and BHLHE40.
a 4C plot and associated epigenomic annotations for rs9311399. Using the rs9311399 region as the bait in OVCA429, 4C result indicated that the rs9311399 fragment could interact with the shared promoter of BHLHE40-AS1 and BHLHE40. b Enrichment quantification of 3C analysis confirmed chromatin interactions between BHLHE40 promoter and rs9311399 locus. The primer sites were marked by black dots, and the bait primer was indicated by the black arrow. c Sanger sequencing of 3C ligation products and the rs9311399 locus from input and 3C samples. The sequence from 3C samples showed a strong allele-specific effect for rs9311399-A. The black arrow denotes the SNP location. d A schematic diagram shows the region surrounding the rs9311399 to be deleted by the two sgRNAs. e qRT-PCR analysis indicated mRNA levels of BHLHE40 and BHLHE40-AS1 were reduced after knockout of the rs9311399 locus among control and KO clones. Statistical comparisons of relative gene expression were undertaken using Student’s t tests. Data are shown as means ± sd with *P < 0.05, **P < 0.01.
Fig. 5Deletion of rs9311399-associated enhancer leads to alteration in cancer-related pathways and tumorigenic capacity.
a Volcano plot of RNA-seq for differentially expressed genes between OVCA432 WT and KO cells. b, c Pathways enrichment analysis of RNA-seq differentially expressed genes using KEGG and Reactome gene sets. d, e The representative images showed colony formation assay. Deletion of rs9311399-associated enhancer reduced colony formation in OVCA432 WT and KO cells. f siRNA-mediated knockdown of BHLHE40-AS1 attenuates cell proliferation in both OVCA432 and SKOV3 cells. Statistical comparisons of colony numbers or cell proliferation were undertaken using Student’s t tests. Data are shown as mean ± sd with **P < 0.01, ***P < 0.001.