| Literature DB >> 26934861 |
Mingyang Cai1,2,3, Sewoon Kim1, Kai Wang2,3,4, Peggy J Farnham5,6, Gerhard A Coetzee7,8, Wange Lu1.
Abstract
Genome-wide association studies (GWAS) have identified >100 independent susceptibility loci for prostate cancer, including the hot spot at 8q24. However, how genetic variants at this locus confer disease risk hasn't been fully characterized. Using circularized chromosome conformation capture (4C) coupled with next-generation sequencing and an enhancer at 8q24 as "bait", we identified genome-wide partners interacting with this enhancer in cell lines LNCaP and C4-2B. These 4C-identified regions are distributed in open nuclear compartments, featuring active histone marks (H3K4me1, H3K4me2 and H3K27Ac). Transcription factors NKX3-1, FOXA1 and AR (androgen receptor) tend to occupy these 4C regions. We identified genes located at the interacting regions, and found them linked to positive regulation of mesenchymal cell proliferation in LNCaP and C4-2B, and several pathways (TGF beta signaling pathway in LNCaP and p53 pathway in C4-2B). Common genes (e.g. MYC and POU5F1B) were identified in both prostate cancer cell lines. However, each cell line also had exclusive genes (e.g. ELAC2 and PTEN in LNCaP and BRCA2 and ZFHX3 in C4-2B). In addition, BCL-2 identified in C4-2B might contribute to the progression of androgen-refractory prostate cancer. Overall, our work reveals key genes and pathways involved in prostate cancer onset and progression.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26934861 PMCID: PMC4776156 DOI: 10.1038/srep22462
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Overview of 4C-seq process and bait region at 8q24 locus.
(A) “Bait” segment is shown in black while “capture” segments are shown in red, blue and orange. Two rounds of digestion are included, and inverse PCR is performed to amplify unknown captured sequence. (B) Upper panel shows a broad view of human 8q24 locus, and lower panel shows a detailed view of AcP10 region at 8q24 locus. AcP10 region is highlighted in cyan. The pair of primers for inverse PCR is shown in red triangles. In gene annotation track, coding gene (POU5F1B) is shown in blue, while non-coding genes (CASC8 and RP11-382A18.2) are shown in green.
Figure 2Consistency between replicates.
(A) Scatter plot showing interactions density. The numbers of interaction sites in each genomic bin (2Mb trans and 1Mb cis) in two replicates are plotted. Color scale indicates interaction density and Pearson correlation coefficient is shown in the panel. (B) Heatmap and dendrogram showing the interactions density and clustering of samples. Complete linkage method with Euclidean distance measure is used. Color scale indicates interaction density. BR: biological replicate.
Figure 3Distribution of 4C reads and sites.
(A) Stacked bar plot showing the percentage of mapped reads in cis and in trans in two replicates of LNCaP and C4-2B. (B) A Venn diagram showing the number of overlapping sites between two biological replicates in LNCaP and C4-2B. (C) Boxplot showing the distributions of overlapping between biological replicates (red dot for LNCaP and blue dot for C4-2B) and between randomly simulated data.
Summary of metrics in 4C-seq analysis in LNCaP and C4-2B.
| cell line | BR | # read pairs | # non-random sites | # overlapping non-random sites | # cis-sites | # overlapping cis-sites | # trans-sites | # overlapping trans-sites |
|---|---|---|---|---|---|---|---|---|
| LNCaP | 1 | 27,518,509 | 4,529 | 2,096 | 403 (8.9%) | 208 (9.9%) | 4,126 (91.1%) | 1,888 (90.1%) |
| 2 | 38,637,799 | 3,840 | 322 (8.4%) | 3,518 (91.6%) | ||||
| C4-2B | 1 | 25,060,263 | 2,536 | 897 | 208 (8.2%) | 102 (11.4%) | 2,328 (91.8%) | 795 (88.6%) |
| 2 | 39,250,858 | 2,041 | 181 (8.9%) | 1,860 (91.1%) | ||||
| Percentages are shown in brackets. Two biological replicates (BRs) were performed for each cell line. | ||||||||
Figure 4Significant 4C domains.
Genome-wide distribution of significant domains in LNCaP (red) and C4-2B (blue) is shown in tabular fashion (A) and circos plot (B).
Figure 5Enrichment of histone marks and transcription factor bindings.
(A) Boxplot of different open chromatin marks distribution around the interacting sites in LNCaP (red), C4-2B (blue), and random sites (grey). ChIP-seq tags within +/− 500 kb around interacting sites were counted and normalized to 10 million. (B) Boxplot of transcription factors binding profile around the interacting sites in LNCaP (red), C4-2B (blue), and random sites (grey). ChIP-seq tags within +/− 500 kb around interacting sites were counted and normalized to 10 million.
Figure 6Gene enrichment analysis.
Results of gene enrichment analysis in LNCaP (A) and C4-2B (B).
Significant enrichment of ontology annotations of genes associated with 4C sites in LNCaP (A) and C4-2B (B).
Prostate cancer associated seed genes (annotated by Phenolyzer) identified by 4C-seq in LNCaP and C4-2B.
| Cell line | prostate cancer seed genes from Phenolyzer |
|---|---|
| LNCaP | (159 genes) ELAC2(7), PTEN(10), MSR1(15), EHBP1(16), MAD1L1(22), |
| C4-2B | (84 genes) BRCA2(8), ZFHX3(18), HSD17B3(21), |
Numbers enclosed with brackets () are ranks calculated by Phenolyzer. In the table, genes are listed with decreasing rank, and genes exist in both LNCaP and C4-2B results are underlined.