| Literature DB >> 23346221 |
Min Ding1, Haiyun Wang, Jiajia Chen, Bairong Shen, Zhonghua Xu.
Abstract
Estrogen receptor (ER) is a crucial molecule symbol of breast cancer. Molecular interactions between ER complexes and DNA regulate the expression of genes responsible for cancer cell phenotypes. However, the positions and mechanisms of the ER binding with downstream gene targets are far from being fully understood. ChIP-Seq is an important assay for the genome-wide study of protein-DNA interactions. In this paper, we explored the genome-wide chromatin localization of ER-DNA binding regions by analyzing ChIP-Seq data from MCF-7 breast cancer cell line. By integrating three peak detection algorithms and two datasets, we localized 933 ER binding sites, 92% among which were located far away from promoters, suggesting long-range control by ER. Moreover, 489 genes in the vicinity of ER binding sites were identified as estrogen response elements by comparison with expression data. In addition, 836 single nucleotide polymorphisms (SNPs) in or near 157 ER-regulated genes were found in the vicinity of ER binding sites. Furthermore, we annotated the function of the nearest-neighbor genes of these binding sites using Gene Ontology (GO), KEGG, and GeneGo pathway databases. The results revealed novel ER-regulated genes pathways for further experimental validation. ER was found to affect every developed stage of breast cancer by regulating genes related to the development, progression, and metastasis. This study provides a deeper understanding of the regulatory mechanisms of ER and its associated genes.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23346221 PMCID: PMC3546463 DOI: 10.1155/2012/568950
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Figure 1The ChIP-Seq data analyzing pipeline.
The CHIP-Seq datasets.
| Dataset | Platform | Cell line | Sample information |
|---|---|---|---|
| GSE19013 | Illumina | MCF-7 | Ethanol treated |
| E2-treated | |||
| GSE14664 | Illumina | MCF-7 | ER_minus_ligand |
| ER_E2 |
An overview of the characteristics of different Chip-Seq peak detection algorithm.
| Algorithm | Profile | Background model | Control sample | Use control to compute FDR |
|---|---|---|---|---|
| F-Seq | Kernel density estimation (KDE) | √ | ||
| FindPeaks | Aggregation of overlapped tags | Monte Carlo | ||
| SISSRs | Window scan | Poisson | √ | |
| QuEST | Kernel density estimation (KDE) | √ | √ | |
| MACS | Tags shifted then window scan | dynamic Poisson | √ | √ |
| CisGenome | Strand-specific window scan | Negative binomial | √ | √ |
Number of ER binding sites identified by three ChIP-Seq peak calling programs (FDR < 0.01).
| Number of ER binding sites | |||||
|---|---|---|---|---|---|
| Dataset | Number of overlaped sites | ||||
| CisGenome | MACS | QuEST | |||
| GSE19013 | 8137 | 5583 | 5418 | 2019 | 933 |
| GSE14664 | 6773 | 7765 | 9280 | 5061 | |
Figure 2Comparison of QuEST, CisGenome, and MACS predicted result. (a) The FDR value in the dataset of GSE19013. (b) The FDR value in the dataset of GSE14664.
Figure 3The genomic binding sites of ER. (a) The consensus motif identified in the ERE binding sites. De novo motif search was performed using the MEME program. (b) The percentage of occurrences of ERE motifs in ER binding sites. (c) Comparison of the occurrences of ERE motifs between published and newly identified binding sites.
Figure 4Location analysis of ER binding sites. (a) locations relative to nearest-neighbor genes. (b) Genomic Locations of ER ChIP-Seq peaks. (c) Genomic locations of ER ChIP-Seq peaks within +10~+100 kb.
Breast cancer gene expression dataset and differently expressed genes number (q-value < 1%).
| Author | Journal | Array type | Sample | Sample | Differently expressed genes | |
|---|---|---|---|---|---|---|
| Upregulated | Downregulated | |||||
| Graham et al. [ | Clin Cancer Res | Affy | 15 | 15 | 709 | 333 |
| Wang et al. [ | Lancet | Affy | 209 | 77 | 2081 | 2537 |
| Lu et al. [ | Breast Cancer Res Treat | Affy | 76 | 53 | 5136 | 5445 |
|
| ||||||
| All | 5692 | 6101 | ||||
Figure 5Genomic Locations of differentially expressed genes in the vicinity of ER binding sites.
Figure 6Directed Acyclic Graphs (DAGs) of significantly enriched GO (Gene Ontology) categories (P < 0.01).
The comparison of top enriched GO categories between different expressed and other nearest-neighbor genes of ER binding sites (number of genes ≥ 100).
| Genes set | Biological process | Molecular function | Cellular component |
|---|---|---|---|
| Differently expressed | Biological regulation, metabolic process, | Protein binding, | Membrane, |
| Others | Biological regulation, metabolic process | Protein binding | Membrane |
KEGG pathways enriched with the nearest-neighbor genes of ER binding sites (P-value < 0.01).
| KEGG ID | Pathways name |
| Number of genes | Number of different expressed genes |
|---|---|---|---|---|
| hsa05200 | Pathways in cancer | 2.24 | 22 | 16 |
| hsa04510 | Focal adhesion | 0.0002 | 15 | 14 |
| hsa04360 | Axon guidance | 0.0009 | 11 | 8 |
| hsa04810 | Regulation of actin cytoskeleton | 0.0012 | 14 | 11 |
| hsa04010 | MAPK signaling pathway | 0.0022 | 15 | 12 |
| hsa04114 | Oocyte meiosis | 0.0024 | 9 | 8 |
| hsa04144 | Endocytosis | 0.0024 | 12 | 11 |
| hsa04115 | p53 signaling pathway | 0.0024 | 7 | 7 |
| hsa05216 | Thyroid cancer | 0.0024 | 5 | 4 |
| hsa05218 | Melanoma | 0.0033 | 7 | 3 |
| hsa04020 | Calcium signaling pathway | 0.004 | 11 | 4 |
| hsa04062 | Chemokine signaling pathway | 0.0064 | 11 | 9 |
| hsa04914 | Progesterone-mediated oocyte maturation | 0.0085 | 7 | 7 |
| hsa01100 | Metabolic pathways | 0.0086 | 35 | 28 |
| hsa00450 | Selenoamino acid metabolism | 0.0088 | 4 | 3 |
| hsa05414 | Dilated cardiomyopathy | 0.0096 | 7 | 7 |
| hsa03440 | Homologous recombination | 0.0097 | 4 | 3 |
Terms of the enriched GeneGo pathway maps (P-value < 0.01, FDR < 0.05).
| GeneGo pathway terms |
|
|---|---|
| Apoptosis and survival_APRIL and BAFF signaling | 1.29889 |
| Development_prolactin receptor signaling | 4.95517 |
| Development_glucocorticoid receptor signaling | 5.81237 |
| Development_ligand-independent activation of ESR1 and ESR2 | 0.000295251 |
| Immune response_IL-22 signaling pathway | 0.000381484 |
| Development_EPO-induced Jak-STAT pathway | 0.000531744 |
| Development_growth hormone signaling via STATs and PLC/IP3 | 0.000531744 |
| Cytoskeleton remodeling_keratin filaments | 0.000622315 |
| Development_GM-CSF signaling | 0.000660576 |
| Transcription_transcription regulation of aminoacid metabolism | 0.000752764 |