| Literature DB >> 30486775 |
Florencio Pazos1, Adrian Garcia-Moreno2, Juan C Oliveros2.
Abstract
BACKGROUND: Epigenetic phenomena are crucial for explaining the phenotypic plasticity seen in the cells of different tissues, developmental stages and diseases, all holding the same DNA sequence. As technology is allowing to retrieve epigenetic information in a genome-wide fashion, massive epigenomic datasets are being accumulated in public repositories. New approaches are required to mine those data to extract useful knowledge. We present here an automatic approach for detecting genomic regions with epigenetic variation patterns across samples related to a grouping of these samples, as a way of detecting regions functionally associated to the phenomenon behind the classification.Entities:
Keywords: Epigenetics; Epigenomics; Gene transcription regulation
Mesh:
Year: 2018 PMID: 30486775 PMCID: PMC6264639 DOI: 10.1186/s12864-018-5286-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Schema of the methodology used for detecting genome segments with an epigenetic pattern resembling a sample classification. For a given sample, different epigenomic markers are quantified in a genome-wide fashion (a). For a given region of the genome, all these markers are collapsed into a single epigenetic state (colors) using ChromHMM (b). This is done for hundreds of different samples (c). These samples can be classified according with different criteria (into three groups in this example: brown, pink and blue). The epigenetic pattern of a given segment of the genome (e) is compared with an equivalent pattern representing this sample classification (f) using a “mutual information” based approach. One of the patterns is shuffled thousands of times in order to generate a null distribution of MI scores from where to extract a p-value for the MI score of the segment of interest (g). The process is repeated for all other windows in the human genome (d). The genome segments with the highest MI values and significant p-values are taken as those related to the sample classification (h). Lung clipart source: Wikimedia Commons (http://commons.wikimedia.org/)
Enriched processes in the three datasets. For the brain dataset, only the top of the list is shown. The whole lists with additional information (e.g. genes in each process), are available as Additional file 3
| BRAIN | FETAL | CANCER | ||||||
|---|---|---|---|---|---|---|---|---|
| GO.ID | Term | Fisher Test | GO.ID | Term | Fisher Test | GO.ID | Term | Fisher Test |
| GO:0007399 | nervous system development | < 1e-30 | GO:0007166 | cell surface receptor signaling pathway | 1.80E-05 | GO:0000278 | mitotic cell cycle | 1.60E-04 |
| GO:0022008 | neurogenesis | 6.50E-27 | GO:0007167 | enzyme linked receptor protein signaling... | 1.50E-04 | GO:0098912 | membrane depolarization during atrial ca... | 1.90E-04 |
| GO:0031175 | neuron projection development | 7.90E-27 | GO:0009966 | regulation of signal transduction | 1.90E-04 | GO:0007049 | cell cycle | 2.60E-04 |
| GO:0030182 | neuron differentiation | 1.80E-26 | GO:0035272 | exocrine system development | 2.00E-04 | GO:0060850 | regulation of transcription involved in cell fate | 3.70E-04 |
| GO:0048699 | generation of neurons | 2.60E-26 | GO:0001503 | ossification | 2.10E-04 | GO:0006928 | movement of cell or subcellular componen... | 4.40E-04 |
| GO:0048666 | neuron development | 7.00E-26 | GO:0060605 | tube lumen cavitation | 3.00E-04 | GO:0007018 | microtubule-based movement | 5.90E-04 |
| GO:0048812 | neuron projection morphogenesis | 2.00E-23 | GO:0060662 | salivary gland cavitation | 3.00E-04 | GO:0086045 | membrane depolarization during AV node c... | 6.20E-04 |
| GO:0048468 | cell development | 2.50E-23 | GO:0023051 | regulation of signaling | 3.30E-04 | GO:0007417 | central nervous system development | 9.50E-04 |
| GO:0048731 | system development | 8.40E-22 | GO:0016055 | Wnt signaling pathway | 4.20E-04 | |||
| GO:0048667 | cell morphogenesis involved in neuron di... | 2.40E-21 | GO:0010646 | regulation of cell communication | 4.20E-04 | |||
| GO:0050808 | synapse organization | 2.40E-19 | GO:0022008 | neurogenesis | 5.10E-04 | |||
| GO:0048856 | anatomical structure development | 5.70E-19 | GO:0007169 | transmembrane receptor protein tyrosine ... | 5.20E-04 | |||
| GO:0030030 | cell projection organization | 1.10E-18 | GO:0048699 | generation of neurons | 6.30E-04 | |||
| GO:0007267 | cell-cell signaling | 2.50E-17 | GO:0023056 | positive regulation of signaling | 7.90E-04 | |||
| GO:0007275 | multicellular organismal development | 3.00E-17 | GO:0060137 | maternal process involved in parturition | 8.40E-04 | |||
| GO:0048858 | cell projection morphogenesis | 4.40E-17 | GO:2000794 | regulation of epithelial cell proliferat... | 8.40E-04 | |||
| GO:0007268 | synaptic transmission | 4.70E-17 | GO:0007435 | salivary gland morphogenesis | 8.80E-04 | |||
| GO:0023052 | signaling | 5.70E-17 | GO:0010647 | positive regulation of cell communicatio... | 9.50E-04 | |||
| GO:0032990 | cell part morphogenesis | 6.40E-17 | ||||||
| GO:0044700 | single organism signaling | 7.70E-17 | ||||||
| GO:0061564 | axon development | 9.80E-17 | ||||||
| GO:0000904 | cell morphogenesis involved in different... | 3.00E-16 | ||||||
| GO:0032502 | developmental process | 3.30E-16 | ||||||
| GO:0007409 | axonogenesis | 7.60E-16 | ||||||
| GO:0044767 | single-organism developmental process | 9.80E-16 | ||||||
| GO:0032989 | cellular component morphogenesis | 2.80E-15 | ||||||
| GO:0000902 | cell morphogenesis | 2.90E-15 | ||||||
| GO:0051960 | regulation of nervous system development | 5.30E-15 | ||||||
| GO:0050803 | regulation of synapse structure or activ... | 8.80E-15 | ||||||
| GO:0007154 | cell communication | 1.60E-14 | ||||||
| GO:0007416 | synapse assembly | 2.10E-14 | ||||||
| GO:0050804 | modulation of synaptic transmission | 7.00E-13 | ||||||
| GO:0044707 | single-multicellular organism process | 8.10E-13 |
Fig. 2Example of two genomic regions with an epigenetic pattern related to brain. The epigenetic state of each 400 bp portion (small boxes) is indicated with colors (legend at the bottom). The two discussed regions (5000 bp windows) are highlighted, surrounded by their epigenomic neighborhood. The sample classification is indicated with light blue (brain) and dark blue (others). The brain samples are further highlighted with a box. The sample names are on the right, colored using the tissue-based color schema of the Roadmap Epigenomics Consortium
Fig. 5Distribution of patterns of epigenetic change in the genomic segments detected for the three datasets. In the X axis, the patterns of epigenetic change are represented. The top box shows the majority epigenetic state for the segment in the samples of a given class (brain, fetal or cancer), and the box below that of the complementary set. The Y axis represents the proportion of detected genomic segments showing that particular pattern of change for the three datasets. For example, 20% of the detected segments in the “fetal” dataset (gold) are in “enhancer” state (yellow) in the fetal samples while “heterochromatized”(purple) in the adult samples
Fig. 3Example of two genomic regions with an epigenetic pattern related to fetal samples. Same representation as in Fig. 2 for fetal samples (dark blue) vs. others (light blue). By the Roadmap coloring of the samples it is evident that the “fetal” set contains now a variety of (fetal) tissues/organs
Fig. 4Example of two genomic regions with an epigenetic pattern related to cancer. Same representation as in Figs. 2 and 3 for cancer samples (dark blue) vs. healthy tissues (light blue)