| Literature DB >> 31597351 |
Daniil Nikitin1,2, Nikita Kolosov3, Anastasiia Murzina4, Karina Pats5, Anton Zamyatin6, Victor Tkachev7, Maxim Sorokin8,9, Philippe Kopylov10, Anton Buzdin11,12,13.
Abstract
BACKGROUND: Retroelements (REs) are mobile genetic elements comprising ~40% of human DNA. They can reshape expression patterns of nearby genes by providing various regulatory sequences. The proportion of regulatory sequences held by REs can serve a measure of regulatory evolution rate of the respective genes and molecular pathways.Entities:
Keywords: H3K4me1; enhancers of transcription; epigenetics; gene expression regulation; gene ontology; histone modifications; human genome evolution; molecular pathways; retroelements; retrotransposons
Mesh:
Substances:
Year: 2019 PMID: 31597351 PMCID: PMC6830109 DOI: 10.3390/cells8101219
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 6.600
Figure 1Correlations of H3K4me1 histone modification profiles with the RNA sequencing gene expression data. The histone modification profiles (vertical axis) were compared with normalized gene expressions (abscissa) for 6 GM12878, 10 HepG2 and 22 K562 cell line RNA sequencing experiments. Pearson correlations were measured for (A) set of all known genes; (B) set of protein coding genes; (C) set of microRNA genes and (D) set of long non-coding RNA genes.
Figure 2Comparison of RE-linked regulatory enrichment scores between five cell lines investigated. Each panel represents Pearson correlation plot for corresponding score, labels denote cell lines investigated. (A) GRE score; (B) NGRE score; (C) PII score and (D) NPII score. Color scale represents Pearson correlation.
Figure 3Comparison of GRE (abscissa axis) and NGRE (axis of ordinates) scores for human genes. Color indicates different gene groups (red—RRE-enriched genes, blue—intermediate genes, green—RRE-deficient genes). Each dot represents single gene. Pearson correlation coefficient (r) and Pearson p-value (p) are shown separately.
Figure 4Comparison of PII scores (abscissa axis) and NPII scores (axis of ordinates) for human molecular pathways. Color indicates different pathway groups (red—RRE-enriched molecular pathways, blue—intermediate pathways, green—RRE-deficient molecular pathways). Each dot represents single pathway. Pearson correlation coefficient (r) and Pearson p-value (p) are shown separately.
Analysis of human non-coding RNA RE-linked regulation.
| Group | Non-Coding RNA Class | Number of Non-Coding RNA Genes in the Sample | Expected Number of Non-Coding RNA Genes in the Sample, Random Distribution Model | Hypergeometric | Hypergeometric | Conclusion |
|---|---|---|---|---|---|---|
| RRE-enriched | lncRNA | 145 | 74 | 2.22 × 10−8 | 0.99999998 | lncRNAs are overrepresented |
| RRE-deficient | lncRNA | 54 | 74 | 0.9951 | 0.0049 | lncRNAs are underrepresented |
| RRE-enriched | microRNA | 145 | 90 | 3.49 × 10−8 | 0.99999997 | microRNAs are overrepresented |
| RRE-deficient | microRNA | 94 | 90 | 0.33 | 0.67 | microRNAs are neither overrepresented nor underrepresented |
Figure 5Random control of GO terms enrichment in RRE-enriched (A) and RRE-deficient (B) gene sets.
RRE enrichment of the molecular processes according to GO and pathway analyses.
| Group of Processes | Pathway Analysis | GO Analysis | Overall Status | ||
|---|---|---|---|---|---|
| Enriched | Deficient | Enriched | Deficient | ||
| Posttranscriptional silencing by small RNAs | 1 | 0 | 1 | 0 | enriched |
| DNA Metabolism and Chromatin Structure | 7 | 6 | 4 | 3 | enriched |
| Sensory Perception and Neurotransmission | 3 | 0 | 2 | 0 | enriched |
| Lipids Metabolism | 12 | 8 | 3 | 0 | enriched |
| Endocytosis | 5 | 4 | 0 | 0 | enriched |
| Immune System | 11 | 21 | 2 | 8 | deficient |
| Protein Ubiquitination and Degradation | 0 | 5 | 0 | 2 | deficient |
| Cell Adhesion, Migration and Interaction | 13 | 15 | 0 | 12 | deficient |
| Metals Metabolism and Ion Transport | 0 | 3 | 3 | 4 | deficient |
| Cell Death | 7 | 13 | 0 | 11 | deficient |
| General Signaling Pathways | 33 | 38 | 0 | 15 | deficient |
| Hormones Signaling Pathways | 0 | 3 | 0 | 4 | deficient |
| Stress Response | 0 | 0 | 0 | 2 | deficient |
| Response to Viruses | 0 | 0 | 0 | 2 | deficient |
| Amino Acids Metabolism | 2 | 4 | 0 | 0 | deficient |
| Detoxication of Xenobiotics | 0 | 8 | 0 | 0 | deficient |
| Protein Aggregation and Import | 0 | 0 | 0 | 7 | deficient |
| Morphogenesis | 16 | 9 | 0 | 45 | ambiguous |
| Cytoskeleton | 7 | 2 | 0 | 7 | ambiguous |
| RNA Synthesis and Degradation | 3 | 0 | 6 | 23 | ambiguous |
| Translation, Protein Export and Folding | 0 | 3 | 3 | 0 | ambiguous |
| Cell Cycle and Mitosis | 22 | 3 | 2 | 6 | ambiguous |
| Other Processes | 14 | 10 | 7 | 18 | ambiguous |
Comparison of isolated groups of processes in current study and in TFBS study [21].
| Group of Processes | Current Study | TFBS Study | Comment | ||
|---|---|---|---|---|---|
| Overall Status | Type of Analysis | Status | Type of Analysis | ||
| (Pathways/GO/Consensus) | (Pathways/GO/Consensus) | ||||
|
| |||||
| Sensory Perception and Neurotransmission | enriched | consensus | enriched | consensus | |
| Lipids Metabolism | enriched | consensus | enriched | consensus | |
| Protein Ubiquitination and Degradation | deficient | consensus | deficient | consensus | Corresponds to “Translation and Protein Quality Control” in the TFBS study |
| Posttranscriptional silencing by small RNAs | enriched | consensus | enriched | consensus | Identified by Gorilla software and validated using hypergeometric enrichment in both studies |
| DNA Metabolism and Chromatin Structure | enriched | consensus | enriched | consensus | Corresponds to “DNA repair” in the TFBS study |
| General Signaling Pathways | deficient | consensus | deficient | consensus | |
|
| |||||
| Stress Response | deficient | GO | deficient | GO | |
| Cell Adhesion, Migration and Interaction | deficient | consensus | deficient | GO | |
| Cell Death | deficient | consensus | deficient | GO | |
| Protein Aggregation and Import | deficient | GO | deficient | GO | Corresponds to “Protein Localization and Modification” in the TFBS study |
| Response to Viruses | deficient | GO | deficient | consensus | |
|
| |||||
| Immune System | deficient | consensus | ambiguous | - | |
| Metals Metabolism and Ion Transport | deficient | consensus | enriched | GO | |
| Hormones Signaling Pathways | deficient | consensus | enriched | pathways | Corresponds to “Hormones” in the TFBS study |
| Amino Acids Metabolism | deficient | pathways | enriched | consensus | |
| Detoxication of Xenobiotics | deficient | pathways | enriched | consensus | |
| RNA Synthesis and Degradation | ambiguous | - | deficient | GO | |
| Translation, Protein Export and Folding | ambiguous | - | deficient | consensus | Corresponds to “Translation and Protein Quality Control” in the TFBS study |
| Cell Cycle and Mitosis | ambiguous | - | deficient | GO | |
|
| |||||
| Endocytosis | enriched | pathways | - | ||
| Morphogenesis | ambiguous | - | - | - | |
| Cytoskeleton | ambiguous | - | - | - | |
| Cellular immune response (T cells and NK cells) | - | - | enriched | consensus | |
| Fertilization | - | - | enriched | consensus | |
| Vitamin metabolism | - | - | enriched | pathways | |
| Molecular transport | - | - | enriched | pathways | |
| Sulfur metabolism and linked redox reactions | - | - | enriched | pathways | |
| Response to phorbol acetate | - | - | deficient | GO | |
| Electron transfer reactions | - | - | deficient | GO | |
| Mitochondria | - | - | deficient | GO | |
| Nucleic Base, Nucleosides and Nucleotides metabolism | - | - | deficient | consensus | |
| Carbohydrates metabolism | - | - | ambiguous | - | |