| Literature DB >> 35328433 |
Nickolai A Tchurikov1, Elena S Klushevskaya1, Ildar R Alembekov1, Anastasiia S Bukreeva1, Antonina N Kretova1, Vladimir R Chechetkin1, Galina I Kravatskaya1, Yuri V Kravatsky1,2.
Abstract
Small noncoding RNAs of different origins and classes play several roles in the regulation of gene expression. Here, we show that diverged and rearranged fragments of rDNA units are scattered throughout the human genome and that endogenous small noncoding RNAs are processed by the Microprocessor complex from specific regions of ribosomal RNAs shaping hairpins. These small RNAs correspond to particular sites inside the fragments of rDNA that mostly reside in intergenic regions or the introns of about 1500 genes. The targets of these small ribosomal RNAs (srRNAs) are characterized by a set of epigenetic marks, binding sites of Pol II, RAD21, CBP, and P300, DNase I hypersensitive sites, and by enrichment or depletion of active histone marks. In HEK293T cells, genes that are targeted by srRNAs (srRNA target genes) are involved in differentiation and development. srRNA target genes are enriched with more actively transcribed genes. Our data suggest that remnants of rDNA sequences and srRNAs may be involved in the upregulation or downregulation of a specific set of genes in human cells. These results have implications for diverse fields, including epigenetics and gene therapy.Entities:
Keywords: HEK293T; differentiation; epigenetics; rDNA fragments; small ribosomal RNAs (srRNAs); transcriptional activation; transcriptional silencing
Mesh:
Substances:
Year: 2022 PMID: 35328433 PMCID: PMC8954558 DOI: 10.3390/ijms23063014
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Characterization of srRNAs and their target genes. (A) Violin presentation of srRNA lengths and abundance, including all isolated molecules, unique molecules (without exact copies), and sense and antisense molecules. The complete data are presented in Table S1. (B) The distribution of sense (red curve) and antisense (blue curve) srRNAs along the length of a 43 kb rDNA unit. The vertical line rising from the 28S gene indicates the highest peak of srRNAs. (C) The top ten Gene Ontology (GO) biological process associations of srRNA target genes. The values to the right of the bars show the number of srRNA target genes associated with a process. The complete list of srRNA target genes is shown in Table S1. Table S2 shows the results of the GO search. (D) A Venn diagram showing the intersections between srRNA target genes and rDNA-contacting genes [9]. Table S3 shows the list of overlapping genes. (E) The top ten GO biological process associations of 426 genes are shown in (D). The values to the right of the bars show the number of srRNA target genes associated with a process. Table S4 shows the results of the corresponding GO search.
Figure 2Characterization of srRNA targets inside the intron of the UNC45B gene. (A) Divergent rDNA stretches in the region are shown at the top. The colors indicate the alignment score of NCBI BLAST. The dot plot in Figure S1 shows the position of a short rearranged 5′ fragment of the 28S gene inside a segment of chr17. The distribution of layered H3K27ac marks, genome segmentation from ENCODE, histone modifications, nucleosome position, and CpG methylation inside a region of chr17 are shown as in the UCSC Browser. (B) Four groups of overlapping sequences of srRNAs of length 23–43 nt correspond to the main peak of sense srRNAs from the 28S gene, as shown in Figure 1B.
Figure 3Properties of srRNA targets in HEK293T cells. (A) Profiles of DNase I sites, binding sites of different factors, histone marks, and rDNA-contacting sites around srRNA targets. The z-scored signals ±1.5 kb around srRNA targets are indicated. (B) The percentage of chromatin states (15-state model) in H1-derived neuronal progenitor-cultured cells in the whole genome (left dark bars) and at srRNA targets (right light bars) is shown. The color codes of the epigenetic states are shown in the order they appear at the srRNA sites. The labels present a state number and the percentage of the corresponding state. The statistical significance of the difference between epigenome states is tested with the independent-samples unequal-variances t-test. All cases with statistically significant differences (p < 0.005) are marked by an asterisk.
Figure 4Analysis of expression of srRNA target genes. (A) The scatter plot presents the expression levels of 1584 srRNA target genes and the numbers of corresponding srRNAs. The red dots indicate srRNA target genes. The median position and whiskers are shown in blue. The X-axis is not to scale. The list of all srRNAs is shown in Table S1. The names of genes are indicated if a single gene is targeted by a particular set of isolated srRNAs (see Section 4). (B) Violin plots showing the distribution of genes with respect to their expression levels for all HEK293T genes (red), random genes (violet), and srRNA target genes (blue). The numbers of corresponding genes are shown at the top.