| Literature DB >> 30517151 |
Saumya Agrawal1, Austen R D Ganley1,2.
Abstract
Ribosomal RNA gene repeats (rDNA) encode ribosomal RNA, a major component of ribosomes. Ribosome biogenesis is central to cellular metabolic regulation, and several diseases are associated with rDNA dysfunction, notably cancer, However, its highly repetitive nature has severely limited characterization of the elements responsible for rDNA function. Here we make use of phylogenetic footprinting to provide a comprehensive list of novel, potentially functional elements in the human rDNA. Complete rDNA sequences for six non-human primate species were constructed using de novo whole genome assemblies. These new sequences were used to determine the conservation profile of the human rDNA, revealing 49 conserved regions in the rDNA intergenic spacer (IGS). To provide insights into the potential roles of these conserved regions, the conservation profile was integrated with functional genomics datasets. We find two major zones that contain conserved elements characterised by enrichment of transcription-associated chromatin factors, and transcription. Conservation of some IGS transcripts in the apes underpins the potential functional significance of these transcripts and the elements controlling their expression. Our results characterize the conservation landscape of the human IGS and suggest that noncoding transcription and chromatin elements are conserved and important features of this unique genomic region.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30517151 PMCID: PMC6281188 DOI: 10.1371/journal.pone.0207531
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 2Primate rDNA repeat units.
A) Phylogenetic tree showing the relationships between primate species selected for rDNA phylogenetic footprinting [adapted from 91]. B) Human and primate rDNA unit structures are shown. The rRNA coding region (black line), including the 18S, 5.8S and 28S rRNA subunits (black boxes), and the IGS (grey line) are indicated along with the positions of repeat elements and a cdc27 pseudogene. Elements above the line are on the forward strand; those below on the reverse strand. The rRNA coding region/IGS coordinates and rDNA unit lengths are indicated.
Fig 3Sequence similarity plot of the primate rDNA.
The horizontal axis represents the position in the human rDNA; the vertical axis the level of sequence similarity between 0 (no identity) and 1 (all bases the same). A 50 bp sliding window with 1 bp increment was used to generate the similarity plot. Conserved regions in the IGS (purple boxes) were identified using phastCons. The positions of Alu elements (green boxes), microsatellites (grey boxes), a cdc27 pseudogene (pink box), the rRNA promoter (blue lines), previously identified IGS noncoding transcripts (green wiggly lines), c-Myc binding sites (orange lines), p53 binding site (green line), and Sal boxes (terminator elements; red lines) are indicated. Conserved regions with a black circle or triangle below are conserved in common marmoset and mouse rDNA, respectively.
Repeat composition of the primate rDNA sequences as a percent of total rDNA length (with genome-wide percent abundance in parentheses for comparison).
| Repeat Elements | Human | Chimpanzee | Gorilla | Orangutan | Gibbon | Macaque | Common Marmoset |
|---|---|---|---|---|---|---|---|
| 20.3 | 8.7 (0.8) | 6.6 (1.1) | 7.7 (0.8) | 7.7 (0.8) | 6.2 (0.8) | 10.4(0.9) | |
| 13.3 (10.6) | 13.1 (10.3) | 13.3 (8.3) | 13.6 (9.8) | 16.0 (10.6) | 14.2 (10.1) | 18.2 (11.0) | |
| 4.3 (20.4) | 1.6 (21.6) | 1.3 (19.8) | 1.1 (22.2) | 1.6 (21.8) | 1.5 (19.1) | 0.4 (21.8) | |
| 1.2 (8.3) | 0.7 (9.0) | 0.4 (8.4) | 0.9 (9.0) | 1.7 (8.7) | 12.2 (8.4) | 3.60 (1.0) |
a 9.34% if the 2 kb [TCTC]n microsatellite at 21,894–23,859 in Fig 2 is removed.