| Literature DB >> 20333211 |
Marc P Hoeppner1, Simon White, Daniel C Jeffares, Anthony M Poole.
Abstract
Small nucleolar RNAs (snoRNAs) and microRNAs (miRNAs) are integral to a range of processes, including ribosome biogenesis and gene regulation. Some are intron encoded, and this organization may facilitate coordinated coexpression of host gene and RNA. However, snoRNAs and miRNAs are known to be mobile, so intron-RNA associations may not be evolutionarily stable. We have used genome alignments across 11 mammals plus chicken to examine positional orthology of snoRNAs and miRNAs and report that 21% of annotated snoRNAs and 11% of miRNAs are positionally conserved across mammals. Among RNAs traceable to the bird-mammal common ancestor, 98% of snoRNAs and 76% of miRNAs are intronic. Comparison of the most evolutionarily stable mammalian intronic snoRNAs with those positionally conserved among primates reveals that the former are more overrepresented among host genes involved in translation or ribosome biogenesis and are more broadly and highly expressed. This stability is likely attributable to a requirement for overlap between host gene and intronic snoRNA expression profiles, consistent with an ancestral role in ribosome biogenesis. In contrast, whereas miRNA positional conservation is comparable to that observed for snoRNAs, intronic miRNAs show no obvious association with host genes of a particular functional category, and no statistically significant differences in host gene expression are found between those traceable to mammalian or primate ancestors. Our results indicate evolutionarily stable associations of numerous intronic snoRNAs and miRNAs and their host genes, with probable continued diversification of snoRNA function from an ancestral role in ribosome biogenesis.Entities:
Keywords: evolution; intron; miRNA; snoRNA
Year: 2009 PMID: 20333211 PMCID: PMC2817437 DOI: 10.1093/gbe/evp045
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Number of Annotated ncRNAs per Genome Versus Aligned Regionsa
| Total (Mb) | snoRNAs | miRNAs | Alignment (bp) | snoRNAs | miRNAs | Genome% | snoRNA% | miRNA% | |
| 2918 | 586 (267) | 565 (185) | 1160 | 348 (232) | 288 (154) | 39.75 | 59.39 | 50.97 | |
| 2385 | 490 (225) | 628 (249) | 1232 | 280 (194) | 366 (200) | 51.65 | 57.14 | 58.28 | |
| 2429 | 406 (182) | 612 (221) | 1219 | 221 (166) | 348 (182) | 50.18 | 54.43 | 56.86 | |
| 1051 | 148 (118) | 560 (256) | 498 | 106 (94) | 287 (183) | 47.38 | 71.62 | 51.25 | |
| 3253 | 717 (360) | 1664 (740) | 1834 | 450 (306) | 1046 (570) | 56.37 | 62.76 | 62.86 | |
| 3094 | 715 (280) | 1208 (422) | 1589 | 383 (240) | 610 (338) | 51.36 | 53.57 | 50.50 | |
| 3502 | 221 (137) | 375 (121) | 1605 | 166 (124) | 168 (88) | 45.85 | 44.27 | 44.80 | |
| 3421 | 949 (351) | 1081 (504) | 1427 | 475 (289) | 677 (421) | 41.72 | 50.05 | 62.63 | |
| 1918 | 2342 (259) | 605 (123) | 554 | 276 (177) | 154 (97) | 28.88 | 11.78 | 25.45 | |
| 2929 | 716 (333) | 1464 (550) | 1836 | 432 (282) | 889 (440) | 62.69 | 60.34 | 60.72 | |
| 3109 | 763 (280) | 1485 (451) | 1691 | 389 (228) | 763 (347) | 54.4 | 50.98 | 51.38 | |
| 2507 | 1023 (373) | 760 (290) | 1376 | 511 (319) | 446 (250) | 54.88 | 49.95 | 58.68 |
The 12 genome alignment for our study was obtained from the Ensembl database. The average proportion of each genome sequence present in synteny blocks is approximately 50% (genome% column). Approximately 50% of annotated snoRNAs (snoRNA% column) and miRNAs (miRNA% column) were included in synteny blocks and thus used in this study. Numbers in parentheses indicate intronically encoded RNAs.
FA comparison of human host gene function between the ancestor of mammals and primates suggests a diversification in the roles of snoRNA host genes in more recent evolutionary history. Whereas half of the host genes in the earliest mammals (mammalian ancestor, graphs on left) are involved in ribosome formation or protein production ([A] biological process: translation; [B] molecular function: RNA binding), no such bias can be found for snoRNA host genes traceable to the primate ancestor (minus those also in the mammalian ancestor, graphs on right). The scale on the y axis corresponds to the total number of genes from Homo sapiens used in each analysis. Only the top 10 categories are shown. E values for significantly overrepresented GO terms were calculated using GoStat (Beissbarth and Speed 2004).
FReconstruction of ancestral states for positionally conserved snoRNAs and miRNAs across 11 mammalian genomes. The tree is based on the phylogeny reported by Bininda-Emonds et al. (2007), with modifications as described in Materials and Methods. Counts of positionally conserved ncRNAs are derived from a maximum parsimony analysis using DolloP from the PHYLIP package (see Materials and Methods). The numbers of ncRNAs inferred from synteny to be present at each internal node are listed (red: snoRNAs, blue: miRNAs). The first number indicates the total count of orthologous mi/snoRNAs inferred to be present at a given node, followed by the number of intronic ncRNAs inferred to be present at a given node (a subset of the first value).
FRepresentation of snoRNA and miRNA families among the subset of positionally conserved ncRNAs in this study. Our analysis is based on genomic alignments and thus excludes approximately 50% of annotated snoRNAs and miRNAs located in nonsyntenic regions of the respective genomes (table 1). To estimate the coverage of snoRNA and miRNA families (as defined by Rfam and miRBase) in our analysis, we performed a per-node reconstruction of family presence/absence irrespective of copy number or positional conservation (genome) and compared these numbers (family count) with the family representation in our analysis (alignment). The results indicate that our study set provides good coverage (between 70% and 90%) of snoRNA and miRNA families. The remaining ncRNA families are likely located in regions not alignable across genomes.
Patterns of Intronic and Intergenic ncRNA Conservation
| ncRNA type | Present in | Intronic | Intergenic | Total |
| snoRNA | All primates | 293 | 81 | 374 |
| Mammalian ancestor | 102 | 2 | 104 | |
| miRNA | All primates | 351 | 139 | 490 |
| Mammalian ancestor | 72 | 20 | 92 |
ncRNAs conserved across primates (node 7, fig. 1) that were already present in the mammalian ancestor (node 2, fig. 1).
FFew genes carry multiple intronic ncRNAs. A per-node (fig. 1) inspection of the number of (A) miRNAs and (B) snoRNAs per host gene in Homo sapiens reveals that most host genes carry only a single ncRNA (black). This suggests de novo emergence and/or transduplications (including ncRNA retroposition) of existing families are more prevalent in mammals than cis-duplication.