| Literature DB >> 25822729 |
Paul P Gardner1, Mario Fasold2, Sarah W Burge3, Maria Ninova4, Jana Hertel5, Stephanie Kehr5, Tammy E Steeves6, Sam Griffiths-Jones4, Peter F Stadler7.
Abstract
Here we present the results of a large-scale bioinformatics annotation of non-coding RNA loci in 48 avian genomes. Our approach uses probabilistic models of hand-curated families from the Rfam database to infer conserved RNA families within each avian genome. We supplement these annotations with predictions from the tRNA annotation tool, tRNAscan-SE and microRNAs from miRBase. We identify 34 lncRNA-associated loci that are conserved between birds and mammals and validate 12 of these in chicken. We report several intriguing cases where a reported mammalian lncRNA, but not its function, is conserved. We also demonstrate extensive conservation of classical ncRNAs (e.g., tRNAs) and more recently discovered ncRNAs (e.g., snoRNAs and miRNAs) in birds. Furthermore, we describe numerous "losses" of several RNA families, and attribute these to either genuine loss, divergence or missing data. In particular, we show that many of these losses are due to the challenges associated with assembling avian microchromosomes. These combined results illustrate the utility of applying homology-based methods for annotating novel vertebrate genomes.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25822729 PMCID: PMC4378963 DOI: 10.1371/journal.pone.0121797
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Heatmaps showing the presence/absence and approximate genomic copy-number of “lost, divergent or missing RNAs” and the “unusually, well conserved RNAs”.
On the top we show the families that have been identified as surprising RNA losses, divergence or missing data. In several cases functionally related families have also been included, e.g. the RNA components of the major and minor spliceosomes: U1, U2, U4, U5 and U6; and U11, U12, U4atac, U5 and U6atac, respectively. Below we show the unusually, well conserved RNAs, these are predominantly lncRNAs.
A summary of ncRNA genes in human, chicken and all bird genomes.
This table contains the total number of annotated ncRNAs from different RNA types in human, the median number for each of the 48 birds and chicken. The number of chicken ncRNA that show evidence for expression is also indicated (the percentage is given in parentheses). The threshold for determining expression was selected based upon a false positive rate of less than 10%.
|
| ||||
|---|---|---|---|---|
| Number in human | median(48 birds) | Number in chicken | Chicken ncRNAs confirmed with RNA-seq | RNA type |
| 62 | 25.0 | 34 | 12 (35.3%) | Long non-coding RNA |
| 356 | 499.5 | 427 | 280 (65.6%) | microRNA |
| 281 | 120.0 | 106 | 90 (84.9%) | C/D box snoRNA |
| 336 | 85.5 | 68 | 48 (70.6%) | H/ACA box snoRNA |
| 34 | 13.0 | 12 | 12 (100.0%) | Small cajal body RNA |
| 1754 | 48.5 | 71 | 32 (45.1%) | Major spliceosomal RNA |
| 58 | 3.0 | 6 | 3 (50.0%) | Minor spliceosomal RNA |
| 525 | 82.0 | 122 | 88 (72.1%) | Cis-regulatory element |
| 316 | 6.5 | 9 | 3 (33.3%) | 7SK RNA |
| 1 | 0.0 | 2 | 0 (0.0%) | Telomerase RNA |
| 9 | 0.0 | 2 | 1 (50.0%) | Vault RNA |
| 892 | 3.0 | 3 | 2 (66.7%) | Y RNA |
| 1084 | 173.5 | 300 | 278 (92.7%) | Transfer RNA |
| 80 | 9.5 | 4 | 2 (50.0%) | Transfer RNA pseudogene |
| 941 | 3.0 | 4 | 2 (50.0%) | SRP RNA |
| 607 | 7.0 | 22 | 10 (45.5%) | Ribosomal RNA |
| 4 | 1.0 | 2 | 2 (100.0%) | RNase P/MRP RNA |
| 7340 | 1080.0 | 1194 | 865 (72.4%) | Total |
Fig 2The preservation of gene order (synteny) surrounding the HOTAIRM1 (RF01976) locus across the Avian and other Vertebrate lineages.