| Literature DB >> 19357091 |
Abstract
Small nucleolar RNAs (snoRNAs) are noncoding RNAs that direct 2'-O-methylation or pseudouridylation on ribosomal RNAs or spliceosomal small nuclear RNAs. These modifications are needed to modulate the activity of ribosomes and spliceosomes. A comprehensive repertoire of snoRNAs is needed to expand the knowledge of these modifications. The sequences corresponding to snoRNAs in 18-26-nt small RNA sequencing data have been rarely explored and remain as a hidden treasure for snoRNA annotation. Here, we showed the enrichment of small RNAs at Arabidopsis snoRNA termini and developed a computational approach to identify snoRNAs on the basis of this characteristic. The approach successfully uncovered the full-length sequences of 144 known Arabidopsis snoRNA genes, including some snoRNAs with improved 5'- or 3'-end annotation. In addition, we identified 27 and 17 candidates for novel box C/D and box H/ACA snoRNAs, respectively. Northern blot analysis and sequencing data from parallel analysis of RNA ends confirmed the expression and the termini of the newly predicted snoRNAs. Our study especially expanded on the current knowledge of box H/ACA snoRNAs and snoRNA species targeting snRNAs. In this study, we demonstrated that the use of small RNA sequencing data can increase the complexity and the accuracy of snoRNA annotation.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19357091 PMCID: PMC2685112 DOI: 10.1093/nar/gkp225
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Characteristics of novel box C/D snoRNAs predicted by small RNA sequencing data
| Name | Size (nt) | Location | Target site | Homology |
|---|---|---|---|---|
| U27-2 | 84 | IGR, cluster | Am42/U6 (D) | Am47 (Hs: mgU6-47) |
| Am28/18S (D′) | Am27 (Hs: U27); Am28 (Sc: snR74) | |||
| U46-1 | 79 | Intron of At3g51800 | Am2246/25S (D) | Am3739 (Hs: U46); Am2256 (Sc: snR63) |
| U46-2 | 83 | IGR, cluster | Am2246/25S (D) | Am3739 (Hs: U46); Am2256 (Sc: snR63) |
| snoR102-2 | 165 | 3′ UTR of At4g30993 | Gm41/U5 (D) | |
| snoR113 | 103 | IGR | Gm1446/25S (D′) | |
| snoR114 | 88 | IGR | Gm1191/25S (D) | |
| Gm1590/18S (D′) | ||||
| snoR115-1 | 68 | IGR, cluster | Um602/18S (D′) | |
| snoR115-2 | 69 | AS of At1g18740, cluster | Um602/18S (D′) | |
| snoR116 | 86 | IGR, cluster | Um123/18S (D′) | Um121 (Hs: mgh18S-121, Z17B) |
| snoR117 | 72 | IGR, cluster | Um2103/25S (D′) | |
| snoR118 | 93 | IGR, cluster | Cm1280/25S (D′) | |
| snoR119 | 75 | Intron of At5g01350, cluster | Am778/18S (D′) | |
| snoR120 | 74 | IGR, cluster | Um44/25S (D) | |
| snoR121 | 85 | IGR | Cm1510/25S (D) | Cm2409 (Hs: mgh28S-2409) |
| snoR122 | 74 | AS of At1g18740, cluster | Um168/18S(D′) | Um172 (Hs: U54A) |
| snoR123a | 80 | Intron of At4g31980, cluster | Um2483/25S(D′) | |
| snoR123b | 79 | Intron of At4g31980, cluster | Um2483/25S(D′) | |
| snoR124 | 68 | IGR, cluster | Gm244/18S(D′) | |
| snoR125 | 200 | IGR | Am31/U2 (D) | Am30 (Hs: mgU2-19/30) |
| snoR126 | 105 | 5′UTR of At5g65860 | Gm75/U6 (D) | |
| Gm84/U6 (D′) | ||||
| snoR127 | 138 | Intron of At5g27720 | Gm20/U2 (D′) | Gm19 (Hs: mgU2-19/30) |
| snoR128 | 92 | IGR, cluster | ||
| snoR129 | 88 | IGR, cluster | ||
| snoR130 | 120 | Intron of At3g07860 | ||
| snoR131 | 77 | IGR, cluster | ||
| snoR132 | 82 | IGR, cluster | ||
| snoR133 | 92 | IGR, cluster |
aThe location of snoRNAs is described as IGR (intergenic region), AS (antisense), UTR (untranslated region) and intron of coding genes.
Cluster indicates potential polycistronic snoRNA.
bThe target site is indicated as the methylated nucleotide followed by its position on rRNAs or snRNAs.
The location of antisense elements that are complementary to rRNAs or snRNAs is indicated in parentheses as D or D′.
cHomology indicates equivalent nucleotides on which methylation has been experimentally validated in humans (Hs) and/or yeast (Sc). Equivalent nucleotides are followed by snoRNAs guiding the modification in parentheses.
Characteristics of novel box H/ACA snoRNAs predicted by small RNA sequencing data
| Name | Size (nt) | Location | Target site | Homolgy |
|---|---|---|---|---|
| snoR80-2 | 140 | IGR, cluster | Ψ1130/25S (P1) | |
| Ψ999/25S (P2) | Ψ990 (Sc: snR49) | |||
| snoR80-3 | 138 | IGR, cluster | Ψ1130/25S (P1) | |
| Ψ999/25S (P2) | Ψ990 (Sc: snR49) | |||
| snoR86-2 | 142 | IGR, cluster | Ψ2555/25S (P1) | |
| Ψ360/18S (P2) | Ψ406 (Hs: U71) | |||
| snoR97-2 | 134 | IGR, cluster | Ψ2964/25S (P1) | |
| Ψ2707/25S (P2) | Ψ4263 (Hs: ACA2) | |||
| snoR103-2 | 132 | Intron of At4g30680 | Ψ50/U5 (P1) | Ψ46 (Hs: U85, U89) |
| Ψ47/U5 (P2) | Ψ43 (Hs: ACA57) | |||
| snoR134 | 144 | IGR, cluster | Ψ1104/18S (P1) | |
| Ψ1192/18S (P2) | Ψ1248 (Hs: ACA13); Ψ1191 (Sc: snR35) | |||
| snoR135 | 141 | At2g46192 (ncRNA), cluster | Ψ2181/25S (P2) | Ψ3674 (Hs:unknown); Ψ2191 (Sc: snR32) |
| snoR136 | 155 | IGR, cluster | Ψ2833/25S (P2) | Ψ4390 (Hs: E3) |
| snoR137 | 146 | IGR | Ψ1208/18S (P1) | |
| snoR138 | 156 | IGR | Ψ1479/18S (P1) | |
| snoR139 | 150 | IGR, cluster | Ψ762/18S (P1) | Ψ815 (Hs: ACA28); Ψ759 (Sc: snR80) |
| snoR140 | 142 | Intron of At5g08180, cluster | Ψ677/18S (P1) | |
| Ψ1215/18S (P2) | ||||
| snoR141 | 151 | IGR, cluster | Ψ1050/25S (P1) | Ψ1769 (Hs: ACA9); Ψ1042 (Sc: snR33) |
| Ψ2093/25S (P2) | ||||
| snoR142 | 154 | IGR, cluster | Ψ26/U6 (P1) | Ψ31 (Hs: ACA65) |
| Ψ2884/25S (P2) | Ψ4441 (Hs: ACA1) | |||
| snoR143 | 171 | CDS of At1g31835 | Ψ38/U2 (P2) | Ψ37 (Hs: ACA45) |
| snoR144 | 144 | IGR, cluster | Ψ1897/25S (P1) | |
| Ψ1255/25S (P2) | ||||
| snoR145 | 145 | At2g46192 (ncRNA), cluster |
aThe location of snoRNAs is described as ncRNA (noncoding RNA), IGR, CDS (coding region), and intron of coding genes.
Cluster indicates potential polycistronic snoRNA.
bThe target site of new box H/ACA snoRNAs is indicated as a pseudouridine, followed by its position on rRNAs or snRNAs.
The location of antisense elements that are complementary to rRNAs or snRNAs is indicated in parentheses as P1 (the 5′ internal loop) or P2 (the 3′ internal loop).
cHomology indicates equivalent nucleotides on which pseudouridylation has been experimentally validated in humans (Hs) and/or yeast (Sc).
Figure 2.Structures of novel box H/ACA snoRNAs predicted by small RNA sequencing data. The program of mfold was used to predict structures of newly identified snoR135, snoR139, snoR140 and snoR142 as described in ‘Materials and Methods’ section.
Figure 1.Enrichment of small RNAs at both termini of snoRNAs. (A) Definition of 5′-, body and 3′-end regions of a snoRNA. (B) The proportion of the sequence length, distinct small RNAs and total reads of small RNAs mapped to each region. (C) The distribution of small RNAs on a known box C/D snoRNA, U30. (D) The distribution of small RNAs on a known box H/ACA snoRNA, snoR74-2 with an incomplete 5′ end. Capital letters indicate the snoRNA sequences reported previously and lowercase letters indicate the flanking sequences. Short sequences are small RNAs followed by their read numbers. Conserved motifs are highlighted.
Number of snoRNAs identified by small RNA sequencing data
| snoRNA | Known | Novel |
|---|---|---|
| Box C/D | 124 (30)/161 | 27 |
| Subgroup I | 114 (26) | 24 |
| Subgroup II | 8 (3) | 2 |
| Subgroup III | 2 (1) | 1 |
| Box H/ACA | 20 (18)/43 | 17 |
The denominators of known snoRNAs are the total numbers of snoRNAs reported in previous studies.
The numbers of snoRNAs with predicted 5′ and/or 3′ ends deviating more than 3 nt from those in previous reports are in parentheses.
Figure 3.Validation of the predicted snoRNA 5′ ends by PARE data. The positions of predicted 5′ ends of 151 box C/D snoRNAs and 37 box H/ACA snoRNAs were set as 0. Positions upstream of the predicted 5′ ends were set as negative values and downstream as positive values. The read number of 5′ ends starting at each position (–10 to 10) was normalized to the total PARE read number of the 21-nt region. A gradient of red color was used to represent the frequency of 5′ ends starting at each position.
Figure 4.Northern blot analysis of novel snoRNAs predicted by small RNA sequencing data. (A) Aliquots of 10 μg total RNA for each lane were separated on 15% TBU gel for the northern blot analysis of eight new box C/D snoRNAs. snoR128: 92 nt; U46-1: 79 nt; snoR123b: 79 nt; snoR119: 75 nt; snoR127: 138 nt; snoR129: 88 nt; snoR125: 200 nt; U27-2: 84 nt. (B) Aliquots of 10 μg total RNA for each lane were separated on 6% TBU gels for the northern blot analyses of six new box H/ACA snoRNAs. snoR141: 151 nt; snoR140: 142 nt; snoR143: 171 nt; snoR145: 145 nt; snoR135: 141 nt; snoR139: 150 nt; Spliceosomal snRNA U6 was used as the loading control. S, 10-day-old seedlings; L, rosette leaves; F, flowers.