| Literature DB >> 20660015 |
Malali Gowda1, Cristiano C Nunes, Joshua Sailsbery, Minfeng Xue, Feng Chen, Cassie A Nelson, Douglas E Brown, Yeonyee Oh, Shaowu Meng, Thomas Mitchell, Curt H Hagedorn, Ralph A Dean.
Abstract
Small RNAs are well described in higher eukaryotes such as mammals and plants; however, knowledge in simple eukaryotes such as filamentous fungi is limited. In this study, we discovered and characterized methylguanosine-capped and polyadenylated small RNAs (CPA-sRNAs) by using differential RNA selection, full-length cDNA cloning and 454 transcriptome sequencing of the rice blast fungus Magnaporthe oryzae. This fungus causes blast, a devastating disease on rice, the principle food staple for over half the world's population. CPA-sRNAs mapped primarily to the transcription initiation and termination sites of protein-coding genes and were positively correlated with gene expression, particularly for highly expressed genes including those encoding ribosomal proteins. Numerous CPA-sRNAs also mapped to rRNAs, tRNAs, snRNAs, transposable elements and intergenic regions. Many other 454 sequence reads could not be mapped to the genome; however, inspection revealed evidence for non-template additions and chimeric sequences. CPA-sRNAs were independently confirmed using a high affinity variant of eIF-4E to capture 5'-methylguanosine-capped RNA followed by 3'-RACE sequencing. These results expand the repertoire of small RNAs in filamentous fungi.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20660015 PMCID: PMC2995040 DOI: 10.1093/nar/gkq583
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.CPA-sRNA isolation and size distribution. (A) Strategy for CPA-sRNA preparation from mycelial total RNA. The protocol ensures capture of RNA species that possess both a 5′-cap and a 3′-polyadenylated tail. The first treatment with BAP prevents RNA containing a 5′-free phosphate from being able to ligate to the 5′-linker. The use of (dT)20VN oligo for single-strand cDNA priming allows cDNA to be synthesized exclusively from RNA containing polyA. Following amplification by PCR, small cDNAs (<200 nt) were purified from a 3% agarose gel and subjected to 454 pyrosequencing. (B) Size distribution of CPA-sRNAs (≥16 nt) that matched to the M. oryzae genome (BLAST criteria: ≥80% coverage and ≥98% sequence identity). (C) CPA-sRNAs mapped to annotated protein-coding TUs. A vertical line represents the TSS and TTS for protein-coding genes.
Distribution of CPA-sRNAs mapped to genomic and mitochondrial features
| Read Count | Prorated | Features | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Total | Sense | Antisense | Total | Sense | Antisense | Mapped | Total | Coverage (%) | |
| Genes | 10 265 | 8894 | 3507 | 9780 | 7579 | 2201 | 4327 | 11 043 | 39 |
| Introns | 681 | 456 | 313 | 378 | 239 | 139 | 467 | 19 651 | 2 |
| Exons | 9985 | 8685 | 3386 | 9401 | 7340 | 2062 | 4977 | 30 705 | 16 |
| 5′-UTR | 2981 | 2526 | 705 | 2323 | 1967 | 356 | 1325 | 11 241 | 12 |
| EST Supp. | 1247 | 1201 | 46 | 1105 | 1074 | 31 | 375 | 2558 | 15 |
| Unsupp. | 1853 | 1384 | 660 | 1217 | 893 | 325 | 950 | 8683 | 11 |
| CDS | 2227 | 1603 | 779 | 1590 | 1090 | 500 | 1581 | 11 054 | 14 |
| EST Supp. | 1676 | 1233 | 518 | 1166 | 847 | 320 | 1085 | 5199 | 21 |
| Unsupp. | 731 | 426 | 340 | 423 | 243 | 180 | 496 | 5855 | 8 |
| 3′-UTR | 6260 | 5653 | 2276 | 5489 | 4283 | 1206 | 2597 | 11 076 | 23 |
| EST Supp. | 3772 | 3643 | 148 | 2982 | 2887 | 94 | 801 | 2551 | 31 |
| Unsupp. | 3636 | 2090 | 2131 | 2507 | 1397 | 1111 | 1796 | 8525 | 21 |
| tRNA | 425 | 394 | 31 | 289 | 1396 | 6 | 287 | 341 | 84 |
| 5′-Leader | 261 | 237 | 24 | 192 | 191 | 1 | 151 | 341 | 44 |
| Mature | 227 | 226 | 1 | 188 | 188 | 0 | 274 | 341 | 80 |
| 3′-Term | 100 | 93 | 7 | 41 | 36 | 5 | 186 | 341 | 55 |
| rRNA | 1741 | 1740 | 1 | 1643 | 1642 | 1 | 47 | 48 | 98 |
| 5.8s | 82 | 82 | 0 | 82 | 82 | 0 | 3 | 3 | 100 |
| 8s | 66 | 66 | 0 | 46 | 46 | 0 | 41 | 41 | 100 |
| 18s | 661 | 660 | 1 | 593 | 592 | 1 | 1 | 2 | 50 |
| 28s | 932 | 932 | 0 | 922 | 922 | 0 | 2 | 2 | 100 |
| snRNA | 24 | 24 | 0 | 16 | 16 | 0 | 5 | 16 | 31 |
| Transp. Elements | 379 | 325 | 102 | 320 | 278 | 42 | 2087 | 3448 | 61 |
| Intergenic | 2778 | 2778 | – | 2498 | 2498 | – | – | – | – |
| Mitochondria | 43 | 41 | 3 | 43 | 40 | 3 | 14 | 37 | 36 |
| Genes | 11 | 8 | 3 | 8 | 6 | 3 | 8 | 15 | 53 |
| CDS | 3 | 3 | 0 | 3 | 3 | 0 | 2 | 16 | 13 |
| tRNA | 3 | 3 | 0 | 2 | 2 | 0 | 4 | 20 | 20 |
| Mature | 1 | 1 | 0 | 1 | 1 | 0 | 1 | 20 | 5 |
| rRNA | 31 | 31 | 0 | 30 | 30 | 0 | 2 | 2 | 100 |
| Intergenic | 3 | 3 | – | 3 | 3 | – | – | – | – |
aRead Count is the number of CPA-sRNAS that map on to particular genomic features. Note: values for genome features are typically less than the sum of component sub-features due to reads mapping to multiple locations and/or features.
bProrating divides the weight of any given read between alignments and features. This takes into account situations where reads map to multiple locations and/or are annotated with more than one feature (see ‘Materials and Methods’ section and Supplementary Figure S1 for more detail). Both read counts and prorated are divided into sense/antisense with respect to the feature mapped.
cFor each feature, the number of members mapped by a CPA-sRNA is given, followed by the total number of possible features and the percentage mapped.
dValues indicate number of CPA-sRNAs (read count or prorated) that map to features (5′-UTR, CDS or 3′-UTR) within gene models that are supported by ESTs (EST supp.) or have no supporting EST evidence (Unsupp.).
etRNA entries include alignments to pseudo-tRNA.
Association of CPA-sRNAs with other transcriptional evidence
| Genome | Intergenic | Genes | ||||
|---|---|---|---|---|---|---|
| Read Count | Features | Read Count | Features | Read Count | Features | |
| EST or ESS | 10 277 | 7898 | 1130 | 744 | 10 149 | 4095 |
| ESS | 4747 | 2888 | 603 | 404 | 10 111 | 4035 |
| MPSS | 2293 | 1332 | 306 | 158 | 1626 | 712 |
| SAGE | 2932 | 1556 | 352 | 246 | 2150 | 1032 |
| EST | 8969 | 5010 | 756 | 340 | 8913 | 2951 |
aCPA-sRNAs that overlap the sequence of ESS and/or ESTs at the same genome (or intergenic) location.
bCPA-sRNAs that map to transcriptionally (ESS and/or EST) supported gene loci.
cNumber of CPA-sRNAs that map to a given feature containing specified transcriptional evidence.
dNumber of features containing specified transcriptional evidence associated with CPA-sRNAs.
eESS (expressed short sequences) are either MPSS or SAGE tag annotations.
Figure 2.CPA-sRNA validation using 3′-RACE. (A) Total RNA from M. oryzae was used to purify 5′ methylguanosine-capped RNAs using recombinant eIF4EK119A bound to beads (21). 5′ methylguanosine-capped RNA was treated with DNase I and single-stranded cDNA synthesized using an oligo (dT)20VN primer. PCR amplification was performed using a forward primer to the 5′-end of specific CPA-sRNAs and reverse primer specific to the oligo (dT)20VN linker. PCR products were analyzed on 3% agarose gels, bands eluted, cloned into pGEM-T vectors and Sanger sequenced. PCR products were resolved on a 3% agarose gels for (B) protein-coding mRNA (MGG_0383.6, MGG_6594.6, MGG_0469.6, MGG_0592.6, MGG_02597.6, MGG_07928.6, MGG_10680.6, MGG_14279.6 and MGG_01210.6); (C) tRNAs (Ala: MGG_20297.6, Cys: MGG_20209.6, Gln: MGG_20266.6 and Leu: MGG_20218.6); (D) rRNAs (18S and 28S); (E) snRNAs (U6 and U2) and (F) retroelements (MAGGY-LTR). A DNA ladder is shown on the left of each panel. Arrows indicate PCR products that were sequenced.
Figure 3.CPA-sRNA sequences validated for mRNA, tRNA and rRNA loci. (A) 454 and 3′ RACE PCR clone sequence location at the chitinase 18–11 gene (MGG_06594.6). A dashed line in the 3′-RACE sequence data represents the absence of intronic sequence (95 nt) for the chitinase gene. (B) CPA-sRNAs associated with Gln tRNA locus (MGG_20266.6). Underlined sequence data represent non-templated nucleotide. (C) CPA-sRNAs clustering at the 5′-end of 18S rRNA locus.
Figure 4.Correlation of expression and GO annotation of genes with mapped CPA-sRNAs. Correlation analysis of gene expression and number of mapped CPA-sRNAs based on (A, sense mapping; B, antisense mapping) bins and (C, sense mapping; D, antisense mapping) individual genes. Genes were grouped into 100 bins based on mycelial gene expression; each bin contains 99 genes. Average gene expression per bin was plotted verses the average number of CPA-sRNAs for each bin. Asterisks indicates the average number of CPA-sRNAs and expression for 65 genes annotated as ribosomal structural proteins. (E) Biological process and (F) molecular function gene ontology annotation for 127 genes with ≥10 mapped CPA-sRNAs. Numbers on top of vertical bars indicate number of genes per category.