| Literature DB >> 20333187 |
Xiaowei Sylvia Chen1, Lesley J Collins, Patrick J Biggs, David Penny.
Abstract
RNA interference (RNAi) is a set of mechanisms which regulate gene expression in eukaryotes. Key elements of RNAi are small sense and antisense RNAs from 19 to 26 nt generated from double-stranded RNAs. MicroRNAs (miRNAs) are a major type of RNAi-associated small RNAs and are found in most eukaryotes studied to date. To investigate whether small RNAs associated with RNAi appear to be present in all eukaryotic lineages, and therefore present in the ancestral eukaryote, we studied two deep-branching protozoan parasites, Giardia intestinalis and Trichomonas vaginalis. Little is known about endogenous small RNAs involved in RNAi of these organisms. Using Illumina Solexa sequencing and genome-wide analysis of small RNAs from these distantly related deep-branching eukaryotes, we identified 10 strong miRNA candidates from Giardia and 11 from Trichomonas. We also found evidence of Giardia short-interfering RNAs potentially involved in the expression of variant-specific surface proteins. In addition, eight new small nucleolar RNAs from Trichomonas are identified. Our results indicate that miRNAs are likely to be general in ancestral eukaryotes and therefore are likely to be a universal feature of eukaryotes.Entities:
Keywords: RNA evolution; ancestral eukaryotes; miRNA; protists
Year: 2009 PMID: 20333187 PMCID: PMC2817412 DOI: 10.1093/gbe/evp017
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FCoverage of RNA species by Solexa sequencing. The filtered reads were mapped against known RNA sequences in Giardia and Trichomonas, and the numbers of reads were counted for the corresponding transcripts. The majority of reads in both organisms are mapped to unknown transcripts, which potentially contain novel ncRNAs. (A) Reads count for Giardia. (B) Reads count for Trichomonas.
FPredicted miRNA candidates and their alignments with published miRNAs: (A) Giardia miRNA candidates and alignments; (B) Trichomonas miRNA candidates and alignments. Six Giardia miRNA candidates and seven Trichomonas miRNA candidates all show extensive sequence homology with known miRNAs from a number of organisms. The full names of known miRNAs are gga-miR-1791 MIMAT0007705 Gallus gallus miR-1791, dan-miR-311a MIMAT0008471 Drosophila ananassae miR-311a, gga-miR-202 MIMAT0003355 Gallus gallus miR-202, oan-miR-1336 MIMAT0006829 Ornithorhynchus anatinus miR-1336, cel-miR-34 MIMAT0000005 Caenorhabditis elegans miR-34, gga-miR-1673 MIMAT0007557 Gallus gallus miR-1673, fru-miR-152 MIMAT0003103 Fugu rubripes miR-152, tni-miR-152 MIMAT0003104 Tetraodon nigroviridis MIR-152, DRE-MIR-725 MIMAT0003753 Danio rerio MIR-725, ATH-MIR862-5p MIMAT0004307 Arabidopsis thalianaMIR862-5p, mmu-miR-743b-3p MIMAT0004840 Mus musculus miR-743b-3p, gma-miR1534 MIMAT0007397 Glycine max miR1534, mml-miR-891 MIMAT0006530 Macaca mulatta miR-891, and osa-miR1852 MIMAT0007772 Oryza sativa miR1852.
FPossible miRNA-3′ UTR target binding for Giardia and Trichomonas miRNA candidates resulting from the homology search: (A) Examples of Giardia miRNA candidates; (B) Examples of Trichomonas miRNA candidates. Examples of Giardia and Trichomonas miRNA candidates show extensive base pairing with predicted 3′ UTR targets. In general, Trichomonas candidates have longer base pairing compared with Giardia candidates, and all predicted target sites are within 50-nt 3′ to the stop codon. The candidate sequences are marked red on corresponding Solexa contigs.
miRNA Candidates and Predicted Targets from Giardia and Trichomonas
| Mir Candidate | Genomic Coordinates | Predicted Mature Sequence | Possible Targets |
| Gim1 | CH991782: 610570–610593 | AUCAACGUGACUGAUGCUGGCUCU | 3′ UTR GL50803_8508 hypothetical protein |
| Gim2 | CH991767: 1378309–1378332 | AUCUCGCACAUAUACCGGCCUCCU | Not found |
| Gim3 | CH991769: 132337–132314 | GUGCAGAGGCAUGGAGCACGGGAA | 3′ UTR GL50803_23634 hypothetical protein; 3′ UTR GL50803_9079 ERP2 |
| Gim4 | CH991779: 949975–949998 | GUGGUCUGCAUCUGGACCUUCACU | 3′ UTR GL50803_14602 hypothetical protein |
| Gim5 | CH991779: 908521–908498 | GGCCGUGUGGUUAGGUGGUUGUUG | 3′ UTR GL50803_48432 hypothetical protein; 3′ UTR GL50803_10843 Thymus-specific serine protease precursor |
| Gim6 | CH991782: 669487–669463 | GUGGUGAGUAGAAGUCAGAUUAUAA | 3′ UTR GL50803_15063 Long chain fatty acid CoA ligase 5; 3′ UTR GL50803_114815 Tenascin precursor |
| Gim7 | CH991769: 687079–687104; 687267–687242 | GCGGUCGCUUGGGUCCCAGCGGGUTC | 3′ UTR GL50803_88901 hypothetical protein |
| Gim8 | CH991813: 431–68 (5 copies in tandem) | GGUCGGUUAGCUCAGUCGGUAGAGCG | 3′ UTR GL50803_20250 hypothetical protein |
| Gim9 | CH991776: 417414–417439 | GUAGGAUGCCCCAGAGACUGCCGAG | 3′ UTR GL50803_13412 acidic ribosomal protein P0 |
| Gim10 | CH991782: 268174–268149 | AAACUCUCCGCACAGGGGCGCGCCUG | 3′ UTR GL50803_94658 hypothetical protein |
| Tvm1 | DS114515: 7733–7712 | GUAAUAGGUCGAGCUUGUGAAU | Not found |
| Tvm2 | DS177933: 152–132 (first of 50 copies) | CAAUUUGGGUAAAUGGUCAAU | 3′ UTR TVAG_416040 conserved hypothetical protein; mRNA TVAG_225980 conserved hypothetical protein |
| Tvm3 | DS176142: 693–673 (first of seven copies) | CAAUUCAGUCAUUCUUUCUGU | 3′ UTR TVAG_493570 conserved hypothetical protein; mRNA TVAG_450550 conserved hypothetical protein |
| Tvm4 | DS160029: 395–416 (first of 21 copies) | CAACAGACAUAAUGCUGAAUAG | mRNA TVAG_389700 conserved hypothetical protein |
| Tvm5 | DS113666: 32359–32338 | UAAUAUGGAAUCAGAAUGCAGU | Not found |
| Tvm6 | DS177803: 248–269 (first of many copies) | UCAUCCUUACCUCAGUCAUUGA | 3′ UTR TVAG_186870 conserved hypothetical protein |
| Tvm7 | DS177310: 460–481(first of 49 copies) | AUAUGGCAUAAUAGAACUUUGC | 3″ UTR TVAG_592550 conserved hypothetical protein; mRNA TVAG_089350 conserved hypothetical protein |
| Tvm8 | DS174663: 730–750(first of many copies) | UUGAAAAAUAAGAUGGUUCGC | 3′ UTR TVAG_080530 conserved hypothetical protein; mRNA TVAG_567710 conserved hypothetical protein |
| Tvm9 | DS162040: 804–784 (first of two copies) | UUUCAAUUGGACAAUUUGAAU | 3′ UTR TVAG_518140 conserved hypothetical protein; mRNA TVAG_258870 conserved hypothetical protein |
| Tvm10 | DS176757: 229–209 (first of many copies) | CUGUUUGGAAGUUGUAUCCAU | 3′ UTR TVAG_029730 conserved hypothetical protein |
| Tvm11 | DS177474: 213–232 (first of many copies) | UGCGACAAGCUUGCCCAUGG | 3′ UTR TVAG_140140 conserved hypothetical protein; mRNA TVAG_123455 conserved hypothetical protein |
FExamples of additional miRNA candidates from genomic sequence analysis: (A) Examples of Giardia miRNA candidates and predicted target binding. (B) Examples of Trichomonas miRNA candidates and predicted target binding. Regions on the predicted precursor sequences marked by red are covered by Solexa de novo contigs. The example of Trichomonas miRNA (B) shows a feature of many candidates that the mature miRNA may be in the loop region of the precursor hairpin. However, more evidence is needed to show if this feature is general in Trichomonas. The Giardia example (A) shows a typical stem location of the mature miRNA.
FExpression of Girep RNAs and alignment with VSP mRNA sequence. (A) Expression of sense and antisense strands of Girep RNAs From the figure, it is clear that at least one of each of the Girep sequences are transcribed at both sense and antisense strands, indicated by the reverse transcriptase–polymerase chain reaction (RT-PCR). The products of RT-PCR and positive control PCR all have multiple bands, indicating the tandem repeating pattern of Girep sequences. (B) General pattern of matching between a Girep sequence and a VSP mRNA. This figure shows the sequence alignment between the 222-nt repeating unit of Girep-1 and the repeating unit of Giardia VSP gene (GL50803_137740). The two sequences are highly similar at sequence level, indicating a strong relation between them.
Gireps and Solexa Short-Read Coverage
| Name | No. of Repeating Units | Length of Repeating Unit | Location | Solexa + Strand Hits | Solexa − Strand Hits |
| Girep-1 | 9 | 222 | CH991779:330469–332457 | 7 | 0 |
| Girep-2 | 8 | 222 | CH991779:395054–396828 | 14 | 1 |
| Girep-3 | 7 | 228 | CH991782:209411–210987 | 44 | 5 |
| Girep-4 | 8 | 228 | CH991804:1–1810 | 2 | 46 |
| Girep-5 | 10 | 225 | CH991763:274924–277375 | 13 | 2 |
FCommon structures of Trichomonas C/D box snoRNAs. Trichomonas C/D box snoRNAs adopt either of the above common structures. Shorter snoRNAs tend to have the left-hand side form and longer ones the right-hand side form. Shorter sequences usually have ribosomal recognition sites close to their 5′ ends, whereas longer sequences generally have less conserved D′ boxes and ribosomal recognition regions close to their 3′ ends.