| Literature DB >> 24228136 |
Jovan D Rebolledo-Mendez1, Radhika A Vaishnav, Nigel G Cooper, Robert P Friedland.
Abstract
Micro-RNAs regulate the expression of cellular and tissue phenotypes at a post-transcriptional level through a complex process involving complementary interactions between micro-RNAs and messenger-RNAs. Similar nucleotide interactions have been shown to occur as cross-kingdom events; for example, between plant viruses and plant micro-RNAs and also between animal viruses and animal micro-RNAs. In this study, this view is expanded to look for cross-kingdom similarities between plant virus and human micro-RNA sequences. A method to identify significant nucleotoide sequence similarities between plant viruses and hsa micro-RNAs was created. Initial analyses demonstrate that plant viruses contain nucleotide sequences which exactly match the seed sequences of human micro-RNAs in both parallel and anti-parallel directions. For example, the bean common mosaic virus strain NL4 from Colombia contains sequences that match exactly the seed sequence for micro-RNA of the hsa-mir-1226 in the parallel direction, which suggests a cross-kingdom conservation. Similarly, the rice yellow stunt viral cRNA contains a sequence that is an exact match in the anti-parallel direction to the seed sequence of hsa-micro-RNA let-7b. The functional implications of these results need to be explored. The finding of these cross-kingdom sequence similarities is a useful starting point in support of bench level investigations.Entities:
Keywords: Tobacco mosaic virus; bioinformatics; environment; evolutionary conservation; health; human miRNA; molecular mimicry; plant virus; post-transcriptional regulation; sequence homology
Year: 2013 PMID: 24228136 PMCID: PMC3821693 DOI: 10.4161/cib.24951
Source DB: PubMed Journal: Commun Integr Biol ISSN: 1942-0889

Figure 1. Pipeline method for determination of similarities between plant viruses and human miRs. Process pipeline developed for finding similarities between named plant viruses and human miRs. Step 1 includes the establishment of the two data sets including (A) plant viruses downloaded from DPVweb, and (B) hsa-miRs, downloaded from miRBase. Step 2 includes the use of NCBI BLASTn for sequence alignments to find sequence similarities between data sets A and B, where the maximum expectation value found is 9.5 and the shortest matching sequence is 6 nucleotides. Step 3 includes the use of filters based on rules: (A) plant viruses must have at least 1,221 nucleotides; (B) there must be at least 11 or more nucleotides of similarity in such match; and (C) the BLAST sequence should start in any of the nucleotides from within the miR seed region. The final database can be used for additional data integration and analyses. It was used here to perform clustering analyses and also for identification of viruses that contained sequences that showed similarities with miR seed sequence.

Figure 2. Overlapping clusters of plant viruses based on human miRs with which they have sequence similarities. Venn diagram showing the four subsets (A–D), based on the bit-score for all the plant viruses that had similarity sequences with human miRs. These data were obtained as the result from Step 3 (Fig. 1). The clusters represent the entire holdings in the database. The priority in this diagram was to show the BLAST bit-scores distributed among the different subsets of plant viruses. Subset (A) (color olive) contains 25 matching viruses with score of 28.4 bits and 14 nucleotides long; subset (B) (color dark khaki) contains 63 matching viruses with score of 30.2 bits and 13 nucleotides long; subset (C) (color dark green) contains 204 matching viruses with score of 24.3 bit and 12 nucleotide long; subset (D) (color pink) contains 615 matching viruses with score 22.3 bit and 11 nucleotides long; cluster I1 (color gray) contains 14 matching viruses with score of 24.3 bits (14 nucleotides) and 26.3 (13 nucleotides); and cluster (I2) (color bright green) contains 1 matching virus, rice tungro spherical (am234049) with scores of 26.3 bits (13 nucleotides long), 24.3 bits (12 nucleotides long) and 22.3 bits (11 nucleotides long).
Table 1. Plant virus ranking
| Rank | Virus Name | DPVweb platn virus index | Virus strain | hsa miRs |
|---|---|---|---|---|
| 1 | ab516283 | | mir-1253, mir-1248, let-7b, let-7a-1, let-7a-3, mir-107, let-7a-2, mir-103a-1, mir-1-1, let-7c, mir-103b-1, mir-103b-2 | |
| 2 | eu937520 | Strain T30 | *mir-122, mir-1244-1, mir-1244-2, mir-1244-3, let-7f-1, let-7f-2, mir-1184-1, mir-1184-2, mir-1184-3, mir-1228 | |
| fj525431,fj525433, fj525434,fj525435, | Isolate NZRB-M12, isolate NZRB-TH28, isolate NZRB-TH30, isolate NZRB-M17 | mir-1244-1, mir-1244-2, mir-1244-3, let-7f-1, mir-1184-1, mir-1184-2, mir-1184-3, let-7f-2, mir-1180, mir-1193 | ||
| fj525432 | Isolate NZRB-G90 | mir-1244-1, mir-1244-2, mir-1244-3, let-7f-1, mir-1184-1, mir-1184-2, mir-1184-3, let-7f-2, mir-1193 | ||
| af260651 | T30 | *mir-122, mir-1244-1, mir-1244-2, mir-1244-3, let-7f-1, mir-1184-1, mir-1184-2, mir-1184-3, let-7f-2 | ||
Ranking of the two plant viruses that have the most number of sequences matching across the human miRs. Rice yellow stunt virus viral cRNA followed by Citrus tristeza virus were the plant viruses that ranked the best matching similarities. The latest plant virus has seven strains that matched with has miRs, where mir-122 can be used to identify the difference between the fj strains, and af and eu strains.

Figure 3. Overlapping clusters of hsa-miRs based on the number of plant viruses which they have sequence similarities. Diagram showing the clustering of human miRs based on their sequence similarities with plant viruses. These data were obtained as an outcome from Step 3 (Fig. 1). This figure represents the entire holdings in the database. The priority for this particular result was set to show the subsets of hsa-miRs based on their BLAST bit-scores. Subset (A) (color bright green) contains 10 matching miRs with score of 28.2 and 14 nucleotides long; subset (B) (color dark khaki) contains 16 matching miRs with score of 26.3 bits and 13 nucleotides long; subset (C) (color dark green) contains 48 matching miRs with score of 24.3 bits and 12 nucleotides long; subset (D) (color pink) contains 73 matching miRs with 11 nucleotides long; and subset (I1)(color yellow green) contains 2 matching miRs that are contained in each other score subset.
Table 2. miR ranking
| Rank | miRs | Number of viruses/strains |
|---|---|---|
| 1 | hsa-mir-1253 | 136 |
| 2 | hsa-mir-1243 | 110 |
| 3 | hsa-mir-105-1 | 83 |
| 3 | hsa-mir-105-2 | 83 |
| 4 | hsa-mir-1207 | 81 |
Top rank ordered hsa-miRs that share identity with one or more plant viruses, with 11 or matching nucleotides including the complete seed region
Table 3. Best plus/plus matching ranking based on seed region
| DPVweb plant virus index | Virus name | Virus sequence | miR sequence | miR name |
|---|---|---|---|---|
| dq666332 | gtgagggcatgcag | gtgagggcatgcag | 1226 | |
| eu761198 | gtgagggcatgcag | gtgagggcatgcag | 1226 | |
| af017780 | caaatgctcagact | caaatgctcagact | 105-1 | |
| aj312438 | gtgagggcatgcag | gtgagggcatgcag | 1226 | |
| af017780 | caaatgctcagact | caaatgctcagact | 105-2 | |
| ay575773 | gtgagggcatgcag | gtgagggcatgcag | 1226 | |
| aj312437 | gtgagggcatgcag | gtgagggcatgcag | 1226 | |
| l32603 | cagttatcacagtg | cagttatcacagtg | 101-1 |
Highest ranking viruses which contain the complete seed sequence of hsa-miRs with plus/plus strand and a similarity of 14 nts. Among these, Bean common mosaic virus has different strains that have a high matching with the seed region of miRs. miR 1226 shares complete seed region matching with two different plant viruses.

Figure 4. Nucleotide sequences showing similarity between mir-105-1, mir-105-2 and sour cherry green ring mottle virus (af017780). The partial sequences for mir-105-1 and mir-105-2 are aligned with a section of af017780. Note that miR105-1 and miR105-2 are both quite similar, with the exception of a single nucleotide (dotted-lined box). Both mir-105-1 and mir-105-2 contain 81 nucleotides. The length of the plant virus is 8,372 nucleotides, but the significant subset, nucleotides 695–708, that accompanies the similarities with mir-105-1 and mir-105-2 is shown here. The solid-lined box shows the seed region matching completely in all three sequences.
Table 4. Selected plant viruses matched sequences
| Plant virus index | Plant virus name | Plant virus sequence | miR sequence | hsa miR |
|---|---|---|---|---|
| ay555269 | 224 cggtgctggat 234 | 2 cggtgctggat 12 | hsa-mir-1250 | |
| z29370 | 832 tcaccagccct 822 | 1 tcaccagccct 11 | hsa-mir-1226 | |
| am236832 | 683 actggatcaatt 672 | 2 actggatcaatt 13 | hsa-mir-1243 | |
| aj439544 | 6646 cagttatcacag 6657 | 1 cagttatcacag 12 | hsa-mir-101-1 | |
| aj890348 | 6757 acggatgtttg 6767 | 1 acggatgtttg 11 | hsa-mir-105-1 | |
| aj890348 | 6757 acggatgtttg 6767 | 1 acggatgtttg 11 | hsa-mir-105-2 | |
| u06789 | 850 gagggtcttgg 860 | 1 gagggtcttgg 11 | hsa-mir-1182 | |
| af395678 | 1054 gtgagggcatgcag 1067 | 1 gtgagggcatgcag 14 | hsa-mir-1226 | |
| aj515903 | 629 gtgagggcatgcag 642 | 1 gtgagggcatgcag 14 | hsa-mir-1226 | |
| s66280 | 193 gtgagggcatgcag 206 | 1 gtgagggcatgcag 14 | hsa-mir-1226 | |
| x00206 | 4830 tcaccagccct 4820 | 1 tcaccagccct 11 | hsa-mir-1226 | |
| af395678 | 267 cagatgatcta 257 | 2 cagatgatcta 12 | hsa-mir-1245b | |
| s66253 | 348 cagatgatcta 338 | 2 cagatgatcta 12 | hsa-mir-1245b | |
| dq198144 | 417 tcacacctgcc 427 | 1 tcacacctgcc 11 | hsa-mir-1228 | |
| eu170481 | 773 gtgttcacagc 763 | 2 gtgttcacagc 12 | hsa-mir-124-1 | |
| eu170481 | 773 gtgttcacagc 763 | 2 gtgttcacagc 12 | hsa-mir-124-2 | |
| eu170481 | 773 gtgttcacagc 763 | 2 gtgttcacagc 12 | hsa-mir-124-3 | |
| aj308228 | 531 ctgcgcaagct 521 | 1 ctgcgcaagct 11 | hsa-let-7i | |
| fn594778 | 51 ctgcgcaagct 41 | 1 ctgcgcaagct 11 | hsa-let-7i | |
| fn594779 | 51 ctgcgcaagct 41 | 1 ctgcgcaagct 11 | hsa-let-7i | |
| fn594780 | 51 ctgcgcaagct 41 | 1 ctgcgcaagct 11 | hsa-let-7i | |
| fn594781 | 51 ctgcgcaagct 41 | 1 ctgcgcaagct 11 | hsa-let-7i | |
| fn594782 | 51 ctgcgcaagct 41 | 1 ctgcgcaagct 11 | hsa-let-7i | |
| fn594783 | 51 ctgcgcaagct 41 | 1 ctgcgcaagct 11 | hsa-let-7i | |
| fn594784 | 51 ctgcgcaagct 41 | 1 ctgcgcaagct 11 | hsa-let-7i | |
| fn594785 | 51 ctgcgcaagct 41 | 1 ctgcgcaagct 11 | hsa-let-7i | |
| fr671374 | 51 ctgcgcaagct 41 | 1 ctgcgcaagct 11 | hsa-let-7i |
Selected plant viruses in order of TMV, PVY, CPMV, and PMMoV, showing matching between plant virus and miR sequences.

Figure 5. Two strains of Tobacco mosaic virus, z29370 and ay555269, show sequences similarity to human miRs. The mir-1226 and virus z29370 have an antisense similarity (plus/minus), while mir-1250 and virus ay555269 have a sense (plus/plus) similarity. The seed region of the miRs is shown in the solid-line box. In (A) the similarity starts one nucleotide before the seed region, and it is followed by three nucleotides after such similarity. (B) shows the similarity is extended four nucleotides after the seed region.

Figure 6. Gene sets similarities across miRs relevant to miR-101/101ab, miR-105/105ab, and miR-1182. Clusters from the sets that contain the target genes, taken from miRBase, for the three miRs: miR-101/101ab (with 803 genes), miR-105/105ab (with 479 genes) and miR-1182 (with 110 genes). The cluster that belongs to all the three groups is formed by 6 genes/loci: LOC100507421, KLF12, C17orf39, RUNX1, ANKRD52, CAMK2G.

Figure 7. Gene sets similarities across miRs relevant to miR-101/101ab, miR-105/105ab, and miR-1243. Groups from the sets that contain the target genes, taken from miRBase, forthe three miRs: miR-101/101ab (with 803 genes), miR-105/105ab (with 479 genes) and miR-1243 (with 139 genes). The cluster that belongs to all the three groups is formed by three genes: SMAD2, SLMO2, NAP1L1.

Figure 8. Sequence similarities between pepper mild mottle virus and hsa-let-7i. The seed region is inside the solid-line box. The complete sequence similarity between the different strains of the virus and the hsa-mir-let-7i consists of 11 nucleotides. Every single PMMoV is complementary (Plus/Minus) to the miR. The virus strain aj308228 shares the similarity from nucleotide 531, while all the other strains do so from nucleotide 51 (antisense).