| Literature DB >> 25469507 |
Yun Zheng1, Shengpeng Wang1, Ramanjulu Sunkar2.
Abstract
Phased small interfering RNA (phasiRNA) generating loci (briefly as PHAS) in plants are a novel class of genes that are normally regulated by microRNAs (miRNAs). Similar to miRNAs, phasiRNAs encoded by PHAS play important regulatory roles by targeting protein coding transcripts in plant species. We performed a genome-wide discovery of PHAS loci in Chinese sacred lotus and identified a total of 106 PHAS loci. Of these, 47 loci generate 21 nucleotide (nt) phasiRNAs and 59 loci generate 24 nt phasiRNAs, respectively. We have also identified a new putative TAS3 and a putative TAS4 loci in the lotus genome. Our results show that some of the nucleotide-binding, leucine-rich repeat (NB-LRR) disease resistance proteins and MYB transcription factors potentially generate phasiRNAs. Furthermore, our results suggest that some large subunit (LSU) rRNAs can derive putative phasiRNAs, which is potentially resulted from crosstalk between small RNA biogenesis pathways that are employed to process rRNAs and PHAS loci, respectively. Some of the identified phasiRNAs have putative trans-targets with less than 4 mismatches, suggesting that the identified PHAS are involved in many different pathways. Finally, the discovery of 24 nt PHAS in lotus suggests that there are 24 nt PHAS in dicots.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25469507 PMCID: PMC4254747 DOI: 10.1371/journal.pone.0113790
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Some identified PHAS loci of Chinese sacred lotus.
| PHAS_ID | Start | End | TR | PR |
| FDR | P.S. | Locus Annotation | Ref. |
|
| |||||||||
| scaffold_106_1 | 375,169 | 376,010 | 94 | 9 |
|
| 15.6 | putative TAS3a |
|
| scaffold_107_1 | 1,397,214 | 1,397,907 | 87 | 11 |
|
| 18.1 | unknown | |
| scaffold_10_1 | 1,954,273 | 1,954,966 | 88 | 9 |
|
| 12.2 | putative TAS3c |
|
| scaffold_131_1 | 70,673 | 71,429 | 97 | 11 |
|
| 26.1 | Putative disease resistance protein RGA3 |
|
| scaffold_149_1 | 773,189 | 773,819 | 156 | 12 |
|
| 10.4 | LSU-rRNA | |
| scaffold_170_1 | 166,669 | 167,980 | 61 | 10 |
|
| 29.2 | unknown | |
| scaffold_326_1 | 253,709 | 254,360 | 170 | 13 |
|
| 12.9 | LSU-rRNA | |
| scaffold_326_2 | 254,166 | 254,838 | 58 | 6 |
|
| 11.5 | LSU-rRNA | |
| scaffold_326_3 | 324,761 | 325,370 | 162 | 12 |
|
| 21.2 | LSU-rRNA | |
| sf_39_1 | 2,038,471 | 2,039,238 | 143 | 12 |
|
| 15.9 | putative TAS4 |
|
| scaffold_65_1 | 675,292 | 676,027 | 73 | 10 |
|
| 27.0 | putative TAS3b |
|
| scaffold_87_1 | 119,206 | 120,004 | 33 | 5 |
|
| 13.5 | putative NB-LRR disease resistance protein |
|
| sf_88_1 | 539,406 | 540,015 | 101 | 9 |
|
| 13.0 | LSU-rRNA | |
| scaffold_8_1 | 9,281,139 | 9,281,769 | 12 | 6 |
|
| 10.8 | Putative disease resistance protein RPP13 |
|
| scaffold_8_2 | 9,353,049 | 9,353,721 | 90 | 12 |
|
| 13.6 | Protein of unknown function | |
| scaffold_90_1 | 1,885,970 | 1,886,621 | 25 | 7 |
|
| 10.8 | putative Transcription factor MYB90 |
|
| PHAS_sf_173_1 | 496,493 | 497,186 | 45 | 7 |
|
| 7.0 | putative MYB transcription factor |
|
| PHAS_sf_21_1 | 1,961,563 | 1,962,329 | 81 | 8 |
|
| 7.8 | putative WER, MYB transcription factor |
|
| PHAS_sf_122_1 | 581,162 | 581,771 | 4 | 4 |
|
| 5.6 | putative NB-LRR disease resistance protein |
|
|
| |||||||||
| scaffold_24_1 | 6,030,639 | 6,031,302 | 82 | 7 |
|
| 10.0 | intergenic region | |
| scaffold_252_1 | 376,585 | 377,296 | 49 | 5 |
|
| 12.8 | intergenic region | |
| scaffold_287_1 | 359,861 | 360,500 | 20 | 5 |
|
| 11.2 | unknown gene | |
| scaffold_39_1 | 2,038,529 | 2,039,168 | 81 | 7 |
|
| 10.2 | overlapped with sf_39_1 | |
| scaffold_42_1 | 2,581,628 | 2,582,459 | 29 | 4 |
|
| 12.7 | expressed sequence match | |
| scaffold_5_1 | 3,813,044 | 3,813,827 | 7 | 4 |
|
| 10.1 | intron of a protein of unknown function | |
| scaffold_88_1 | 1,770,018 | 1,770,657 | 96 | 8 |
|
| 15.1 | unknown gene |
The Start and End column list the start and end positions of the predicted PHAS loci in the scaffold. The TR and PR column list the total number of unique siRNAs and the number of phased unique siRNAs, respectively. The P-value and FDR column list the P-values evaluated with Equation 1 and the false discovery ratio using method in [35]. The P.S. column lists the phase scores calculated using Equation 2. The Ref. column lists related literature of the predicted PHAS loci.
Figure 1The category of predicted PHAS loci.
(a) The category of 21 nt PHAS loci. (b) The category of 24 nt PHAS loci.
Figure 2The read distributions and phase scores of some predicted PHAS loci from NB-LRR disease resistance proteins and a MYB transcription factor.
The red and green diamonds represent the number of 21 nt reads, vertical axis, that appeared at the position of the PHAS loci, horizontal axis, in the flower and leaf small RNA libraries, respectively. The vertical gray lines with distances of 21 nt are the phased positions from the position with highest phase scores of the PHAS loci. The yellow boxes in the read distribution panel represent the miRNA complementary sites. Sites pointed by miRNAs from above and under zero read line means miRNAs complement to the plus and minus strand of the predicted PHAS loci, respectively. The predicted miR2118a complementary sites are shown below the phase score panel. (a) to (c) Three potential PHAS loci, scaffold_87_1, scaffold_170_1 and scaffold_8_1, from NB-LRR disease resistance proteins. The blue sequences in the complementary sites of (a) and (b) are one of the phasiRNAs from the PHAS loci. The miR2118a site in Part c is at 176 nt 5′ side (upstream) of the PHAS locus scaffold_8_1. (d) scaffold_90_1, from an MYB90 transcription factor locus.
Figure 5Some LSU-rRNA loci that generate putative phasiRNAs.
(a) A schematic view of phasiRNAs, and annotated genes around two predicted PHAS loci scaffold_326_1 and scaffold_326_2. (b) to (e) The read distributions and phase scores of scaffold_326_1/2, scaffold_149_1, scaffold_326_3 and sf_88_1, respectively. The legend are the same as those in Figure 2. In the lower panel of Part b, the blue and red bars represent the phase scores of scaffold_326_1 and scaffold_326_2, respectively.
Figure 3Putative TAS3c locus and the predicted targets of tasiRNAs in Chinese sacred lotus.
(a) The sequences of the three putative TAS3 loci in Chinese sacred lotus. The red and blue regions are 5′ and 3′ miR390 complementary sites, respectively. The regions of upper case nucleotides are mature tasiRNAs that target ARF family members, or tasiARFs. The underlined region of TAS3c are a phased siRNA TAS3_D6-2(+) with position 12. (b) The 5′ miR390 binding sites on TAS3 transcripts. Only the commonly aligned nucleotides are aligned to miR390. (c) The 3′ miR390 binding sites on TAS3 transcripts. (d) The mature tasiRNAs that target ARF family members derived from TAS3a/b/c loci. (e) The reads distribution and phase score of TAS3c. Legend are the same as those in Figure 2. The vertical dashed lines with distances of 21 nt are phase positions that start from position 10 of the 3′ miR390 site. (f) The predicted targets of tasiRNAs. The yellow and blue circle includes 7 and 8 ARF family members that are targeted by TAS3a/b and TAC3c derived tasiRNAs, respectively. (g) The complementary site of TAS3_D6-2(+) and an ARF family member, NNU_003220-RA.
Figure 4Putative TAS4 (sf_39_1, NNU_012673-RA) derives both 21 nt and 24 nt phasiRNAs.
(a) The schematic view of the predicted 21 nt and 24 nt putative TAS4 and its derived phasiRNAs, as well as annotated genes. (b) The miR828 site on putative TAS4 (sf_39_1). The yellow and underlined region represent the 21 nt and 24 nt that are nearest to the miR828 site. The position pointed by an arrow is the expected phase start position that is triggered by miR828. (c) and (d) The read distribution and phase score of the 21 nt and 24 nt PHAS loci predicted. The legend are the same as those in Figure 2. The distances between vertical gray lines in Part c and d are 21 nt and 24 nt, respectively.
Figure 6The read distribution and phase scores of some predicted 24 nt PHAS loci.
The legend are the same as those in Figure 2 except that the diamonds represent 24 nt reads and the distances between vertical gray lines are 24 nt. (a) scaffold_24_1. (b) scaffold_252_1. (c) scaffold_287_1. (d) scaffold_42_1. (e) scaffold_5_1. (f) scaffold_88_1.
Some predicted targets of phasiRNAs derived from the predicted PHAS loci.
| PhasiRNA | Target ID | M. | Target Description |
|
| |||
| sf_39_1_siR4 | NNU_018790 | 0 | Similar to MYB90 Transcription factor MYB90 |
| sf_8_2_siR11 | NNU_017525 | 1 | Protein of unknown function |
| sf_8_2_siR13 | NNU_017525 | 1 | Protein of unknown function |
| sf_8_2_siR11 | NNU_022433 | 1 | Protein of unknown function |
| sf_170_1_siR16 | NNU_021230 | 1 | Similar to RGA3 Putative disease resistance protein RGA3 |
| sf_170_1_siR27 | NNU_021230 | 1 | Similar to RGA3 Putative disease resistance protein RGA3 |
| sf_87_1_siR3 | NNU_025043 | 1 | Similar to RGA4 Putative disease resistance protein RGA4 |
| sf_8_2_siR11 | NNU_020674 | 1 | Similar to At4g18490 Uncharacterized protein |
| sf_170_1_siR16 | NNU_004711 | 1 | Similar to RPPL1 Putative disease resistance |
| sf_90_1_siR5 | NNU_018789 | 1 | Similar to MYB114 Transcription factor MYB114 |
| sf_90_1_siR4 | NNU_018789 | 1 | Similar to MYB114 Transcription factor MYB114 |
| sf_107_1_siR3 | NNU_023201 | 1.5 | Protein of unknown function |
| sf_8_2_siR14 | NNU_022433 | 1.5 | Protein of unknown function |
| sf_87_1_siR3 | NNU_018068 | 1.5 | Similar to RPPL1 Putative disease resistance |
| sf_170_1_siR6 | NNU_021731 | 1.5 | Similar to ABCG15 ABC transporter G family member 15 |
| sf_170_1_siR17 | NNU_000750 | 1.5 | Similar to At5g15080 Probable receptor-like protein kinase |
| sf_107_1_siR1 | NNU_006211 | 1.5 | Similar to EX1 Protein EXECUTER 1, chloroplastic |
| sf_170_1_siR27 | NNU_002459 | 1.5 | Similar to RGA4 Putative disease resistance protein RGA4 |
| sf_87_1_siR3 | NNU_002459 | 1.5 | Similar to RGA4 Putative disease resistance protein RGA4 |
| sf_107_1_siR1 | NNU_005527 | 1.5 | Similar to At3g12360 Ankyrin repeat-containing protein |
| sf_87_1_siR3 | NNU_018017 | 1.5 | Similar to RPPL1 Putative disease resistance |
| sf_170_1_siR25 | NNU_012692 | 1.5 | Similar to RGA3 Putative disease resistance protein RGA3 |
| sf_90_1_siR4 | NNU_018784 | 1.5 | Protein of unknown function |
| sf_8_2_siR11 | NNU_004714 | 1.5 | Protein of unknown function |
| sf_90_1_siR5 | NNU_018786 | 1.5 | Similar to MYB90 Transcription factor MYB90 |
| sf_90_1_siR4 | NNU_018786 | 1.5 | Similar to MYB90 Transcription factor MYB90 |
| sf_8_1_siR6 | NNU_007200 | 1.5 | Similar to At4g15470 BI1-like protein |
| sf_106_1_siR10 | NNU_011742 | 1.5 | Similar to GATA18 GATA transcription factor 18 |
| sf_39_1_siR8 | NNU_008226 | 1.5 | Similar to SLC4A1AP Kanadaptin (Homo sapiens) |
| sf_10_1_siR10 | NNU_014786 | 1.5 | Similar to UBP22 Ubiquitin carboxyl-terminal hydrolase 22 |
| sf_8_2_siR13 | NNU_004714 | 1.5 | Protein of unknown function |
|
| |||
| sfd_88_1_siR7 | NNU_012891 | 2.0 | Similar to slr0992 Putative tRNA methyltransferase |
| sfd_5_1_siR8 | NNU_013967 | 2.0 | Similar to PDR12 Pleiotropic drug resistance protein 12 |
| sfd_24_1_siR3 | NNU_016850 | 2.5 | Similar to DNAJB13 DnaJ homolog subfamily B member 13 |
| sfd_88_1_siR6 | NNU_012697 | 2.5 | Similar to At1g68650 GDT1-like protein 5 |
| sfd_5_1_siR8 | NNU_016474 | 2.5 | Similar to TIFY6B Protein TIFY 6B |
| sfd_39_1_siR3 | NNU_003001 | 2.5 | Similar to MYB315 Myb-related protein 315 |
| sfd_88_1_siR7 | NNU_010637 | 2.5 | Similar to ABP19A Auxin-binding protein ABP19a |
| sfd_88_1_siR7 | NNU_011269 | 2.5 | Protein of unknown function |
| sfd_88_1_siR7 | NNU_005826 | 2.5 | Protein of unknown function |
| sfd_39_1_siR3 | NNU_017703 | 2.5 | Similar to arfA ADP-ribosylation factor 1 |
| sfd_88_1_siR8 | NNU_007725 | 2.5 | Similar to WRKY33 Probable WRKY transcription factor 33 |
| sfd_88_1_siR7 | NNU_010742 | 2.5 | Similar to NPR1 Regulatory protein NPR1 |
| sfd_88_1_siR7 | NNU_009141 | 2.5 | Similar to TPR2 Topless-related protein 2 |
The M. column is the number of mismatches of the miRNA complementary sites. The names of the 21 nt and 24 nt phasiRNAs have been abbreviated to start with sf_ and sfd_, respectively.