Literature DB >> 18474525

pssRNAMiner: a plant short small RNA regulatory cascade analysis server.

Xinbin Dai1, Patrick Xuechun Zhao.   

Abstract

In plants, short RNAs including approximately 21-nt microRNA (miRNA) and 21-nt trans-acting siRNA (ta-siRNA) compose a 'miRNA --> ta-siRNA --> target gene' cascade pathway that regulates gene expression at the posttranscriptional level. In this cascade, biogenesis of ta-siRNA clusters requires 21-nt intervals (i.e. phasing) and miRNA (phase-initiator) cleavage sites on its TAS transcript. Here, we report a novel web server, pssRNAMiner, which is developed to identify both the clusters of phased small RNAs as well as the potential phase-initiator. To detect phased small RNA clusters, the pssRNAMiner maps input small RNAs against user-specified transcript/genomic sequences, and then identifies phased small RNA clusters by evaluating P-values of hypergeometric distribution. To identify potential phase-initiators, pssRNAMiner aligns input phase-initiators with transcripts of TAS candidates using the Smith-Waterman algorithm. Potential cleavage sites on TAS candidates are further identified from complementary regions by weighting the alignment expectation and its distance to detected phased small RNA clusters. The pssRNAMiner web server is freely available at http://bioinfo3.noble.org/pssRNAMiner/.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18474525      PMCID: PMC2447807          DOI: 10.1093/nar/gkn297

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

In plants, small regulatory RNAs including miRNAs, heterochromatic siRNAs (hc-siRNAs), repeat-associated siRNAs (ra-siRNAs), natural sense–antisense transcript siRNAs and trans-acting siRNAs (ta-siRNAs) are involved in regulating gene expression through various mechanisms (1). Recent studies have demonstrated that these various small RNA molecules, in combination with cellular transcription factors, form the basis of network responsible for regulating the cellular gene expression. The ta-siRNA, a newly identified class of 21-nt short siRNAs, play an essential role in bridging the miRNA and siRNA pathways, which were previously believed to be two independent processes (2–5). Phase-initiators direct the cleavage of ta-siRNA gene (TAS) primary transcripts (precursors) and subsequently initiate the production of ta-siRNA clusters (4). To date, only a few known phase-initiators are ta-siRNAs as most of reported phase-initiators are miRNAs. Following cleavage of the precursor, the 3′ (or 5′)-cleavage products are converted into double-stranded RNA (dsRNA) by RDR6 and SGS3. The dsRNA is then processed into 21-nt increments, relative to the original cleavage site on both strands, by DCL4 in order to produce ta-siRNA clusters (3,4). The ta-siRNAs form an essential component of the RISC complex responsible for guiding AGO-dependent cleavage of the target transcript. Based on these data, several researchers have proposed a ‘miRNA → ta-siRNA → target gene’ cleavage cascade as an element of gene regulatory network in the model plant Arabidopsis (4–7). In this paper, we describe the development of the pssRNAMiner, a plant short small RNA regulatory cascade analysis server that is able to identify potential ta-siRNAs clusters and their phase-initiators. While miRNA biogenesis is dependent on the hairpin structure of the precursor, the formation of siRNA molecules occurs as a result of the processing of dsRNA by RNA-dependent RNA polymerases (RDRs). Therefore, as compared to miRNA, there is no effective computational approach to identify siRNAs from either genomic or transcript sequence. Since 21-nt phased ta-siRNAs are generated from TAS precursors, it is possible to map known small RNA molecules to candidate TAS precursors and then cluster these small RNAs based on their 21-nt phase properties (8,9). By evaluating the P-value of phase features, Chen et al. (6) identified and validated a number of known, and several previously unknown, ta-siRNA clusters as well as the associated TAS gene loci in Arabidopsis. TAS precursors require at least one valid cleavage site, which is targeted by a phase-initiator to generate ta-siRNAs. While some TAS genes possess multiple regions that are complementary with the phase-initiator, only one region effectively guides cleavage in vivo. A ‘two-hit trigger’ mechanism was recently proposed to correlate the number of complementary regions with the activity of phased-initiators based on evidences from studies performed in moss and Arabidopsis (7). Therefore, it is anticipated that analysis of both complementary regions as well as valid cleavage sites in TAS candidate precursors would facilitate screening of phased small RNA clusters. In this study, we describe the development of pssRNAMiner, a web-based server which identifies ta-siRNA clusters as well as their potential phase-initiators. This program requires that the user submit a set of small RNAs and specify one of listed transcript/genomic libraries for mapping. To identify phase-initiators, the user must submit at least one small RNA as candidate phase initiator. The pssRNAMiner is able to identify phased small RNA clusters as ta-siRNA candidates by evaluating the P-values of hypergeometric distribution. Furthermore, pssRNAMiner has the ability to identify potential phase-initiators based on the user input. To date, pssRNAMiner hosts 29 transcript/genomic sequence libraries from 20 species.

METHODS

Detection of phased small RNA clusters

The pssRNAMiner improves a previously described method of evaluating the P-value of random hypergeometric distribution to detect phased small RNA clusters (6). First, pssRNAMiner maps the input small RNAs on transcript sequences and records each position. Then, as described by Chen et al. (6), the algorithm slides on both strands of the transcript sequence to search each mapped small RNA and count the number of phased/nonphased positions with small RNA hits in a 231 bp region downstream of the 5′ start site of the small RNA. Equations (1) and (2), revised from Chen et al. (6), are used to calculate the P-value of phased small RNA clusters on the basis of a random hypergeometric distribution. Since the cleavage of phased small RNAs often occurs within 1–2 nt of the phased positions (8), we introduced a variable, s, in Equation (1) to reflect this shift. The addition of this variable will have the added effect of reducing the total number of nonphased position within the 231 bp region. n: Number of total positions having small RNA hits in 231-bp region; k: Number of phase positions having small RNA hits in 231-bp region; s: Maximum allowed offset from phase position

Analysis of cleavage sites guided by phase-initiator

In addition to detecting phased small RNA clusters, pssRNAMiner has the ability to predict whether the input candidate phase-initiators have the potential to guide the cleavage of TAS candidate precursors and trigger the biogenesis of phased small RNA clusters identified in the first step. To perform this analysis, pssRNAMiner first searches the complementary regions between the phase-initiator and TAS candidate precursor using the Smith–Waterman algorithm (10). It then ranks the alignment of complementary regions based on a scoring scheme described by Zhang (11,12). Since the Smith–Waterman algorithm is only able to identify the optimal alignment for each pair of query/target sequences, an iterative algorithm has been developed to obtain as many complementary regions as possible. During each iteration, the identified complementary region of the precursor sequences are masked by ‘N’ letters for the next iterative search until no further complementary region(s) can be identified in a given query/target sequence. Next, the server identifies whether these complementary regions have valid cleavage sites, which can result in the production of phased small RNA clusters. We applied two conditions in this analysis. First, the user-defined cleavage site in the complementary region must be located within a region calculated by B1 - D, B2 + D], where B1, start position of phased small RNA cluster on the TAS precursor; B2, end position of phased small RNA cluster; D, a user-specific maximum distance between cleavage site and start site of small RNA cluster. Second, the distance between the cleavage site and start site of the small RNA cluster must meet the 21-nt phase property (i.e. multiples of 21 nt).

Architecture and implementation of pssRNAMiner

The pssRNAMiner consists of two independent components, a backend pipeline that is responsible for directing the core functions and a web server. The core functions, which include the calculation of the P-value and analysis of cleavage sites directed by the phase-initiator, are performed by the backend pipeline written in Java and PERL. To perform a search for complementary regions, a third party software ssearch has been used for Smith–Waterman alignment. The web server was developed on top of a SQLite database. Scripts used to generate web interfaces and search results were written in PHP.

USER INTERFACES

To identify 21-nt phased small RNA clusters as ta-siRNA candidates from the existing small RNA dataset, the user is required to submit small RNA sequences in either a simple sequence format or multi-FASTA format. The pssRNAMiner is able to recognize small RNA sequences between 17 and 28 nt in length but more readily identifies sequences of 21 nt in length. In addition, the user needs to specify a transcript/genomic library for mapping (Figure 1a). Following submission, pssRNAMiner maps each of the small RNAs onto the transcript/genomic sequence and then, calculates a P-value, in order to evaluate the potential phased small RNA cluster. The pssRNAMiner lists the candidate clusters having P-values lower than a user specified cutoff threshold (Figure 2a). The list view page enables the use of a filtering/searching function, in order to remove false positive clusters, such as transposon or ribosomal RNAs. In addition, this function enables the user to list only those clusters of interest by filtering based on specific annotation keywords or IDs. The users can further inspect the details of each cluster, including phased/nonphased small RNA sequences, the location of small RNAs on TAS candidate precursors, functional annotation of TAS candidate and valid cleavage site, etc. by clicking on the individual cluster ID for a detailed view (Figure 2b and c). The users are able to further check potential target sequences of these detected ta-siRNA candidates by clicking ‘search’ at the end of detailed view.
Figure 1.

(a) Input interface for searching phased small RNA clusters. (b) Input interface for optional identification of phase-initiator for detected small RNA clusters.

Figure 2.

(a) Output: list view of phased small RNA clusters with phase-initiator cleavage sites on TAS candidate. (b) Output: detailed view of a specific phased small RNA clusters with phase-initiator cleavage sites on a TAS candidate. (c) Output: detailed view of the alignment of specific phased small RNA clusters with phase-initiator cleavage sites on a TAS candidate.

(a) Input interface for searching phased small RNA clusters. (b) Input interface for optional identification of phase-initiator for detected small RNA clusters. (a) Output: list view of phased small RNA clusters with phase-initiator cleavage sites on TAS candidate. (b) Output: detailed view of a specific phased small RNA clusters with phase-initiator cleavage sites on a TAS candidate. (c) Output: detailed view of the alignment of specific phased small RNA clusters with phase-initiator cleavage sites on a TAS candidate. Based on user preferences, pssRNAMiner can also identify potential cleavage site guided by phase-initiator on TAS candidate precursor. For pssRNAMiner to perform this function, it is necessary for the user to submit potential phase-initiators in multi-FASTA format and specify the maximum expectation for screening the complementary region between the phase-initiators and TAS candidate precursors. In addition, it is necessary for the user to specify the expected cleavage range in the complementary region (generally between 9 and 11 nt on miRNA sequence) (Figure 1b). After submission, the server aligns input phase-initiators with TAS candidate precursors to locate complementary regions. Next, the server searches valid cleavage sites from the complementary regions based on the distance between the cleavage site and the phased small RNA clusters and output a web page that lists clusters with cleavage site information (Figure 2a–c). To demonstrate the effectiveness of pssRNAMiner and to facilitate its use, we have integrated a number of published small RNA datasets into pssRNAMiner. These data can be found under the ‘dataset’ sub-menu. Each dataset has been pre-run against their corresponding transcript/genomic sequences, and therefore users may click links on the right-hand column of dataset table to view these precalculated results.

PERFORMANCE

Finally, to evaluate the performance of pssRNAMiner, we downloaded RDR small RNA MPSS libraries from the Arabidopsis MPSS Plus database (http://mpss.udel.edu/at/) (13,14). A total of 11 767 distinct small RNA signatures were analyzed based on the Arabidopsis TAIR7 cDNA release (ftp://ftp.arabidopsis.org/home/tair/Genes/TAIR7_genome_release/). When these data were investigated, pssRNAMiner detected 124 significantly phased small RNA clusters (P < 0.005) at 19 gene loci. Moreover, pssRNAMiner detected all the reported ta-siRNA gene loci with significant P-value, including TAS1a (AT2G27400), TAS1b (AT1G50055), TAS1c (AT2G39675), TAS2 (AT2G39681), TAS3a (AT3G17185) and PPR proteins (AT1G63080 and AT1G63130) (see Supplementary Material for the list of detected phased RNA clusters and gene loci). To investigate the ability of pssRNAMiner to identify valid cleavage site of phase-initiators, a reported phase-initiator, miRNA ‘ath-miR390’, was submitted for cleavage site analysis on TAS candidate precursors. The pssRNAMiner reported two complementary regions (expectation ≤ 5) with one region having a valid cleavage site on TAS3a. These results are consistent with previously published studies that demonstrated that the ath-miR390 initiates production of phased small RNA clusters on TAS3a by guiding precursor cleavage (2,4) (see detailed alignment in the Supplementary Material).

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.
  14 in total

1.  The use of MPSS for whole-genome transcriptional analysis in Arabidopsis.

Authors:  Blake C Meyers; Shivakundan Singh Tej; Tam H Vu; Christian D Haudenschild; Vikas Agrawal; Steve B Edberg; Hassan Ghazal; Shannon Decola
Journal:  Genome Res       Date:  2004-08       Impact factor: 9.043

2.  Endogenous trans-acting siRNAs regulate the accumulation of Arabidopsis mRNAs.

Authors:  Franck Vazquez; Hervé Vaucheret; Ramya Rajagopalan; Christelle Lepers; Virginie Gasciolli; Allison C Mallory; Jean-Louis Hilbert; David P Bartel; Patrice Crété
Journal:  Mol Cell       Date:  2004-10-08       Impact factor: 17.970

3.  microRNA-directed phasing during trans-acting siRNA biogenesis in plants.

Authors:  Edwards Allen; Zhixin Xie; Adam M Gustafson; James C Carrington
Journal:  Cell       Date:  2005-04-22       Impact factor: 41.582

4.  A two-hit trigger for siRNA biogenesis in plants.

Authors:  Michael J Axtell; Calvin Jan; Ramya Rajagopalan; David P Bartel
Journal:  Cell       Date:  2006-11-03       Impact factor: 41.582

5.  Endogenous siRNAs derived from a pair of natural cis-antisense transcripts regulate salt tolerance in Arabidopsis.

Authors:  Omar Borsani; Jianhua Zhu; Paul E Verslues; Ramanjulu Sunkar; Jian-Kang Zhu
Journal:  Cell       Date:  2005-12-29       Impact factor: 41.582

6.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

7.  A diverse and evolutionarily fluid set of microRNAs in Arabidopsis thaliana.

Authors:  Ramya Rajagopalan; Hervé Vaucheret; Jerry Trejo; David P Bartel
Journal:  Genes Dev       Date:  2006-12-15       Impact factor: 11.361

8.  SGS3 and SGS2/SDE1/RDR6 are required for juvenile development and the production of trans-acting siRNAs in Arabidopsis.

Authors:  Angela Peragine; Manabu Yoshikawa; Gang Wu; Heidi L Albrecht; R Scott Poethig
Journal:  Genes Dev       Date:  2004-10-01       Impact factor: 11.361

9.  miRU: an automated plant miRNA target prediction server.

Authors:  Yuanji Zhang
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

10.  Principles of microRNA-target recognition.

Authors:  Julius Brennecke; Alexander Stark; Robert B Russell; Stephen M Cohen
Journal:  PLoS Biol       Date:  2005-03       Impact factor: 8.029

View more
  16 in total

Review 1.  Role of phasiRNAs in plant-pathogen interactions: molecular perspectives and bioinformatics tools.

Authors:  S Jyothsna; Manickavelu Alagu
Journal:  Physiol Mol Biol Plants       Date:  2022-05-27

2.  Analysis of antisense expression by whole genome tiling microarrays and siRNAs suggests mis-annotation of Arabidopsis orphan protein-coding genes.

Authors:  Casey R Richardson; Qing-Jun Luo; Viktoria Gontcharova; Ying-Wen Jiang; Manoj Samanta; Eunseog Youn; Christopher D Rock
Journal:  PLoS One       Date:  2010-05-26       Impact factor: 3.240

3.  PhasiRNAnalyzer: an integrated analyser for plant phased siRNAs.

Authors:  Yuhan Fei; Jiejie Feng; Rui Wang; Baoyi Zhang; Hongsheng Zhang; Ji Huang
Journal:  RNA Biol       Date:  2021-02-04       Impact factor: 4.652

4.  An efficient algorithm for systematic analysis of nucleotide strings suitable for siRNA design.

Authors:  Ancha Baranova; Jonathan Bode; Ganiraju Manyam; Maria Emelianenko
Journal:  BMC Res Notes       Date:  2011-05-27

5.  Mining Functional Elements in Messenger RNAs: Overview, Challenges, and Perspectives.

Authors:  Firoz Ahmed; Vagner A Benedito; Patrick Xuechun Zhao
Journal:  Front Plant Sci       Date:  2011-11-30       Impact factor: 5.753

6.  Bioinformatics analysis of small RNAs in pima (Gossypium barbadense L.).

Authors:  Hongtao Hu; Dazhao Yu; Hong Liu
Journal:  PLoS One       Date:  2015-02-13       Impact factor: 3.240

7.  The Complexity of Posttranscriptional Small RNA Regulatory Networks Revealed by In Silico Analysis of Gossypium arboreum L. Leaf, Flower and Boll Small Regulatory RNAs.

Authors:  Hongtao Hu; Aaron M Rashotte; Narendra K Singh; David B Weaver; Leslie R Goertzen; Shree R Singh; Robert D Locy
Journal:  PLoS One       Date:  2015-06-12       Impact factor: 3.240

8.  MicroRNA and tasiRNA diversity in mature pollen of Arabidopsis thaliana.

Authors:  Robert Grant-Downton; Gael Le Trionnaire; Ralf Schmid; Josefina Rodriguez-Enriquez; Said Hafidh; Saher Mehdi; David Twell; Hugh Dickinson
Journal:  BMC Genomics       Date:  2009-12-30       Impact factor: 3.969

9.  Differential RNAi responses of Nicotiana benthamiana individuals transformed with a hairpin-inducing construct during Plum pox virus challenge.

Authors:  Christian Montes; Álvaro Castro; Paola Barba; Julia Rubio; Evelyn Sánchez; Denisse Carvajal; Carlos Aguirre; Eduardo Tapia; Paola DelÍ Orto; Veronique Decroocq; Humberto Prieto
Journal:  Virus Genes       Date:  2014-06-26       Impact factor: 2.198

Review 10.  Small RNAs in Plant Responses to Abiotic Stresses: Regulatory Roles and Study Methods.

Authors:  Yee-Shan Ku; Johanna Wing-Hang Wong; Zeta Mui; Xuan Liu; Jerome Ho-Lam Hui; Ting-Fung Chan; Hon-Ming Lam
Journal:  Int J Mol Sci       Date:  2015-10-15       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.