Literature DB >> 15608278

ASRP: the Arabidopsis Small RNA Project Database.

Adam M Gustafson1, Edwards Allen, Scott Givan, Daniel Smith, James C Carrington, Kristin D Kasschau.   

Abstract

Eukaryotes produce functionally diverse classes of small RNAs (20-25 nt). These include microRNAs (miRNAs), which act as regulatory factors during growth and development, and short-interfering RNAs (siRNAs), which function in several epigenetic and post-transcriptional silencing systems. The Arabidopsis Small RNA Project (ASRP) seeks to characterize and functionally analyze the major classes of endogenous small RNAs in plants. The ASRP database provides a repository for sequences of small RNAs cloned from various Arabidopsis genotypes and tissues. Version 3.0 of the database contains 1920 unique sequences, with tools to assist in miRNA and siRNA identification and analysis. The comprehensive database is publicly available through a web interface at http://asrp.cgrb.oregonstate.edu.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15608278      PMCID: PMC540081          DOI: 10.1093/nar/gki127

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Recent studies revealed that plants contain populations of small RNAs (20–25 nt) that belong to two major classes—microRNAs (miRNAs) and endogenous short-interfering RNAs (siRNAs). MIRNA gene transcripts adopt imperfect foldback structures and are processed by DICER-LIKE 1 (DCL1), resulting in 20–22 nt miRNAs. Mature miRNAs function as post-transcriptional regulators that guide either site-specific cleavage or non-degradative repression of target mRNAs (1). In many cases, disruption of miRNA-mediated control results in severe developmental abnormalities (2–10). siRNAs arise from endogenous transcripts that form dsRNA structures, or that are substrates for RNAi pathways. Processing of siRNAs often requires other DCL proteins, such as DCL3 (11). In addition, biogenesis of several classes of endogenous siRNAs requires RNA-dependent RNA polymerases, such as RDR2 (11). siRNA-generating loci often yield multiple, overlapping clusters of small RNAs, in contrast to MIRNA loci that generally yield a single miRNA. Endogenous siRNAs arise from repetitive sequences, transposons and retroelements, genomic regions containing inverted duplications, as well as other genic and intergenic regions. A subset of siRNAs also act to guide or assist formation of heterochromatin (12–14). A subclass of siRNAs has been shown to guide cleavage of specific target mRNAs in trans, similar to miRNAs. Biogenesis of trans-acting siRNAs (ta-siRNAs) requires DCL1 and RDR6 (15,16). In contrast to miRNA genes, ta-siRNA precursor transcripts do not form a foldback structure, but rather both sense and antisense small RNAs are processed from perfectly complementary RNA duplexes. Several small RNA libraries have been constructed from Arabidopsis thaliana plants with the primary goal to identify miRNAs and endogenous siRNAs (17–22). The aim of the Arabidopsis Small RNA Project (ASRP) is to analyze small RNAs from different tissues and genotypes of Arabidopsis, provide a public database of cloned small RNA sequences and develop web-based tools to assist in analysis of small RNA populations. These resources are intended to aid the identification of miRNAs and MIRNA genes, and to enable functional analysis of siRNA-producing regions of the genome.

DATABASE CONTENT

The ASRP database currently contains 5521 small RNA entries representing 1920 unique sequences. The collection represents small RNA sequences from both in-house cloning projects and sequences deposited in the miRNA registry (23). For sequences derived in-house, multiple small RNA libraries were constructed from Arabidopsis (Columbia-0 ecotype) at various developmental stages, including embryos, 3-day post germination seedlings, aerial tissues (including rosette leaves and apical meristems) and inflorescences (stages 1–12). To genetically enrich for miRNA populations, libraries were constructed from rdr2-1 and dcl3-1 mutants that have defects in the chromatin siRNA pathway. All unique sequences were given an independent ASRP database (DBE) identifier.

DATABASE ORGANIZATION

The ASRP database relies on freely available and open-source software. The ASRP graphical user interface (GUI) is composed of web pages delivered by an apache HTTP server (http://httpd.apache.org). In addition, the server incorporates mod_perl (http://perl.apache.org) and Mason (http://www.masonhq.org) to dynamically produce web pages based upon user input. The vast majority of the GUI is generated by custom Perl code that increasingly incorporates object-oriented coding practices to improve extensibility and re-usability of the individual software components. Bioperl (24) is used for specific tasks, such as parsing the GenBank files containing the Arabidopsis chromosomes. The GUI interacts with a custom database backend utilizing Structured Query Language (SQL) and the open source MySQL (http://www.mysql.com) database engine. Table structures and specific query statements conform to standard SQL language syntax and are portable to other SQL database engines. Currently, the ASRP database resides on a custom-configured server managed by the RedHat Linux AS operating system.

DATA ACCESS AND WEB INTERFACE

The ASRP database web interface enables users to view and analyze the small RNAs in text and graphical formats. Data for each small RNA is stored in MySQL database tables that are easily sorted and searched. Through the web interface, users may sort and view the small RNA data in the following ways: All small RNAs. This page displays basic information about all unique small RNAs in the database, including, if applicable, the miRNA or ta-siRNA name, number of loci in the Arabidopsis genome, number of near predicted loci in the Rice genome, number of potential mRNA targets and number of times isolated. More information about a specific small RNA is available by following the database number (DBE#) link. Small RNA clusters. Some small RNA loci are clustered in the Arabidopsis genome. This page displays clusters containing a minimum of four small RNA loci, with each within 500 nt of the next small RNA loci. From this page, the user can view the sequences and positions of the small RNAs in each cluster in text format or the cluster can be viewed graphically in relation to the Arabidopsis genome using an open access genome viewer (25). miRNAs. All small RNAs characterized as miRNAs are displayed in a similar format as section (i) (Figure 1A). The display page for each individual miRNA is split into four sections; general information, Arabidopsis MIRNA genes, predicted and validated target genes, and Oryza sativa MIRNA genes (Figure 1B). The general information section includes the sequence and source of the miRNA. The predicted foldback structure for the pre-miRNA, the flanking sequence around the MIRNA gene and the graphical genome view are available through links on the Arabidopsis MIRNA genes section (Figure 1C). Information about the predicted target genes, the target-miRNA binding site, and the computational or experimental validity of the target-miRNA binding site is displayed in the third section (Figure 1B). The fourth section displays information about the small RNA in O.sativa, including the predicted secondary structure of validated precursor miRNAs.
Figure 1

Windows from the ASRP database website. (A) A partial list of all miRNAs in the database. (B) Information specific to a single miRNA. (C) Display from the genome browser.

ta-siRNAs. All published ta-siRNAs are in the database. The page is similar in format to the miRNA page. General information, ta-siRNA-generating locus information, and predicted target genes are displayed. The user can view the information about the ta-siRNAs in a manner similar to the miRNA section of the database. Annotated small RNAs. Automated annotation programs such as RepeatMasker (http://ftp.genome.washington.edu/RM/RepeatMasker.html) are used to identify small RNAs that originate from genomic regions of highly repetitive sequences, as well as transposons and retroelements. The user can display and sort small RNAs by the specific class of annotated repeat element such as MuDR or SINE. In addition to the sorting features, the web interface provides users with a variety of searching capabilities. Quick searches enable users to locate specific miRNAs based on either the miRNA names or the ASRP database identifiers (DBE#). To search for small RNAs predicted to target specific Arabidopsis genes, or that originate from generic sequences, the locus identifiers (e.g. At3g60630) or user-defined FASTA formatted sequences are used, respectively. Finally, users can determine if a small RNA sequence is represented in the ASRP database by searching the sequence against the entire population of small RNAs.

AVAILABILITY

All small RNAs in the ASRP database are available through the publicly available website (http://asrp.cgrb.oregonstate.edu) or can be downloaded in FASTA format from the website download page (http://asrp.cgrb.oregonstate.edu/downloads/).

FURTHER DIRECTIONS

The ASRP database was created to serve as a repository and tool to facilitate the analysis of miRNAs and endogenous siRNAs and their targets. To increase accessibility of the database, we are working to more completely integrate the ASRP database with existing Arabidopsis resources, such as TAIR. In addition, integration of miRNAs, ta-siRNAs and endogenous siRNAs from the database with other research projects, such as genomic tilling microarrays and chromatin immunoprecipitation arrays (14,26), will enhance the information acquired from these experiments and further expand our understanding of small RNA function. There are still many unanswered questions concerning miRNAs, ta-siRNAs and endogenous siRNAs. The regulatory roles of miRNA-target gene interaction, the regulation of MIRNA gene expression, and the function of siRNAs in the regulation of chromatin structure and gene silencing are just a few questions currently being studied. Future plans include the integration of data from genome-scale microarray projects into the ASRP database (27). The scope of the database may widen with the addition of other plant genomes, libraries or computational analysis. The inclusion of additional plant genomes will enable a more in-depth study of miRNA evolution and conservation and activities of endogenous siRNAs.
  27 in total

1.  Radial patterning of Arabidopsis shoots by class III HD-ZIP and KANADI genes.

Authors:  John F Emery; Sandra K Floyd; John Alvarez; Yuval Eshed; Nathaniel P Hawker; Anat Izhaki; Stuart F Baum; John L Bowman
Journal:  Curr Biol       Date:  2003-10-14       Impact factor: 10.834

2.  Dissection of floral induction pathways using global expression analysis.

Authors:  Markus Schmid; N Henriette Uhlenhaut; François Godard; Monika Demar; Ray Bressan; Detlef Weigel; Jan U Lohmann
Journal:  Development       Date:  2003-10-22       Impact factor: 6.868

3.  Endogenous and silencing-associated small RNAs in plants.

Authors:  Cesar Llave; Kristin D Kasschau; Maggie A Rector; James C Carrington
Journal:  Plant Cell       Date:  2002-07       Impact factor: 11.277

4.  The microRNA Registry.

Authors:  Sam Griffiths-Jones
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

5.  A biochemical framework for RNA silencing in plants.

Authors:  Guiliang Tang; Brenda J Reinhart; David P Bartel; Phillip D Zamore
Journal:  Genes Dev       Date:  2003-01-01       Impact factor: 11.361

6.  The generic genome browser: a building block for a model organism system database.

Authors:  Lincoln D Stein; Christopher Mungall; ShengQiang Shu; Michael Caudy; Marco Mangone; Allen Day; Elizabeth Nickerson; Jason E Stajich; Todd W Harris; Adrian Arva; Suzanna Lewis
Journal:  Genome Res       Date:  2002-10       Impact factor: 9.043

7.  MicroRNAs in plants.

Authors:  Brenda J Reinhart; Earl G Weinstein; Matthew W Rhoades; Bonnie Bartel; David P Bartel
Journal:  Genes Dev       Date:  2002-07-01       Impact factor: 11.361

8.  SGS3 and SGS2/SDE1/RDR6 are required for juvenile development and the production of trans-acting siRNAs in Arabidopsis.

Authors:  Angela Peragine; Manabu Yoshikawa; Gang Wu; Heidi L Albrecht; R Scott Poethig
Journal:  Genes Dev       Date:  2004-10-01       Impact factor: 11.361

9.  microRNA-mediated repression of rolled leaf1 specifies maize leaf polarity.

Authors:  Michelle T Juarez; Jonathan S Kui; Julie Thomas; Bradley A Heller; Marja C P Timmermans
Journal:  Nature       Date:  2004-03-04       Impact factor: 49.962

10.  Modulation of floral development by a gibberellin-regulated microRNA.

Authors:  Patrick Achard; Alan Herr; David C Baulcombe; Nicholas P Harberd
Journal:  Development       Date:  2004-07       Impact factor: 6.868

View more
  94 in total

1.  Plant secondary siRNA production determined by microRNA-duplex structure.

Authors:  Pablo A Manavella; Daniel Koenig; Detlef Weigel
Journal:  Proc Natl Acad Sci U S A       Date:  2012-01-30       Impact factor: 11.205

2.  Maternal siRNAs as regulators of parental genome imbalance and gene expression in endosperm of Arabidopsis seeds.

Authors:  Jie Lu; Changqing Zhang; David C Baulcombe; Z Jeffrey Chen
Journal:  Proc Natl Acad Sci U S A       Date:  2012-03-19       Impact factor: 11.205

3.  High-resolution experimental and computational profiling of tissue-specific known and novel miRNAs in Arabidopsis.

Authors:  Natalie W Breakfield; David L Corcoran; Jalean J Petricka; Jeffrey Shen; Juthamas Sae-Seaw; Ignacio Rubio-Somoza; Detlef Weigel; Uwe Ohler; Philip N Benfey
Journal:  Genome Res       Date:  2011-09-22       Impact factor: 9.043

4.  Comparative analysis of miRNAs and their targets across four plant species.

Authors:  Dorina Lenz; Patrick May; Dirk Walther
Journal:  BMC Res Notes       Date:  2011-11-08

5.  A pathway for the biogenesis of trans-acting siRNAs in Arabidopsis.

Authors:  Manabu Yoshikawa; Angela Peragine; Mee Yeon Park; R Scott Poethig
Journal:  Genes Dev       Date:  2005-08-30       Impact factor: 11.361

Review 6.  Genetic networks.

Authors:  Steven P Briggs; Tatjana Singer
Journal:  Plant Physiol       Date:  2005-06       Impact factor: 8.340

7.  DICER-LIKE 1 and DICER-LIKE 3 redundantly act to promote flowering via repression of FLOWERING LOCUS C in Arabidopsis thaliana.

Authors:  Robert J Schmitz; Lewis Hong; Kathleen E Fitzpatrick; Richard M Amasino
Journal:  Genetics       Date:  2007-06       Impact factor: 4.562

8.  VirtualPlant: a software platform to support systems biology research.

Authors:  Manpreet S Katari; Steve D Nowicki; Felipe F Aceituno; Damion Nero; Jonathan Kelfer; Lee Parnell Thompson; Juan M Cabello; Rebecca S Davidson; Arthur P Goldberg; Dennis E Shasha; Gloria M Coruzzi; Rodrigo A Gutiérrez
Journal:  Plant Physiol       Date:  2009-12-09       Impact factor: 8.340

9.  Transcriptome-wide analysis of uncapped mRNAs in Arabidopsis reveals regulation of mRNA degradation.

Authors:  Yuling Jiao; José Luis Riechmann; Elliot M Meyerowitz
Journal:  Plant Cell       Date:  2008-10-24       Impact factor: 11.277

10.  Transduction of RNA-directed DNA methylation signals to repressive histone marks in Arabidopsis thaliana.

Authors:  Hisataka Numa; Jong-Myong Kim; Akihiro Matsui; Yukio Kurihara; Taeko Morosawa; Junko Ishida; Yoshiki Mochizuki; Hiroshi Kimura; Kazuo Shinozaki; Tetsuro Toyoda; Motoaki Seki; Manabu Yoshikawa; Yoshiki Habu
Journal:  EMBO J       Date:  2009-12-10       Impact factor: 11.598

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.