Literature DB >> 30364952

Plant editosome database: a curated database of RNA editosome in plants.

Man Li1,2,3, Lin Xia1,2,3, Yuansheng Zhang1,2,3, Guangyi Niu1,2,3, Mengwei Li1,2,3, Pei Wang1,2,3, Yang Zhang1,2,3, Jian Sang1,2,3, Dong Zou1,2, Songnian Hu2,3, Lili Hao1,2, Zhang Zhang1,2,3,4.   

Abstract

RNA editing plays an important role in plant development and growth, enlisting a number of editing factors in the editing process and accordingly revealing the diversity of plant editosomes for RNA editing. However, there is no resource available thus far that integrates editosome data for a variety of plants. Here, we present Plant Editosome Database (PED; http://bigd.big.ac.cn/ped), a curated database of RNA editosome in plants that is dedicated to the curation, integration and standardization of plant editosome data. Unlike extant relevant databases, PED incorporates high-quality editosome data manually curated from related publications and organelle genome annotations. In the current version, PED integrates a complete collection of 98 RNA editing factors and 20 836 RNA editing events, covering 203 organelle genes and 1621 associated species. In addition, it contains functional effects of editing factors in regulating plant phenotypes and includes detailed experimental evidence. Together, PED serves as an important resource to help researchers investigate the RNA editing process across a wide range of plants and thus would be of broad utility for the global plant research community.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30364952      PMCID: PMC6324055          DOI: 10.1093/nar/gky1026

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

RNA editing is a post-transcriptional process that changes specific nucleotides of a transcript by nucleotide conversion/insertion/deletion. Studies have shown that a large number of plants undergo RNA editing, albeit to date confined to plant organelles (plastid and mitochondria), which mainly occurs in coding regions and may cause recoding effects (1). Meanwhile, there are also some editing sites locating in non-coding regions, which are essential for posttranscriptional regulations such as splicing (2). Collectively, RNA editing plays important roles in plant developmental processes (3), plant flowering (4), organelle biogenesis (5), response to particular environmental conditions (6,7) and signal transduction (8). It is believed that RNA editing acts as an indirect repair mechanism to correct DNA mutations on the RNA level, enlisting a number of editing factors in the editing process and consequently revealing the diversity of plant editosomes for RNA editing (see a review in (9)). Over the past several years, a variety of editing factors have been identified as essential components of plant RNA editosome, including pentatricopeptide repeat (PPR) proteins (10), RNA editing factor interacting proteins (RIPs)/multiple organellar RNA editing factors (MORFs) (11,12), organelle RNA recognition motif (ORRM) proteins (13), organelle zinc-finger (OZ) proteins (14), and protoporphyrinogen oxidase 1 (PPO1) (15). Undoubtedly, a complete picture of RNA editing factors is of fundamental significance to better understand the diversity of plant editosomes for RNA editing machinery and potential applications. To date, however, there is still no comprehensive collection of RNA editing factors in plants, although several databases have already been developed to integrate various RNA editing data. For instance, REDIdb (16) is specialized for collecting RNA editing events in plant organelles, PREPACT 3.0 (17) aims to facilitate RNA editing data analysis by integrating information on characterized editing factors for editing site recognition specificities, dbRES (18) integrates RNA editing sites for plant, metazoan, protozoa, fungi and virus, RESOPS (19) focuses on the correspondence of RNA editing sites of land plant organelles to protein three-dimensional structures, and REDIportal (20) and RADAR (21) are two popular databases containing A-to-I RNA editing data in human. Therefore, a specialized database of plant RNA editosomes that houses diverse plant RNA editing factors as well as their associated data is highly desirable. Here we present Plant Editosome Database (PED; http://bigd.big.ac.cn/ped), a curated database of plant RNA editosomes that is dedicated to the curation, integration and standardization of plant editosome data. Unlike extant relevant databases, PED features high-quality editosome data manually curated from related publications and organelle genome annotations. Accordingly, PED incorporates a complete collection of RNA editing factors and associated data across a wide range of plants, which would be of broad utility for the global plant research community.

METHODS AND MATERIALS

All editing events integrated in PED were derived from a total of 2651 organelle flat files downloaded from NCBI RefSeq Organelle Genome Resources (https://www.ncbi.nlm.nih.gov/genome/organelle/). Specifically, an abundance of editing information, including edited gene, editing position, editing region, editing type, codons before and after editing, amino acids before and after editing, etc., were extracted and parsed by a python script. To obtain high-confidence RNA editing information, genome sequences and related publications were then retrieved by manual curation when some RNA editing information were missing or questionable. As a result, 20 836 RNA editing events in 203 organelle genes were obtained, involving a total of 1621 plant species and 1673 organelle genomes (http://bigd.big.ac.cn/ped/dataset). In addition, RNA editing factors as well as their associated editing events were manually curated from published literature. Accordingly, a collection of 98 editing factors were obtained, which were classified into several groups (PPR protein, MORFs/RIPs, ORRM protein, OZ protein, etc.) based on their conserved motifs as detailed in (9). To facilitate RNA editing data curation in a standardized manner, a curation model containing controlled vocabularies and conclusion terms was developed, involving a variety of data items such as editing factor, factor family, associated editing events, edited gene(s), editing type, experimental information, mutant effect relative to wild type, etc. More details about this curation model are publicly available at http://bigd.big.ac.cn/ped/help. PED was constructed with MySQL (http://www.mysql.org; a free and popular relational database management system) as its database engine and web interfaces were developed by using JSP (JavaServer Pages; a technology facilitating rapid development of dynamic web pages based on the Java programming language) and AJAX (Asynchronous JavaScript and XML; a set of web development techniques to create asynchronous applications without interfering with the display and behavior of the existing page). In addition, Bootstrap (https://getbootstrap.com; an open source toolkit for developing with HTML, CSS and JavaScript) was adopted as a front-end framework, which provides a series of templates for designing web pages with consistent interface components.

DATABASE CONTENT AND USAGE

PED features comprehensive integration of RNA editing factors and their associated data manually curated from related publications and organelle annotation files. Collectively, the current version of PED houses a total of 98 RNA editing factors and 20 836 editing events, involving 203 organelle genes and covering 1621 plant species and 1673 plant organelles. In addition, PED includes interactions between editing factors and editing events in 8 model species, functional effects of editing factors in regulating plant phenotypes as well as detailed experimental evidence. PED contains 98 experimentally validated RNA editing factors and their associated edited genes manually curated from 94 publications (Figure 1), corresponding to a complete collection to date of known plant editing factor families as well as their associated experimental treatments and mutant phenotypes. To have a complete picture of plant RNA editing factors, PED is equipped with a specific page for editing factors (Figure 2A), demonstrating that, according to the current collection, the majority of plant editing factors are PPR proteins (72 in count) and the following are MORF/RIP and ORRM. Accordingly, it facilitates access to all collected factors and their associated information, including species, protein family, subclass, associated phenotype(s), edited gene(s), organelle and publication(s). For each specific editing factor, PED summarizes all relevant editing events and provides an abundance of detailed information, including editing position, editing region, editing type, changes of codons & amino acids (if available), editing effect (recoding or synonymous) and detailed information of the experiment evidence. Interestingly, DYW2, an editing factor in Arabidopsis thaliana encoding an atypical PPR-DYW protein, is found to have a larger number of edited genes (41 in count) according to the current implementation of PED. Together, all editing factors collected in PED are publicly accessible at http://bigd.big.ac.cn/ped/browse/factors.
Figure 1.

Statistics of experimentally validated RNA editing factors and associated edited genes in PED (as of August 2018).

Figure 2.

Screenshots of PED web pages, including (A) RNA editing factors, (B) edited genes and (C) species.

Statistics of experimentally validated RNA editing factors and associated edited genes in PED (as of August 2018). Screenshots of PED web pages, including (A) RNA editing factors, (B) edited genes and (C) species. PED collects a total of 203 edited genes associated with 20 836 RNA editing events (http://bigd.big.ac.cn/ped/browse/genes). It is revealed from the current version of PED that rpl2, a ribosomal protein L2 locating in mitochondria, chloroplast and plastid, contains a large number of editing events in more than 600 plant species, and that ndhB, an edited gene of NADH dehydrogenase subunit 2 locating in chloroplast and plastid, is related with 20 different editing factors. According to the current collection of PED, editing events are classified into five types: C-to-U (11, 903), U-to-C (2, 613), A-to-G (15), G-to-A (9) and G-to-U (6), agreeing well with previous studies that C-to-U and U-to-C conversions are most common in plants (22). For a given edited gene, PED provides both basic descriptive information and associated detailed information of editing events (Figure 2B). Basic information includes gene symbol, gene function, organelle type, functional category, number of associated species, number and list of experimentally validated editing factors, number of editing events occurring on edited gene, editing type, and external links to NCBI. Detailed information of editing events contains editing position, editing type, strand, change of codons & amino acids. For instance, atp6, a mitochondrial gene (ATP synthase F0 subunit 6), contains >300 RNA editing events among 27 species, associates with three editing factors (DYW2, EMP5, NUWA) and results in multiple different amino acid changes (e.g. proline to leucine, glutamine to stop codon), which, collectively, are helpful for users to investigate RNA editing patterns and explore the editing machinery across a wide range of species. PED also organizes editing information in terms of species (http://bigd.big.ac.cn/ped/browse/species), where each species has a summary of editing information (Figure 2C). Basic descriptive information includes organelle type, organelle accession number, number of edited genes, number of editing events, list of edited genes. Further detailed features of edited genes contain genomic position, strand, RNA type, editing type, change of codons & amino acids. Currently, PED incorporates 1673 plant organelle genomes from 1621 species, corresponding to a wide diversity of plants and algae, including monocots, eudicots, ferns, club-mosses, bryophytes, and other Viridaeplantae and accounting for a total of 20 836 RNA editing events. Of these RNA editing events currently stored in PED, 16 578 occur in mRNAs (16 351 in protein-coding sequences), 252 in tRNAs and 4006 in unannotated transcripts. Also, according to the current collection in PED, organelle genomes from Anthoceros angustus (chloroplast; 78 edited genes), Nelumbo nucifera (mitochondrion; 38 edited genes) and Nothoceros aenigmaticus (plastid; 88 edited genes), contain more editing events (1018, 771 and 226, respectively). In addition, the currently stored RNA editing events are mainly from chloroplast, accounting for a total of 10 916 events from 1255 chloroplasts genomes. In spite of the higher total number of editing sites in chloroplast, the frequency of editing is much lower than that in mitochondria, with 67 editing sites/per genome for mitochondria, but only about nine editing sites/per genome for chloroplast.

DISCUSSION AND FUTURE DEVELOPMENTS

PED is a curated database of RNA editosome in plants, featuring high-quality editosome data manually curated from published literature and organelle genome annotations. The current implementation of PED houses a complete collection of 98 RNA editing factors and integrates 203 edited genes and 20 836 editing events across a wide range of plant organelle genomes. As a featured resource of BIG Data Center (http://bigd.big.ac.cn) (23,24), PED is committed to the curation, integration and standardization of plant editosome data and thus has the great potential to help researchers conduct systematic investigations on RNA editing machinery in a variety of plant species. Considering the increasing number of published studies on plant RNA editing, a specialized team is responsible for PED to make it up-to-date and comprehensive. Future directions of PED include integration of more editing factors, and importantly, their potential interactions (25) through literature curation. Since some editing factors are evolutionarily related and belong to one common protein family, we will build phylogenetic trees for closely related editing factors, with the aim to facilitate researchers to study RNA editing across species. In addition, we will develop online tools to help users to explore the RNA editing process for a specific editing factor, compare RNA editing pattern between different factors/genes, and investigate the conservation of RNA editing sites across a variety of species. For any given editing factor, we will also develop new functionalities to visualize its associated edited gene(s) and editing sites across a wide range of species.
  25 in total

1.  Multiple organellar RNA editing factor (MORF) family proteins are required for RNA editing in mitochondria and plastids of plants.

Authors:  Mizuki Takenaka; Anja Zehrmann; Daniil Verbitskiy; Matthias Kugelmann; Barbara Härtel; Axel Brennicke
Journal:  Proc Natl Acad Sci U S A       Date:  2012-03-12       Impact factor: 11.205

Review 2.  Plant RNA editing.

Authors:  Anne-Laure Chateigner-Boutin; Ian Small
Journal:  RNA Biol       Date:  2010-03-26       Impact factor: 4.652

3.  A pentatricopeptide repeat protein is essential for RNA editing in chloroplasts.

Authors:  Emi Kotera; Masao Tasaka; Toshiharu Shikanai
Journal:  Nature       Date:  2005-01-20       Impact factor: 49.962

4.  Functional disruption of the pentatricopeptide protein SLG1 affects mitochondrial RNA editing, plant development, and responses to abiotic stresses in Arabidopsis.

Authors:  Hui Yuan; Dong Liu
Journal:  Plant J       Date:  2012-01-16       Impact factor: 6.417

5.  Intron RNA editing is essential for splicing in plant mitochondria.

Authors:  Benoît Castandet; David Choury; Dominique Bégu; Xavier Jordana; Alejandro Araya
Journal:  Nucleic Acids Res       Date:  2010-07-08       Impact factor: 16.971

6.  The mitochondrial PPR protein LOVASTATIN INSENSITIVE 1 plays regulatory roles in cytosolic and plastidial isoprenoid biosynthesis through RNA editing.

Authors:  Jianwei Tang; Keiko Kobayashi; Masashi Suzuki; Shogo Matsumoto; Toshiya Muranaka
Journal:  Plant J       Date:  2009-11-19       Impact factor: 6.417

7.  Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins reveals their essential role in organelle biogenesis.

Authors:  Claire Lurin; Charles Andrés; Sébastien Aubourg; Mohammed Bellaoui; Frédérique Bitton; Clémence Bruyère; Michel Caboche; Cédrig Debast; José Gualberto; Beate Hoffmann; Alain Lecharny; Monique Le Ret; Marie-Laure Martin-Magniette; Hakim Mireau; Nemo Peeters; Jean-Pierre Renou; Boris Szurek; Ludivine Taconnat; Ian Small
Journal:  Plant Cell       Date:  2004-07-21       Impact factor: 11.277

8.  RIP1, a member of an Arabidopsis protein family, interacts with the protein RARE1 and broadly affects RNA editing.

Authors:  Stephane Bentolila; Wade P Heller; Tao Sun; Arianne M Babina; Giulia Friso; Klaas J van Wijk; Maureen R Hanson
Journal:  Proc Natl Acad Sci U S A       Date:  2012-05-07       Impact factor: 11.205

9.  dbRES: a web-oriented database for annotated RNA editing sites.

Authors:  Tao He; Pufeng Du; Yanda Li
Journal:  Nucleic Acids Res       Date:  2006-11-06       Impact factor: 16.971

10.  RESOPS: a database for analyzing the correspondence of RNA editing sites to protein three-dimensional structures.

Authors:  Kei Yura; Sintawee Sulaiman; Yosuke Hatta; Masafumi Shionyu; Mitiko Go
Journal:  Plant Cell Physiol       Date:  2009-10-06       Impact factor: 4.927

View more
  12 in total

1.  Database Resources of the National Genomics Data Center in 2020.

Authors: 
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

2.  Databases for RNA Editing Collections.

Authors:  Claudio Lo Giudice; Luigi Mansi; Graziano Pesole; Ernesto Picardi
Journal:  Methods Mol Biol       Date:  2021

3.  Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2022.

Authors: 
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

4.  Plastid genome evolution in Amazonian açaí palm (Euterpe oleracea Mart.) and Atlantic forest açaí palm (Euterpe edulis Mart.).

Authors:  Amanda de Santana Lopes; Túlio Gomes Pacheco; Odyone Nascimento da Silva; Leila do Nascimento Vieira; Miguel Pedro Guerra; Eduardo Pacca Luna Mattar; Valter Antonio de Baura; Eduardo Balsanelli; Emanuel Maltempi de Souza; Fábio de Oliveira Pedrosa; Marcelo Rogalski
Journal:  Plant Mol Biol       Date:  2021-01-01       Impact factor: 4.076

Review 5.  The Regulation of RNA Modification Systems: The Next Frontier in Epitranscriptomics?

Authors:  Matthias R Schaefer
Journal:  Genes (Basel)       Date:  2021-02-26       Impact factor: 4.096

6.  Comparative Analysis of the Complete Chloroplast Genome of Mainland Aster spathulifolius and Other Aster Species.

Authors:  Swati Tyagi; Jae-A Jung; Jung Sun Kim; So Youn Won
Journal:  Plants (Basel)       Date:  2020-04-29

7.  Systematic analysis reveals cis and trans determinants affecting C-to-U RNA editing in Arabidopsis thaliana.

Authors:  Duan Chu; Lai Wei
Journal:  BMC Genet       Date:  2020-09-03       Impact factor: 2.797

8.  LIRBase: a comprehensive database of long inverted repeats in eukaryotic genomes.

Authors:  Lihua Jia; Yang Li; Fangfang Huang; Yingru Jiang; Haoran Li; Zhizhan Wang; Tiantian Chen; Jiaming Li; Zhang Zhang; Wen Yao
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

9.  Full Length Transcriptome Highlights the Coordination of Plastid Transcript Processing.

Authors:  Marine Guilcher; Arnaud Liehrmann; Chloé Seyman; Thomas Blein; Guillem Rigaill; Benoit Castandet; Etienne Delannoy
Journal:  Int J Mol Sci       Date:  2021-10-19       Impact factor: 5.923

10.  MORF9 Functions in Plastid RNA Editing with Tissue Specificity.

Authors:  Faan Tian; Jinfa Yu; Ya Zhang; Yakun Xie; Binghua Wu; Ying Miao
Journal:  Int J Mol Sci       Date:  2019-09-19       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.