Literature DB >> 16381833

MODOMICS: a database of RNA modification pathways.

Stanislaw Dunin-Horkawicz1, Anna Czerwoniec, Michal J Gajda, Marcin Feder, Henri Grosjean, Janusz M Bujnicki.   

Abstract

MODOMICS is the first comprehensive database resource for systems biology of RNA modification. It integrates information about the chemical structure of modified nucleosides, their localization in RNA sequences, pathways of their biosynthesis and enzymes that carry out the respective reactions. MODOMICS also provides literature information, and links to other databases, including the available protein sequence and structure data. The current list of modifications and pathways is comprehensive, while the dataset of enzymes is limited to Escherichia coli and Saccharomyces cerevisiae and sequence alignments are presented only for tRNAs from these organisms. RNAs and enzymes from other organisms will be included in the near future. MODOMICS can be queried by the type of nucleoside (e.g. A, G, C, U, I, m1A, nm5s2U, etc.), type of RNA, position of a particular nucleoside, type of reaction (e.g. methylation, thiolation, deamination, etc.) and name or sequence of an enzyme of interest. Options for data presentation include graphs of pathways involving the query nucleoside, multiple sequence alignments of RNA sequences and tabular forms with enzyme and literature data. The contents of MODOMICS can be accessed through the World Wide Web at http://genesilico.pl/modomics/.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16381833      PMCID: PMC1347447          DOI: 10.1093/nar/gkj084

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Naturally occurring RNAs contain numerous chemically altered nucleosides. They are formed by enzymatic modification of the primary transcripts during the complex RNA maturation process. To date, over 100 structurally distinguishable modified nucleosides originating from different types of RNAs from many diverse organisms of the three major phylogenetic domains of life have been reported and collected in the RNAMods database (1,2). The location and distribution of various types of modification vary greatly between different RNA molecules, organisms and organelles. The largest number of modified nucleosides with the greatest structural diversity is found in transfer RNAs (tRNAs) (3). Other types of RNA (rRNA, mRNA, snRNA, snoRNA and even the recently discovered miRNA) also contain modified nucleosides; however, they cannot match tRNAs with respect to abundance and diversity of modifications [see ref. (4) and the collection of reviews in books (5,6)].

DATABASE CONTENT

We have collected all the currently known RNA modifications from the RNAMods database (2) and the tRNA sequences from the Bayreuth database (3). We have extended the set of RNA modifications to include the most recent additions reported in the literature and we have carefully refined the type and location of modifications in Escherichia coli and Saccharomyces cerevisiae tRNA sequences based on an extensive survey of the published data. Furthermore, based on a very extensive literature search we have compiled the set of known and predicted modification reactions (and the responsible enzymes, if available) that lead from the unmodified precursors to all known modified nucleosides. For all RNA modification enzymes from E.coli and S.cerevisiae, we have collected the available information concerning alternative gene and protein names, amino acid sequence, 3D structure, cofactor requirements and literature (PubMed) references to the key analyses. To date, such a comprehensive set of metabolic reactions involving nucleosides in RNA has never been reported. Similarly, our collection of enzymes is much more complete, accurate and up-to-date than the subsets of proteins involved in metabolism of RNA modifications stored in general-purpose enzyme and genome databases, such as BRENDA (7), EcoCyc (8), SGD (9), etc.

DATABASE ORGANIZATION AND ACCESS

MODOMICS is a relational database that links together the datasets of modification reactions/pathways, RNA-modifying enzymes and the sequences of target RNAs, which can be queried via three convenient menus, ‘PATHWAYS’, ‘ENZYMES’ and ‘RNAs’. The modification pathways are represented as a set of graphs visualized with GRAPHVIZ (), in which the nodes represent the modified nucleosides, and the edges represent the transformations (real or putative) between them, e.g. enzymatic reactions. Currently, the pathway dataset comprises five graphs, corresponding to all known modifications of adenosine (A), cytidine (C), guanosine (G), uridine (U) and the queuosine pathway (Q), of which the graph of U modifications is clearly most diversified. The ‘PATHWAYS’ menu offers a variety of filtering options to display a whole graph or its fragment corresponding to reactions that occur in a particular kingdom of life (Eukaryota, Archaea, Bacteria or viruses) or an organellum (currently only mitochondrial modifications are supported), in a particular subset or RNAs (tRNAs, rRNAs, mRNAs, small RNAs or chromosomal RNAs), and hypermodifications that result from processing of an already modified nucleoside (e.g. I and its derivatives, Ψ and its derivatives, etc.). All nodes of the (sub)graph, e.g. the images of all nucleosides, are hyperlinked to dynamically generated windows comprising two panels (example shown in Figure 1). The upper panel displays basic information about the selected nucleoside, including its common name, one letter code in the tRNA sequence database, symbol used in MODOMICS and the chemical structure in the SMILES code hyperlinked to the corresponding image generated using the SMI2GIF script kindly provided by the Daylight Chemical Information Systems Inc (). The lower panel includes a subgraph of reactions leading to the selected nucleoside from its precursor(s) (unless the selected nucleoside is the precursor itself), and all known hypermodifications formed from that nucleoside.
Figure 1

A subgraph of adenosine modifications, showing the pathways of m1Im synthesis (14). Experimentally verified reactions catalyzed by known enzymes from the MODOMICS dataset are indicated by solid arrows, putative reactions or those catalyzed by unknown enzymes are indicated by broken arrows (in this example: m1I→m1Im). The phylogenetic occurrence of nucleosides is shown by colored letters: Archaea (red A), Bacteria (blue B) and Eukaryota (green E).

All edges of the (sub)graph, e.g. arrows that connect the images of nucleosides, are hyperlinked to static ‘reaction’ windows comprising one or more panels (example shown in Figure 2). The upper panel displays basic information about the selected reaction, namely the type of RNA in which it is conducted and its occurrence in the phylogenetic context (inferred from the knowledge of the phylogenetic distribution of the substrate and the product nucleoside). Other panels display information about enzymes known to catalyze the selected reaction in different substrates and in different organisms. Currently, the database of enzymes includes only proteins from E.coli and S.cerevisiae, but will be expanded in the future and may eventually comprise all orthologs of the functionally characterized enzymes identifiable in fully sequenced genomes. At this moment, however, MODOMICS includes only enzymes and corresponding protein cofactors that have been experimentally validated, of which most (but not all) have known amino acid sequences. The enzyme panels are also directly accessible from the ‘ENZYMES’ menu, which lists all entries of the current database in the tabular form, which can be sorted by the name(s) (e.g. Trm1, Dus2, MnmC, etc.), enzyme type (e.g. methyltransferase, deaminase, etc.) or the organism of origin. The enzyme database can be also searched by identifying entries, whose fields match the query formulated as a regular expression (e.g. ‘Trm*’, ‘*pus*’, ‘m1*’ or ‘*transferase’, etc.). The enzyme names include both the traditional one(s), which have been quite erratically inferred from different characteristics of the gene, the protein, the organism, the type of the reaction, or even the whole pathway (e.g. Tgs1p, Abd1p, Gar1p, Nop1p, MnmA, GidA, etc.), as well as a novel name given according to the newly developed, uniform nomenclature (H.G. and J.M.B., manuscript in preparation). The enzyme panel lists all relevant types of reactions between nucleosides in different substrates and provides the EC numbers (if available) and links to the corresponding entries in the BRENDA database (7). Other information about the enzyme concerns the sequence (if available) and includes the name of the open reading frame in the genome, the NCBI GenPept (10) and SwissProt (11) accession numbers, the experimentally solved structure in the PDB (12) (in the future it will also include experimentally validated computational models) and the literature information concerning experimental characterization of the protein, identification of the gene, and possibly a structural analysis, together with links to the relevant database entries.
Figure 2

Reaction panel illustrates the methylation of inosine (I) to N-methylinosine (m1I) catalyzed by the Trm5p enzyme at the position 37 of the anticodon loop of tRNAAla (anticodon IGC), which also introduces the m1G modification in a larger subset of other yeast tRNAs (15). The hyperlink to the G→m1G reaction reveals a similar panel (data not shown) with Trm5p, as well as three other enzymes, namely tRNA MTases Trm10p in S.cerevisiae (16) and TrmD (17), and rRNA MTase RlmA(I) (18) (both in E.coli). Note that in Archaeal tRNAs, the enzymatic formation of m1I, which exclusively occurs at the position 57 of the Ψ-loop, follows a completely different metabolic route [data not shown, for details see (14,19)].

The ‘RNAs’ menu allows displaying multiple sequence alignments of homologous RNA families, in which the modifications have been observed. Currently, this dataset is complementary to the enzyme dataset and includes only tRNA sequences from S.cerevisiae (both cytoplasmic and mitochondrial) and E.coli, and the corresponding unmodified tDNA sequences. In the future it will be extended to manually validated tRNA sequences from other model organisms, for which the tRNA data are available (3), as well as to tDNA sequences predicted from the fully sequenced genomes [e.g. the tRNomics dataset from ref. (13)]. It will also include other RNA sequences with identified modifications, such as rRNAs and various small non-coding RNAs. The content of the currently available tRNA sequence alignment can be filtered according to several options, including the organism and strain or taxon of origin, anticodon and amino acid specificity.

AVAILABILITY

The data are accessible freely for research purposes at the URL . Images of pathways are available by Email request to the first author (at sdh@genesilico.pl). Scientists interested in adding data (modifications, RNA sequences, enzymes, etc.) or in using the enzyme information in MODOMICS to predict the modification status of RNA molecules from the DNA sequence are encouraged to contact the senior authors (at iamb@genesilico.pl or henri.grosjean@lebs.cnrs-gif.fr). This article should be cited in research projects assisted by the use of MODOMICS.
  17 in total

1.  A primordial tRNA modification required for the evolution of life?

Authors:  G R Björk; K Jacobsson; K Nilsson; M J Johansson; A S Byström; O P Persson
Journal:  EMBO J       Date:  2001-01-15       Impact factor: 11.598

2.  tRNomics: analysis of tRNA genes from 50 genomes of Eukarya, Archaea, and Bacteria reveals anticodon-sparing strategies and domain-specific features.

Authors:  Christian Marck; Henri Grosjean
Journal:  RNA       Date:  2002-10       Impact factor: 4.942

3.  BRENDA, the enzyme database: updates and major new developments.

Authors:  Ida Schomburg; Antje Chang; Christian Ebeling; Marion Gremse; Christian Heldt; Gregor Huhn; Dietmar Schomburg
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  A primordial RNA modification enzyme: the case of tRNA (m1A) methyltransferase.

Authors:  Martine Roovers; Johan Wouters; Janusz M Bujnicki; Catherine Tricot; Victor Stalon; Henri Grosjean; Louis Droogmans
Journal:  Nucleic Acids Res       Date:  2004-01-22       Impact factor: 16.971

Review 5.  Summary: the modified nucleosides of RNA.

Authors:  P A Limbach; P F Crain; J A McCloskey
Journal:  Nucleic Acids Res       Date:  1994-06-25       Impact factor: 16.971

6.  Chromosomal location and cloning of the gene (trmD) responsible for the synthesis of tRNA (m1G) methyltransferase in Escherichia coli K-12.

Authors:  A S Byström; G R Björk
Journal:  Mol Gen Genet       Date:  1982

7.  Identification of the yeast gene encoding the tRNA m1G methyltransferase responsible for modification at position 9.

Authors:  Jane E Jackman; Rebecca K Montange; Harmit S Malik; Eric M Phizicky
Journal:  RNA       Date:  2003-05       Impact factor: 4.942

8.  Identification of the rrmA gene encoding the 23S rRNA m1G745 methyltransferase in Escherichia coli and characterization of an m1G745-deficient mutant.

Authors:  C Gustafsson; B C Persson
Journal:  J Bacteriol       Date:  1998-01       Impact factor: 3.490

9.  Saccharomyces Genome Database (SGD) provides tools to identify and analyze sequences from Saccharomyces cerevisiae and related sequences from other organisms.

Authors:  Karen R Christie; Shuai Weng; Rama Balakrishnan; Maria C Costanzo; Kara Dolinski; Selina S Dwight; Stacia R Engel; Becket Feierbach; Dianna G Fisk; Jodi E Hirschman; Eurie L Hong; Laurie Issel-Tarver; Robert Nash; Anand Sethuraman; Barry Starr; Chandra L Theesfeld; Rey Andrada; Gail Binkley; Qing Dong; Christopher Lane; Mark Schroeder; David Botstein; J Michael Cherry
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

10.  A novel enzymatic pathway leading to 1-methylinosine modification in Haloferax volcanii tRNA.

Authors:  H Grosjean; F Constantinesco; D Foiret; N Benachenhou
Journal:  Nucleic Acids Res       Date:  1995-11-11       Impact factor: 16.971

View more
  95 in total

1.  A domain of the actin binding protein Abp140 is the yeast methyltransferase responsible for 3-methylcytidine modification in the tRNA anti-codon loop.

Authors:  Sonia D'Silva; Steffen J Haider; Eric M Phizicky
Journal:  RNA       Date:  2011-04-25       Impact factor: 4.942

2.  Actin-binding protein ABP140 is a methyltransferase for 3-methylcytidine at position 32 of tRNAs in Saccharomyces cerevisiae.

Authors:  Akiko Noma; Sanghyun Yi; Takayuki Katoh; Yoshimi Takai; Takeo Suzuki; Tsutomu Suzuki
Journal:  RNA       Date:  2011-04-25       Impact factor: 4.942

3.  Identification and modeling of a phosphatase-like domain in a tRNA 2'-O-ribosyl phosphate transferase Rit1p.

Authors:  Anna Czerwoniec; Janusz M Bujnicki
Journal:  Cell Cycle       Date:  2011-10-15       Impact factor: 4.534

4.  Global Epitranscriptomics Profiling of RNA Post-Transcriptional Modifications as an Effective Tool for Investigating the Epitranscriptomics of Stress Response.

Authors:  Rebecca E Rose; Manuel A Pazos; M Joan Curcio; Daniele Fabris
Journal:  Mol Cell Proteomics       Date:  2016-01-05       Impact factor: 5.911

5.  AtTrm5a catalyses 1-methylguanosine and 1-methylinosine formation on tRNAs and is important for vegetative and reproductive growth in Arabidopsis thaliana.

Authors:  Xiaohuan Jin; Zhengyi Lv; Junbao Gao; Rui Zhang; Ting Zheng; Ping Yin; Dongqin Li; Liangcai Peng; Xintao Cao; Yan Qin; Staffan Persson; Bo Zheng; Peng Chen
Journal:  Nucleic Acids Res       Date:  2019-01-25       Impact factor: 16.971

6.  Differentiation of 2'-O- and 3'-O-methylated ribonucleosides by tandem mass spectrometry.

Authors:  Qingchun Zhang; Yinsheng Wang
Journal:  J Am Soc Mass Spectrom       Date:  2006-06-05       Impact factor: 3.109

7.  High-resolution structure of the nitrile reductase QueF combined with molecular simulations provide insight into enzyme mechanism.

Authors:  Youngchang Kim; Min Zhou; Shiu Moy; Jennifer Morales; Mark A Cunningham; Andrzej Joachimiak
Journal:  J Mol Biol       Date:  2010-09-25       Impact factor: 5.469

8.  The tRNA recognition mechanism of folate/FAD-dependent tRNA methyltransferase (TrmFO).

Authors:  Ryota Yamagami; Koki Yamashita; Hiroshi Nishimasu; Chie Tomikawa; Anna Ochi; Chikako Iwashita; Akira Hirata; Ryuichiro Ishitani; Osamu Nureki; Hiroyuki Hori
Journal:  J Biol Chem       Date:  2012-10-24       Impact factor: 5.157

9.  Cysteine of sequence motif VI is essential for nucleophilic catalysis by yeast tRNA m5C methyltransferase.

Authors:  Hélène Walbott; Clotilde Husson; Sylvie Auxilien; Béatrice Golinelli-Pimpaneau
Journal:  RNA       Date:  2007-05-02       Impact factor: 4.942

10.  Flexible recognition of the tRNA G18 methylation target site by TrmH methyltransferase through first binding and induced fit processes.

Authors:  Anna Ochi; Koki Makabe; Kunihiro Kuwajima; Hiroyuki Hori
Journal:  J Biol Chem       Date:  2010-01-06       Impact factor: 5.157

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.