Literature DB >> 19015153

sRNAMap: genomic maps for small non-coding RNAs, their regulators and their targets in microbial genomes.

Hsi-Yuan Huang1, Heng-Yi Chang, Chih-Hung Chou, Ching-Ping Tseng, Shinn-Ying Ho, Chi-Dung Yang, Yih-Wei Ju, Hsien-Da Huang.   

Abstract

Small non-coding RNAs (sRNAs) carry out a variety of biological functions and affect protein synthesis and protein activities in prokaryotes. Recently, numerous sRNAs and their targets were identified in Escherichia coli and in other bacteria. It is crucial to have a comprehensive resource concerning the annotation of small non-coding RNAs in microbial genomes. This work presents an integrated database, namely sRNAMap, to collect the sRNA genes, the transcriptional regulators of sRNAs and the sRNA target genes by integrating a variety of biological databases and by surveying literature. In this resource, we collected 397 sRNAs, 62 regulators/sRNAs and 60 sRNAs/targets in 70 microbial genomes. Additionally, more valuable information of the sRNAs, such as the secondary structure of sRNAs, the expressed conditions of sRNAs, the expression profiles of sRNAs, the transcriptional start sites of sRNAs and the cross-links to other biological databases, are provided for further investigation. Besides, various textual and graphical interfaces were designed and implemented to facilitate the data access in sRNAMap. sRNAMap is available at http://sRNAMap.mbc.nctu.edu.tw/.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 19015153      PMCID: PMC2686527          DOI: 10.1093/nar/gkn852

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Small non-coding RNAs (sRNAs), which are discovered in many organisms ranging from bacteria to mammals, play important regulatory roles in cell physiology including regulation of cell development, cell death and chromosome silencing (1). Many of them regulate gene expression at a posttranscriptional level, either by acting as antisense RNAs, by binding to complementary sequences of target transcripts, or by interacting with proteins (2). Figure 1 depicts the synthesis and the functions of small non-coding RNAs. The transcription of sRNAs is regulated by transcription factors. Furthermore, sRNAs can play regulatory roles in translation repression, translation activation, mRNA degradation and mRNA stability.
Figure 1.

The synthesis and functions of small non-coding RNAs collected in sRNAMap.

The synthesis and functions of small non-coding RNAs collected in sRNAMap. EcoCyc (3) and RegulonDB (4) integrate biological knowledge of the transcriptional regulation in Escherichia coli, as well as knowledge on the organization of the genes and regulatory signals into operons in the chromosome. ASAP (5) is developed to store genome sequences in conjunction with associated annotations and functional characterization data. NONCODE (6) is an integrated knowledge database dedicated to non-coding RNAs. In addition, Storz et al. (7) used northern blotting analysis to document a total of 79 small RNAs in E. coli. The increased investigations of important regulatory roles for sRNAs encoded far from their targets, acting on multiple targets, or both, has expanded interest in how to find such regulatory RNAs and how they work (8). Therefore, a resource collects the comprehensive annotation of small non-coding RNAs is crucial. We present an integrated database, sRNAMap, to collect the annotations of the sRNAs and the regulatory relationship between transcriptional regulator and sRNA, and between sRNA and its target genes. The design concept of the sRNAMap is illustrated in Figure 1. Additionally, more valuable information of sRNAs, such as the secondary structure of sRNAs, the expressed conditions of sRNAs, the expression profiles of sRNAs, the transcriptional start sites of sRNAs and the cross-links to other biological databases, are provided for further investigation. Besides, various textual and graphical interfaces were designed and implemented to facilitate the data access in sRNAMap.

DATABASE STATISTICS

The sRNAMap currently collects 397 sRNA genes, 62 regulator/sRNA regulations and 60 sRNA/target regulations in seventy microbial genomes. The detailed list of genome is given in Table S4. As given in Table 1, for instance, the number of experimentally validated sRNA genes in E. coli, Shigella boydii, Shigella flexneri and Yersinia pestis are 87, 35, 40 and 24, respectively. Table 2 gives the length distribution of the total known sRNA genes. Moreover, the sRNAMap analysed the transcriptional start sites of sRNA genes. Figure S3 (see Supplementary Materials) is the schematic diagram for the classification of transcription start sites of sRNA. In E. coli K-12 MG1655, 30 sRNAs have transcription start sites and 33 sRNAs have 49 putative transcription start sites, as given in Table S5.
Table 1.

The briefly statistics of small non-coding RNAs in sRNAMap

Species namesNo. of experimentally verified sRNAs
Escherichia coli87
EnterobacteriaPhage VT2-Sakai genomic DNA1
Enterobacter intermedius1
Enterobacter cloacae1
Enterobacter aerogenes1
Yersinia pseudotuberculosis21
Yersinia pestis24
Yersinia mollaretii ATCC 43 9691
Yersinia intermedia ATCC 29 9091
Yersinia frederiksenii ATCC 33 6411
Yersinia enterocolitica23
Yersinia bercovieri ATCC 43 9701
Shigella sonnei 04638
Shigella flexneri40
Shigella dysenteriae33
Shigella boydii35
Serratia proteamaculans 5682
Salmonella typhimurium15
Salmonella enteritidis2
Salmonella typhi10
Salmonella paratyphi A ATCC 915030
Salmonella enterica Subsp. Arizonae1
Salmonella choleraesuis SC-B674
Photorhabdus luminescens TTO11
Pectobacterium carotovorum2
Pectobacterium atrosepticum SCRI104315
Klebsiella pneumoniae4
Klebsiella oxytoca2
Table 2.

Nucleotide length distribution of sRNA genes

<100100 ∼ 200200 ∼ 300300 ∼ 400400 ∼ 500
No. of sRNAs334436685812
The briefly statistics of small non-coding RNAs in sRNAMap Nucleotide length distribution of sRNA genes

DATA GENERATION

The data generation flow of sRNAMap database is depicted in Figure 2. The data generation flow comprises two major parts: (i) integration of external data sources and (ii) integration of annotated tools. We collect the sRNA information from a variety of biological databases, such as RegulonDB, ASAP and NONCODE. Information of sRNAs including the accessions, names, genomic location, species, descriptions and sequences were obtained. Furthermore, the regulator/sRNA regulations and sRNA/target regulations were obtained from RegulonDB and NPInter (9). In addition to collecting data from external databases, we gather the sRNA information by surveying literatures. Besides, RNA secondary structures, cross-species comparisons and 37 expression profiles of sRNAs were integrated into the database. 308 computationally identified sRNAs (10,11) and 114 computationally identified regulator/sRNA regulations and sRNA/target regulations (12,13) were obtained.
Figure 2.

The data generation flow of sRNAMap.

The data generation flow of sRNAMap. Gene Expression Omnibus (GEO) (14) is a database repository of high-throughput gene expression data and hybridization arrays, chips, microarrays. The expression profiles related to sRNA were obtained and integrated. Besides, UCSC Archaeal Genome Browser (15), which is a popular web-based tool for quickly displaying a requested portion of a genome at any scale, was integrated to provide the sequence conservation of sRNAs. RNAfold (16) was applied to fold the RNA secondary structures of sRNAs. Moreover, RNALogo (17), which presents a graphical representation of the patterns in an aligned RNA sequence family with a consensus structure, was integrated for presenting the sRNA families. The cross-links to other biological databases are provided for each sRNA in the database. The integrated external data sources, the linked external data sources and the integrated annotated tools are listed in Table S1, Table S2 and Table S3 (see Supplementary Materials), respectively.

INTERFACE

The sRNAMap provides a variety of interfaces and graphical visualization to present the plentiful information of sRNAs. Users can submit keywords or sequences to search the database. For each sRNA gene, the database provides the sequence, the genomic location, promoter information, secondary structures, literatures, annotations, expression profiles, sequence conservation and its transcriptional regulatory network. Additionally, the sRNAMap has the regulator/sRNA page and the sRNA/target page which provide the experimental conditions and the regulator/sRNA regulations and sRNA/target regulations. Figure S1 shows the interface of sRNA genes in sRNAMap. sRNAMap also provides several browsing functions, such as the genome browser, the network browser, the expression profile browser, the computational sRNAs browser and the literature record browser (Figure S2, see Supplementary Materials).

DISCUSSIONS

sRNAMap is an integrated and comprehensive database comprising plentiful information about sRNA. Table 3 gives the comparison of sRNAMap with other databases related to sRNA including RegulonDB, ASAP, NONCODE, NPInter and Rfam (18). sRNAMAp aims on the annotation of small non-coding RNAs in microbial genomes, while Rfam mainly aims on the collection of non-coding RNA families and a variety of regulatory RNA structural motifs. Rfam currently collects 53 sRNA families. Our proposed sRNAMap collects 87 E. coli sRNAs and totally 397 sRNAs from 70 species. Moreover, sRNAMap also collects computational sRNA and supports information about RNA secondary structures, transcriptional start sites of sRNA and especially the expression profiles of sRNA. Consequently, we would like to say that sRNAMap provides more plentiful and effective information than Rfam and other databases in the aspect of sRNAs.
Table 3.

Comparing sRNAMap with other resources

RegulonDBASAPNONCODENPInterRfamsRNAMap
No. of sRNAs797213410353397
No. of relations
    Regulators/sRNAs165062
    sRNAs/targets264360
No. of species supported1 (E. coli)59 (Microbial genomes)21 (Microbial genomes)1 (E. coli)24870 (Microbial genomes)
Computational sRNAs supportedYes
Secondary structure of sRNAsYesYes
Transcription start site of sRNAs1 typea5 typesa
Expression profiles supportedYes
Transcriptional regulatory networkRegulators/sRNAs sRNAs/targetsRegulators/sRNAs sRNAs/targets
Sequence homology searchYesYes

aThe classification of transcriptional start sites of sRNA is described in Figure S1 (See Supplementary Materials).

Comparing sRNAMap with other resources aThe classification of transcriptional start sites of sRNA is described in Figure S1 (See Supplementary Materials).

AVAILABILITY

The sRNAMap database will be continuously maintained and updated. The database is now freely available at http://sRNAMap.mbc.nctu.edu.tw/.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

The National Science Council of the Republic of China (Contract No. NSC 96-3112-E-009-002, NSC 95-2311-B-009-004-MY3 and 97-2627-B-009-007); National Research Program for Genomic Medicine (NRPGM), Taiwan; MOE ATU (Partial). Funding for the open access publication charge: National Science Council of the Republic of China and MOE ATU. Conflict of Interest statement: None declared.
  18 in total

1.  A bioinformatics based approach to discover small RNA genes in the Escherichia coli genome.

Authors:  Shuo Chen; Elena A Lesnik; Thomas A Hall; Rangarajan Sampath; Richard H Griffey; Dave J Ecker; Lawrence B Blyn
Journal:  Biosystems       Date:  2002 Mar-May       Impact factor: 1.973

2.  Novel small RNA-encoding genes in the intergenic regions of Escherichia coli.

Authors:  L Argaman; R Hershberg; J Vogel; G Bejerano; E G Wagner; H Margalit; S Altuvia
Journal:  Curr Biol       Date:  2001-06-26       Impact factor: 10.834

Review 3.  Target identification of small noncoding RNAs in bacteria.

Authors:  Jörg Vogel; E Gerhart H Wagner
Journal:  Curr Opin Microbiol       Date:  2007-06-15       Impact factor: 7.934

4.  ASAP, a systematic annotation package for community analysis of genomes.

Authors:  Jeremy D Glasner; Paul Liss; Guy Plunkett; Aaron Darling; Tejasvini Prasad; Michael Rusch; Alexis Byrnes; Michael Gilson; Bryan Biehl; Frederick R Blattner; Nicole T Perna
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

5.  The UCSC Archaeal Genome Browser.

Authors:  Kevin L Schneider; Katherine S Pollard; Robert Baertsch; Andy Pohl; Todd M Lowe
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

6.  NPInter: the noncoding RNAs and protein related biomacromolecules interaction database.

Authors:  Tao Wu; Jie Wang; Changning Liu; Yong Zhang; Baochen Shi; Xiaopeng Zhu; Zhihua Zhang; Geir Skogerbø; Lan Chen; Hongchao Lu; Yi Zhao; Runsheng Chen
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

7.  NCBI GEO: mining tens of millions of expression profiles--database and tools update.

Authors:  Tanya Barrett; Dennis B Troup; Stephen E Wilhite; Pierre Ledoux; Dmitry Rudnev; Carlos Evangelista; Irene F Kim; Alexandra Soboleva; Maxim Tomashevsky; Ron Edgar
Journal:  Nucleic Acids Res       Date:  2006-11-11       Impact factor: 16.971

8.  NONCODE v2.0: decoding the non-coding.

Authors:  Shunmin He; Changning Liu; Geir Skogerbø; Haitao Zhao; Jie Wang; Tao Liu; Baoyan Bai; Yi Zhao; Runsheng Chen
Journal:  Nucleic Acids Res       Date:  2007-11-13       Impact factor: 16.971

9.  RegulonDB (version 6.0): gene regulation model of Escherichia coli K-12 beyond transcription, active (experimental) annotated promoters and Textpresso navigation.

Authors:  Socorro Gama-Castro; Verónica Jiménez-Jacinto; Martín Peralta-Gil; Alberto Santos-Zavaleta; Mónica I Peñaloza-Spinola; Bruno Contreras-Moreira; Juan Segura-Salazar; Luis Muñiz-Rascado; Irma Martínez-Flores; Heladia Salgado; César Bonavides-Martínez; Cei Abreu-Goodger; Carlos Rodríguez-Penagos; Juan Miranda-Ríos; Enrique Morett; Enrique Merino; Araceli M Huerta; Luis Treviño-Quintanilla; Julio Collado-Vides
Journal:  Nucleic Acids Res       Date:  2007-12-23       Impact factor: 16.971

10.  RNALogo: a new approach to display structural RNA alignment.

Authors:  Tzu-Hao Chang; Jorng-Tzong Horng; Hsien-Da Huang
Journal:  Nucleic Acids Res       Date:  2008-05-21       Impact factor: 16.971

View more
  39 in total

1.  Tracing common origins of Genomic Islands in prokaryotes based on genome signature analyses.

Authors:  Mark Wj van Passel
Journal:  Mob Genet Elements       Date:  2011-09-01

2.  RNAcentral: A vision for an international database of RNA sequences.

Authors:  Alex Bateman; Shipra Agrawal; Ewan Birney; Elspeth A Bruford; Janusz M Bujnicki; Guy Cochrane; James R Cole; Marcel E Dinger; Anton J Enright; Paul P Gardner; Daniel Gautheret; Sam Griffiths-Jones; Jen Harrow; Javier Herrero; Ian H Holmes; Hsien-Da Huang; Krystyna A Kelly; Paul Kersey; Ana Kozomara; Todd M Lowe; Manja Marz; Simon Moxon; Kim D Pruitt; Tore Samuelsson; Peter F Stadler; Albert J Vilella; Jan-Hinnerk Vogel; Kelly P Williams; Mathew W Wright; Christian Zwieb
Journal:  RNA       Date:  2011-09-22       Impact factor: 4.942

3.  sRNATarBase: a comprehensive database of bacterial sRNA targets verified by experiments.

Authors:  Yuan Cao; Jiayao Wu; Qian Liu; Yalin Zhao; Xiaomin Ying; Lei Cha; Ligui Wang; Wuju Li
Journal:  RNA       Date:  2010-09-15       Impact factor: 4.942

4.  Multi label learning for prediction of human protein subcellular localizations.

Authors:  Lin Zhu; Jie Yang; Hong-Bin Shen
Journal:  Protein J       Date:  2009-12       Impact factor: 2.371

5.  Assessing computational tools for the discovery of small RNA genes in bacteria.

Authors:  Xiaojun Lu; Heidi Goodrich-Blair; Brian Tjaden
Journal:  RNA       Date:  2011-07-18       Impact factor: 4.942

6.  Ras-driven transcriptome analysis identifies aurora kinase A as a potential malignant peripheral nerve sheath tumor therapeutic target.

Authors:  Ami V Patel; David Eaves; Walter J Jessen; Tilat A Rizvi; Jeffrey A Ecsedy; Mark G Qian; Bruce J Aronow; John P Perentesis; Eduard Serra; Timothy P Cripe; Shyra J Miller; Nancy Ratner
Journal:  Clin Cancer Res       Date:  2012-07-18       Impact factor: 12.531

Review 7.  Recent advances in genetic engineering tools based on synthetic biology.

Authors:  Jun Ren; Jingyu Lee; Dokyun Na
Journal:  J Microbiol       Date:  2020-01-02       Impact factor: 3.422

8.  A novel antisense RNA regulates at transcriptional level the virulence gene icsA of Shigella flexneri.

Authors:  Mara Giangrossi; Gianni Prosseda; Chi Nhan Tran; Anna Brandi; Bianca Colonna; Maurizio Falconi
Journal:  Nucleic Acids Res       Date:  2010-02-03       Impact factor: 16.971

9.  Modularity of Escherichia coli sRNA regulation revealed by sRNA-target and protein network analysis.

Authors:  Timothy H Wu; Ian Yi-Feng Chang; Li-chieh Julie Chu; Hsuan-Cheng Huang; Wailap Victor Ng
Journal:  BMC Bioinformatics       Date:  2010-10-15       Impact factor: 3.307

10.  sRNAscanner: a computational tool for intergenic small RNA detection in bacterial genomes.

Authors:  Jayavel Sridhar; Narmada Sambaturu; Suryanarayanan Ramkumar Narmada; Radhakrishnan Sabarinathan; Hong-Yu Ou; Zixin Deng; Kanagaraj Sekar; Ziauddin Ahamed Rafi; Kumar Rajakumar
Journal:  PLoS One       Date:  2010-08-05       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.