| Literature DB >> 27174874 |
Ming Wang1, Joy Fleming2, Zihui Li3, Chuanyou Li3, Hongtai Zhang2, Yunxin Xue2, Maoshan Chen4, Zongde Zhang3, Xian-En Zhang5, Lijun Bi6.
Abstract
Deep-sequencing of bacterial transcriptomes using RNA-Seq technology has made it possible to identify small non-coding RNAs, RNA molecules which regulate gene expression in response to changing environments, on a genome-wide scale in an ever-increasing range of prokaryotes. However, a simple and reliable automated method for identifying sRNA candidates in these large datasets is lacking. Here, after generating a transcriptome from an exponential phase culture of Mycobacterium tuberculosis H37Rv, we developed and validated an automated method for the genome-wide identification of sRNA candidate-containing regions within RNA-Seq datasets based on the analysis of the characteristics of reads coverage maps. We identified 192 novel candidate sRNA-encoding regions in intergenic regions and 664 RNA transcripts transcribed from regions antisense (as) to open reading frames (ORF), which bear the characteristics of asRNAs, and validated 28 of these novel sRNA-encoding regions by northern blotting. Our work has not only provided a simple automated method for genome-wide identification of candidate sRNA-encoding regions in RNA-Seq data, but has also uncovered many novel candidate sRNA-encoding regions in M. tuberculosis, reinforcing the view that the control of gene expression in bacteria is more complex than previously anticipated.Entities:
Keywords: Mycobacterium tuberculosis; RNA-Seq; non-coding RNA; transcriptome
Mesh:
Substances:
Year: 2016 PMID: 27174874 PMCID: PMC4913526 DOI: 10.1093/abbs/gmw037
Source DB: PubMed Journal: Acta Biochim Biophys Sin (Shanghai) ISSN: 1672-9145 Impact factor: 3.848