Literature DB >> 25765651

Analysis of strand-specific RNA-seq data using machine learning reveals the structures of transcription units in Clostridium thermocellum.

Wen-Chi Chou1, Qin Ma1, Shihui Yang2, Sha Cao3, Dawn M Klingeman4, Steven D Brown4, Ying Xu5.   

Abstract

Identification of transcription units (TUs) encoded in a bacterial genome is essential to elucidation of transcriptional regulation of the organism. To gain a detailed understanding of the dynamically composed TU structures, we have used four strand-specific RNA-seq (ssRNA-seq) datasets collected under two experimental conditions to derive the genomic TU organization of Clostridium thermocellum using a machine-learning approach. Our method accurately predicted the genomic boundaries of individual TUs based on two sets of parameters measuring the RNA-seq expression patterns across the genome: expression-level continuity and variance. A total of 2590 distinct TUs are predicted based on the four RNA-seq datasets. Among the predicted TUs, 44% have multiple genes. We assessed our prediction method on an independent set of RNA-seq data with longer reads. The evaluation confirmed the high quality of the predicted TUs. Functional enrichment analyses on a selected subset of the predicted TUs revealed interesting biology. To demonstrate the generality of the prediction method, we have also applied the method to RNA-seq data collected on Escherichia coli and achieved high prediction accuracies. The TU prediction program named SeqTU is publicly available at https://code.google.com/p/seqtu/. We expect that the predicted TUs can serve as the baseline information for studying transcriptional and post-transcriptional regulation in C. thermocellum and other bacteria.
© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Year:  2015        PMID: 25765651      PMCID: PMC4446414          DOI: 10.1093/nar/gkv177

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  35 in total

1.  Transcriptome complexity in a genome-reduced bacterium.

Authors:  Marc Güell; Vera van Noort; Eva Yus; Wei-Hua Chen; Justine Leigh-Bell; Konstantinos Michalodimitrakis; Takuji Yamada; Manimozhiyan Arumugam; Tobias Doerks; Sebastian Kühner; Michaela Rode; Mikita Suyama; Sabine Schmidt; Anne-Claude Gavin; Peer Bork; Luis Serrano
Journal:  Science       Date:  2009-11-27       Impact factor: 47.728

Review 2.  Application of RNA-seq to reveal the transcript profile in bacteria.

Authors:  A C Pinto; H P Melo-Barbosa; A Miyoshi; A Silva; V Azevedo
Journal:  Genet Mol Res       Date:  2011

3.  The primary transcriptome of the major human pathogen Helicobacter pylori.

Authors:  Cynthia M Sharma; Steve Hoffmann; Fabien Darfeuille; Jérémy Reignier; Sven Findeiss; Alexandra Sittka; Sandrine Chabas; Kristin Reiche; Jörg Hackermüller; Richard Reinhardt; Peter F Stadler; Jörg Vogel
Journal:  Nature       Date:  2010-02-17       Impact factor: 49.962

4.  The transcription unit architecture of the Escherichia coli genome.

Authors:  Byung-Kwan Cho; Karsten Zengler; Yu Qiu; Young Seoub Park; Eric M Knight; Christian L Barrett; Yuan Gao; Bernhard Ø Palsson
Journal:  Nat Biotechnol       Date:  2009-11-01       Impact factor: 54.908

5.  The percentage of bacterial genes on leading versus lagging strands is influenced by multiple balancing forces.

Authors:  Xizeng Mao; Han Zhang; Yanbin Yin; Ying Xu
Journal:  Nucleic Acids Res       Date:  2012-06-26       Impact factor: 16.971

6.  A new framework for identifying cis-regulatory motifs in prokaryotes.

Authors:  Guojun Li; Bingqiang Liu; Qin Ma; Ying Xu
Journal:  Nucleic Acids Res       Date:  2010-12-11       Impact factor: 16.971

Review 7.  RNA degradome--its biogenesis and functions.

Authors:  Paulina Jackowiak; Martyna Nowacka; Pawel M Strozycki; Marek Figlerowicz
Journal:  Nucleic Acids Res       Date:  2011-06-07       Impact factor: 16.971

8.  Mycoplasma hyopneumoniae transcription unit organization: genome survey and prediction.

Authors:  Franciele Maboni Siqueira; Augusto Schrank; Irene Silveira Schrank
Journal:  DNA Res       Date:  2011-11-14       Impact factor: 4.458

9.  RNA-seq based transcriptional map of bovine respiratory disease pathogen "Histophilus somni 2336".

Authors:  Ranjit Kumar; Mark L Lawrence; James Watt; Amanda M Cooksey; Shane C Burgess; Bindu Nanduri
Journal:  PLoS One       Date:  2012-01-20       Impact factor: 3.240

10.  Deep sequencing-based discovery of the Chlamydia trachomatis transcriptome.

Authors:  Marco Albrecht; Cynthia M Sharma; Richard Reinhardt; Jörg Vogel; Thomas Rudel
Journal:  Nucleic Acids Res       Date:  2009-11-18       Impact factor: 16.971

View more
  9 in total

1.  Systematic analysis of the underlying genomic architecture for transcriptional-translational coupling in prokaryotes.

Authors:  Richa Bharti; Daniel Siebert; Bastian Blombach; Dominik G Grimm
Journal:  NAR Genom Bioinform       Date:  2022-09-27

2.  Bacterial regulon modeling and prediction based on systematic cis regulatory motif analyses.

Authors:  Bingqiang Liu; Chuan Zhou; Guojun Li; Hanyuan Zhang; Erliang Zeng; Qi Liu; Qin Ma
Journal:  Sci Rep       Date:  2016-03-15       Impact factor: 4.379

3.  SeqTU: A Web Server for Identification of Bacterial Transcription Units.

Authors:  Xin Chen; Wen-Chi Chou; Qin Ma; Ying Xu
Journal:  Sci Rep       Date:  2017-03-07       Impact factor: 4.379

4.  A machine learning classifier trained on cancer transcriptomes detects NF1 inactivation signal in glioblastoma.

Authors:  Gregory P Way; Robert J Allaway; Stephanie J Bouley; Camilo E Fadul; Yolanda Sanchez; Casey S Greene
Journal:  BMC Genomics       Date:  2017-02-06       Impact factor: 3.969

5.  A New Machine Learning-Based Framework for Mapping Uncertainty Analysis in RNA-Seq Read Alignment and Gene Expression Estimation.

Authors:  Adam McDermaid; Xin Chen; Yiran Zhang; Cankun Wang; Shaopeng Gu; Juan Xie; Qin Ma
Journal:  Front Genet       Date:  2018-08-14       Impact factor: 4.599

6.  RECTA: Regulon Identification Based on Comparative Genomics and Transcriptomics Analysis.

Authors:  Xin Chen; Anjun Ma; Adam McDermaid; Hanyuan Zhang; Chao Liu; Huansheng Cao; Qin Ma
Journal:  Genes (Basel)       Date:  2018-05-30       Impact factor: 4.096

Review 7.  Single-Cell RNA Sequencing of Plant-Associated Bacterial Communities.

Authors:  Qin Ma; Heike Bücking; Jose L Gonzalez Hernandez; Senthil Subramanian
Journal:  Front Microbiol       Date:  2019-10-29       Impact factor: 5.640

8.  Revisiting operons: an analysis of the landscape of transcriptional units in E. coli.

Authors:  Xizeng Mao; Qin Ma; Bingqiang Liu; Xin Chen; Hanyuan Zhang; Ying Xu
Journal:  BMC Bioinformatics       Date:  2015-11-04       Impact factor: 3.169

9.  Transcriptional Organization of the Stability Module of Broad-Host-Range Plasmid RA3, from the IncU Group.

Authors:  Ewa Lewicka; Patrycja Dolowy; Jolanta Godziszewska; Emilia Litwin; Marta Ludwiczak; Grazyna Jagura-Burdzy
Journal:  Appl Environ Microbiol       Date:  2020-08-03       Impact factor: 4.792

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.