Literature DB >> 25087770

SCOPE++: sequence classification of homoPolymer emissions.

James T Morton1, Patricia Abrudan2, Nathanial Figueroa3, Chun Liang4, John E Karro5.   

Abstract

BACKGROUND: mRNA polyadenylation, the addition of a poly(A) tail to the 3'-end of pre-mRNA, is a process critical to gene expression and regulation in eukaryotes. To understand the molecular mechanisms governing polyadenylation and other relevant biological processes, it is important to identify these poly(A) tails accurately in transcriptome sequencing data and differentiate them from artificial adapter sequences added in the sequencing process. But the annotation of these tails is complicated by the presence of sequencing errors and post-transcriptional modifications. While determining that a tail is present in a given transcript fragment is straight-forward, these obfuscations make the problem of boundary identification a challenge; conventional seed-and-extend algorithms struggle to accurately identify these poly(A) tail end-points. Further, all existing tools that we are aware of focus exclusively on the trimming of poly(A) tails, failing to provide the detailed information needed for studying the polyadenylation process.
RESULTS: We have created SCOPE++, an open-source tool for finding the precise border of poly(A) tails and other homopolymers in raw mRNA sequence reads. Based on a Hidden Markov Model (HMM) approach, SCOPE++ accurately identifies specific homopolymer sequences in error-prone EST/cDNA data or RNA-Seq data at a speed appropriate for large sequence sets.
CONCLUSIONS: We demonstrate that our tool can precisely identify poly(A) tails with near perfect accuracy at the speed required for high-throughput applications, providing a valuable resource for polyadenylation research.
Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Hidden Markov Model; Polyadenylation; Transcriptome

Mesh:

Substances:

Year:  2014        PMID: 25087770      PMCID: PMC4165746          DOI: 10.1016/j.ygeno.2014.07.005

Source DB:  PubMed          Journal:  Genomics        ISSN: 0888-7543            Impact factor:   5.736


  16 in total

1.  EMBOSS: the European Molecular Biology Open Software Suite.

Authors:  P Rice; I Longden; A Bleasby
Journal:  Trends Genet       Date:  2000-06       Impact factor: 11.639

Review 2.  Translational control by CPEB: a means to the end.

Authors:  R Mendez; J D Richter
Journal:  Nat Rev Mol Cell Biol       Date:  2001-07       Impact factor: 94.444

3.  Nontemplated nucleotide addition prior to polyadenylation: a comparison of Arabidopsis cDNA and genomic sequences.

Authors:  Yongfeng Jin; Tengfei Bian
Journal:  RNA       Date:  2004-09-23       Impact factor: 4.942

4.  Novel extraction strategy of ribosomal RNA and genomic DNA from cheese for PCR-based investigations.

Authors:  Catherine Bonaïti; Sandrine Parayre; Françoise Irlinger
Journal:  Int J Food Microbiol       Date:  2005-11-02       Impact factor: 5.277

5.  Genome-wide landscape of polyadenylation in Arabidopsis provides evidence for extensive alternative polyadenylation.

Authors:  Xiaohui Wu; Man Liu; Bruce Downie; Chun Liang; Guoli Ji; Qingshun Q Li; Arthur G Hunt
Journal:  Proc Natl Acad Sci U S A       Date:  2011-07-11       Impact factor: 11.205

Review 6.  Ending the message: poly(A) signals then and now.

Authors:  Nick J Proudfoot
Journal:  Genes Dev       Date:  2011-09-01       Impact factor: 11.361

7.  Comprehensive polyadenylation site maps in yeast and human reveal pervasive alternative polyadenylation.

Authors:  Fatih Ozsolak; Philipp Kapranov; Sylvain Foissac; Sang Woo Kim; Elane Fishilevich; A Paula Monaghan; Bino John; Patrice M Milos
Journal:  Cell       Date:  2010-12-10       Impact factor: 41.582

8.  SeqTrim: a high-throughput pipeline for pre-processing any type of sequence read.

Authors:  Juan Falgueras; Antonio J Lara; Noé Fernández-Pozo; Francisco R Cantón; Guillermo Pérez-Trabado; M Gonzalo Claros
Journal:  BMC Bioinformatics       Date:  2010-01-20       Impact factor: 3.169

9.  Widespread shortening of 3'UTRs by alternative cleavage and polyadenylation activates oncogenes in cancer cells.

Authors:  Christine Mayr; David P Bartel
Journal:  Cell       Date:  2009-08-21       Impact factor: 41.582

10.  Poly(A)-tail profiling reveals an embryonic switch in translational control.

Authors:  Alexander O Subtelny; Stephen W Eichhorn; Grace R Chen; Hazel Sive; David P Bartel
Journal:  Nature       Date:  2014-01-29       Impact factor: 49.962

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.