Literature DB >> 21775302

Comparative analysis of RNA-Seq alignment algorithms and the RNA-Seq unified mapper (RUM).

Gregory R Grant1, Michael H Farkas, Angel D Pizarro, Nicholas F Lahens, Jonathan Schug, Brian P Brunk, Christian J Stoeckert, John B Hogenesch, Eric A Pierce.   

Abstract

MOTIVATION: A critical task in high-throughput sequencing is aligning millions of short reads to a reference genome. Alignment is especially complicated for RNA sequencing (RNA-Seq) because of RNA splicing. A number of RNA-Seq algorithms are available, and claim to align reads with high accuracy and efficiency while detecting splice junctions. RNA-Seq data are discrete in nature; therefore, with reasonable gene models and comparative metrics RNA-Seq data can be simulated to sufficient accuracy to enable meaningful benchmarking of alignment algorithms. The exercise to rigorously compare all viable published RNA-Seq algorithms has not been performed previously.
RESULTS: We developed an RNA-Seq simulator that models the main impediments to RNA alignment, including alternative splicing, insertions, deletions, substitutions, sequencing errors and intron signal. We used this simulator to measure the accuracy and robustness of available algorithms at the base and junction levels. Additionally, we used reverse transcription-polymerase chain reaction (RT-PCR) and Sanger sequencing to validate the ability of the algorithms to detect novel transcript features such as novel exons and alternative splicing in RNA-Seq data from mouse retina. A pipeline based on BLAT was developed to explore the performance of established tools for this problem, and to compare it to the recently developed methods. This pipeline, the RNA-Seq Unified Mapper (RUM), performs comparably to the best current aligners and provides an advantageous combination of accuracy, speed and usability. AVAILABILITY: The RUM pipeline is distributed via the Amazon Cloud and for computing clusters using the Sun Grid Engine (http://cbil.upenn.edu/RUM). CONTACT: ggrant@pcbi.upenn.edu; epierce@mail.med.upenn.edu SUPPLEMENTARY INFORMATION: The RNA-Seq sequence reads described in the article are deposited at GEO, accession GSE26248.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21775302      PMCID: PMC3167048          DOI: 10.1093/bioinformatics/btr427

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  26 in total

1.  Basal body dysfunction is a likely cause of pleiotropic Bardet-Biedl syndrome.

Authors:  Stephen J Ansley; Jose L Badano; Oliver E Blacque; Josephine Hill; Bethan E Hoskins; Carmen C Leitch; Jun Chul Kim; Alison J Ross; Erica R Eichers; Tanya M Teslovich; Allan K Mah; Robert C Johnsen; John C Cavender; Richard Alan Lewis; Michel R Leroux; Philip L Beales; Nicholas Katsanis
Journal:  Nature       Date:  2003-09-21       Impact factor: 49.962

2.  Activator-mediated recruitment of the MLL2 methyltransferase complex to the beta-globin locus.

Authors:  Celina Demers; Chandra-Prakash Chaturvedi; Jeffrey A Ranish; Gaetan Juban; Patrick Lai; Francois Morle; Ruedi Aebersold; F Jeffrey Dilworth; Mark Groudine; Marjorie Brand
Journal:  Mol Cell       Date:  2007-08-17       Impact factor: 17.970

3.  SOAP2: an improved ultrafast tool for short read alignment.

Authors:  Ruiqiang Li; Chang Yu; Yingrui Li; Tak-Wah Lam; Siu-Ming Yiu; Karsten Kristiansen; Jun Wang
Journal:  Bioinformatics       Date:  2009-06-03       Impact factor: 6.937

4.  A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms.

Authors:  R Sachidanandam; D Weissman; S C Schmidt; J M Kakol; L D Stein; G Marth; S Sherry; J C Mullikin; B J Mortimore; D L Willey; S E Hunt; C G Cole; P C Coggill; C M Rice; Z Ning; J Rogers; D R Bentley; P Y Kwok; E R Mardis; R T Yeh; B Schultz; L Cook; R Davenport; M Dante; L Fulton; L Hillier; R H Waterston; J D McPherson; B Gilman; S Schaffner; W J Van Etten; D Reich; J Higgins; M J Daly; B Blumenstiel; J Baldwin; N Stange-Thomann; M C Zody; L Linton; E S Lander; D Altshuler
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

5.  Kabuki make-up syndrome: a syndrome of mental retardation, unusual facies, large and protruding ears, and postnatal growth deficiency.

Authors:  N Niikawa; N Matsuura; Y Fukushima; T Ohsawa; T Kajii
Journal:  J Pediatr       Date:  1981-10       Impact factor: 4.406

6.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.

Authors:  Ben Langmead; Cole Trapnell; Mihai Pop; Steven L Salzberg
Journal:  Genome Biol       Date:  2009-03-04       Impact factor: 13.583

Review 7.  The ciliopathies: an emerging class of human genetic disorders.

Authors:  Jose L Badano; Norimasa Mitsuma; Phil L Beales; Nicholas Katsanis
Journal:  Annu Rev Genomics Hum Genet       Date:  2006       Impact factor: 8.929

8.  Rnnotator: an automated de novo transcriptome assembly pipeline from stranded RNA-Seq reads.

Authors:  Jeffrey Martin; Vincent M Bruno; Zhide Fang; Xiandong Meng; Matthew Blow; Tao Zhang; Gavin Sherlock; Michael Snyder; Zhong Wang
Journal:  BMC Genomics       Date:  2010-11-24       Impact factor: 3.969

9.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

10.  TopHat: discovering splice junctions with RNA-Seq.

Authors:  Cole Trapnell; Lior Pachter; Steven L Salzberg
Journal:  Bioinformatics       Date:  2009-03-16       Impact factor: 6.937

View more
  175 in total

1.  Transgenerational epigenetic programming via sperm microRNA recapitulates effects of paternal stress.

Authors:  Ali B Rodgers; Christopher P Morgan; N Adrian Leu; Tracy L Bale
Journal:  Proc Natl Acad Sci U S A       Date:  2015-10-19       Impact factor: 11.205

2.  Designing alternative splicing RNA-seq studies. Beyond generic guidelines.

Authors:  Camille Stephan-Otto Attolini; Victor Peña; David Rossell
Journal:  Bioinformatics       Date:  2015-07-27       Impact factor: 6.937

3.  RASER: reads aligner for SNPs and editing sites of RNA.

Authors:  Jaegyoon Ahn; Xinshu Xiao
Journal:  Bioinformatics       Date:  2015-08-30       Impact factor: 6.937

4.  RNA Sequencing and Analysis.

Authors:  Kimberly R Kukurba; Stephen B Montgomery
Journal:  Cold Spring Harb Protoc       Date:  2015-04-13

Review 5.  Vision from next generation sequencing: multi-dimensional genome-wide analysis for producing gene regulatory networks underlying retinal development, aging and disease.

Authors:  Hyun-Jin Yang; Rinki Ratnapriya; Tiziana Cogliati; Jung-Woong Kim; Anand Swaroop
Journal:  Prog Retin Eye Res       Date:  2015-02-07       Impact factor: 21.198

6.  Serotonergic neuron regulation informed by in vivo single-cell transcriptomics.

Authors:  Jennifer M Spaethling; David Piel; Hannah Dueck; Peter T Buckley; Jacqueline F Morris; Stephen A Fisher; Jaehee Lee; Jai-Yoon Sul; Junhyong Kim; Tamas Bartfai; Sheryl G Beck; James H Eberwine
Journal:  FASEB J       Date:  2013-11-05       Impact factor: 5.191

7.  Generation of a microglial developmental index in mice and in humans reveals a sex difference in maturation and immune reactivity.

Authors:  Richa Hanamsagar; Mark D Alter; Carina S Block; Haley Sullivan; Jessica L Bolton; Staci D Bilbo
Journal:  Glia       Date:  2017-06-15       Impact factor: 7.452

8.  HOS15 Interacts with the Histone Deacetylase HDA9 and the Evening Complex to Epigenetically Regulate the Floral Activator GIGANTEA.

Authors:  Hee Jin Park; Dongwon Baek; Joon-Yung Cha; Xueji Liao; Sang-Ho Kang; C Robertson McClung; Sang Yeol Lee; Dae-Jin Yun; Woe-Yeon Kim
Journal:  Plant Cell       Date:  2019-01-03       Impact factor: 11.277

9.  LIM domain-binding 1 maintains the terminally differentiated state of pancreatic β cells.

Authors:  Benjamin N Ediger; Hee-Woong Lim; Christine Juliana; David N Groff; LaQueena T Williams; Giselle Dominguez; Jin-Hua Liu; Brandon L Taylor; Erik R Walp; Vasumathi Kameswaran; Juxiang Yang; Chengyang Liu; Chad S Hunter; Klaus H Kaestner; Ali Naji; Changhong Li; Maike Sander; Roland Stein; Lori Sussel; Kyoung-Jae Won; Catherine Lee May; Doris A Stoffers
Journal:  J Clin Invest       Date:  2016-12-12       Impact factor: 14.808

10.  Simulation-based comprehensive benchmarking of RNA-seq aligners.

Authors:  Giacomo Baruzzo; Katharina E Hayer; Eun Ji Kim; Barbara Di Camillo; Garret A FitzGerald; Gregory R Grant
Journal:  Nat Methods       Date:  2016-12-12       Impact factor: 28.547

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.