Literature DB >> 29718481

The fractured landscape of RNA-seq alignment: the default in our STARs.

Sara Ballouz1, Alexander Dobin1, Thomas R Gingeras1, Jesse Gillis1.   

Abstract

Many tools are available for RNA-seq alignment and expression quantification, with comparative value being hard to establish. Benchmarking assessments often highlight methods' good performance, but are focused on either model data or fail to explain variation in performance. This leaves us to ask, what is the most meaningful way to assess different alignment choices? And importantly, where is there room for progress? In this work, we explore the answers to these two questions by performing an exhaustive assessment of the STAR aligner. We assess STAR's performance across a range of alignment parameters using common metrics, and then on biologically focused tasks. We find technical metrics such as fraction mapping or expression profile correlation to be uninformative, capturing properties unlikely to have any role in biological discovery. Surprisingly, we find that changes in alignment parameters within a wide range have little impact on both technical and biological performance. Yet, when performance finally does break, it happens in difficult regions, such as X-Y paralogs and MHC genes. We believe improved reporting by developers will help establish where results are likely to be robust or fragile, providing a better baseline to establish where methodological progress can still occur.

Entities:  

Mesh:

Year:  2018        PMID: 29718481      PMCID: PMC6007662          DOI: 10.1093/nar/gky325

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  31 in total

1.  Novel algorithms and the benefits of comparative validation.

Authors:  Robert Smith; Dan Ventura; John T Prince
Journal:  Bioinformatics       Date:  2013-04-14       Impact factor: 6.937

2.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

3.  compcodeR--an R package for benchmarking differential expression methods for RNA-seq data.

Authors:  Charlotte Soneson
Journal:  Bioinformatics       Date:  2014-05-09       Impact factor: 6.937

Review 4.  Crowdsourcing biomedical research: leveraging communities as innovation engines.

Authors:  Julio Saez-Rodriguez; James C Costello; Stephen H Friend; Michael R Kellen; Lara Mangravite; Pablo Meyer; Thea Norman; Gustavo Stolovitzky
Journal:  Nat Rev Genet       Date:  2016-07-15       Impact factor: 53.242

5.  Near-optimal probabilistic RNA-seq quantification.

Authors:  Nicolas L Bray; Harold Pimentel; Páll Melsted; Lior Pachter
Journal:  Nat Biotechnol       Date:  2016-04-04       Impact factor: 54.908

6.  Human genomics. The human transcriptome across tissues and individuals.

Authors:  Marta Melé; Pedro G Ferreira; Ferran Reverter; David S DeLuca; Jean Monlong; Michael Sammeth; Taylor R Young; Jakob M Goldmann; Dmitri D Pervouchine; Timothy J Sullivan; Rory Johnson; Ayellet V Segrè; Sarah Djebali; Anastasia Niarchou; Fred A Wright; Tuuli Lappalainen; Miquel Calvo; Gad Getz; Emmanouil T Dermitzakis; Kristin G Ardlie; Roderic Guigó
Journal:  Science       Date:  2015-05-08       Impact factor: 47.728

7.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.

Authors:  Bo Li; Colin N Dewey
Journal:  BMC Bioinformatics       Date:  2011-08-04       Impact factor: 3.307

8.  A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium.

Authors: 
Journal:  Nat Biotechnol       Date:  2014-08-24       Impact factor: 54.908

9.  Human splicing diversity and the extent of unannotated splice junctions across human RNA-seq samples on the Sequence Read Archive.

Authors:  Abhinav Nellore; Andrew E Jaffe; Jean-Philippe Fortin; José Alquicira-Hernández; Leonardo Collado-Torres; Siruo Wang; Robert A Phillips; Nishika Karbhari; Kasper D Hansen; Ben Langmead; Jeffrey T Leek
Journal:  Genome Biol       Date:  2016-12-30       Impact factor: 13.583

10.  Transcriptome and genome sequencing uncovers functional variation in humans.

Authors:  Tuuli Lappalainen; Michael Sammeth; Marc R Friedländer; Peter A C 't Hoen; Jean Monlong; Manuel A Rivas; Mar Gonzàlez-Porta; Natalja Kurbatova; Thasso Griebel; Pedro G Ferreira; Matthias Barann; Thomas Wieland; Liliana Greger; Maarten van Iterson; Jonas Almlöf; Paolo Ribeca; Irina Pulyakhina; Daniela Esser; Thomas Giger; Andrew Tikhonov; Marc Sultan; Gabrielle Bertier; Daniel G MacArthur; Monkol Lek; Esther Lizano; Henk P J Buermans; Ismael Padioleau; Thomas Schwarzmayr; Olof Karlberg; Halit Ongen; Helena Kilpinen; Sergi Beltran; Marta Gut; Katja Kahlem; Vyacheslav Amstislavskiy; Oliver Stegle; Matti Pirinen; Stephen B Montgomery; Peter Donnelly; Mark I McCarthy; Paul Flicek; Tim M Strom; Hans Lehrach; Stefan Schreiber; Ralf Sudbrak; Angel Carracedo; Stylianos E Antonarakis; Robert Häsler; Ann-Christine Syvänen; Gert-Jan van Ommen; Alvis Brazma; Thomas Meitinger; Philip Rosenstiel; Roderic Guigó; Ivo G Gut; Xavier Estivill; Emmanouil T Dermitzakis
Journal:  Nature       Date:  2013-09-15       Impact factor: 49.962

View more
  6 in total

1.  Systematic evaluation of differential splicing tools for RNA-seq studies.

Authors:  Arfa Mehmood; Asta Laiho; Mikko S Venäläinen; Aidan J McGlinchey; Ning Wang; Laura L Elo
Journal:  Brief Bioinform       Date:  2020-12-01       Impact factor: 11.622

2.  The y-ome defines the 35% of Escherichia coli genes that lack experimental evidence of function.

Authors:  Sankha Ghatak; Zachary A King; Anand Sastry; Bernhard O Palsson
Journal:  Nucleic Acids Res       Date:  2019-03-18       Impact factor: 16.971

3.  Sex Determination Using RNA-Sequencing Analyses in Early Prenatal Pig Development.

Authors:  Susana A Teixeira; Adriana M G Ibelli; Maurício E Cantão; Haniel C de Oliveira; Mônica C Ledur; Jane de O Peixoto; Daniele B D Marques; Karine A Costa; Luiz L Coutinho; Simone E F Guimarães
Journal:  Genes (Basel)       Date:  2019-12-05       Impact factor: 4.096

4.  SPEAQeasy: a scalable pipeline for expression analysis and quantification for R/bioconductor-powered RNA-seq analyses.

Authors:  Nicholas J Eagles; Emily E Burke; Jacob Leonard; Brianna K Barry; Joshua M Stolz; Louise Huuki; BaDoi N Phan; Violeta Larios Serrato; Everardo Gutiérrez-Millán; Israel Aguilar-Ordoñez; Andrew E Jaffe; Leonardo Collado-Torres
Journal:  BMC Bioinformatics       Date:  2021-05-01       Impact factor: 3.169

5.  Histone chaperone HIRA complex regulates retrotransposons in embryonic stem cells.

Authors:  Miao Zhang; Xin Zhao; Xiao Feng; Xiao Hu; Xuan Zhao; Wange Lu; Xinyi Lu
Journal:  Stem Cell Res Ther       Date:  2022-04-01       Impact factor: 6.832

6.  Splice-Junction-Based Mapping of Alternative Isoforms in the Human Proteome.

Authors:  Edward Lau; Yu Han; Damon R Williams; Cody T Thomas; Rajani Shrestha; Joseph C Wu; Maggie P Y Lam
Journal:  Cell Rep       Date:  2019-12-10       Impact factor: 9.423

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.