Brendan A Veeneman1, Sudhanshu Shukla2, Saravana M Dhanasekaran2, Arul M Chinnaiyan3, Alexey I Nesvizhskii4. 1. Department of Computational Medicine and Bioinformatics, Michigan Center for Translational Pathology. 2. Michigan Center for Translational Pathology, Department of Pathology. 3. Department of Computational Medicine and Bioinformatics, Michigan Center for Translational Pathology, Department of Pathology, Department of Urology and Howard Hughes Medical Institute, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA. 4. Department of Computational Medicine and Bioinformatics, Michigan Center for Translational Pathology, Department of Pathology.
Abstract
MOTIVATION: Discovery of novel splicing from RNA sequence data remains a critical and exciting focus of transcriptomics, but reduced alignment power impedes expression quantification of novel splice junctions. RESULTS: Here, we profile performance characteristics of two-pass alignment, which separates splice junction discovery from quantification. Per sample, across a variety of transcriptome sequencing datasets, two-pass alignment improved quantification of at least 94% of simulated novel splice junctions, and provided as much as 1.7-fold deeper median read depth over those splice junctions. We further demonstrate that two-pass alignment works by increasing alignment of reads to splice junctions by short lengths, and that potential alignment errors are readily identifiable by simple classification. Taken together, two-pass alignment promises to advance quantification and discovery of novel splicing events. CONTACT: arul@med.umich.edu, nesvi@med.umich.edu AVAILABILITY AND IMPLEMENTATION: Two-pass alignment was implemented here as sequential alignment, genome indexing, and re-alignment steps with STAR. Full parameters are provided in Supplementary Table 2. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Discovery of novel splicing from RNA sequence data remains a critical and exciting focus of transcriptomics, but reduced alignment power impedes expression quantification of novel splice junctions. RESULTS: Here, we profile performance characteristics of two-pass alignment, which separates splice junction discovery from quantification. Per sample, across a variety of transcriptome sequencing datasets, two-pass alignment improved quantification of at least 94% of simulated novel splice junctions, and provided as much as 1.7-fold deeper median read depth over those splice junctions. We further demonstrate that two-pass alignment works by increasing alignment of reads to splice junctions by short lengths, and that potential alignment errors are readily identifiable by simple classification. Taken together, two-pass alignment promises to advance quantification and discovery of novel splicing events. CONTACT: arul@med.umich.edu, nesvi@med.umich.edu AVAILABILITY AND IMPLEMENTATION: Two-pass alignment was implemented here as sequential alignment, genome indexing, and re-alignment steps with STAR. Full parameters are provided in Supplementary Table 2. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Alexander Dobin; Carrie A Davis; Felix Schlesinger; Jorg Drenkow; Chris Zaleski; Sonali Jha; Philippe Batut; Mark Chaisson; Thomas R Gingeras Journal: Bioinformatics Date: 2012-10-25 Impact factor: 6.937
Authors: Alberto Gatto; Carlos Torroja-Fungairiño; Francesco Mazzarotto; Stuart A Cook; Paul J R Barton; Fátima Sánchez-Cabo; Enrique Lara-Pezzi Journal: Nucleic Acids Res Date: 2014-02-25 Impact factor: 16.971
Authors: Pär G Engström; Tamara Steijger; Botond Sipos; Gregory R Grant; André Kahles; Gunnar Rätsch; Nick Goldman; Tim J Hubbard; Jennifer Harrow; Roderic Guigó; Paul Bertone Journal: Nat Methods Date: 2013-11-03 Impact factor: 28.547
Authors: Josep F Abril; Pär G Engström; Felix Kokocinski; Tamara Steijger; Tim J Hubbard; Roderic Guigó; Jennifer Harrow; Paul Bertone Journal: Nat Methods Date: 2013-11-03 Impact factor: 28.547
Authors: Sarah Djebali; Carrie A Davis; Angelika Merkel; Alex Dobin; Timo Lassmann; Ali Mortazavi; Andrea Tanzer; Julien Lagarde; Wei Lin; Felix Schlesinger; Chenghai Xue; Georgi K Marinov; Jainab Khatun; Brian A Williams; Chris Zaleski; Joel Rozowsky; Maik Röder; Felix Kokocinski; Rehab F Abdelhamid; Tyler Alioto; Igor Antoshechkin; Michael T Baer; Nadav S Bar; Philippe Batut; Kimberly Bell; Ian Bell; Sudipto Chakrabortty; Xian Chen; Jacqueline Chrast; Joao Curado; Thomas Derrien; Jorg Drenkow; Erica Dumais; Jacqueline Dumais; Radha Duttagupta; Emilie Falconnet; Meagan Fastuca; Kata Fejes-Toth; Pedro Ferreira; Sylvain Foissac; Melissa J Fullwood; Hui Gao; David Gonzalez; Assaf Gordon; Harsha Gunawardena; Cedric Howald; Sonali Jha; Rory Johnson; Philipp Kapranov; Brandon King; Colin Kingswood; Oscar J Luo; Eddie Park; Kimberly Persaud; Jonathan B Preall; Paolo Ribeca; Brian Risk; Daniel Robyr; Michael Sammeth; Lorian Schaffer; Lei-Hoon See; Atif Shahab; Jorgen Skancke; Ana Maria Suzuki; Hazuki Takahashi; Hagen Tilgner; Diane Trout; Nathalie Walters; Huaien Wang; John Wrobel; Yanbao Yu; Xiaoan Ruan; Yoshihide Hayashizaki; Jennifer Harrow; Mark Gerstein; Tim Hubbard; Alexandre Reymond; Stylianos E Antonarakis; Gregory Hannon; Morgan C Giddings; Yijun Ruan; Barbara Wold; Piero Carninci; Roderic Guigó; Thomas R Gingeras Journal: Nature Date: 2012-09-06 Impact factor: 49.962
Authors: Jacob C Ulirsch; Jeffrey M Verboon; Shideh Kazerounian; Michael H Guo; Daniel Yuan; Leif S Ludwig; Robert E Handsaker; Nour J Abdulhay; Claudia Fiorini; Giulio Genovese; Elaine T Lim; Aaron Cheng; Beryl B Cummings; Katherine R Chao; Alan H Beggs; Casie A Genetti; Colin A Sieff; Peter E Newburger; Edyta Niewiadomska; Michal Matysiak; Adrianna Vlachos; Jeffrey M Lipton; Eva Atsidaftos; Bertil Glader; Anupama Narla; Pierre-Emmanuel Gleizes; Marie-Françoise O'Donohue; Nathalie Montel-Lehry; David J Amor; Steven A McCarroll; Anne H O'Donnell-Luria; Namrata Gupta; Stacey B Gabriel; Daniel G MacArthur; Eric S Lander; Monkol Lek; Lydie Da Costa; David G Nathan; Andrei A Korostelev; Ron Do; Vijay G Sankaran; Hanna T Gazda Journal: Am J Hum Genet Date: 2018-11-29 Impact factor: 11.025
Authors: Samantha Gadd; Vicki Huff; Andrew D Skol; Lindsay A Renfro; Conrad V Fernandez; Elizabeth A Mullen; Corbin D Jones; Katherine A Hoadley; Kai Lee Yap; Nilsa C Ramirez; Sheena Aris; Quy H Phung; Elizabeth J Perlman Journal: Cell Rep Med Date: 2022-05-25
Authors: Zsolt Merényi; Máté Virágh; Emile Gluck-Thaler; Jason C Slot; Brigitta Kiss; Torda Varga; András Geösel; Botond Hegedüs; Balázs Bálint; László G Nagy Journal: Elife Date: 2022-02-14 Impact factor: 8.713
Authors: Sarah M Brotman; Chelsea K Raulerson; Swarooparani Vadlamudi; Kevin W Currin; Qiujin Shen; Victoria A Parsons; Apoorva K Iyengar; Tamara S Roman; Terrence S Furey; Johanna Kuusisto; Francis S Collins; Michael Boehnke; Markku Laakso; Päivi Pajukanta; Karen L Mohlke Journal: Am J Hum Genet Date: 2022-01-06 Impact factor: 11.043