MOTIVATION: Accurate alignment of high-throughput RNA-seq data is a challenging and yet unsolved problem because of the non-contiguous transcript structure, relatively short read lengths and constantly increasing throughput of the sequencing technologies. Currently available RNA-seq aligners suffer from high mapping error rates, low mapping speed, read length limitation and mapping biases. RESULTS: To align our large (>80 billon reads) ENCODE Transcriptome RNA-seq dataset, we developed the Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure. STAR outperforms other aligners by a factor of >50 in mapping speed, aligning to the human genome 550 million 2 × 76 bp paired-end reads per hour on a modest 12-core server, while at the same time improving alignment sensitivity and precision. In addition to unbiased de novo detection of canonical junctions, STAR can discover non-canonical splices and chimeric (fusion) transcripts, and is also capable of mapping full-length RNA sequences. Using Roche 454 sequencing of reverse transcription polymerase chain reaction amplicons, we experimentally validated 1960 novel intergenic splice junctions with an 80-90% success rate, corroborating the high precision of the STAR mapping strategy. AVAILABILITY AND IMPLEMENTATION: STAR is implemented as a standalone C++ code. STAR is free open source software distributed under GPLv3 license and can be downloaded from http://code.google.com/p/rna-star/.
MOTIVATION: Accurate alignment of high-throughput RNA-seq data is a challenging and yet unsolved problem because of the non-contiguous transcript structure, relatively short read lengths and constantly increasing throughput of the sequencing technologies. Currently available RNA-seq aligners suffer from high mapping error rates, low mapping speed, read length limitation and mapping biases. RESULTS: To align our large (>80 billon reads) ENCODE Transcriptome RNA-seq dataset, we developed the Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure. STAR outperforms other aligners by a factor of >50 in mapping speed, aligning to the human genome 550 million 2 × 76 bp paired-end reads per hour on a modest 12-core server, while at the same time improving alignment sensitivity and precision. In addition to unbiased de novo detection of canonical junctions, STAR can discover non-canonical splices and chimeric (fusion) transcripts, and is also capable of mapping full-length RNA sequences. Using Roche 454 sequencing of reverse transcription polymerase chain reaction amplicons, we experimentally validated 1960 novel intergenic splice junctions with an 80-90% success rate, corroborating the high precision of the STAR mapping strategy. AVAILABILITY AND IMPLEMENTATION: STAR is implemented as a standalone C++ code. STAR is free open source software distributed under GPLv3 license and can be downloaded from http://code.google.com/p/rna-star/.
Authors: Stefan Kurtz; Adam Phillippy; Arthur L Delcher; Michael Smoot; Martin Shumway; Corina Antonescu; Steven L Salzberg Journal: Genome Biol Date: 2004-01-30 Impact factor: 13.583
Authors: Bryan P Kline; Kathleen M Schieffer; Christine S Choi; Tara Connelly; Jeffrey Chen; Leonard Harris; Sue Deiling; Gregory S Yochum; Walter A Koltun Journal: Dig Dis Sci Date: 2018-12-03 Impact factor: 3.199
Authors: Joshua A Betts; Mahdi Moradi Marjaneh; Fares Al-Ejeh; Yi Chieh Lim; Wei Shi; Haran Sivakumaran; Romain Tropée; Ann-Marie Patch; Michael B Clark; Nenad Bartonicek; Adrian P Wiegmans; Kristine M Hillman; Susanne Kaufmann; Amanda L Bain; Brian S Gloss; Joanna Crawford; Stephen Kazakoff; Shivangi Wani; Shu W Wen; Bryan Day; Andreas Möller; Nicole Cloonan; John Pearson; Melissa A Brown; Timothy R Mercer; Nicola Waddell; Kum Kum Khanna; Eloise Dray; Marcel E Dinger; Stacey L Edwards; Juliet D French Journal: Am J Hum Genet Date: 2017-08-03 Impact factor: 11.025
Authors: Themis Alissafi; Lydia Kalafati; Maria Lazari; Anastasia Filia; Ismini Kloukina; Maria Manifava; Jong-Hyung Lim; Vasileia Ismini Alexaki; Nicholas T Ktistakis; Triantafyllos Doskas; George A Garinis; Triantafyllos Chavakis; Dimitrios T Boumpas; Panayotis Verginis Journal: Cell Metab Date: 2020-07-31 Impact factor: 27.287
Authors: Marta Garcia-Miralles; Michal Geva; Jing Ying Tan; Nur Amirah Binte Mohammad Yusof; Yoonjeong Cha; Rebecca Kusko; Liang Juin Tan; Xiaohong Xu; Iris Grossman; Aric Orbach; Michael R Hayden; Mahmoud A Pouladi Journal: JCI Insight Date: 2017-12-07
Authors: Samantha J Mascuch; Paul D Boudreau; Tristan M Carland; N Tessa Pierce; Joshua Olson; Mary E Hensler; Hyukjae Choi; Joseph Campanale; Amro Hamdoun; Victor Nizet; William H Gerwick; Teresa Gaasterland; Lena Gerwick Journal: J Nat Prod Date: 2017-12-07 Impact factor: 4.050
Authors: Gráinne Neary; Ashley W Blom; Anna I Shiel; Gabrielle Wheway; Jason P Mansell Journal: J Mater Sci Mater Med Date: 2018-07-21 Impact factor: 3.896