MOTIVATION: The RNA-seq paired-end read (PER) protocol samples transcript fragments longer than the sequencing capability of today's technology by sequencing just the two ends of each fragment. Deep sampling of the transcriptome using the PER protocol presents the opportunity to reconstruct the unsequenced portion of each transcript fragment using end reads from overlapping PERs, guided by the expected length of the fragment. METHODS: A probabilistic framework is described to predict the alignment to the genome of all PER transcript fragments in a PER dataset. Starting from possible exonic and spliced alignments of all end reads, our method constructs potential splicing paths connecting paired ends. An expectation maximization method assigns likelihood values to all splice junctions and assigns the most probable alignment for each transcript fragment. RESULTS: The method was applied to 2 x 35 bp PER datasets from cancer cell lines MCF-7 and SUM-102. PER fragment alignment increased the coverage 3-fold compared to the alignment of the end reads alone, and increased the accuracy of splice detection. The accuracy of the expectation maximization (EM) algorithm in the presence of alternative paths in the splice graph was validated by qRT-PCR experiments on eight exon skipping alternative splicing events. PER fragment alignment with long-range splicing confirmed 8 out of 10 fusion events identified in the MCF-7 cell line in an earlier study by (Maher et al., 2009). AVAILABILITY: Software available at http://www.netlab.uky.edu/p/bioinfo/MapSplice/PER.
MOTIVATION: The RNA-seq paired-end read (PER) protocol samples transcript fragments longer than the sequencing capability of today's technology by sequencing just the two ends of each fragment. Deep sampling of the transcriptome using the PER protocol presents the opportunity to reconstruct the unsequenced portion of each transcript fragment using end reads from overlapping PERs, guided by the expected length of the fragment. METHODS: A probabilistic framework is described to predict the alignment to the genome of all PER transcript fragments in a PER dataset. Starting from possible exonic and spliced alignments of all end reads, our method constructs potential splicing paths connecting paired ends. An expectation maximization method assigns likelihood values to all splice junctions and assigns the most probable alignment for each transcript fragment. RESULTS: The method was applied to 2 x 35 bp PER datasets from cancer cell lines MCF-7 and SUM-102. PER fragment alignment increased the coverage 3-fold compared to the alignment of the end reads alone, and increased the accuracy of splice detection. The accuracy of the expectation maximization (EM) algorithm in the presence of alternative paths in the splice graph was validated by qRT-PCR experiments on eight exon skipping alternative splicing events. PER fragment alignment with long-range splicing confirmed 8 out of 10 fusion events identified in the MCF-7 cell line in an earlier study by (Maher et al., 2009). AVAILABILITY: Software available at http://www.netlab.uky.edu/p/bioinfo/MapSplice/PER.
Authors: Michael F Berger; Joshua Z Levin; Krishna Vijayendran; Andrey Sivachenko; Xian Adiconis; Jared Maguire; Laura A Johnson; James Robinson; Roel G Verhaak; Carrie Sougnez; Robert C Onofrio; Liuda Ziaugra; Kristian Cibulskis; Elisabeth Laine; Jordi Barretina; Wendy Winckler; David E Fisher; Gad Getz; Matthew Meyerson; David B Jaffe; Stacey B Gabriel; Eric S Lander; Reinhard Dummer; Andreas Gnirke; Chad Nusbaum; Levi A Garraway Journal: Genome Res Date: 2010-02-23 Impact factor: 9.043
Authors: Martin Krzywinski; Jacqueline Schein; Inanç Birol; Joseph Connors; Randy Gascoyne; Doug Horsman; Steven J Jones; Marco A Marra Journal: Genome Res Date: 2009-06-18 Impact factor: 9.043
Authors: Christopher A Maher; Nallasivam Palanisamy; John C Brenner; Xuhong Cao; Shanker Kalyana-Sundaram; Shujun Luo; Irina Khrebtukova; Terrence R Barrette; Catherine Grasso; Jindan Yu; Robert J Lonigro; Gary Schroth; Chandan Kumar-Sinha; Arul M Chinnaiyan Journal: Proc Natl Acad Sci U S A Date: 2009-07-10 Impact factor: 11.205
Authors: Kai Wang; Darshan Singh; Zheng Zeng; Stephen J Coleman; Yan Huang; Gleb L Savich; Xiaping He; Piotr Mieczkowski; Sara A Grimm; Charles M Perou; James N MacLeod; Derek Y Chiang; Jan F Prins; Jinze Liu Journal: Nucleic Acids Res Date: 2010-08-27 Impact factor: 16.971
Authors: Yan Huang; Yin Hu; Corbin D Jones; James N MacLeod; Derek Y Chiang; Yufeng Liu; Jan F Prins; Jinze Liu Journal: J Comput Biol Date: 2013-03 Impact factor: 1.479
Authors: Susan M Corley; Karen L MacKenzie; Annemiek Beverdam; Louise F Roddam; Marc R Wilkins Journal: BMC Genomics Date: 2017-05-23 Impact factor: 3.969
Authors: Henrik N Kløverpris; Samuel W Kazer; Jenny Mjösberg; Jenniffer M Mabuka; Amanda Wellmann; Zaza Ndhlovu; Marisa C Yadon; Shepherd Nhamoyebonde; Maximilian Muenchhoff; Yannick Simoni; Frank Andersson; Warren Kuhn; Nigel Garrett; Wendy A Burgers; Philomena Kamya; Karyn Pretorius; Krista Dong; Amber Moodley; Evan W Newell; Victoria Kasprowicz; Salim S Abdool Karim; Philip Goulder; Alex K Shalek; Bruce D Walker; Thumbi Ndung'u; Alasdair Leslie Journal: Immunity Date: 2016-02-02 Impact factor: 31.745
Authors: Roberto Plebani; Gavin R Oliver; Marco Trerotola; Emanuela Guerra; Pamela Cantanelli; Luana Apicella; Andrew Emerson; Alessandro Albiero; Paul D Harkin; Richard D Kennedy; Saverio Alberti Journal: Neoplasia Date: 2012-11 Impact factor: 5.715
Authors: Andrew McPherson; Fereydoun Hormozdiari; Abdalnasser Zayed; Ryan Giuliany; Gavin Ha; Mark G F Sun; Malachi Griffith; Alireza Heravi Moussavi; Janine Senz; Nataliya Melnyk; Marina Pacheco; Marco A Marra; Martin Hirst; Torsten O Nielsen; S Cenk Sahinalp; David Huntsman; Sohrab P Shah Journal: PLoS Comput Biol Date: 2011-05-19 Impact factor: 4.475
Authors: Robert Kofler; Pablo Orozco-terWengel; Nicola De Maio; Ram Vinay Pandey; Viola Nolte; Andreas Futschik; Carolin Kosiol; Christian Schlötterer Journal: PLoS One Date: 2011-01-06 Impact factor: 3.752
Authors: Darshan Singh; Christian F Orellana; Yin Hu; Corbin D Jones; Yufeng Liu; Derek Y Chiang; Jinze Liu; Jan F Prins Journal: Bioinformatics Date: 2011-08-08 Impact factor: 6.937
Authors: Yin Hu; Yan Huang; Ying Du; Christian F Orellana; Darshan Singh; Amy R Johnson; Anaïs Monroy; Pei-Fen Kuan; Scott M Hammond; Liza Makowski; Scott H Randell; Derek Y Chiang; D Neil Hayes; Corbin Jones; Yufeng Liu; Jan F Prins; Jinze Liu Journal: Nucleic Acids Res Date: 2012-11-15 Impact factor: 16.971