Literature DB >> 20576625

A probabilistic framework for aligning paired-end RNA-seq data.

Yin Hu1, Kai Wang, Xiaping He, Derek Y Chiang, Jan F Prins, Jinze Liu.   

Abstract

MOTIVATION: The RNA-seq paired-end read (PER) protocol samples transcript fragments longer than the sequencing capability of today's technology by sequencing just the two ends of each fragment. Deep sampling of the transcriptome using the PER protocol presents the opportunity to reconstruct the unsequenced portion of each transcript fragment using end reads from overlapping PERs, guided by the expected length of the fragment.
METHODS: A probabilistic framework is described to predict the alignment to the genome of all PER transcript fragments in a PER dataset. Starting from possible exonic and spliced alignments of all end reads, our method constructs potential splicing paths connecting paired ends. An expectation maximization method assigns likelihood values to all splice junctions and assigns the most probable alignment for each transcript fragment.
RESULTS: The method was applied to 2 x 35 bp PER datasets from cancer cell lines MCF-7 and SUM-102. PER fragment alignment increased the coverage 3-fold compared to the alignment of the end reads alone, and increased the accuracy of splice detection. The accuracy of the expectation maximization (EM) algorithm in the presence of alternative paths in the splice graph was validated by qRT-PCR experiments on eight exon skipping alternative splicing events. PER fragment alignment with long-range splicing confirmed 8 out of 10 fusion events identified in the MCF-7 cell line in an earlier study by (Maher et al., 2009). AVAILABILITY: Software available at http://www.netlab.uky.edu/p/bioinfo/MapSplice/PER.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20576625      PMCID: PMC2916723          DOI: 10.1093/bioinformatics/btq336

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  7 in total

1.  Integrative analysis of the melanoma transcriptome.

Authors:  Michael F Berger; Joshua Z Levin; Krishna Vijayendran; Andrey Sivachenko; Xian Adiconis; Jared Maguire; Laura A Johnson; James Robinson; Roel G Verhaak; Carrie Sougnez; Robert C Onofrio; Liuda Ziaugra; Kristian Cibulskis; Elisabeth Laine; Jordi Barretina; Wendy Winckler; David E Fisher; Gad Getz; Matthew Meyerson; David B Jaffe; Stacey B Gabriel; Eric S Lander; Reinhard Dummer; Andreas Gnirke; Chad Nusbaum; Levi A Garraway
Journal:  Genome Res       Date:  2010-02-23       Impact factor: 9.043

2.  Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Authors:  Heng Li; Jue Ruan; Richard Durbin
Journal:  Genome Res       Date:  2008-08-19       Impact factor: 9.043

3.  Circos: an information aesthetic for comparative genomics.

Authors:  Martin Krzywinski; Jacqueline Schein; Inanç Birol; Joseph Connors; Randy Gascoyne; Doug Horsman; Steven J Jones; Marco A Marra
Journal:  Genome Res       Date:  2009-06-18       Impact factor: 9.043

4.  Chimeric transcript discovery by paired-end transcriptome sequencing.

Authors:  Christopher A Maher; Nallasivam Palanisamy; John C Brenner; Xuhong Cao; Shanker Kalyana-Sundaram; Shujun Luo; Irina Khrebtukova; Terrence R Barrette; Catherine Grasso; Jindan Yu; Robert J Lonigro; Gary Schroth; Chandan Kumar-Sinha; Arul M Chinnaiyan
Journal:  Proc Natl Acad Sci U S A       Date:  2009-07-10       Impact factor: 11.205

5.  Detection of splice junctions from paired-end RNA-seq data by SpliceMap.

Authors:  Kin Fai Au; Hui Jiang; Lan Lin; Yi Xing; Wing Hung Wong
Journal:  Nucleic Acids Res       Date:  2010-04-05       Impact factor: 16.971

6.  MapSplice: accurate mapping of RNA-seq reads for splice junction discovery.

Authors:  Kai Wang; Darshan Singh; Zheng Zeng; Stephen J Coleman; Yan Huang; Gleb L Savich; Xiaping He; Piotr Mieczkowski; Sara A Grimm; Charles M Perou; James N MacLeod; Derek Y Chiang; Jan F Prins; Jinze Liu
Journal:  Nucleic Acids Res       Date:  2010-08-27       Impact factor: 16.971

7.  TopHat: discovering splice junctions with RNA-Seq.

Authors:  Cole Trapnell; Lior Pachter; Steven L Salzberg
Journal:  Bioinformatics       Date:  2009-03-16       Impact factor: 6.937

  7 in total
  13 in total

1.  Sensitive gene fusion detection using ambiguously mapping RNA-Seq read pairs.

Authors:  Marcus Kinsella; Olivier Harismendy; Masakazu Nakano; Kelly A Frazer; Vineet Bafna
Journal:  Bioinformatics       Date:  2011-02-16       Impact factor: 6.937

2.  A robust method for transcript quantification with RNA-seq data.

Authors:  Yan Huang; Yin Hu; Corbin D Jones; James N MacLeod; Derek Y Chiang; Yufeng Liu; Jan F Prins; Jinze Liu
Journal:  J Comput Biol       Date:  2013-03       Impact factor: 1.479

3.  Differentially expressed genes from RNA-Seq and functional enrichment results are affected by the choice of single-end versus paired-end reads and stranded versus non-stranded protocols.

Authors:  Susan M Corley; Karen L MacKenzie; Annemiek Beverdam; Louise F Roddam; Marc R Wilkins
Journal:  BMC Genomics       Date:  2017-05-23       Impact factor: 3.969

4.  Innate Lymphoid Cells Are Depleted Irreversibly during Acute HIV-1 Infection in the Absence of Viral Suppression.

Authors:  Henrik N Kløverpris; Samuel W Kazer; Jenny Mjösberg; Jenniffer M Mabuka; Amanda Wellmann; Zaza Ndhlovu; Marisa C Yadon; Shepherd Nhamoyebonde; Maximilian Muenchhoff; Yannick Simoni; Frank Andersson; Warren Kuhn; Nigel Garrett; Wendy A Burgers; Philomena Kamya; Karyn Pretorius; Krista Dong; Amber Moodley; Evan W Newell; Victoria Kasprowicz; Salim S Abdool Karim; Philip Goulder; Alex K Shalek; Bruce D Walker; Thumbi Ndung'u; Alasdair Leslie
Journal:  Immunity       Date:  2016-02-02       Impact factor: 31.745

5.  Long-range transcriptome sequencing reveals cancer cell growth regulatory chimeric mRNA.

Authors:  Roberto Plebani; Gavin R Oliver; Marco Trerotola; Emanuela Guerra; Pamela Cantanelli; Luana Apicella; Andrew Emerson; Alessandro Albiero; Paul D Harkin; Richard D Kennedy; Saverio Alberti
Journal:  Neoplasia       Date:  2012-11       Impact factor: 5.715

6.  deFuse: an algorithm for gene fusion discovery in tumor RNA-Seq data.

Authors:  Andrew McPherson; Fereydoun Hormozdiari; Abdalnasser Zayed; Ryan Giuliany; Gavin Ha; Mark G F Sun; Malachi Griffith; Alireza Heravi Moussavi; Janine Senz; Nataliya Melnyk; Marina Pacheco; Marco A Marra; Martin Hirst; Torsten O Nielsen; S Cenk Sahinalp; David Huntsman; Sohrab P Shah
Journal:  PLoS Comput Biol       Date:  2011-05-19       Impact factor: 4.475

7.  PoPoolation: a toolbox for population genetic analysis of next generation sequencing data from pooled individuals.

Authors:  Robert Kofler; Pablo Orozco-terWengel; Nicola De Maio; Ram Vinay Pandey; Viola Nolte; Andreas Futschik; Carolin Kosiol; Christian Schlötterer
Journal:  PLoS One       Date:  2011-01-06       Impact factor: 3.752

8.  FDM: a graph-based statistical method to detect differential transcription using RNA-seq data.

Authors:  Darshan Singh; Christian F Orellana; Yin Hu; Corbin D Jones; Yufeng Liu; Derek Y Chiang; Jinze Liu; Jan F Prins
Journal:  Bioinformatics       Date:  2011-08-08       Impact factor: 6.937

9.  Detecting cancer outlier genes with potential rearrangement using gene expression data and biological networks.

Authors:  Mohammed Alshalalfa; Tarek A Bismar; Reda Alhajj
Journal:  Adv Bioinformatics       Date:  2012-06-28

10.  DiffSplice: the genome-wide detection of differential splicing events with RNA-seq.

Authors:  Yin Hu; Yan Huang; Ying Du; Christian F Orellana; Darshan Singh; Amy R Johnson; Anaïs Monroy; Pei-Fen Kuan; Scott M Hammond; Liza Makowski; Scott H Randell; Derek Y Chiang; D Neil Hayes; Corbin Jones; Yufeng Liu; Jan F Prins; Jinze Liu
Journal:  Nucleic Acids Res       Date:  2012-11-15       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.