Literature DB >> 24130305

ORMAN: optimal resolution of ambiguous RNA-Seq multimappings in the presence of novel isoforms.

Phuong Dao1, Ibrahim Numanagić, Yen-Yi Lin, Faraz Hach, Emre Karakoc, Nilgun Donmez, Colin Collins, Evan E Eichler, S Cenk Sahinalp.   

Abstract

MOTIVATION: RNA-Seq technology is promising to uncover many novel alternative splicing events, gene fusions and other variations in RNA transcripts. For an accurate detection and quantification of transcripts, it is important to resolve the mapping ambiguity for those RNA-Seq reads that can be mapped to multiple loci: >17% of the reads from mouse RNA-Seq data and 50% of the reads from some plant RNA-Seq data have multiple mapping loci. In this study, we show how to resolve the mapping ambiguity in the presence of novel transcriptomic events such as exon skipping and novel indels towards accurate downstream analysis. We introduce ORMAN ( O ptimal R esolution of M ultimapping A mbiguity of R N A-Seq Reads), which aims to compute the minimum number of potential transcript products for each gene and to assign each multimapping read to one of these transcripts based on the estimated distribution of the region covering the read. ORMAN achieves this objective through a combinatorial optimization formulation, which is solved through well-known approximation algorithms, integer linear programs and heuristics.
RESULTS: On a simulated RNA-Seq dataset including a random subset of transcripts from the UCSC database, the performance of several state-of-the-art methods for identifying and quantifying novel transcripts, such as Cufflinks, IsoLasso and CLIIQ, is significantly improved through the use of ORMAN. Furthermore, in an experiment using real RNA-Seq reads, we show that ORMAN is able to resolve multimapping to produce coverage values that are similar to the original distribution, even in genes with highly non-uniform coverage. AVAILABILITY: ORMAN is available at http://orman.sf.net

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 24130305     DOI: 10.1093/bioinformatics/btt591

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  6 in total

1.  Compressive mapping for next-generation sequencing.

Authors:  Deniz Yorukoglu; Yun William Yu; Jian Peng; Bonnie Berger
Journal:  Nat Biotechnol       Date:  2016-04       Impact factor: 54.908

2.  Evidence for a Pan-Neurodegenerative Disease Response in Huntington's and Parkinson's Disease Expression Profiles.

Authors:  Adam Labadorf; Seung H Choi; Richard H Myers
Journal:  Front Mol Neurosci       Date:  2018-01-11       Impact factor: 5.639

Review 3.  Dynamics in Transcriptomics: Advancements in RNA-seq Time Course and Downstream Analysis.

Authors:  Daniel Spies; Constance Ciaudo
Journal:  Comput Struct Biotechnol J       Date:  2015-08-24       Impact factor: 7.271

4.  Evidence of Extensive Alternative Splicing in Post Mortem Human Brain HTT Transcription by mRNA Sequencing.

Authors:  Adam T Labadorf; Richard H Myers
Journal:  PLoS One       Date:  2015-10-23       Impact factor: 3.240

5.  Terminus enables the discovery of data-driven, robust transcript groups from RNA-seq data.

Authors:  Hirak Sarkar; Avi Srivastava; Héctor Corrada Bravo; Michael I Love; Rob Patro
Journal:  Bioinformatics       Date:  2020-07-01       Impact factor: 6.937

6.  Improved haplotype inference by exploiting long-range linking and allelic imbalance in RNA-seq datasets.

Authors:  Emily Berger; Deniz Yorukoglu; Lillian Zhang; Sarah K Nyquist; Alex K Shalek; Manolis Kellis; Ibrahim Numanagić; Bonnie Berger
Journal:  Nat Commun       Date:  2020-09-16       Impact factor: 14.919

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.