Literature DB >> 28968634

SparseIso: a novel Bayesian approach to identify alternatively spliced isoforms from RNA-seq data.

Xu Shi1, Xiao Wang1, Tian-Li Wang2, Leena Hilakivi-Clarke3, Robert Clarke3, Jianhua Xuan1.   

Abstract

Motivation: Recent advances in high-throughput RNA sequencing (RNA-seq) technologies have made it possible to reconstruct the full transcriptome of various types of cells. It is important to accurately assemble transcripts or identify isoforms for an improved understanding of molecular mechanisms in biological systems.
Results: We have developed a novel Bayesian method, SparseIso, to reliably identify spliced isoforms from RNA-seq data. A spike-and-slab prior is incorporated into the Bayesian model to enforce the sparsity for isoform identification, effectively alleviating the problem of overfitting. A Gibbs sampling procedure is further developed to simultaneously identify and quantify transcripts from RNA-seq data. With the sampling approach, SparseIso estimates the joint distribution of all candidate transcripts, resulting in a significantly improved performance in detecting lowly expressed transcripts and multiple expressed isoforms of genes. Both simulation study and real data analysis have demonstrated that the proposed SparseIso method significantly outperforms existing methods for improved transcript assembly and isoform identification. Availability and implementation: The SparseIso package is available at http://github.com/henryxushi/SparseIso. Contact: xuan@vt.edu. Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

Mesh:

Year:  2018        PMID: 28968634      PMCID: PMC5870564          DOI: 10.1093/bioinformatics/btx557

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  28 in total

1.  Characterization of the human ESC transcriptome by hybrid sequencing.

Authors:  Kin Fai Au; Vittorio Sebastiano; Pegah Tootoonchi Afshar; Jens Durruthy Durruthy; Lawrence Lee; Brian A Williams; Harm van Bakel; Eric E Schadt; Renee A Reijo-Pera; Jason G Underwood; Wing Hung Wong
Journal:  Proc Natl Acad Sci U S A       Date:  2013-11-26       Impact factor: 11.205

2.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

3.  Transcriptome assembly and isoform expression level estimation from biased RNA-Seq reads.

Authors:  Wei Li; Tao Jiang
Journal:  Bioinformatics       Date:  2012-10-11       Impact factor: 6.937

4.  Biases in Illumina transcriptome sequencing caused by random hexamer priming.

Authors:  Kasper D Hansen; Steven E Brenner; Sandrine Dudoit
Journal:  Nucleic Acids Res       Date:  2010-04-14       Impact factor: 16.971

5.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.

Authors:  Bo Li; Colin N Dewey
Journal:  BMC Bioinformatics       Date:  2011-08-04       Impact factor: 3.307

6.  Oestrogen receptors interact with the α-catalytic subunit of AMP-activated protein kinase.

Authors:  Yulia Lipovka; Hao Chen; Josef Vagner; Theodore J Price; Tsu-Shuen Tsao; John P Konhilas
Journal:  Biosci Rep       Date:  2015-09-15       Impact factor: 3.840

7.  Bayesian transcriptome assembly.

Authors:  Lasse Maretty; Jonas Andreas Sibbesen; Anders Krogh
Journal:  Genome Biol       Date:  2014       Impact factor: 13.583

8.  iReckon: simultaneous isoform discovery and abundance estimation from RNA-seq data.

Authors:  Aziz M Mezlini; Eric J M Smith; Marc Fiume; Orion Buske; Gleb L Savich; Sohrab Shah; Sam Aparicio; Derek Y Chiang; Anna Goldenberg; Michael Brudno
Journal:  Genome Res       Date:  2012-11-29       Impact factor: 9.043

9.  The UCSC Genome Browser database: 2014 update.

Authors:  Donna Karolchik; Galt P Barber; Jonathan Casper; Hiram Clawson; Melissa S Cline; Mark Diekhans; Timothy R Dreszer; Pauline A Fujita; Luvina Guruvadoo; Maximilian Haeussler; Rachel A Harte; Steve Heitner; Angie S Hinrichs; Katrina Learned; Brian T Lee; Chin H Li; Brian J Raney; Brooke Rhead; Kate R Rosenbloom; Cricket A Sloan; Matthew L Speir; Ann S Zweig; David Haussler; Robert M Kuhn; W James Kent
Journal:  Nucleic Acids Res       Date:  2013-11-21       Impact factor: 16.971

10.  TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions.

Authors:  Daehwan Kim; Geo Pertea; Cole Trapnell; Harold Pimentel; Ryan Kelley; Steven L Salzberg
Journal:  Genome Biol       Date:  2013-04-25       Impact factor: 13.583

View more
  3 in total

1.  IntAPT: integrated assembly of phenotype-specific transcripts from multiple RNA-seq profiles.

Authors:  Xu Shi; Andrew F Neuwald; Xiao Wang; Tian-Li Wang; Leena Hilakivi-Clarke; Robert Clarke; Jianhua Xuan
Journal:  Bioinformatics       Date:  2021-05-05       Impact factor: 6.937

2.  SAUTE: sequence assembly using target enrichment.

Authors:  Alexandre Souvorov; Richa Agarwala
Journal:  BMC Bioinformatics       Date:  2021-07-21       Impact factor: 3.169

3.  Anti-bias training for (sc)RNA-seq: experimental and computational approaches to improve precision.

Authors:  Philip Davies; Matt Jones; Juntai Liu; Daniel Hebenstreit
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.