Literature DB >> 29703885

Bayesian nonparametric discovery of isoforms and individual specific quantification.

Derek Aguiar1, Li-Fang Cheng2, Bianca Dumitrascu3, Fantine Mordelet4, Athma A Pai5,6, Barbara E Engelhardt7,8.   

Abstract

Most human protein-coding genes can be transcribed into multiple distinct mRNA isoforms. These alternative splicing patterns encourage molecular diversity, and dysregulation of isoform expression plays an important role in disease etiology. However, isoforms are difficult to characterize from short-read RNA-seq data because they share identical subsequences and occur in different frequencies across tissues and samples. Here, we develop BIISQ, a Bayesian nonparametric model for isoform discovery and individual specific quantification from short-read RNA-seq data. BIISQ does not require isoform reference sequences but instead estimates an isoform catalog shared across samples. We use stochastic variational inference for efficient posterior estimates and demonstrate superior precision and recall for simulations compared to state-of-the-art isoform reconstruction methods. BIISQ shows the most gains for low abundance isoforms, with 36% more isoforms correctly inferred at low coverage versus a multi-sample method and 170% more versus single-sample methods. We estimate isoforms in the GEUVADIS RNA-seq data and validate inferred isoforms by associating genetic variants with isoform ratios.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29703885      PMCID: PMC5923247          DOI: 10.1038/s41467-018-03402-w

Source DB:  PubMed          Journal:  Nat Commun        ISSN: 2041-1723            Impact factor:   14.919


  64 in total

Review 1.  Pre-mRNA splicing and human disease.

Authors:  Nuno André Faustino; Thomas A Cooper
Journal:  Genes Dev       Date:  2003-02-15       Impact factor: 11.361

Review 2.  Transcranial magnetic stimulation in neurology.

Authors:  Masahito Kobayashi; Alvaro Pascual-Leone
Journal:  Lancet Neurol       Date:  2003-03       Impact factor: 44.182

3.  De novo assembly and analysis of RNA-seq data.

Authors:  Gordon Robertson; Jacqueline Schein; Readman Chiu; Richard Corbett; Matthew Field; Shaun D Jackman; Karen Mungall; Sam Lee; Hisanaga Mark Okada; Jenny Q Qian; Malachi Griffith; Anthony Raymond; Nina Thiessen; Timothee Cezard; Yaron S Butterfield; Richard Newsome; Simon K Chan; Rong She; Richard Varhol; Baljit Kamoh; Anna-Liisa Prabhu; Angela Tam; YongJun Zhao; Richard A Moore; Martin Hirst; Marco A Marra; Steven J M Jones; Pamela A Hoodless; Inanc Birol
Journal:  Nat Methods       Date:  2010-10-10       Impact factor: 28.547

4.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.

Authors:  Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-27       Impact factor: 11.205

Review 5.  The role of regulatory variation in complex traits and disease.

Authors:  Frank W Albert; Leonid Kruglyak
Journal:  Nat Rev Genet       Date:  2015-02-24       Impact factor: 53.242

6.  Analysis and design of RNA sequencing experiments for identifying isoform regulation.

Authors:  Yarden Katz; Eric T Wang; Edoardo M Airoldi; Christopher B Burge
Journal:  Nat Methods       Date:  2010-11-07       Impact factor: 28.547

7.  Identification of alternative splicing markers for breast cancer.

Authors:  Julian P Venables; Roscoe Klinck; Anne Bramard; Lyna Inkel; Geneviève Dufresne-Martin; ChuShin Koh; Julien Gervais-Bird; Elvy Lapointe; Ulrike Froehlich; Mathieu Durand; Daniel Gendron; Jean-Philippe Brosseau; Philippe Thibault; Jean-Francois Lucier; Karine Tremblay; Panagiotis Prinos; Raymund J Wellinger; Benoit Chabot; Claudine Rancourt; Sherif Abou Elela
Journal:  Cancer Res       Date:  2008-11-15       Impact factor: 12.701

8.  Characterizing the genetic basis of transcriptome diversity through RNA-sequencing of 922 individuals.

Authors:  Alexis Battle; Sara Mostafavi; Xiaowei Zhu; James B Potash; Myrna M Weissman; Courtney McCormick; Christian D Haudenschild; Kenneth B Beckman; Jianxin Shi; Rui Mei; Alexander E Urban; Stephen B Montgomery; Douglas F Levinson; Daphne Koller
Journal:  Genome Res       Date:  2013-10-03       Impact factor: 9.043

9.  Annotation-free quantification of RNA splicing using LeafCutter.

Authors:  Yang I Li; David A Knowles; Jack Humphrey; Alvaro N Barbeira; Scott P Dickinson; Hae Kyung Im; Jonathan K Pritchard
Journal:  Nat Genet       Date:  2017-12-11       Impact factor: 38.330

10.  Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data.

Authors:  Jacob F Degner; John C Marioni; Athma A Pai; Joseph K Pickrell; Everlyne Nkadori; Yoav Gilad; Jonathan K Pritchard
Journal:  Bioinformatics       Date:  2009-10-06       Impact factor: 6.937

View more
  2 in total

1.  IntAPT: integrated assembly of phenotype-specific transcripts from multiple RNA-seq profiles.

Authors:  Xu Shi; Andrew F Neuwald; Xiao Wang; Tian-Li Wang; Leena Hilakivi-Clarke; Robert Clarke; Jianhua Xuan
Journal:  Bioinformatics       Date:  2021-05-05       Impact factor: 6.937

2.  Partitioning RNAs by length improves transcriptome reconstruction from short-read RNA-seq data.

Authors:  Francisca Rojas Ringeling; Shounak Chakraborty; Caroline Vissers; Derek Reiman; Akshay M Patel; Ki-Heon Lee; Ari Hong; Chan-Woo Park; Tim Reska; Julien Gagneur; Hyeshik Chang; Maria L Spletter; Ki-Jun Yoon; Guo-Li Ming; Hongjun Song; Stefan Canzar
Journal:  Nat Biotechnol       Date:  2022-01-10       Impact factor: 68.164

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.