Literature DB >> 27239250

A penalized likelihood approach for robust estimation of isoform expression.

Hui Jiang1, Julia Salzman2.   

Abstract

Ultra high-throughput sequencing of transcriptomes (RNA-Seq) has enabled the accurate estimation of gene expression at individual isoform level. However, systematic biases introduced during the sequencing and mapping processes as well as incompleteness of the transcript annotation databases may cause the estimates of isoform abundances to be unreliable, and in some cases, highly inaccurate. This paper introduces a penalized likelihood approach to detect and correct for such biases in a robust manner. Our model extends those previously proposed by introducing bias parameters for reads. An L1 penalty is used for the selection of non-zero bias parameters. We introduce an efficient algorithm for model fitting and analyze the statistical properties of the proposed model. Our experimental studies on both simulated and real datasets suggest that the model has the potential to improve isoform-specific gene expression estimates and identify incompletely annotated gene models.

Entities:  

Keywords:  Isoform expression; Penalized likelihood; RNA-Seq; Robust estimation

Year:  2015        PMID: 27239250      PMCID: PMC4879778          DOI: 10.4310/SII.2015.v8.n4.a3

Source DB:  PubMed          Journal:  Stat Interface        ISSN: 1938-7989            Impact factor:   0.582


  18 in total

1.  CisGenome Browser: a flexible tool for genomic data visualization.

Authors:  Hui Jiang; Fan Wang; Nigel P Dyer; Wing Hung Wong
Journal:  Bioinformatics       Date:  2010-05-30       Impact factor: 6.937

2.  Statistical inferences for isoform expression in RNA-Seq.

Authors:  Hui Jiang; Wing Hung Wong
Journal:  Bioinformatics       Date:  2009-02-25       Impact factor: 6.937

3.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

4.  Biases in Illumina transcriptome sequencing caused by random hexamer priming.

Authors:  Kasper D Hansen; Steven E Brenner; Sandrine Dudoit
Journal:  Nucleic Acids Res       Date:  2010-04-14       Impact factor: 16.971

5.  GENCODE: the reference human genome annotation for The ENCODE Project.

Authors:  Jennifer Harrow; Adam Frankish; Jose M Gonzalez; Electra Tapanari; Mark Diekhans; Felix Kokocinski; Bronwen L Aken; Daniel Barrell; Amonida Zadissa; Stephen Searle; If Barnes; Alexandra Bignell; Veronika Boychenko; Toby Hunt; Mike Kay; Gaurab Mukherjee; Jeena Rajan; Gloria Despacio-Reyes; Gary Saunders; Charles Steward; Rachel Harte; Michael Lin; Cédric Howald; Andrea Tanzer; Thomas Derrien; Jacqueline Chrast; Nathalie Walters; Suganthi Balasubramanian; Baikang Pei; Michael Tress; Jose Manuel Rodriguez; Iakes Ezkurdia; Jeltje van Baren; Michael Brent; David Haussler; Manolis Kellis; Alfonso Valencia; Alexandre Reymond; Mark Gerstein; Roderic Guigó; Tim J Hubbard
Journal:  Genome Res       Date:  2012-09       Impact factor: 9.043

6.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.

Authors:  Bo Li; Colin N Dewey
Journal:  BMC Bioinformatics       Date:  2011-08-04       Impact factor: 3.307

7.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation.

Authors:  Cole Trapnell; Brian A Williams; Geo Pertea; Ali Mortazavi; Gordon Kwan; Marijke J van Baren; Steven L Salzberg; Barbara J Wold; Lior Pachter
Journal:  Nat Biotechnol       Date:  2010-05-02       Impact factor: 54.908

8.  Circular RNAs are the predominant transcript isoform from hundreds of human genes in diverse cell types.

Authors:  Julia Salzman; Charles Gawad; Peter Lincoln Wang; Norman Lacayo; Patrick O Brown
Journal:  PLoS One       Date:  2012-02-01       Impact factor: 3.240

9.  An integrated encyclopedia of DNA elements in the human genome.

Authors: 
Journal:  Nature       Date:  2012-09-06       Impact factor: 49.962

10.  The UCSC Genome Browser database: 2014 update.

Authors:  Donna Karolchik; Galt P Barber; Jonathan Casper; Hiram Clawson; Melissa S Cline; Mark Diekhans; Timothy R Dreszer; Pauline A Fujita; Luvina Guruvadoo; Maximilian Haeussler; Rachel A Harte; Steve Heitner; Angie S Hinrichs; Katrina Learned; Brian T Lee; Chin H Li; Brian J Raney; Brooke Rhead; Kate R Rosenbloom; Cricket A Sloan; Matthew L Speir; Ann S Zweig; David Haussler; Robert M Kuhn; W James Kent
Journal:  Nucleic Acids Res       Date:  2013-11-21       Impact factor: 16.971

View more
  5 in total

Review 1.  Detecting circular RNAs: bioinformatic and experimental challenges.

Authors:  Linda Szabo; Julia Salzman
Journal:  Nat Rev Genet       Date:  2016-10-14       Impact factor: 53.242

2.  Strawberry: Fast and accurate genome-guided transcript reconstruction and quantification from RNA-Seq.

Authors:  Ruolin Liu; Julie Dickerson
Journal:  PLoS Comput Biol       Date:  2017-11-27       Impact factor: 4.475

3.  OMGene: mutual improvement of gene models through optimisation of evolutionary conservation.

Authors:  Michael P Dunne; Steven Kelly
Journal:  BMC Genomics       Date:  2018-04-27       Impact factor: 3.969

4.  AIDE: annotation-assisted isoform discovery with high precision.

Authors:  Wei Vivian Li; Shan Li; Xin Tong; Ling Deng; Hubing Shi; Jingyi Jessica Li
Journal:  Genome Res       Date:  2019-11-06       Impact factor: 9.043

5.  Anti-bias training for (sc)RNA-seq: experimental and computational approaches to improve precision.

Authors:  Philip Davies; Matt Jones; Juntai Liu; Daniel Hebenstreit
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.