Literature DB >> 21169371

Using non-uniform read distribution models to improve isoform expression inference in RNA-Seq.

Zhengpeng Wu1, Xi Wang, Xuegong Zhang.   

Abstract

MOTIVATION: RNA-Seq technology based on next-generation sequencing provides the unprecedented ability of studying transcriptomes at high resolution and accuracy, and the potential of measuring expression of multiple isoforms from the same gene at high precision. Solved by maximum likelihood estimation, isoform expression can be inferred in RNA-Seq using statistical models based on the assumption that sequenced reads are distributed uniformly along transcripts. Modification of the model is needed when considering situations where RNA-Seq data do not follow uniform distribution.
RESULTS: We proposed two curves, the global bias curve (GBC) and the local bias curves (LBCs), to describe the non-uniformity of read distributions for all genes in a transcriptome and for each gene, respectively. Incorporating the bias curves into the uniform read distribution (URD) model, we introduced non-URD (N-URD) models to infer isoform expression levels. On a series of systematic simulation studies, the proposed models outperform the original model in recovering major isoforms and the expression ratio of alternative isoforms. We also applied the new model to real RNA-Seq datasets and found that its inferences on expression ratios of alternative isoforms are more reasonable. The experiments indicate that incorporating N-URD information can improve the accuracy in modeling and inferring isoform expression in RNA-Seq.

Mesh:

Substances:

Year:  2010        PMID: 21169371     DOI: 10.1093/bioinformatics/btq696

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  45 in total

1.  Modeling RNA degradation for RNA-Seq with applications.

Authors:  Lin Wan; Xiting Yan; Ting Chen; Fengzhu Sun
Journal:  Biostatistics       Date:  2012-02-21       Impact factor: 5.899

2.  CEDER: accurate detection of differentially expressed genes by combining significance of exons using RNA-Seq.

Authors:  Lin Wan; Fengzhu Sun
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2012 Sep-Oct       Impact factor: 3.710

3.  WemIQ: an accurate and robust isoform quantification method for RNA-seq data.

Authors:  Jing Zhang; C-C Jay Kuo; Liang Chen
Journal:  Bioinformatics       Date:  2014-11-17       Impact factor: 6.937

4.  Clustering of mRNA-Seq data based on alternative splicing patterns.

Authors:  Marla Johnson; Elizabeth Purdom
Journal:  Biostatistics       Date:  2017-04-01       Impact factor: 5.899

5.  Transcriptome assembly and isoform expression level estimation from biased RNA-Seq reads.

Authors:  Wei Li; Tao Jiang
Journal:  Bioinformatics       Date:  2012-10-11       Impact factor: 6.937

6.  A robust method for transcript quantification with RNA-seq data.

Authors:  Yan Huang; Yin Hu; Corbin D Jones; James N MacLeod; Derek Y Chiang; Yufeng Liu; Jan F Prins; Jinze Liu
Journal:  J Comput Biol       Date:  2013-03       Impact factor: 1.479

7.  Cyc17, a meiosis-specific cyclin, is essential for anaphase initiation and chromosome segregation in Tetrahymena thermophila.

Authors:  Guan-Xiong Yan; Huai Dang; Miao Tian; Jing Zhang; Anura Shodhan; Ying-Zhi Ning; Jie Xiong; Wei Miao
Journal:  Cell Cycle       Date:  2016-05-18       Impact factor: 4.534

8.  E2fl1 is a meiosis-specific transcription factor in the protist Tetrahymena thermophila.

Authors:  Jing Zhang; Miao Tian; Guan-Xiong Yan; Anura Shodhan; Wei Miao
Journal:  Cell Cycle       Date:  2016-11-28       Impact factor: 4.534

9.  SparseIso: a novel Bayesian approach to identify alternatively spliced isoforms from RNA-seq data.

Authors:  Xu Shi; Xiao Wang; Tian-Li Wang; Leena Hilakivi-Clarke; Robert Clarke; Jianhua Xuan
Journal:  Bioinformatics       Date:  2018-01-01       Impact factor: 6.937

10.  An Enumerative Combinatorics Model for Fragmentation Patterns in RNA Sequencing Provides Insights into Nonuniformity of the Expected Fragment Starting-Point and Coverage Profile.

Authors:  Celine Prakash; Arndt Von Haeseler
Journal:  J Comput Biol       Date:  2016-09-23       Impact factor: 1.479

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.