Literature DB >> 28133637

The Selection of Quantification Pipelines for Illumina RNA-seq Data Using a Subsampling Approach.

Po-Yen Wu, May D Wang.   

Abstract

RNA sequencing, or (RNA-seq for short,, is a widely applied technology that for extractings gene and transcript expression from biological samples. Given numerous quantification pipelines for RNA-seq data, one fundamental challenge is to determine identify a pipeline that can produce the most accurate estimate the most accurate gene and/or transcript expression. Exploring all available pipelines requires tremendous extensive computational resources, so. Therefore, we propose to use a subsampling approach that can improve speed up the pipeline evaluation and selection the efficiency process of pipeline performance evaluation for a given RNA-seq dataset. We applied our approach to one simulated and two real RNA-seq datasets and found that expression estimates derived from subsampled data are close surrogates for those derived from original data. In addition, the ranking of quantification pipelines based on the subsampled data was highly correlated concordant with that based on the original data. Therefore, we conclude that subsampling is a valid approach to facilitating efficient quantification pipeline selection using RNA-seq data.

Entities:  

Year:  2016        PMID: 28133637      PMCID: PMC5267345          DOI: 10.1109/BHI.2016.7455839

Source DB:  PubMed          Journal:  IEEE EMBS Int Conf Biomed Health Inform        ISSN: 2641-3590


  17 in total

1.  Systematic comparison of three genomic enrichment methods for massively parallel DNA sequencing.

Authors:  Jamie K Teer; Lori L Bonnycastle; Peter S Chines; Nancy F Hansen; Natsuyo Aoyama; Amy J Swift; Hatice Ozel Abaan; Thomas J Albert; Elliott H Margulies; Eric D Green; Francis S Collins; James C Mullikin; Leslie G Biesecker
Journal:  Genome Res       Date:  2010-09-01       Impact factor: 9.043

2.  Analysis and design of RNA sequencing experiments for identifying isoform regulation.

Authors:  Yarden Katz; Eric T Wang; Edoardo M Airoldi; Christopher B Burge
Journal:  Nat Methods       Date:  2010-11-07       Impact factor: 28.547

3.  Haplotype and isoform specific expression estimation using multi-mapping RNA-seq reads.

Authors:  Ernest Turro; Shu-Yi Su; Ângela Gonçalves; Lachlan J M Coin; Sylvia Richardson; Alex Lewin
Journal:  Genome Biol       Date:  2011-02-10       Impact factor: 13.583

4.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.

Authors:  Bo Li; Colin N Dewey
Journal:  BMC Bioinformatics       Date:  2011-08-04       Impact factor: 3.307

5.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation.

Authors:  Cole Trapnell; Brian A Williams; Geo Pertea; Ali Mortazavi; Gordon Kwan; Marijke J van Baren; Steven L Salzberg; Barbara J Wold; Lior Pachter
Journal:  Nat Biotechnol       Date:  2010-05-02       Impact factor: 54.908

6.  Differential expression analysis for sequence count data.

Authors:  Simon Anders; Wolfgang Huber
Journal:  Genome Biol       Date:  2010-10-27       Impact factor: 13.583

7.  Streaming fragment assignment for real-time analysis of sequencing experiments.

Authors:  Adam Roberts; Lior Pachter
Journal:  Nat Methods       Date:  2012-11-18       Impact factor: 28.547

8.  A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium.

Authors: 
Journal:  Nat Biotechnol       Date:  2014-08-24       Impact factor: 54.908

9.  HTSeq--a Python framework to work with high-throughput sequencing data.

Authors:  Simon Anders; Paul Theodor Pyl; Wolfgang Huber
Journal:  Bioinformatics       Date:  2014-09-25       Impact factor: 6.937

10.  TopHat: discovering splice junctions with RNA-Seq.

Authors:  Cole Trapnell; Lior Pachter; Steven L Salzberg
Journal:  Bioinformatics       Date:  2009-03-16       Impact factor: 6.937

View more
  2 in total

1.  RATEmiRs: the rat atlas of tissue-specific and enriched miRNAs for discerning baseline expression exclusivity of candidate biomarkers.

Authors:  Pierre R Bushel; Florian Caiment; Han Wu; Raegan O'Lone; Frank Day; John Calley; Aaron Smith; Jianying Li; Alison H Harrill
Journal:  RNA Biol       Date:  2020-02-12       Impact factor: 4.652

2.  Online Decentralized Leverage Score Sampling for Streaming Multidimensional Time Series.

Authors:  Rui Xie; Zengyan Wang; Shuyang Bai; Ping Ma; Wenxuan Zhong
Journal:  Proc Mach Learn Res       Date:  2019-04
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.