Literature DB >> 25926345

Polyester: simulating RNA-seq datasets with differential transcript expression.

Alyssa C Frazee1, Andrew E Jaffe2, Ben Langmead3, Jeffrey T Leek1.   

Abstract

MOTIVATION: Statistical methods development for differential expression analysis of RNA sequencing (RNA-seq) requires software tools to assess accuracy and error rate control. Since true differential expression status is often unknown in experimental datasets, artificially constructed datasets must be utilized, either by generating costly spike-in experiments or by simulating RNA-seq data.
RESULTS: Polyester is an R package designed to simulate RNA-seq data, beginning with an experimental design and ending with collections of RNA-seq reads. Its main advantage is the ability to simulate reads indicating isoform-level differential expression across biological replicates for a variety of experimental designs. Data generated by Polyester is a reasonable approximation to real RNA-seq data and standard differential expression workflows can recover differential expression set in the simulation by the user.
AVAILABILITY AND IMPLEMENTATION: Polyester is freely available from Bioconductor (http://bioconductor.org/). CONTACT: jtleek@gmail.com SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Substances:

Year:  2015        PMID: 25926345      PMCID: PMC4635655          DOI: 10.1093/bioinformatics/btv272

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  26 in total

Review 1.  Design and validation issues in RNA-seq experiments.

Authors:  Zhide Fang; Xiangqin Cui
Journal:  Brief Bioinform       Date:  2011-04-15       Impact factor: 11.622

2.  Comparative analysis of RNA-Seq alignment algorithms and the RNA-Seq unified mapper (RUM).

Authors:  Gregory R Grant; Michael H Farkas; Angel D Pizarro; Nicholas F Lahens; Jonathan Schug; Brian P Brunk; Christian J Stoeckert; John B Hogenesch; Eric A Pierce
Journal:  Bioinformatics       Date:  2011-07-19       Impact factor: 6.937

3.  Transcriptome assembly and isoform expression level estimation from biased RNA-Seq reads.

Authors:  Wei Li; Tao Jiang
Journal:  Bioinformatics       Date:  2012-10-11       Impact factor: 6.937

4.  GemSIM: general, error-model based simulator of next-generation sequencing data.

Authors:  Kerensa E McElroy; Fabio Luciani; Torsten Thomas
Journal:  BMC Genomics       Date:  2012-02-15       Impact factor: 3.969

5.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.

Authors:  Bo Li; Colin N Dewey
Journal:  BMC Bioinformatics       Date:  2011-08-04       Impact factor: 3.307

6.  Single read and paired end mRNA-Seq Illumina libraries from 10 nanograms total RNA.

Authors:  Srikumar Sengupta; Jennifer M Bolin; Victor Ruotti; Bao Kim Nguyen; James A Thomson; Angela L Elwell; Ron Stewart
Journal:  J Vis Exp       Date:  2011-10-27       Impact factor: 1.355

7.  Removing technical variability in RNA-seq data using conditional quantile normalization.

Authors:  Kasper D Hansen; Rafael A Irizarry; Zhijin Wu
Journal:  Biostatistics       Date:  2012-01-27       Impact factor: 5.899

8.  Summarizing and correcting the GC content bias in high-throughput sequencing.

Authors:  Yuval Benjamini; Terence P Speed
Journal:  Nucleic Acids Res       Date:  2012-02-09       Impact factor: 16.971

9.  GC-content normalization for RNA-Seq data.

Authors:  Davide Risso; Katja Schwartz; Gavin Sherlock; Sandrine Dudoit
Journal:  BMC Bioinformatics       Date:  2011-12-17       Impact factor: 3.169

10.  Modelling and simulating generic RNA-Seq experiments with the flux simulator.

Authors:  Thasso Griebel; Benedikt Zacher; Paolo Ribeca; Emanuele Raineri; Vincent Lacroix; Roderic Guigó; Michael Sammeth
Journal:  Nucleic Acids Res       Date:  2012-09-07       Impact factor: 16.971

View more
  102 in total

1.  SeqNet: An R Package for Generating Gene-Gene Networks and Simulating RNA-Seq Data.

Authors:  Tyler Grimes; Somnath Datta
Journal:  J Stat Softw       Date:  2021-07-10       Impact factor: 6.440

2.  IntAPT: integrated assembly of phenotype-specific transcripts from multiple RNA-seq profiles.

Authors:  Xu Shi; Andrew F Neuwald; Xiao Wang; Tian-Li Wang; Leena Hilakivi-Clarke; Robert Clarke; Jianhua Xuan
Journal:  Bioinformatics       Date:  2021-05-05       Impact factor: 6.937

3.  A multi-omics data simulator for complex disease studies and its application to evaluate multi-omics data analysis methods for disease classification.

Authors:  Ren-Hua Chung; Chen-Yu Kang
Journal:  Gigascience       Date:  2019-05-01       Impact factor: 6.524

4.  Smooth quantile normalization.

Authors:  Stephanie C Hicks; Kwame Okrah; Joseph N Paulson; John Quackenbush; Rafael A Irizarry; Héctor Corrada Bravo
Journal:  Biostatistics       Date:  2018-04-01       Impact factor: 5.899

5.  Hierarchical analysis of RNA-seq reads improves the accuracy of allele-specific expression.

Authors:  Narayanan Raghupathy; Kwangbom Choi; Matthew J Vincent; Glen L Beane; Keith S Sheppard; Steven C Munger; Ron Korstanje; Fernando Pardo-Manual de Villena; Gary A Churchill
Journal:  Bioinformatics       Date:  2018-07-01       Impact factor: 6.937

6.  Flexible expressed region analysis for RNA-seq with derfinder.

Authors:  Leonardo Collado-Torres; Abhinav Nellore; Alyssa C Frazee; Christopher Wilks; Michael I Love; Ben Langmead; Rafael A Irizarry; Jeffrey T Leek; Andrew E Jaffe
Journal:  Nucleic Acids Res       Date:  2016-09-29       Impact factor: 16.971

7.  Nonparametric expression analysis using inferential replicate counts.

Authors:  Anqi Zhu; Avi Srivastava; Joseph G Ibrahim; Rob Patro; Michael I Love
Journal:  Nucleic Acids Res       Date:  2019-10-10       Impact factor: 16.971

8.  Swimming downstream: statistical analysis of differential transcript usage following Salmon quantification.

Authors:  Michael I Love; Charlotte Soneson; Rob Patro
Journal:  F1000Res       Date:  2018-06-27

9.  Detecting, Categorizing, and Correcting Coverage Anomalies of RNA-Seq Quantification.

Authors:  Cong Ma; Carl Kingsford
Journal:  Cell Syst       Date:  2019-11-27       Impact factor: 10.304

10.  Splice Expression Variation Analysis (SEVA) for inter-tumor heterogeneity of gene isoform usage in cancer.

Authors:  Bahman Afsari; Theresa Guo; Michael Considine; Liliana Florea; Luciane T Kagohara; Genevieve L Stein-O'Brien; Dylan Kelley; Emily Flam; Kristina D Zambo; Patrick K Ha; Donald Geman; Michael F Ochs; Joseph A Califano; Daria A Gaykalova; Alexander V Favorov; Elana J Fertig
Journal:  Bioinformatics       Date:  2018-06-01       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.