| Literature DB >> 28984183 |
Yuanhang Liu1,2, Ping Wu3, Jingqi Zhou1,4, Teresa L Johnson-Pais3, Zhao Lai1, Wasim H Chowdhury3, Ronald Rodriguez3, Yidong Chen5,6.
Abstract
BACKGROUND: RNA sequencing (RNA-seq) is a high throughput technology that profiles gene expression in a genome-wide manner. RNA-seq has been mainly used for testing differential expression (DE) of transcripts between two conditions and has recently been used for testing differential alternative polyadenylation (APA). In the past, many algorithms have been developed for detecting differentially expressed genes (DEGs) from RNA-seq experiments, including the one we developed, XBSeq, which paid special attention to the context-specific background noise that is ignored in conventional gene expression quantification and DE analysis of RNA-seq data.Entities:
Keywords: Alternative polyadenylation; Differential expression analysis; RNA-seq; XBSeq; XBSeq2
Mesh:
Year: 2017 PMID: 28984183 PMCID: PMC5629564 DOI: 10.1186/s12859-017-1803-9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1ROC curves of different methods under various levels of background noise. ROC curves of DESeq2, edgeR, XBSeq, XBSeq2 under low, intermediate or high level of background noise (a); ROC curves of different methods but only with highly expressed genes (genes above 75% quantile of expression intensity) (b); ROC curves of different methods but only with genes expressed at low levels (genes below 25% quantile of expression intensity) (c); Simulations were carried out 100 times and average AUC were used. Dataset with 3 number of replicates per condition, 10% DEGs with 1.5-fold change was used
Fig. 2False discovery curves different methods under various levels of background noise. False discovery curves of DESeq2, edgeR, XBSeq, XBSeq2 under low, intermediate or high level of background noise (a); False discovery curves of different methods but only with highly expressed genes (genes above 75% quantile of expression intensity) (b); False discovery curves of different methods but only with genes expressed at low levels (genes below 25% quantile of expression intensity) (c); Simulations were carried out 100 times and average number of false discoveries were used. Dataset with 3 number of replicates per condition, 10% DEGs with 1.5-fold change was used
Fig. 3Statistical power of different methods under various levels of background noise. Bar chart of statistical power for DESeq2, edgeR, XBSeq, XBSeq2 under low, intermediate or high level of background noise (a); Bar chart of statistical power for different methods but only with highly expressed genes (genes above 75% quantile of expression intensity) (b); Bar chart of statistical power for different methods but only with genes expressed at low levels (genes below 25% quantile of expression intensity) (c); Simulations were carried out 100 times and average statistical power were used. Dataset with 3 number of replicates per condition, 10% DEGs with 1.5-fold change was used
Fig. 4Benchmark of different methods under low level of background noise. Benchmark of DESeq2, edgeR, XBSeq, XBSeq2 in terms of computation time (a); and total number of computational memory allocated (b). Methods were benchmarked with datasets of 3, 5, or 10 number of replicates per condition, 10% DEGs with 1.5-fold change. Benchmark procedure was carried out under MacBook Pro, 2.7 GHz Intel Core i5, 8 GB 1867 MHz DDR3