| Literature DB >> 27107712 |
Mingxiang Teng1,2,3, Michael I Love1,2, Carrie A Davis4, Sarah Djebali5, Alexander Dobin4, Brenton R Graveley6, Sheng Li7, Christopher E Mason7, Sara Olson6, Dmitri Pervouchine5, Cricket A Sloan8, Xintao Wei6, Lijun Zhan6, Rafael A Irizarry9,10.
Abstract
Obtaining RNA-seq measurements involves a complex data analytical process with a large number of competing algorithms as options. There is much debate about which of these methods provides the best approach. Unfortunately, it is currently difficult to evaluate their performance due in part to a lack of sensitive assessment metrics. We present a series of statistical summaries and plots to evaluate the performance in terms of specificity and sensitivity, available as a R/Bioconductor package ( http://bioconductor.org/packages/rnaseqcomp ). Using two independent datasets, we assessed seven competing pipelines. Performance was generally poor, with two methods clearly underperforming and RSEM slightly outperforming the rest.Entities:
Mesh:
Year: 2016 PMID: 27107712 PMCID: PMC4842274 DOI: 10.1186/s13059-016-0940-1
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Estimated log fold changes stratified by transcript abundance on simulation dataset. One example based on Cufflinks quantification of two samples is shown here. Black points are non-differential transcripts; blue points are differentially expressed transcripts which were simulated to have signals on both samples; red points are differentially expressed transcripts which were simulated to have signals in only one of the samples
Fig. 2Distribution of reported transcript quantifications on one sample of simulation dataset a before and b after rescaling. Seven quantification methods are shown here
Fig. 3Standard deviations of transcript quantifications based on a an experimental dataset (GM12878) and b a simulation dataset (one of the cell lines). Seven quantification methods are shown here
Summarized metrics for analyzed pipelines based on an experimental dataset
| Method | SD low | SD medium | SD high | NE (K = 1) | NN (K = 1) | TxDiff low | TxDiff medium | TxDiff high | deFC low | deFC medium | deFC high | pAUC |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Cufflinks | 0.62 (0.002) | 0.26 (0.001) | 0.12 (0.000) | 0.08 | 0.70 | 0.31 (0.007) | 0.08 (0.002) | 0.03 (0.001) | 2.65 (0.022) | 2.25 (0.047) | 1.01 (0.024) | 0.77 |
| eXpress | 0.75 (0.003) | 0.37 (0.002) | 0.13 (0.001) | 0.05 | 0.80 | 0.44 (0.008) | 0.05 (0.002) | 0.01 (0.000) | 1.93 (0.026) | 2.56 (0.058) | 1.20 (0.028) | 0.68 |
| Flux Capacitor | 0.62 (0.003) | 0.57 (0.003) | 0.18 (0.001) | 0.10 | 0.73 | 0.42 (0.008) | 0.15 (0.004) | 0.07 (0.003) | 2.62 (0.024) | 2.40 (0.050) | 1.01 (0.025) | 0.75 |
| kallisto | 0.53 (0.002) | 0.24 (0.001) | 0.12 (0.000) | 0.09 | 0.64 | 0.28 (0.007) | 0.08 (0.002) | 0.03 (0.0001 | 2.36 (0.024) | 2.06 (0.045) | 1.03 (0.024) | 0.76 |
| RSEM | 0.54 (0.002) | 0.22 (0.001) | 0.11 (0.000) | 0.06 | 0.73 | 0.39 (0.008) | 0.07 (0.002) | 0.02 (0.001) | 2.72 (0.022) | 2.22 (0.048) | 1.03 (0.026) | 0.78 |
| Sailfish | 0.46 (0.002) | 0.25 (0.001) | 0.13 (0.000) | 0.08 | 0.60 | 0.27 (0.006) | 0.08 (0.002) | 0.04 (0.001) | 2.30 (0.023) | 2.08 (0.044) | 0.97 (0.022) | 0.77 |
| Salmon | 0.46 (0.002) | 0.23 (0.001) | 0.12 (0.000) | 0.08 | 0.65 | 0.29 (0.007) | 0.07 (0.002) | 0.04 (0.001) | 2.30 (0.024) | 2.06 (0.045) | 1.03 (0.022) | 0.77 |
Metrics for single cell lines are averaged for both cell lines, except standard deviation is the square root of average squares. Columns 2–4 shows median standard deviation on three transcript abundance levels; column 5 shows proportions of discordant calls when K = 1; column 6 shows proportions of both non-expressed when K = 1; columns 7–9 show the mean proportion differences of transcripts in genes only having two annotated transcripts based on three transcript abundance levels; columns 10–12 show median log fold changes of true differentially expressed genes based on three abundance levels; column 13 shows standardized partial area under the curve for differential expression of genes. pAUC partial area under the receiver operating characteristic curve
Summarized metrics for analyzed pipelines based on a simulation dataset
| Method | SD low | SD medium | SD high | NE (K = 1) | NN (K = 1) | TxDiff low | TxDiff medium | TxDiff high | deFC low | deFC medium | deFC high | pAUC |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Cufflinks | 0.73 (0.002) | 0.54 (0.003) | 0.26 (0.001) | 0.090 | 0.657 | 0.34 (0.011) | 0.08 (0.003) | 0.05 (0.003) | 0.53 (0.009) | 0.95 (0.006) | 0.98 (0.003) | 0.61 |
| eXpress | 0.71 (0.003) | 0.67 (0.004) | 0.30 (0.001) | 0.09 | 0.68 | 0.33 (0.009) | 0.07 (0.003) | 0.07 (0.003) | 0.47 (0.011) | 0.87 (0.015) | 0.91 (0.012) | 0.60 |
| Flux Capacitor | 1.03 (0.004) | 1.23 (0.007) | 0.40 (0.002) | 0.15 | 0.63 | 0.46 (0.013) | 0.15 (0.006) | 0.07 (0.004) | 0.39 (0.011) | 0.82 (0.013) | 0.97 (0.009) | 0.52 |
| Kallisto | 0.72 (0.003) | 0.55 (0.003) | 0.27 (0.001) | 0.10 | 0.63 | 0.37 (0.011) | 0.08 (0.004) | 0.05 (0.003) | 0.56 (0.008) | 0.95 (0.006) | 0.98 (0.002) | 0.58 |
| RSEM | 0.65 (0.002) | 0.48 (0.003) | 0.25 (0.001) | 0.08 | 0.69 | 0.43 (0.011) | 0.07 (0.004) | 0.04 (0.003) | 0.58 (0.008) | 0.96 (0.006) | 1.00 (0.003) | 0.65 |
| Sailfish | 0.76 (0.003) | 0.65 (0.004) | 0.30 (0.001) | 0.11 | 0.57 | 0.34 (0.009) | 0.08 (0.004) | 0.05 (0.003) | 0.52 (0.011) | 0.94 (0.011) | 0.96 (0.006) | 0.56 |
| Salmon | 0.64 (0.002) | 0.52 (0.003) | 0.26 (0.001) | 0.09 | 0.67 | 0.35 (0.010) | 0.08 (0.004) | 0.05 (0.003) | 0.54 (0.008) | 0.95 (0.007) | 1.00 (0.003) | 0.61 |
The last four columns are based on differential expression of transcripts. pAUC partial area under the receiver operating characteristic curve
Fig. 4Proportions of discordant expression calls based on a an experimental dataset (GM12878) and b a simulation dataset (one of the cell lines). Seven quantification methods are shown here
Fig. 5Proportion differences of transcript quantifications in genes with only two annotated transcripts based on a an experimental dataset (GM12878) and b a simulation dataset (one of the cell lines). Seven quantification methods are shown
Fig. 6ROC curves indicating performance of quantification methods based on differential expression analysis of a an experimental dataset and b a simulation dataset. Seven quantification methods are shown. FP false positive, TP true positive