| Literature DB >> 29028903 |
Daniel Spies1,2, Peter F Renz1,2, Tobias A Beyer1, Constance Ciaudo1.
Abstract
RNA sequencing (RNA-seq) has become a standard procedure to investigate transcriptional changes between conditions and is routinely used in research and clinics. While standard differential expression (DE) analysis between two conditions has been extensively studied, and improved over the past decades, RNA-seq time course (TC) DE analysis algorithms are still in their early stages. In this study, we compare, for the first time, existing TC RNA-seq tools on an extensive simulation data set and validated the best performing tools on published data. Surprisingly, TC tools were outperformed by the classical pairwise comparison approach on short time series (<8 time points) in terms of overall performance and robustness to noise, mostly because of high number of false positives, with the exception of ImpulseDE2. Overlapping of candidate lists between tools improved this shortcoming, as the majority of false-positive, but not true-positive, candidates were unique for each method. On longer time series, pairwise approach was less efficient on the overall performance compared with splineTC and maSigPro, which did not identify any false-positive candidate.Entities:
Mesh:
Year: 2019 PMID: 29028903 PMCID: PMC6357553 DOI: 10.1093/bib/bbx115
Source DB: PubMed Journal: Brief Bioinform ISSN: 1467-5463 Impact factor: 11.622
Properties of available TC analysis tools
| Method | Normalization method | Model | DEG test | Uneven sampling allowed | Isoforms | Clustering | Time | Citation |
|---|---|---|---|---|---|---|---|---|
| DyNB | Variance estimation+ scaling factors on GP | NB | ML | Yes | No | No | Days | [ |
| EBSeq-HMM | Median/quantile | Beta NB+AR-HMM | EB | Yes | Yes | Yes | Minutes | [ |
| FunPat | – | γ/logNorm/Weibull | Bounded Area | No | No | Yes | Seconds | [ |
| ImpulseDE2 | – | NB+impulse model | LLR | Yes | No | No | Minutes | [ |
| lmms | – | lmms | LLR | Yes | No | Yes | Minutes | [ |
| Next maSigPro | – | NB+PR | LLR | No | No | Yes | Minutes | [ |
| nsgp | – | Nonstationary/stati GP | ML by grad | No | No | No | Days | [ |
| splineTC | – | Spline regression | Moderate | No | No | No | Seconds | [ |
| timeSeq | Via edgeR | NBMM | Kullback–Leibler distance ratio | Yes | Exon level | No | Days | [ |
aNB model,
bGP,
cML,
dMCMC,
eAR-HMM,
fempirical Bayesian method,
glog likelihood ratio,
hlinear mixed model splines,
ipolynomial regression,
jgradient descent,
kHastings-Monte-Carlo no U-turn sampling,
lnegative binomial mixed model. If a tool has several normalization methods, the standard method is underlined.
Figure 1The standard experimental design (A) consisted of two TCs having 4 time points with three replicates each. A single value for each gene was sampled from a negative binomial distribution using mean/dispersion value pairs of a biological data set. Time points and replicates were then drawn from the same distribution and expression patterns applied by multiplying with the pattern vector. Of the total 24 patterns that were simulated, each consisted of 50 genes, resulting in the simulation of 1200 DEGs in total (B). As genes were drawn from a negative binomial distribution, each pattern mostly consists of lowly expressed genes and a few highly expressed genes. (C) Other experimental designs were tested by increasing or reducing the library size, replicates or time points (standard parameters in parenthesis).
Test summary statistics of the standard scenario
| Tools | TP | FP | FN | TN | sen | spec | prec | FNR | FDR | acc | F1 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| DyNB | 675 | 525 | 525 | 16 778 | 0.56 | 0.97 | 0.56 | 0.44 | 0.44 | 0.94 | 0.56 |
| EBSeqHMM | 527 | 3889 | 673 | 13 414 | 0.44 | 0.78 | 0.12 | 0.56 | 0.88 | 0.75 | 0.19 |
| FunPat | 470 | 0 | 730 | 17 303 | 0.39 | 1.00 | 1.00 | 0.61 | 0.00 | 0.96 | 0.56 |
| ImpulseDE2 | 980 | 57 | 220 | 17 246 | 0.82 | 1.00 | 0.95 | 0.18 | 0.05 | 0.99 | 0.88 |
| lmms | 836 | 99 | 364 | 17 303 | 0.70 | 0.99 | 0.89 | 0.30 | 0.11 | 0.97 | 0.78 |
| maSigPro | 947 | 699 | 253 | 16 604 | 079 | 0.96 | 0.58 | 0.21 | 0.42 | 0.95 | 0.67 |
| nsgp | 482 | 718 | 718 | 16 585 | 0.40 | 0.96 | 0.40 | 0.60 | 0.60 | 0.92 | 0.40 |
| pairwise | 1014 | 70 | 186 | 17 233 | 0.85 | 1.00 | 0.94 | 0.16 | 0.06 | 0.97 | 0.89 |
| splineTC | 881 | 53 | 319 | 17 250 | 0.73 | 1.00 | 0.94 | 0.27 | 0.06 | 0.98 | 0.83 |
| timeSeq | 801 | 802 | 399 | 16 501 | 0.67 | 0.95 | 0.50 | 0.33 | 0.50 | 0.94 | 0.57 |
FN, false negatives; TN, true negatives; sen(sitivity), correctly identify TP; spec(ificity), correctly identify FP; prec(ision), ratio of correctly identified candidates; FNR (false-negative rate), ratio of falsely refused candidates; FDR, ratio of falsely identified candidates; acc(uracy), ratio of correctly identified TP and FP; F1, weighted harmonic mean of precision and sensitivity.
Figure 2Results of standard simulation scenario. (A) ROC showing the TPR and FPR on the x and y axis, respectively. FDR thresholds of 0.01, 0.05 and 0.1 are indicated by rings on each curve. (B) TPR/FDR curves with ring indicated adjusted P-value thresholds of 0.01, 0.05 and 0.1. (C) AUC fraction (ranging from 0/worst to 1/best) calculated for the ROC on several FDR thresholds. (D) Performance of TC tools on noisy data ranging from 0.05 to 0.2 white noise added to the samples.
Figure 3Results of overlapping candidate lists. (A) Overlaps of true-positive (left) and false-positive (right) candidates of top five tools. (B) ROC curve with computed AUC for overlaps of candidate lists. The number of the overlap indicates the minimum number of lists sharing candidates. (C) ROC curve with computed AUC for top five tools. FPR thresholds of 0.01, 0.05 and 0.1 are indicated by dashed red lines.
Figure 4Experimental design and results of published data on PIP3 signaling perturbations. (A) Experimental design and processing steps of samples. (B) DESeq2 overlaps of DEGs between TCs and T0 for further categorization and GO analysis. (C) GO enrichment for Class A DEGs for each method and the combined approach. The length of the bar depicts the number of enriched genes in each term. Log10 P-value is indicated by color (increasing from colored to gray), and is shown for the first and last term to indicate the range.