| Literature DB >> 30326833 |
Rhonda Bacher1, Ning Leng2, Li-Fang Chu2, Zijian Ni3, James A Thomson2, Christina Kendziorski4, Ron Stewart5.
Abstract
BACKGROUND: High-throughput expression profiling experiments with ordered conditions (e.g. time-course or spatial-course) are becoming more common for studying detailed differentiation processes or spatial patterns. Identifying dynamic changes at both the individual gene and whole transcriptome level can provide important insights about genes, pathways, and critical time points.Entities:
Keywords: Gene expression; R package; RNA-seq; Segmented regression; Shiny; Time-course
Mesh:
Year: 2018 PMID: 30326833 PMCID: PMC6192113 DOI: 10.1186/s12859-018-2405-x
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Trendy framework. The Trendy framework fits multiple segmented regression models to each feature/gene. The optimal model is selected as the one with the smallest BIC. Trendy summarizes the expression pattern of each gene and provides a summary of global dynamics
Fig. 2Simulation results. A set of replicate RNA-seq samples collected at the same time were shuffled and assigned a time-order. The number of top dynamic genes identified by Trendy was determined using two adjusted R2 thresholds of and . Shown in panel a are the number of genes above the cutoffs for all combinations of settings for K,N,and mNS (each combination was simulated 300 times over varying point distributions). Panel b contains the number of genes above each cutoff over three various point distribution scenarios (each box contains 2400 simulations over all other varied parameters)
Results of simulation study for genes having a true simulated trend
| Low variance | High variance | |||
|---|---|---|---|---|
| Average % correct: | K | Trend | K | Trend |
| K = 0 | 100% | 99% | 100% | 94% |
| K = 1 | 97% | 93% | 92% | 86% |
| K = 2 | 95% | 88% | 79% | 72% |
The average percent of genes over all simulations classified correctly in terms of K and the trend direction when the true K is simulated as either 0, 1, or 2 and the within-gene variance is either low or high
Fig. 3Results of Trendy on the Whitfield dataset. Panel a is the breakpoint distribution for the 118 genes having . Orange bars indicate the S phase and black arrows indicate the time of mitosis as shown in Figures 1 and 2 in Whitfield et al., 2002. Panel b contains two genes identified by Trendy with different expression dynamics over the time-course
Fig. 4Results of Trendy on the Axolotl dataset. Panel a contains the breakpoint distribution for all 9535 genes having . The orange bars indicate the times of major transcriptome changes identified in Figure 2 in Jiang and Nelson et al., 2016. Panel b shows two genes identified by Trendy with different expression dynamics over the time-course. The first gene, NSD1, has three estimated breakpoints, while GDF9 has two breakpoints
Fig. 5Comparison to EBSeq-HMM. The reported expression trend for a single simulated gene analyzed using Trendy and EBSeq-HMM is shown. In a Trendy reports two increasing segments separated by a breakpoint between times 7 and 8. In b EBSeq-HMM reports the expression path as “EE-EE-EE-EE-EE-EE-EE-EE-Up”, where ‘EE’ is equivalent to ‘no-change’