| Literature DB >> 23734663 |
Abstract
BACKGROUND: RNA-Seq has become a key technology in transcriptome studies because it can quantify overall expression levels and the degree of alternative splicing for each gene simultaneously. To interpret high-throughout transcriptome profiling data, functional enrichment analysis is critical. However, existing functional analysis methods can only account for differential expression, leaving differential splicing out altogether.Entities:
Mesh:
Year: 2013 PMID: 23734663 PMCID: PMC3622641 DOI: 10.1186/1471-2105-14-S5-S16
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Correlation coefficients of DE and DS scores and experiment statistics of the four data sets.
| Data Set | P-value | Sample Size | Platform | SE/PE | Strand Specific | Read Length | Frag. Size | Tissue | |
|---|---|---|---|---|---|---|---|---|---|
| Artificial | 0.23 | 0 | 10 v 9 | GAII | - | - | - | - | - |
| Cancer | 0.05 | 0 | 20 v 10 | GAII | PE | No | 36 | 200-300 | Prostate |
| BA46 | -0.007 | 0.17 | 20 v 20 | SOLiD v4 | SE | Yes | 50 | 200-300 | Brain (BA46) |
| BA22 | -0.033 | 3.4e-9 | 9 v 9 | GAII | SE | No | 76 | 200-250 | Brain (BA22) |
The number of significant gene sets on the four data sets at FDR 1% with linear combination strategy.
| Dataset | GS | DS | 0.1 | 0.2 | 0.3 | 0.4 | 0.5 | 0.6 | 0.7 | 0.8 | 0.9 | DE |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Artificial | c2 | 1550 | 982 | 1389 | 1656 | 1773 | 1757 | 1569 | 1247 | 696 | 261 | 64 |
| c5 | 647 | 326 | 479 | 681 | 766 | 747 | 675 | 558 | 354 | 158 | 66 | |
| Cancer | c2 | 4 | 3 | 4 | 10 | 14 | 16 | 16 | 16 | 14 | 12 | 12 |
| c5 | 11 | 11 | 20 | 14 | 13 | 9 | 6 | 2 | 2 | 1 | 1 | |
| BA46 | c2 | 0 | 1 | 1 | 2 | 3 | 3 | 2 | 4 | 3 | 2 | 2 |
| c5 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 1 | 0 | |
| BA22 | c2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| c5 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | |
For the full results, see Supplementary Tables S1-S4.
GS: gene set category; DS: DS-only GSEA; DE: DE-only GSEA; 0.1,...,0.9: weights α.
Figure 1Saturation plots of weights on gene set category c5. Shown are the numbers of unique gene sets detected by different number of weights indicated in x-axis. (a) is for the artificial data sets; (b) cancer; (c) BA46; (d) BA22. From the fewest to the most number of weights, we gradually included the following weights in the order of (1 - DE-only, 0 - DS-only, 0.5, 0.1, 0.9, 0.3, 0.7, 0.2, 0.4, 0.6, 0.8, 0.05, 0.15, 0.25, 0.35, 0.45, 0.55, 0.65, 0.75, 0.85, 0.95). Red color indicates "Linear" combination, blue "RankSp", and black "RankGlb".