| Literature DB >> 32093752 |
Lara Urban1,2, Christian W Remmele1, Marcus Dittrich1,3, Roland F Schwarz4, Tobias Müller5.
Abstract
OBJECTIVE: The biological interpretation of gene expression measurements is a challenging task. While ordination methods are routinely used to identify clusters of samples or co-expressed genes, these methods do not take sample or gene annotations into account. We aim to provide a tool that allows users of all backgrounds to assess and visualize the intrinsic correlation structure of complex annotated gene expression data and discover the covariates that jointly affect expression patterns.Entities:
Keywords: Fourthcorner analysis; High-throughput data; Microarray analysis; Multivariate analysis; Ordination methods; RLQ analysis; RNA-Seq analysis; Transcriptomics; Visualization
Year: 2020 PMID: 32093752 PMCID: PMC7038619 DOI: 10.1186/s13104-020-04946-1
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Speed-up of the fourthcorner analysis implemented in covRNA due to parallelization across multiple cores
| Permutations | 103 | 104 | 105 | 106 | 107 |
|---|---|---|---|---|---|
| 1 Core (time in sec) | 9.1 | 52.9 | 5.3 × 102 | 6.8 × 103 | 6.9 × 104 |
| 10 Cores (time in sec) | 8.5 | 15.7 | 84.7 | 7.8 × 102 | 7.7 × 103 |
| Speed-up | 1.1 | 3.4 | 6.3 | 8.2 | 9.0 |
The fourthcorner analysis is performed on the Bacillus anthracis example dataset on 1 and 10 cores for different numbers of permutations as indicated in the first row. The following rows indicate the required user time in seconds while the last row indicates the relative speed-up of the multi-threading approach. The run time was profiled on a server with 72 Cores (Intel Xeon CPU E5-2699 v3 @ 2.30 GHz) with 512 GB RAM
Fig. 1covRNA’s statistical test is shown to control the type I error rate correctly. A p-value distribution under the null hypothesis of covRNA’s statistical test between sample and gene annotations for n = 1000 permutations is generated. The results of the permutation of one random sample annotation-gene annotation association are shown here. a Histogram of the resulting p-values. b Q–Q plot of the p-values
Fig. 2Visualization of covRNA analysis of microarray data of human dendritic cells infected with A. fumigatus based on the MSigDB hallmark gene set. a Ordination of sample and gene covariates. The lines between the covariates denote significant negative (red) and positive (blue) associations (at a significance level α = 0.05, each condition tested in turn versus the others). Gray covariates are not involved in any significant association. b Results of the association test. Consistently, red, blue and gray colors denote significant negative, positive or no significant associations (at a significance level α = 0.05)