| Literature DB >> 32264950 |
José María Martínez-Otzeta1, Itziar Irigoien1, Basilio Sierra1, Concepción Arenas2.
Abstract
BACKGROUND: Microarray technology provides the expression level of many genes. Nowadays, an important issue is to select a small number of informative differentially expressed genes that provide biological knowledge and may be key elements for a disease. With the increasing volume of data generated by modern biomedical studies, software is required for effective identification of differentially expressed genes. Here, we describe an R package, called ORdensity, that implements a recent methodology (Irigoien and Arenas, 2018) developed in order to identify differentially expressed genes. The benefits of parallel implementation are discussed.Entities:
Keywords: Differentially expressed gene; Multivariate statistics; Outlier; Parallel implementation; Quantile; R package
Mesh:
Year: 2020 PMID: 32264950 PMCID: PMC7137194 DOI: 10.1186/s12859-020-3463-4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Visualization of differences for p∈C={0.25,0.5,0.75} for two genes. In the left side, a gene whose expressions in conditions X and Y are not differentially expressed (No DE gene); in the right side, a gene that is differentially expressed in conditions X and Y (DE gene)
Fig. 2Running times for simulated data with different numbers of genes, different numbers of cores and different number of permutations used in the bootstrap procedure. In the vertical axis the type of execution: sequential or the number of processes working
Fig. 3For the simexpr data set, representation of the potential genes based on OR (vertical axis), FP (horizontal axis) and dFP (size of the circle is inversely proportional to its value). Genes identified by the relaxed selection as DEGs are indicated by the symbol “ △”; in red and blue, genes belonging to cluster 1 and cluster 2, respectively
Using the simexpr data set, distribution among the two clusters of the genes identified by ORdensity as DEGs
| Identified DEGs | |||
|---|---|---|---|
| True DE genes | Cluster 1 | Cluster2 | |
| 35 | 3 | 34 (9S/25R) | 1 (0S/1R) |
| 32 | 2 | 31 (2S/29R) | 0 |
| 33 | 1.5 | 18 (1S/17R) | 12 (0S/12R) |
| Not true DE | |||
| 900 | - | 0 | 1 (0S/1R) |
In brackets the number of DEGs selected by strong (S) and relaxed (R) criterion, respectively