| Literature DB >> 25988149 |
Anton A Buzdin1, Alex A Zhavoronkov2, Mikhail B Korzinkin3, Sergey A Roumiantsev4, Alexander M Aliper2, Larisa S Venkova3, Philip Y Smirnov3, Nikolay M Borisov3.
Abstract
The diversity of the installed sequencing and microarray equipment make it increasingly difficult to compare and analyze the gene expression datasets obtained using the different methods. Many applications requiring high-quality and low error rates cannot make use of available data using traditional analytical approaches. Recently, we proposed a new concept of signalome-wide analysis of functional changes in the intracellular pathways termed OncoFinder, a bioinformatic tool for quantitative estimation of the signaling pathway activation (SPA). We also developed methods to compare the gene expression data obtained using multiple platforms and minimizing the error rates by mapping the gene expression data onto the known and custom signaling pathways. This technique for the first time makes it possible to analyze the functional features of intracellular regulation on a mathematical basis. In this study we show that the OncoFinder method significantly reduces the errors introduced by transcriptome-wide experimental techniques. We compared the gene expression data for the same biological samples obtained by both the next generation sequencing (NGS) and microarray methods. For these different techniques we demonstrate that there is virtually no correlation between the gene expression values for all datasets analyzed (R (2) < 0.1). In contrast, when the OncoFinder algorithm is applied to the data we observed clear-cut correlations between the NGS and microarray gene expression datasets. The SPA profiles obtained using NGS and microarray techniques were almost identical for the same biological samples allowing for the platform-agnostic analytical applications. We conclude that this feature of the OncoFinder enables to characterize the functional states of the transcriptomes and interactomes more accurately as before, which makes OncoFinder a method of choice for many applications including genetics, physiology, biomedicine, and molecular diagnostics.Entities:
Keywords: RNA-Seq; correction of errors; gene expression; intracellular signaling pathway activation; microarray hybridization; next generation sequencing; signalome; transcriptome profiling
Year: 2014 PMID: 25988149 PMCID: PMC4428387 DOI: 10.3389/fmolb.2014.00008
Source DB: PubMed Journal: Front Mol Biosci ISSN: 2296-889X
Transcriptomic data deposited in the GEO database that were used for the current study.
| GSE36244 | HepG2 cells | Treated vs. untreated with benzopyrene | Affymetrix human genome U133 Plus 2.0 arrays and illumina genome analyzer sequencer |
| GSE41588 | HT-29 cells | Treated vs. untreated with 5-aza-deoxy-cytidine | Affymetrix human genome U133 Plus 2.0 arrays and illumina genome analyzer sequencer |
| GSE37765 | Lung adenocarcinoma | Tumor samples vs. matched samples of normal tissue | Agilent 1M CNV arrays and illumina genome analyzer sequencer |
Figure 1Clouds of values obtained using the RNA next-generation sequencing vs. RNA microarray analysis methods. Upper row (A,B): cell replica 1, 24 h after BaP treatment from the HepG2 cells, dataset GSE36244 (van Delft et al., 2012). Middle row (C,D): treatment with 5 μM of 5-Aza and cell replica 1 from the HT-29 cells, dataset GSE41588 (Xu et al., 2013). Lower row (E,F): sample P8 from the lung adenocarcinoma dataset GSE37765 (Kim et al., 2013). Left column (A,C,E): values of decimal logarithmic CNR for each gene. Right column (B,D,F): values of PAS.
Correlation coefficients between values obtained using the RNA microarray analysis and RNA sequencing methods for the HepG2 cells dataset GSE36244 (van Delft et al., .
| GSE36244, 24 h after BaP treatment | Replica 1 | 0.35 | 0.49 |
| Replica 2 | 0.10 | 0.47 | |
| Averaged over 2 samples above | 0.22 | 0.49 | |
| GSE41588, 5 μM of 5-Aza | Replica 1 | 0.16 | 0.89 |
| Replica 2 | 0.049 | 0.88 | |
| Replica 3 | 0.047 | 0.80 | |
| Averaged over 3 samples above | 0.082 | 0.87 | |
| GSE37765 | P1 | 0.18 | 0.79 |
| P3 | 0.098 | 0.75 | |
| P4 | 0.12 | 0.80 | |
| P5 | −0.029 | 0.21 | |
| P8 | 0.043 | 0.80 | |
| Averaged over 5 samples above | 0.068 | 0.77 | |