| Literature DB >> 18315862 |
Hongya Zhao1, Kwok-Leung Chan, Lee-Ming Cheng, Hong Yan.
Abstract
BACKGROUND: Identification of differentially expressed genes is a typical objective when analyzing gene expression data. Recently, Bayesian hierarchical models have become increasingly popular to solve this type of problems. These models show good performance in accommodating noise, variability and low replication of microarray data. However, the correlation between different fluorescent signals measured from a gene spot is ignored, which can diversely affect the data analysis step. In fact, the intensities of the two signals are significantly correlated across samples. The larger the log-transformed intensities are, the smaller the correlation is.Entities:
Mesh:
Year: 2008 PMID: 18315862 PMCID: PMC2259410 DOI: 10.1186/1471-2105-9-S1-S9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Robust locations v.s. scales of gene expression data. The x-axis is the estimation of locations and y-axis is the estimation of scales. The left graph is plotted with the expression data of control groups and the right one is for the Cd toxic treatment (right) groups in Cd toxic microarray experiment.
Figure 2Mean of log-transformed intensities v.s. correlation coefficient. The x-axis is the mean of log-transformed measurements over the replications and y-axis is the correlation coefficient between the pair of measurements within a spot.
Figure 3Effect of sample size on error rates in the multiple testing in our simulation study. Effect of the sample size on error in multiple testing with the GLRT to identify the DE genes under the multivariate hierarchical model. The x-axis is the number of replications and the y-axis is the error rates. The sensitivity is expressed by diamond, specificity by circle, PPV by star and NPV by triangle.
Operating characteristics in simulation study of ρ = 0.1, 0.5 and 0.9.
| Corr. Coef. | Method | Sensitivity | Specificity | PPV | NPV | FDR |
| 0.1 | GLRT | 0.742 | 0.998 | 0.960 | 0.987 | 0.040 |
| LNN | 0.667 | 0.991 | 0.826 | 0.980 | 0.174 | |
| t-test | 0.557 | 0.987 | 0.692 | 0.978 | 0.308 | |
| fold change | 0.309 | 0.999 | 0.968 | 0.966 | 0.032 | |
| 0.5 | GLRT | 0.945 | 0.998 | 0.966 | 0.997 | 0.034 |
| LNN | 0.635 | 0.986 | 0.873 | 0.972 | 0.127 | |
| t-test | 0.516 | 0.968 | 0.435 | 0.977 | 0.565 | |
| fold-change | 0.252 | 1 | 1 | 0.966 | 0 | |
| 0.9 | GLRT | 0.980 | 0.999 | 0.990 | 0.999 | 0.010 |
| LNN | 0.568 | 0.988 | 0.877 | 0.987 | 0.123 | |
| t-test | 0.406 | 0.972 | 0.436 | 0.969 | 0.564 | |
| fold change | 0.139 | 1 | 1 | 0.956 | 0 | |
Number of errors in N multiple test
| # not rejected (negative) | # rejected (positive) | Total | |
| # True H0 (EE) | |||
| # Non-true H0 (DE) | |||
# denotes the number of elements in the set.