| Literature DB >> 19891778 |
Pingzhao Hu1, Celia M T Greenwood, Joseph Beyene.
Abstract
BACKGROUND: Development of efficient analytic methodologies for combining microarray results is a major challenge in gene expression analysis. The widely used effect size models are thought to provide an efficient modeling framework for this purpose, where the measures of association for each study and each gene are combined, weighted by the standard errors. A significant disadvantage of this strategy is that the quality of different data sets may be highly variable, but this information is usually neglected during the integration. Moreover, it is widely known that the estimated standard deviations are probably unstable in the commonly used effect size measures (such as standardized mean difference) when sample sizes in each group are small.Entities:
Mesh:
Year: 2009 PMID: 19891778 PMCID: PMC2784452 DOI: 10.1186/1752-0509-3-106
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Parameters used in simulation of probe-level gene expression profile
| Parameter | Study 1 | Study 2 |
|---|---|---|
| Number of genes | 1000 | 1000 |
| Proportion of expressed genes | 0.5 | 0.5 |
| Proportion of differentially expressed genes | 0.1 | 0.1 |
| Sample size | 25 arrays in groups | 50 arrays in groups |
| Number of probes in each probeset | 11 | 16 |
Main characteristics of the Affymetrix microarray data sets
| Studies | Number of Normal Samples | Number of Prostate Cancer Samples | Chip Type |
|---|---|---|---|
| Singh study | 50 | 52 | Affymetrix |
| Welsh study | 8 | 25 | Affymetrix |
| LaTulippe study | 3 | 23 | Affymetrix |
| Stuart study | 50 | 38 | Affymetrix |
Area under the curves of the four meta-analysis models (s = 0.05)
| Effect Size | WROM | UWROM | WSMD | UWSMD |
|---|---|---|---|---|
| 0.978 | 0.965 | 0.942 | 0.903 | |
| 0.962 | 0.949 | 0.942 | 0.905 | |
| 0.958 | 0.932 | 0.924 | 0.877 | |
Rank of known and validated prostate cancer markers
| Gene Name | LIMMA | WROM | UWROM | WSMD | UWSMD | Source |
|---|---|---|---|---|---|---|
| HEPSIN | 2 | 2 | 2 | 1 | 6 | Welsh et al. (2001) |
| MIC-1 | 145 | 19 | 66 | 110 | 61 | Welsh et al. (2001) |
| FASN | 173 | 15 | 13 | 72 | 231 | Welsh et al. (2001) |
| TACSTD1 | 32 | 6 | 6 | 289 | 413 | Welsh et al. (2001) |
| PSCA | 10344 | 8948 | 8072 | 11622 | 12259 | Tricoli et al. 2004 |
| PSMA | 509 | 508 | 294 | 433 | 220 | Tricoli et al. 2004 |
| TERT | 6636 | 4625 | 7741 | 9945 | 7596 | Tricoli et al. 2004 |
| GSTP1 | 1744 | 99 | 366 | 1508 | 2368 | Tricoli et al. 2004 |
| GRN | 6386 | 1880 | 840 | 2332 | 2336 | Tricoli et al. 2004 |
Figure 1Comparison of meta-analysis and single study analysis. Overlap of top-ranked genes by meta-analysis and single study analysis
Figure 2Prediction accuracy of the SVM model with linear kernel. Prediction accuracy of the SVM models as a function of the number of differentially expressed genes selected by the four meta-analytic procedures, respectively
Figure 3Prediction accuracy of the SVM model with radial kernel. Prediction accuracy of the SVM models as a function of the number of differentially expressed genes selected by the four meta-analytic procedures, respectively