| Literature DB >> 21989042 |
Abstract
BACKGROUND: As the magnitude of the experiment increases, it is common to combine various types of microarrays such as paired and non-paired microarrays from different laboratories or hospitals. Thus, it is important to analyze microarray data together to derive a combined conclusion after accounting for heterogeneity among data sets. One of the main objectives of the microarray experiment is to identify differentially expressed genes among the different experimental groups. We propose the linear mixed effect model for the integrated analysis of the heterogeneous microarray data sets.Entities:
Mesh:
Year: 2011 PMID: 21989042 PMCID: PMC3226253 DOI: 10.1186/1471-2105-12-S5-S3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Descriptive information for the liver cancer microarray data
| Data set ID | Hospital | Chip type | Number of paired samples | Number of non-paired samples | Total number of samples | ||
|---|---|---|---|---|---|---|---|
| tumor | control | tumor | control | ||||
| D1 | A | 15 | 15 | 1 | 1 | 32 | |
| D2 | B | 23 | 23 | 0 | 0 | 46 | |
| D3 | C | 4 | 4 | 25 | 1 | 34 | |
| D4 | C | 8 | 8 | 4 | 1 | 21 | |
Genes that are identified as differentially expressed when FDR is controlled 1%, 5%, 10%, and 20%, respectively
| FDR | Meta analysis | Two-stage ANOVA | LMe | ||||
|---|---|---|---|---|---|---|---|
| M1 | M2 | M3 | M4 | M5 | |||
| 1% | 57 | 46 | 119 | 184 | 205 | 124 | 37 |
| 5% | 197 | 145 | 214 | 543 | 589 | 375 | 114 |
| 10 % | 303 | 203 | 339 | 879 | 978 | 740 | 181 |
| 20 % | 478 | 336 | 585 | 1500 | 1761 | 1323 | 342 |
Figure 1Number of genes that are identified by meta-analysis, two-stage ANOVA, and LMe model M3 when FDR is controlled by 1%
Common genes detected by meta-analysis, two-stage ANOVA model, and LMe M3 model when FDR is controlled by 1% (9 known genes)
| Unigene ID | Description |
|---|---|
| Hs.82084 | Integrin beta 3 binding protein (beta3-endonexin) (ITGB3BP), mRNA |
| Hs.514 | Cyclin H (CCNH), mRNA |
| Hs.167529 | Cytochrome P450, subfamily IIC (mephenytoin 4-hydroxylase), polypeptide 9 (CYP2C9), mRNA |
| Hs.117367 | Solute carrier family 22 (organic cation transporter), member 1 (SLC22A1), mRNA |
| Hs.54900 | Serologically defined colon cancer antigen 1 (SDCCAG1), mRNA |
| Hs.80756 | Betaine-homocysteine methyltransferase (BHMT), mRNA |
| Hs.8765 | RNA helicase-related protein (RNAHP), mRNA |
| Hs.755990 | Haptoglobin (HP), mRNA |
| Hs.35101 | Proline-rich Gla (G-carboxyglutamic acid) polypeptide 2 (PRRG2), mRNA |
Setting for random effects b
| Type | random effects of subject | covariance of random effects |
|---|---|---|
| 1 | ||
| 2 | ||
| 3 | ||
Power and FDR of methods under simulated data of Types 1, 2, and 3 covariance structures when FDR was controlled by 0.05
| Type | Meta analysis | Two-stage ANOVA | LMe | ||||||
|---|---|---|---|---|---|---|---|---|---|
| M1 | M2 | M3 | M4 | M5 | |||||
| 1 | 0 | Power | 0.2087 | 0.1983 | 0.2770 | 0.2073 | 0.2610 | 0.2273 | 0.2240 |
| FDR | 0.0740 | 0.0600 | 0.1150 | 0.0730 | 0.0863 | 0.0746 | 0.0807 | ||
| 0.2 | Power | 0.1863 | 0.1683 | 0.3493 | 0.2570 | 0.2847 | 0.2607 | 0.1903 | |
| FDR | 0.0345 | 0.0307 | 0.0958 | 0.0666 | 0.0707 | 0.0668 | 0.0371 | ||
| 0.4 | Power | 0.1543 | 0.1403 | 0.4783 | 0.3950 | 0.4007 | 0.3953 | 0.1580 | |
| FDR | 0.0170 | 0.0186 | 0.0718 | 0.0558 | 0.0573 | 0.0557 | 0.0104 | ||
| 2 | 0 | Power | 0.1453 | 0.1423 | 0.3650 | 0.3067 | 0.3093 | 0.3073 | 0.1570 |
| FDR | 0.0224 | 0.0251 | 0.0920 | 0.0564 | 0.0569 | 0.0563 | 0.0248 | ||
| 0.2 | Power | 0.1290 | 0.1347 | 0.3373 | 0.2867 | 0.2867 | 0.2867 | 0.1450 | |
| FDR | 0.0203 | 0.0098 | 0.0858 | 0.0591 | 0.591 | 0.0591 | 0.0247 | ||
| 0.4 | Power | 0.1490 | 0.1497 | 0.3700 | 0.3150 | 0.3170 | 0.3150 | 0.1503 | |
| FDR | 0.0283 | 0.0323 | 0.1091 | 0.0680 | 0.0676 | 0.0680 | 0.0363 | ||
| 3 | Power | 0.1517 | 0.0010 | 1.000 | 1.0000 | 1.0000 | 1.0000 | 0.0043 | |
| FDR | 0.0000 | 0.0000 | 0.0712 | 0.0455 | 0.0455 | 0.0455 | 0.0000 | ||