| Literature DB >> 18522715 |
Wing-Cheong Wong1, Marie Loh, Frank Eisenhaber.
Abstract
BACKGROUND: The original spotted array technology with competitive hybridization of two experimental samples and measuring relative expression levels is increasingly displaced by more accurate platforms that allow determining absolute expression values for a single sample (for example, Affymetrix GeneChip and Illumina BeadChip). Unfortunately, cross-platform comparisons show a disappointingly low concordance between lists of regulated genes between the latter two platforms.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18522715 PMCID: PMC2453111 DOI: 10.1186/1745-6150-3-23
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Intensity output of Illumina & Affymetrix across K technical replicates
| Platform | Replicate 1 | Replicate 2 | ⋯ | Replicate K |
| Illumina BeadChip (Raw data) | ⋯ | |||
| Illumina BeadChip (Summary data) | ⋯ | |||
| Affymetrix GeneChip (Raw data) | ⋯ |
Number of TP and FP genes based on P-value ranking.
| 0.01 vs 0 | 0 | 34 | 0 | 34 | 0 |
| 0.03 vs 0.01 | 0 | 34 | 0 | 34 | 0 |
| 0.1 vs 0.03 | 7 | 27 | 9 | 25 | 7 |
| 0.3 vs 0.1 | 14 | 20 | 22 | 12 | 14 |
| 1 vs 0.3 | 30 | 4 | 33 | 1 | 30 |
| 3 vs 1 | 30 | 4 | 34 | 0 | 30 |
| 10 vs 3 | 33 | 1 | 34 | 0 | 33 |
| 30 vs 10 | 34 | 0 | 34 | 0 | 34 |
| 100 vs 30 | 33 | 1 | 34 | 0 | 33 |
| 300 vs 100 | 4 | 30 | 26 | 8 | 4 |
| 1000 vs 300 | 9 | 25 | 16 | 18 | 8 |
The first column indicates the 11 test cases of pair-wise treatment conditions. The second and third columns indicate the number of TP and FP genes respectively for using σas the summary statistic. Similarly, column four and five indicate the results for using σ. The last column indicates the result for the number of common TP genes found by both σand σ.
Figure 1Relationship between . We show the plot of mean estimates of bead intensities μagainst standard deviation in mean estimates σand bead intensities σof 0 pM spike concentration data.
KEGG pathways elucidated from the MT1-MMP data.
| HSA01430 : Cell communication | ✓ | ✓ | ✓ |
| HSA04540 : Gap junction | ✓ | ✓ | ✓ |
| HSA04610 : Complement and coagulation cascades | ✓ | ✓ | ✓ |
| HSA04110 : Cell cycle | ✓ |
The first column indicates the name of the KEGG pathways. Only KEGG pathways with EASE score ≤ 0.05 and count ≥ 5 are included. The second to fourth columns indicate KEGG pathway found by Golubkov V.S. et. al. [7], σand σrespectively. Cutoff criteria of EASE score ≤ 0.05 and count ≥ 5 was applied.
Cell cycle genes in the MT1-MMP data.
| CCNA1 | NM_003914 | - | ≥ 0.05 | 1.12 | 0.00 | Cyclin A1 |
| CDC45L | NM_003504 | - | ≥ 0.05 | 1.05 | 0.00 | CDC45 cell division cycle 45-like |
| CCNB1 | NM_031966 | 1.30 | < 0.05 | 1.44 | 0.00 | Cyclin B1 |
| CCNB2 | NM_004701 | 1.27 | < 0.05 | 1.39 | 0.00 | Cyclin B2 |
| CDC2 | NM_033379 | 1.10 | < 0.05 | 1.20 | 0.00 | cell division cycle 2, G1 to S and G2 to M |
| CDC20 | NM_001255 | 2.02 | < 0.05 | 2.01 | 0.00 | Cell division cycle 20 homolog |
The first, second and last column indicate the gene symbol, the RefSeq ID and the description of each cell cycle gene respectively. The third and fourth columns indicate the logarithm fold change and the P-value of each gene found by Golubkov V.S. et. al. [7], which is analogous to using σas the summary statistic. The fifth and sixth columns indicate the logarithm fold change and the P-value of each gene found by using σas the summary statistic. Cutoff criteria of p ≤ 0.05 and |FC| ≥ 2 were applied. The genes in the upper section were found only by the σstatistic, whereas the genes in the lower section were detected with significance by both statistics.
KEGG pathways elucidated from the human spermatogenesis data.
| HSA00190 : Oxidative phosphorylation | ✓ | ✓ | ✓ |
| HSA00970 : Aminoacyl-tRNA synthetases | ✓ | ✓ | ✓ |
| HSA03010 : Ribosome | ✓ | ✓ | ✓ |
| HSA03050 : Proteosome | ✓ | ✓ | ✓ |
| HSA00010 : Glycolysis/Gluconeogenesis | ✓ | ✓ | |
| HSA00030 : Pentose phosphate pathway | ✓ | ✓ | |
| HSA00193 : ATP synthesis | ✓ | ✓ | |
| HSA00530 : Aminosugars metabolism | ✓ | ✓ | |
| HSA00640 : Propanoate metabolism | ✓ | ✓ | |
| HSA03020 : RNA polymerase | ✓ | ✓ | |
| HSA03060 : Protein export | ✓ | ✓ | |
| HSA04110 : Cell cycle | ✓ | ✓ | |
| MMU03010 : Ribosome | ✓ | ✓ | |
| HSA04120 : Ubiquitin mediated proteolysis | ✓ | ||
| HSA00020 : Citrate cycle (TCA cycle) | ✓ | ||
| HSA00240 : Pyrimidine metabolism | ✓ | ||
| HSA00251 : Glutamate metabolism | ✓ | ||
| HSA00510 : N-glycan biosynthesis | ✓ | ||
| HSA03022 : Basal transcription factors | ✓ | ||
| HSA04520 : Adherens junction | ✓ | ||
| HSA04530 : Tight junction | ✓ | ||
| HSA00620 : Pyruvate metabolism | ✓ | ||
The first column indicates the name of the KEGG pathways. Only KEGG pathways with EASE score ≤ 0.05 and count ≥ 5 are included. The second column indicates KEGG pathways found by Affymetrix. The third and fourth columns indicate the KEGG pathways found by using σand σrespectively. Cutoff criteria of EASE score ≤ 0.05 and count ≥ 5 were applied.
N-glycan biosynthesis genes in human spermatogenesis data.
| B4GALT1 | NM_001497 | -0.45 | 0.499 | -1.08 | 0.000 | UDP-Gal:betaGlcNAc beta 1,4- galactosyltransferase, polypeptide 1 |
| DDOST | NM_005216 | -2.22 | 0.067 | -1.18 | 0.000 | dolichyl-diphosphooligosaccharide-protein glycosyltransferase |
| DHDDS | NM_024887 | 3.60 | 0.049 | 3.62 | 0.000 | dehydrodolichyl diphosphate synthase |
| DPM1 | NM_003859 | -0.65 | 0.320 | -1.06 | 0.000 | dolichyl-phosphate mannosyltransferase polypeptide 1, catalytic subunit |
| GANAB | NM_198334 | 1.45 | 0.045 | 2.12 | 0.000 | glucosidase, alpha; neutral AB |
| MAN1A2 | NM_006699 | -1.27 | 0.019 | -1.28 | 0.000 | mannosidase, alpha, class 1A, member 2 |
| MAN2A2 | NM_006122 | -0.81 | 0.001 | -1.06 | 0.000 | mannosidase, alpha, class 2A, member 2 |
| UGCGL2 | NM_020121 | -0.97 | 0.003 | -1.25 | 0.000 | UDP-glucose ceramide glucosyltransferase-like 2 |
| ALG2 | NM_033087 | -2.18 | 0.000 | -2.20 | 0.000 | asparagine-linked glycosylation 2 homolog (S. cerevisiae, alpha-1,3-mannosyltransferase) |
| ALG5 | NM_013338 | -1.90 | 0.000 | -1.71 | 0.000 | asparagine-linked glycosylation 5 homolog (S. cerevisiae, dolichyl-phosphate beta-glucosyltransferase) |
| ALG8 | NM_024079 | -1.36 | 0.001 | -1.52 | 0.000 | asparagine-linked glycosylation 8 homolog (S. cerevisiae, alpha-1,3-glucosyltransferase) |
| B4GALT2 | NM_003780 | 2.33 | 0.006 | 2.29 | 0.000 | UDP-Gal:betaGlcNAc beta 1,4- galactosyltransferase, polypeptide 2 |
| MAN2A1 | NM_002372 | -1.21 | 0.055 | -1.33 | 0.000 | mannosidase, alpha, class 2A, member 1 |
| MGAT4A | NM_012214 | -1.84 | 0.001 | -1.72 | 0.000 | mannosyl (alpha-1,3-)-glycoprotein beta-1,4-N-acetylglucosaminyltransferase, isozyme A |
| OGT | NM_181673 | -2.31 | 0.009 | -2.12 | 0.000 | O-linked N-acetylglucosamine (GlcNAc) transferase (UDP-N-acetylglucosamine:polypeptide-N-acetylglucosaminyl transferase) |
| RPN1 | NM_002950 | -1.46 | 0.001 | -1.51 | 0.000 | ribophorin I |
| RPN2 | NM_002951 | -1.95 | 0.000 | -2.00 | 0.000 | ribophorin II |
The first, second and last column indicate the gene symbol, the RefSeq ID and the description of each N-glycan biosynthesis gene respectively. The third and fourth columns indicate the logarithm fold change and the P-value of each gene found by using σas the summary statistic. The fifth and sixth columns indicate the logarithm fold change and the P-value of each gene found by using σas the summary statistic. The cutoff criteria p ≤ 0.01 and |FC| ≥ 2 were applied. The first 8 genes are found to be statistically significant by σonly, the rest was found with both statistics.
Tight junction genes in human spermatogenesis data.
| ACTG1 | NM_001614 | -0.81 | 0.064 | -1.14 | 0.000 | actin, gamma 1 |
| CLDN1 | NM_021101 | -2.47 | 0.059 | -1.63 | 0.000 | claudin 1 |
| CLDN16 | NM_006580 | -1.20 | 0.019 | -1.55 | 0.000 | claudin 16 |
| CLDN5 | NM_003277 | 1.15 | 0.286 | 1.42 | 0.000 | claudin 5 (transmembrane protein deleted in velocardiofacial syndrome) |
| CLDN6 | NM_021195 | 2.48 | 0.135 | 1.88 | 0.000 | claudin 6 |
| CSNK2A2 | NM_001896 | -1.00 | 0.014 | -1.57 | 0.000 | casein kinase 2, alpha prime polypeptide |
| CTNNA1 | NM_001903 | -0.93 | 0.009 | -1.04 | 0.000 | catenin (cadherin-associated protein), alpha 1, 102 kDa |
| EXOC3 | NM_007277 | -1.26 | 0.019 | -1.43 | 0.000 | exocyst complex component 3 |
| EXOC4 | NM_021807 | -0.78 | 0.012 | -1.04 | 0.000 | exocyst complex component 4 |
| GNAI2 | NM_002070 | 1.02 | 0.110 | 1.52 | 0.000 | guanine nucleotide binding protein (G protein), alpha inhibiting activity polypeptide 2 |
| JAM3 | NM_032801 | -0.86 | 0.005 | -1.07 | 0.000 | junctional adhesion molecule 3 |
| KRAS | NM_004985 | -2.85 | 0.032 | -2.43 | 0.000 | v-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog |
| MYH9 | NM_002473 | 1.09 | 0.183 | 1.02 | 0.000 | myosin, heavy chain 9, non-muscle |
| PPP2R2B | NM_181676 | -1.22 | 0.020 | -1.97 | 0.000 | protein phosphatase 2 (formerly 2A), regulatory subunit B, beta isoform |
| PPP2R3A | NM_002718 | 1.42 | 0.013 | 1.50 | 0.000 | protein phosphatase 2 (formerly 2A), regulatory subunit B", alpha |
| RAB13 | NM_002870 | -1.17 | 0.118 | -1.68 | 0.000 | RAB13, member RAS oncogene family |
| TJP1 | NM_175610 | -2.63 | 0.038 | -2.10 | 0.000 | tight junction protein 1 (zona occludens 1) |
| AKT3 | NM_181690 | -1.16 | 0.001 | -1.16 | 0.000 | v-akt murine thymoma viral oncogene homolog 3 (protein kinase B, gamma) |
| CDC42 | NM_044472 | -1.67 | 0.001 | -1.51 | 0.000 | cell division cycle 42 (GTP binding protein, 25 kDa) |
| CLDN11 | NM_005602 | -2.03 | 0.000 | -2.06 | 0.000 | claudin 11 (oligodendrocyte transmembrane protein) |
| CLDN14 | NM_012130 | 2.92 | 0.006 | 2.90 | 0.000 | claudin 14 |
| CSDA | NM_003651 | -2.28 | 0.000 | -2.48 | 0.000 | cold shock domain protein A |
| CSNK2B | NM_001320 | -2.61 | 0.006 | -2.69 | 0.000 | casein kinase 2, beta polypeptide |
| CTNNA2 | NM_004389 | -1.64 | 0.002 | -1.88 | 0.000 | catenin (cadherin-associated protein), alpha 2 |
| CTTN | NM_138565 | -1.91 | 0.008 | -2.05 | 0.000 | cortactin |
| EPB41L3 | NM_012307 | -1.85 | 0.001 | -1.71 | 0.000 | erythrocyte membrane protein band 4.1-like 3 |
| MYH10 | NM_005964 | -1.34 | 0.000 | -1.35 | 0.000 | myosin, heavy chain 10, non-muscle |
| MYL6 | NM_079423 | -1.62 | 0.000 | -1.78 | 0.000 | myosin, light chain 6, alkali, smooth muscle and non-muscle |
| PPP2CA | NM_002715 | -2.69 | 0.002 | -2.40 | 0.000 | protein phosphatase 2 (formerly 2A), catalytic subunit, alpha isoform |
| PPP2CB | NM_004156 | -1.43 | 0.005 | -1.83 | 0.000 | protein phosphatase 2 (formerly 2A), catalytic subunit, beta isoform |
| PPP2R1B | NM_181699 | -2.14 | 0.003 | -2.34 | 0.000 | protein phosphatase 2 (formerly 2A), regulatory subunit A, beta isoform |
| PPP2R1B | NM_181699 | -1.47 | 0.011 | -1.18 | 0.000 | protein phosphatase 2 (formerly 2A), regulatory subunit A, beta isoform |
| PPP2R2A | NM_002717 | -1.53 | 0.002 | -1.86 | 0.000 | protein phosphatase 2 (formerly 2A), regulatory subunit B, alpha isoform |
| PPP2R2B | NM_181677 | 2.51 | 0.007 | 2.29 | 0.000 | protein phosphatase 2 (formerly 2A), regulatory subunit B, beta isoform |
| PRKCH | NM_006255 | -1.20 | 0.002 | -1.03 | 0.000 | protein kinase C, eta |
| PTEN | NM_000314 | -2.30 | 0.004 | -1.61 | 0.000 | phosphatase and tensin homolog (mutated in multiple advanced cancers 1) |
| RHOA | NM_001664 | -1.81 | 0.001 | -1.58 | 0.000 | ras homolog gene family, member A |
| CTNNA3 | NM_013266 | -1.03 | 0.007 | -0.92 | 0.000 | catenin (cadherin-associated protein), alpha 3 |
The first, second and last column indicate the gene symbol, the RefSeq ID and the description of each Tight junction gene respectively. The third and fourth columns indicate the logarithm fold change and the P-value of each gene found by using σas the summary statistic. The fifth and sixth columns indicate the logarithm fold change and the P-value of each gene found by using σas the summary statistic. A cutoff criteria of p ≤ 0.01 and |FC| ≥ 2 was applied. The first 17 genes are found to be statistically significant by σonly. The last row contain a gene excluded by σdue to |FC| ≤ 2 although p ≤ 0.01. The genes upper in the section were found only by the σstatistic, the remainder was found with both approaches except of the last gene entry recovered only by the σstatistic (it was filtered out due the fold change criterion).
Adherens junction genes in human spermatogenesis data.
| ACTG1 | NM_001614 | -0.81 | 0.064 | -1.14 | 0.000 | actin, gamma 1 |
| BAIAP2 | NM_006340 | 1.32 | 0.016 | 1.27 | 0.000 | BAI1-associated protein 2 |
| CREBBP | NM_004380 | -0.98 | 0.002 | -1.11 | 0.000 | CREB binding protein (Rubinstein-Taybi syndrome) |
| CSNK2A2 | NM_001896 | -1.00 | 0.014 | -1.57 | 0.000 | Casein kinase 2, alpha prime polypeptide |
| CTNNA1 | NM_001903 | -0.93 | 0.009 | -1.04 | 0.000 | catenin (cadherin-associated protein), alpha 1, 102 kDa |
| FER | NM_005246 | -1.37 | 0.015 | -1.73 | 0.000 | fer (fps/fes related) tyrosine kinase (phosphoprotein NCP94) |
| IQGAP1 | NM_003870 | -2.66 | 0.026 | -2.27 | 0.000 | IQ motif containing GTPase activating protein 1 |
| MAPK1 | NM_002745 | -1.56 | 0.014 | -1.34 | 0.000 | mitogen-activated protein kinase 1 |
| MAPK3 | NM_002746 | 1.75 | 0.042 | 1.88 | 0.000 | mitogen-activated protein kinase 3 |
| PTPRF | NM_130440 | -0.91 | 0.000 | -1.10 | 0.000 | protein tyrosine phosphatase, receptor type, F |
| SMAD2 | NM_005901 | -1.27 | 0.027 | -1.59 | 0.000 | SMAD family member 2 |
| TCF7 | NM_003202 | 3.13 | 0.045 | 2.90 | 0.000 | transcription factor 7 (T-cell specific, HMG-box) |
| TJP1 | NM_175610 | -2.63 | 0.038 | -2.10 | 0.000 | tight junction protein 1 (zona occludens 1) |
| WASL | NM_003941 | -1.99 | 0.047 | -1.42 | 0.000 | Wiskott-Aldrich syndrome-like |
| ACP1 | NM_004300 | -1.82 | 0.009 | -1.66 | 0.000 | acid phosphatase 1, soluble |
| ACP1 | NM_007099 | -1.77 | 0.000 | -1.70 | 0.000 | acid phosphatase 1, soluble |
| CDC42 | NM_044472 | -1.67 | 0.001 | -1.51 | 0.000 | cell division cycle 42 (GTP binding protein, 25 kDa) |
| CSNK2B | NM_001320 | -2.61 | 0.006 | -2.69 | 0.000 | casein kinase 2, beta polypeptide |
| CTNNA2 | NM_004389 | -1.64 | 0.002 | -1.88 | 0.000 | catenin (cadherin-associated protein), alpha 2 |
| MAP3K7 | NM_145333 | -1.52 | 0.000 | -1.39 | 0.000 | mitogen-activated protein kinase kinase kinase 7 |
| MAPK1 | NM_138957 | -1.19 | 0.005 | -1.23 | 0.000 | mitogen-activated protein kinase 1 |
| MAPK1 | NM_138957 | -1.68 | 0.002 | -2.03 | 0.000 | mitogen-activated protein kinase 1 |
| RHOA | NM_001664 | -1.81 | 0.001 | -1.58 | 0.000 | ras homolog gene family, member A |
| SMAD4 | NM_005359 | -1.02 | 0.000 | -1.07 | 0.000 | SMAD family member 4 |
| SORBS1 | NM_015385 | 1.86 | 0.001 | 1.99 | 0.000 | sorbin and SH3 domain containing 1 |
| WASF3 | NM_006646 | -1.62 | 0.002 | -1.80 | 0.000 | WAS protein family, member 3 |
| CTNNA3 | NM_013266 | -1.03 | 0.007 | -0.92 | 0.000 | catenin (cadherin-associated protein), alpha 3 |
The first, second and last column indicate the gene symbol, the RefSeq ID and the description of each Tight junction gene respectively. The third and fourth columns indicate the logarithm fold change and the P-value of each gene found by using σas the summary statistic. The fifth and sixth columns indicate the logarithm fold change and the P-value of each gene found by using σas the summary statistic. Cutoff criteria of p ≤ 0.01 and |FC| ≥ 2 were applied. The first 14 genes are found to be statistically significant by σonly. The lower part of the table lists genes found by both statistics except for the last row that contains a gene excluded by σdue to |FC| ≤ 2 (although p ≤ 0.01).
Figure 2Venn diagram of gene list overlap. The Venn diagram of the distribution of differentially expressed genes (based on the cutoff criteria p ≤ 0.01 and |FC| ≥ 2) between Affymetrix and Illumina spermatogenesis datasets is presented.