| Literature DB >> 17663766 |
Lei Xu1, Donald Geman, Raimond L Winslow.
Abstract
BACKGROUND: There is a continuing need to develop molecular diagnostic tools which complement histopathologic examination to increase the accuracy of cancer diagnosis. DNA microarrays provide a means for measuring gene expression signatures which can then be used as components of genomic-based diagnostic tests to determine the presence of cancer.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17663766 PMCID: PMC1950528 DOI: 10.1186/1471-2105-8-275
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Common genes overrepresented in the meta-signature. The figure shows the relationship between the numbers of genes on two microarray platforms, HuGeneFL and HG-U95A, and the corresponding numbers of genes in the meta-signature of neoplastic transformation [17]. There are 5127 genes common to both platforms, 238 only on HuGeneFL and 3592 only on HG-U95A. The numbers without parentheses are the corresponding numbers of genes in the meta-signature.
Microarray data from Affymetrix HuGeneFL arrays
| Study | Class1 | Size | Class2 | Size |
| Beer_Lung [25] | Normal Lung | 10 | Lung Adenoarcinoma | 86 |
| Dyrskjot_Bladder [26] | Normal Bladder | 4 | Bladder Cancer | 40 |
| Hippo_Gastric [27] | Normal Gastric Tissues | 8 | Gastric Cancer | 22 |
| Hsiao_Normal [28] | Normal Tisues | 59 | ||
| Lancaster_Ovarian [29] | Normal Ovary | 3 | Ovarian Adenocarcinoma | 31 |
| Logsdon_Pancreas [30] | Normal Pancreas | 5 | Pancreatic Adenocarcinoma | 10 |
| Pomeroy_Brain [31] | Normal Cerebellum | 4 | Atypical Teratoid/Rhabdoid Tumors | 10 |
| Primitive neuroectodermal Tumors | 8 | |||
| Malignant Gliomas | 10 | |||
| Medulloblastoma | 10 | |||
| Quade_Myometrium [32] | Normal Myometrium | 4 | Leiomyosarcoma | 14 |
| Ramaswamy_Multi [33] | Normal Prostate | 9 | Prostate Cancer | 10 |
| Normal Uterus | 6 | Uterine Cancer | 10 | |
| Normal Whole brain | 8 | Glioblastoma/Medulloblastoma | 20 | |
| Normal Breast | 5 | Breast Adenocarcinoma | 11 | |
| Normal Lung | 7 | Lung Adenoarcinoma | 11 | |
| Nromal Colon | 11 | Colorectal | 11 | |
| Normal Germinal Center | 6 | Lymphoma | 22 | |
| Normal Bladder | 7 | Bladder Cancer | 11 | |
| Melanoma | 10 | |||
| Peripheral Blood | 5 | Leukemia | 30 | |
| Normal Kidney | 12 | Renal Cell Carcinoma | 11 | |
| Normal Pancreas | 10 | Pancreatic Adenocarcinoma | 11 | |
| Normal Ovary | 4 | Ovarian Carcinoma | 11 | |
| Mesothelioma | 11 | |||
| Rickman_Brain [34] | Normal Temporal Lobe | 6 | Glioma | 45 |
| Welsh_Ovarian [35] | Normal Ovary | 4 | Ovarian Carcinoma | 22 |
| Zhan_Myeloma [36] | Normal Plasma Cell- Bone Marrow | 30 | Multiple Myeloma | 74 |
| Total | Normal Tissues | 227 | Cancer Tissues | 572 |
Microarray data from Affymetrix HG-U95A arrays
| Study | Class1 | Size | Class2 | Size |
| Bhattacharjee_Lung [37] | Normal Lung | 17 | Small Cell Lung Carcinoma | 6 |
| Lung Carcinoid | 20 | |||
| Squamous Cell Lung Carcinoma | 21 | |||
| Cromer_Head-Neck [38] | Normal Uvula | 4 | Head-Neck Squamous Cell Carcinoma | 34 |
| Dehan_Lung [39] | Normal Lung | 9 | Lung Adenocarcinoma | 7 |
| Lung Squamous Cell Carcinoma | 17 | |||
| Lung Adenosquamous | 1 | |||
| Frierson_Salivary [40] | Normal Salivary Gland | 6 | Salivary Carcinoma | 16 |
| Giordano_Adrenal [41] | Normal Adrenal Cortex | 3 | Adrenocortical Carcinoma | 11 |
| Gutmann_Brain [42] | Normal White Matter | 3 | Pilocytic Astrocytoma | 8 |
| Huang_Thyroid [43] | Normal Thyroid | 8 | Thyroid Carcinoma | 8 |
| Shai_Brain [44] | White Matter | 7 | Glioblastoma Multiforme | 35 |
| Stearman_Lung [45] | Normal Lung | 19 | Lung Tumor | 20 |
| Su_Multi [46] | Normal Tissues | 63 | Tumor Tissues | 18 |
| Su_Tumors [5] | Prostate Cancer | 24 | ||
| Bladder/Ureter | 8 | |||
| Breast | 21 | |||
| Colorectal | 21 | |||
| Gastroesophagus | 11 | |||
| Kidney | 10 | |||
| Liver | 6 | |||
| Ovary | 23 | |||
| Pancreas | 6 | |||
| Lung Adenocarcinoma | 12 | |||
| Lung Squamous Cell Carcinoma | 12 | |||
| Welle_Normal [47] | Normal Muscle | 12 | ||
| Yanai_Normal [48] | Normal Tissues | 24 | ||
| Yu_Prostate [49] | Normal Prostae | 16 | Primary Prosate Carcinoma | 35 |
| Total | Normal Tissues | 191 | Cancer Tissues | 411 |
Microarray data from Affymetrix HG-U133A arrays
| Study | Class1 | Size | Class2 | Size |
| Gordon_Lung [50] | Normal Lung | 4 | Malignant Pleural Mesothelioma | 40 |
| Normal Pleura | 5 | |||
| Hoffman_Myometrium [51] | Normal Myometrium | 5 | Uterine Leiomyomas | 5 |
| Lenburg_Kidney [52] | Normal Kidney Tissue | 5 | Renal Cell Carcinoma | 12 |
| Talantov_Skin [53] | Normal Skin | 7 | Melanoma | 45 |
| Wachi_Lung [54] | Normal Lung | 5 | Squamous Lung Cancer | 5 |
| Yoon_Soft_Tissue [55] | Normal Soft Tissue | 15 | Soft Tissue Sarcoma | 39 |
| Total | Normal Tissues | 46 | Cancer Tissues | 146 |
Common cancer signature genes
| Microarray Platform | ||||
| Gene Symbol | Probe Set ID | Gene Symbol | Probe Set ID | |
| HuGeneFL | BOP1 | D50914_at | COX7A1 | M83186_at |
| PON2 | L48513_at | CXCL12 | U19495_s_at | |
| NME1* | X17620_at | ALDH1A1 | M31994_at | |
| CKS2* | X54942_at | SELP | M25322_at | |
| CCT3 | X74801_at | CD36 | Z32765_at | |
| KIAA0101* | D14657_at | CSRP1 | M76378_at | |
| FOXM1 | U74612_at | C9orf61 | L27479_at | |
| MAP3K11 | L32976_at | MYH11 | AF001548_rna1_at | |
| RAB13 | X75593_at | LTC4S | U50136_rna1_at | |
| ARPC1B | AF006084_at | DEFA4 | X65977_at | |
| HMGA1 | L17131_rna1_at | CLEC3B | X64559_at | |
| TYMS | D00596_at | |||
| DNMT1 | X63692_at | |||
| HG-U95A | SOX4* | 33131_at | TEK | 1596_g_at |
| C7orf24 | 41696_at | FXYD1 | 32109_at | |
| POSTN | 1451_s_at | ABCA8 | 35717_at | |
| BAZ1B | 32261_at | CLEC3B | 36569_at | |
| KIAA0101* | 38116_at | CBX7 | 36894_at | |
| RECQL | 34684_at | TNXA///TNXB | 38508_s_at | |
| FAT | 40454_at | SH3BP5 | 38968_at | |
| SIPA1L3 | 37831_at | CA4 | 40739_at | |
| MARCKSL1 | 36174_at | FBXO9 | 38990_at | |
| CKAP4 | 32529_at | COX7A1 | 39031_at | |
| KIF14 | 34563_at | GABARAPL1 | 35785_at | |
| SUB1 | 36171_at | ADH1B | 35730_at | |
| PTGDS | 216_at | |||
* These genes were also identified as common cancer signature genes in Rhode et al. [17].
Figure 2Common cancer signature which can discriminate cancer from normal samples. Some of the training data (Stearman_Lung, Frierson_Salivary, Giordano_Adrenal and Gutmann_Brain) is used to illustrate the gene expression values of the signature genes in the figure. The heatmap is generated by the matrix2png software [24]. For each data set, the expression value for each gene is normalized across the samples to zero mean and one standard deviation (SD) for visualization purposes. Genes with expression levels greater than the mean are colored in red and those below the mean are colored in green. The scale indicates the number of SDs above or below the mean.
Class prediction of the common signature on training data
| Microarray Platform | Study | Number of Normal Samples | Number of Cancer Samples | Accuracy (%) | |
| HuGeneFL | Beer_Lung | 10 | 86 | 95.8 | 8.87E-11 |
| Dyrskjot_Bladder | 4 | 40 | 95.5 | 1.18E-03 | |
| Hippo_Gastric | 8 | 22 | 76.7 | 2.67E-01 | |
| Hsiao_Normal | 59 | 0 | 91.5 | N/A* | |
| Lancaster_Ovarian | 3 | 31 | 91.2 | 1.00 | |
| Logsdon_Pancreas | 5 | 10 | 100 | 3.33E-04 | |
| Pomeroy_Brain | 4 | 38 | 97.6 | 3.48E-04 | |
| Quade_Myometrium | 4 | 14 | 77.8 | 2.29E-02 | |
| Ramaswamy_Multi | 90 | 190 | 77.9 | 1.28E-20 | |
| Rickman_Brain | 6 | 45 | 94.1 | 2.87E-04 | |
| Welsh_Ovarian | 4 | 22 | 100 | 6.69E-05 | |
| Zhan_Myeloma | 30 | 74 | 72.1 | 2.88E-01 | |
| HG-U95A | Bhattacharjee_Lung | 17 | 47 | 90.6 | 6.34E-10 |
| Cromer_Head-Neck | 4 | 34 | 97.4 | 4.74E-04 | |
| Dehan_Lung | 9 | 25 | 85.3 | 3.82E-05 | |
| Frierson_Salivary | 6 | 16 | 95.5 | 9.38E-05 | |
| Giordano_Adrenal | 3 | 11 | 92.9 | 1.10E-02 | |
| Gutmann_Brain | 3 | 8 | 100 | 6.06E-03 | |
| Huang_Thyroid | 8 | 8 | 75 | 5.59E-02 | |
| Shai_Brain | 7 | 35 | 85.7 | 6.36E-05 | |
| Stearman_Lung | 19 | 20 | 89.7 | 1.28E-07 | |
| Su_Multi | 63 | 18 | 81.5 | 1.15E-05 | |
| Su_Tumors | 0 | 154 | 93.5 | N/A | |
| Welle_Normal | 12 | 0 | 100 | N/A | |
| Yanai_Normal | 24 | 0 | 91.7 | N/A | |
| Yu_Prostate | 16 | 35 | 54.9 | 1.83E-02 | |
* For the data sets with samples from only one class, no p-value is available.
Validation of the common signature on independent HG-U133A data
| Study | Number of Normal Samples | Number of Cancer Samples | Accuracy (%) | |
| Gordon_Lung | 9 | 40 | 95.9 | 1.75E-07 |
| Hoffman_Myometrium | 5 | 5 | 80.0 | 8.33E-02 |
| Lenburg_Kidney | 5 | 12 | 76.5 | 1.07E-01 |
| Talantov_Skin | 7 | 45 | 98.1 | 3.44E-07 |
| Wachi_Lung | 5 | 5 | 100 | 3.97E-03 |
| Yoon_Soft_Tissue | 15 | 39 | 96.3 | 6.76E-11 |
Comparison with the Rhodes signature on the same independent data
| Study | Rhodes Signature | Our Signature | ||
| Accuracy (%) | Accuracy (%) | |||
| Gordon_Lung | 91.8 | 3.48E-07 | 95.9 | 1.75E-07 |
| Hoffman_Myometrium | 80.0 | 2.06E-01 | 80.0 | 8.33E-02 |
| Lenburg_Kidney | 76.5 | 1.01E-01 | 76.5 | 1.07E-01 |
| Talantov_Skin | 94.2 | 8.97E-07 | 98.1 | 3.44E-07 |
| Wachi_Lung | 100 | 3.97E-03 | 100 | 3.97E-03 |
| Yoon_Soft_Tissue | 85.2 | 5.67E-8 | 96.3 | 6.76E-11 |