| Literature DB >> 21836821 |
Zhongyu Liu1, Zhiqiang Yao, Chao Li, Yicheng Lu, Chunfang Gao.
Abstract
Diffuse astrocytoma of (WHO grade II) has a tendency to progress spontaneously to anaplastic astrocytoma (WHO grade III) and/or glioblastoma (WHO grade IV). However, the molecular basis of astrocytoma progression is still poorly understood. In current study, an essential initial step toward this goal is the establishment of the taxonomy of tumors on the basis of their gene expression profiles. We have used gene expression profiling, unsupervised (hierarchal cluster (HCL) and principal component analysis (PCA)) and supervised (prediction analysis for microarrays (PAM)) learning methods, to demonstrate the presence of three distinct gene expression signatures of astrocytomas (ACMs), which correspond to diffuse or low-grade astrocytoma (WHO grade II), Anaplastic astrocytoma (WHO grade III) and Glioblastoma multiforme (WHO grade IV). We also demonstrate a 171 gene-based classifier that characterize the distinction between these pathologic/molecular subsets of astrocytomas. These results further define molecular subtypes of astrocytomas and may potentially be used to define potential targets and further refine stratification approaches for therapy. In addition, this study demonstrates that combining gene expression analysis with detailed annotated pathway and gene ontology (GO) category resources was applied to highly enriched normal and tumor population; it can yield an understanding of the critical biological mechanism of astrocytomas.Entities:
Year: 2011 PMID: 21836821 PMCID: PMC3151497 DOI: 10.1155/2011/245137
Source DB: PubMed Journal: Comp Funct Genomics ISSN: 1531-6912
Patients' characteristics.
| No | Array ID | GEO No | Title | Histological diagnosis | Age/Gender | WHO grade |
|---|---|---|---|---|---|---|
| 1 | 190-063A | GSM492652 | Astrocytomas_T2-1-Signal | Diffuse or low-grade astrocytoma | 72, M | II |
| 2 | 0286-29 | GSM492653 | Astrocytomas_T2-2-Signal | Diffuse or low-grade astrocytoma | 32, M | II |
| 3 | 0286-71 | GSM492654 | Astrocytomas_T2-3-Signal | Diffuse or low-grade astrocytoma | 73, M | II |
| 4 | 0286-98 | GSM492655 | Astrocytomas_T2-4-Signal | Diffuse or low-grade astrocytoma | 45, M | II |
| 5 | 0286-86 | GSM492656 | Astrocytomas_T2-5-Signal | Diffuse or low-grade astrocytoma | 65, M | II |
| 6 | 0286-94 | GSM492657 | Astrocytomas_T3-1-Signal | Anaplastic astrocytoma | 18, F | III |
| 7 | 0286-95 | GSM492658 | Astrocytomas_T3-2-Signal | Anaplastic astrocytoma | 64, F | III |
| 8 | 0190-068 | GSM492659 | Astrocytomas_T3-3-Signal | Anaplastic astrocytoma | 28, M | III |
| 9 | 0286-27 | GSM492660 | Astrocytomas_T3-4-Signal | Anaplastic astrocytoma | 45, M | III |
| 10 | 0286-88 | GSM492661 | Astrocytomas_T3-5-Signal | Anaplastic astrocytoma | 54, M | III |
| 11 | 0190-031B | GSM492662 | Astrocytomas_T4-1-Signal | Glioblastoma multiforme (GBM) | 68, M | IV |
| 12 | 0286-83 | GSM492663 | Astrocytomas_T4-2-Signal | Glioblastoma multiforme (GBM) | 62, M | IV |
| 13 | 0286-89 | GSM492664 | Astrocytomas_T4-3-Signal | Glioblastoma multiforme (GBM) | 80, M | IV |
| 14 | 0286-92 | GSM492665 | Astrocytomas_T4-4-Signal | Glioblastoma multiforme (GBM) | 78, F | IV |
| 15 | 0286-90 | GSM492666 | Astrocytomas_T4-5-Signal | Glioblastoma multiforme (GBM) | 69, M | IV |
| 16 | Normal Plus1 | GSM525014 | Normal | Normal brain tissues | Normal | |
| 17 | Normal Plus2 | GSM525015 | Normal | Normal brain tissues | Normal | |
| 18 | Normal Plus3 | GSM525016 | Normal | Normal brain tissues | Normal | |
| 19 | 0190-normalA | GSM492649 | Normal | Pooled normal tissues | Normal |
Figure 1The exhibition of results of HCL and PCA. Hierarchal clustering: the analysis was performed using the Euclidean distance and the average linkage algorithm; principal component analysis with scaling. (a) dendrogram plot: the astrocytomas tissue group and normal-tissue group were separated obviously in the top node; tumor node is composed of two branches, one is just WHO II grade, and another includes WHO III cluster and WHO IV cluster; (b) heatmap plot: each column represents the expression levels for all genes in a particular sample, whereas each row represents the relative expression of a particular gene across all samples. The expression level of any given gene in any given sample (relative to the mean expression level of that gene across all tissue samples) is reported along a color scale in which red represents transcriptional upregulation, green represents downregulation, and the color intensity indicates the magnitude of deviation from the mean; (c) the two main principal components 2D-plot; (d) the three main principal components 3D-plot. In Figures 1(c) and 1(d): different colors stand for different pathological stages, Green, blue, yellow, and red represent normal, II, III, and IV grade, respectively.
Actual and predicted classification.
| Sample labels | Normal plus2 | Normal plus3 | II-86 | II-98 | III-84 | III-95 | IV-90 | IV-92 |
|---|---|---|---|---|---|---|---|---|
| Predicted class labels | Normal | Normal | Stage-2 | Stage-2 | Stage-3 | Stage-3 | Stage-4 | Stage-4 |
| Normal | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| Stage-2 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 |
| Stage-3 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 |
| Stage-4 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
Figure 2Estimated probabilities for the cross-validation (a) and test data (b). Samples are partitioned by the true class (a) and the predicted class (b). All 11 of the training samples and all 8 of the test samples known to be astrocytomas are correctly classified.
The marker genes of three different grades of ACM.
| Gene symbol | II-astro score | III-astro score | IV-astro score | Normal-score | II-mean (log base 2) | III-mean (log base 2) | IV-mean (log base 2) | Normal mean (log base 2) |
|---|---|---|---|---|---|---|---|---|
| EPHB1 | 15.4462 | 0 | 0 | 0 |
| 2.55495 | 2.65409 | 2.72811 |
| LOC732229 | 13.8658 | 0 | 0 | 0 |
| 2.59371 | 2.77408 | 2.26366 |
| C12orf39 | 8.5766 | 0 | 0 | 0 |
| 2.238085 | 2.58682 | 2.48695 |
| MARVELD3 | 7.8886 | 0 | 0 | 0 |
| 2.37729 | 2.86839 | 2.39552 |
| LOC283677 | 4.4758 | 0 | 0 | 0 |
| 2.826663 | 2.71392 | 2.8898 |
| C20orf42 | 3.5577 | 0 | 0 | 0 |
| 2.859789 | 2.4558 | 2.52664 |
| SPTLC3 | 3.2679 | 0 | 0 | 0 |
| 2.255256 | 2.30132 | 2.59188 |
| KCNQ1OT1 | 2.9128 | 0 | 0 | 0 |
| 2.338234 | 2.2464 | 2.26363 |
| FLJ21062 | 2.1167 | 0 | 0 | 0 |
| 2.238085 | 2.48495 | 2.26875 |
| ATP13A5 | 1.9707 | 0 | 0 | 0 |
| 2.238085 | 2.2464 | 2.43345 |
| PDXDC1 | 1.3847 | 0 | 0 | 0 |
| 2.25661 | 2.26636 | 2.26363 |
| CNNM2 | 0.8081 | 0 | 0 | 0 |
| 2.27347 | 2.6133 | 2.54342 |
|
| ||||||||
| ZBTB39 | 0 | 15.0318 | 0 | 0 | 4.173501 |
| 4.6856 | 4.68794 |
| C6orf89 | 0 | 11.2219 | 0 | 0 | 2.621672 |
| 2.91788 | 2.30613 |
| LOC116236 | 0 | 11.0126 | 0 | 0 | 2.254329 |
| 2.72612 | 2.44326 |
| FKHL18 | 0 | 10.2107 | 0 | 0 | 2.258635 |
| 2.2464 | 2.6121 |
| CTRL | 0 | 10.0666 | 0 | 0 | 2.526428 |
| 2.31851 | 2.58541 |
| TIRAP | 0 | 9.5523 | 0 | 0 | 2.324936 |
| 2.2464 | 2.75954 |
| NEUROG1 | 0 | 2.5236 | 0 | 0 | 2.258056 |
| 2.64751 | 2.27475 |
| TMEM44 | 0 | 0.4603 | 0 | 0 | 2.315798 |
| 2.45605 | 2.51802 |
| RPUSD1 | 0 |
| 0 | 0 | 4.034009 |
| 3.84208 | 4.14466 |
| DHRS7 | 0 |
| 0 | 0 | 3.425335 |
| 3.60534 | 4.03533 |
| AKAP7 | 0 |
| 0 | 0 | 4.723465 |
| 5.11056 | 3.7332 |
| PDE3B | 0 |
| 0 | 0 | 5.989233 |
| 6.10718 | 6.31594 |
|
| ||||||||
| C20orf160 | 0 | 0 | 14.9825 | 0 | 2.523126 |
| 4.41487 | 2.27477 |
| PKMYT1 | 0 | 0 | 11.89 | 0 | 2.644079 |
| 3.84824 | 2.26751 |
| TRPV2 | 0 | 0 | 10.5089 | 0 | 2.397686 |
| 4.5048 | 3.31894 |
| LEFTY2 | 0 | 0 | 5.3507 | 0 | 2.231583 |
| 5.56688 | 2.28191 |
| C20orf77 | 0 | 0 | 5.1848 | 0 | 2.333998 |
| 3.48131 | 2.3379 |
| MOBKL2C | 0 | 0 | 3.8936 | 0 | 2.517987 |
| 3.86723 | 2.45653 |
| STAC | 0 | 0 | 2.9965 | 0 | 2.275399 |
| 6.32448 | 2.78522 |
| GGA3 | 0 | 0 | 2.8351 | 0 | 2.295872 |
| 3.46887 | 2.26651 |
| VAV1 | 0 | 0 | 2.1584 | 0 | 2.396392 |
| 4.53627 | 2.34353 |
| TXNRD2 | 0 | 0 | 1.3886 | 0 | 2.582906 |
| 3.38602 | 2.41848 |
| SP100 | 0 | 0 | 0.9134 | 0 | 2.385938 |
| 3.58699 | 2.37374 |
| LCP2 | 0 | 0 | 0.7656 | 0 | 2.268675 |
| 3.62601 | 2.88405 |
| RHEB | 0 | 0 | 0.6002 | 0 | 2.254732 |
| 3.16583 | 2.31595 |
The first column lists the marker genes for different ACM WHO grades; the scores from 2th to 5th columns suggest the estimates of the class probabilities by PAM [4], which are similar to that used in linear discriminant analysis (LDA); the means from 6th to 9th columns represent the average values of log (base 2) gene expression. Bold indicates a marker gene of one WHO grade ACM relative to other WHO grades which are overexpressional; italic indicates those that are underexpressional.
Figure 3MAPK signaling pathway. Red stars marks indicate dysregulated genes.