| Literature DB >> 20209125 |
Brady G Miller1, John A Stamatoyannopoulos.
Abstract
BACKGROUND: Acute myeloid leukemia (AML) is a heterogeneous disease with an overall poor prognosis. Gene expression profiling studies of patients with AML has provided key insights into disease pathogenesis while exposing potential diagnostic and prognostic markers and therapeutic targets. A systematic comparison of the large body of gene expression profiling studies in AML has the potential to test the extensibility of conclusions based on single studies and provide further insights into AML. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2010 PMID: 20209125 PMCID: PMC2830886 DOI: 10.1371/journal.pone.0009466
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Tag-based classification method flowchart.
Acute Myelogenous Leukemia expression profiling studies included in analysis.
| Reference | Platform | Disease | No. of samples | No. of differentially expressed features | No. of differentially expressed mapped features |
|
| Affymetrix HU6000 | AML/ALL | 72 | 100 | 78 |
|
| Custom cDNA 23,040 clones | AML | 76 | 491 | 355 |
|
| Affymetrix U95Av2 | AML | 37 | 150 | 140 |
|
| Affymetrix U95Av2 | AML | 28 | 77 | 75 |
|
| Affymetrix U95Av2 Affymetrix U133A | AML/ALL | 90 | 156 | 147 |
|
| Affymetrix U95Av2 | AML | 54 | 1,910 | 1,753 |
|
| Custom cDNA 39,711 clones | AML | 116 | 1,040 | 855 |
|
| Custom cDNA 42,749 clones | AML | 100 | 436 | 329 |
|
| Custom cDNA 4211 clones | AML | 29 | 170 | 162 |
|
| Affymetrix U133A | AML | 150 | 713 | 682 |
|
| Affymetrix U133A | AML | 293 | 779 | 745 |
|
| DiscoveryChip cDNA 9,039 clones | AML | 55 | 197 | 119 |
|
| Affymetrix U133A | AML | 78 | 554 | 541 |
|
| Affymetrix U133A | AML | 43 | 181 | 167 |
|
| Custom cDNA 41,424 clones | AML | 137 | 178 | 146 |
|
| Affymetrix U133A | AML | 35 | 20 | 19 |
|
| Custom cDNA 4,211 clones | AML | 110 | 250 | 250 |
|
| Affymetrix U133A | AML | 275 | 568 | 555 |
|
| Affymetrix U133plus2.0 | AML | 64 | 314 | 304 |
|
| Affymetrix U95Av2 | AML | 170 | 705 | 674 |
|
| Affymetrix U133A | AML | 5 | 822 | 724 |
|
| Custom cDNA 39,711 clones | AML | 93 | 4,120 | 3,828 |
|
| Affymetrix U133A | AML | 11 | 82 | 82 |
|
| Affymetrix U133A | AML | 93 | 1,188 | 1,095 |
|
| Affymetrix U133plus2.0 | AML | 530 | 608 | 560 |
|
| 10 platforms | 2,744 | 15,809 | 14,385 |
*Included further analysis of data by Verhaak 200522 and Wilson 200626.
Genes most frequently published in AML expression studies.
| Rank | Gene symbol | No. of references | No. of platforms | No. of differentially expressed features | Gene name |
|
|
| 12 | 6 | 32 | homeobox B2 |
|
|
| 12 | 5 | 31 | pre-B-cell leukemia homeobox 3 |
|
|
| 11 | 4 | 35 | homeobox A9 |
|
|
| 11 | 3 | 29 | POU class 4 homeobox 1 |
|
|
| 10 | 5 | 16 | tetraspanin 7 |
|
|
| 10 | 3 | 38 | myosin, heavy chain 11, smooth muscle |
|
|
| 10 | 3 | 34 | runt-related transcription factor 1; translocated to, 1 (cyclin D-related) |
|
|
| 10 | 3 | 21 | thyrotropin-releasing hormone |
|
|
| 10 | 3 | 20 | major histocompatibility complex, class II, DP alpha 1 |
|
|
| 9 | 5 | 32 | homeobox B5 |
|
|
| 9 | 5 | 16 | secreted protein, acidic, cysteine-rich (osteonectin) |
|
|
| 9 | 4 | 34 | homeobox A10 |
|
|
| 9 | 4 | 18 | ribonuclease, RNase A family, 2 (liver, eosinophil-derived neurotoxin) |
|
|
| 9 | 3 | 16 | CD34 molecule |
|
|
| 9 | 3 | 16 | Meis homeobox 1 |
|
|
| 8 | 5 | 23 | runt-related transcription factor 3 |
|
|
| 8 | 5 | 22 | versican proteoglycan |
|
|
| 8 | 4 | 21 | RNA binding protein with multiple splicing |
|
|
| 8 | 4 | 18 | homeobox A4 |
|
|
| 8 | 4 | 16 | meningioma (disrupted in balanced translocation) 1 |
|
|
| 8 | 4 | 11 | preferentially expressed antigen in melanoma |
|
|
| 8 | 3 | 20 | jagged 1 (Alagille syndrome) |
|
|
| 8 | 3 | 18 | integral membrane protein 2A |
|
|
| 8 | 3 | 17 | regulator of G-protein signaling 10 |
|
|
| 8 | 2 | 12 | prostaglandin D2 synthase, hematopoietic |
The genes reported in at least eight independent studies are presented here. In order of preference, the genes are ranked by the number of independent studies, the number of unique platforms, and the total number of differentially expressed features.
*Gene symbol is not approved by HUGO Gene Nomenclature Committee.
Number of genes and independent publications with selected prognostic categories.
| Tag (total genes) (total references) | No. genes in 1 study | No. genes in 2 studies | No. genes in 3 studies | No. genes in 4 studies | No. genes in 5 studies | No. genes in 6 studies | No. genes in 7 studies | No. genes in 8 studies | |
|
| all genes | 1559 | 138 | 27 | 3 | ||||
| up-regulated | 586 | 28 | 2 | ||||||
| down-regulated | 943 | 55 | 10 | 1 | |||||
|
| all genes | 1484 | 134 | 18 | 1 | 1 | |||
| up-regulated | 925 | 62 | 9 | 1 | |||||
| down-regulated | 528 | 24 | 2 | ||||||
|
| all genes | 978 | 147 | 32 | 11 | 1 | |||
| up-regulated | 541 | 42 | 18 | 7 | 1 | ||||
| down-regulated | 436 | 96 | 13 | 2 | |||||
|
| all genes | 188 | 25 | 9 | 7 | 1 | |||
| up-regulated | 115 | 17 | 5 | 6 | |||||
| down-regulated | 56 | 8 | 4 | 1 | 1 | ||||
|
| all genes | 1197 | 88 | 23 | 7 | 6 | 1 | ||
| up-regulated | 533 | 44 | 12 | 7 | 2 | 1 | |||
| down-regulated | 285 | 15 | 4 | 1 | |||||
|
| all genes | 1057 | 92 | 23 | 15 | 4 | 1 | 2 | 1 |
| up-regulated | 253 | 32 | 13 | 4 | 1 | 2 | 1 | ||
| down-regulated | 552 | 27 | 5 | 2 | |||||
|
| all genes | 459 | 19 | 4 | |||||
| up-regulated | 65 | 1 | |||||||
| down-regulated | 45 | 3 | |||||||
|
| all genes | 224 | 10 | 1 | |||||
| up-regulated | 135 | 2 | |||||||
| down-regulated | 67 | ||||||||
|
| all genes | 480 | 36 | 3 | |||||
| up-regulated | 173 | 1 | |||||||
| down-regulated | 206 | 6 |
The AML prognosis and subtype identification tags reported in greater than 3 independent studies are shown with the number of genes listed by number of independent studies and differential expression direction. Identification tag descriptions can be found in Table S1. Note that the following tags are abbreviated: poor prog is poor prognosis, good prog is good prognosis, NPM1 mut is NPM1 mutation, and normal cyto is normal cytogenetics.
Figure 2Hierarchical cluster analyses.
Strict up-regulation is green and strict down-regulation is red, while light blue represents no reported specific direction. Identification tag descriptions can be found in Table S1. (A) Hierarchical cluster analysis of the 3998 differentially expressed genes (x-axis) of AML prognostic categories (y-axis). For illustration purposes, we notated and manually separated 5 major clusters. Cluster 1 includes aneuploid, abnormal cytogenetics, CD34+CD38+ AML fraction, high centrosome aberrations and poor prognosis. Cluster 2 includes FAB-M4, FAB-M5, inv(16) and monocytic. Cluster 3 includes a large group of heterogenous identification tags. Cluster 4 identifies FLT3-TKD, euploid, FAB-M7, CEBPA silenced, and NRAS-PM. Cluster 5 includes FLT3 mutation, FLT3-ITD, normal cytogenetics and NPM1 mutation. (B) Hierarchical cluster analysis of the 541 differential GO categories (x-axis) of AML prognostic categories (y-axis). For illustration purposes, we notated and manually separated 6 major clusters. Cluster 1 includes NPM1 mutation, good prognosis and normal cytogenetics. Cluster 2 includes NRAS-PM and MLL fusion gene. Cluster 3 includes inv(16), high centrosome aberrations, abnormal cytogenetics, 11q23, aneuploid, CEBPA silenced, FAB-M7, and poor prognosis. Cluster 4 includes FLT3 mutation, FLT3-ITD and t(11;19). Cluster 5 includes CD34+CD38+ AML fraction, CBF, FAB-M4, FAB-M5, monocytic, and normal patient controls. Cluster 6 includes a large group of heterogenous identification tags.
Top ranked up-regulated genes associated with poor prognosis.
| Rank | Gene symbol | No. of specific references | Total no. of references | Total no. of platforms | Total no. of differentially expressed features | Gene name |
|
|
| 3 | 5 | 4 | 19 | B-cell CLL/lymphoma 11A (zinc finger protein) |
|
|
| 3 | 5 | 4 | 11 | thromboxane A synthase 1 (platelet, cytochrome P450, family 5, subfamily A) |
|
|
| 2 | 9 | 5 | 32 | homeobox B5 |
|
|
| 2 | 9 | 4 | 34 | homeobox A10 |
|
|
| 2 | 9 | 3 | 16 | CD34 molecule |
|
|
| 2 | 8 | 4 | 21 | RNA binding protein with multiple splicing |
|
|
| 2 | 8 | 4 | 18 | homeobox A4 |
|
|
| 2 | 8 | 4 | 16 | meningioma (disrupted in balanced translocation) 1 |
|
|
| 2 | 6 | 3 | 12 | guanine nucleotide binding protein (G protein), alpha inhibiting, activity polypeptide 1 |
|
|
| 2 | 5 | 4 | 21 | src kinase associated phosphoprotein 2 |
|
|
| 2 | 5 | 4 | 9 | minichromosome maintenance complex component 3 |
|
|
| 2 | 5 | 3 | 8 | CAP-GLY domain containing linker protein 2 |
|
|
| 2 | 5 | 3 | 8 | death-associated protein kinase 1 |
|
|
| 2 | 4 | 4 | 8 | guanylate cyclase 1, soluble, alpha 3 |
|
|
| 2 | 4 | 3 | 11 | angiopoietin 1 |
|
|
| 2 | 4 | 3 | 6 | methylenetetrahydrofolate dehydrogenase (NADP+ dependent) 1, methenyltetrahydrofolate cyclohydrolase, formyltetrahydrofolate synthetase |
|
|
| 2 | 3 | 3 | 14 | microtubule-associated protein 7 |
|
|
| 2 | 3 | 3 | 11 | UDP-glucose ceramide glucosyltransferase-like 2 |
|
|
| 2 | 3 | 3 | 6 | SH2B adaptor protein 3 |
|
|
| 2 | 3 | 3 | 5 | fms-related tyrosine kinase 3 |
In order of preference, the genes are ranked by the number of poor prognosis related independent studies, the total number of independent studies, the total number of unique platforms, and the total number of features. Genes that were also associated with good prognosis with the same expression direction are not shown.
Top ranked down-regulated genes associated with poor prognosis.
| Rank | Gene symbol | No. of specific references | Total no. of references | Total no. of platforms | Total no. of differentially expressed features | Gene name |
|
|
| 4 | 4 | 3 | 22 | echinoderm microtubule associated protein like 4 |
|
|
| 3 | 6 | 2 | 9 | complement component 3a receptor 1 |
|
|
| 3 | 5 | 4 | 26 | phosphatidylinositol 3-kinase-related protein kinase |
|
|
| 3 | 5 | 4 | 15 | forkhead box O1 |
|
|
| 3 | 4 | 3 | 18 | interleukin 6 signal transducer (gp130, oncostatin M receptor) |
|
|
| 3 | 4 | 3 | 12 | UDP-glucose ceramide glucosyltransferase |
|
|
| 3 | 4 | 2 | 12 | adipose differentiation-related protein |
|
|
| 3 | 4 | 2 | 8 | azurocidin 1 (cationic antimicrobial protein 37) |
|
|
| 3 | 3 | 2 | 14 | sorting nexin 9 |
|
|
| 3 | 3 | 2 | 8 | phosphoinositide-3-kinase, regulatory subunit 4, p150 |
|
|
| 3 | 3 | 2 | 6 | sema domain, immunoglobulin domain (Ig), short basic domain, secreted, (semaphorin) 3F |
|
|
| 2 | 8 | 3 | 20 | jagged 1 (Alagille syndrome) |
|
|
| 2 | 7 | 4 | 21 | CD3d molecule, delta (CD3-TCR complex) |
|
|
| 2 | 6 | 5 | 14 | solute carrier family 7 (cationic amino acid transporter, y+ system), member 7 |
|
|
| 2 | 6 | 4 | 13 | endonuclease domain containing 1 |
|
|
| 2 | 6 | 4 | 11 | glycophorin C (Gerbich blood group) |
|
|
| 2 | 6 | 3 | 17 | interferon stimulated exonuclease gene 20 kDa |
|
|
| 2 | 5 | 4 | 13 | Ezrin |
|
|
| 2 | 5 | 4 | 12 | Agrin |
|
|
| 2 | 5 | 3 | 9 | neuropilin 1 |
In order of preference, the genes are ranked by the number of poor prognosis related independent studies, the total number of independent studies, the total number of unique platforms, and the total number of features. Genes that were also associated with good prognosis with the same expression direction are not shown.
*Gene symbol is not approved by HUGO Gene Nomenclature Committee.
Figure 3Functional category comparisons.
(A) Significantly over-represented functional gene ontology (GO) categories of interest in up-regulated and down-regulated genes found in poor prognosis and good prognosis are compared; the comprehensive functional gene ontology listings can be found in Table S5, Table S6, Table S7, and Table S8. (B) Significantly over-represented functional gene ontology (GO) categories of interest in up-regulated and down-regulated genes found in AML with NPM1 mutation, t(15;17), t(8;21) and inv(16) are compared; the comprehensive functional gene ontology listings can be found in Table S13, Table S14, Table S15, Table S16, Table S17, Table S18, Table S19, Table S20. Corrected p-value is the Bonferroni multiple hypothesis.