Literature DB >> 25476127

Integrated differential transcriptome maps of Acute Megakaryoblastic Leukemia (AMKL) in children with or without Down Syndrome (DS).

Maria Chiara Pelleri1, Allison Piovesan2, Maria Caracausi3, Anna Concetta Berardi4, Lorenza Vitale5, Pierluigi Strippoli6,7.   

Abstract

BACKGROUND: The incidence of Acute Megakaryoblastic Leukemia (AMKL) is 500-fold higher in children with Down Syndrome (DS) compared with non-DS children, but the relevance of trisomy 21 as a specific background of AMKL in DS is still an open issue. Several Authors have determined gene expression profiles by microarray analysis in DS and/or non-DS AMKL. Due to the rarity of AMKL, these studies were typically limited to a small group of samples.
METHODS: We generated integrated quantitative transcriptome maps by systematic meta-analysis from any available gene expression profile dataset related to AMKL in pediatric age. This task has been accomplished using a tool recently described by us for the generation and the analysis of quantitative transcriptome maps, TRAM (Transcriptome Mapper), which allows effective integration of data obtained from different experimenters, experimental platforms and data sources. This allowed us to explore gene expression changes involved in transition from normal megakaryocytes (MK, n=19) to DS (n=43) or non-DS (n=45) AMKL blasts, including the analysis of Transient Myeloproliferative Disorder (TMD, n=20), a pre-leukemia condition.
RESULTS: We propose a biological model of the transcriptome depicting progressive changes from MK to TMD and then to DS AMKL. The data indicate the repression of genes involved in MK differentiation, in particular the cluster on chromosome 4 including PF4 (platelet factor 4) and PPBP (pro-platelet basic protein); the gene for the mitogen-activated protein kinase MAP3K10 and the thrombopoietin receptor gene MPL. Moreover, comparing both DS and non-DS AMKL with MK, we identified three potential clinical markers of progression to AMKL: TMEM241 (transmembrane protein 241) was the most over-expressed single gene, while APOC2 (apolipoprotein C-II) and ZNF587B (zinc finger protein 587B) appear to be the most discriminant markers of progression, specifically to DS AMKL. Finally, the chromosome 21 (chr21) genes resulted to be the most over-expressed in DS and non-DS AMKL, as well as in TMD, pointing out a key role of chr21 genes in differentiating AMKL from MK.
CONCLUSIONS: Our study presents an integrated original model of the DS AMLK transcriptome, providing the identification of genes relevant for its pathophysiology which can potentially be new clinical markers.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25476127      PMCID: PMC4304173          DOI: 10.1186/s12920-014-0063-z

Source DB:  PubMed          Journal:  BMC Med Genomics        ISSN: 1755-8794            Impact factor:   3.063


Background

Trisomy for human chromosome 21 (chr21) is the most frequent live-born aneuploidy and is the cause of Down syndrome (DS), whose main symptoms include intellectual disability, cardiovascular defects and craniofacial dysmorphisms [1]. The DS phenotype is thought to be associated with an altered expression of the genes located on chr21 [2-7]. Basic research on DS is now rapidly accelerating, and there is the possibility that the results will be beneficial for individuals with DS [8]. Several studies have shown that individuals with DS have a specific cancer risk pattern, or tumor profile: their risk of developing leukemia and testicular cancer is much higher than age-matched controls, while women with DS almost never develop breast cancer [9,10]. In particular, children with DS show an increased prevalence of acute leukemia, both lymphoid (ALL) and myeloid (AML), with relative risk ranging from 10 to 20 times higher than the normal population [11,12]. In nearly half of the cases, these childhood leukemias are classified as megakaryoblastic leukemia (AMKL), a relatively rare subtype of AML also known as AML M7, according to French–American–British (FAB) classification, whose incidence increases by 500-fold in children with DS by the age of 4 years as compared to the chromosomally normal population (reviewed in [13]). This observation strongly suggests that trisomy 21 directly contributes to the neoplastic transformation of hematopoietic cells, in particular in the megakaryocyte lineage cells. Interestingly, acute leukemia cells harboring megakaryocyte markers and presenting in subjects without DS may show trisomy 21 [14]. We also described a cell line derived from blast cells of a patient with type M2 AML which has trisomy 21 and megakaryocyte features [15]. More recently, mutations of the gene encoding for the transcription factor GATA1 have been shown to cooperate with trisomy 21 in initiating megakaryoblastic proliferation in nearly all DS AMKL cases while they are absent in non-DS AMKL [13,16]. GATA1 mutations in DS cells give rise to a short, truncated form of GATA1 (GATA1s) transcription factor that, in this form, is not able to establish normal interactions with other gene regulators [17]. Transient myeloproliferative disorder (TMD) is a clonal pre-leukemia condition, occurring in 10% of children with DS during the neonatal period, presenting at a median age of 3-7 days with accumulation of immature megakaryoblasts [13]. TMD cases usually resolve spontaneosuly, but DS AMKL may develop within 1-4 years in 20-30% of these children. AMKL may develop in non-DS children, usually at an higher age in comparison to DS subjects (median 8 vs. 1.8 years, respectively) and in absence of a trisomy 21 background. Cytogenetic abormalities described in non-DS AMKL cells include trisomy 8 and 1 and monosomy 7 [13]. An open issue is the relevance of trisomy 21 as a specific background for the higher incidence of AMKL in DS. A few previous studies have used gene expression profiling by microarray analysis in order to identify specific transcriptome alterations in DS and/or non-DS AMKL, as well as in TMD [17-24]. Due to the rarity of AMKL, these works often analyze a small number of cases, using a variety of experimental platforms. Results were consequently affected by a small grade of comparability. One of the first goals of this work was to perform a systematic meta-analysis using any available gene expression profile dataset related to AMKL in pediatric age in order to produce a differential transcriptome map between DS and non-DS related AMKL. This task has been accomplished using a tool recently described by us for the generation and the analysis of quantitative transcriptome maps, TRAM (Transcriptome Mapper) [25], which allows effective integration of data obtained from different experimenters, experimental platforms and data sources. The comparison of 43 DS AMKL samples with 45 non-DS AMKL samples represents the largest study on the subject, highlighting the relevance of trisomy 21 in the development of AMKL in comparison with AMKL originating from non-trisomic cells. Results show significant over- or under-expression of distinct chromosomal segments and of single key genes in the whole genome, as well as on chr21, adding new knowledge compared with that produced by the single works from which the data were originally obtained. In addition, each considered type of leukemia was compared with the expression profile of TMD cells and normal human megakaryoblast/megakaryocyte cells (MK), allowing the building of a model for the disorder in differentiation process that lead to DS and non-DS AMKL. Comparisons with cord blood-derived MK cells (CB MK) have also been performed, due to the fact that leukemias in infants or young children originate from fetal hematopoietic cells [17,18,26,27] and the progenitor cells (fetal/neonatal MKP) are present in the cord blood (CB) [28,29]. For each cell type investigated, reference expression data for about 17,000-26,000 mapped sequences have been generated and validated through a sample comparison with known data. The biological and clinical significance of these data is discussed.

Methods

Literature search

A systematic biomedical literature search was performed up to January 2013 in order to identify articles related to global gene expression profile experiments in AMKL patients (DS AMKL, non-DS AMKL and TMD conditions). A general search using the commonly used acronym "AMKL" retrieved 157 articles. The MeSH term "Leukemia, Megakaryoblastic, Acute" was also used for a PubMed search in the expression: "Leukemia, Megakaryoblastic, Acute"[Mesh] AND ("Gene Expression Profiling"[MeSH] OR "Oligonucleotide Array Sequence Analysis"[Mesh] OR "Microarray Analysis"[Mesh] OR microarray* OR "Expression profile" OR SAGE).

Database search

Gene Expression Omnibus (GEO) [30] functional genomics repository was searched for: (AMKL[All Fields] OR (AML[All Fields] AND M7[All Fields])) AND "Homo sapiens"[Organism]. A more general search using the expression "Down Syndrome"[MeSH] AND "Homo sapiens" [Organism] was also used. ArrayExpress database [31] of functional genomics experiments was searched for the terms: ''AMKL'', "Megakaryoblastic", "AML M7". In order to obtain gene expression profile datasets for normal human MK cells, in addition to the 9 used in the original description of the TRAM software [25], we searched GEO for the expression ("Megakaryocytes" [Mesh] OR Megakaryoblast*) AND "Homo sapiens" [ORGANISM]. The ArrayExpress database was searched for the expressions "Megakaryocyte", "Megakaryocytic", "Megakaryoblast", "MK". The searches were performed up to January 2013.

Dataset selection

The inclusion criteria of datasets in the analysis were: availability of the raw or pre-processed data; pediatric age of the subject from whom the sample was obtained; diagnosis of DS or non-DS AMKL or TMD. Exclusion criteria were: exon arrays (hampering data elaboration by TRAM due to exceedingly high number of data rows) or platforms using probes split into several distinct arrays for each sample (hampering intra-sample normalization); lack of identifiers corresponding to those found in the GEO sample records (GSM) or ArrayExpress sample records; platforms assaying an atypical number of genes (i.e. <5.000 or >60.000); cell line derived data; specific subtype of non-DS AMKL, e.g. t(1;22); trisomy 21 in non-DS AMKL samples. Normal MK samples were considered for the analysis when fulfilling these criteria: late MK colonies (10-14 days) or MK sorted cells, obtained from peripheral blood (PB), bone marrow (BM) or cord blood (CB). MK cultured for less than 10 days or Colony Forming Unit-Megakaryocytic (CFU-MK) were excluded. In order to obtain a quantitative transcriptome map, values from each dataset were linearized when provided as logarithms. In some cases we used raw files (e.g. File CEL) to be converted into pre-processed data, using the software "Alt Analyze" [32].

TRAM (transcriptome mapper) analysis

TRAM (Transcriptome Mapper) software [25] allows the import of gene expression data recorded in the NCBI (National Center for Biotechnology Information) GEO and EBI (European Bioinformatics Institute) ArrayExpress databases in tab-delimited text format. It also allows the integration of all data by decoding probe set identifiers to gene symbols via UniGene data parsing [33], normalizing data from multiple platforms using intra-sample and inter-sample normalization (scaled quantile normalization) [34], creating graphical representation of gene expression profile through two ways, "Map" and "Cluster" mode, and determining the statistical significance of results. Moreover, TRAM allows to compare two biological conditions identifying critical genomic regions and genes with significant differential expressions. We created a directory (folder) for each condition, containing all the sample datasets related to the same source and selected for the study: DS AMKL (pool 'A'); non-DS AMKL (pool 'B'); TMD (pool 'C'); normal MK (pool 'D'); normal CB MK (pool 'E'). We ran the whole set of analyses permitted by TRAM (in both "Map" and "Cluster" mode, although we focused on the "Map" mode) using default parameters as described [25]. We used an updated version of TRAM including enhanced resolution of gene identifiers and updated UniGene and Entrez Gene databases (TRAM 1.1, June 2013), in comparison with the original 2011 version [25]. When the gene location cytoband was not available in the Gene database [35], it was manually derived from UCSC Genome Browser [36]. TRAM is freely available at http://apollo11.isto.unibo.it/software. Briefly, gene expression values were assigned to individual loci via UniGene, intra-sample normalized as percentage of the mean value and inter-sample normalized by scaled quantile. The value for each locus, in each biological condition, is the mean value of all available values for that locus. The genome wide gene expression median value was used in order to determine percentiles of expression for each gene. Using the "Map" mode graphical representation we searched for over/under-expressed genome segments, which have a window size of 500,000 bp and a shift of 250,000 bp. The expression value for each genomic segment is the mean of the expression values of the loci included in that segment. A segment is defined over/under-expressed if it has an expression value which is significantly different between two conditions analyzed, and contains at least 3 individually over/under-expressed genes, e.g. genes which have expression values within the highest and the lowest 2.5th percentile. Significance of the over/under-expression for single genes was determined by running TRAM in "Map" mode with a segment window of 12,500 bp. This window size corresponds to about a quarter of the mean length of a gene, so the significant over/under-expression of a segment almost always corresponds with that of a single gene. A segment or a gene was considered to be statistically significantly over- or under-expressed for q < 0.05, where q is the p-value obtained by the method of hypergeometric distribution [25] and corrected for multiple comparison. When the segment window contains more than one gene, the significance is maintained if the expression value of the over/under-expressed gene prevails over the others. For the creation of the maps, TRAM software does not consider probes where the expression values is not available, assuming that an expression level has not been measured. Furthermore, it gives 95% of the minimum positive value present in a sample to those expression values equal to or lower than "0", in order to obtain meaningful numbers when we need to obtain a ratio between values in pool 'A' and pool 'B'. Assuming that in these cases an expression level is too low to be detected under the experimental conditions used, this transformation is useful to highlight differential gene expression. Finally, we considered the most over- or under-expressed genes among the genes associated with at least 5 data points. At chromosomal level, we calculated (in the TRAM "chr" table) the median expression ratio for all the genes located in the same chromosome.

Other analysis

FuncAssociate analysis [37] was used to obtain Gene Ontology attributes in order to functionally characterize large sets of genes derived from the TRAM analysis.

Results

A general search using the acronym ''AMKL'' retrieved 157 articles, 6 of them describe gene expression profiling experiments [17,18,20-23]. No additional pertinent item was retrieved using the expression described in the Methods section and including the MeSH Term "Leukemia, Megakaryoblastic, Acute". The Gene Expression Omnibus (GEO) [30] search allowed the retrievement of three additional works describing data possibly useful for meta-analysis [19,24,38]. The lack of inclusion of these works in the literature search was due to failure of using the ''AMKL'' acronym and assigning the MeSH Term "Leukemia, Megakaryoblastic, Acute" during the PubMed indexing process (the more general term "Leukemia, Myeloid, Acute" was used). The more general search using the expression "Down Syndrome"[MeSH] AND "Homo sapiens"[Organism] allowed the addition of one further work [39]. This work analyzed several types of AML samples and did not explicitly mention AMKL or AML M7 in both PubMed and GEO databases. No further pertinent works related to AMKL were identified by ArrayExpress database [31] search. Several datasets for normal MK cells global gene expression profile fulfilling the selection criteria were obtained from the works [40] (GEO, 7 samples) and [41] (ArrayExpress, 1 sample), in addition to the 4 sample series identified in the first report of the TRAM software [25] and obtained from different works [42-45], for a total of 19 datasets related to human normal MK cells.

Dataset building

Of the 10 works related to DS or non-DS AMKL retrieved as above described, 7 were considered for the meta-analysis (Table 1). It was not possible to obtain raw data from the Authors of [20], while the only sample of AML M7 described in [38] was related to "Leukemic Stem Cells" cell type and the two AML M7 reported by Tomasson et al. [39] were obtained from elderly patients. Raw data from [23] were kindly provided by Drs. Jeffrey Taub and Yubin Ge.
Table 1

Main features of the samples used in TRAM analyses

Study ID Sample ID Sample type Platform Microarray Spots References
Pool 'A' (n=43) - DS AMKL (25,955 mapped loci following analysis by TRAM)
A1…A3 (n=3)GSM491372…4BM Sorted leukemic blastsGPL570Affymetrix U133 Plus 2.054,675[18]
A4…A25 (n=22)GSM94245, GSM94272…92BM or PBGPL96Affymetrix U133A22,283[22]
A26…A31 (n=6)GSM417985…90BM or PB Sorted leukemic blastsGPL570Affymetrix U133 Plus 2.054,675[17]
A32…A38 (n=7)E-MEXP-72*BMMC or PBMCA-AFFY-33.adf.txt*Affymetrix U133A22,283[21]
A39…A43 (n=5)/BMMC or PBMCGPL96Affymetrix U133A22,283[23]
Pool 'B' (n=45) - non-DS AMKL (26,045 mapped loci following analysis by TRAM)
B1-B2 (n=2)GSM491370-1BM Sorted leukemic blastsGPL570Affymetrix U133 Plus 2.054,675[18]
B3…B23 (n=21)GSM94221-4-5, GSM94227…32, GSM94234-5-7-8, GSM94240-2-3-8, GSM94256-9, GSM94261-2BM or PBGPL96Affymetrix U133A22,283[22]
B24…B28 (n=5)GSM39832-5, GSM39842-4, GSM39863PBMC or BMMCGPL8300Affymetrix U95 Version 212,625[19]
B29…B35 (n=7)GSM361502…4, GSM361506…8, GSM361510BM or PBGPL96Affymetrix U133A22,283[24]
B36…B40 (n=5)GSM417991…5BM or PB Sorted leukemic blastsGPL570Affymetrix U133 Plus 2.054,675[18]
B41…B45 (n=5)/BMMC or PBMCGPL96Affymetrix U133A22,283[23]
Pool 'C' (n=20) - TMD (25,955 mapped loci following analysis by TRAM)
C1…C3 (n=3)GSM491375…7BM Sorted leukemic blastsGPL570Affymetrix U133 Plus 2.054,675[18]
C4…C11 (n=8)GSM94293…9, GSM94300BM or PBGPL96Affymetrix U133A22,283[22]
C12…C20 (n=9)E-MEXP-72*BMMC or PBMCA-AFFY-33.adf.txt*Affymetrix U133A22,283[21]
Pool 'D' (n=19) - MK (26,372 mapped loci following analysis by TRAM)
D1-D2 (n=2)GSM321577-8MK (BM) (subj = pool)GPL96Affymetrix U133A22,283[45]
D3…D6 (n=4)GSM112277-8, GSM112291-2MK (PB) (subj = 1, rep. 1)GPL887Agilent 1A22,575[44]
D7-D8 (n=2)GSM15648, GSM8649MK (BM) (subj = 6)GPL96Affymetrix U133A22,283[42]
D9…D11 (n=3)GSM88014-22-34MK (PB) (subj = 1)GPL887Agilent 1A22,575[43]
D12…D18 (n=7)GSM609746…52CBGPL4685Affymetrix HT-HG_U133A22,944[40]
D19 (n=1)E-MEXP-2146*CBA-AFFY-44.adf.txtAffymetrix U133 Plus 2.054,675[41]
Pool 'E' = D12…D19 (n=8) - CB MK (25,577 mapped loci following analysis by TRAM)

Samples selected for the meta-analysis of gene expression profiles in DS AMKL (pool 'A'), non-DS AMKL (pool 'B'), TMD (pool 'C'), and megakaryocytic cells (pool 'D' and 'E'). All Sample IDs and Platforms IDs are related to GEO database, other than codes marked with * (ArrayExpress database). Sample type: BM, bone marrow; PB, peripheral blood; BMMC, bone marrow mononuclear cells; PBMC, peripheral blood mononuclear cells; MK, megakaryocytic/megakaryoblast cells, obtained by in vitro differentiation of CD34+ cells; CD34+, undifferentiated CD34+ cells; BM, CB or PB: CD34+ cells derived from bone marrow, cord blood or peripheral blood, respectively. subj = number of subjects from which the sample was derived (in some cases, where subj = pool, the exact number of subjects included in a pool was not available). rep. = biological replicate. Microarray: U133A: Affymetrix Human Genome U133A Array; 1A: Agilent-012097 Human 1A Microarray (V2) G4110B; 22k A: Agilent Human oligo 22k A; HG-Focus: Affymetrix Human HG-Focus Target Array.

Details about Sample identifiers and main sample features are listed in Additional file 1 (available at: http://apollo11.isto.unibo.it/suppl).

Main features of the samples used in TRAM analyses Samples selected for the meta-analysis of gene expression profiles in DS AMKL (pool 'A'), non-DS AMKL (pool 'B'), TMD (pool 'C'), and megakaryocytic cells (pool 'D' and 'E'). All Sample IDs and Platforms IDs are related to GEO database, other than codes marked with * (ArrayExpress database). Sample type: BM, bone marrow; PB, peripheral blood; BMMC, bone marrow mononuclear cells; PBMC, peripheral blood mononuclear cells; MK, megakaryocytic/megakaryoblast cells, obtained by in vitro differentiation of CD34+ cells; CD34+, undifferentiated CD34+ cells; BM, CB or PB: CD34+ cells derived from bone marrow, cord blood or peripheral blood, respectively. subj = number of subjects from which the sample was derived (in some cases, where subj = pool, the exact number of subjects included in a pool was not available). rep. = biological replicate. Microarray: U133A: Affymetrix Human Genome U133A Array; 1A: Agilent-012097 Human 1A Microarray (V2) G4110B; 22k A: Agilent Human oligo 22k A; HG-Focus: Affymetrix Human HG-Focus Target Array. Details about Sample identifiers and main sample features are listed in Additional file 1 (available at: http://apollo11.isto.unibo.it/suppl). At the end, DS AMKL sample pool 'A' included 43 datasets, while non-DS AMKL sample pool 'B' was composed of 45 datasets. A TMD dataset pool 'C' was constructed starting from 20 samples described in some of the DS AMKL related articles [18,21,22]. Age and sex data were available for 29 out of 43 DS AMKL patients (mean age: 20 months; 11 males and 18 females), for 26 out of 45 non-DS AMKL patients (mean age: 19 months; 19 males and 7 females) and for 9 out of 20 TMD patients (mean age: 8 days; 7 males and 2 females). GATA1 mutations giving rise to GATA1s were present in all DS AMKL and TMD samples, and not in non-DS AMKL samples, considering all samples for which this information was provided. Sample identifiers and main sample features are listed in Table 1 and Additional file 1 (available at: http://apollo11.isto.unibo.it/suppl). Two pools were constructed from the normal MK related dataset selected: pool 'D' included all available MK samples, while pool 'E' was a subset including only CB-derived MK cells (Table 1 and Additional file 1).

Transcriptome differential maps

Datasets were loaded into TRAM and analyzed obtaining 8 transcriptome maps: DS AMKL (pool 'A') vs. non-DS AMKL (pool 'B'); DS AMKL (pool 'A') vs. normal MK (pool 'D'); non-DS AMKL (pool 'B') vs. normal MK (pool 'D'); DS AMKL (pool 'A') vs. normal CB MK (pool 'E') cells; non-DS AMKL (pool 'B') vs. normal CB MK (pool 'E'); DS AMKL (pool 'A') vs. TMD (pool 'C'); TMD (pool 'C') vs. normal MK (pool 'D'); TMD (pool 'C') vs. normal CB MK (pool 'E'). For each comparison between two cell types by TRAM, we describe below or in the corresponding Figures or Tables the total of data points analyzed for each cell type, i.e. gene expression values for all human mapped loci following intra- and inter-sample normalization [25]; the number of loci for which the comparison between the two conditions was possible due to the presence of values for those loci in both sample pools considered; the number and the gene content of each genomic segment containing at least three over- or under-expressed genes and found to be statistically significantly over- or under-expressed in the comparison between the two tissues. Each genomic segment was identified among the 12,373 segments generated using the default window of 500,000 bp with a sliding window of 250,000 bp and following removal of overlapping segments with similar gene content. When the results were reported for the over/under-expressed single genes, we considered only the genes associated at at least 5 data points. The description of the gene name corresponding to all gene symbols cited here in the text, Figures or Tables is given in the Additional file 2. We performed a PubMed search for the most relevant over- or under-expressed genes using gene symbol or gene description along with MeSH terms related to MK or MK progenitor cells, thrombopoiesis, AMKL, platelets. Detailed results for each map are provided below, and are also available at: http://apollo11.isto.unibo.it/suppl. The absolute (not differential) expression values and maps for each cell type (not compared to another cell type) are also available in the complete sets of results at http://apollo11.isto.unibo.it/suppl, but are not discussed here because they include typical housekeeping genes whose over-expression is no longer evident when compensated by the corresponding housekeeping genes in the compared cell type.

Transcriptome map comparison of DS AMKL vs. non-DS AMKL

We first analyzed regional differential expression of pool 'A' (43 DS AMKL samples) versus pool 'B' (45 non-DS AMKL samples) (Table 1). A total of 1,061,761 data points from the pool 'A' and 1,084,700 data points from the pool 'B' were included in the analysis. An 'A'/'B' ratio value was determinable for 25,954 loci having values both in 'A' and 'B' pools (Additional file 3). The main results are shown in Figure 1. Results obtained by the analysis included 3 significantly non-overlapped over-expressed segments (Table 2a). The highest expression ratio between DS AMKL and non-DS AMKL (3.27) was observed in a segment on chromosome 15 (15q21.2), including the known gene HDC (encoding for histidine decarboxylase, which converts L-histidine to histamine). The second segment with the highest expression was located on chromosome 4 (4q31.1) and contained over-expressed genes such as GYPE, GYPB and GYPA, encoding for glycophorin E, B and A (MNS blood group) respectively. The third over-expressed segment spans the cluster of apolipoprotein encoding genes on chromosome 19 (19q13.2).
Figure 1

Main results of DS AMKL vs. MK, DS AMKL vs. non-DS AMK, non-DS AMKL vs. MK comparisons. For each comparison the number of loci analyzed, the most over- or under-expressed segments and single genes, and the highest and lowest median expression ratios for all the genes located in the same chromosome are indicated.

Table 2

Genomic segments significantly over- or under-expressed

a) DS AMKL vs. non-DS AMKL
Chr and location a Segment start b Segment end b 'A'/'B' ratio q-value Genes in the segment c
Over-expressed segments
chr15 15q21.250,250,00150,750,0003.270.00233 ATP8B4- SLC27A2+ HDC+ GABPB1+ FLJ10038+ GABPB1-AS1+ Hs.656448+ Hs.660869+ d USP8+
chr4 4q31.1144,750,001145,250,0002.210.00024 GYPE+ Hs.658686- GYPB + GYPA +
chr19 19q13.245,000,00145,500,0001.580.00461 ZNF180- PVR+ CEACAM19- BCL3+ CBLC+ BCAM+ PVRL2- Hs.666142- TOMM40+ APOE+ APOC1 + APOC4+ APOC2 + CLPTM1+
b) DS AMKL vs. MK
Chr and location Segment start Segment end 'A'/'D' ratio q-value Genes in the segment
Over-expressed segments
chr3 p26.23,500,0014,000,00020.810.00006 Hs.241414+ Hs.587205+ LRRN1 + Hs.128128+
chr1 1q23166,000,001166,500,00018.070.00004 Hs.22930+ FAM78B+ Hs.662048+
chr15 15q26.296,750,00197,250,00014.730.00001 Hs.677040+ Hs.661950+ NR2F2-AS1 + NR2F2- Hs.592015+
Under-expressed segments
chr4 4q12-q2174,750,00175,250,0000.450.00079 PF4 - PPBP - CXCL5- CXCL3 - CXCL2- MTHFD2L- Hs.662627- EREG-
chr4 4q32.1156,250,001156,750,0000.130.00012 MAP9- GUCY1A3 - Hs.612374- GUCY1B3 -
c) Non-DS AMKL vs. MK
Chr and location Segment start Segment end 'B'/'D' ratio q-value Genes in the segment
Over-expressed segments
chr15 15q26.296,750,00197,250,00022.30<0.00001 Hs.677040+ Hs.661950+ NR2F2-AS1 + NR2F2- Hs.592015+
Under-expressed segments
chr4 4q13-q2174,500,00175,000,0000.390.00009 IL8- CXCL6- PF4V1 - CXCL1- PF4 - PPBP - CXCL5- CXCL3 - CXCL2-

Data refer to the following comparisons: a) DS AMKL (pool 'A') vs. non-DS AMKL (pool 'B'); b) DS AMKL (pool 'A') vs. MK (pool 'D'); c) non-DS AMKL (pool 'B') vs. MK (pool 'D'). Analysis was performed using default parameters (see Methods section). Segments are sorted by decreasing 'A'/'B' ratio in a), 'A'/'D' ratio in b), 'B'/'D' ratio in c). In the "Map" mode, TRAM displays UniGene EST clusters (with the prefix "Hs." in the case of Homo sapiens) only if they have an expression value. Some segments are not shown for simplicity because they are over-lapping with those highlighted in one of the listed regions. The complete results for these models are available as on line additional material.

aChr: chromosome. The segment location cytoband was derived from that of the first mapped gene within the segment.

bSegment Start/End: chromosomal coordinates for each segment.

cBold and '+': over-expressed gene; bold and '-': under-expressed gene; '+' or '-': gene expression value higher or lower than the median value, respectively.

dThis UniGene cluster contains EST with at least one Alu sequence, according to Repeat Masker [46]. For this reason, it can not be excluded that its over-expression is related to unspecific hybridization by Alu-containg probes.

Main results of DS AMKL vs. MK, DS AMKL vs. non-DS AMK, non-DS AMKL vs. MK comparisons. For each comparison the number of loci analyzed, the most over- or under-expressed segments and single genes, and the highest and lowest median expression ratios for all the genes located in the same chromosome are indicated. Genomic segments significantly over- or under-expressed Data refer to the following comparisons: a) DS AMKL (pool 'A') vs. non-DS AMKL (pool 'B'); b) DS AMKL (pool 'A') vs. MK (pool 'D'); c) non-DS AMKL (pool 'B') vs. MK (pool 'D'). Analysis was performed using default parameters (see Methods section). Segments are sorted by decreasing 'A'/'B' ratio in a), 'A'/'D' ratio in b), 'B'/'D' ratio in c). In the "Map" mode, TRAM displays UniGene EST clusters (with the prefix "Hs." in the case of Homo sapiens) only if they have an expression value. Some segments are not shown for simplicity because they are over-lapping with those highlighted in one of the listed regions. The complete results for these models are available as on line additional material. aChr: chromosome. The segment location cytoband was derived from that of the first mapped gene within the segment. bSegment Start/End: chromosomal coordinates for each segment. cBold and '+': over-expressed gene; bold and '-': under-expressed gene; '+' or '-': gene expression value higher or lower than the median value, respectively. dThis UniGene cluster contains EST with at least one Alu sequence, according to Repeat Masker [46]. For this reason, it can not be excluded that its over-expression is related to unspecific hybridization by Alu-containg probes. At single gene level, a fold increase higher than 5 was observed in all of the first 20 genes with the greatest expression ratios of DS AMKL vs. non-DS AMKL samples (Table 3a and Additional file 3). In particular, a 24-fold increase was observed for SLITRK6 gene, encoding for a membrane protein to date described as similar to receptor for BDNF (brain-derived neurotrophic factor) and predominantly expressed in neural tissues. Among the genes with the lower 'A'/'B' expression ratios a 169.5-fold decrease was observed for a UniGene EST cluster, Hs.355689.
Table 3

List of the five most over- or under-expressed genes (all significantly, with q < 0.05)

a) DS AMKL vs. non-DS AMKL
Gene Value 'A' Value 'B' 'A'/'B' ratio Location Data points SD SD
'A' 'B' 'A' 'B'
Over-expressed genes
SLITRK6 465.519.523.86013q31.12721113.079.6
HDC 1,119.465.517.09715q21-q22364598.8249.4
ZNF587B 615.039.315.64319q13.43a 4340427.632.2
SOSTDC1 111.78.612.9577p21.14345176.5108.4
LOC100287628*1,381.3107.712.82016p13.29780.256.8
Under-expressed genes
Hs.587427*19.8407.80.0497p15.2181428.9353.3
SPINK2 13.1285.10.0464q12364581.8115.0
Hs.602709*15.8370.20.04311q13.2a 9733.1256.9
RPS12 20.7519.10.0406q23.24350113.7213.0
Hs.355689*3.7624.40.00618q11.2a 9747.3168.4
b) DS AMKL vs. MK
Gene Value 'A' Value 'D' 'A'/'D' ratio Location Data points SD SD
'A' 'D' 'A' 'D'
Over-expressed genes
TMEM241 1,036.617.658.88518q11.218857.7115.8
CMBL 592.411.750.7595p15.2278142.030.1
IFI27 467.811.640.47414q32368290.027.6
CSRNP1 640.616.838.0233p229841.3138.3
PTGS2 2,205.059.237.2761q25.2 q25.36320156.4131.6
APOC2* b 431.711.737.04419q13.2542881.441.6
SLITRK6 b 465.513.335.05713q31.12710113.095.6
Under-expressed genes
MAP3K10 5.42,352.30.00219q13.2361947.4135.1
DRD4 5.02,327.60.00211p15.5361951.3135.2
DLGAP3 4.82,297.90.0021p35.3-p34.19740.837.6
LBX1 2.71,358.50.00210q24361946.4152.6
TSPAN10* 7.94,674.70.00217q25.39855.046.0
c) Non-DS AMKL vs. MK
Gene Value 'B' Value 'D' 'B'/'D' ratio Location Data points SD SD
'B' 'D' 'B' 'D'
Over-expressed genes
TMEM241 2,754.117.6156.44318q11.214889.9115.8
CDYL2 1,613.513.1122.99416q23.278128.969.8
MPV17L 1,857.824.476.28016p13.1178119.565.0
ATP6V0D2 836.711.970.4888q21.3a 2110169.482.9
CMBL 757.911.764.9385p15.2218176.230.1
Under-expressed genes
ILDR1 13.74,333.80.0033q13.3314998.463.3
HIST3H3 3.91,313.00.0031q42401978.0137.9
TSPAN10* 12.14,674.70.00317q25.37888.146.0
MAP3K10 3.92,352.30.00219q13.2451940.1135.1
LBX1 2.01,358.50.00110q24451985.7152.6

Data refer to the following comparisons: a) DS AMKL (pool 'A') vs. non-DS AMKL (pool 'B'); b) DS AMKL (pool 'A') vs. MK (pool 'D'); c) non-DS AMKL (pool 'B') vs. MK (pool 'D'). Value: mean gene expression value normalized across all the pool samples; data points: number of spots associated to an expression value for the locus; SD: standard deviation for the expression value expressed as percentage of the mean. Full results available as additional material (see text). *The segment window contains more than one gene, but the significance is assumed to be maintained because the expression value of this over- or under-expressed gene prevails over the others.

aCytoband not available in Gene was derived from the UCSC Genome Browser [36].

bThis gene, exceeding the limit of five genes for each list, has been shown for its relevance in the Discussion, being recurrent in other comparisons.

List of the five most over- or under-expressed genes (all significantly, with q < 0.05) Data refer to the following comparisons: a) DS AMKL (pool 'A') vs. non-DS AMKL (pool 'B'); b) DS AMKL (pool 'A') vs. MK (pool 'D'); c) non-DS AMKL (pool 'B') vs. MK (pool 'D'). Value: mean gene expression value normalized across all the pool samples; data points: number of spots associated to an expression value for the locus; SD: standard deviation for the expression value expressed as percentage of the mean. Full results available as additional material (see text). *The segment window contains more than one gene, but the significance is assumed to be maintained because the expression value of this over- or under-expressed gene prevails over the others. aCytoband not available in Gene was derived from the UCSC Genome Browser [36]. bThis gene, exceeding the limit of five genes for each list, has been shown for its relevance in the Discussion, being recurrent in other comparisons. At chromosomal level, we calculated (in the TRAM "Chr" table) the median 'A'/'B' expression ratio for all the genes located in the same chromosome. The highest ratios were near to 1 (0.93 for chr22, 0.92 for chrX and chr21, 0.91 for chr19); other values were in the range from 0.90 (chr17 and chr12) to 0.76 (chrY). We performed two additional transcriptome maps to investigate specifically sex-biased gene expression patterns (data not shown, results may be regenerated by the user by excluding/including or reimporting samples on the basis of data provided in Additional file 1): in particular, we compared male (pool 'A.1', n=11) vs. female DS AMKL cells (pool 'A.2', n=18). These datasets are derived from the samples for which the knowledge about the sex of the sample donor was available. The results showed a significant statistical correlation of data between male and female gene expression data (r=0.99, p-value<0.0001), showing a large overlap of results between the two transcriptome maps, with the exception of single genes with a well known sex-biased expression pattern. For example, XIST, which is specifically activated in female cells to start the X-inactivation process, turns out to be the most differentially expressed gene between female (value=402.60) and male (value=12.20) DS AMKL cells (ratio=33).

Transcriptome map comparison of DS AMKL or non-DS AMKL vs. normal MK

Regional differential expression of pool 'A' (43 DS AMKL samples) or pool 'B' (45 non-DS AMKL samples) versus pool 'D' (19 normal MK cell samples, 411,381 data points) (Table 1) was investigated. An 'A'/'D' ratio value was determinable for 25,800 loci (Additional file 4). The main results are shown in Figure 1. For what DS AMKL samples are concerned, results included 5 significantly differentially expressed segments in DS AMKL cells, 3 over- and 2 under-expressed (Table 2b). The highest expression ratio (20.81) between DS AMKL cells and normal MK was observed in the segment at coordinates 3,500,001-4,000,000 on chromosome 3, including the known gene LRRN1, encoding for a type I transmembrane protein. The second segment with highest expression ratio (18.07) was located on chromosome 1 (1q23) and contained FAM78B (family with sequence similarity 78, member B). The third segment was on chromosome 15 and included NR2F2-AS1, a non-coding RNA. The first significantly under-expressed segment (4q32.1) includes genes encoding for subunits of soluble guanylate cyclase (GUCY1A3 and GUCY1B3), while the second spans the cluster of MK specific genes located on chromosome 4 (4q12-q21). At single gene level, a fold increase higher than 18 was observed in all of the first 20 genes with the greatest expression ratios of DS AMKL vs. MK samples (Table 3b and Additional file 4). In particular, a 59-fold increase was observed for TMEM241, encoding a transmembrane protein of unknown function. Among the genes with the lowest 'A'/'D' expression ratio a 589.7-fold decrease was observed for TSPAN10, encoding for tetraspanin 10. At chromosomal level, the highest ratio was observed for chr21 (1.75), the lowest (chr17), other values were in the range from 1.68 (chrY) to a value of 1.23 for chr17. Regarding the non-DS AMKL samples, results obtained by default analysis and derived from 'B'/'D' expression ratio for 25,819 loci included one significantly over- and one significantly under-expressed segment (Table 2c). The highest expression ratio (22.30) between non-DS AMKL and normal MK was observed in the same segment on chr15, significantly over-expressed also in the DS AMKL transcriptome map. This segment was the only significantly over-expressed one in this comparison. Similarly, the only significantly under-expressed segment includes the cluster of MK specific genes on chromosome 4 also found to be under-expressed in DS AMKL samples (Table 2b). At single gene level, a fold increase higher than 31 was observed in all of the first 20 genes with the greatest expression ratios of non-DS AMKL vs. MK samples (Table 3c and Additional file 5). In particular, a 156-fold increase was observed for TMEM241. Overall, there was a remarkable overlap between the most over- (TMEM241, CMBL, ZNF445, SPRR4) and under-expressed (PF4V1, FLJ22184, FSIP2, PPP1R3B, HIST3H3, PIF1, SPSB4, ILDR1, MAP3K10, DRD4, LBX1, TSPAN10) genes in DS and in non-DS AMKL samples (Additional files 4 and 5). At chromosomal level, looking at the median 'B'/'D' expression ratio for the genes located in the same chromosome, the highest ratio was observed for chr21 (1.77) and chrY (1.73), followed by chr13 (1.71) and chr10 (1.59); other values were in the range from 1.58 (chr20), to a value of 1.24 (chrX).

Transcriptome map comparison of DS AMKL or non DS AMKL vs. normal CB MK

Regional differential expression of pool 'A' (43 DS AMKL samples) or pool 'B' (45 non-DS AMKL samples) versus pool 'E' (8 normal cord blood (CB)-derived MK cell samples, 191,798 data points) (Table 1) was investigated. The main results are shown in Figure 2.
Figure 2

Main results of DS AMKL vs. CB MK and non-DS AMKL vs. CB MK comparisons. For each comparison the number of loci analyzed, the most over- or under-expressed segments and single genes, and the highest and lowest median expression ratios for all the genes located in the same chromosome are indicated.

Main results of DS AMKL vs. CB MK and non-DS AMKL vs. CB MK comparisons. For each comparison the number of loci analyzed, the most over- or under-expressed segments and single genes, and the highest and lowest median expression ratios for all the genes located in the same chromosome are indicated. For what DS AMKL samples are concerned, results derived from 'A'/'E' expression ratio for 25,540 loci (Additional file 6) included 1 significantly over- and 3 under-expressed segments in DS AMKL cells (Table 4a). A remarkable expression ratio (80.36) between DS AMKL cells and normal CB MK was observed for a segment on chromosome 3 (3q22.1), including collagen-encoding COL6A5 and COL6A6 known loci. The three significantly under-expressed segments included the region on chromosome 4 (4q12-q21) with PF4, PPBP and CXCL3 loci implied in MK differentiation.
Table 4

Genomic segments significantly over- or under-expressed

a) DS AMKL vs. normal CB MK
Chr and location a Segment start b Segment end b 'A'/'E' ratio q-value Genes in the segment c
Over-expressed segments
chr3 3q22.1130,000,001130,500,00080.360.00015 COL6A5 + COL6A6 + Hs.596709+ Hs.596805+ PIK3R4-
Under-expressed segments
chr7 7q11.2377,000,00177,500,0000.480.00117 GSAP - PTPN12 - LOC101059910- RSBN1L-AS1- RSBN1L- Hs.594486- Hs.720279- TMEM60- PHTF2-
chr4 4q12-q2174,750,00175,250,0000.370.00119 PF4 - PPBP - CXCL5- CXCL3 - CXCL2- MTHFD2L- Hs.662627- EREG-
chr4 4q32.1156,250,001156,750,0000.120.00000 MAP9 - GUCY1A3 - Hs.612374 - GUCY1B3 -
b) Non-DS AMKL vs. normal CB MK
Chr and location Segment start Segment end 'B'/'E' ratio q-value Genes in the segment
Over-expressed segments
chr3 3q22.1130,000,001130,500,000116.130.00030 COL6A5 + COL6A6 + Hs.596709+ Hs.596805+ PIK3R4-
chr11 11p156,750,0017,250,00050.980.00117 OR6A2+ OR10A5 + OR10A4 + ZNF215- ZNF214- Hs.445849+ NLRP14+ RBMXL2- LOC100506238+
Under-expressed segments
chr4 4q12-q2174,750,00175,250,0000.420.00003 PF4 - PPBP - CXCL5 - CXCL3 - CXCL2- MTHFD2L- Hs.662627- EREG-

Data refer to the following comparisons: a) DS AMKL (pool 'A') vs. normal CB MK (pool 'E'); b) non-DS AMKL (pool 'B') vs. normal CB MK (pool 'E'). Analysis was performed using default parameters (see Methods section). Segments are sorted by increasing 'A'/'E' ratio in a), 'B'/'E' ratio in b). In the "Map" mode, TRAM displays UniGene EST clusters (with the prefix "Hs." in the case of H. sapiens) only if they have an expression value. Some segments are not shown for simplicity because they are over-lapping with those highlighted in one of the listed regions. The complete results for this model are available as on line additional material.

aChr: chromosome. The segment location cytoband was derived from that of the first mapped gene within the segment.

bSegment Start/End: chromosomal coordinates for each segment.

cBold and '+': over-expressed gene; bold and '-': under-expressed gene; '+' or '-': gene expression value higher or lower than the median value, respectively.

Genomic segments significantly over- or under-expressed Data refer to the following comparisons: a) DS AMKL (pool 'A') vs. normal CB MK (pool 'E'); b) non-DS AMKL (pool 'B') vs. normal CB MK (pool 'E'). Analysis was performed using default parameters (see Methods section). Segments are sorted by increasing 'A'/'E' ratio in a), 'B'/'E' ratio in b). In the "Map" mode, TRAM displays UniGene EST clusters (with the prefix "Hs." in the case of H. sapiens) only if they have an expression value. Some segments are not shown for simplicity because they are over-lapping with those highlighted in one of the listed regions. The complete results for this model are available as on line additional material. aChr: chromosome. The segment location cytoband was derived from that of the first mapped gene within the segment. bSegment Start/End: chromosomal coordinates for each segment. cBold and '+': over-expressed gene; bold and '-': under-expressed gene; '+' or '-': gene expression value higher or lower than the median value, respectively. At single gene level, a fold increase higher than 15.7 was observed in all of the first 20 genes with the greatest expression ratios of DS AMKL cells vs. CB MK (Table 5a and Additional file 6). In particular, a 45-fold increase was observed for the tyrosine phosphatase receptor gene (PTPRO), known to be involved in megakaryocytopoiesis.
Table 5

List of the five most over- or under-expressed genes (all significantly, with q <0.05)

a) DS AMKL vs. normal CB MK
Gene Value 'A' Value 'E' Ratio 'A'/'E' Location Data points SD SD
'A' 'E' 'A' 'E'
Over-expressed genes
PTPRO 679.715.145.06312p13-p12819133.060.8
APOC2 a 431.79.943.47719q13.2541081.443.4
IFI27 467.811.540.74214q3236890.057.1
HDC 1,119.434.232.71615q21-q2236898.8168.1
VIPR2 ab 346.213.525.5827q36.311523334.849.3
Under-expressed genes
CDKL1 4.6174.30.02714q21.345965.399.0
ASAP2 19.1734.10.0262p2436895.467.3
GNG11 18.91,042.30.0187q21368119.781.2
RPS12 20.71,157.20.0186q23.24315113.7112.4
OLFM4 8.9499.30.01813q14.336783.9121.5
b) Non-DS AMKL vs. normal CB MK
Gene Value 'B' Value 'E' Ratio 'B'/'E' Location Data points SD SD
'B' 'E' 'B' 'E'
Over-expressed genes
PTPRO b 510.315.133.83112p13-p12979137.860.8
CES3 b 287.412.223.51116q22.1479335.873.8
SCD5 ab 544.727.219.9924q21.226118290.373.9
TOP3B ab 150.27.519.92222q11.226014431.819.7
MT1E ab 237.111.919.89916q1345892.527.3
Under-expressed genes
Hs.23729a 2.385.10.0271p13.2c 408121.089.5
PDE6C 2.288.20.02510q24458112.9105.9
SPTLC3 4.8203.70.02320p12.147986.5110.7
SPATA1 7.0356.20.0201p22.3407127.477.9
OLFM4 9.7499.30.02013q14.3457105.1121.5

Data refer to the following comparisons: a) DS AMKL (pool 'A') vs. normal CB MK (pool 'E'); b) non-DS AMKL (pool 'B') vs. normal CB MK (pool 'E'). Value: mean gene expression value normalized across all the pool samples; data points: number of spots associated to an expression value for the locus; 'SD': standard deviation for the expression value expressed as percentage of the mean. Full results available as additional material (see text).

aThe segment window contains more than one gene, but the significance is assumed to be maintained because the expression value of this over- or under-expressed gene prevails over the others.

bAccording to the criteria detailed in Methods section, this gene is one of the five most over-expressed ones, but the value is not statistically significant due to the presence of a lot of loci associated with less than 5 data points in the CB MK integrated dataset.

cCytoband not available in Gene was derived from the UCSC Genome Browser [36].

List of the five most over- or under-expressed genes (all significantly, with q <0.05) Data refer to the following comparisons: a) DS AMKL (pool 'A') vs. normal CB MK (pool 'E'); b) non-DS AMKL (pool 'B') vs. normal CB MK (pool 'E'). Value: mean gene expression value normalized across all the pool samples; data points: number of spots associated to an expression value for the locus; 'SD': standard deviation for the expression value expressed as percentage of the mean. Full results available as additional material (see text). aThe segment window contains more than one gene, but the significance is assumed to be maintained because the expression value of this over- or under-expressed gene prevails over the others. bAccording to the criteria detailed in Methods section, this gene is one of the five most over-expressed ones, but the value is not statistically significant due to the presence of a lot of loci associated with less than 5 data points in the CB MK integrated dataset. cCytoband not available in Gene was derived from the UCSC Genome Browser [36]. At chromosomal level, the highest ratio was observed for chr21 (2.08), followed by chrY (1.89); other values were in the range from 1.84 (chr22) to 1.37 (chrX). Regarding the non-DS AMKL samples, results derived from 'B'/'E' expression ratio (for 25,546 loci) included 2 significantly over- and 1 under-expressed segments in non-DS AMKL (Table 4b). A remarkable expression ratio between non-DS AMKL and normal CB MK (116.13) was observed in the same segment on chromosome 3 (3q22.1), including collagen-encoding COL6A5 and COL6A6 known loci that was observed in DS AMKL samples. The second segment was specific of non-DS AMKL samples and included the two olfactive receptor genes OR10A5 and OR10A4. The only significantly under-expressed segment included the region on chromosome 4 (4q12-q21) highly enriched in MK-specific loci (PF4, PPBP, CXCL5 and CXCL3) as in the case of DS AMKL samples, and was extended to PF4V1 locus. At single gene level, a fold increase higher than 14.7 was observed in all of the first 20 genes with the greatest expression ratios of non-DS AMKL vs. CB MK samples (Table 5b and Additional file 7). In particular, a 33.8-fold increase was observed for PTPRO, encoding a tyrosine phosphatase receptor. Overall, there was some overlapping between the most over- and under-expressed genes in DS and in non-DS AMKL samples (Additional file 6 and Additional file 7). At chromosomal level, regarding the median 'B'/'E' expression ratio for the genes located in the same chromosome, the highest ratio was observed for chrY (2.51), followed by chr21 (2.19); other values were in the range from 2.03 (chr20) to 1.46 (chrX).

Transcriptome map comparison of DS AMKL vs. TMD

Regional differential expression of pool 'A' (43 DS AMKL samples) versus pool 'C' (20 TMD samples, 398,162 data points) (Table 1) was investigated. The main results are shown in Figure 3.
Figure 3

Main results of DS AMKL vs. TMD, TMD vs. MK, TMD vs. CB MK comparisons. For each comparison the number of loci analyzed, the most over- or under-expressed segments and single genes, and the highest and lowest median expression ratios for all the genes located in the same chromosome are indicated.

Main results of DS AMKL vs. TMD, TMD vs. MK, TMD vs. CB MK comparisons. For each comparison the number of loci analyzed, the most over- or under-expressed segments and single genes, and the highest and lowest median expression ratios for all the genes located in the same chromosome are indicated. Results obtained by default analysis and derived from 'A'/'C' expression ratio for 25,955 loci (Additional file 8) included 2 significantly over-expressed segments in DS AMKL (Table 6a). The highest expression ratio (2.20) between DS AMKL and TMD was observed in a segment on chromosome 2 (2q31.3), including the known gene ITGA4, encoding an alpha 4 chain of integrin protein and CERKL, a gene responsible for retinitis pigmentosa and involved in the protection of cells from apoptosis induced by oxidative stress [47]. The second segment with the highest expression ratio (1.40) was located on chromosome 8 (8q21.3) and contained the NECAB1 and OTUD6B genes, encoding for neuronal Ca(2+)-binding protein and the deubiquitinating enzyme, respectively.
Table 6

Genomic segments significantly over- or under-expressed

a) DS AMKL vs. TMD
Chr a and location Segment start b Segment end b 'A'/'C' ratio q-value Genes in the segment c
Over-expressed segments
chr2 2q31.3182,000,001182,500,0002.200.00045 ITGA4 + Hs.660611- Hs.658786+ CERKL+ Hs.72981+
chr8 8q21.391,750,00192,250,0001.400.00051 NECAB1 + Hs.743640+ LOC100127983+ TMEM55A+ LOC100506365- OTUD6B+ LRRC69+
b) TMD vs. normal MK
Chr and location Segment start Segment end 'C'/'D' ratio q-value Genes in the segment
Over-expressed segments
chr3 3p26.23,500,0014,000,00028.000.00006 Hs.241414+ d Hs.587205+ LRRN1 + Hs.128128+
chr15 15q26.296,750,00197,250,00019.860.00000 Hs.677040+ d Hs.661950+ d NR2F2-AS1 + NR2F2- Hs.592015+
c) TMD vs. normal CB MK
Chr and location Segment start Segment end 'C'/'E' ratio q-value Genes in the segment
chr33q22.1130,000,001130,500,00075.970.00015 COL6A5 + COL6A6 + Hs.596709+ Hs.596805+ PIK3R4-

Data refer to the following comparisons: a) DS AMKL (pool 'A') vs. TMD (pool 'C'); b) TMD (pool 'C') vs. normal MK (pool 'D'); c) TMD (pool 'C') vs. normal CB MK (pool 'E'). Analysis was performed using default parameters (see Methods section). Segments are sorted by increasing 'A'/'C' ratio in a), 'C'/'D' ratio in b) and 'C'/'E' ratio in c). In the "Map" mode, TRAM displays UniGene EST clusters (with the prefix "Hs." in the case of H. sapiens) only if they have an expression value. Some segments are not shown for simplicity because they are over-lapping with those highlighted in one of the listed regions. The complete results for this model are available as additional material on line.

aChr: chromosome. The segment location cytoband was derived from that of the first mapped gene within the segment.

bSegment Start/End: chromosomal coordinates for each segment.

cBold and '+': over-expressed gene; bold and '-': under-expressed gene; '+' or '-': gene expression value higher or lower than the median value, respectively.

dThis UniGene cluster contains EST with at least one Alu sequence, according to Repeat Masker [46]. For this reason, it can not be excluded that its over-expression is related to non-specific hybridization by Alu-containg probes.

Genomic segments significantly over- or under-expressed Data refer to the following comparisons: a) DS AMKL (pool 'A') vs. TMD (pool 'C'); b) TMD (pool 'C') vs. normal MK (pool 'D'); c) TMD (pool 'C') vs. normal CB MK (pool 'E'). Analysis was performed using default parameters (see Methods section). Segments are sorted by increasing 'A'/'C' ratio in a), 'C'/'D' ratio in b) and 'C'/'E' ratio in c). In the "Map" mode, TRAM displays UniGene EST clusters (with the prefix "Hs." in the case of H. sapiens) only if they have an expression value. Some segments are not shown for simplicity because they are over-lapping with those highlighted in one of the listed regions. The complete results for this model are available as additional material on line. aChr: chromosome. The segment location cytoband was derived from that of the first mapped gene within the segment. bSegment Start/End: chromosomal coordinates for each segment. cBold and '+': over-expressed gene; bold and '-': under-expressed gene; '+' or '-': gene expression value higher or lower than the median value, respectively. dThis UniGene cluster contains EST with at least one Alu sequence, according to Repeat Masker [46]. For this reason, it can not be excluded that its over-expression is related to non-specific hybridization by Alu-containg probes. At single gene level, a fold increase ranged from 15.5 to 3.5 for the first 20 genes with the greatest expression ratios of DS AMKL vs. TMD samples (Table 7a and Additional file 8). The highest fold increases were observed for the ZNF587B (15.5) and IFI27 (13.4) genes, encoding for a zinc finger protein and the interferon alpha-inducible protein 27, respectively. The lowest 'A'/'C' expression ratios were observed for KIAA2022 (10-fold decrease) and SLFNL1 (5-fold decrease) genes.
Table 7

List of the five genes most over- or under-expressed (all significantly, with q <0.05)

a) DS AMKL vs. TMD
Gene Value 'A' Value 'C' Ratio 'A'/'C' Location Data points SD SD
'A' 'C' 'A' 'C'
Over-expressed genes
ZNF587B 615.039.715.49819q13.43c 4320427.620.9
IFI27 467.834.813.43114q32361190.091.4
SLITRK6 465.538.712.03613q31.1279113.084.5
BST2 a 117.614.87.96719p13.13611260.098.8
ZNF521 143.019.67.30918q11.2186113.558.6
Under-expressed genes
ALS2CR12 4.215.80.2682q33.118680.3103.2
GBP5 11.643.80.2661p22.218675.7110.6
GHRH 14.767.10.21920q11.2432097.7257.0
SLFNL1 a 18.992.90.2031p34.218690.3122.1
KIAA2022 24.3297.90.081Xq13.327952.2260.9
b) TMD vs. normal MK
Gene Value 'C' Value 'D' Ratio 'C'/'D' Location Data points SD SD
'C' 'D' 'C' 'D'
Over-expressed genes
TMEM241 2,224.417.6126.35618q11.26857.1115.8
CMBL 1,063.511.791.1265p15.298168.830.1
PTGS2 2,256.459.238.1451q25.2-q25.32020147.6131.6
CGA 360.610.036.2386q12-q211419159.727.3
TMEFF2 755.122.533.6052q32.399190.344.3
Under-expressed genes
ILDR1 18.34,333.80.0043q13.336958.963.3
DRD4 9.32,327.60.00411p15.5111952.3135.2
HIST3H3 5.21,313.00.0041q42111936.2137.9
LBX1 4.31,358.50.00310q24111926.6152.6
MAP3K10 7.42,352.30.00319q13.2111928.8135.1
c) TMD vs. normal CB MK
Gene Value 'C' Value 'E' Ratio 'C'/'E' Location Data points SD SD
'C' 'E' 'C' 'E'
Over-expressed genes
CGA 360.68.044.8696q12-q21148159.739.9
PTPRO b 523.115.134.68012p13-p12259116.860.8
CLCA1 b 319.29.932.3801p22.3208116.717.5
APOC2 ab 301.69.930.37219q13.2171066.643.4
HDC ab 918.034.226.83015q21-q2211859.9168.1
Under-expressed genes
CDKL1 5.9174.30.03414q21.314982.099.0
GNG11 34.31,042.30.0337q2111880.181.2
RHOBTB1 7.7241.30.03210q21.220849.797.4
RPS12 34.61,157.20.0306q23.2201577.4112.4
OLFM4 10.0499.30.02013q14.311756.0121.5

Data refer to the following comparisons: a) DS AMKL (pool 'A') vs. TMD (pool 'C'); b) TMD (pool 'C') vs. normal MK (pool 'D'); c) TMD (pool 'C') vs. normal CB MK (pool 'E'). Value: mean gene expression value normalized across all the pool samples; data points: number of spots associated to an expression value for the locus; 'SD': standard deviation for the expression value expressed as percentage of the mean. Full results available as additional material (see text).

aThe segment window contains more than one gene, but the significance is assumed to be maintained because the expression value of this over- or under-expressed gene prevails over the others.

bAccording to the criteria detailed in Methods section, this gene is one of the five most over-expressed ones, but the value is not statistically significant due to the presence of a lot of loci associated with less than 5 data points in the CB MK integrated dataset.

cCytoband not available in Gene was derived from the UCSC Genome Browser [36].

List of the five genes most over- or under-expressed (all significantly, with q <0.05) Data refer to the following comparisons: a) DS AMKL (pool 'A') vs. TMD (pool 'C'); b) TMD (pool 'C') vs. normal MK (pool 'D'); c) TMD (pool 'C') vs. normal CB MK (pool 'E'). Value: mean gene expression value normalized across all the pool samples; data points: number of spots associated to an expression value for the locus; 'SD': standard deviation for the expression value expressed as percentage of the mean. Full results available as additional material (see text). aThe segment window contains more than one gene, but the significance is assumed to be maintained because the expression value of this over- or under-expressed gene prevails over the others. bAccording to the criteria detailed in Methods section, this gene is one of the five most over-expressed ones, but the value is not statistically significant due to the presence of a lot of loci associated with less than 5 data points in the CB MK integrated dataset. cCytoband not available in Gene was derived from the UCSC Genome Browser [36]. At chromosomal level, the highest ratio was observed for chr14 (0.87, followed by 0.86 for chr5, chr21, chr8, chr12 and chr16), the lowest for chrY (0.74).

Transcriptome map comparison of TMD vs. normal MK or CB MK cells

Regional differential expression of pool 'C' (20 TMD samples) versus pool 'D' (19 MK samples) or 'E' (8 CB MK samples) (Table 1) was investigated. The main results are shown in Figure 3. For what MK samples are concerned, results obtained by default analysis and derived from 'C'/'D' expression ratios for 25,800 loci (Additional file 9) included 2 significantly over-expressed segments in TMD cells (Table 6b). The highest expression ratio (28.0) between TMD and normal MK was observed in a segment on chromosome 3 including the known gene LRRN1, already observed as over-expressed in comparison of DS AMKL vs. normal MK (Table 2b). The second segment with the highest expression ratio (19.9) was located on chromosome 15, and contained the locus NR2F2-AS1, encoding for an antisense mRNA, already observed as over-expressed in comparison of DS AMKL and non-DS AMKL vs. normal MK (Table 2b and 2c). At single gene level, the fold increase was higher than 16.5 for the first 20 genes with the greatest expression ratios (Table 7b and Additional file 9), with the highest fold increases for TMEM241 (126.4) and the cysteine hydrolase gene (CMBL) (91.1). The lowest 'C'/'D' expression ratios were observed for a member of the serine/threonine kinase family (MAP3K10) and the homeobox gene (LBX1) (both with 333-fold decrease). At chromosomal level, the highest ratio was observed for chrY (2.1) and chr21 (1.93), followed by chr13 (1.71), the lowest for chrX (1.33). As far as the comparison of TMD with CB MK samples is concerned, results derived from the 'C'/'E' expression ratio for 25,540 loci (Additional file 10) included only 1 significantly over-expressed segment in TMD cells (Table 6c). The segment with a significant high expression ratio (76.0) between TMD and normal CB MK cells was on chromosome 3 (3q22.1), including the known genes COL6A5 and COL6A6 and already observed as over-expressed in DS as well in non-DS AMKL samples in comparison with CB MK samples. At single gene level, a fold increase ranged from 44.9 to 15.1 for the first 20 genes with the greatest expression ratios (Table 7c and Additional file 10). The highest fold increases were observed for CGA (44.9), encoding for the alpha chain of the glycoprotein hormones and PTPRO (34.7), as already observed in non-DS AMKL vs. CB MK comparison. The lowest 'C'/'E' expression ratio was observed for OLFM4 (50-fold decrease), encoding olfactomedin 4, an antiapoptotic factor that promotes tumor growth. At chromosomal level, the highest ratio was observed for chr21 (2.40), followed by chrY (2.37) and chr20 (2.13), the lowest for chrX (1.53).

Comparison with previously published data

As a result of the analysis above described, a reference integrated map for the expression of about 26,000 mapped sequences (~75% known genes and ~25% expression sequence tags - ESTs) was de facto generated for five cell types (DS AMKL cells, non-DS AMKL cells, TMD cells, MK and CB MK). This gave us the opportunity to compare our data with the expression values of specific known genes from previously published works about the considered cell types. Following analysis of the main literature about AMKL, we selected 38 genes of interest and have tabulated their expression values desumed from our 8 differential maps, comparing these values to the ones previously described in different experimental settings (Table 8).
Table 8

Comparison with previously published data

Gene Location References 'A' /' B' 'A' /'D' 'B' /'D' 'A '/' E' 'B' /' E' 'A' /' C' 'C' / 'D' 'C '/' E'
MK markers
BST2 19p13.11[23] 0.53 5.3610.077.2113.567.970.670.91
GATA1 Xp11.23[13,22,48] 3.39 1.45 0.43 1.970.58 1.19 1.22 1.66
GP1BA 17p13.2[43]0.46 0.07 0.14 0.25 0.53 0.810.080.30
HBG1 11p15.5[22] 2.36 1.400.591.070.450.781.791.37
HBG2 11p15.5[22] 2.47 1.310.531.060.430.711.841.48
ITGA2B 17q21.32[23]0.82 0.63 0.771.521.841.050.601.44
ITGB3 17q21.32[23]0.75 0.08 0.10 0.10 0.13 0.650.120.15
PF4 4q12-q21[25]0.90 0.06 0.06 0.11 0.12 0.440.130.25
PPBP 4q12-q13[25]1.45 0.10 0.07 0.13 0.09 0.280.340.44
TAL1 1p32[23] 0.85 0.19 0.22 0.16 0.19 1.050.180.16
Chr21 genes
ADAMTS1 21q21.2[49]6.19 7.02 1.14 7.14 1.151.82 3.85 3.91
BACH1 21q22.11[22] 1.86 1.440.771.040.561.550.930.67
DYRK1A 21q22.13[50] 1.61 0.870.540.630.39 1.14 0.770.55
ERG 21q22.3[18,22] 0.68 3.475.123.104.571.422.442.18
ETS2 21q22.2[18,22] 0.93 1.111.201.391.490.731.521.90
GABPA 21q21.3[18,22] 1.39 2.451.761.521.101.551.580.98
RUNX1 21q22.3[18,22] 0.71 1.081.510.841.181.061.010.80
SOD1 21q22.11[51] 1.88 1.520.812.781.481.521.001.84
SON 21q22.11[22] 1.28 1.391.081.030.801.440.960.71
Other genes
APOC1 19q13.2[20,22,23] 4.37 10.042.3016.263.72 1.79 5.639.11
APOC2 19q13.2[22] 4.36 37.048.4943.489.961.4325.8830.37
APOE 19q13.2[22,23] 2.18 3.571.643.241.481.562.302.08
CDA 1p36.2-p35[23] 0.39 0.401.030.882.240.880.461.00
DICER1 14q32.13[18]0.800.410.510.270.340.990.410.27
FCER1A 1q23[22,23] 6.32 4.280.688.861.401.323.256.72
GYPA 4q31.21[23] 3.40 1.560.460.890.261.191.310.75
GYPB 4q31.21[23] 2.81 1.280.460.890.311.271.010.70
GYPE 4q31.1[23] 2.07 1.630.791.710.821.720.950.99
HDC 15q21-q22[22,23] 17.10 9.950.5832.721.911.228.1626.83
IGF1R 15q26.3[17]0.76 1.18 1.55 0.941.241.210.980.78
ITGAL 16p11.2[52]0.79 0.21 0.26 0.09 0.11 1.02 0.20 0.09
KIT 4q11-q12[22] 1.66 3.342.013.081.851.222.742.53
MPL 1p34[53]1.11 0.31 0.28 0.220.201.23 0.25 0.18
MTOR 1p36.22[17]0.87 1.57 1.82 1.942.240.891.762.17
MYCN 2p24.3[21]0.351.002.870.772.19 0.34 2.992.28
PRAME 22q11.22[21]1.6011.597.2415.619.76 4.18 2.773.73
SMOX 20p13[23] 1.37 0.760.551.020.750.790.961.30
ST18 8q11.23[18]0.900.740.820.740.820.780.960.95

Expression values of genes known for their role in MK differentiation and DS or non-DS AMKL development in each of the eight comparisons: DS AMKL (pool 'A') vs. non-DS AMKL (pool 'B'); DS AMKL (pool 'A') vs. normal MK (pool 'D'); non-DS AMKL (pool 'B') vs. normal MK (pool 'D'); DS AMKL (pool 'A') vs. normal CB MK (pool 'E') cells; non-DS AMKL (pool 'B') vs. normal CB MK (pool 'E'); DS AMKL (pool 'A') vs. TMD (pool 'C'); TMD (pool 'C') vs. normal MK (pool 'D'); TMD (pool 'C') vs. normal CB MK (pool 'E'). Data extracted from the full tables showing expression values and their ratio for about 17-26,000 loci for each comparison. In bold: expression ratio values are consistent with data available in the literature (see references indicated). The other values are reported for completeness. Descriptions of the gene symbols are given in Additional file 2.

Comparison with previously published data Expression values of genes known for their role in MK differentiation and DS or non-DS AMKL development in each of the eight comparisons: DS AMKL (pool 'A') vs. non-DS AMKL (pool 'B'); DS AMKL (pool 'A') vs. normal MK (pool 'D'); non-DS AMKL (pool 'B') vs. normal MK (pool 'D'); DS AMKL (pool 'A') vs. normal CB MK (pool 'E') cells; non-DS AMKL (pool 'B') vs. normal CB MK (pool 'E'); DS AMKL (pool 'A') vs. TMD (pool 'C'); TMD (pool 'C') vs. normal MK (pool 'D'); TMD (pool 'C') vs. normal CB MK (pool 'E'). Data extracted from the full tables showing expression values and their ratio for about 17-26,000 loci for each comparison. In bold: expression ratio values are consistent with data available in the literature (see references indicated). The other values are reported for completeness. Descriptions of the gene symbols are given in Additional file 2. The wide agreement of expression ratio values for specific genes between our data, generated by systematic meta-analysis of hundred of thousands of gene expression values from any gene expression profile available, and the data obtained by different marker-specific methods in published quantitative studies, is relevant for the validation of our maps that may so be used for exploring any other expression ratio in the considered biological conditions.

Discussion

We have presented here a comprehensive analysis of transcriptome in human DS AMKL cells. Integration of data from different sources, including data obtained from different Authors using a variety of platforms, was made possible by a recent approach described by us for creation and analysis of transcriptome maps [25]. While most approaches are aimed to separate gene expression profiles related to the same biological source in subclasses, the TRAM tool provides means to integrate and summarize a pool of samples of the same biological origin leading to a global picture of gene expression for that condition. Moreover, TRAM identifies critical genomic regions and genes with significant differential expressions between two biological conditions. Several Authors have determined gene expression profiles for DS or non-DS AMKL samples or have explicitly compared these two leukemic conditions. However, due to the rarity of the M7 subtype of leukemia and the need to limit the analysis to pediatric age because DS AMKL occurs almost exclusively in children, these studies were typically limited to small group of samples. In addition, most platforms used in the microarray studies are affected by omissions or errors in mapping a certain percentage of probes to specific loci in the genome. In our analysis, the use of a new version of the TRAM software (TRAM 1.1) allowed us to map thousands of previously uncharacterized microarray probes and to avoid the errors in probe assignment to human loci often present in the data supplied by the manufacturer along with the platforms. Our data are derived from systematic integration of data from multiple sources at locus level (up-to-date rigorous assignment of each microarray probe to a specific human locus/transcript/EST cluster), map level (up-to-date fine mapping of each transcript on the genome map) and expression value level (assignment of a reference value to each locus in each cell type following an intra- and inter-sample normalization pipeline exploiting both parametric and non-parametric calculations). The combination of many gene expression profile datasets from different sources poses the problem of the batch effect, i.e. the systematic differences between batches (groups) of samples in microarray experiments due to purely technical reasons. However, the intrinsic resistance of the TRAM approach to the batch effect has been discussed previously [25], and it is indirectly confirmed by the clear biological meaning of the differential expression highlighted by the tool when comparison with previous direct key experimental knowledge is possible in several different types of tissues and organs [25,34,54]. A systematic comparison of AMKL originated from trisomy 21 cells versus non-trisomy 21 cells should highlight specific mechanisms [55] related to the presence of an extra copy of chr21 in DS children developing AMKL. Moreover, we presented a comparison with normal MK cells that has never been performed in other analyses about AMKL. Our global quantitative models of the transcriptome in the AMKL cells could also be useful to test hypotheses for correlations between any parameter associated to the condition (e.g., specific mutations or phenotype aspects) and specific changes in gene expression. Our results, obtained in an integrated and open setting without any a priori assumption, show several previously unidentified aspects regarding specificity of AMKL originated by trisomy 21 cells. First, there are only a few genomic regions significantly over- or under-expressed when comparing DS versus non-DS AMKL samples. This finding suggests that transcriptome maps of these two conditions are similar while on the other hand allows to focus to a small set of regions that appears to be critical in order to differentiate these disease conditions. Relevant differences regarding genes were reported in Table 2a (genomic segments) and Table 3a (single genes), with potential implications for the identification of diagnostic or therapeutic targets. There are three main regions over-expressed in DS AMKL vs. non-DS AMKL (Table 2a). The first one (15q21.2) contains HDC gene, whose mRNA is translated in the enzyme converting L-histidine to histamine produced by only a few cell types [56]; HDC mRNA increase has been shown to be associated to basophilic rather than to MK differentiation of pluripotent hematopoietic cells [57]. These observations led to the discovery of a skewing toward a potential basophilic differentiation for DS AMKL not highlighted in the original works from which the data were derived. Supporting this hypothesis, FCER1A mRNA, encoding the alpha subunit of the high-affinity IgE receptor, the initiator of the allergic response and strongly typical of basophilic differentiation, is 6.3 times over-expressed (the 8th most over-expressed known gene) in DS vs. non-DS AMKL, reinforcing the notion that in DS AMKL but not in non-DS AMKL the leukemic dedifferentiation involved the possibility of redirection toward basophilic differentiation. Remarkably, in [58] is demonstrated by an electron microscopy analysis that AMKL blast cells from children with DS may contain basophil-like granules which were almost totally absent in blasts from children with non-DS AMKL or adults with AMKL, so that our data allow to the visualization of the molecular correlation at the level of the whole transcriptome of a morphological feature observed more than 20 years ago. Two other genomic regions are over-expressed in DS vs. non-DS AMKL blasts. The first is the region of glycophorins genes (GYPE, GYPA, GYPB) on chr4, erythroid surface markers [59]: this reinforces the concept of a disturbance of multilineage myeloid hematopoiesis in DS AMKL and has been observed by flow cytometry in the non-neoplastic hematopoiesis itself in trisomy 21 [60]. The other is the region of apolipoproteins genes (APOC1, APOC2, APOE) on chr19 that has been described as a signature of progression from TMD to DS AMKL in a gene expression profile ([20], it was not possible to include this in our analysis), further underlining its specificity for DS vs. non-DS MK blast cells. When grouping expression values by chromosome, the chromosome with the greatest global RNA output was chr21 in both TMD and DS AMKL vs. normal CB MK; we observed the same result in both DS and non-DS AMKL vs. normal MK comparisons. These data suggest that over-expression of chr21 genes is a key factor in AMKL development. In particular, ADAMTS1, encoding a protease known to inhibit angiogenesis [61] is the most over-expressed chr21 gene in DS vs. non-DS AMKL comparison. It is interesting to notice that this gene has been correlated to pediatric leukemias (ALL and DS AML) [49,62] but its exact role in leukemogenesis is still to be discovered. Our quantitative approach summarizing all values for each locus may clearly highlight mean expression ratio near to 1.5:1 for several chr21 genes when comparing DS AMKL vs. non-DS AMKL. This observation is consistent with the presence of an additional copy of the considered genes in trisomic cells. For example, GABPA expression presents a ratio close to 1.5:1 expected in trisomy of chr21, as well as other chr21 genes (DYRK1A, SON, BACH1) (Table 8). Even relatively small but significant differences (around 1.5-fold) in expression of numerous genes likely produce an aggregate effect, as observed in [63], where the same genes seem to be candidates to explain the impact of trisomy 21 in hematopoiesis abnormalities. For this reason it could be interesting to start from significant and robust meta-analysis data to plan functional approaches in the future. Moreover, our data are consistent with the previous observation that RUNX1, ERG and ETS2 oncogenes, although located on chr21, are not over-expressed in DS vs. non-DS AMKL [18,22]. It has also been recently demonstrated that they are not located on a chr21 duplicated minimal region in two cases of AML of M0 subtype (FAB classification) [64]. As regard oncogenes, if chr21 oncogenes cited above appear not to be over-expressed in DS AMKL, TRIB1 (chr8) may be a novel important oncogene for DS AMKL and its mutation is an earlier genetic event in leukemogenesis [65]. In particular, it has been shown that a mutation of TRIB1 [65], a myeloid oncogene whose protein product is able to enhance ERK phosphorylation and to promote degradation of C/EBP family transcription factors, is a gain-of-function mutation remaining in leukocytes of the remission stage in which GATA1 mutation disappeared. Our results show a mean value of 1.40 for the human TRIB1 expression ratio between DS and non-DS AMKL samples, with a higher over-expression observed when comparing leukemic samples to normal MK (ratio 3.15 for DS AMKL and 2.26 for non-DS AMKL). Although the expression of chr21 genes as a whole, or of some individual chr21 genes, may be coherent with the 3:2 gene expression ratio model in the comparison between trisomic and euploid cells, we note that discrepancies from this ideal model seen in our data for some of the comparisons we have made (Figures 1, 2 and 3) may be ascribed to the complexity of gene regulation in the aneuploid state [66,67], to individual variability [68] as well as to the general dysregulation typical of the neoplastic state for which specific cell types we analyze is concerned. Additional biologically relevant findings came from comparison of each type of megakaryoblastic leukemic condition with normal MK cells. Due to the role of the microenvironment in the hemopoiesis, including hemopoiesis in DS [69], it is expected that DS MK would present growth alterations due to trisomy 21 in both hemopoietic and microenvironment cells. From this point of view, DS MK cells would be the ideal control for the progression of a trisomic cell toward TMD and AMKL, however no gene expression profile dataset was available for this cell type. We propose here a biological model of the transcriptome depicting progressive changes from normal MK to TMD and then to DS AMKL, able to underline both shared and unique transcriptome map patterns for DS and non-DS variants of AMKL (Figure 4).
Figure 4

Biological model of the transcriptome depicting progressive changes from MK to TMD then to DS AMKL. Downward pointing arrow indicates the repression of genes involved in MK differentiation; upward pointing arrow indicates the over-expression of potential molecular markers of progression to AMKL. Value: mean gene expression value normalized across all the pool samples. §Observed both in DS and non-DS AMKL.

Biological model of the transcriptome depicting progressive changes from MK to TMD then to DS AMKL. Downward pointing arrow indicates the repression of genes involved in MK differentiation; upward pointing arrow indicates the over-expression of potential molecular markers of progression to AMKL. Value: mean gene expression value normalized across all the pool samples. §Observed both in DS and non-DS AMKL. Noteworthy, the genomic segment on chr4 known to contain a cluster of genes highly specific for MK differentiation [25,70], was the highest significantly under-expressed segment in both DS and non-DS AMKL in comparison with normal MK. In particular, the more strongly under-expressed region 4q12-q21 contains a cluster of genes, including PF4 (encoding for the platelet factor 4, a main component of platelet alpha granules) [71-73], and PPBP (encoding for beta-thromboglobulin) [70], that are the most up-regulated transcripts in the megakaryocytic differentiation from CD34+ hematopoietic progenitors [25]. This finding highlights a common final outcome of the block of MK cells differentiation in both DS and non-DS AMKL. It should be underlined that this result came from systematic ab initio analysis of more than 12,000 segments on the human genome including about 26,000 mapped loci, thus highlighting that this region critical for the MK differentiation is actually the more repressed in absolute when comparing transcriptome maps of AMKL (DS or non-DS) and normal MK cells. In addition, the most under-expressed gene in TMD blasts when compared to normal MK cells is MAP3K10, encoding an activity of mitogen-activated protein kinase kinase kinase (MAPKKK). It is known that the mitogen-activated protein kinase (MAPK) pathway is involved in and is sufficient for megakaryocytic differentiation [74,75]. MAPK activity is present in several tens of human proteins, and we have identified the member MAP3K10 as the critically repressed gene in the block of MK differentiation in the development of leukemia with MK features in that it appears down-regulated 300-fold in TMD cells and 500-fold in both DS and non-DS AMKL compared with normal MK samples (Tables 3 and 7). Finally, transcript for MPL, the receptor of thrombopoietin which is the primary regulator of normal thrombopoiesis (the formation of platelets) [53,73,76], is decreased by ~70% in either TMD, DS and non-DS AMKL cells vs. normal MK cells. An exceedingly high over-expression of the gene located on chromosome 18 and encoding for the uncharacterized membrane protein TMEM241 has been found in both DS (59-fold) and non-DS (156-fold) AMKL cells vs. normal MK. Although this probe was not present in all considered experimental platforms, its extreme differential expression makes it a candidate for further studies as a marker of progression from normal MK to AMKL blasts, also due to its 126-fold over-expression in TMD vs. MK cells. Moreover, we identified several signatures of progression specifically to DS AMKL. Remarkably, segments and genes up- or down-regulated in TMD in comparison with normal MK cells were highly similar to those specifically found in DS AMKL, underlining striking similarities between TMD and DS AMKL at the level of the whole transcriptome (already noted in [21], in their smaller set). On the other hand, a direct comparison between TMD and DS AMKL shows specific potential markers of progression to DS AMKL. As cited above, apolipoproteins genes (APOC1, APOC2, APOE) have been described as a signature of progression from TMD to DS AMKL [20] and it is interesting to notice that APOC2 is among the 20 most expressed genes in the comparison between TMD and MK (25.88-fold increase), showing a progressive increase of expression from normal MK to TMD and then to DS AMKL. In our analysis, ZNF587B appears to be the most discriminant marker between TMD and DS AMKL. Again, this observation offers the opportunity to integrate and discuss single genes and pathways previously described as abnormally expressed in DS or non-DS AMKL (Table 8). For example, the PRAME gene, encoding for a tumor antigen [21] was identified as a specific marker for DS AMKL blasts (n=7), with no expression in TMD (n=9). While our meta-analysis on PRAME expression data points (36 for DS AMKL and 11 for TMD) confirmed a clear over-expression of PRAME in DS AMKL (4.2-fold increase, Table 8), it was not the most discriminant marker, that was exactly ZNF587B, while PRAME was the 33rd out of 25,955 transcripts ordered by decreasing DS AMKL vs. TMD expression ratio (Additional file 8). Finally, since leukemias in infants or young children originate from fetal hematopoietic cells [17,18,26,27] and the progenitors (fetal/neonatal MKP) are present in the cord blood [28,29], comparisons with CB MK cells have been also performed. Data from DS and non-DS AMKL vs. CB MK comparisons confirmed the repression of the clusters of genes expressed in MK. The over-expression of a region with collagen genes emerged both in DS and non-DS AMKL as well as that of the single gene PTPRO (Table 5a and 5b), encoding for a tyrosine phosphatase receptor known to be involved in megakaryocytopoiesis and whose mRNA targeting by antisense oligonucleotides results in inhibited MK progenitor proliferation [77]. On the other hand, the difference between DS and non-DS AMKL vs. CB MK is shown by the over-expression of the two olfactive receptor genes OR10A5 and OR10A4 (Table 4) only in non-DS derived cells. The analysis of enrichment in specific gene functions using the tool FuncAssociate, for the 100 most over- or under-expressed genes in the comparison of DS vs. non-DS AMKL and of both of them vs. MK cells gave no significant results, other than a significant enrichment in genes involved in sequence-specific DNA-binding in non-DS AMKL vs. MK cells (data not shown), highlighting the relevance of remodeling the transcription factor network in leukemia.

Conclusions

Our results provide a systematic meta-analysis using any available gene expression profile dataset related to AMKL in pediatric age. These allow to identify more general trends and to produce a highly coherent view of the transcriptome depicting progressive changes from MK to TMD and then to DS AMKL. We believe that the originality of our results is due to several concurrent original features of the TRAM 1.1 platform. Advantages and relevant differences are: integration of the largest possible number of samples; integrated analysis of the largest possible number of genes (the integration of different platforms led us to assess expression ratio for about 26,000 loci, quantitating almost 4,000 genes in addition to the widely used platform U133A when used alone); absence of a priori filtering (in several works, this led to actual analysis of less than 50% of the genes present on the experimental platform); characterization at regional/map level in the study of gene expression (usually absent in the works from which data were obtained), relevant with regard to the study of an aneuploidy such as trisomy 21. These results provide a new integrated model of the whole human transcriptome in DS and non DS AMKL, TMD and normal human MK cells, providing hints about pathophysiology of AMKL and also being useful to highlight possible clinical markers.
  73 in total

1.  Gene expression profiling of normal and malignant CD34-derived megakaryocytic cells.

Authors:  Elena Tenedini; Maria Elena Fagioli; Nicola Vianelli; Pier Luigi Tazzari; Francesca Ricci; Enrico Tagliafico; Paolo Ricci; Luigi Gugliotta; Giovanni Martinelli; Sante Tura; Michele Baccarani; Sergio Ferrari; Lucia Catani
Journal:  Blood       Date:  2004-07-22       Impact factor: 22.113

2.  Activation of the mitogen-activated protein kinase pathway is involved in and sufficient for megakaryocytic differentiation of CMK cells.

Authors:  A S Melemed; J W Ryder; T A Vik
Journal:  Blood       Date:  1997-11-01       Impact factor: 22.113

3.  The receptor protein tyrosine phosphatase, PTP-RO, is upregulated during megakaryocyte differentiation and Is associated with the c-Kit receptor.

Authors:  Y Taniguchi; R London; K Schinkmann; S Jiang; H Avraham
Journal:  Blood       Date:  1999-07-15       Impact factor: 22.113

4.  Modulation of histidine decarboxylase activity and cytokine synthesis in human leukemic cell lines: relationship with basophilic and/or megakaryocytic differentiation.

Authors:  M Dy; M Pacilio; A Arnould; F Machavoine; P Mayeux; O Hermine; M Bodger; E Schneider
Journal:  Exp Hematol       Date:  1999-08       Impact factor: 3.084

5.  Expression of chromosome 21-localized genes in acute myeloid leukemia: differences between Down syndrome and non-Down syndrome blast cells and relationship to in vitro sensitivity to cytosine arabinoside and daunorubicin.

Authors:  J W Taub; X Huang; L H Matherly; M L Stout; S A Buck; G V Massey; D L Becton; M N Chang; H J Weinstein; Y Ravindranath
Journal:  Blood       Date:  1999-08-15       Impact factor: 22.113

6.  A role for the MEK/MAPK pathway in PMA-induced cell cycle arrest: modulation of megakaryocytic differentiation of K562 cells.

Authors:  R Herrera; S Hubbell; S Decker; L Petruzzelli
Journal:  Exp Cell Res       Date:  1998-02-01       Impact factor: 3.905

Review 7.  Transient leukemia in newborns with Down syndrome.

Authors:  Gita V Massey
Journal:  Pediatr Blood Cancer       Date:  2005-01       Impact factor: 3.167

8.  Identification of a gene expression signature associated with pediatric AML prognosis.

Authors:  Tomohito Yagi; Akira Morimoto; Mariko Eguchi; Shigeyoshi Hibi; Masahiro Sako; Eiichi Ishii; Shuki Mizutani; Shinsaku Imashuku; Misao Ohki; Hitoshi Ichikawa
Journal:  Blood       Date:  2003-05-08       Impact factor: 22.113

9.  Trisomy 21 enhances human fetal erythro-megakaryocytic development.

Authors:  Stella T Chou; Joanna B Opalinska; Yu Yao; Myriam A Fernandes; Anna Kalota; John S J Brooks; John K Choi; Alan M Gewirtz; Gwenn-ael Danet-Desnoyers; Richard L Nemiroff; Mitchell J Weiss
Journal:  Blood       Date:  2008-09-23       Impact factor: 22.113

10.  An erythroid and megakaryocytic common precursor cell line (B1647) expressing both c-mpl and erythropoietin receptor (Epo-R) proliferates and modifies globin chain synthesis in response to megakaryocyte growth and development factor (MGDF) but not to erythropoietin (Epo).

Authors:  L Bonsi; A Grossi; P Strippoli; F Tumietto; R Tonelli; A M Vannucchi; A Ronchi; S Ottolenghi; G Visconti; G C Avanzi; L Pegoraro; G P Bagnara
Journal:  Br J Haematol       Date:  1997-09       Impact factor: 6.998

View more
  10 in total

1.  High Expression of DC-STAMP Gene Predicts Adverse Outcomes in AML.

Authors:  Qian Liang; Lele Zhang; Wenjun Wang; Jingyu Zhao; Qiaoli Li; Hong Pan; Zhen Gao; Liwei Fang; Jun Shi
Journal:  Front Genet       Date:  2022-04-27       Impact factor: 4.772

2.  Systematic identification of human housekeeping genes possibly useful as references in gene expression studies.

Authors:  Maria Caracausi; Allison Piovesan; Francesca Antonaros; Pierluigi Strippoli; Lorenza Vitale; Maria Chiara Pelleri
Journal:  Mol Med Rep       Date:  2017-07-06       Impact factor: 2.952

3.  A molecular view of the normal human thyroid structure and function reconstructed from its reference transcriptome map.

Authors:  Lorenza Vitale; Allison Piovesan; Francesca Antonaros; Pierluigi Strippoli; Maria Chiara Pelleri; Maria Caracausi
Journal:  BMC Genomics       Date:  2017-09-18       Impact factor: 3.969

4.  The Novel Zinc Finger Protein 587B Gene, ZNF587B, Regulates Cell Proliferation and Metastasis in Ovarian Cancer Cells in vivo and in vitro.

Authors:  Feiyue Zeng; Yingzi Liu; Yujie Liu; Qianying Ouyang; Zeen Sun; Jieqiong Tan; Weihua Huang; Jie Liu; Zhaoqian Liu; Honghao Zhou
Journal:  Cancer Manag Res       Date:  2020-06-26       Impact factor: 3.989

5.  Dysregulation of NIPBL leads to impaired RUNX1 expression and haematopoietic defects.

Authors:  Mara Mazzola; Alex Pezzotta; Grazia Fazio; Alessandra Rigamonti; Erica Bresciani; Germano Gaudenzi; Maria Chiara Pelleri; Claudia Saitta; Luca Ferrari; Matteo Parma; Monica Fumagalli; Andrea Biondi; Giovanni Cazzaniga; Anna Marozzi; Anna Pistocchi
Journal:  J Cell Mol Med       Date:  2020-04-23       Impact factor: 5.310

6.  Sex-Specific Transcriptome Differences in Substantia Nigra Tissue: A Meta-Analysis of Parkinson's Disease Data.

Authors:  Elisa Mariani; Lorenza Lombardini; Federica Facchin; Fabrizio Pizzetti; Flavia Frabetti; Andrea Tarozzi; Raffaella Casadei
Journal:  Genes (Basel)       Date:  2018-05-25       Impact factor: 4.096

7.  Partial trisomy 21 map: Ten cases further supporting the highly restricted Down syndrome critical region (HR-DSCR) on human chromosome 21.

Authors:  Maria Chiara Pelleri; Elena Cicchini; Michael B Petersen; Lisbeth Tranebjaerg; Teresa Mattina; Pamela Magini; Francesca Antonaros; Maria Caracausi; Lorenza Vitale; Chiara Locatelli; Marco Seri; Pierluigi Strippoli; Allison Piovesan; Guido Cocchi
Journal:  Mol Genet Genomic Med       Date:  2019-06-25       Impact factor: 2.183

8.  Identification of minimal eukaryotic introns through GeneBase, a user-friendly tool for parsing the NCBI Gene databank.

Authors:  Allison Piovesan; Maria Caracausi; Marco Ricci; Pierluigi Strippoli; Lorenza Vitale; Maria Chiara Pelleri
Journal:  DNA Res       Date:  2015-11-17       Impact factor: 4.458

9.  Meta-Analysis of Parkinson's Disease Transcriptome Data Using TRAM Software: Whole Substantia Nigra Tissue and Single Dopamine Neuron Differential Gene Expression.

Authors:  Elisa Mariani; Flavia Frabetti; Andrea Tarozzi; Maria Chiara Pelleri; Fabrizio Pizzetti; Raffaella Casadei
Journal:  PLoS One       Date:  2016-09-09       Impact factor: 3.240

10.  Integrated Quantitative Transcriptome Maps of Human Trisomy 21 Tissues and Cells.

Authors:  Maria Chiara Pelleri; Chiara Cattani; Lorenza Vitale; Francesca Antonaros; Pierluigi Strippoli; Chiara Locatelli; Guido Cocchi; Allison Piovesan; Maria Caracausi
Journal:  Front Genet       Date:  2018-04-24       Impact factor: 4.599

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.