Literature DB >> 26507857

BloodSpot: a database of gene expression profiles and transcriptional programs for healthy and malignant haematopoiesis.

Frederik Otzen Bagger¹, Damir Sasivarevic², Sina Hadi Sohi², Linea Gøricke Laursen¹, Sachin Pundhir¹, Casper Kaae Sønderby³, Ole Winther⁴, Nicolas Rapin⁵, Bo T Porse⁶.

Abstract

Research on human and murine haematopoiesis has resulted in a vast number of gene-expression data sets that can potentially answer questions regarding normal and aberrant blood formation. To researchers and clinicians with limited bioinformatics experience, these data have remained available, yet largely inaccessible. Current databases provide information about gene-expression but fail to answer key questions regarding co-regulation, genetic programs or effect on patient survival. To address these shortcomings, we present BloodSpot (www.bloodspot.eu), which includes and greatly extends our previously released database HemaExplorer, a database of gene expression profiles from FACS sorted healthy and malignant haematopoietic cells. A revised interactive interface simultaneously provides a plot of gene expression along with a Kaplan-Meier analysis and a hierarchical tree depicting the relationship between different cell types in the database. The database now includes 23 high-quality curated data sets relevant to normal and malignant blood formation and, in addition, we have assembled and built a unique integrated data set, BloodPool. Bloodpool contains more than 2000 samples assembled from six independent studies on acute myeloid leukemia. Furthermore, we have devised a robust sample integration procedure that allows for sensitive comparison of user-supplied patient samples in a well-defined haematopoietic cellular space.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2015 PMID： 26507857 PMCID： PMC4702803 DOI： 10.1093/nar/gkv1101

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

INTRODUCTION

A decade of intense studies of the genetic programs underlying normal and malignant haematopoiesis has resulted in a number of gene-expression data sets, which can potentially help answer questions concerning the molecular mechanisms governing normal haematopoiesis and how these are de-regulated in cancer. To researchers and clinicians with limited bioinformatics experience, these data have been available through online databases in the form of raw or semi-processed files but remained largely inaccessible for analysis, let alone comparison with user-supplied in-house data. Recently, a number of web interfaces have been generated to facilitate single gene queries of in-house data (ImmGen Gene Skyline (1), Gene-expression Atlas (2), Leukemia Gene Atlas (3) and Differentiation Map (2)) or curated, compiled and processed data sets (HemaExplorer (3), Gene Expression Commons (4), A HeamAtlas (5), BloodChIP (6), BloodExpress (7) and CODEX (8)). These tools provide information on the expression of single genes, but fail to answer the main questions as to whether these genes influence patient survival or if genes or pathways are regulated in similar or inverse patterns. We have previously published a comprehensive database of mRNA microarray samples from FACS sorted healthy and leukemic bone marrow samples (3) which has proven a useful and popular resource for researchers working within the areas of cellular differentiation, haematopoiesis and leukaemia. Here, we present a complete overhaul and significantly expanded version of the original database, with a new and interactive interface, all freely available online. The new database redefines current approaches to explorative data integration, presentation and visualisation of gene-expression in the haematopoietic system. Consequently, all these improvements called for a new name: BloodSpot. The core function of BloodSpot is to provide an expression plot of genes in healthy and cancerous haematopoietic cells at specific differentiation stages. To present these haematopoietic gene profiles, we have developed a novel visualization chart that simply integrates the benefits of strip-charts and violin plots. The server accepts either a unique gene name (gene alias) or a gene signature name from the MSigDB database. Of note, an auto-complete mechanism helps finding the right names for genes and gene signatures. To contextualise the haematopoietic gene expression profile, two additional levels of visualisation are available: an interactive hierarchical tree that shows the relationship between the samples displayed and a Kaplan–Meier plot based on a high-quality Acute Myeloid Leukemia (AML) data set (9). Additionally, we added a large body of curated data sets to the database, which users can query seamlessly. Significantly, we provide a new integrated data set of samples from AML patients along with FACS sorted samples from healthy individuals. This new integrated data set provides the most detailed picture of the gene expression landscape in healthy and malignant haematopoiesis to date. Finally, the database provides the possibility of comparing user-supplied leukaemia samples to healthy cells. The platform is freely available, and requires no login, at: www.bloodspot.eu

DATA CONTENT UPDATES

Available data sets

BloodSpot is a database of mRNA expression in healthy and malignant haematopoiesis and includes data from both humans and mice. The database is sub-divided into several data sets that are each accessible for browsing through the new interface. Data sets are organised by organism of origin and disease status. The data sets are organised as follows: first, human healthy haematopoietic cells, then human leukaemia and finally healthy mouse haematopoietic cells. BloodSpot contains the data sets from our previous HemaExplorer (3) as well as new published data sets, all manually processed as described in Rapin et al. (10). All data sets available in BloodSpot were generated using oligonucleotide microarray chips, except for one mouse data set that was generated using RNA sequencing technology. For completeness, the database also includes the content of other online databases that we deem relevant for the study of haematopoiesis in the framework of BloodSpot. These external databases include the Differentiation Map (DMAP) (2) and the Immunological Genome project (ImmGen) (1). In total the platform encompasses more than 5000 samples (see Tables 1–3). All data sets were controlled for quality, appropriately normalised and adjusted for batch effects when necessary (11,12).

Table 1.

Data sets for normal hematopoiesis

Data set	Organism	Source	Sample numbers	Cell types	Reference
Normal hematopoiesis with AMLs	Human	GSE42519	34	HSC, MPP, CMP, MEP, GMP, early PM, late PM, MY, MM, BC, PMN	Rapin et al. (20)
Normal hematopoiesis (HemaExplorer)	Human	GSE17054	2	HSC	Majeti et al. (21)
Normal hematopoiesis (HemaExplorer)	Human	GSE19599	4	GMP, MEP	Andersson et al. (22)
Normal hematopoiesis (HemaExplorer)	Human	GSE11864	2	Monocytes	Hu et al. (23)
Normal hematopoiesis (HemaExplorer)	Human	E-MEXP-1242	2	Monocytes	Wildenberg et al. (24)
Normal hematopoiesis (DMAP)	Human	GSE24759	211	Normal Hematopoiesis	Novershtern et al. (2)
Mouse normal hematopoietic system	Mouse	GSE14833, GSE6506	67	Normal Hematopoiesis	Di Tullio et al. (25), Chambers et al. (26)
ImmGen data sets	Mouse	GSE15907	>700	Normal Hematopoiesis	Ref (1,27–29)

Table 3.

Data set overview

Data set	Features	Samples	Normalisation method
Leukemia MILE study	67191	2095	1
Normal human hematopoiesis with AMLs	67191	296	1,7
Immgen Key populations	47273	256	2
AML versus normal	67191	252	3
AML TCGA data set	67191	244	1
AML TCGA data set versus normal	67191	244	3
AML Normal Karyotype	54675	234	1
AML Normal Karyotype versus normal	67191	234	3
Normal human hematopoiesis (DMAP)	35459	211	4
Immgen abT cells	47273	190	2
Immgen Dentritic cells	47273	151	2
Immgen MFs Monocytes Neutrophils	47273	114	2
Immgen B cells	47273	103	2
Normal human hematopoiesis (HemaExplorer)	57270	77	5
Immgen gdT cells	47273	76	2
Immgen Stem and progenitor cells	47273	76	2
Mouse normal hematopoietic system	57613	67	4
Immgen Activated T cells	47273	55	2
Immgen NK cells	47273	47	2
Immgen Stromal cells	47273	39	2
Mouse normal (RNA seq)	45426	52	6
BloodPool	67191	2120	1,7
BloodPool versus normal	67191	2076	3,7

Normalisation method legend:

1 Each cancer sample is normalised together with a set of samples from sorted normal myeloid populations. All samples where normalised using RMA. Comparison of gene expression values is not possible with other data sets in Bloodspot.

2 All samples from the ImmGen data sets were normalised together with RMA. Samples were subsequently attributed to the different data sets in BloodSpot. This means that comparison of gene expression values is possible across all ImmGen data sets.

3 The data are normalised according to Rapin et al. Briefly, each cancer sample is normalised together with a set of samples from sorted normal myeloid populations. Next, using a PCA-based method, the 5 closest normal samples from the cancer sample are averaged and this computed normal sample are next compared to the cancer sample allowing for computation of gen expression fold changes. See Supplementary Methods and Rapin et al. (10).

4 All sampleswhere

normalised using RMA. Comparison of gene expression values is not possible with other datasets in Bloodspot.

See our previous work (Bagger et al. (3)).

6 The data were processed using the bcbio nextgen RNA-seq pipeline. Count data were subsequently processed with DESeq2's variance stabilising transformation method.

7 The data was batch corrected using ComBat, taking study number as batch.

Normalisation method legend: 1 Each cancer sample is normalised together with a set of samples from sorted normal myeloid populations. All samples where normalised using RMA. Comparison of gene expression values is not possible with other data sets in Bloodspot. 2 All samples from the ImmGen data sets were normalised together with RMA. Samples were subsequently attributed to the different data sets in BloodSpot. This means that comparison of gene expression values is possible across all ImmGen data sets. 3 The data are normalised according to Rapin et al. Briefly, each cancer sample is normalised together with a set of samples from sorted normal myeloid populations. Next, using a PCA-based method, the 5 closest normal samples from the cancer sample are averaged and this computed normal sample are next compared to the cancer sample allowing for computation of gen expression fold changes. See Supplementary Methods and Rapin et al. (10). 4 All sampleswhere normalised using RMA. Comparison of gene expression values is not possible with other datasets in Bloodspot. 5 See our previous work (Bagger et al. (3)). 6 The data were processed using the bcbio nextgen RNA-seq pipeline. Count data were subsequently processed with DESeq2's variance stabilising transformation method. 7 The data was batch corrected using ComBat, taking study number as batch.

BloodPool

One new feature of BloodSpot is BloodPool, an aggregated and integrated data set grouping the results of multiple studies focusing on AML. By means of our batch correction methods this data set can be used to study gene expression (programs) in AML in comparison with healthy corresponding cells (see Figure 1). Using the computational method developed in Rapin et al. (10), we have also computed gene expression fold changes relative to their nearest normal counterparts for all AML profiles in BloodPool. BloodPool is available for browsing within BloodSpot and can be selected as any of the other available data sets.

Figure 1.

Principal component analysis (PCA) plot of BloodPool samples. (A) before batch correction, (B) after batch correction. Batches are coloured by study of origin.

MSigDB and CMAP gene signatures integration

We collected all gene signatures available from the Molecular Signatures Database (MSigDB) (13) (version 4.0) (http://www.broadinstitute.org/gsea/msigdb/) and computed, for each signature, the mean expression values for all samples in all data sets. These mean values summarise the expression of a signature for each sample. Connectivity map (CMAP) (13) signatures were generated with the rank matrix provided by the database. For each combination of compound and concentration, we reported the top and bottom 500 genes and produced gene signatures. The data displayed in BloodSpot represent the mean value of all genes in a given signature.

Data normalisation

All data were normalised and batch corrected to eliminate potential lab batch effects. For this we performed Robust Multi-array Average (RMA) (14) normalisation of all microarray .CEL data files partitioned by origin, and next applied ComBat (http://jlab.byu.edu/ComBat/) (12) an empirical Bayes method implemented in the R language. The batches were defined to be the study name/number, while the covariates was assigned to the relevant cell type. The resulting integrated gene expression databases can be visualised directly or compared to external samples provided by the user. See Tables 1–3 for an overview of the data presented in BloodSpot and the normalisation procedure used. All AML data sets available in BloodSpot are normalised according to Rapin et al. (10) and further batch corrected using ComBat when necessary. This processing schema ensures that the samples are normalised in the context of normal haematopoiesis and according to state of the art batch correction methods, regardless of the origin of the data. For RNA-seq data, we used the Blue Collar Bioinformatics RNA-seq pipeline (mapping on mm10 mouse genome with TopHat version 2 (15), (https://bcbio-nextgen.readthedocs.org/)) to obtain normalised count data from raw fastq files from Lara-Astiaso et al. (16). We report count data processed using the variance stabilising transformation method from the DESeq2 package (17).

Abbreviations and sample annotations

Abbreviations for all cell types can be found below the plot by clicking the ‘Abbreviations’ link. Typically, the user can find more detailed information about each cell type such as a longer, more informative name, and for healthy cells data sets the immunophenotype, when available. Links to the raw unprocessed data can also be found here.

Available genes

The server is restricted to genes found in our database of Affymetrix Human 133U plus 2, Affymetrix Human 133UA and Affymetrix Human 133UB chips for human, and GeneChip Mouse Genome 430 2.0 and Affymetrix Mouse Gene 1.0 ST Arrays for mouse. For the RNA-seq data set UCSC annotation for the mm10 genome was used. In order to handle gene aliases, a dictionary of gene aliases was constructed from NCBI ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/ and The HUGO Gene Nomenclature Committee (HGNC) www.genenames.org. Ambiguous gene aliases were not included when constructing the dictionary. The alias conversion is only used when the query is not an official gene symbol or probe name. The end result allows for greater flexibility regarding gene names input and faster browsing.

FUNCTIONALITY UPDATES

Both the back-end and the front-end have been completely redesigned for interactive usage and speed of execution. The interface is built with a range of new functionalities, with a focus on simplicity of use (see Figure 2).

Figure 2.

BloodSpot interface details. After a gene alias is submitted to display its expression pattern, any of the top three panels can be clicked to magnify content. The three panels show, from left to right, a survival plot based on a high-quality AML data set displaying a full Kaplan–Meier analysis for any query gene or gene signature, an improved jitter strip chart of gene-expression plot that draws from bar plots and violin plots and an interactive hierarchical tree that shows the relationship between the samples displayed and allows changing the focus of the display. The Select Population button allows the user to select which populations to display. The Gene Correlations button shows in a table how much other genes or gene signatures correlate with the displayed gene. It is possible to click on the genes in the table to display their expression profile. The Print as PDF button allows the user to export the current plot in PDF format. The T-Test button allows you to perform significance test between pairs of populations (legend is as follows: NS: non significant; *P < 0.05; **P < 0.01; ***P < 0.001). The Export Data as Text button allows you to export the raw data as text (CSV format). The Upload your own sample button allows for the upload of an Affymetrix HU133 plus 2.0 .CEL file and for viewing it in the context of normal haematopoiesis. The drop down menu in the upper right corner of the main plot can be used to select a probe representing the gene of interest; by default, the probe with the highest intensity is chosen. At the bottom of the main plot, a list of abbreviations is available that includes immunophenotypes when applicable.

Unified input

BloodSpot takes a single gene name (or unambiguous gene alias) or gene signature name as query. Users can search for keywords such as ‘carcinomas’ or ‘cell cycle’ and will be provided with a list of matching gene signature names. When relevant, it is possible to select which probe-set to display from the list in the upper right corner of the main plot. By default, the probe with the overall highest intensity is at the top of the list. The option ‘Max probe’ will use the one probe with the highest intensity within each population.

Default plot

When visiting the interface the plot at the centre of the screen in the default view. This representation is a novel improved jitter strip chart of gene expression, a swift novel visualisation plot that draws from bar plots and violin plots where the jitter is controlled by the density of samples and normalised over all the columns in the chart. Thus the width of the data cloud shows how many samples have similar values (see Figure 3A and a comparison to existing data plot types in Supplementary Figure S1). For more details on this visualisation method please see (Sidiropoulos, N., Sohi, S.H., Rapin, N. and Bagger, F.O. (2015) SinaPlot: an enhanced chart for simple and truthful representation of single observations over multiple classes. bioRxiv, http://dx.doi.org/10.1101/028191). Both an R-package and a webserver have been implemented for those interested in make use of this plot type that we have named SinaPlot.

Figure 3.

Main plots from BloodSpot for MEIS1. (A) Default view in BloodSpot. The plot is a novel improved jitter strip chart of gene expression that draws from bar plots and violin plots where the jitter is controlled by the density of samples and normalised over all the columns in the chart. (B) Survival plot based on a high-quality AML data set from The Cancer Genome Atlas (TCGA). It displays a full Kaplan–Meier analysis of survival. The survival plots are only available for human data sets, sharing probes with the microarray platform used by the TCGA. (C) Interactive hierarchical tree that shows the relationship between the samples displayed. Hovering over the nodes provides the full names of cell populations. Nodes can be clicked to collapse a branch of the tree—this will also update the default plot in the middle and remove the same populations there. The colour in the nodes represents the median expression of the queried gene. To accentuate the display in the trees, node size is also proportional to gene expression. Trees are based on literature (hierarchical differentiation), or overall sample correlation (correlation of samples). (D) Example table of genes and gene signatures correlating with MEIS1 expression in the default data set. This table appears when the user clicks on the ‘correlation’ button.

Survival plot

The chart shown to the left of the BloodSpot interface is a survival plot based on a high-quality AML data set from The Cancer Genome Atlas (TCGA). It displays a full Kaplan–Meier analysis of survival. The survival plots are only available for human data sets, sharing probes with the microarray platform used by the TCGA (Affymetrix U133 Plus 2) (see Figure 3B).

Tree plot

The chart shown to the right of the BloodSpot interface is an interactive hierarchical tree that shows the relationship between the samples displayed and allows changing the focus of the display. It is possible to mouse over the nodes to get the full name for long names. Nodes can be clicked to collapse a branch of the tree—this will also update the default plot in the middle and remove the same populations there (see Figure 3C).

Correlation of genes and gene signatures

For each gene and signature in every data set, we report the top correlating genes or signatures. Taking the haematopoietic fingerprint (e.g. the expression value of one gene over all haematopoietic cells) of all probe-sets and gene signatures in a given data set, we calculated the correlation matrix (Pearson) and present the highest positive and negative correlating genes/signatures. This feature allows for investigation of new associations between putative co-regulated genes or signatures that exhibit similar or inverse expression patterns over the course of haematopoiesis (see Figure 3D).

Other built-in tools

Cell populations may be removed from the graphs using the ‘Select population’ button. The current plot displayed can be exported as PDF in publication-ready quality using the ‘Print as PDF’ button. The ‘T-Test’ button can be used to add the results from a students t-test for significance between pairs of populations to the plot. The legend is as following: NS: non-significant; *P < 0.05; **P < 0.01; ***P < 0.001. The significance marks relies on t statistics for unequal sample sizes but assuming equal variance and the critical values are compared with a two-tailed probability. Finally, raw data can be exported as CSV using the ‘Export Data as Text’ button.

Upload sample

By clicking the ‘Upload sample’ button it is possible to analyse user-supplied samples produced on the Affymetrix U133 plus 2 platform. Significantly, doing so allows for the comparison of any myeloid microarray data to normal human haematopoiesis. The resulting analysis is then displayed in a private session in the framework of BloodSpot along with a principal component analysis that shows the location of the uploaded sample in the hematopoietic sample space. The analysis is anonymous and requires no login. The resulting data set, including the uploaded sample, can then be queried along with the default data sets in a private session. All names and array information are stripped from the uploaded file before creating the database for the user session. Hence, the uploaded sample in the private session will appear simply as S_1 in all charts. The private sessions and uploaded data are deleted every day at GMT 1.30 pm.

EXAMPLES OF USE OF BLOODSPOT

To demonstrate the use of BloodSpot, we provide in the following section an example relying on data and analysis provided by the database. MEIS1 is part of a transcriptional program required for the maintenance of MLL-rearranged AML (18). The expression of this gene is therefore often up-regulated in MLL leukaemias. Using Bloodspot, we investigated the expression pattern of MEIS1, and found it to be expressed at high levels in stem cells with decreasing expression as the cells differentiate (Figure 3A and C). Using the correlation function, we find that MEIS1 expression also correlates with the expression patterns of a number of Homeobox genes, including HOXA3, HOXA9 and HOXA10 which are also typically expressed early during haematopoiesis (19) (Figure 3D). Switching to the BloodPool data set, MEIS1 is found to be up-regulated in MLL leukaemias (Figure 4). Although the P-value in the survival plot does not reach statistical significance (0.055; see Figure 3B), the influence of MEIS1 expression in leukemic patients may be of potential relevance.

Figure 4.

MEIS1 expression relative to the nearest normal counterpart in different AML subtypes, including MLL-rearranged AML.

DISCUSSION

Here we have presented a web-based database that allows for browsing of haematopoietic gene-expression fingerprints in human, murine and malignant haematopoiesis in a large number of high-quality data set containing several hematopoietic cell types. The tool facilitates the easy assessment of gene-expression data and how this links to patient survival, investigation of gene-expression signatures, as well as analysis of user generated data and export of data and figures. Focusing on simplicity, BloodSpot has features that allow clinicians or biologists to quickly retrieve relevant information on the expression of specific genes/pathways, and further explore co-regulated patterns of gene-expression as well as impact on patient survival. Our statistical framework supports the upload of user-generated patient data for integration and comparison with our database of healthy cells. This will allow assessment of the origin of the blast population in AML patients as well as assessment of well known and novel genetic markers in the context of normal haematopoiesis, both of which could be important for stratification of difficult patient cases. We have also integrated the largest pool of AML patient microarray samples to date and have computed gene expression fold changes for these profiles, thanks to our cancer versus normal method previously described in (10) and curation and labelling of external data followed by ComBat (12). In conclusion, we have curated and populated a database and developed an analysis platform, which will allow researchers as well as clinicians to access and analyse gene expression data related to both normal and malignant haematopoiesis. We believe that the database should be of interest to all researchers and clinicians interested in haematopoiesis, leukaemia, basic immunology and gene expression in developmental systems. Additional to information on gene-expression BloodSpot addresses two key questions, namely, how gene-expression patterns of single genes impact on patient survival, and which other genes display similar expression patterns in the haematopoietic system. Thus the platform will help broaden the basis on which to generate hypotheses about potential therapeutic targets and expand the understanding of co-regulated genes and pathways, to support experimental findings from animal model systems.

AVAILABILITY

Bloodspot is accessible at www.bloodspot.eu

Table 2.

Data sets for leukemic patients

Data set	Organism	Source	Patient numbers	Cell types	Reference
AML Normal Karyotype data sets	Human AML	GSE15434	251	NK-AML, WBM	Kohlman et al. (28)
AML TCGA data sets	Human AML	TCGA	183	Various genetic aberrations, including t(8;21), inv(16), t(15;17), t(11q23), complex karyotype, WBM	TCGA (9)
Leukemia MILE study	Human AML, ALL, CML, CLL and MDS	GSE13159	2096	AML, ALL and preleukemic stages.	Haferlach et al. (29,30)
AML versus normal	Human AML	GSE6891, GSE13159	91	NK-AML, WBM	de Jonge et al. (31,32)
			251
Bloodpool	Human AML	GSE13159, GSE15434, TCGA, GSE61804, GSE14468	2076	Mainly AML, ALL and preleukemic stages.	all references above

34 in total

1. Gene expression profiling in AML with normal karyotype can predict mutations for molecular markers and allows novel insights into perturbed biological pathways.

Authors: A Kohlmann; L Bullinger; C Thiede; M Schaich; S Schnittger; K Döhner; M Dugas; H-U Klein; H Döhner; G Ehninger; T Haferlach
Journal: Leukemia Date: 2010-04-29 Impact factor: 11.528

2. Hematopoietic fingerprints: an expression database of stem cells and their progeny.

Authors: Stuart M Chambers; Nathan C Boles; Kuan-Yin K Lin; Megan P Tierney; Teresa V Bowman; Steven B Bradfute; Alice J Chen; Akil A Merchant; Olga Sirin; David C Weksberg; Mehveen G Merchant; C Joseph Fisk; Chad A Shaw; Margaret A Goodell
Journal: Cell Stem Cell Date: 2007-11 Impact factor: 24.633

3. TGIF1 is a negative regulator of MLL-rearranged acute myeloid leukemia.

Authors: A Willer; J S Jakobsen; E Ohlsson; N Rapin; J Waage; M Billing; L Bullinger; S Karlsson; B T Porse
Journal: Leukemia Date: 2014-10-28 Impact factor: 11.528

4. CCAAT/enhancer binding protein alpha (C/EBP(alpha))-induced transdifferentiation of pre-B cells into macrophages involves no overt retrodifferentiation.

Authors: Alessandro Di Tullio; Thien Phong Vu Manh; Alexis Schubert; Giancarlo Castellano; Robert Månsson; Thomas Graf
Journal: Proc Natl Acad Sci U S A Date: 2011-10-03 Impact factor: 11.205

5. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors: Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal: Proc Natl Acad Sci U S A Date: 2005-09-30 Impact factor: 11.205

6. Clinical utility of microarray-based gene expression profiling in the diagnosis and subclassification of leukemia: report from the International Microarray Innovations in Leukemia Study Group.

Authors: Torsten Haferlach; Alexander Kohlmann; Lothar Wieczorek; Giuseppe Basso; Geertruy Te Kronnie; Marie-Christine Béné; John De Vos; Jesus M Hernández; Wolf-Karsten Hofmann; Ken I Mills; Amanda Gilkes; Sabina Chiaretti; Sheila A Shurtleff; Thomas J Kipps; Laura Z Rassenti; Allen E Yeoh; Peter R Papenhausen; Wei-Min Liu; P Mickey Williams; Robin Foà
Journal: J Clin Oncol Date: 2010-04-20 Impact factor: 44.544

7. Transcriptional profiling of stroma from inflamed and resting lymph nodes defines immunological hallmarks.

Authors: Deepali Malhotra; Anne L Fletcher; Jillian Astarita; Veronika Lukacs-Kornek; Prakriti Tayalia; Santiago F Gonzalez; Kutlu G Elpek; Sook Kyung Chang; Konstantin Knoblich; Martin E Hemler; Michael B Brenner; Michael C Carroll; David J Mooney; Shannon J Turley
Journal: Nat Immunol Date: 2012-04-01 Impact factor: 25.606

8. CODEX: a next-generation sequencing experiment database for the haematopoietic and embryonic stem cell communities.

Authors: Manuel Sánchez-Castillo; David Ruau; Adam C Wilkinson; Felicia S L Ng; Rebecca Hannah; Evangelia Diamanti; Patrick Lombard; Nicola K Wilson; Berthold Gottgens
Journal: Nucleic Acids Res Date: 2014-09-30 Impact factor: 19.160

9. Deciphering the transcriptional network of the dendritic cell lineage.

Authors: Jennifer C Miller; Brian D Brown; Tal Shay; Emmanuel L Gautier; Vladimir Jojic; Ariella Cohain; Gaurav Pandey; Marylene Leboeuf; Kutlu G Elpek; Julie Helft; Daigo Hashimoto; Andrew Chow; Jeremy Price; Melanie Greter; Milena Bogunovic; Angelique Bellemare-Pelletier; Paul S Frenette; Gwendalyn J Randolph; Shannon J Turley; Miriam Merad
Journal: Nat Immunol Date: 2012-07-15 Impact factor: 25.606

10. BloodChIP: a database of comparative genome-wide transcription factor binding profiles in human blood cells.

Authors: Diego Chacon; Dominik Beck; Dilmi Perera; Jason W H Wong; John E Pimanda
Journal: Nucleic Acids Res Date: 2013-10-31 Impact factor: 16.971

123 in total

1. The cell polarity determinant CDC42 controls division symmetry to block leukemia cell differentiation.

Authors: Benjamin Mizukawa; Eric O'Brien; Daniel C Moreira; Mark Wunderlich; Cindy L Hochstetler; Xin Duan; Wei Liu; Emily Orr; H Leighton Grimes; James C Mulloy; Yi Zheng
Journal: Blood Date: 2017-08-04 Impact factor: 22.113

2. RNA binding protein MSI2 positively regulates FLT3 expression in myeloid leukemia.

Authors: Ayuna Hattori; Daniel McSkimming; Natarajan Kannan; Takahiro Ito
Journal: Leuk Res Date: 2017-01-11 Impact factor: 3.156

3. Neonatal expression of RNA-binding protein IGF2BP3 regulates the human fetal-adult megakaryocyte transition.

Authors: Kamaleldin E Elagib; Chih-Huan Lu; Goar Mosoyan; Shadi Khalil; Ewelina Zasadzińska; Daniel R Foltz; Peter Balogh; Alejandro A Gru; Deborah A Fuchs; Lisa M Rimsza; Els Verhoeyen; Miriam Sansó; Robert P Fisher; Camelia Iancu-Rubin; Adam N Goldfarb
Journal: J Clin Invest Date: 2017-05-08 Impact factor: 14.808

4. SLAMF7 is critical for phagocytosis of haematopoietic tumour cells via Mac-1 integrin.

Authors: Jun Chen; Ming-Chao Zhong; Huaijian Guo; Dominique Davidson; Sabrin Mishel; Yan Lu; Inmoo Rhee; Luis-Alberto Pérez-Quintero; Shaohua Zhang; Mario-Ernesto Cruz-Munoz; Ning Wu; Donald C Vinh; Meenal Sinha; Virginie Calderon; Clifford A Lowell; Jayne S Danska; André Veillette
Journal: Nature Date: 2017-04-19 Impact factor: 49.962

5. Integrating Proteomics and Transcriptomics for Systematic Combinatorial Chimeric Antigen Receptor Therapy of AML.

Authors: Fabiana Perna; Samuel H Berman; Rajesh K Soni; Jorge Mansilla-Soto; Justin Eyquem; Mohamad Hamieh; Ronald C Hendrickson; Cameron W Brennan; Michel Sadelain
Journal: Cancer Cell Date: 2017-10-09 Impact factor: 31.743

6. Oncogenic role and therapeutic targeting of ABL-class and JAK-STAT activating kinase alterations in Ph-like ALL.

Authors: Kathryn G Roberts; Yung-Li Yang; Debbie Payne-Turner; Wenwei Lin; Jacob K Files; Kirsten Dickerson; Zhaohui Gu; Jack Taunton; Laura J Janke; Taosheng Chen; Mignon L Loh; Stephen P Hunger; Charles G Mullighan
Journal: Blood Adv Date: 2017-08-30

7. A specialized pathway for erythroid iron delivery through lysosomal trafficking of transferrin receptor 2.

Authors: Shadi Khalil; Maja Holy; Stephen Grado; Robert Fleming; Ryo Kurita; Yukio Nakamura; Adam Goldfarb
Journal: Blood Adv Date: 2017-06-27

8. The Tetraspanin CD53 Regulates Early B Cell Development by Promoting IL-7R Signaling.

Authors: Zev J Greenberg; Darlene A Monlish; Rachel L Bartnett; Yihu Yang; Guomin Shen; Weikai Li; Jeffrey J Bednarski; Laura G Schuettpelz
Journal: J Immunol Date: 2019-11-20 Impact factor: 5.422

9. High expression of ABCG2 induced by EZH2 disruption has pivotal roles in MDS pathogenesis.

Authors: K C Kawabata; Y Hayashi; D Inoue; H Meguro; H Sakurai; T Fukuyama; Y Tanaka; S Asada; T Fukushima; R Nagase; R Takeda; Y Harada; J Kitaura; S Goyama; H Harada; H Aburatani; T Kitamura
Journal: Leukemia Date: 2017-07-19 Impact factor: 11.528

10. Activation of the Intracellular Pattern Recognition Receptor NOD2 Promotes Acute Myeloid Leukemia (AML) Cell Apoptosis and Provides a Survival Advantage in an Animal Model of AML.

Authors: Nathaniel J Buteyn; Ramasamy Santhanam; Giovanna Merchand-Reyes; Rakesh A Murugesan; Gino M Dettorre; John C Byrd; Anasuya Sarkar; Sumithira Vasu; Bethany L Mundy-Bosse; Jonathan P Butchar; Susheela Tridandapani
Journal: J Immunol Date: 2020-02-24 Impact factor: 5.422