| Literature DB >> 30395284 |
Jarny Choi1,2, Tracey M Baldwin1, Mae Wong3, Jessica E Bolden1,4, Kirsten A Fairfax1,4, Erin C Lucas1, Rebecca Cole1, Christine Biben1,4, Clare Morgan1,4, Kerry A Ramsay1, Ashley P Ng4,5, Maria Kauppi4,5, Lynn M Corcoran4,6, Wei Shi7,8, Nick Wilson3, Michael J Wilson3, Warren S Alexander4,5, Douglas J Hilton1,4, Carolyn A de Graaf1,4.
Abstract
During haematopoiesis, haematopoietic stem cells differentiate into restricted potential progenitors before maturing into the many lineages required for oxygen transport, wound healing and immune response. We have updated Haemopedia, a database of gene-expression profiles from a broad spectrum of haematopoietic cells, to include RNA-seq gene-expression data from both mice and humans. The Haemopedia RNA-seq data set covers a wide range of lineages and progenitors, with 57 mouse blood cell types (flow sorted populations from healthy mice) and 12 human blood cell types. This data set has been made accessible for exploration and analysis, to researchers and clinicians with limited bioinformatics experience, on our online portal Haemosphere: https://www.haemosphere.org. Haemosphere also includes nine other publicly available high-quality data sets relevant to haematopoiesis. We have added the ability to compare gene expression across data sets and species by curating data sets with shared lineage designations or to view expression gene vs gene, with all plots available for download by the user.Entities:
Year: 2019 PMID: 30395284 PMCID: PMC6324085 DOI: 10.1093/nar/gky1020
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Summary of RNA-seq data sets included in the repository
| Data set | Species | #Samp | MP | RP | Er | Mk | Ms | Ba | Eo | Nt | Mc | DC | B | T | IL | Ref |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Haemopedia-RNAseq-Mouse | Mouse | 129 | 3 | 11 | 6 | 1 | 1 | 3 | 6 | 2 | 5 | 2 | 8 | 8 | 1 | |
| Haemopedia-RNAseq-Human | Human | 42 | 1 | 1 | 2 | 3 | 2 | 2 | 1 | |||||||
| ImmGen-RNASeq | Mouse | 157 | 6 | 1 | 3 | 4 | 3 | 18 | 32 | 11 | ( | |||||
| Linsley | Human | 114 | 1 | 1 | 1 | 2 | 1 | ( |
#Samp = number of samples in data set. The lineage columns give the number of different cell types of that lineage included in the data set. MP = multipotential progenitor, RP = restricted potential progenitor, Er = erythrocyte lineage, Mk = megakaryocyte lineage, Ms = mast cell lineage, Ba = basophil lineage, Eo = eosinophil lineage, Nt = neutrophil lineage, Mc = macrophage lineage, DC = dendritic cell lineage, B = B cell lineage, T = T cell lineage, IL = innate lymphocyte lineage, Ref = publication reference.
Figure 1.Multi-dimensional scaling plot. Multi-dimensional scaling (MDS) plots of the (A) Haemopedia mouse RNA-seq and (B) Haemopedia human RNA-seq data sets. Plots were produced by Haemosphere, using the Euclidean distance between the 500 most variable genes in each data set, with the dimension reduction performed with principal coordinates analysis. Each dot represents a sample and they are coloured by lineage as shown in the key.
Figure 2.Multi-species plot. Example multi-species graphs from Haemosphere comparing Cd19 expression in the Haemopedia mouse RNA-seq data set to its orthologue CD19 in the Haemopedia human RNA-seq data set. Cell types have been assigned to lineages, and plotted with shared colours according to the colour scheme in Figure 1. Some related lineages, such as granulocytes, have been grouped together to simplify the view. On the website, hovering over each bar reveals the cell type name and clicking on the data set name takes the user to the gene's expression plot in that data set to display more detail.
Figure 3.Gene versus gene plot. Example of gene versus gene plot from Haemosphere using the Haemopedia mouse RNA-seq data set. The expression of the transcription factor Spi1 is plotted against Csf1r expression. Each dot represents average expression for a different cell type and is coloured according to lineage. The number of cell types in a lineage and the Pearson correlation within the lineage is shown in brackets in the legend.
Figure 4.Heatmap of genes with high expression in neutrophils. This heatmap was generated from haemosphere.org using the Haemopedia mouse RNA-seq data set and shows the 25 most enriched genes in the neutrophil lineage as selected by Haemosphere's ‘High Expression Search’ function. Squares are coloured by row based z score.