| Literature DB >> 29204516 |
B Ogan Mancarci1,2,3, Lilah Toker2,3, Shreejoy J Tripathy2,3, Brenna Li2,3, Brad Rocco4,5, Etienne Sibille4,5, Paul Pavlidis2,3.
Abstract
Establishing the molecular diversity of cell types is crucial for the study of the nervous system. We compiled a cross-laboratory database of mouse brain cell type-specific transcriptomes from 36 major cell types from across the mammalian brain using rigorously curated published data from pooled cell type microarray and single-cell RNA-sequencing (RNA-seq) studies. We used these data to identify cell type-specific marker genes, discovering a substantial number of novel markers, many of which we validated using computational and experimental approaches. We further demonstrate that summarized expression of marker gene sets (MGSs) in bulk tissue data can be used to estimate the relative cell type abundance across samples. To facilitate use of this expanding resource, we provide a user-friendly web interface at www.neuroexpresso.org.Entities:
Keywords: RNA sequencing; cell type; gene expression; marker gene; microarray
Mesh:
Year: 2017 PMID: 29204516 PMCID: PMC5707795 DOI: 10.1523/ENEURO.0212-17.2017
Source DB: PubMed Journal: eNeuro ISSN: 2373-2822
Figure 1.Mouse brain cell type-specific expression database compiled from publicly available datasets. , Workflow of the study. Cell type-specific expression profiles are collected from publicly available datasets and personal communications. Acquired samples are grouped based on cell type and brain region. Marker genes are selected per brain region for all cell types. Marker genes are biologically and computationally validated and used in estimation of cell type proportions. , Brain region hierarchy used in the study. Samples included in a brain region based on the region they were extracted from. For instance, dopaminergic cells isolated from the midbrain were included when selecting marker genes in the context of brainstem and whole brain, and microglia extracted from whole brain isolates were added to all brain regions.
Cell types in NeuroExpresso database
| Astrocyte | 9/1* | 94** | GSE9566 ( |
| Oligodendrocyte | 25/1* | 22** | GSE48369, ( |
| Microglia | 3/1* | 131** | GSE29949 ( |
| FS Basket (G42) | 13/5* | 18 | GSE17806 ( |
| Martinotti (GIN) | 3/1* | 15 | GSE2882 ( |
| VIPReln (G30) | 6/1* | 33 | GSE2882 ( |
| Pan-pyramidal*** | 9/17 * | 35 | See below |
| Pyramidal cortico-thalamic | 3/2* | 2 | GSE2882 ( |
| Pyramidal Glt25d2 | 3/2* | 3 | GSE35758 ( |
| Pyramidal S100a10 | 3/4* | 2 | GSE35751 ( |
| Layer 2 3 Pyra | 2* | 3 | GSE71585 ( |
| Layer 4 Pyra | 3* | 5 | GSE71585 ( |
| Layer 6a Pyra | 2* | 6 | GSE71585 ( |
| Layer 6b Pyra | 2* | 9 | GSE71585 ( |
| OPs | 1* | 184 | GSE71585 ( |
| Endothelial | 2* | 178 | GSE71585 ( |
| Forebrain cholinergic | 3 | 90 | GSE13379 ( |
| Forebrain cholinergic | 3 | 45 | GSE13379 ( |
| Medium spiny neurons | 39 | 74 | GSE13379 ( |
| Glutamatergic | 3 | 10 | GSE2882 ( |
| Pyramidal Thy1 Amyg | 12 | 21 | GSE2882 ( |
| DentateGranule | 3 | 17 | GSE11147 ( |
| GabaSSTReln | 3 | 54 | GSE2882 ( |
| Pyramidal Thy1 Hipp | 12 | 17 | GSE2882 ( |
| Ependymal | 2 | 50 | GSE18765 ( |
| GabaReln | 3 | 53 | GSE2882 ( |
| Hypocretinergic | 4 | 35 | GSE38668 ( |
| Thalamus cholinergic | 3 | 40 | GSE43164 ( |
| Midbrain cholinergic | 3 | 34 | GSE13379 ( |
| Serotonergic | 3 | 18 | GSE36068 ( |
| Dopaminergic | 30 | 58** | No accession **** ( |
| Noradrenergic | 9 | 133 | GSE8720 ( |
| Basket | 16 | 6 | GSE13379 ( |
| Bergmann | 3 | 52 | GSE13379 ( |
| Cerebellar granule cells | 3 | 11 | GSE13379 ( |
| Golgi | 3 | 26 | GSE13379 ( |
| Purkinje | 44 | 43 | GSE13379 ( |
| Spinal cord cholinergic | 3 | 124 | GSE13379 ( |
Sample count, number of samples that representing the cell type; gene count, number of marker genes detected for cell type; *, the number of clusters from RNA-seq data; **, marker genes for these cell types are identified in multiple regions displayed yet only the number of the genes that are found in the region specified on the table is shown for the sake of conservation of space. Astrocytes, microglia, and oligodendrocyte markers are identified in the context of all other brain regions (except cerebellum for astrocytes) and dopaminergic markers are also identified for midbrain; ***, pan-pyramidal is a merged cell type composed of all pyramidal samples; ****, data obtained directly from authors.
Matching single-cell RNA sequencing data from tasic to well-defined cell types
| Astrocyte | Astro Gja1 | Direct match | Astrocyte |
| Microglia | Micro Ctss | Direct match | Microglia |
| Oligodendrocyte | Oligo Opalin | Direct match | Oligodendrocyte |
| FS Basket (G42) | Pvalb Gpx3, Pvalb Rspo2, Pvalb Wt1, Pvalb Obox3, Pvalb Cpne5 | Definition: fast spiking pval positive interneurons | FS Basket (G42) |
| Martinotti (GIN) | Sst Cbln4 | Direct match | Martinotti (GIN) |
| VIPReln (G30) | Vip Sncg | Unique Vip and Sncg expression, high Sncg expression in microarray cell type | VIPReln (G30) |
| Pyramidal Glt25d2 | L5b Tph2, L5b Cdh13 | Definition: Glt25d2 positive Fam84b positive | Pyramidal Glt25d2 |
| Pyramidal S100a10 | L5a Hsd11b1, L5a Batf3, L5a Tcerg1l, L5a Pde1c | Definition: S100a10 expressing cells from layer 5a | Pyramidal S100a10 |
| Pyramidal CrtThalamic | L6a Car12, L6a Syt17 | Direct match | Pyramidal CrtThalamic |
| — | Endo Myl9, Endo Tbc1d4 | New cell type | Endothelial |
| — | OPC Pdgfra | New cell type | OPCs |
| — | L4 Ctxn3, L4 Scnn1a, L4 Arf5 | New cell type | Layer 4 Pyra |
| — | L2 Ngb, L2/3 Ptgs2 | New cell type | Layer 2 3 Pyra |
| — | L6a Mgp, L6a Sla | New cell type | Layer 6a Pyra |
| — | L6b Serpinb11, L6b Rgs12 | New cell type | Layer 6b Pyra |
List of molecular cell types identified by Tasic et al. (2016) and their corresponding cell types in NeuroExpresso. Matching method column defines how the matching was performed. Direct matches are one to one matching between the definition provided by Tasic et al. (2016) for the molecular cell types and definition provided by microarray samples. For “definition” matches, description of the cell type in the original source is used to find molecular cell types that fit the definition. VIPReln, Vip Sncg matching was done based on unique Sncg expression in VIPReln cells in the microarray data. New cell types are well defined cell types that have no counterpart in microarray data.
Figure 7.MGPs reveal cell type-specific changes in whole tissue data. , Estimation of cell type profiles for cortical cells in frontal cortex and white matter. Values are normalized to be between 0 and 1. (***p < 0.001). , left, Oligodendrocyte MGPs in Stanley C cohort. Right, Morphology-based oligodendrocyte counts of Stanley C cohort. Figure adapted from Uranova et al. (2004). , Estimations of dopaminergic cell MGPs in substantia nigra of controls and PD patients. Values are relative and are normalized to be between 0 and 1 and are not reflective of absolute proportions (**p < 0.01, ***p < 0.001).
Figure 2.The NeuroExpresso.org web application. The application allows easy visualization of gene expression across cell types in brain regions. Depicted is the expression of cell types from neocortex region. Alternatively, cell types can be grouped based on their primary neurotransmitter or the purification type. The application can be reached at www.neuroexpresso.org.
Figure 3.Marker genes are selected for mouse brain cell types and used to estimate cell type profiles. , Expression of top marker genes selected for cell cortical cell types in cell types represented by RNA-seq (left) and microarray (right) data in NeuroExpresso. Expression levels were normalized per gene to be between 0 and 1 for each dataset. , Expression of Fam114a1 in neocortex in microarray (top) and RNA-seq (bottom) datasets. Fam114a1 is a proposed fast spiking basket cell marker. It was not selected as a marker in this study due to its high expression in oligodendrocytes and S100a10 expressing pyramidal cells that were both absent from the original study.
Figure 4.Validation of candidate markers using the ABA. , ISH images from the ABA. Rightmost panels show the location of the image in the brain according to the Allen Brain mouse reference atlas. Panels on the left show the ISH image and normalized expression level of known and novel dentate gyrus granule cell (upper panels) and Purkinje cell (lower panels) markers. , Validation status of marker genes detected for Purkinje and dentate gyrus granule cells. Figures used for validation and validation statuses of individual marker genes can be found in Extended Data (Extended Data Fig. 4-1,2,3,4).
Figure 5.Single-plane image of mouse sensorimotor cortex labeled for Pvalb, Slc32a1, and Cox6a2 mRNAs and counterstained with NeuroTrace. Arrows indicate Cox6a2+ neurons. Scale bar: 10 µm.
Coexpression of cortical MGSs in single-cell RNA-seq data
| Cell types | Gene count | Gene count | ||
|---|---|---|---|---|
| Endothelial | 180 | 157 | ||
| Astrocyte | 282 | 239 | ||
| Microglia | 248 | 201 | ||
| Oligodendrocyte | 156 | 201 | ||
| OPCs | 0.831 | 193 | 0.999 | 203 |
| FS Basket (G42) | 26 | 26 | ||
| Martinotti (GIN) | 21 | 20 | ||
| VIPReln (G30) | 43 | 36 | ||
| Pyramidal | 34 | 27 | ||
Statistics were calculated by Wilcoxon rank-sum test.
Figure 6.NeuroExpresso reveals novel gene expression patterns. , Expression of cholinergic, GABAergic, and glutamatergic markers in cholinergic cells from forebrain and thalamus. Forebrain cholinergic neurons express GABAergic markers while thalamus (hubenular) cholinergic neurons express glutamatergic markers. , left, Expression of Ddc in oligodendrocyte samples from Cahoy et al. (2008), Doyle et al. (2008), and Fomchenko et al. (2011) datasets and in comparison to dopaminergic cells and other (nonoligodendrocyte) cell types from the neocortex in the microarray dataset. In all three datasets, expression of Ddc in oligodendrocytes is comparable to expression in dopaminergic cells and is higher than in any of the other cortical cells. Oligodendrocyte samples show higher than background levels of expression across datasets. Right, Ddc expression in oligodendrocytes, OPCs, and other cell types from Tasic et al. (2016) single-cell dataset. , Bimodal gene expression in two dopaminergic cell isolates by different labs. Genes shown are labeled as marker genes in the context of midbrain if the two cell isolates are labeled as different cell types.
Summaries of statistical analyses
| Frontal cortex ( | White matter ( | Group comparison | ||||||
|---|---|---|---|---|---|---|---|---|
| Mean | SD | Mean | SD | W | ||||
| Endothelial | 0.265 | 0.117 | 0.64 | 0.112 | 42 | |||
| Astrocyte | 0.401 | 0.135 | 0.757 | 0.101 | 136 | |||
| Microglia | 0.179 | 0.092 | 0.708 | 0.135 | 4 | |||
| Oligodendrocyte | 0.226 | 0.107 | 0.815 | 0.087 | 2 | |||
| Olig. precursors | 0.215 | 0.123 | 0.817 | 0.078 | 0 | |||
| FS Basket (G42) | 0.865 | 0.081 | 0.27 | 0.115 | 7744 | |||
| VIPReln (G30) | 0.792 | 0.102 | 0.288 | 0.142 | 7718 | |||
| Pyramidal | 0.877 | 0.062 | 0.212 | 0.112 | 7744 | |||
|
| ||||||||
| Mean | SD | W (vs control) | ||||||
| Schizophrenia ( | 0.598 | 0.129 | 75 | 0.013 | ||||
| Bipolar ( | 0.334 | 0.242 | 102 | |||||
| Depression ( | 0.386 | 0.13 | 89 | |||||
| Control ( | 0.78 | 0.146 | NA | NA | ||||
|
| ||||||||
|
| ||||||||
|
| ||||||||
| PD | Control | Group comparison | ||||||
| Mean | SD | Mean | SD | W | ||||
| Lesnick | 0.26 | 0.179 | 16 | 0.578 | 0.263 | 9 | 119 | 0.007 |
| Moran lateral | 0.174 | 0.135 | 9 | 0.665 | 0.246 | 7 | 60 | 0.001 |
| Moran medial | 0.305 | 0.191 | 15 | 0.799 | 0.191 | 8 | 115 | |
| Zhang | 0.201 | 0.101 | 10 | 0.489 | 0.287 | 18 | 148 | 0.004 |
All statistics were calculated by Wilcoxon rank-sum test.