| Literature DB >> 33883145 |
Youjin Lee1,2,3,4, Derek Bogdanoff5,6, Yutong Wang7,8, George C Hartoularos9,10,11, Jonathan M Woo12,2,3,4, Cody T Mowery12,2,3,4,13,14, Hunter M Nisonoff8, David S Lee3,10,11, Yang Sun3,10,11, James Lee15, Sadaf Mehdizadeh2, Joshua Cantlon16, Eric Shifrut12,2,3,4, David N Ngyuen12,2,3,4,17, Theodore L Roth12,2,3,13,14, Yun S Song18,19,20, Alexander Marson1,2,3,4,17,20,21,22,11,23, Eric D Chow24,6, Chun Jimmie Ye25,20,22,11,23,26,27.
Abstract
Single-cell RNA sequencing (scRNA-seq) of tissues has revealed remarkable heterogeneity of cell types and states but does not provide information on the spatial organization of cells. To better understand how individual cells function within an anatomical space, we developed XYZeq, a workflow that encodes spatial metadata into scRNA-seq libraries. We used XYZeq to profile mouse tumor models to capture spatially barcoded transcriptomes from tens of thousands of cells. Analyses of these data revealed the spatial distribution of distinct cell types and a cell migration-associated transcriptomic program in tumor-associated mesenchymal stem cells (MSCs). Furthermore, we identify localized expression of tumor suppressor genes by MSCs that vary with proximity to the tumor core. We demonstrate that XYZeq can be used to map the transcriptome and spatial localization of individual cells in situ to reveal how cell composition and cell states can be affected by location within complex pathological tissue.Entities:
Mesh:
Year: 2021 PMID: 33883145 PMCID: PMC8059935 DOI: 10.1126/sciadv.abg4755
Source DB: PubMed Journal: Sci Adv ISSN: 2375-2548 Impact factor: 14.957
Fig. 1XYZeq enables single-cell and spatial transcriptome profiling simultaneously.
(A) Schematic of the XYZeq workflow. (B) Schematic of XYZeq sequencing library structure. P5 and P7, Illumina adaptors; bp, base pairs; R1 and R2, annealing sites for Illumina sequencing primers. (C) Schematic representation of the mixed-species cell gradient pattern printed on the chip with 11 unique cell proportion ratios (see Methods for specific cell proportion ratios). (D) Scatterplot of mouse (x axis) and human (y axis) UMI counts detected from a mixture of HEK293T and NIH 3T3 cells after computational decontamination. Blue refers to human cells (n = 4182), red refers to mouse cells (n = 2220), and gray refers to collisions (n = 45). (E) Proportion of HEK293T (blue) cells, NIH 3T3 (red) cells, or collisions (gray) detected by XYZeq for each column of the microwell array.
Fig. 2Spatially resolved single-cell transcriptomes captured from tissue.
(A) Scatterplot of mouse (x axis) and human (y axis) UMI counts detected from liver/tumor tissues (n = 4) at 500 UMI cutoff after decontamination processing. Blue refers to human cells (n = 2657), red refers to mouse cells (n = 5707), and gray refers to collisions (n = 382). (B) Violin plots showing the number of detected UMIs (left) and genes (right) per mouse (red) and human (blue) cell. Median UMI counts for human cells: 1596; mouse cells: 1009. Median gene counts for human cells: 629; mouse cells: 456 across all liver/tumor slices. (C) H&E-stained image of the liver/tumor tissue slice. Tumor region, dark purple with yellow dotted outlines; liver region, pink. Scale bar, 2 mm. (D) Visualization of human (blue) and mouse (red) cell distribution on the XYZeq array overlayed on the H&E-stained slice.
Fig. 3Frequency and spatial mapping of single-cell clusters from tissue.
(A) tSNE visualization of the cell types identified from liver/tumor tissue. A total of 6623 total cells were plotted. (B) Heatmap of scaled marker gene expression and hierarchical clustering of genes that define each cell type from liver/tumor tissue. Reference for color bar in (A). (C) Correlations of pseudobulk expression values for matching cell types between XYZeq and 10x Genomics Chromium. (D) Spatial localization of hepatocytes, MC38, and myeloid cells overlaid on a bright-field image of tissue. Yellow dotted outline indicates tumor regions. (E) Pie chart of cell type composition for each XYZeq well from a representative liver/tumor tissue slice (top) and bar chart illustrating combined cell type composition across all four slices of liver/tumor tissue, which tracks with proximity to the tumor (bottom) (see Methods for proximity score). (F) Pairplot showing the frequency of hepatocytes, MC38, and myeloid cells in each well. Scatterplots show the colocalization of two cell types in each well. Histograms show the distribution of number of cells (x axis) per well (y axis) for each cell type. Pearson correlation (r) and P values are annotated.
Fig. 4Expression of gene modules in space that track with cellular composition.
(A) Projection of average expression of hepatocyte-enriched module (LM14) in tSNE space. Each dot is a cell and colored by the average expression of top contributing module genes (Materials and Methods). (B) Spatial expression of hepatocyte-enriched module (LM14). Each spatial well is colored by the average expression of the top contributing module genes weighted by the number of cells per well. Wells are binarized into high (above weighted average) versus low (all other nonzero expression). Yellow dotted outlines indicate tumor regions. (C) Heatmap representing the number of overlapping genes between each pair of modules in liver/tumor and spleen/tumor. Each row is an LM, and each column is an SM. (D) tSNE projection of XYZeq scRNA-seq data colored by annotated cell types in liver/tumor (top left) and spleen/tumor (bottom left) and mean gene expression of the top overlapping modules between liver/tumor (top row) and spleen/tumor (bottom row). Tumor response modules correspond to LM5 and SM12, and immune regulation modules correspond to LM19 and SM7. ECs, endothelial cells. Spatial projection visualizes the mean expression of the tumor response modules (E) corresponding to LM5 and SM12 and the immune regulation modules (F) corresponding to LM19 and SM7. Each well in (E) and (F) is colored by the average gene expression of each module weighted by the number of cells per well (high versus low), and yellow dotted outline indicates tumor regions. Wells are binarized into high (above weighted average) versus low (all other nonzero expression).
Fig. 5Differential gene expression within MSCs associated with their spatial proximity to tumor.
(A) Average expression of the cell migration modules (LM10 and SM17) in tSNE space. Each dot is a cell colored by its mean expression of the top module genes between corresponding liver/tumor and spleen/tumor modules. (B) XYZeq array colored by the tumor proximity score. Values near 1 (yellow) indicate regions rich in tumor, values near 0 (purple) indicate regions rich in nontumor cells, and wells capturing the border between the two tissue types take on values around 0.5 (blue/green). (C) MSCs colored by the cell-specific proximity score in tSNE space. (D) Row-clustered heatmap showing the scaled, mean gene expression in MSCs of genes enriched in three spatial regions (intratumor, boundary, and intratissue) along the one-dimensional proximity score. For spleen/tumor, statistically significant genes enriched in the tumor and nontumor regions are highlighted. (E) Log expression (y axis) of Csmd1 (left) and Tshz2 (right) along the proximity score (x axis). Each dot corresponds to one MSC cell, and the regression line is fitted using the negative binomial distribution (Materials and Methods). (F) Projection in space of mean expression of Csmd1 (left) and Tshz2 (right) in MSCs. Yellow dotted outline indicates tumor region.