Literature DB >> 32573704

Transcriptional output, cell-type densities, and normalization in spatial transcriptomics.

Manuel Saiselet1, Joël Rodrigues-Vitória1, Adrien Tourneur1, Ligia Craciun2, Alex Spinette2, Denis Larsimont2, Guy Andry3, Joakim Lundeberg4,5, Carine Maenhaut1, Vincent Detours1.   

Abstract

Entities:  

Year:  2020        PMID: 32573704      PMCID: PMC7883818          DOI: 10.1093/jmcb/mjaa028

Source DB:  PubMed          Journal:  J Mol Cell Biol        ISSN: 1759-4685            Impact factor:   6.216


× No keyword cloud information.
Dear Editor, Spatial transcriptomics (ST) makes it possible to perform RNA-seq at hundreds of precisely located spots on the surface of a histological slice (Ståhl et al., 2016). Since mRNA diffusion is minimal during tissues permeabilization and mRNA capture, the transcriptome of each spot is thought to aggregate the transcriptomes of the cells it contains. The number of cells within a spot and their transcriptional output depend on their type, state, and overall local morphology. ST shares some limitations with single-cell RNA-seq, including high dropout rate. So far, ST studies have relied on preprocessing pipelines inspired by single-cell RNA-seq studies (Ståhl et al., 2016; Asp et al., 2017; Giacomello et al., 2017; Berglund et al., 2018; Lundmark et al., 2018). These include normalization of gene-wise read counts in a cell/spot by the total number of reads collected from that cell/spot. But the number of reads obtained from a spot could reflect its cellular content or technical variation in RNA capture and amplification. Thus, whether read count normalization is warranted in the context of ST remains an open question. We addressed it by quantifying the cellular content of individual spots from image analysis and by comparing it with read counts. A BRAF V600E-mutated papillary thyroid cancer was profiled with ST (Supplementary material). Pathology review of the H&E image revealed five major types of morphologies (Figure 1A). This qualitative approach was complemented by whole-slide machine learning-based localization of nuclei, and their classification within three categories (Figure 1B): epithelial cells, fibroblasts, and ‘other cells’, which mostly contain immune cells. Among 86111 detected nuclei (Figure 1C), 31% were located within ST spots. The mean number of cells per spots varied from 0 to 197 (median 67).
Figure 1

Variation of total read counts is related to morphology and number of cells of different types. (A) Five types of morphological regions were determined from pathology. The transcriptome was measured in each spot with ST. (B) The nuclei on the H&E image were segmented and classified with a machine learning-based algorithm (Supplementary material). (C) Nuclei counts. (D) Distributions of the number of cells per ST spot. (E) Total read count per spot. (F) Multivariate analysis of association between cell numbers and the log2 of total read count per spot. (G) Each point represents a ST spot with same color code as in A. ρ denotes the Spearman’s correlation. (H and I) Expression of TG (H) and VIM (I) without normalization (left), with adjustment for total read counts (center), and with DCA normalization (right). Boxplots represent the expression of spots in the regions shown in A (same color code). (J and K) Points represent spots (same color code as before) with the absolute number (J) and cell-type proportion (K) of epithelial cells (x-axis) and fibroblasts (y-axis). The large negative correlation stems from the low number of ‘others’ in C. (L and M) Expressions of TG and VIM are compared using raw counts or auto-encoder-based normalization. (N) Distribution of gene–gene correlation across spots for raw counts and normalized data.

Variation of total read counts is related to morphology and number of cells of different types. (A) Five types of morphological regions were determined from pathology. The transcriptome was measured in each spot with ST. (B) The nuclei on the H&E image were segmented and classified with a machine learning-based algorithm (Supplementary material). (C) Nuclei counts. (D) Distributions of the number of cells per ST spot. (E) Total read count per spot. (F) Multivariate analysis of association between cell numbers and the log2 of total read count per spot. (G) Each point represents a ST spot with same color code as in A. ρ denotes the Spearman’s correlation. (H and I) Expression of TG (H) and VIM (I) without normalization (left), with adjustment for total read counts (center), and with DCA normalization (right). Boxplots represent the expression of spots in the regions shown in A (same color code). (J and K) Points represent spots (same color code as before) with the absolute number (J) and cell-type proportion (K) of epithelial cells (x-axis) and fibroblasts (y-axis). The large negative correlation stems from the low number of ‘others’ in C. (L and M) Expressions of TG and VIM are compared using raw counts or auto-encoder-based normalization. (N) Distribution of gene–gene correlation across spots for raw counts and normalized data. Spot-wise read coverage varied from 356 to 8749 across the slide (Figure 1E), a 25-fold variation. It was associated with the number of cells of all types in a multivariate analysis (Figure 1G), particularly epithelial cells (Figure 1F). As expected, total read count per spots was highest in dense epithelial areas and lowest in low cell density fibrotic zones (Figure 1E and G). We concluded that total read counts per spot reflects relevant quantitative and qualitative features of tissue morphology. To assess the effect of normalization, we compared raw read counts with raw counts normalized for total counts and with scale-normalized expression estimates generated by the Deep Count Autoencoder (DCA; Eraslan et al., 2019), a neural network-based algorithm developed in the context of single-cell RNA-seq. Figure 1H and I show expression of thyroglobulin (TG, a thyroid differentiation marker) and vimentin (VIM, a mesenchymal intermediate filament), respectively. Raw counts and normalized expression of TG all closely followed epithelial density and total counts (compare Figure 1H to Figure 1A and E, and see also Supplementary Figure S1). VIM raw counts were substantial in the epithelial areas but also in the cellular fibrosis and immune foci. Normalization, however, revealed a dramatically different picture, particularly for DCA: while remaining high in cellular fibrosis and immune foci, VIM expression was lower in epithelium (Figure 1I; Supplementary Figure S1). Thus, normalization affected the spatial expression pattern of VIM, but not TG. The absolute numbers of epithelial cells and fibroblasts per spot were weakly associated (Figure 1J), while their relative proportions, i.e. their number divided by the total number of cells within a spot, were massively anti-correlated (Figure 1K). The raw counts of TG and VIM are positively correlated (Figure 1L), while their normalized values are negatively correlated (Figure 1M). The positive correlation between TG and VIM raw counts (Figure 1L) suggests that the tumoral epithelium expresses VIM and could undergo an epithelial–mesenchymal transition. This transition has been reported in BRAF V600E-mutated tumors (Knauf et al., 2011) such as this one. VIM is also expressed in primary cultures of normal thyrocytes treated with epidermal growth factor, which inhibits differentiation, but not of thyrocytes treated with thyroid-stimulating hormone, which promotes thyroid differentiation—while both are mitogenic (Coclet et al., 1991). Alternatively, VIM expression by fibroblasts could be promoted by nearby epithelial cells. Overall, raw counts seemed more related to the number of cells of a given cell type, while normalized expression captured cell types’ relative proportions. To rule out possible artifacts related to TG and VIM in this particular tissue slice, we reproduced the above analysis (i) to the thyroid-stimulating hormone receptor (TSHR) and collagen III α1 (COL3A1) in another slice of the same thyroid cancer (Supplementary Figure S2) and (ii) to the estrogen receptor (ESR1) and VIM in a publicly available breast cancer slice profiled on the recent Visium platform (10x Genomics; Supplementary Figure S3). Taken together, these controls support the generality of the effect of normalization on the relation between epithelial differentiation and mesenchymal markers and their relevance to Visium slides, which have a 4-fold higher resolution than the first generation of ST slides shown in Figure 1. To gain insights on the global effect of normalization, we calculated the distribution of gene−gene correlations across spots for all three expression metrics (Figure 1N). Raw counts were positively correlated for most pairs of genes. By contrast, normalized expression correlations were centered on 0. This implies that genes tend to show a similar expression pattern that reflects total transcriptional output when raw counts are considered, while normalized expression highlights contrasts between genes. We showed that the variation of total read counts is largely determined by local cell density in ST data. Thus, total counts per spot are biologically informative and do not necessarily need to be normalized out. Importantly, normalization and denoising are technically independent operations. For example, DCA estimates read count scale factors, but users are free to use its scale-adjusted or un-adjusted outputs (Eraslan et al., 2019). Our study shows that both options are valid but address different purposes. Raw read counts inform about the absolute density of cell types, while normalized expression informs about their relative proportions. It is remarkable that normalized expression better detects specific morphologies such as pure epithelium and cellular fibrosis (see boxplots in Figure 1H and I, and compare the DCA panels in Figure 1H and I with the pathology annotation in Figure 1A), while raw counts do reflect actual cell-type local densities and may highlight atypical expression patterns, as exemplified here for VIM in regions of high epithelial density. The resolution of commercially available ST will eventually reach sub-cellular resolution (Vickovic et al., 2019). Given that some cells, e.g. cancer cells, produce more RNA than others (Lovén et al., 2012), it begs the question of to what extent our argument also applies at single-cell level. Cell-level phenotypic information measured independently of transcription must be available together with matched cell transcriptomes in order to unambiguously address this question.

[ Click here for additional data file.
  10 in total

1.  Intermediate filaments in normal thyrocytes: modulation of vimentin expression in primary cultures.

Authors:  J Coclet; F Lamy; F Rickaert; J E Dumont; P P Roger
Journal:  Mol Cell Endocrinol       Date:  1991-04       Impact factor: 4.102

2.  Spatially resolved transcriptome profiling in model plant species.

Authors:  Stefania Giacomello; Fredrik Salmén; Barbara K Terebieniec; Sanja Vickovic; José Fernandez Navarro; Andrey Alexeyenko; Johan Reimegård; Lauren S McKee; Chanaka Mannapperuma; Vincent Bulone; Patrik L Ståhl; Jens F Sundström; Nathaniel R Street; Joakim Lundeberg
Journal:  Nat Plants       Date:  2017-05-08       Impact factor: 15.793

3.  Visualization and analysis of gene expression in tissue sections by spatial transcriptomics.

Authors:  Patrik L Ståhl; Fredrik Salmén; Sanja Vickovic; Anna Lundmark; José Fernández Navarro; Jens Magnusson; Stefania Giacomello; Michaela Asp; Jakub O Westholm; Mikael Huss; Annelie Mollbrink; Sten Linnarsson; Simone Codeluppi; Åke Borg; Fredrik Pontén; Paul Igor Costea; Pelin Sahlén; Jan Mulder; Olaf Bergmann; Joakim Lundeberg; Jonas Frisén
Journal:  Science       Date:  2016-07-01       Impact factor: 47.728

Review 4.  Revisiting global gene expression analysis.

Authors:  Jakob Lovén; David A Orlando; Alla A Sigova; Charles Y Lin; Peter B Rahl; Christopher B Burge; David L Levens; Tong Ihn Lee; Richard A Young
Journal:  Cell       Date:  2012-10-26       Impact factor: 41.582

5.  Progression of BRAF-induced thyroid cancer is associated with epithelial-mesenchymal transition requiring concomitant MAP kinase and TGFβ signaling.

Authors:  J A Knauf; M A Sartor; M Medvedovic; E Lundsmith; M Ryder; M Salzano; Y E Nikiforov; T J Giordano; R A Ghossein; J A Fagin
Journal:  Oncogene       Date:  2011-03-07       Impact factor: 9.867

6.  Spatial maps of prostate cancer transcriptomes reveal an unexplored landscape of heterogeneity.

Authors:  Emelie Berglund; Jonas Maaskola; Niklas Schultz; Stefanie Friedrich; Maja Marklund; Joseph Bergenstråhle; Firas Tarish; Anna Tanoglidi; Sanja Vickovic; Ludvig Larsson; Fredrik Salmén; Christoph Ogris; Karolina Wallenborg; Jens Lagergren; Patrik Ståhl; Erik Sonnhammer; Thomas Helleday; Joakim Lundeberg
Journal:  Nat Commun       Date:  2018-06-20       Impact factor: 14.919

7.  Gene expression profiling of periodontitis-affected gingival tissue by spatial transcriptomics.

Authors:  Anna Lundmark; Natalija Gerasimcik; Tove Båge; Anders Jemt; Annelie Mollbrink; Fredrik Salmén; Joakim Lundeberg; Tülay Yucel-Lindberg
Journal:  Sci Rep       Date:  2018-06-19       Impact factor: 4.379

8.  Single-cell RNA-seq denoising using a deep count autoencoder.

Authors:  Gökcen Eraslan; Lukas M Simon; Maria Mircea; Nikola S Mueller; Fabian J Theis
Journal:  Nat Commun       Date:  2019-01-23       Impact factor: 14.919

9.  Spatial detection of fetal marker genes expressed at low level in adult human heart tissue.

Authors:  Michaela Asp; Fredrik Salmén; Patrik L Ståhl; Sanja Vickovic; Ulrika Felldin; Marie Löfling; José Fernandez Navarro; Jonas Maaskola; Maria J Eriksson; Bengt Persson; Matthias Corbascio; Hans Persson; Cecilia Linde; Joakim Lundeberg
Journal:  Sci Rep       Date:  2017-10-11       Impact factor: 4.379

10.  High-definition spatial transcriptomics for in situ tissue profiling.

Authors:  Gökcen Eraslan; Fredrik Salmén; Johanna Klughammer; Linnea Stenbeck; Sanja Vickovic; Denis Schapiro; Tarmo Äijö; Richard Bonneau; Ludvig Bergenstråhle; José Fernandéz Navarro; Joshua Gould; Gabriel K Griffin; Åke Borg; Mostafa Ronaghi; Jonas Frisén; Joakim Lundeberg; Aviv Regev; Patrik L Ståhl
Journal:  Nat Methods       Date:  2019-09-09       Impact factor: 28.547

  10 in total
  9 in total

1.  scFeatures: multi-view representations of single-cell and spatial data for disease outcome prediction.

Authors:  Yue Cao; Yingxin Lin; Ellis Patrick; Pengyi Yang; Jean Yee Hwa Yang
Journal:  Bioinformatics       Date:  2022-10-14       Impact factor: 6.931

Review 2.  Next Generation Imaging Techniques to Define Immune Topographies in Solid Tumors.

Authors:  Violena Pietrobon; Alessandra Cesano; Francesco Marincola; Jakob Nikolas Kather
Journal:  Front Immunol       Date:  2021-01-27       Impact factor: 7.561

Review 3.  Hypoxia and the phenomenon of immune exclusion.

Authors:  Violena Pietrobon; Francesco M Marincola
Journal:  J Transl Med       Date:  2021-01-06       Impact factor: 5.531

Review 4.  Analysis and Visualization of Spatial Transcriptomic Data.

Authors:  Boxiang Liu; Yanjun Li; Liang Zhang
Journal:  Front Genet       Date:  2022-01-27       Impact factor: 4.599

Review 5.  An introduction to spatial transcriptomics for biomedical research.

Authors:  Cameron G Williams; Hyun Jae Lee; Takahiro Asatsuma; Roser Vento-Tormo; Ashraful Haque
Journal:  Genome Med       Date:  2022-06-27       Impact factor: 15.266

Review 6.  Spatial omics: Navigating to the golden era of cancer research.

Authors:  Yingcheng Wu; Yifei Cheng; Xiangdong Wang; Jia Fan; Qiang Gao
Journal:  Clin Transl Med       Date:  2022-01

Review 7.  Dynamic Cancer Cell Heterogeneity: Diagnostic and Therapeutic Implications.

Authors:  Valerie Jacquemin; Mathieu Antoine; Geneviève Dom; Vincent Detours; Carine Maenhaut; Jacques E Dumont
Journal:  Cancers (Basel)       Date:  2022-01-07       Impact factor: 6.639

8.  Characterizing spatial gene expression heterogeneity in spatially resolved single-cell transcriptomic data with nonuniform cellular densities.

Authors:  Brendan F Miller; Dhananjay Bambah-Mukku; Catherine Dulac; Xiaowei Zhuang; Jean Fan
Journal:  Genome Res       Date:  2021-05-25       Impact factor: 9.043

Review 9.  Single-Cell RNA Sequencing with Spatial Transcriptomics of Cancer Tissues.

Authors:  Rashid Ahmed; Tariq Zaman; Farhan Chowdhury; Fatima Mraiche; Muhammad Tariq; Irfan S Ahmad; Anwarul Hasan
Journal:  Int J Mol Sci       Date:  2022-03-11       Impact factor: 5.923

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.