| Literature DB >> 31654804 |
Michele Bortolomeazzi1, Mohamed Reda Keddar1, Francesca D Ciccarelli2, Lorena Benedetti3.
Abstract
Interactions between cancer cells and non-cancer cells composing the tumour microenvironment play a primary role in determining cancer progression and shaping the response to therapy. The qualitative and quantitative characterisation of the different cell populations in the tumour microenvironment is therefore crucial to understand its role in cancer. In recent years, many experimental and computational approaches have been developed to identify the cell populations composing heterogeneous tissue samples, such as cancer. In this review, we describe the state-of-the-art approaches for the quantification of non-cancer cells from bulk and single-cell cancer transcriptomic data, with a focus on immune cells. We illustrate the main features of these approaches and highlight their applications for the analysis of the tumour microenvironment in solid cancers. We also discuss techniques that are complementary and alternative to RNA sequencing, particularly focusing on approaches that can provide spatial information on the distribution of the cells within the tumour in addition to their qualitative and quantitative measurements. This article is part of a Special Issue entitled: Transcriptional Profiles and Regulatory Gene Networks edited by Dr. Federico Manuel Giorgi and Dr. Shaun Mahony.Entities:
Keywords: Cell-specific signatures; Deconvolution; GSEA; Gene expression profiles; RNA-seq; Tumour microenvironment
Mesh:
Substances:
Year: 2019 PMID: 31654804 PMCID: PMC7346884 DOI: 10.1016/j.bbagrm.2019.194445
Source DB: PubMed Journal: Biochim Biophys Acta Gene Regul Mech ISSN: 1874-9399 Impact factor: 4.490
Fig. 1Workflows of bulk and scRNA-seq experiments. (A) Bulk RNA-seq of solid tumours is based on four steps: RNA extraction from the cancer tissue, rRNA depletion, RNA fragmentation, and cDNA library synthesis for sequencing. (B) scRNA-seq from solid tumour samples requires single cell isolation either through FACS or microfluidics-based methods or laser capture microdissection. cDNA libraries from individual cells are then synthesised and sequenced. (C) Analytical approaches for the quantification of gene expression for bulk RNA-seq and scRNA-seq data. After pre-processing, the reads are aligned to the reference transcriptome or genome. Reads mapping to the exons are counted and normalised to generate gene expression profiles. FACS = fluorescence-activated cell sorting.
Examples of approaches for the quantification of tumour-infiltrating cells from bulk transcriptomic data. For each approach, we report the underlying mathematical method, the type of expression data used to derive the signatures, the total number of marker genes included in the signatures and the final number of non-cancer cell populations considered. Only methods that implement their own reference signatures and that have been applied to the analysis of cancer samples are reported. ssGSEA = single sample gene set enrichment analysis, GSVA = gene set variation analysis.
| Approach | Computational method | Source of expression data | Marker genes (n) | Cell populations (n) |
|---|---|---|---|---|
| Angelova et al. [ | ssGSEA | Microarray | 812 | 31 |
| Charoentong et al. [ | ssGSEA | Microarray | 782 | 28 |
| ConsensusTME [ | ssGSEA | Microarray, bulk RNA-seq | Cancer type specific | 18 |
| xCell [ | ssGSEA and spillover compensation | Microarray, bulk RNA-seq | 10,808 | 64 |
| Tamborero et al. [ | Scoring (GSVA) | Microarray, bulk RNA-seq | 401 | 16 |
| MCP-counter [ | Log-transformed geometric mean of expression | Microarray | 522 | 10 |
| Danaher et al. [ | Log-transformed geometric mean of expression | Microarray, bulk RNA-seq | 60 | 14 |
| ImSig [ | Arithmetic mean of expression | Microarray, bulk RNA-seq | 318 | 7 |
| CIBERSORT [ | Deconvolution, nu support vector regression | Microarray | 547 | 22 |
| TIMER [ | Deconvolution, constrained least square fitting | Microarray | Cancer type specific | 6 |
| EPIC [ | Deconvolution, constrained least square fitting | scRNA-seq | 118 | 10 |
| quanTIseq [ | Deconvolution, constrained least square fitting | Bulk RNA-seq | 153 | 10 |
Fig. 2Computational framework to derive reference signatures. (A) Gene expression data of purified cell populations and marker genes are collected from gene expression databases and/or the literature. (B) They are then normalised to derive cell type-specific transcriptional profiles. (C) Profiles are used to derive cell type-specific reference marker genes through differential expression and correlation analyses. (D) Alternatively, the transcriptional profiles can be aggregated to generate reference expression profile matrices. GEO = Gene Expression Omnibus database, IRIS = Immune Response In Silico database, DC = dendritic cells.
Fig. 3Single-cell RNA-seq for the identification of TME cell populations. (A) Clustered scRNA-seq profiles of cancer samples are annotated according to the expression of known marker genes. (B) Cell populations can be directly identified from the annotated clusters, and visualised after dimensionality reduction. (C) Alternatively, the annotated clusters can also be used to derive high-resolution reference profile matrices. DC = dendritic cells, tSNE = t-distributed Stochastic Neighbour Embedding.
Non-transcriptomic approaches for the quantification of tumour-infiltrating cells. For each method, we report its detection technology and compatibility with FFPE samples, the type and the maximum number of measurable markers, and its throughput per run. For techniques providing spatial information, we also report their spatial resolution. FFPE = formalin-fixed paraffin-embedded, IHC = immunohistochemistry, IF = immunofluorescence, cyTOF = cytometry by time-of-flight, IMC = imaging mass cytometry, MIBI = multiplexed ion beam imaging, DSP = digital spatial profiling.
| Method | Technology | FFPE | Markers | Throughput per run | Spatial resolution |
|---|---|---|---|---|---|
| Multiplex IHC [ | Chromogenic-antibodies | Y | <12 proteins | ~500 mm2/run | <1 μm |
| Multiplex IF [ | Fluorescent-antibodies | Y | <50 proteins | ~500 mm2 | <1 μm |
| Flow Cytometry [ | Fluorescent-antibodies | Y | <28 proteins | ~107 cells | N |
| cyTOF [ | Mass spectrometry | Y | <40 proteins | ~107 cells | N |
| IMC [ | Mass spectrometry | Y | <40 proteins | ~ 1 mm2 | 1 μm |
| MIBI [ | Mass spectrometry | Y | <50 proteins | ~ 1 mm2 | 0.2 μm |
| Spatial transcriptomics [ | DNA probes Bulk DNA-seq | N | >1500 genes | 1007 spots/slide | 200 μm |
| NanoString DSP [ | DNA probes | Y | <40 proteins | 600 μm2 | 10 μm |
| REAP-seq [ | DNA probes | N | ~1500 genes | ~4000 cells | N |
| Abseq [ | DNA probes | N | >600 genes | >10,000 cells | N |
| CITE-seq [ | DNA probes | N | ~1500 genes | ~10,000 cells | N |