| Literature DB >> 35168703 |
Jongwon Lee1, Minsu Yoo2, Jungmin Choi3.
Abstract
Single-cell RNA sequencing (scRNA-seq) has greatly advanced our understanding of cellular heterogeneity by profiling individual cell transcriptomes. However, cell dissociation from the tissue structure causes a loss of spatial information, which hinders the identification of intercellular communication networks and global transcriptional patterns present in the tissue architecture. To overcome this limitation, novel transcriptomic platforms that preserve spatial information have been actively developed. Significant achievements in imaging technologies have enabled in situ targeted transcriptomic profiling in single cells at singlemolecule resolution. In addition, technologies based on mRNA capture followed by sequencing have made possible profiling of the genome-wide transcriptome at the 55-100 μm resolution. Unfortunately, neither imaging-based technology nor capturebased method elucidates a complete picture of the spatial transcriptome in a tissue. Therefore, addressing specific biological questions requires balancing experimental throughput and spatial resolution, mandating the efforts to develop computational algorithms that are pivotal to circumvent technology-specific limitations. In this review, we focus on the current state-of-the-art spatially resolved transcriptomic technologies, describe their applications in a variety of biological domains, and explore recent discoveries demonstrating their enormous potential in biomedical research. We further highlight novel integrative computational methodologies with other data modalities that provide a framework to derive biological insight into heterogeneous and complex tissue organization. [BMB Reports 2022; 55(3): 113-124].Entities:
Mesh:
Year: 2022 PMID: 35168703 PMCID: PMC8972138
Source DB: PubMed Journal: BMB Rep ISSN: 1976-6696 Impact factor: 4.778
Fig. 1Overview of two main categories of spatially resolved transcriptomic techniques: image-based (Left) and capture-based (Right) methods. Imaging-based methods detect target genes by sequencing directly into a fixed tissue section (ISS) or hybridizing a complementary fluorescent probe (ISH). Both image-based methods are suitable for analyzing subcellular transcript patterns of a cell. Capture-based methods are divided into three sub-categories: directly obtaining a specific region of interest from a tissue section using laser capture microdissection (LCM), customized slides, or bead arrays to capture mRNAs by oligonucleotide-based spatial barcodes followed by NGS. Combining computational strategies enables the comprehensive mapping of cell types in spatially resolved transcriptomic data with a specific spatial resolution by each method.
Spatially resolved transcriptomic techniques
| Techniques | Features | Target genes | Application | Programming language | Reference |
|---|---|---|---|---|---|
| Image-based spatially resolved transcriptomics: ISH | |||||
| smFISH | Short DNA probes complementary to mRNA targets trigger chain reactions | 39 probes (smHCR) | Zebra fish, mouse brain | MATLAB | Shah |
| osmFISH | Binding of 20 nucleotide-long fluorescently labeled DNA probes | 33 marker genes | Mouse brain | Python | Codeluppi |
| MERFISH | Using chemical cleavage instead of photobleaching to remove fluorescent signals | 130 genes in up to 100,000 cells | Cultured U-2 OS cells | MATLAB | Moffitt |
| MERFISH-based analysis platform | Targeting a set of 155 genes | Mouse hypothalamic preoptic region | MATLAB | Moffitt | |
| seqFISH+ | Enables visualization of the subcellular localization | 10,000 genes in single cells | NIH/3T3, mouse brain | MATLAB | Eng |
| SABER | Additional signal amplification or applying serial imaging with DNA-Exchange | 18,000 probes targeting a 3.9-Mb region | Mouse retinal tissue | MATLAB & Python | Kishi |
| Split-FISH | Alternative approach to reduce off-target background fluorescence by integrating split-probe strategy with multiplexed FISH | 317 genes in single cell | Mouse brain, liver, kidney, ovary | Python | Goh |
| Image-based spatially resolved transcriptomics: ISS | |||||
| STARmap | Integrated with hydrogel-tissue chemistry and targeted signal amplification | 160 to 1,020 genes simultaneously | Mouse brain | Python | Wang |
| INSTA-seq | Sequences two bases simultaneously from both ends of the cDNA fragments | Up to 443,304 UMIs in total | R | Fürth | |
| HybISS | New barcoding system via sequence-by-hybridization chemistry | 119 genes for PLP design | Mouse visual cortex, human brain | MATLAB | Gyllborg |
| pciSeq | Bayesian algorithm derived from scRNA-seq clusters data | Designed 755 probes for 99 genes | Mouse CA1 interneuron | Python | Qian |
| ExSeq | cDNA amplicons are eluted from the sample and re-sequenced | Up to 3,039 genes with untargeted approach | Mouse brain, mouse visual cortex, human breast cancer | MATLAB | Alon |
| Capture-based spatially resolved transcriptomics: LCM | |||||
| exome- capture RNA- sequencing | Optimized standard protocol for hematoxylin and eosin (H&E) staining | Whole exome | 7 tumor samples of TNBC | MATLAB | Romanens |
| immuno-LCM- RNAseq | RNA quality was significantly improved using modified protocol | Up to 60 cells were demonstrated to be sufficient quality | Mouse small intestine | Python | Zhang |
| PIC | Photo-irradiated cells were suppressed cDNA amplification | 8,000 genes were detected with 7 × 104 unique read counts | Mouse embryo | R | Honda |
| Capture-based spatially resolvedtranscriptomics: Oligonucleotide-based spatial barcode on slide | |||||
| ST | Arrayed reverse transcription primers with unique positional barcodes | Up to 200 million oligonucleotides in each of 1007 features | Mouse brain, human breast cancer | R | Ståhl |
| Salmén | |||||
| Multimodal analysis | Combined single-cell RNA sequencing with ST | Median depth of 1,629 UMIs/spot and 967 genes/spot | Human cSCC | MATLAB & R | Ji |
| ST | Combined ST and ISS | Mean 31,283 UMIs and 6,578 unique genes per TD | AD mouse model, mouse and human brain | Python | Chen |
| Capture-based spatially resolvedtranscriptomics: Oligonucleotide-based spatial barcode on bead array | |||||
| Slide-seq | DNA-barcoded beads with known positions ( | 1.5 million beads, of which 770,000 can be analyzed | Mouse cerebellum and hippocampus | MATLAB & R | Rodriques |
| HDST | Barcoded poly(d)T oligonucleotides into 2-μm wells with a randomly ordered bead array-based | 2,893,865 individual barcoded beads | Mouse brain, primary breast cancer | Python | Vickovic |
| Slide-seq V2 | Improvements in library generation, bead synthesis and array indexing | Mean 45,772 UMIs in 110 μm diameter area | Mouse hippocampus | MATLAB & R & Python | Stickels |
| Seq-Scope | Based on a solid-phase amplification using an Illumina sequencing platform | Up to 5.88-19.7 genes were identified per HDMI pixel | Human liver and colon | Python | Cho |
| Stereo-seq | Combined DNA nanoball pattern arrays and tissue RNA capture | Up to 133,776 UMIs per 100 μm diameter | Mouse brain | Not identified yet | Chen |
Integration of spatially resolved transcriptomic data with other methods
| Techniques | Features | Input data | Application | Programming language | Reference |
|---|---|---|---|---|---|
| Combination with scRNA-seq | |||||
| pciSeq | Bayesian algorithm derived from scRNA-seq clusters data with ISS | Designed 755 probes for 99 genes | Mouse CA1 interneuron | Python | Qian |
| seqFISH | Computing the ratio of the performance and prediction scores with scRNA-seq data | Each cell contained avg 196 mRNA from 93.2 genes | Embryo development in brain and gut | R | Lohoff |
| Multiple spatial transcriptomics | Unbiased approach with additional in situ hybridization using RNAscope and multi-molecule ISS | Mean 31,283 UMIs and 6,578 unique genes per tissue domain | AD mouse model, mouse and human brain | Python | Chen |
| Slide-seq | NMFreg that reconstructs expression of each cell type signatures defined by scRNA-seq | 1.5 million beads, of which 770,000 could be analyzed | Mouse cerebellum and hippocampus | MATLAB & R | Rodriques |
| Integrating microarray-base d ST and MIA | Enrichment analysis that two-tailed Student’s t-test were used to compare expression of those marker genes | 2,500-3,300 UMIs and 1,400-1,700 unique expressed genes per single cell | Pancreatic ductal adeno-carcinoma | R | Moncada |
| Deep learning-based spatial information | |||||
| DEEPsc | Deep-learning network was trained with spatial position feature vectors as simulated scRNA-seq data | Started with top 3,000 highly variable genes | MATLAB | Maseda | |
| BayesSpace | Bayesian statistical method that uses the information from spatial neighborhoods to achieve super-resolution images | 10X Genomics Visium data, does not require independent single-cell data or marker gene preselection | Brain, melanoma, invasive ductal carcinoma, ovarian adeno-carcinoma | R | Zhao |
| SPICEMIX | Enhances the NMF of gene expression with a graphical representation of the spatial relationship of cells | 2,470 genes in 523 cells (seqFISH+), 930 cells and 1,020 genes (STARmap) | Mouse primary visual cortex | Python | Chidester |
| SpaOTsc | Infer the spatial distance between every pair of cells by computing the optimal transport distance | 851-15,413 cells and 10,495-45,789 genes (scRNA-seq), 64-1,549 spatial positions and 47-1,020 genes | Zebrafish embryo, | Python | Cang |
| Deconvolution of spatially resolved transcriptomics | |||||
| RCTD | Statistical model assumed to be Poisson distributed and maximum-likelihood estimation (MLE) used to infer the cell types | Slide-seq and 10X Genomics Visium data | Mouse brain | R | Cable |
| SPOTlight | NMF along with non-negative least squares (NNLS) model with both the basis and coefficient matrices with cell type marker genes | 41,986 cells were merged to identify a total of 10,623 immune cells | Mouse brain, pancreatic adeno-carcinoma | R | Elosua-Bayes |
| SpatialDWLS | Dampened weighted least squares (DWLS) model with cell-type specific gene signatures from a public scRNA-seq dataset as a reference | 10,000 genes in 523 cells (seqFISH+) | Mouse brain, human heart | R | Dong |
| DestVI | Bayesian model for multi-resolution deconvolution of cell types using Variational Inference | Pair of ST and scRNA-seq from same tissue | Murine lymph node, mouse tumor model | Python | Lopez |
| Cell type inference via image-based machine learning | |||||
| ST-Net | Deep learning algorithm that combines ST and histology images to predict the target gene expression of each spot | 30,612 spots in 68 breast tissue sections | Breast cancer | Python | He |
| HisToGene | Employs a modified Vision Transformer model for gene expression prediction from histology images | 9,612 spots and 785 genes in breast cancer tissue | Breast cancer | PyTorch | Pang |
| stLearn | Deep neural network model to predict hotspots where cell-cell interactions are more likely to occur | Feature vectors from H&E images of the tissue section | Mouse brain, human brain, breast cancer | Python | Pham |
| SpaCell | Normalized count data and H&E staining images were trained with convolutional neural network | Tissue morphology and spatial gene expression data | Prostate cancer, amyotrophic lateral sclerosis | Python | Tan |
| CoSTA | Clustering by Gaussian mixture model (GMM) and weight updating as commonly performed in training neural networks | Image-type matrix of MERFISH and Slide-seq data | Mouse brain, brain Injury | Python | Xu |
| STUtility | NMF to decompose ST data and identification and extraction of neighbouring capture-spots | 10x Genomics Visium data | Mouse brain, breast cancer tissue, lymph node, rheumatoid arthritis | R | Bergenstråhle |
| ISST | Image-based | 12 sections from the mouse olfactory bulb | Mouse olfactory bulb, human breast cancer | Python | Bergenstråhle |
| Cellular protein information | |||||
| CITE-seq | Oligonucleotide-labeled antibodies are used to integrate cellular protein and transcriptome measurements | Common immune subpopulation markers (CD8a, CD3e, CD19, CD56, CD16, CD11c and CD14) | Human HeLa, mouse 4T1 cell, immune subpopulation | R | Stoeckius |
| IMC | Epitope-based imaging methods that employ a mass spectrometer for readout to infer RNA-to-protein correlations | Detected three mRNA simultaneously (HER2, CK19 and CXCL10) | Breast cancer | MATLAB & R & Python | Schulz |
| nanoPOTS | Unique proteins were identified via combination of LCM and ultrasensitive nanoLC-MS/MS | > 2,000 proteins with 100 μm spatial resolution | Mouse luminal epithelial cell, stromal cell, glandular epithelial cell | R | Piehowski |
| Spatial ATAC-seq | |||||
| sciMAP-ATAC | Spatially resolved, single-cell profiling of chromatin states from a single tissue punch | Mean 12,052 - 30,212 passing reads per cell | Mouse and human brain, cerebral ischemia model system | R | Thornton |
| Spatial-ATAC-seq | DNA barcode solutions were introduced to the tissue surface using an array of microchannels | 36,303-100,786 unique fragments per pixel | Mouse embryos, human tonsil tissue | R | Deng |