| Literature DB >> 35036053 |
Ke Li1, Congcong Yan1, Chenghao Li1, Lu Chen1, Jingting Zhao1, Zicheng Zhang1, Siqi Bao1, Jie Sun1, Meng Zhou1.
Abstract
Recent advances in spatially resolved transcriptomics (SRT) have revolutionized biological and medical research and enabled unprecedented insight into the functional organization and cell communication of tissues and organs in situ. Identifying and elucidating gene spatial expression variation (SE analysis) is fundamental to elucidate the SRT landscape. There is an urgent need for public repositories and computational techniques of SRT data in SE analysis alongside technological breakthroughs and large-scale data generation. Increasing efforts to use in silico techniques in SE analysis have been made. However, these attempts are widely scattered among a large number of studies that are not easily accessible or comprehensible by both medical and life scientists. This study provides a survey and a summary of public resources on SE analysis in SRT studies. An updated systematic overview of state-of-the-art computational approaches and tools currently available in SE analysis are presented herein, emphasizing recent advances. Finally, the present study explores the future perspectives and challenges of in silico techniques in SE analysis. This study guides medical and life scientists to look for dedicated resources and more competent tools for characterizing spatial patterns of gene expression.Entities:
Keywords: SE analysis; single-cell RNA sequencing; spatially resolved transcriptomes; spatially variable genes
Year: 2021 PMID: 35036053 PMCID: PMC8728308 DOI: 10.1016/j.omtn.2021.12.009
Source DB: PubMed Journal: Mol Ther Nucleic Acids ISSN: 2162-2531 Impact factor: 8.886
Overview of resources and databases for spatially resolved transcriptomics
| Database | Description | URL |
|---|---|---|
| SpatialDB | a database for spatially resolved transcriptomic datasets | |
| Single Cell Portal | a comprehensive database for single-cell and SRT studies |
Overview of computational tools and methods for identification of spatially variable genes
| Method | Description | Platform | URL | Reference |
|---|---|---|---|---|
| Statistical-modeling-based methods | ||||
| trendsceek | based on marked point processes | R | Edsgard et al. | |
| SpatialDE | based on Gaussian process regression | Python | Svensson et al. | |
| SPARK | based on spatial generalized linear mixed model with multiple spatial kernels | R | Sun et al. | |
| SPARK-X | based on the non-parametric model | R | Zhu et al. | |
| GPcounts | based on GP regression using negative binomial likelihood functions | Python | BinTayyash et al. | |
| BayesSpace | based on the Bayesian statistical model | R | Zhao et al. | |
| Machine-learning-based methods | ||||
| RayleighSelection | based on the extension of the graph Laplacian method | R | Govek et al. | |
| SOMDE | based on a self-organizing map neural network | Python | Hao et al. | |
| SPADE | based on convolutional neural network | Python/R | Bae et al. | |
| Spatial-grid-based methods | ||||
| singleCellHaystack | based on the grid and grid points and binary gene expression values | R | Vandenbon et al. | |
| HMRF | based on spatial genes and neighborhood network to detect spatial domains | Python | Zhu et al. | |
| Meringue | based on Delaunay triangulation and spatial auto-correlation statistic | R | Miller et al. | |
| BinSpect | based on Delaunay triangulation and statistical enrichment test | R | Dries et al. | |
Figure 1Schematic workflow of statistical-modeling-based strategies
Cell coordinates and gene expression profiles are input to represent the spatial distribution of gene expression. Then, spatial coordinates and gene expression values are modeled. Finally, significantly spatially variable genes are obtained by calculating statistical indicators.
Figure 2Schematic workflow of machine-learning-based strategies
Spectral-based methods first construct a nearest neighbor graph according to the input data and then calculate the Laplacian matrix and score. Graph and combinatorial Laplacian scores are used to identify the spatially variable genes. Neural-network-based methods process the input data through a graph convolutional neural network or SOM to identify spatially variable genes. SOM, self-organizing map.
Figure 3Schematic workflow of spatial-grid-based methods
Spatial-grid-based methods acquire cell coordinates and gene expression profiles as the input and divide the space into grids and encode spatial relationships or infer the distribution of cells, then they apply subsequent steps such as binarizing the cells’ spatial adjacent relationships or gene expression levels to identify spatially variable genes.