| Literature DB >> 35937994 |
Etienne Boileau1,2,3, Xue Li2,3, Isabel S Naarmann-de Vries1,2,3, Christian Becker4, Ramona Casper4, Janine Altmüller5, Florian Leuschner2,3, Christoph Dieterich1,2,3.
Abstract
We introduce Single-cell Nanopore Spatial Transcriptomics (scNaST), a software suite to facilitate the analysis of spatial gene expression from second- and third-generation sequencing, allowing to generate a full-length near-single-cell transcriptional landscape of the tissue microenvironment. Taking advantage of the Visium Spatial platform, we adapted a strategy recently developed to assign barcodes to long-read single-cell sequencing data for spatial capture technology. Here, we demonstrate our workflow using four short axis sections of the mouse heart following myocardial infarction. We constructed a de novo transcriptome using long-read data, and successfully assigned 19,794 transcript isoforms in total, including clinically-relevant, but yet uncharacterized modes of transcription, such as intron retention or antisense overlapping transcription. We showed a higher transcriptome complexity in the healthy regions, and identified intron retention as a mode of transcription associated with the infarct area. Our data revealed a clear regional isoform switching among differentially used transcripts for genes involved in cardiac muscle contraction and tissue morphogenesis. Molecular signatures involved in cardiac remodeling integrated with morphological context may support the development of new therapeutics towards the treatment of heart failure and the reduction of cardiac complications.Entities:
Keywords: myocardial infarction; oxford nanopore technology; single-cell RNA sequencing; spatial transcriptomics; visium spatial
Year: 2022 PMID: 35937994 PMCID: PMC9354982 DOI: 10.3389/fgene.2022.912572
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1scNaST methodology. (A) Schematic of the scNaST workflow using a hybrid sequencing approach on Nanopore and Illumina platforms to assign spatial barcodes to long-read sequencing. (B) Nanopore sequencing saturation showing the number of splice sites detected at various levels of subsampling. A curve that reaches a plateau before getting to 100% data suggest that all known junctions in the library have been detected. The curve shows the mean ± SE of four samples. (C) Normalized transcript coverage for Nanopore and Illumina. The curves show the mean ± SE of four samples. (D) Reads assigned by scNapBar at each step of the workflow. The bars show the mean ± SE of four samples.
FIGURE 2Defining morphological regions after MI. (A) Dot plot showing the expresson of selected markers associated with the expresson of CM = cardiomyocytes, EC = endothelial cells, MFB = myofibroblasts, IM = immune cells, or with fibrosis and inflammation, based on the short-read Illumina data. (B) Annotation of mouse heart regions after MI via short-read clustering, transfered to the Nanopore data. Scatter plot in spatial coordinates of the anatomical regions (left) and UMAP representation of the Nanopore data using the region annotation from short-read clustering (right). Colors in the spatial scatter plot are matching those of the UMAP. (C) Neighbors enrichment analysis in one heart axis section. The heatmap shows the enrichment score on spatial proximity between the different anatomical regions. Spots belonging to two different regions that are close together will have a high score, and vice-versa. (D) Cluster co-occurrence in spatial dimensions in one heart axis section. Line plot showing the conditional probability of observing a given region conditioned on the presence of the infarct region, computed across increasing radii size around each spots. Distance units are given in pixels of the Visium source image. (E) Barplot showing the frequency distribution of the number of isoforms per gene, either stemming from the assembly, or found in the final data after quality control filtering. The median length of transcripts is indicated in each bar for each category. (F) Average number of isoform per gene detected among markers of each morphological region. Significance was measured using a Mann-Whitney U-test (*** = 0.001, **** = 0.0001)
FIGURE 3Characterizing the isoform diversity after MI. (A) Barplot showing how full-length transcripts obtained with scNaST compare to the existing mouse annotation. Labels Complete (=), Multi-exon (j), Containment (k), Unknown (u), Exonic (x), Intron(i), Overlap (o), Retained (m, n), and Contains (y) are explained in https://ccb.jhu.edu/software/stringtie/gffcompare.shtml. (B) Over-representation analysis of genes harboring novel transcripts with intron retention (IR). From top to bottom, biological processes, molecular function, and hallmark gene sets from the Molecular Signatures Database (MSigDB). (C) Exonic structure of the different Actc1 isoforms, including novel isoforms identified by scNaST with IR and exonic antisense overlap. Coverage (log scale) is shown for each region with a different scale. (D) Actc1 isoform contributions to total Actc1 expression in the different heart regions. (E) Per spot correlation observed between spatial deconvolution of cell types and Actc1 isoforms.
FIGURE 4Regional isoform switching after MI. (A) Dot plot showing the top five most significant isoform-switching genes per contrast across all comparisons between the remote, border, and infarct areas, using a stage-wise testing procedure at an overall false discovery rate (OFDR) of 0.05. The top genes were restricted to the spatially variable genes. (B) Heatmap representing the number of isoform-switching genes identified between any two regions. I = infarct, BZ = border zone, RZ = remote zone. (C) Spatial scatter plot showing Pdlim5 isoform expression in one heart axis section. (D) Pdlim5 isoforms track. (E) Zoom of D to show coverage of both isoforms across regions.