| Literature DB >> 34799559 |
Jianke Lv1,2,3,4,5, Qianqian Shi1,2,3,4,5, Yunwei Han1,2,3,4,5, Weidong Li1,2,3,4,5, Hanjiao Liu1,2,3,4,5, Jingyue Zhang1,2,3,4,5, Chen Niu1,2,3,4,5, Guangshen Gao1,2,3,4,5, Yiru Fu1,2,3,4,5, Renyong Zhi1,2,3,4,5, Kailiang Wu1,2,3,4,5, Shuai Li1,2,3,4,5, Feng Gu6,7,8,9,10, Li Fu11,12,13,14,15.
Abstract
Invasive micropapillary carcinoma (IMPC) is a special histological subtype of breast cancer, featured with extremely high rates of lymphovascular invasion and lymph node metastasis. Based on a previous series of studies, our team proposed the hypothesis of "clustered metastasis of IMPC tumor cells". However, the transcriptomics characteristics underlying its metastasis are unknown, especially in spatial transcriptomics (ST). In this paper, we perform ST sequencing on four freshly frozen IMPC samples. We draw the transcriptomic maps of IMPC for the first time and reveal its extensive heterogeneity, associated with metabolic reprogramming. We also find that IMPC subpopulations with abnormal metabolism are arranged in different spatial areas, and higher levels of lipid metabolism are observed in all IMPC hierarchical clusters. Moreover, we find that the stromal regions show varieties of gene expression programs, and this difference depends on their distance from IMPC regions. Furthermore, a total of seven IMPC hierarchical clusters of four samples share a common higher expression level of the SREBF1 gene. Immunohistochemistry results further show that high SREBF1 protein expression is associated with lymph node metastasis and poor survival in IMPC patients. Together, these findings provide a valuable resource for exploring the inter- and intra-tumoral heterogeneity of IMPC and identify a new marker, SREBF1, which may facilitate accurate diagnosis and treatment of this disease.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34799559 PMCID: PMC8605000 DOI: 10.1038/s41419-021-04380-6
Source DB: PubMed Journal: Cell Death Dis Impact factor: 8.469
Fig. 1The workflow for spatial transcriptomics (ST) and hierarchical clustering results.
Attached a 10 µm thick tissue to the slide for ST sequencing, performed bioinformatics analysis on the sequencing data, and displayed the hierarchical clustering results with t-SNE, heatmap, and spatial graphs.
Fig. 2Tumor morphology and hierarchical clustering results.
A H&E-stained tissue images of sample 1 with marked IMPC (red), IDC-NOS and IMPC-like (black), and stroma (yellow) tissue regions. B Hierarchical clustering of the spatial features in sample 1. Each cluster was assigned a color. The cluster 0 (red) and cluster 2 (green) represent interstitial regions. The cluster 1 (yellow) represents IDC-NOS region. The cluster 3 (wathet) is IMPC-like region. Clusters 4 (blue) and 5 (pink) are the IMPC area. C t-SNE color visualization of hierarchical clustering profile in sample 1. D H&E-stained tissue images of sample 4 with marked IMPC (red), IDC-NOS (black), and stroma (yellow) tissue regions. E Hierarchical clustering of the spatial features in sample 4. Each cluster was assigned a color. The cluster 0 (red) and cluster 3 (cyan) represent IDC-NOS regions. The cluster 1 (brown) and cluster 6 (pink) are stromal regions. The cluster 2 (green), 4 (blue), and 5 (purple) are the IMPC area. F t-SNE color visualization of hierarchical clustering profile in sample 4. G, H Heatmap plots of sample 1 and sample 4. t-SNE, t-distributed statistical neighbor embedding.
Fig. 3Enrichment, transcription factor analysis and PPI of IMPC cluster in four samples.
A Highlight spatial hierarchical plot, GO enrichment (biological process, BP), KEGG enrichment and transcription factor analysis among top 50 gene in each IMPC cluster. GO, Gene Ontology. KEGG, Kyoto Encyclopedia of Genes and Genomes. The 30 terms with the lowest p.adjust values of the enrichment results were selected to draw the enrichment plot. The horizontal axis is gene and the vertical axis is term. The color represents the gene’s logFC value. p.adjust: use BH for multiple hypothesis testing, adjusted P value. p.adjust < 0.05 is a significant difference. Network diagram of highly expressed genes and transcription factors, the blue dots represent differential genes, and the purple dots represent transcription factors. The larger the node, the more nodes interacted to it. The arrows represent the enrichment terms related to metabolism, tumor immune response, and some important signaling pathways. B The t-SNE diagram of the integrated sequencing data. The four samples were completely distinguished. C, D The protein–protein interaction (PPI) plots of top 50 highly expressed genes in cluster 5 of sample 1 and cluster 2 of sample 4. The circles are nodes, and the size represents the size of the degree.
Fig. 4The spatial distribution of stromal regions affects the gene expression of IMPC regions in sample 1.
A The spatial distribution position of two stromal clusters (cluster 0 and 2). The stroma (green cluster) is far from IMPC regions (blue and pink clusters). Another stroma (red cluster) is near IMPC regions. B–E The expression level of IGKC, IGHG4, IGHG3, and IGHA1 genes on the spatial plots. F, G The enrichment results of cluster 0 highly expressed top 50 genes on GO (BP) and KEGG. H, I The enrichment results of cluster 2 highly expressed top 50 genes on GO (BP), and KEGG. The arrows represent that these terms were linked to immune response, cellular interaction, complement pathway, and oxidative phosphorylation, etc.
Fig. 5SREBF1 was a common highly expressed gene in the IMPC clusters of four samples.
A The Venn plot showed that SREBF1 was a common high expression gene in all IMPC clusters of the 4 samples. B SREBF1 was highly expressed in IMPC clusters of each sample in spatial plots. C Boxplot of SREBF1 expression level in IMPC clusters versus IDC-NOS clusters. Bar represents median, and boxplot represents quartiles; scale in log10(RPM). Student’s t-test for comparison between two groups in each sample. The difference was significant in sample 1 (P < 2e-16), sample 3 (P = 2.1e-08) and sample 4 (P < 2e-16). D The correlation analysis between SREBF1 and FASN using TCGA database. The Pearson correlation coefficient was used, two-sided. P < 0.05 was considered to indicate statistical significance. E SREBF1 was more highly expressed in IMPC tumor tissues using RT-qPCR. Bar represents median, and boxplot represents quartiles. Student’s t-test for comparison. The two groups were significantly different (P = 0.013).
Fig. 6Clinical outcomes associated with the expression of the SREBF1 and FASN proteins in 82 patients with IMPC and 80 with IDC-NOS.
A Images of immunohistochemical staining for SREBF1 and FASN in patients with IMPC (n = 82) and IDC-NOS (n = 80). The upper panel shows high expression of SREBF1 and FASN in patients with IMPC; the lower panel shows low expression of the SREBF1 and FASN proteins in patients with IDC-NOS, x400 magnification. B The association of SREBF1 and FASN protein expression with disease-free survival (DFS) and overall survival (OS). The light panel shows SREBF1, and the right panel shows FASN, with patients with IMPC stratified by high and low protein expression. Log-rank test, two-sided. The P values for DFS of SREBF1 and FASN were 0.022, and 0.009, respectively; the P values for OS of SREBF1 and FASN were 0.044 and 0.015, respectively. P < 0.05 was considered to indicate statistical significance.