Core Tip: In this project, we identified five genes that were overexpressed in diffuse large B-cell lymphoma (DLBCL), and the five gene signatures were closely associated with the development of DLBCL. More importantly, the five genes were positively correlated with stromal cell infiltration, especially in cancer-associated fibroblasts. Taken together, these genes might be novel therapeutic targets for DLBCL.
INTRODUCTION
Diffuse large B-cell lymphoma (DLBCL) is a common type of non-Hodgkin lymphoma, which accounts for about 30% of non-Hodgkin lymphomapatients[1]. DLBCL is a highly malignant tumor; the survival time for those untreated patients is only several months[2]. The current treatments for DLBCL are mainly dependent on the clinical stages. The limited stage contains stages 1 and 2, the treatment guideline for which suggests the combination of systemic chemoimmunotherapy, including rituximab, cyclophosphamide, doxorubicin, vincristine and prednisone (R-CHOP), and the involved-field radiation therapy[3,4]. However, to some degree, involved-field radiation therapy is not applicable to patients with advanced stages. As the first-line treatment for DLBCL, many patients could benefit from R-CHOP. However, there are still 30%-40% of patients expressing resistance and recurrence[5-7]. Therefore, screening for novel targets involved in the development of DLBCL could prove an encouraging work for precision medicine.Considering the risks of drug resistance and recurrence in the treatment of DLBCL, this study aimed to reveal the significant genes in the development of DLBCL. The discovery of novel molecules could provide further understanding about the regulatory mechanism of DLBCL, finding therapeutic drug targets, and further understanding of the tumor microenvironment in the regulation of drug resistance and tumor recurrence[8,9]. However, we are still unclear about the tumor microenvironment in the development of DLBCL, especially in the regulation of stromal and immune infiltration.In this study, overexpressed genes in DLBCL were identified, and the key hub genes were confirmed for further analysis. We identified alpha-2-macroglobulin (A2M), fibronectin (FN1), cathepsin B (CTSB), matrix metallopeptidase 9 (MMP9) and secreted protein acidic and cysteine-rich (SPARC) as gene signatures, which were closely associated with the development of DLBCL. More interestingly, we confirmed the significantly positive correlation between the five gene signatures and stromal score in tissue samples of DLBCL. In detail, the five gene signatures could predict the infiltration level of cancer-associated fibroblasts (CAFs). Finally, the top two genes, FN1 and SPARC, showed a co-expression profile in DLBCL, which might be encouraged as novel therapeutic targets for DLBCL.
MATERIALS AND METHODS
Gene expression omnibus data acquisition and sorting
The GSE60 dataset was employed for the expression profiling assay on the DLBCL samples[10]. The expression profile was obtained from the Gene Expression Omnibus, and treated with log transformation. The Benjamini and Hochberg (false discovery rate) was applied to adjust the P value. The heatmap of differentially expressed genes (DEGs), principal components analysis plot, and correlation coefficient for all samples were produced by the Image GP tool (http://www.ehbio.com/ImageGP/index.php/).
Protein-protein interaction (PPI) and identification of hub genes
The top 20 DEGs were subjected to analysis through the STRING database (version 11.0)[11], and the multiple protein manners were applied, including the active interaction sources of text mining, experiments, databases, co-expression, neighborhood, gene fusion, and co-occurrence. The minimally required interaction score was set as medium confidence (0.40). The node genes were considered as the hub genes, and the PPI enrichment value expresses statistical difference.
Target gene expression analysis in DLBCL
The hub gene expression in tumor cells and normal cells was assessed in the GEPIA database[12]. DLBCL cells from 47 patients were included in the tumor group, and the Genotype-Tissue Expression[13] and normal lymphocytes were included in the normal group (n = 337). The expression profiling was investigated by the log2 (TPM + 1) method.
Gene alteration and clinical pathological analysis
The genomic alteration analysis of indicated genes was conducted in the cBioportal for cancer genomics (https://www.cbioportal.org/)[14]. The DLBCL (The Cancer Gene Atlas [TCGA], PanCancer Atlas) was selected, and the genomic alternation included missense mutation, amplification, deep deletion, and messenger ribonucleic acid-high. The correlation analysis between the indicated genes and the different stages of DLBCL was conducted in the UALCAN portal in the TCGA database. The patients’ information about the disease stage was obtained from the UALCAN[15].
Single sample gene set enrichment analysis
The correlation analysis between indicated genes and tumor purity and infiltration levels of immune/stromal cells in the tumor tissues was conducted by the Estimation of Stromal and Immune cells in Malignant Tumor tissues using Expression data (ESTIMATE)[16], which is a tool to predict the tumor purity and the infiltration levels of immune or stromal cells in tumor tissues. The ESTIMATE score represents tumor purity, and the stromal score correlates to the presence of stroma in the tumor tissues. For further analysis, CAFs, one kind of stromal cells, were employed for the correlation analysis with the MCP-Counter method[17] and tumor immune dysfunction and exclusion (TIDE)[18].
Statistical analysis
The data analyses were conducted with GraphPad Prism version 8.0 (GraphPad Software incorporated, La Jolla, CA, United States). The data were shown as the mean ± SD. Student’s t-test was applied to compare the statistical difference between groups. Correlation analysis was performed with the Spearman test. P < 0.05 was considered statistically significant.
RESULTS
DEGs in DLBCL
A total of 61 case samples, which involved 31 cases of normal cells and 30 cases of tumor cells were subjected to determine the DEGs. The top 20 overexpressed genes in DLBCL are shown in Figure 1A. Two samples in the normal group were significantly different from the other samples. The principal components analysis on 31 cases of normal cells and 30 cases of DLBCL tumor cells was conducted and shown in Figure 1B, and results were consistent with those shown in Figure 1A. Furthermore, the gene expression pattern of all samples was analyzed by the Spearman correlation matrix for the evaluation of the correlation coefficient, and the results showed that these DEGs have a certain co-expression manner.
Figure 1
The top 20 differentially expressed genes in diffuse large B-cell lymphoma. A: A total of 31 cases in the normal lymphocytes group and 30 cases in the diffuse large B-cell lymphoma group were subjected to analysis of the differentially expressed genes; B: The principal components analysis plot for 31 normal lymphocyte samples and 30 diffuse large B-cell lymphoma samples; C: The Spearman correlation matrix analysis of all samples was conducted by pairwise correlation coefficient analysis. A2M: Alpha-2-macroglobulin; CTSB: Cathepsin B; FN1: Fibronectin; MMP9: Matrix metallopeptidase 9; PC: Principal components; SPARC: Secreted protein acidic and cysteine-rich.
The top 20 differentially expressed genes in diffuse large B-cell lymphoma. A: A total of 31 cases in the normal lymphocytes group and 30 cases in the diffuse large B-cell lymphoma group were subjected to analysis of the differentially expressed genes; B: The principal components analysis plot for 31 normal lymphocyte samples and 30 diffuse large B-cell lymphoma samples; C: The Spearman correlation matrix analysis of all samples was conducted by pairwise correlation coefficient analysis. A2M: Alpha-2-macroglobulin; CTSB: Cathepsin B; FN1: Fibronectin; MMP9: Matrix metallopeptidase 9; PC: Principal components; SPARC: Secreted protein acidic and cysteine-rich.
Identification of hub genes and validation of these overexpressed genes in DLBCL
The top 20 DEGs were subjected to STRING for PPI analysis, and screening the hub genes in the regulation of DLBCL. As Figure 2A reveals, A2M, FN1, SPARC, CTSB, and MMP9 were identified as the hub genes. We further analyzed the differential expression of the hub genes in DLBCL in TCGA-DLBCL dataset. The results were similar to those shown in Figure 1 of the GSE60 dataset (Figure 2B-F), and these five hub genes were overexpressed in the DLBCL. Furthermore, the difference of the five genes in germinal center B-like or activated B-like DLBCL was also conducted, as presented in Supplementary Figure 1, and the results showed no significant difference between the two subtypes of DLBCL. Therefore, the five genes were deemed general regulators in DLBCL, and might be important mediators in the regulation of DLBCL.
Figure 2
The five hub genes were confirmed as overexpressed in diffuse large B-cell lymphoma. A: The protein-protein interaction was confirmed in STRING, and the hub genes were obtained; B-F: The expressions of alpha-2-macroglobulin (A2M) (B), fibronectin (FN1) (C), secreted protein acidic and cysteine-rich (SPARC) (D), cathepsin B (CTSB) (E), and matrix metallopeptidase 9 (MMP9) (F) were subjected to GEPIA to analyze the expression in diffuse large B-cell lymphoma and normal lymphocytes within The Cancer Gene Atlas dataset. aP < 0.001 vs the diffuse large B-cell lymphoma tumor samples.
The five hub genes were confirmed as overexpressed in diffuse large B-cell lymphoma. A: The protein-protein interaction was confirmed in STRING, and the hub genes were obtained; B-F: The expressions of alpha-2-macroglobulin (A2M) (B), fibronectin (FN1) (C), secreted protein acidic and cysteine-rich (SPARC) (D), cathepsin B (CTSB) (E), and matrix metallopeptidase 9 (MMP9) (F) were subjected to GEPIA to analyze the expression in diffuse large B-cell lymphoma and normal lymphocytes within The Cancer Gene Atlas dataset. aP < 0.001 vs the diffuse large B-cell lymphoma tumor samples.
The five hub genes had a certain association with stage of DLBCL
As above mentioned, the five hub genes were identified and confirmed as overexpressed in the DLBCL. Further analysis showed that the percentages of gene alteration among the five hub genes were 5% (A2M), 8% (FN1), 5% (CTSB), 2.7% (MMP9) and 5% (SPARC), respectively (Figure 3A), suggesting the significantly different alteration of the five hub genes, which promotes the overall understanding of the potential association between the hub genes and development of DLBCL. As Figure 3B-F shows, with increased tumor stage, the A2M, FN1, CTSB, MMP9, and SPARC expression becomes higher than that in the early stage (Stage 1). Through the analysis of the five genes associated with the development of DLBCL, we found that the FN1 expression was increased in a stage-dependent manner (Figure 3C). To further analyze the importance of FN1 in the development of DLBCL, and the correlation between FN1 expression and international prognosis index (IPI) score was tested. In detail, the DLBCL samples were divided into three groups according to the IPI score value 0, 1-2, 3-4 respectively, and the result showed that the FN1 expression showed no significant difference in three groups with different IPI scores (Supplementary Figure 2), suggesting that FN1 was an independent factor compared with the common IPI score system.
Figure 3
The alteration of hub genes had a potential association with the development of diffuse large B-cell lymphoma. A: The gene alterations were conducted in the cBioportal and included missense mutation, amplification, deep deletion, and messenger ribonucleic acid high; B-F: The expressions of alpha-2-macroglobulin (A2M) (B), cathepsin B (CTSB) (C), fibronectin (FN1) (D), matrix metallopeptidase 9 (MMP9) (E), secreted protein acidic and cysteine-rich (SPARC) (F) were subjected to GEPIA to analyze the gene expression in different stages of DLBCL. bP < 0.05 vs stage 1 group.
The alteration of hub genes had a potential association with the development of diffuse large B-cell lymphoma. A: The gene alterations were conducted in the cBioportal and included missense mutation, amplification, deep deletion, and messenger ribonucleic acid high; B-F: The expressions of alpha-2-macroglobulin (A2M) (B), cathepsin B (CTSB) (C), fibronectin (FN1) (D), matrix metallopeptidase 9 (MMP9) (E), secreted protein acidic and cysteine-rich (SPARC) (F) were subjected to GEPIA to analyze the gene expression in different stages of DLBCL. bP < 0.05 vs stage 1 group.
The five gene signatures were positively correlated with tumor purity in DLBCL
The above results showed the overexpression of these five genes in DLBCL compared with the normal lymphocytes group (Figure 2B-F), and the expression level had a positive correlation with the tumor stage (Figure 3B-F), suggesting the significance of these five genes in the development of DLBCL. To further confirm the findings, tumor purity analysis with the five gene signatures was conducted by the ESTIMATE score. As Figure 4A-E shows, the tumor purity score was positively correlated with the levels of A2M, CTSB, FN1, MMP9, and SPARC gene expression. Furthermore, CD19 is a specific marker for DLBCL; thus, the gene correlation analysis between CD19 and the five gene signatures was conducted and showed a positive correlation between the five gene signatures and the tumor purity (Figure 4F). All of these results showed that the five gene signatures were positively correlated with the tumor purity, suggesting the close association between the five gene signatures and DLBCL development.
Figure 4
Overexpression of five hub genes was confirmed in diffuse large B-cell lymphoma. A-E: Correlation analyses between alpha-2-macroglobulin (A2M) (A), cathepsin B (CTSB) (B), fibronectin (FN1) (C), matrix metallopeptidase 9 (MMP9) (D), secreted protein acidic and cysteine-rich (SPARC) (E) expression, and tumor purity were conducted by the Estimation of Stromal and Immune cells in Malignant Tumor tissues using Expression data Score (ESTIMATE) method; F: Expression correlation analysis between CD19 and the five hub genes was conducted in GEPIA. P < 0.05 was considered as statistically different.
Overexpression of five hub genes was confirmed in diffuse large B-cell lymphoma. A-E: Correlation analyses between alpha-2-macroglobulin (A2M) (A), cathepsin B (CTSB) (B), fibronectin (FN1) (C), matrix metallopeptidase 9 (MMP9) (D), secreted protein acidic and cysteine-rich (SPARC) (E) expression, and tumor purity were conducted by the Estimation of Stromal and Immune cells in Malignant Tumor tissues using Expression data Score (ESTIMATE) method; F: Expression correlation analysis between CD19 and the five hub genes was conducted in GEPIA. P < 0.05 was considered as statistically different.
Close association between the five gene signatures and stromal tumor microenvironment
The close correlation between the five hub genes and the tumor purity in the tumor microenvironment was shown above (Figure 4), and the stromal infiltration analysis is an important method to express detailed information regarding the tumor microenvironment. As Figure 5A-E shows, the correlation value between A2M, CTSB, FN1, MMP9, SPARC and the stromal score was 0.528 (P = 0.000143), 0.618 (P = 4.75E - 06), 0.584 (P = 1.9E - 05), 0.381 (P = 0.0079), and 0.715 (P = 6.37E - 08), respectively. The significant positive correlation revealed that the five hub genes might play an important role in the regulation of stromal cells in the tumor microenvironment.
Figure 5
Stromal infiltration analysis of the five hub genes. A-E: Alpha-2-macroglobulin (A2M) (A), cathepsin B (CTSB) (B), fibronectin (FN1) (C), matrix metallopeptidase 9 (MMP9) (D), and secreted protein acidic and cysteine-rich (SPARC) (E) expression levels were confirmed by correlation analysis with the stromal score. P < 0.05 was considered as statistically different.
Stromal infiltration analysis of the five hub genes. A-E: Alpha-2-macroglobulin (A2M) (A), cathepsin B (CTSB) (B), fibronectin (FN1) (C), matrix metallopeptidase 9 (MMP9) (D), and secreted protein acidic and cysteine-rich (SPARC) (E) expression levels were confirmed by correlation analysis with the stromal score. P < 0.05 was considered as statistically different.
Close association between the hub genes and CAFs’ infiltration in DLBCL
As is indicated in Figure 5, the significantly positive correlation between the five gene signatures and the stromal score suggests a close relationship between the five indicated genes and the tumor microenvironment. To further understand the detailed relationship between the five genes and the stromal infiltration among the tumor microenvironment, and the importance of CAFs in the tumor stromal analysis, we conducted correlation analysis between the five genes’ expression and the CAFs’ infiltration. The results by the MCP-Counter method revealed that the expression of CTSB, FN1, MMP9, and SPARC were positively correlated with the CAFs’ infiltration, having correlation coefficients of 0.201 (P = 0.202), 0.408 (P = 7.32E - 03), 0.803 (P = 1.63E - 10), 0.384 (P = 0.012) and 0.853 (P = 7.82E - 13), respectively (Figure 6B-F). However, A2M showed no significant correlation with the infiltration of CAFs (Figure 6A). More importantly, the results were confirmed with the TIDE portal. The results were consistent with the results from the MCP-Counter method, and the expression of CTSB, FN1, MMP9, and SPARC showed a significant positive correlation with the infiltration levels of CAFs (Figure 7B-E). The uncertain positive correlation between A2M and the degree of CAFs’ infiltration was also confirmed with the TIDE portal. From the above results, we confirmed that the two top genes were FN1 and SPARC. FN1 is a glycoprotein expressing gene, encoding an important component of the extracellular matrix, which could interact with the integrin receptor[19,20]. SPARC is a protein-coding gene rich in cysteine and could be expressed in fibroblasts, osteoblasts, chondrocytes, epithelial cells, and platelets[21]. Previous studies have shown a significant role of the two proteins, FN1 and SPARC, in the regulation of various physiological and pathological processes, including tissue reconstruction, cell migration, and morphogenesis[22-24]. In this study, we identified that the two genes were closely involved in regulation of the DLBCL tumor microenvironment.
Figure 6
The five hub genes were closely associated with infiltration level of cancer-associated fibroblasts with the MCP-Counter method. A-E: The alpha-2-macroglobulin (A2M) (A), cathepsin B (CTSB) (B), fibronectin (FN1) (C), matrix metallopeptidase 9 (MMP9) (D), and secreted protein acidic and cysteine-rich (SPARC) (E) gene expressions were evaluated for correlation with infiltration levels of cancer-associated fibroblasts (CAFs) in diffuse large B-cell lymphoma, and the correlation analysis was conducted with the Spearman method. P < 0.05 was considered as statistically different.
Figure 7
Validation of the positive correlation between the five genes and the infiltration level of cancer-associated fibroblasts. A-E: Alpha-2-macroglobulin (A2M) (A), cathepsin B (CTSB) (B), fibronectin (FN1) (C), matrix metallopeptidase 9 (MMP9) (D), and secreted protein acidic and cysteine-rich (SPARC) (E) were subjected to analysis for correlation with the infiltration level of cancer-associated fibroblasts (CAFs) by the tumor immune dysfunction and exclusion (TIDE) portal. P < 0.05 was considered as statistically different.
The five hub genes were closely associated with infiltration level of cancer-associated fibroblasts with the MCP-Counter method. A-E: The alpha-2-macroglobulin (A2M) (A), cathepsin B (CTSB) (B), fibronectin (FN1) (C), matrix metallopeptidase 9 (MMP9) (D), and secreted protein acidic and cysteine-rich (SPARC) (E) gene expressions were evaluated for correlation with infiltration levels of cancer-associated fibroblasts (CAFs) in diffuse large B-cell lymphoma, and the correlation analysis was conducted with the Spearman method. P < 0.05 was considered as statistically different.Validation of the positive correlation between the five genes and the infiltration level of cancer-associated fibroblasts. A-E: Alpha-2-macroglobulin (A2M) (A), cathepsin B (CTSB) (B), fibronectin (FN1) (C), matrix metallopeptidase 9 (MMP9) (D), and secreted protein acidic and cysteine-rich (SPARC) (E) were subjected to analysis for correlation with the infiltration level of cancer-associated fibroblasts (CAFs) by the tumor immune dysfunction and exclusion (TIDE) portal. P < 0.05 was considered as statistically different.
Co-expression pattern between FN1 and SPARC
As Figure 6 and 7 show, FN1 and SPARC were the top two molecules correlated with the CAFs’ infiltration, suggesting that the two genes might play a role as pro-oncogenes in the regulation of DLBCL. To further study the potential association between FN1 and SPARC, we conducted the gene expression correlation analysis. As is shown in Figure 8A, FN1 was remarkably correlated with the SPARC expression, with the correlation value of up to 0.81 (P = 3.14E - 12). Besides, similar results were confirmed by the correlation analysis between the FN1 protein level and the SPARC messenger ribonucleic acid level (r = 0.64, P = 5.31E – 5; Figure 8B). These data suggested that FN1 was closely associated with SPARC, and the two genes might be in a co-expression pattern in the DLBCL condition.
Figure 8
The top two genes, fibronectin and secreted protein acidic and cysteine-rich, were expressed in a remarkably correlated manner. A: A total of 48 diffuse large B-cell lymphoma samples were applied to TIMER to determine the expression correlation between fibronectin (FN1) and secreted protein acidic and cysteine-rich (SPARC); B: The FN1 protein levels were positively correlated with the SPARC messenger ribonucleic acid levels. The correlation analysis was conducted by the Spearman method, and P < 0.05 was considered as statistically different.
The top two genes, fibronectin and secreted protein acidic and cysteine-rich, were expressed in a remarkably correlated manner. A: A total of 48 diffuse large B-cell lymphoma samples were applied to TIMER to determine the expression correlation between fibronectin (FN1) and secreted protein acidic and cysteine-rich (SPARC); B: The FN1 protein levels were positively correlated with the SPARC messenger ribonucleic acid levels. The correlation analysis was conducted by the Spearman method, and P < 0.05 was considered as statistically different.
DISCUSSION
DLBCL is a highly heterogeneous tumor; clinical studies showed that DLBCL patients have multiple subtypes and different responses to the treatment of R-CHOP, which is widely recognized by clinicians[5,25]. As a classical example, DLBCL patients with c-Myc gene translocation have poor prognosis after R-CHOP therapy. Therefore, these resistant patients should turn to other treatments, such as chimeric antigen receptor T cells therapy, immunomodulators (immune-inhibitors or immune-agonists), and hematopoietic stem cell transplantation[26-29]. From this aspect, the prognostic prediction analysis based on the patient’s individual gene-mapping information could significantly improve the outcome of precision therapy. However, the identification of prognostic biomarkers for DLBCL has been seldom reported. Though DLBCL is a highly heterogeneous tumor, the stromal infiltration level in DLBCL is unclear. Thus, this study aimed to identify gene signatures for DLBCL and to evaluate the potential significance of these genes in the development of DLBCL.In this study, we first identified the top 20 overexpressed genes in the DLBCL, and found a correlation coefficient pattern among them (Figure 1), suggesting that they might be involved in the DLBCL and play similar roles in the development of DLBCL. Based on the significant correlation co-efficient pattern of these overexpressed genes, this study was designed to identify the hub genes using the STRING database. With this portal, we analyzed the PPI and obtained five gene signatures (including A2M, CTSB, FN1, MMP9, and SPARC) (Figure 2A). The overexpression of the five genes was confirmed in another DLBCL dataset (Figure 2B-F). Besides, this study identified that these five genes showed certain percentages of genomic alteration, which was 5% (A2M), 8% (FN1), 5% (CTSB), 2.7% (MMP9) and 5% (SPARC), respectively (Figure 3A). The positive correlation between the five genes and disease stages revealed that the gene signature might be involved in the development of DLBCL and might perform as a predictor for the disease progression (Figure 3B-F). The hypothesis was further confirmed by the ESTIMATE analysis, which suggested that the significantly positive correlation between the five genes’ expression and the tumor purity in the DLBCL tumor tissues also reflected the close association between the five gene signatures and DLBCL (Figure 4). More importantly, this study evaluated the correlation between the five genes and stromal score (Figure 5). The results suggested that these five genes were closely involved in the regulation of the extracellular matrix, the most important factor for tumor microenvironment, the abnormal regulation of which plays key roles in tumor progression. Considering the significance of CAFs in the extracellular matrix of some cancers[30,31], we hypothesized that these five genes might be closely associated with CAFs’ infiltration. Figure 6 and 7 show the confirmation of our hypothesis, with the five gene signatures being significantly associated with CAFs’ infiltration. Therefore, based on the inducing effect of CAFs in tumor recurrence and metastasis, the five gene signatures might act as a predictor of CAFs-associated tumor metastasis and recurrence. The detailed analysis revealed that FN1 and SPARC were the most important among the five gene signatures; the remarkable co-expression manner suggested that the two molecules might interact with each other (Figure 8). However, the experimental confirmation of the FN1 and SPARC protein interaction was not included in this study. Moreover, this study was based exclusively on the gene expression level rather than that of the encoded protein. For further study, these points should be attached to much importance, and it could encourage rapid diagnosis for DLBCL patients and provide guidelines for clinical treatment.
CONCLUSION
Our study found five gene signatures associated with the stromal infiltration, which might provide opportunities to better understand the significance of the tumor microenvironment in DLBCL.
ARTICLE HIGHLIGHTS
Research background
Diffuse large B-cell lymphoma (DLBCL) is a lymphoma with high mortality rates. Even though some therapeutic strategies are applied in clinical practice, the prognoses of DLBCL patients remain unsatisfactory. Therefore, the screening of novel therapeutic targets or prognostic biomarkers could be an important work for DLBCL therapy, which could contribute to the improvement of treatment regimens.
Research motivation
This study aimed to identify the novel biomarkers of DLBCL, and analyze the prognostic value of these biomarkers.
Research objectives
This study addressed the question of the novel biomarkers and potential mechanism involved in the development of DLBCL.
Research methods
The differentially expressed genes (DEGs) of DLBCL were examined with the GSE60 dataset, and these DEGs were applied to the STRING tool to conduct protein-protein interaction (PPI) analysis. The key hub genes based on PPI analysis were then applied to the GEPIA portal to analyze the expression level in DLBCL. The gene alteration level and the correlation between fibronectin protein level and secreted protein acidic and cysteine-rich messenger ribonucleic acid expression was analyzed in cBioportal. Moreover, the expression level of the hub genes in different stages were investigated in the UALCAN portal. The gene correlation analysis was conducted in GEPIA. The TIMER portal was used to evaluate the correlation between the gene expression and tumor purity, infiltrated stromal cells and infiltrated level of cancer-associated fibroblasts.
Research results
The top 20 DEGs in DLBCL were obtained, and the hub genes (A2M, CTSB, FN1, MMP9, and SPARC) were identified based on DEGs through PPI analysis. The five hub genes were overexpressed in DLBCL, and gene alteration was also confirmed in cBioportal, including messenger ribonucleic acid high amplification and missense mutation. Furthermore, the five hub genes had a positive correlation with the tumor stage. Besides, the positive correlation between the five hub genes levels and the tumor purity was also confirmed by the overexpression of the five hub genes in DLBCL. More interestingly, there was a significant correlation between the five hub genes’ expression level and the stromal infiltration score, especially in the correlation analysis with cancer-associated fibroblasts’ infiltration level.
Research conclusions
A five hub gene signatures were identified in DLBCL, and the overexpression of these five genes were closely associated with the progression of DLBCL. The mechanism evaluation showed positive correlation between the five genes’ expression levels and infiltrated levels of stromal cells, especially for the cancer-associated fibroblasts. In summary, the five gene signatures have potential values as novel therapeutic targets or biomarkers for DLBCL.
Research perspectives
In this project, we identified five gene signatures in DLBCL and that the overexpression of the five genes is closely associated with the disease development, suggesting that the five gene signatures might be novel therapeutic targets for DLBCL, especially in the regulation of cancer-associated fibroblasts. In our subsequent work, the detailed mechanism underlying the regulation of the five genes in the tumor microenvironment will be addressed, which could promote the further understanding of these five gene signatures in DLBCL.
Authors: Ruth Pettengell; Monika Długosz-Danecka; David Andorsky; David Belada; Pencho Georgiev; Donald Quick; Jack W Singer; Simran B Singh; Athanasios Pallis; Anton Egorov; Gilles Salles Journal: Br J Haematol Date: 2019-12-27 Impact factor: 6.998
Authors: Yu Ri Kim; Sun Och Yoon; Soo-Jeong Kim; June-Won Cheong; Haerim Chung; Jung Yeon Lee; Ji Eun Jang; Yundeok Kim; Woo-Ick Yang; Yoo Hong Min; Jin Seok Kim Journal: Ann Hematol Date: 2020-05-10 Impact factor: 3.673
Authors: Stephen D Smith; Brian G Till; Mazyar S Shadman; Ryan C Lynch; Andrew J Cowan; Qian V Wu; Jenna Voutsinas; Heather A Rasmussen; Katherine Blue; Chaitra S Ujjani; Andrei Shustov; Ryan D Cassaday; Jonathan R Fromm; Ajay K Gopal Journal: Br J Haematol Date: 2020-02-06 Impact factor: 6.998
Authors: Viola Poeschel; Gerhard Held; Marita Ziepert; Mathias Witzens-Harig; Harald Holte; Lorenz Thurner; Peter Borchmann; Andreas Viardot; Martin Soekler; Ulrich Keller; Christian Schmidt; Lorenz Truemper; Rolf Mahlberg; Reinhard Marks; Heinz-Gert Hoeffkes; Bernd Metzner; Judith Dierlamm; Norbert Frickhofen; Mathias Haenel; Andreas Neubauer; Michael Kneba; Francesco Merli; Alessandra Tucci; Peter de Nully Brown; Massimo Federico; Eva Lengfelder; Alice di Rocco; Ralf Trappe; Andreas Rosenwald; Christian Berdel; Martin Maisenhoelder; Ofer Shpilberg; Josif Amam; Konstantinos Christofyllakis; Frank Hartmann; Niels Murawski; Stephan Stilgenbauer; Maike Nickelsen; Gerald Wulf; Bertram Glass; Norbert Schmitz; Bettina Altmann; Markus Loeffler; Michael Pfreundschuh Journal: Lancet Date: 2019-12-21 Impact factor: 79.321
Authors: Peng Jiang; Shengqing Gu; Deng Pan; Jingxin Fu; Avinash Sahu; Xihao Hu; Ziyi Li; Nicole Traugh; Xia Bu; Bo Li; Jun Liu; Gordon J Freeman; Myles A Brown; Kai W Wucherpfennig; X Shirley Liu Journal: Nat Med Date: 2018-08-20 Impact factor: 53.440