Literature DB >> 33126329

Integrated TCGA and GEO analysis showed that SMAD7 is an independent prognostic factor for lung adenocarcinoma.

Zhou-Tong Dai1, Jun Wang1, Kai Zhao2, Yuan Xiang3, Jia Peng Li1, Hui-Min Zhang1, Zi-Tan Peng1,4, Xing Hua Liao1.   

Abstract

The lack of effective markers leads to missed optimal treatment times, resulting in poorer prognosis in most cancers. Drosophila mothers against decapentaplegic protein (SMAD) family members are important cytokines in the transforming growth factor-beta family. They jointly regulate the processes of cell growth, differentiation, and apoptosis. However, the expression of SMAD family genes in pan-cancers and their impact on prognosis have not been elucidated. Perl software and R software were used to perform expression analysis and survival curve analysis on the data collected by TCGA, GTEx, and GEO, and the potential regulatory pathways were determined through gene ontology enrichment and kyoto encyclopedia of genes and genomes enrichment analysis. It was found that SMAD7 and SMAD9 expression decreased in lung adenocarcinoma (LUAD), and their expression was positively correlated with survival time. Additionally, SMAD7 could be used as an independent prognostic factor for LUAD. In general, SMAD7 and SMAD9 can be used as prognostic markers of LUAD. Further, SMAD7 is expected to become a therapeutic target for LUAD.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 33126329      PMCID: PMC7598801          DOI: 10.1097/MD.0000000000022861

Source DB:  PubMed          Journal:  Medicine (Baltimore)        ISSN: 0025-7974            Impact factor:   1.817


Introduction

Transforming growth factor-beta (TGF-β) is distributed in various systems of the human body and is an extremely important type of growth factor that regulates the process of cell differentiation and maturation. It regulates a series of states such as cell migration, growth, differentiation, and apoptosis.[ Early studies have shown that drosophila mothers against decapentaplegic protein (SMAD) can be directly activated by TGF-β induced cell membrane receptors to form transcription complexes, which further control the transcription of target genes in the nucleus of transcription. Therefore, SMAD protein not only becomes an essential part of the TGF-β signaling pathways but also regulates cell function together.[ In recent years, it is related to the occurrence and development of various diseases, especially multiple malignant tumors.[ Currently, 8 types of SMAD proteins are encoded in the human genome. SMAD1, SMAD5, SMAD9 (SMAD8) belong to the receptor substrates of anti-mullerian-hormone and bone morphogenetic protein in the TGF-β family,[ SMAD2, and SMAD3 are receptor substrates of activin, TGF-β, and Nodal pathways. SMAD4 assists all R-SMADs. SMAD6 and SMAD7 are inhibitory SMAD proteins.[ Recent studies have shown that SMAD1 promotes colorectal cancer cell migration.[ Meanwhile, in lung cancer, blocking SMAD2 and SMAD4 can block the function of TGF-β.[ SMAD3 can participate in the epithelial-mesenchymal transition (EMT) process of cervical cancer through Long noncoding RNA OIP5-AS1. The expression level of SMAD4 is positively correlated with the survival rate of colon cancer, and the lack of SMAD4 leads to a poor prognosis.[ Down-regulation of SMAD5 can inhibit nasopharyngeal carcinoma cell proliferation, invasion, and migration.[ The up-regulation of SMAD6 can promote the appreciation of liver cancer cells.[ In addition, according to different results, how SAMD7 regulates cancer cell proliferation and migration is still controversial.[ The expression of SMAD9 is also closely related to the risk of essential hypertension. However, the SMAD family's expression and function in some cancers are still unclear, and there are fewer reports about prognosis, especially in lung cancer. At present, the use of large databases and regulatory networks has been widely accepted in biology, such as the prognostic characteristics of melanoma by transcriptome analysis,[ and the development of new prognostic models in liver cancer.[ In this study, a meta-analysis of the expression and prognostic value of SMAD family members in cancer using multiple databases. We explore the expression, prognosis, clinical features, and possible regulatory pathways of SMAD family members in patients with lung adenocarcinoma (LUAD), and provided a theoretical basis for future studies of SMAD family members in LUAD.

Materials and methods

Expression and survival curve of SMAD family genes in cancer

In order to analyze the expression of SMAD family genes in cancer, survival data and expressions of clinical from GTEx, TCGA, and Oncomine databases were summarized to verify the expression of each SMAD family gene in cancer. Oncomine database is a gene chip-based database and integrated data extraction platform. In this database, you can set the conditions for filtering and extracting data according to your own needs (http://www.oncomine.org). In this study, we set the screening conditions as: “Analysis type: cancer vs normal analysis. P-value: .05. Threshold (fold change): 2. Threshold (gene rank): Top 10%.” Meanwhile, The gene expression profling interactive analysis (GEPIA) tool was used to analyze the clinical data of the GTEx and TCGA databases (http://gepia.cancer-pku.cn) and to compare the expression differences of the SMAD family in pan-cancers with their control of normal tissues and its survival.[ In addition, the Oncomine database was used to analyze the different expression of SMAD families in each lung cancer subtype.

Analysis of clinical characteristics

The expression data of all genes in LUAD and their corresponding clinical information were extracted from the TCGA database. R software with limma was used to average the repeated data of each expression. In addition, Perl software was used to summarize the information into a matrix. The Wilcox Test was used on a single gene to examine the relationship with clinical characteristics.

Analysis of differential genes

Log (x+1) processing was performed on data of the extracted TCGA clinical, and the samples were divided into high expression groups and low expression groups according to the median expression of the SMAD7 and SMAD9. R software with limma, pheatmap, and ggplot2 packages was used to filter the original data of the target gene in LUAD and normalize and screen out the differential genes between the 2 groups. The result were displayed using volcanic plots and heatmaps. Differential gene screening criteria: | log FC | ≥2, P. adjust < .05. The clustering method was Euclidean distance.

Analysis GO and KEGG

The cluster profiler package of R software was used to perform gene ontology (GO) functional analysis and kyoto encyclopedia of genes and genomes (KEGG) pathway analysis on the differential genes screened above, and the differences were screened using P. adjust < .05 as the threshold.[P. adjust <.05 is the main enrichment function and pathway of screening differential genes for the conditional threshold.

Analysis of protein interaction network analysis

The selected differential genes were introduced into the search tool for the retrieval of interacting genes. It is an online analysis website for protein-protein interaction (PPI) (https://string-db.org/).[ The results were imported into Cytoscape software,[ and key protein expression modules and key node genes were screened.

Analysis of GEO

The download data of GSE43767 from the database of GEO chips in NCBI (http://www.ncbi.nlm.nih.gov/geo/),[ which includes 113 samples and 29 samples of therapeutic or spontaneous abortion, 15 normal samples and 69 LUAD samples. R software with the limma and beeswarm packages were used to process the obtained data and draw different expression heatmaps and volcano plots.

Analysis of gene set enrichment

Analysis of TCGA clinical data was used by gene set enrichment analysis (GSEA) with version of 4.0.3.[ According to the expression levels of SMAD7 and SMAD9, they were divided into 2 groups: high expression group and low expression group. The effect of their expression level on the gene set of various biological pathways was analyzed by GSEA. The gene set obtained from the MsigDB database of the GSEA website was used as the reference gene set, and the P-value was calculated 1000 times per analysis cycle according to the weighted method.

Analysis of independent prognostic factor

R software with survival and survminer packages were used to analyze TCGA clinical data. Both univariate analysis and multivariate analysis were COX proportional hazard regression models.

Ethical statement

All the data of this paper was obtained from the open-access database, we did not get these data from patients or animals directly, nor intervene these patients. So the ethical approval was not necessary.

Result

Expression of SMAD protein family in pancreatic cancer

The expression of SMAD protein family members in human cancers at the mRNA level was analyzed by using the Oncomine online database. Analysis of expression differences between cancer and normal tissues according to the selected criteria, the results showed that there were 442, 458, 453, 459, 459, 448, 456, 389 independent studies in the database involving expressions from SMAD1 to SMAD9 (Fig. 1). Interestingly, with the exception of a few SMAD genes that have increased expression in several specific cancers, SMAD protein family members have decreased expression in most cancers. In detail, SMAD1 expression increased in brain and central nervous system (CNS) cancer and lymphoma, the expression of SMAD5 increased in brain and CNS cancer, colorectal cancer and kidney cancer, SMAD6 expression increased in esophageal cancer, and SMAD9 expression increased in brain and CNS cancer. However, in the other cancer data, as shown in Table 1, except for testicular cancer, most members of the SMAD protein family have decreased expression, and there is no significant expression difference in other types of cancer.
Figure 1

Analysis of the mRNA expression of drosophila mothers against decapentaplegic protein family genes (cancer tissue vs normal tissue) from the Oncomine database. The Oncomine database was used to analyze. The number of statistically significant datasets from data filtered by the screening criteria. Red represents increased expression, blue represents decreased expression, and each group of underground numbers represents the total number of studies.

Table 1

Expression of drosophila mothers against decapentaplegic protein famliy in other cancers in Oncomine.

Analysis of the mRNA expression of drosophila mothers against decapentaplegic protein family genes (cancer tissue vs normal tissue) from the Oncomine database. The Oncomine database was used to analyze. The number of statistically significant datasets from data filtered by the screening criteria. Red represents increased expression, blue represents decreased expression, and each group of underground numbers represents the total number of studies. Expression of drosophila mothers against decapentaplegic protein famliy in other cancers in Oncomine. In order to further determine the expression difference of SMAD protein family between cancer and normal tissues, the TCGA and GTEx database was used to jointly analyze the expression difference of SMAD protein family in 31 cancers, and a heatmap was drawn (Fig. 2). The expression of SMAD family genes were used t test to calculate the P value of tumor tissue and normal tissue (Figs. S1–S8). And found the intersection of the results of TCGA and GTEx and the results of Oncomine database. Compared with normal tissues, it was found that the expressions of SMAD1, SMAD4, SMAD5, and SMAD7 were significantly different in brain lower grade glioma. In invasive breast carcinoma, SMAD9 expression was significantly different. There are significant differences in the expression of SMAD1 in acute myeloid leukemia, significant differences in the expression of SAMD6, SMAD7, and SMAD9 in LUAD, and the expression of SMAD1 and SMAD7 in lymphoid neoplasm diffuse large B-cell lymphoma. There are significant differences in the expression of SMAD6 in prostate adenocarcinoma, and significant differences in the expression of SMAD1 and SMAD7 in testicular germ cell tumors.
Figure 2

Analysis of drosophila mothers against decapentaplegic protein family gene expression of GTEx and TCGA databases. Heatmaps show the logFC value of drosophila mothers against decapentaplegic protein family members in all types of cancer. The color label indicates the different logFC values.

Analysis of drosophila mothers against decapentaplegic protein family gene expression of GTEx and TCGA databases. Heatmaps show the logFC value of drosophila mothers against decapentaplegic protein family members in all types of cancer. The color label indicates the different logFC values.

Prognostic analysis of SMAD protein family

To determine the prognostic values of the genes selected, Kaplan–Meier survival analysis was conducted on the genes selected above based on the clinical information in the TCGA database. In LUAD, SMAD6 (log-rank P = .65, P (HR) = .66) cannot show an obvious correlation with overall survival (Fig. 3). Similarly, in all other cancers that have been analyzed, the differential genes for other SMAD protein families also showed the same negative results as SMAD6. However, in LUAD, both SMAD7 (log-rank P = .0099, p (HR) = 0.01) and SMAD9 (log-rank P = .0017, P (HR) = .0019) shown in figure showed positive results (Fig. 3). The prognosis of SMAD7 and SMAD9 high expression groups were significantly better than that of low expression groups.
Figure 3

Prognostic value of drosophila mothers against decapentaplegic protein family in lung adenocarcinoma. Survival curves were drawn in K-M method, and survival curve was tested by COX and log-rank.

Prognostic value of drosophila mothers against decapentaplegic protein family in lung adenocarcinoma. Survival curves were drawn in K-M method, and survival curve was tested by COX and log-rank.

Clinical features of SMAD7 and SMAD9

In order to evaluate the clinical characteristics of SMAD7 and SMAD9, we extracted the expression data of SMAD7 and SMAD9 in TCGA in different types of lung cancer. they showed the same results as that from the combined analysis of TCGA and GTEx (Fig. 4A). In LUAD, the expressions of SMAD7 (P = 1.76 × 10−12) and SMAD9 P = 1.64 × 10−12) were reduced compared to normal tissues. However, as shown in figure (Fig. 4B.C.D), after analyzing their stage, gender, age, and expression, it was found that the expression of SMAD7 has nothing to do with the stage, gender, and age. In SMAD9 (Fig. 4E.F.G), although there are differences between Stage1 and Stage3 (P = 1.07 × 10−2), there is no continuous difference. Thus, the expression of SMAD9 is independent of the stages. On the other hand, SMAD9 expression was slightly higher in women than that in men (P = 2.07 × 10−2). In addition, the expression of SMAD9 is higher in young patients, but it is worth noting due to the insufficient sample size of young patients (n = 12).
Figure 4

Clinical characteristics of SMAD7 (drosophila mothers against decapentaplegic protein) and SMAD. (A) Box plots indicate the mRNA expression levels of SMAD7 and SMAD9 in different types of lung cancer in the TCGA database. The values are normalized by log2. 0 = Normal (810); 1: acinar lung adenocarcinoma (LUAD) (6); 2: LUAD (261); 3: LUAD, mixed subtype (67); 4: lung clear cell adenocarcinoma (2); 5: lung mucinous adenocarcinoma (6); 6: micropapillary LUAD (3); 7: mucinous bronchioloalveolar carcinoma (3); 8:non-mucinous bronchioloalveolar carcinoma (9); 9: papillary LUAD (10); 10:solid LUAD (1); 11:squamous cell lung carcinoma (348); 12:squamous cell lung carcinoma, basaloid variant (8); 13:squamous cell lung carcinoma, papillary variant (2); 14:squamous cell lung carcinoma, small cell variant (1). (b) Expression of SMAD7 in LUAD at different stages. (C) Expression of SMAD7 in LUAD of different genders. (D) Expression of SMAD7 in LUAD at different ages. (E) Expression of SMAD9 in LUAD at different stages. (F) SMAD9 expression in LUAD of different genders. (G) SMAD9 expression in LUAD at different ages. The value is normalized by log2 (TPM + 1) for log-scale, ∗ is the significant differences (P<.05) after t test.

Clinical characteristics of SMAD7 (drosophila mothers against decapentaplegic protein) and SMAD. (A) Box plots indicate the mRNA expression levels of SMAD7 and SMAD9 in different types of lung cancer in the TCGA database. The values are normalized by log2. 0 = Normal (810); 1: acinar lung adenocarcinoma (LUAD) (6); 2: LUAD (261); 3: LUAD, mixed subtype (67); 4: lung clear cell adenocarcinoma (2); 5: lung mucinous adenocarcinoma (6); 6: micropapillary LUAD (3); 7: mucinous bronchioloalveolar carcinoma (3); 8:non-mucinous bronchioloalveolar carcinoma (9); 9: papillary LUAD (10); 10:solid LUAD (1); 11:squamous cell lung carcinoma (348); 12:squamous cell lung carcinoma, basaloid variant (8); 13:squamous cell lung carcinoma, papillary variant (2); 14:squamous cell lung carcinoma, small cell variant (1). (b) Expression of SMAD7 in LUAD at different stages. (C) Expression of SMAD7 in LUAD of different genders. (D) Expression of SMAD7 in LUAD at different ages. (E) Expression of SMAD9 in LUAD at different stages. (F) SMAD9 expression in LUAD of different genders. (G) SMAD9 expression in LUAD at different ages. The value is normalized by log2 (TPM + 1) for log-scale, ∗ is the significant differences (P<.05) after t test.

Differentially expressed genes (DEGs) in TCGA

The data of LUAD in TCGA were divided into 2 groups of high expression and low expression according to the target gene median, and the DEG was used to screen the gene expression data between the 2 groups with limma in R software. According to the grouping result, a total of 12 DEGs of SMAD7 and 57 DEGs of SMAD9 were identified from the TCGA database (Fig. 5A.B).
Figure 5

Heatmap and volcano plot of differential gene expression of drosophila mothers against decapentaplegic protein 7 and drosophila mothers against decapentaplegic protein 9. (A) Heatmap of Differentially expressed genes screened based on clinical data from TCGA database. (B) Volcano plot of the differentially expressed genes. Red indicates genes that are up-regulated and green indicates genes that are down-regulated.

Heatmap and volcano plot of differential gene expression of drosophila mothers against decapentaplegic protein 7 and drosophila mothers against decapentaplegic protein 9. (A) Heatmap of Differentially expressed genes screened based on clinical data from TCGA database. (B) Volcano plot of the differentially expressed genes. Red indicates genes that are up-regulated and green indicates genes that are down-regulated.

GO and KEGG enrichment analysis of differential genes

Cluster Profiler, org.Hs.eg.db, richplot, and ggplot2 packages in R software were employed to analyze the functions of DEGs in LUAD. The results show that in GO enrichment, SMAD7 mainly participates in the function of the regulation of blood vessels. SMAD9 is mainly involved in the function of zymogen activation (Fig. 6A). In the KEGG enrichment, SMAD7 is mainly involved in functions such as protein digestion and absorption, and SMAD9 is mainly involved in functions such as the NOD-like receptor signaling pathway (Fig. 6B).
Figure 6

gene ontology enrichment analysis and kyoto encyclopedia of genes and genomes enrichment analysis of differentially expressed genes. (A) Drosophila mothers against decapentaplegic protein 7. (B) Drosophila mothers against decapentaplegic protein 9. The abscissa is the Rich factor. A larger value indicates a greater degree of enrichment. The ordinate is a pathway term with a higher degree of enrichment. The p value is the p value that has undergone multiple checks. The redder the color, the smaller the p value, indicating that the enrichment is more obvious. The size of the dot indicates the number of differential genes in the term, and the larger the dot, the more genes there are. BP = biological processes, CC = cellular components, MF = molecular functions.

gene ontology enrichment analysis and kyoto encyclopedia of genes and genomes enrichment analysis of differentially expressed genes. (A) Drosophila mothers against decapentaplegic protein 7. (B) Drosophila mothers against decapentaplegic protein 9. The abscissa is the Rich factor. A larger value indicates a greater degree of enrichment. The ordinate is a pathway term with a higher degree of enrichment. The p value is the p value that has undergone multiple checks. The redder the color, the smaller the p value, indicating that the enrichment is more obvious. The size of the dot indicates the number of differential genes in the term, and the larger the dot, the more genes there are. BP = biological processes, CC = cellular components, MF = molecular functions.

GSEA of the TCGA

Although DEGs have been used for GO and KEGG enrichment, it only screened for differential expressions and did not involve the degree and direction of differential gene expressions. Therefore, the function of SMAD7 and SMAD9 in LUAD was further analyzed by using GESA. In the GO and KEGG enrichment analysis results, SMAD7 positive mainly regulates processes such as cellular respiration and RNA degradation, but SMAD7 negative mainly regulates processes such as regulation of cellular response and leukocyte transendothelial migration (Fig. 7A.B). SMAD9 positive mainly regulates the processes such as chronic inflammatory response and galactose metabolism, while SMAD9 negative mainly regulates the processes such as lung alveolus development and GNRH signaling pathway (Fig. 8A.B). These enrichment analysis results can better help us understand how SMAD7 and SMAD9 participate in the regulation of LUAD.
Figure 7

gene ontology and kyoto encyclopedia of genes and genomes enrichments analyses of drosophila mothers against decapentaplegic protein 7 by using gene set enrichment analysis. (A) Top 5 positive enrichment scores. (B) Top 5 negative enrichment scores.

Figure 8

gene ontology and kyoto encyclopedia of genes and genomes enrichments of genes by drosophila mothers against decapentaplegic protein 9 analyzed using gene set enrichment analysis. (A) Top 5 positive enrichment scores. (B) Top 5 negative enrichment scores.

gene ontology and kyoto encyclopedia of genes and genomes enrichments analyses of drosophila mothers against decapentaplegic protein 7 by using gene set enrichment analysis. (A) Top 5 positive enrichment scores. (B) Top 5 negative enrichment scores. gene ontology and kyoto encyclopedia of genes and genomes enrichments of genes by drosophila mothers against decapentaplegic protein 9 analyzed using gene set enrichment analysis. (A) Top 5 positive enrichment scores. (B) Top 5 negative enrichment scores.

The network of DEGs PPI

The PPI network helps us to explore further the molecular mechanism of SMAD7 and SMAD9 in LUAD. The search tool for the retrieval of interacting genes network tool was used to analyze the identified DEGs. After hiding the disconnected nodes in the network for SMAD7 (Fig. 9A), the PPI network of the DEGs consisted of 12 nodes and 15 edges. The top 5 of predicted functional partners are FGG, COL1A2, F2, SERPINC1, and LOX. Mainly through platelet aggregation, common pathway of fibrin clot formation (FDR = 1.05 × 10−10), integrin cell surface interactions (FDR = 1.14 × 10−9), extracellular matrix organization (FDR = 2.58 × 10−9) and GRB2: SOS provides linkage to MAPK signaling for Integrins (FDR = 4.79 × 10−6), which regulates the occurrence and development of LUAD. For SMAD9 (Fig. 9B), the PPI network of the DEGs consisted of 54 nodes and 238 edges. The top 5 of predicted functional partners are FGB, HGF, F2, TRAF2, and OASL, which mainly involved in the immune system (FDR = 1.05 × 10−7), interferon alpha/beta signaling (FDR = 2.22 × 10−7), cytokine signaling in immune system (FDR = 5.45 × 10−7) and interferon signaling (FDR = 6.62 × 10−6), the finding showed that SMAD7 and SMAD9 are mainly regulated the occurrence and development of LUAD.
Figure 9

Differentially expressed genes (DEG's) protein-protein interaction (PPI) network. (A) PPI network between DEGs of drosophila mothers against decapentaplegic protein 7. (B) PPI network between DEGs of drosophila mothers against decapentaplegic protein 9. All disconnected nodes are hidden.

Differentially expressed genes (DEG's) protein-protein interaction (PPI) network. (A) PPI network between DEGs of drosophila mothers against decapentaplegic protein 7. (B) PPI network between DEGs of drosophila mothers against decapentaplegic protein 9. All disconnected nodes are hidden.

Identification of the expression of SMAD7 and SMAD9 in GSE43767

GSE43767 is a microarray study based on normal and LUAD patients. It includes data from 15 normal lung tissue samples and lung tissue from 69 LUAD patients. The samples were analyzed using the limma package in R software. The results showed (Fig. 10) that the expression of SMAD7 and SMAD9 was significantly reduced in cancer patients.
Figure 10

Expressions of drosophila mothers against decapentaplegic protein 7 and drosophila mothers against decapentaplegic protein 9 of the GSE43767 dataset in the GEO database. The ordinate is the expression value after taking log (x + 1) for the data result.

Expressions of drosophila mothers against decapentaplegic protein 7 and drosophila mothers against decapentaplegic protein 9 of the GSE43767 dataset in the GEO database. The ordinate is the expression value after taking log (x + 1) for the data result.

SMAD7 is an independent prognostic factor for LUAD

The clinical information of LUAD patients was extracted from the TCGA database, and some clinical information samples with some missing data were deleted, and the survival and survminer packages in R software were employed to analyze the data using the COX risk ratio model in with the expression of SMAD7 and SMAD9. It was shown (Table 2) that in SMAD7, both the results obtained based on univariate Cox regression analysis (P = 1.42 × 10−2) or multivariate cox regression analysis (P = 4.88 × 10−4) are statistically significant (Fig. 11A), indicating that SMAD7 can be used as an independent prognostic factor for LUAD. Unfortunately, SMAD9 cannot be used as an independent prognostic factor (P = .086) (Fig. 11B).
Table 2

Analysis of independent prognostic factor.

Figure 11

Multi-factor COX analysis forest map of drosophila mothers against decapentaplegic protein 7 and drosophila mothers against decapentaplegic protein 9.

Analysis of independent prognostic factor. Multi-factor COX analysis forest map of drosophila mothers against decapentaplegic protein 7 and drosophila mothers against decapentaplegic protein 9.

Discussion

At present, lung cancer is one of the first high-incidence malignancies among all tumors.[ Histopathologically, it can be divided into non-small cell lung cancer and small cell lung cancer. Epidemiological statistics show that clinically, non-small cell lung cancer accounts for the vast majority. Non-small cell lung cancer can be divided into squamous cell carcinoma, adenocarcinoma, and large cell lung cancer and other types.[ LUAD is a primary epithelial tumor of the lung, which mostly originates from the bronchial mucosal epithelium or alveolar epithelium. In recent years, the incidence of LUAD has continued to rise and has now become the most common type of lung cancer worldwide.[ Unlike lung squamous cell carcinoma, which mostly manifests as central lung cancer, most LUADs occur in the peripheral parts of the lung, and there are no obvious clinical symptoms in the early stage, leading to a poor prognosis. Timely detection and operation can effectively improve patients with the LUAD survival rate.[ The SMAD signaling pathway is a crucial pathway for the TGF-β transcription factor family to regulate cell proliferation, differentiation, metabolic migration, localization, and apoptosis.[ As an essential family of transcription factors, studying SMAD family genes can better allow us to understand the occurrence and metastasis of cancer, and thus develop new therapeutic approaches. Although many pathways have been reported, the pathways in some cancers have not been elucidated. In this study, GTEx, TCGA, and GEO databases were used to analyze the expression of SMAD family in different cancers, and systematically to compare the mRNA expression differences of SMAD family genes in normal tissues and cancer tissues. At the same time, based on the screening results, the expression profile of the SMAD family in LUAD was systematically revealed. These results show that the SMAD family plays an essential role in the development of LUAD. So far, there have been many studies on SMAD1 expression. In hepatocellular carcinoma, the expression of SMAD1/SMAD5/SMAD8 is reduced compared to normal tissues, but they are not potential biomarkers for hepatocellular carcinoma.[ In prostate cancer, the expression of SMAD1 is increased.[ However, there are no reports of SMAD1 on cancer prognosis. Similarly, SMAD2 is also involved in the regulation of many cancers. According to a study in 2018, overexpression of ATF4 can affect the survival rate of triple-negative breast cancer patients through SMAD2.[ Moreover, there is strong evidence that silencing SMAD2 can inhibit TGF-β function.[ However, a previous study showed that the absence of SMAD2 could lead to reduced differentiation and increased EMT levels, leading to tumor metastasis.[ At the same time, in patients with non-small cell lung cancer, high expression of p-SMAD2 is predictive of poor clinical survival.[ Like SMAD2, SMAD3 is considered a tumor suppressor. In lung cancer, the deletion of SMAD3 can inhibit the pathway of tumor growth through TGF-β, thereby promoting tumorigenicity.[ Moreover, in lung cancer, the expression of SMAD3 increased. Although the number of samples studied in Oncomine was insufficient, it was consistent with our findings. It shows that our research has potential value.[ Multiple reports have shown that SMAD4 is a tumor suppressor gene. For example, in pancreatic cancer, SMAD4 expression is reduced. The same is true for colorectal cancer.[ Moreover, special studies have shown that high expression of SMAD4 can prolong the survival time of patients. Although SMAD4 cannot be used as an independent prognostic factor, it can be used as an independent prognostic factor in combination with p-SMAD2.[ For SMAD5, it has been reported that miR-145 can promote the migration and migration of esophageal cancer cells by inhibiting theexpression of SMAD5.[ Fstl1 can promote glioma growth through SMAD1/SMAD5/SMAD8.[ At the same time, SMAD5 expression increases in prostate cancer, which is closely related to postoperative survival.[ For the SMAD6 gene, high expression of it can promote the development of glioma.[ Similarly, high expression of SMAD7 can lead to a poor prognosis for acute myeloid leukemia.[ Unfortunately, the high expression of SMAD9 increases the EMT risk of oral squamous cell carcinoma.[ These findings reveal that the SMAD family plays an important role in the development of cancer. In this study, SMAD7 and SMAD9 were significantly reduced in lung cancer. Moreover, the expression of SMAD7 and SMAD9 is significantly correlated with the prognosis survival of patients. Through enrichment analysis, it was found that SMAD7 and SMAD9 mainly regulate the occurrence and development of lung cancer through the regulation of cellular response and GNRH signaling pathway pathways. At the same time, SMAD7 can be used as an independent prognostic factor to provide earlier detection and use of new treatments for lung cancer. Taken together, these results suggest that SMAD7 and SMAD9 may be markers and new therapeutic targets for LUAD. Correspondingly, these results need to be further verified by specific experiments.

Author contributions

Xing-Hua Liao designed research; Xing-Hua Liao, Yuan Xiang, and Zhou-Tong Dai performed research, Jia Peng Li, Hui-Min Zhang, Zhou-Tong Dai, and Jun Wang analyzed data; and Xing-Hua Liao, Hui-Min Zhang wrote the paper. Kai Zhao and Zhou-Tong Dai. participated in the revision process of the manuscript. All authors read and approved the final manuscript. Data curation: Zhou-Tong Dai, Jun Wang. Formal analysis: Yuan Xiang. Funding acquisition: Xing-Hua Liao. Investigation: Zhou-Tong Dai, Yuan Xiang. Methodology: Zhou-Tong Dai, Yuan Xiang. Project administration: Xing-Hua Liao. Resources: Jia Peng Li. Software: Jun Wang, Hui-Min Zhang. Supervision: Jia Peng Li. Validation: Jia Peng Li. Visualization: Jia Peng Li, Xing-Hua Liao. Writing – original draft: Zhou-Tong Dai. Writing – review & editing: Zhou-Tong Dai, Kai Zhao, Hui-Min Zhang, Zi Tan Peng.
  43 in total

1.  MicroRNA-145 promotes esophageal cancer cells proliferation and metastasis by targeting SMAD5.

Authors:  Qiong Zhang; Huaiyong Gan; Wenqing Song; Damin Chai; Shiwu Wu
Journal:  Scand J Gastroenterol       Date:  2018-05-31       Impact factor: 2.423

2.  Reduced Expression of SMAD4 Is Associated with Poor Survival in Colon Cancer.

Authors:  Pu Yan; Dirk Klingbiel; Zenia Saridaki; Paola Ceppa; Monica Curto; Thomas Alexander McKee; Arnaud Roth; Sabine Tejpar; Mauro Delorenzi; Fredrik T Bosman; Roberto Fiocca
Journal:  Clin Cancer Res       Date:  2016-02-09       Impact factor: 12.531

3.  Activating Transcription Factor 4 Modulates TGFβ-Induced Aggressiveness in Triple-Negative Breast Cancer via SMAD2/3/4 and mTORC2 Signaling.

Authors:  Adrián González-González; Esperanza Muñoz-Muela; Juan A Marchal; Francisca E Cara; Maria P Molina; Marina Cruz-Lozano; Gema Jiménez; Akanksha Verma; Alberto Ramírez; Wei Qian; Wen Chen; Anthony J Kozielski; Olivier Elemento; María D Martín-Salvago; Rafael J Luque; Carmen Rosa-Garrido; David Landeira; María Quintana-Romero; Roberto R Rosato; Maria A García; Cesar L Ramirez-Tortosa; Hanna Kim; Cristian Rodriguez-Aguayo; Gabriel Lopez-Berestein; Anil K Sood; Jose A Lorente; Pedro Sánchez-Rovira; Jenny C Chang; Sergio Granados-Principal
Journal:  Clin Cancer Res       Date:  2018-07-16       Impact factor: 12.531

4.  PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes.

Authors:  Vamsi K Mootha; Cecilia M Lindgren; Karl-Fredrik Eriksson; Aravind Subramanian; Smita Sihag; Joseph Lehar; Pere Puigserver; Emma Carlsson; Martin Ridderstråle; Esa Laurila; Nicholas Houstis; Mark J Daly; Nick Patterson; Jill P Mesirov; Todd R Golub; Pablo Tamayo; Bruce Spiegelman; Eric S Lander; Joel N Hirschhorn; David Altshuler; Leif C Groop
Journal:  Nat Genet       Date:  2003-07       Impact factor: 38.330

5.  NCBI GEO: archive for functional genomics data sets--update.

Authors:  Tanya Barrett; Stephen E Wilhite; Pierre Ledoux; Carlos Evangelista; Irene F Kim; Maxim Tomashevsky; Kimberly A Marshall; Katherine H Phillippy; Patti M Sherman; Michelle Holko; Andrey Yefanov; Hyeseung Lee; Naigong Zhang; Cynthia L Robertson; Nadezhda Serova; Sean Davis; Alexandra Soboleva
Journal:  Nucleic Acids Res       Date:  2012-11-27       Impact factor: 16.971

6.  STRING v10: protein-protein interaction networks, integrated over the tree of life.

Authors:  Damian Szklarczyk; Andrea Franceschini; Stefan Wyder; Kristoffer Forslund; Davide Heller; Jaime Huerta-Cepas; Milan Simonovic; Alexander Roth; Alberto Santos; Kalliopi P Tsafou; Michael Kuhn; Peer Bork; Lars J Jensen; Christian von Mering
Journal:  Nucleic Acids Res       Date:  2014-10-28       Impact factor: 16.971

7.  Smad1 promotes colorectal cancer cell migration through Ajuba transactivation.

Authors:  Daming Yang; Tieying Hou; Lei Li; Yimin Chu; Fengli Zhou; Ying Xu; Xinyu Hou; Huan Song; Kai Zhu; Zhaoyuan Hou; Haixia Peng; Hao Jia
Journal:  Oncotarget       Date:  2017-11-30

8.  Hepatic SMARCA4 predicts HCC recurrence and promotes tumour cell proliferation by regulating SMAD6 expression.

Authors:  Zhiao Chen; Xinyuan Lu; Deshui Jia; Ying Jing; Di Chen; Qifeng Wang; Fangyu Zhao; Jinjun Li; Ming Yao; Wenming Cong; Xianghuo He
Journal:  Cell Death Dis       Date:  2018-01-19       Impact factor: 8.469

9.  Distinct lung cancer subtypes associate to distinct drivers of tumor progression.

Authors:  Valeria Relli; Marco Trerotola; Emanuela Guerra; Saverio Alberti
Journal:  Oncotarget       Date:  2018-10-30

10.  MiR-199a-3p suppresses proliferation and invasion of prostate cancer cells by targeting Smad1.

Authors:  Feng Qu; Jinyu Zheng; Weidong Gan; Huibo Lian; Hua He; Wuping Li; Tian Yuan; Yaling Yang; Xiaogong Li; Changwei Ji; Xiang Yan; Linfeng Xu; Hongqian Guo
Journal:  Oncotarget       Date:  2017-04-18
View more
  2 in total

1.  A 4-Gene Signature Associated With Recurrence in Low- and Intermediate-Risk Endometrial Cancer.

Authors:  Diocésio Alves Pinto de Andrade; Luciane Sussuchi da Silva; Ana Carolina Laus; Marcos Alves de Lima; Gustavo Nóriz Berardinelli; Vinicius Duval da Silva; Graziela de Macedo Matsushita; Murilo Bonatelli; Aline Larissa Virginio da Silva; Adriane Feijó Evangelista; Jesus Paula Carvalho; Rui Manuel Reis; Ricardo Dos Reis
Journal:  Front Oncol       Date:  2021-08-17       Impact factor: 6.244

2.  Signature based on RNA-binding protein-related genes for predicting prognosis and guiding therapy in non-small cell lung cancer.

Authors:  Ti-Wei Miao; Fang-Ying Chen; Long-Yi Du; Wei Xiao; Juan-Juan Fu
Journal:  Front Genet       Date:  2022-09-02       Impact factor: 4.772

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.