Junxiu Sheng1,2, Jinyao Zhao2, Qiuhong Xu2, Linlin Wang2, Wenjing Zhang2, Yang Zhang1. 1. Department of Oncology, The Second Affiliated Hospital of Dalian Medical University, Dalian, Liaoning 116023, P.R. China. 2. Institute of Cancer Stem Cell, Cancer Center, Dalian Medical University, Dalian, Liaoning 116044, P.R. China.
Abstract
Colorectal cancer is the third most common type of cancer and the fourth leading cause of cancer-associated mortality worldwide. Serine/arginine-rich splicing factor 1 (SRSF1) is a well-characterized oncogenic factor that promotes tumorigenesis by controlling a number of alternative splicing events. However, there is limited network analysis, from a global aspect, to study the effect of SRSF1 on colorectal cancer. In the present study, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis of available gene regulation data from The Cancer Genome Atlas database revealed the enriched functions and signaling pathways of SRSF1. Subsequently, Oncomine analysis was performed, which demonstrated that SRSF1 was upregulated in a number of types of colon cancer. From overlapping the analysis of 2,678 SRSF1-related genes and 3,625 colorectal cancer genes in GeneCards, 468 genes were identified as SRSF1-related colorectal cancer genes. The GO results revealed that these overlapped genes were primarily enriched in metabolic processes, response to DNA damage, regulation of the cell cycle and a number of additional biological processes. KEGG pathway analysis revealed that SRSF1-related colorectal cancer genes were associated with the cell cycle, deregulated signaling pathways associated with cancer progression and colorectal cancer signaling pathways. In addition, the Search Tool for the Retrieval of Interacting Genes/Proteins database and Cytoscape analysis demonstrated that 468 SRSF1-related colorectal cancer genes exhibit potential interaction networks in which these genes were enriched in DNA metabolic processes, cell cycle regulation and regulation of apoptosis. The results of the present study suggested that SRSF1 exhibited an increased degree of interaction with key molecules, including NUF2 NDC80 kinetochore complex component, kinesin family member 2C, structural maintenance of chromosomes 3, ATM serine/threonine kinase, BRCA1 DNA repair associated, protein kinase DNA-activated catalytic polypeptide, heat shock protein 90 alpha family class A member 1, ras homolog family member A, and phosphatase and tensin homolog. Collectively, the bioinformatics analysis of the present study indicated that SRSF1 may have key functions in the progression and development of colorectal cancer.
Colorectal cancer is the third most common type of cancer and the fourth leading cause of cancer-associated mortality worldwide. Serine/arginine-rich splicing factor 1 (SRSF1) is a well-characterized oncogenic factor that promotes tumorigenesis by controlling a number of alternative splicing events. However, there is limited network analysis, from a global aspect, to study the effect of SRSF1 on colorectal cancer. In the present study, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis of available gene regulation data from The Cancer Genome Atlas database revealed the enriched functions and signaling pathways of SRSF1. Subsequently, Oncomine analysis was performed, which demonstrated that SRSF1 was upregulated in a number of types of colon cancer. From overlapping the analysis of 2,678 SRSF1-related genes and 3,625 colorectal cancer genes in GeneCards, 468 genes were identified as SRSF1-related colorectal cancer genes. The GO results revealed that these overlapped genes were primarily enriched in metabolic processes, response to DNA damage, regulation of the cell cycle and a number of additional biological processes. KEGG pathway analysis revealed that SRSF1-related colorectal cancer genes were associated with the cell cycle, deregulated signaling pathways associated with cancer progression and colorectal cancer signaling pathways. In addition, the Search Tool for the Retrieval of Interacting Genes/Proteins database and Cytoscape analysis demonstrated that 468 SRSF1-related colorectal cancer genes exhibit potential interaction networks in which these genes were enriched in DNA metabolic processes, cell cycle regulation and regulation of apoptosis. The results of the present study suggested that SRSF1 exhibited an increased degree of interaction with key molecules, including NUF2NDC80 kinetochore complex component, kinesin family member 2C, structural maintenance of chromosomes 3, ATMserine/threonine kinase, BRCA1 DNA repair associated, protein kinase DNA-activated catalytic polypeptide, heat shock protein 90 alpha family class A member 1, ras homolog family member A, and phosphatase and tensin homolog. Collectively, the bioinformatics analysis of the present study indicated that SRSF1 may have key functions in the progression and development of colorectal cancer.
Serine/arginine splicing factor 1 (SRSF1, also known as SF2/ASF) is a member of the SR protein family of splicing regulators. SRSF1 regulates alternative splicing through recognizing and binding the exonic splicing enhancers (ESEs) (1). In addition, SRSF1 is involved in the regulation of mRNA stability (2), mRNA export (3), nonsense-mediated mRNA decay (4), translation (5–7) and microRNA processing (8). It has been demonstrated that SRSF1 is a well-characterized oncogenic factor that promotes tumorigenesis by controlling a number of alternative splicing (AS) events (9). A previous study has identified that SRSF1 promotes the skipping of RON proto-oncogene (RON) exon 11 to generate RONΔ11, which stimulates cell migration and invasion (10). The expression level of SRSF1 is upregulated in humanbreast cancer, and SRSF1 overexpression promotes the transformation of mammary cells by regulating a number of AS events (11). Additionally, the overexpression of SRSF1 has been identified in colon, thyroid, small intestine, kidney, lung, liver and pancreas tumors (12).Colorectal cancer, as the third most common type of cancer and the fourth leading cause of cancer-associated mortality, affects >1,000,000 people and causes >500,000 mortalities every year worldwide (13). Histological types of colon cancer include adenocarcinoma, scirrhous tumors and neuroendocrine (14).Treatments for patients with colon cancer include combinations of surgery, radiation therapy, chemotherapy and targeted therapy (13). Cancer of the colon may be cured with surgery when localized to the bowel; however, cancer that has spread or cancer recurrence following surgery are typically incurable (13). Thus, the development of effective biomarkers and specific therapeutic targets is required.SRSF1 serves a key function in the tumorigenesis of lung and breast cancer (9,11,15,16), but there is limited information demonstrating the association between SRSF1 and colorectal cancer. SRSF1 may regulate alternative splicing of tumor-related Ras-related C3 botulinum toxin substrate 1b (Rac1b) by promoting the inclusion of alternative exon 3b in colorectal cells (17). Furthermore, alternative splicing of solute carrier family member 39 member 14 in colorectal cancer was identified to be regulated in the Wnt signaling pathway, and this was hypothesized to occur via the regulation of SRSF protein kinase 1 and SRSF1 (18). However, there is a limited amount of network analysis from a global aspect to study the effect of SRSF1 on colorectal cancer.In the present study, the enriched functions and signaling pathways of SRSF1 were investigated using Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis of available gene regulation data. Additionally, Oncomine analysis was performed, which revealed that SRSF1 was upregulated in a number of types of colon cancer. Furthermore, the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) database and Cytoscape analysis demonstrated that SRSF1-related colorectal cancer genes have potential interaction networks in which these genes were enriched in DNA metabolic processes, cell cycle regulation and regulation of apoptosis.
Materials and methods
The expression levels of SRSF1 in a number of types of colon cancer were obtained from the Oncomine dataset (https://www.oncomine.org/resource/login.html). The immunohistochemical staining results of SRSF1 in colorectal cancer and normal tissues were selected from the Human Protein Atlas (http://www.proteinatlas.org). cBioPortal (http://www.cbioportal.org/) was applied to identify the SRSF1 co-expressed genes. Briefly, the threshold selected was the absolute value of Pearson correlation coefficient >0.4. Furthermore, Venny 2.1 (BioinfoGP, CNB-CSIC, http://bioinfogp.cnb.csic.es/tools/venny/index.html) was utilized to overlap the identified gene set with a colon cancer-related gene list, screened using GeneCards. The resulting overlapped genes were defined as SRSF1-related genes in colon cancer. The GO and KEGG analysis were conducted with all genes on Database for Annotation, Visualization and Integrated Discovery platform (http://david.abcc.ncifcrf.gov/summary.jsp). Fisher exact P-values were calculated using SPSS 19.0 (IBM Corp., Armonk, NY, USA) for each enriched functional category and pathway, and enriched functional categories and pathways with P<0.05 were presented. The online tool STRING (http://string-db.org) was used to identify the potential interaction networks of protein products of these genes. The Cytoscape 2.8.3 software (Institute of Systems Biology, Seattle, WA, USA) was further utilized to construct interaction networks and sub-networks among these SRSF1-associated gene products in colon cancer. The gene regulation networks and protein interactions were visualized by using Cytoscape 2.8.3 software.
Results
Network of SRSF1-related genes
The co-expression network of SRSF1-regulated genes, from the colorectal cancer gene expression profiling database, was established using cBioPortal by setting the Pearson correlation efficient at >0.4. A total of 2,678 genes were identified to be SRSF1-related. Protein-protein interaction networks of 913/2,678 genes were constructed using the STRING database analysis (Fig. 1).
Figure 1.
Interaction network of SRSF1-related genes. The interactions of 2,678 SRSF1-related genes were analyzed using the Search Tool for the Retrieval of Interacting Genes/Proteins database. The resulting complicated multicentric interaction network contained 913 genes. SRSF1, serine/arginine-rich splicing factor 1.
GO analysis for the SRSF1-related proteins revealed that SRSF1-regulated cancers genes were markedly enriched in a number of functions, including regulation of the cell cycle, DNA metabolic processes, RNA processing and DNA replication (Fig. 2A). The results indicated that SRSF1 is involved in cancer cell proliferation. In addition, KEGG pathway analysis revealed that the SRSF1-related genes were markedly enriched in cell cycle, spliceosome, ubiquitin-mediated proteolysis, RNA degradation, nucleotide excision repair, DNA replication and p53 signaling pathway (Fig. 2B). Notably, SRSF1-regulated genes participated in the progression of colorectal cancer, suggesting that SRSF1 may affect colorectal cancer development.
Figure 2.
Analysis of SRSF1-related genes. (A) Gene Ontology analysis (biological process) of 913 SRSF1-related genes is presented. Fisher exact P-values were calculated for each enriched functional category. Enriched functional categories with P<0.05 are presented. (B) Kyoto Encyclopedia of Genes and Genomes pathway of SRSF1-related genes. SRSF1, serine/arginine-rich splicing factor 1; TGF-β, transforming growth factor β.
SRSF1 serves a key function in colorectal cancer progression
The association between SRSF1 and colorectal cancer was further analyzed. Using Oncomine analysis, the expression level of SRSF1 was determined in colon cancer. It was identified that SRSF1 was upregulated in a number of types of colon cancer, including cecum adenocarcinoma, colon adenocarcinoma and colon mucinous adenocarcinoma (Fig. 3A). In addition, the immunohistochemistry staining results of SRSF1, obtained from the Human Protein Atlas database (http://www.proteinatlas.org) (19), were analyzed, which revealed that SRSF1 was upregulated in colorectal cancer, compared with normal tissues (Fig. 3B) (http://www.proteinatlas.org/ENSG00000136450-SRSF1/cancer/tissue/colorectal+cancer). Taken together, SRSF1 may be involved in colorectal cancer progression.
Figure 3.
Expression level of SRSF1 in a number of types of colon cancer. (A) The mRNA expression level results of SRSF1 in a number of types of colon cancer were obtained from the Oncomine dataset. The median, upper and lower quartiles were plotted, and the whiskers indicate the data range. The points that are >1.5× the interquartile range are marked outliers. (B) Representative immunohistochemical staining of SRSF1 (HPA061301, Sigma-Aldrich; Merck KGaA, Darmstadt, Germany) in two samples of colorectal cancer and normal tissues obtained from Human Protein Atlas. SRSF1, serine/arginine-rich splicing factor 1.
Overlapping analysis of 2,678 SRSF1-related genes and 3,625 genes defined as colorectal cancer genes in GeneCards identified 468 genes as SRSF1-related colorectal cancer genes (Fig. 4A). Subsequently, GO and KEGG pathway analysis was performed to classify the 468 overlapped genes. The GO results demonstrated that the overlapped genes were enriched in a number of biological processes, including metabolic processes, response to DNA damage and cell cycle regulation (Fig. 4B). Furthermore, KEGG pathway analysis revealed that SRSF1-related colorectal cancer genes were involved in the regulation of the cell cycle, deregulated signaling pathways associated with cancer progression and the colorectal cancer signaling pathway (Fig. 4C).
Figure 4.
Interaction network of SRSF1-related genes and colon cancer associated genes. (A) Overlapping analysis of 2,678 SRSF1-related genes (identified using The Cancer Genome Atlas) and 3,625 colon cancer genes (identified using GeneCards). (B) Gene Ontology analysis (biological process) of 468 SRSF1-related colon cancer genes. Fisher exact P-values were plotted for each enriched functional category. Enriched functional categories with P<0.05 were shown. (C) Kyoto Encyclopedia of Genes and Genomes pathway of SRSF1-related colon cancer genes. Fisher exact P-values were plotted for each enriched pathway. Enriched functional pathways with P<0.05 were shown. SRSF1, serine/arginine-rich splicing factor 1; ErbB, Epidermal growth factor receptor; TGF, transforming growth factor; NOD, nucleotide-binding oligomerization domain; MAPK, mitogen-activated protein kinase; mTOR, mechanistic target of rapamycin; VEGF, vascular endothelial growth factor.
Networks of SRSF1-related colorectal cancer genes
The STRING database and Cytoscape analysis was conducted on SRSF1-related colorectal cancer genes to reveal potential interaction networks. GO demonstrated that SRSF1-associated colorectal cancer genes were enriched in the functions of DNA metabolic processes, cell cycle regulation and regulation of apoptosis (Fig. 5).
Figure 5.
Interactions of SRSF1-related colon cancer genes were analyzed using the Search Tool for the Retrieval of Interacting Genes/Proteins database and Cytoscape. SRSF1, serine/arginine-rich splicing factor 1.
A sub-network of SRSF1-related colorectal cancer genes was identified to be enriched in cell cycle regulation. NUF2NDC80 kinetochore complex component (NUF2), kinesin family member 2c (KIF2C), structural maintenance of chromosome 3 (SMC3), MAD2 mitotic arrest deficient-like 1 (MAD2L1) and ZWILCH kinetochore protein (ZWILCH) genes were identified to exhibit a high degree of interaction in colorectal cancer (Fig. 6A). In the sub-network of SRSF1-related colorectal cancer genes that were involved in DNA metabolic processes, ATMserine/threonine kinase (ATM), BRCA1 DNA repair associated (BRCA1), protein kinase DNA activated catalytic polypeptide (PRKDC), proliferating cancer nuclear antigen (PCNA), topoisomerase (DNA) 1 (TOP1) and MRE11 homolog double strain break repair nuclease (MRE11A) genes revealed a high connectivity value (Fig. 6B). Furthermore, heat shock protein 90 alpha family class A member 1 (HSP90AA1), Ras homolog family member A (RHOA), phosphatase and tensin homolog (PTEN) and ribosomal protein S6 kinase B1 (RPS6KB1) exhibited a high degree of interaction in another sub-network that regulated apoptosis (Fig. 6C).
Figure 6.
Sub-networks of SRSF1-related colon cancer genes. (A) Network of SRSF1-regulated genes involved in cell cycle regulation. (B) Network of SRSF1-regulated cancer-associated genes that serve roles in DNA metabolic processes. (C) Network of SRSF1-regulated genes involved in apoptosis regulation. SRSF1, serine/arginine-rich splicing factor 1.
Discussion
Serine/Arginine-rich (SR) proteins are a family of RNA-binding proteins with a serine/arginine rich domain. SR proteins regulate alternative splicing of a number of genes that are involved in cell cycle regulation, cell proliferation, apoptosis, epithelial-mesenchymal transition and drug resistance (10,12,20–22). As a member of SR proteins, SRSF1 (SF2/ASF) functions as a proto-oncogene that is generally upregulated in various types of cancer (12,23). In addition, SRSF1 may promote tumorigenesis by controlling the alternative splicing of a number of cancer-related genes, including RPS6KB1 the tumor suppressor bridging integrator 1, the tyrosine kinase receptor for macrophage-stimulating protein, cyclin D1, and vascular endothelial growth factor (24,25).In the present study, bioinformatics analysis was performed systematically to determine the effect of SRSF1 on colorectal cancer. Gene expression data were obtained from The Cancer Genome Atlas database. Subsequently, potential SRSF1-regulated genes in colorectal cancer were obtained using cBioPortal online tools, followed by GO, KEGG and protein interactome analyses. Oncomine analysis was performed which revealed that SRSF1 is upregulated in a number of types of colon cancer. A total of 468 genes were identified as SRSF1-related colorectal cancer genes by the overlapping analysis of 2,678 SRSF1-related genes and 3,625 colorectal cancer genes (identified using GeneCards). The GO results suggested that these SRSF1-associated colorectal cancer genes were primarily enriched in metabolic processes, response to DNA damage and regulation of the cell cycle. In addition, KEGG pathway analysis revealed that SRSF1-related colorectal cancer genes were closely associated with cell cycle, deregulated signaling pathways associated with cancer progression and colorectal cancer signaling pathways. STRING database and Cytoscape analysis demonstrated that 468 SRSF1-related colorectal cancer genes exhibited potential interaction networks with functions of DNA metabolic process, cell cycle regulation and regulation of apoptosis. Notably, a panel of genes (NUF2, KIF2C, SMC3, MAD2L1 and ZWILCH) responsible for cell cycle regulation were identified to be highly associated with the expression level of SRSF1. NUF2 is a component of NDC80 kinetochore complex and is known to be critical for stable spindle microtubule-kinetochore attachment (26). The expression level of NUF2 has been identified to be upregulated and associated with poor prognosis in patients with colorectal cancer (27). Previous studies have revealed that NUF2 serves a function in a number of types of humancancer including lung, colorectal, gastric, prostate, urinary bladder, renal carcinoma and ovarian cancer (28–33). KIF2C (Kinesin family member 2C, also known as the mitotic centromere-associated kinesin, MACK) is the best characterized member of the kinesin-13 family, which is critical in the regulation of microtubule dynamics. KIF2C has been identified to be aberrantly regulated in cancer cells, which is associated with increased malignance, invasiveness, metastasis, drug resistance, chromosomal instability and remodeling of the microtubule cytoskeleton (34,35).ATM, BRCA1, PRKDC, PCNA, TOP1 and MRE11A genes, involved in DNA metabolic processes, exhibit a high degree of interaction with SRSF1. ATM is an important cell cycle checkpoint kinase and a key regulator of the DNA double-strand-break response (36). ATM may regulate a number of downstream proteins, including p53, BRCA1, checkpoint kinase 2, checkpoint clamp loader component RAD17, cell cycle checkpoint control protein RAD9 and Nibrin. ATM and serine/threonine-protein kinase ATR, which are hypothesized to be regulators of cell cycle checkpoint signaling pathways (36).Genes that regulate apoptosis, including HSP90AA1, RHOA, PTEN and RPS6KB1, were highly associated with the expression level of SRSF1. RHOA is a member of Rho GTPases, which are involved in major aspects of cancer development, including cell proliferation, apoptosis, cell polarity, adhesion, migration and invasion (37). Overexpression of RHOA is associated with invasion and poor prognosis in colorectal cancer (38).The results of the present study, which was conducted using bioinformatics analysis, indicated that SRSF1 may have key functions in the progression and development of colorectal cancer by interaction networks. Additional studies may verify this potential regulatory network by performing molecular biology experiments.
Authors: Mathias Uhlén; Linn Fagerberg; Björn M Hallström; Cecilia Lindskog; Per Oksvold; Adil Mardinoglu; Åsa Sivertsson; Caroline Kampf; Evelina Sjöstedt; Anna Asplund; IngMarie Olsson; Karolina Edlund; Emma Lundberg; Sanjay Navani; Cristina Al-Khalili Szigyarto; Jacob Odeberg; Dijana Djureinovic; Jenny Ottosson Takanen; Sophia Hober; Tove Alm; Per-Henrik Edqvist; Holger Berling; Hanna Tegel; Jan Mulder; Johan Rockberg; Peter Nilsson; Jochen M Schwenk; Marica Hamsten; Kalle von Feilitzen; Mattias Forsberg; Lukas Persson; Fredric Johansson; Martin Zwahlen; Gunnar von Heijne; Jens Nielsen; Fredrik Pontén Journal: Science Date: 2015-01-23 Impact factor: 47.728
Authors: Olga Anczuków; Martin Akerman; Antoine Cléry; Jie Wu; Chen Shen; Nitin H Shirole; Amanda Raimer; Shuying Sun; Mads A Jensen; Yimin Hua; Frédéric H-T Allain; Adrian R Krainer Journal: Mol Cell Date: 2015-10-01 Impact factor: 17.970
Authors: Kasper Thorsen; Francisco Mansilla; Troels Schepeler; Bodil Øster; Mads H Rasmussen; Lars Dyrskjøt; Rotem Karni; Martin Akerman; Adrian R Krainer; Søren Laurberg; Claus L Andersen; Torben F Ørntoft Journal: Mol Cell Proteomics Date: 2010-10-11 Impact factor: 5.911
Authors: Rotem Karni; Elisa de Stanchina; Scott W Lowe; Rahul Sinha; David Mu; Adrian R Krainer Journal: Nat Struct Mol Biol Date: 2007-02-18 Impact factor: 15.369
Authors: Geetika Sethi; Harsh B Pathak; Hong Zhang; Yan Zhou; Margret B Einarson; Vinod Vathipadiekal; Sumedha Gunewardena; Michael J Birrer; Andrew K Godwin Journal: PLoS One Date: 2012-10-09 Impact factor: 3.240
Authors: Olga Anczuków; Avi Z Rosenberg; Martin Akerman; Shipra Das; Lixing Zhan; Rotem Karni; Senthil K Muthuswamy; Adrian R Krainer Journal: Nat Struct Mol Biol Date: 2012-01-15 Impact factor: 15.369
Authors: Joanna R Sinnakannu; Kian Leong Lee; Shanshan Cheng; Jia Li; Mengge Yu; Siew Peng Tan; Clara Chong Hui Ong; Huihua Li; Hein Than; Olga Anczuków-Camarda; Adrian R Krainer; Xavier Roca; Steven G Rozen; Jabed Iqbal; Henry Yang; Charles Chuah; Sin Tiong Ong Journal: Leukemia Date: 2020-02-12 Impact factor: 11.528