Zhenqing Sun1, Wei Xia2, Yali Lyu3, Yanan Song2, Min Wang3, Ruirui Zhang3, Guode Sui4, Zhenlu Li4, Li Song4, Changliang Wu4, Choong-Chin Liew5,6,7, Lei Yu3, Guang Cheng1, Changming Cheng3. 1. Department of Gastrointestinal Surgery, The Affiliated Hospital of Qingdao University, Qingdao, Shandong 266000, P.R. China. 2. Department of Nuclear Medicine, The Seventh People's Hospital, Shanghai University of Traditional Chinese Medicine, Shanghai 200137, P.R. China. 3. R&D Department, Huaxia Bangfu Technology Incorporated, Beijing 100000, P.R. China. 4. Department of General Surgery, The Affiliated Hospital of Qingdao University, Qingdao, Shandong 266000, P.R. China. 5. Golden Health Diagnostics Inc., Yan Cheng, Jiangsu 224000, P.R. China. 6. Department of Clinical Pathology and Laboratory Medicine, University of Toronto, Toronto, ON M5S 1A8, Canada. 7. Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA.
Cancer development and progression is recognized as a multi-step process involving the disruption of the immune-mediated homeostatic balance that characterizes healthy tissues (1). In homeostasis, the resident immune system cells act like sentinels to safeguard tissue and organ integrity (2). However, the immune system and inflammation also serve a role in tumorigenesis (3). This was first documented in the 19th century when Dr. Rudolf Virchow observed the presence of leukocytes within tumors, whose function has since been elucidated (4,5). During oncogenesis, the immune system serves a multi-faceted role in regulating cancer development from pathogenesis to treatment. Although the immune system can suppress factors involved in the initiation and progression of cancer, immune cells can also promote proliferation, infiltration and metastasis of cancer (6). Different immune responses and cell types are involved in the formation of the tumor microenvironment, including macrophages, neutrophils, mast cells, myeloid derived suppressor cells, dendritic cells, natural killer cells of the innate immune response, and the T and B lymphocytes of the adaptive immune response (7).A number of studies have suggested that immune cells serve a crucial role in regulating colorectal cancer (CRC) tumorigenesis. CRC involves multiple strategies to evade and suppress immune system processes, including immunosurveillance, immunoediting, antitumor immune response and conditioning of the tumor microenvironment (8–10). Immune cells have also been identified as prognostic indicators in CRC (9). For example, CRC is characterized by numerous protumorigenic inflammatory responses that selectively inhibit antitumor immune responses and promote tumor development (11). By selectively inhibiting the activation of antitumor cells, such as tumor infiltrating lymphocytes (TIL), and activating suppressor T cells, such as myeloid-derived suppressor cells and regulatory T cells, immune cells lead to immune evasion in CRC, affecting its progression (12–14). Given their important role in pathogenesis and clinical outcome, immune response cells are regarded as an independent predictor for CRC recurrence and outcome (15). An ‘immune score’ (16) based on TIL location is, for example, used to assess disease free and overall survival (OS), as well as the risk of relapse and metastasis in CRC (17).Immunotherapy involves the use of components of the immune system to treat patients with cancer (11,14,18,19). The main immunotherapy strategies include: Cancer vaccines and immune stimulatory cytokines, which augment the antitumor immune response; and the use of checkpoint inhibitors, such as the anticytotoxic T lymphocyte-associated antigen 4 antibody, to inhibit immune response suppression (20). Immunotherapy has been investigated previously in CRC (21); however, further investigation is required to elucidate the association between immune factors and CRC.Peripheral blood is the main component of human physiological homeostasis. Blood connects the entire biological system, and immune cells in the blood constitute specific immunity, which is the third line of immune defense (22). Thus, blood cells recognize subtle changes occurring in the body in association with injury or disease, reflect integrated physiological responses to injury and induce specific gene expression alterations (9,23). For these reasons, according to the Sentinel Principle (24) peripheral blood transcriptome profiling dynamically reflects system-wide biology (25,26). Peripheral blood transcriptome technology has been applied in the diagnosis of various non-hematological disorders, including various types of cancer (27–36).In the present study, the peripheral blood transcriptome derived from patients with CRC was analyzed in order to develop immune-related gene expression profiles, and to identify CRC-specific immune genes as potential CRC diagnostic tools. The biological functions of these genes were then characterized with the aim of identifying new immune response-related aspects of CRC pathogenesis, thereby investigating potential immunotherapy techniques for CRC.
Materials and methods
Ethics
The present study was approved by The Ethics Committees of the Affiliated Hospital of Qingdao University (Qingdao, China) and The Seventh People's Hospital of Shanghai University of Traditional Chinese Medicine (Shanghai, China). Sample acquisition was performed between October 2018 and August 2019 at The Affiliated Hospital of Qingdao University and The Seventh People's Hospital of Shanghai. A total of 121 participants were enrolled, including 59 patients with CRC and 62 healthy patients. Written informed consent was provided by all participants prior to the study start.
Study population
Blood samples from 59 patients with CRC were collected before they had undergone any form of treatment, including radio- and chemotherapy or surgery. The patients were selected from 74 volunteers who donated blood before routine colonoscopy and who were subsequently diagnosed with CRC after pathological examination. The pathologists were independent and not involved in the present study. Healthy control samples consisted of 62 blood samples from subjects with no pathology. Tables I and II present the patient demographics and clinical characteristics.
Table I.
Demographics of healthy controls and patients with colorectal cancer.
Patient demographics
Healthy controls
Patients with colorectal cancer
P-value
Sex, n
Male
38
38
Female
24
21
Age, years
Min
42
28
Max
76
89
Mean ± standard deviation
51.6±5.5
63.0±11.4
1.39×10−10
Year groups, %
21–30
0.0
1.7
31–40
0.0
3.4
41–50
46.8
10.2
51–60
48.4
18.6
61–70
3.2
37.3
71–80
1.6
25.4
81–90
0.0
3.4
Total patients, n
62
59
Patients with CRC were significantly older than healthy controls with P-value of 1.39×10−10 according to analysis of variance.
Table II.
Clinicopathological characteristics of patients with colorectal cancer.
Characteristic
Patients, n
Tumor location
Left colon
14
Right colon
7
Rectum
34
Unknown
4
Tumor differentiation
Well
2
Well-moderate
1
Moderate
42
Moderate-Poor
4
Poor
4
Unknown
6
Pathological Tumor-Node-Metastasis stage
I
7
II
22
III
27
IV
1
Unknown
2
Basic and clinicopathological characteristics
A total of 121 blood samples were collected, including 62 healthy controls and 59 samples from patients with CRC. Patients with CRC were significantly older compared with the healthy controls (P<0.01) according to analysis of variance (F-test). The age of the controls ranged from 42–76 years, whereas the patients with CRC ranged from 28–89 years of age. Detailed information is presented in Table I.The clinicopathological characteristics of the patients with CRC are presented in Table II, including tumor location, differentiation and pathological Tumor-Mode-Metastasis (pTNM) stage (37). The majority of the CRC tissue was located in the rectum, followed by the left colon. The main tumor differentiation type was moderate, which accounted for ~71.2% (42/59). The pTNMs were mainly stages II and III (22/59 and 27/59).
Blood collection, RNA isolation and RNA quality control
Peripheral whole blood (2.5 ml) was collected in PaxGene Blood RNA tubes (PreAnalytiX GmbH; Qiagen). Total RNA was then isolated using the PaxGene Blood RNA kit (PreAnalytiX GmbH; Qiagen) following the manufacturer's protocol. RNA quality was assessed using a 2100 Bioanalyzer RNA 6000 Nano Chips (Agilent Technologies, Inc.), according to the manufacturer's protocol. All samples for microarray analysis met the following quality criteria: RNA Integrity number ≥7.0 and 28S:18S ribosomal RNA ≥1.0. RNA quantity was determined using a NanoDrop 1000 UV–Vis spectrophotometer (Thermo Fisher Scientific, Inc.).
Microarray hybridization
Whole blood gene expression profiles from the 121 blood samples (59 patients with CRC and 62 controls) was analyzed by microarray hybridization using the Gene Profiling Array cGMP U133 P2 (cat. no. 901411), in accordance with the manufacturer's protocol (Affymetrix; Thermo Fisher Scientific, Inc.). In brief, 200 ng of purified total RNA is transcribed into cDNA by reverse transcription, then labeled and hybridized against the microarray, according to the manufacturer's protocol. A total of 200 ng of each RNA sample was used for cDNA synthesis and hybridization using the accessory reagents of the Affymetrix microarray, according to the manufacturer's protocol. The gene expression profiles of the RNA samples were then processed using Affymetrix Expression Console software (version 1.4.1; Affymetrix; Thermo Fisher Scientific, Inc.) and normalized using the MAS5 normalization method (38) that uses a scaling factor to adjust the global trimmed mean signal intensity value to 500 for each array.
Microarray data and pre-processing
To identify candidate genes for CRC, probe sets were selected from 54,675 available sets in the Affymetrix Gene Profiling cGMP U133 P2 microarray. The following criteria were utilized: i) The probe sets could be detected reliably (‘present’ call) in all samples; and ii) the probe sets were present within the microarray quality control (MAQC) list for the Affymetrix U133 P2 microarray, as reported by the MAQC Consortium (39) Immune system-related genes downloaded from the Reactome database on December 10, 2019 (https://reactome.org/) were utilized to identify the relevant immune-related genes (40). All immune-related gene expression microarray data analyzed in the present study are included in Table SI and have been uploaded to the Gene Expression Omnibus; accession number GSE164191. The microarray data were log transformed to conform to a Gaussian distribution. The total data were divided into a training set and a test set in accordance with 7:3 proportional scales.
Identification of CRC-specific immune genes
To identify CRC-specific immune genes, feature selection techniques were used that do not alter the original representation of the variables but select an optimal subset from them. To select the candidate genes efficiently and rapidly from the vast set of gene expression signals, a one-way ANOVA F-value was calculated to find differentially expressed genes by comparing the expression of genes between the CRC patient group and the healthy volunteer group following post hoc Tukey's test to determine significant differences between the groups. Overall, 583 features were identified that were P<0.05 according to the F-test.This method considers each feature separately, thereby ignoring feature dependencies that may lead to poor classification performance (41). To correct for this, ElasticNet regression analysis (42) was used, which takes advantage of L1 and L2 regularization to select stronger features for CRC detection. Finally, five candidate genes with highest predictive accuracy were selected for the classification of CRC and healthy controls.
Model selection and performance evaluation
A logistic regression algorithm was used to construct a predictive model based on the five candidate genes as described in our previous report (24) To differentiate the CRC group from the healthy control group, area under the receiver operating characteristic curve (ROC AUC), sensitivity, specificity and accuracy were estimated in both the training and test groups.
Protein-protein networks and functional enrichment analyses
The proteins that interacted with the five candidate biomarkers were extracted from the STRING database (https://string-db.org/), with a confidence ≥0.7 (43). Gene-annotation enrichment analysis using the clusterProfiler R package (http://bioconductor.org/packages/release/bioc/html/clusterProfiler.html; version 3.12) was performed on signature genes and their associated proteins (44) and the protein-protein interaction network with the final biomarkers is presented in Fig. 3.
Figure 3.
Protein-protein interaction network of the five candidate genes. Interaction map of five transcriptomic gene biomarkers (hexagons) and their interacting proteins, using the confidence cut-off ≥0.7. The node size represents the size of the degree.
Reactome pathways were identified with a strict cut-off of P<0.05 that was corrected using the Benjamini and Hochberg method (45) with a false discovery rate of <0.05. Finally, protein-protein interaction and gene networks were constructed, and the biomarkers were identified using Cytoscape software (https://cytoscape.org/) (46).
Kaplan-Meier (KM) survival analysis for candidate genes based on the log-rank test
A KM analysis was performed for the five candidate genes to characterize the association between gene expression and corresponding clinical outcome on the basis of mRNA datasets of 165 rectum adenocarcinoma included in KM plot database (http://www.kmplot.com/). With the mRNA dataset of rectum adenocarcinoma collected in the KM database, these 165 patients were divided into high and low expression groups by using each percentile of gene expression between the lower and upper quartiles of expression with auto-selected best cutoff point according to the previous reports (47,48). All possible cutoff values between the lower and upper quartiles were computed and the best performing threshold was used as a cutoff. The ‘survival’ R package v2.38 (http://CRAN.R-project.org/package=survival/) was utilized to calculate log-rank P values, hazard ratios (HR) and 95% confidence intervals (CI).
Statistical analysis
A one-way ANOVA test (F-test) with Tukey's test was performed to determine the age difference between the CRC patient group and the healthy control group. An F-test following a Tukey's test was performed to find the genes differentially expressed between the CRC patient group and the healthy control group. ElasticNet regression and logistic regression analysis were performed to identify the candidate biomarkers and construct the predictive model for CRC diagnosis by comparing expression profiles between CRC and healthy control groups.
Results
Peripheral blood gene expression profiling
Genome-wide gene expression profiling was applied to the peripheral blood samples obtained from the 59 patients with CRC and the 62 healthy controls. Genome-wide expression profiles generated from Affymetrix GeneChip U133Plus2.0 were analyzed and associated between the CRC and controls. The probe sets could be detected reliably (‘present’ call) in all the samples present within the MAQC list for the Affymetrix U133 P2 microarray, as reported by the MAQC Consortium, and were also included in the immune response-relevant transcriptome signatures. A final five immune-related genes were identified to reliably distinguish CRC from the controls, including: Protein phosphatase 3 regulatory subunit Bα (PPP3R1), amyloid β precursor protein (APP), cathepsin H (CTSH), proteasome activator subunit 4 (PSME4) and DEAD-Box Helicase 3 X-Linked (DDX3X). The corresponding gene symbols, titles of the final five probe sets and fold changes are listed in Table III.
Table III.
Immune-related signatures for distinguishing colorectal cancer from controls.
Probe Set ID
Gene title
Gene symbol
Fold change
Regulation
204506_at
Protein phosphatase 3 regulatory subunit Bα
PPP3R1
2.04
Up
214953_s_at
Amyloid β (A4) precursor protein
APP
1.84
Up
202295_s_at
Cathepsin H
CTSH
1.23
Up
212220_at
Proteasome activator subunit 4
PSME4
−1.41
Down
212514_x_at
DEAD (Asp-Glu-Ala-Asp) box helicase 3 X-linked
DDX3X
−1.45
Down
Model construction and performance evaluation
A predictive logistic regression model for discriminating CRC from controls was constructed based on the five immune-related candidate genes identified. Fig. 1 presents the hierarchical cluster diagrams that demonstrate the performance of the five genes for the 121 samples. The five-gene panel constructed by logistic regression in Fig. 1B more clearly shows the clustering of CRC samples compared with the control samples.
Figure 1.
Heat map of candidate genes and a hierarchical cluster diagram. (A) Performance of five immune-related biomarkers combined by the principle of hierarchical cluster. (B) Five-gene panel constructed by logistic regression for clustering the 121 samples, including 59 colorectal cancer samples and 62 healthy control samples. PPP3R1, protein phosphatase 3 regulatory subunit Bα; DDX3X, DEAD-Box Helicase 3 X-Linked; CTSH, cathepsin H; APP, amyloid β precursor protein; PSME4, proteasome activator subunit 4.
To build the predictive model, the data were divided into training and test sets in proportions of 7:3. The training set model contained a total of 84 samples, including 41 CRC samples and 43 control samples. The model performance was then evaluated in the test set, which contained 37 samples, including 18 CRC samples and 19 control samples. The performances of the training set and the test set are presented in Tables IV and V.
Table IV.
Predicted positive/negative performances of the training and test set data.
Set
Positive, n
Negative, n
Total, n
Training set
CRC
41
0
41
Control
0
43
43
Test set
CRC
15
3
18
Control
1
18
19
CRC, colorectal cancer.
Table V.
Performance evaluation in the training and test sets.
Set
Sensitivity, %
Specificity, %
Accuracy, %
ROC AUC
Training set
100.0
100.0
100.0
1.00
Test set
83.3
94.7
89.2
0.96
ROC AUC, area under the receiver operating characteristic curve.
In terms of specificity and accuracy, both the training and test sets performed well. Sensitivity, specificity and accuracy in the training set were all 100%, and were 83.3, 94.7 and 89.2% in the test set, respectively. Furthermore, three of the 18 CRC samples in the test set were predicted as negative results, and one of the 19 healthy control samples in the test set was predicted as a positive result. These false-negative and false-positive results require further study in a larger cohort.The five-gene panel also exhibited a higher ROC AUC when compared with any single gene in both the training set and test set, as presented in Fig. 2A and B. Based on the five gene panel and logistic regression algorithm, the predictive model performed well in separating CRC and healthy controls in both the training set and the test set, as the box-whisker plot illustrates in Fig. 2C. These results suggested this five-gene panel exhibited good performance for CRC discrimination and might be a potential biomarker for CRC diagnosis.
Figure 2.
Model evaluation with ROC curve and box-whisker plot. (A) ROC curve for candidate genes in the training set. (B) ROC curve for candidate genes in the test set. (C) Box-whisker plot to display the LogReg scores of colorectal cancer and controls in the training set and test set. ROC, receiver operating characteristic; LogReg, logistic regression; AUC, area under the curve; PPP3R1, protein phosphatase 3 regulatory subunit Bα; APP, amyloid β precursor protein; CTSH, cathepsin H; PSME4, proteasome activator subunit 4; DDX3X, DEAD-Box Helicase 3 X-Linked.
Protein networks and immunofunctional enrichment analysis
To enrich the signaling pathways to reveal the biological processes underlying these five genes and their involvement in CRC pathogenesis, we assessed the five individual genes (proteins) through protein-protein interactions. This analysis produced a network including 142 proteins that link these five genes (proteins) together, as shown in Fig. 3. Subsequently, canonical pathway analysis found that these genes (proteins) associate to multiple immune-functions and thus might play important roles in the interaction between the immune system and colorectal cancer pathogenesis.The five genes selected were functionally categorized based on Reactome annotation terms and pathways identified with a strict cut-off of adjusted P<0.05, corrected with the Benjamini-Hochberg method. A total of 152 pathways consisting of these five CRC-specific genes were identified, and the top nine immune-related pathways with the highest P-adjusted values were selected for further analysis. As presented in Fig. 4A, these immune-related pathways were categorized into three groups: Adaptive immune response, innate immune response and cytokine signaling of the immune system. The adaptive immune response group included ‘downstream signaling events of B cell receptor (BCR)’, ‘downstream T cell receptor (TCR) signaling’, ‘cross-presentation of soluble exogenous antigens (endosomes)’ and ‘major histocompatibility complex (MHC) class II antigen presentation’. The innate immune response group consisted of ‘CLEC7A (Dectin-1) signaling’, ‘Fc epsilon receptor (FCERI) signaling’, ‘Toll-like receptor cascade’ and ‘neutrophil degranulation’. ‘Interleukin-1 signaling’ was the only pathway identified from the group of cytokine signaling genes in the immune system. The interactions between the enriched immune-related pathways and the related candidate genes of each pathway are indicated in Fig. 4B. These results indicated that these five CRC-specific genes are mainly associated with ‘immune responses’, suggesting a close relationship between immune system variations and the pathogenesis of colorectal carcinoma.
Figure 4.
Immuno-functional enrichment analysis of the five candidate genes and their interacting proteins. (A) Top nine significantly enriched immune-related pathways. (B) The relationship between the top nine enriched immune-related pathways and their associated genes. Red, upregulated candidate genes; green, downregulated candidate genes; light orange, interacting proteins of the candidate genes. MHC, major histocompatibility complex; TCR, T cell receptor.
An immune analysis of the five CRC-specific immune genes was also summarized as per their effect on immune response, as presented in Table VI. It was determined that three of the five candidate genes (PSME4, PPP3R1 and CTSH) were involved in both the adaptive and innate immune response; whilst APP was associated with the innate immune response and cytokine signaling in the immune system, and DDX3X participated only in the innate immune response. The gene involved in the highest number of immune response categories was PSME4, which was involved in five types of immune responses, including: ‘Signaling by the BCR’, ‘TCR signaling’, ‘class I MHC mediated antigen processing and presentation’, ‘C-type lectin receptors (CLRs)’ and ‘FCERI signaling’.
Table VI.
Immune analysis of candidate genes.
Immune response category
Common name
NS Probe ID
Gene class
Adaptive immune response
Innate immune response
Cytokine signaling in the immune system
Synonyms/previous symbols
PSME4
NM_014614
Immune response
Signaling by the BCR, TCR signaling, Class I MHC mediated antigen processing and presentation
CLRs, FCERI
–
PA200, KIAA0077
PPP3R1
NM_000945
Immune response
Signaling by the BCR
CLRs, FCERI
–
CALNB1, CNB, CNB1
APP
NM_000484
Immune response/cytokines
–
Toll-like receptor cascades
Present
AD1
CTSH
NM_004390
Immune response
MHC class II antigen presentation
Neutrophil degranulation
–
CPSB
DDX3X
NM_024005
Immune response
–
Neutrophil degranulation
–
DDX3
BCR, B cell receptor; TCR, T cell receptor; MHC, major histocompatibility complex; CLRs C-type lectin receptors; FCERI, Fc epsilon receptor signaling; PPP3R1, protein phosphatase 3 regulatory subunit Bα; APP, amyloid β precursor protein; CTSH, cathepsin H; PSME4, proteasome activator subunit 4; DDX3X, DEAD-Box Helicase 3 X-Linked.
Survival analysis of candidate genes
The KM survival curve is able to assess the effect of any gene or gene combination on survival for various types of cancer, using >30,000 samples measured using gene chips or RNA-sequencing (49). As there are no well-defined gene expression profiles of CRC in the KM plot database, a KM survival analysis was performed for the five CRC-specific immune genes to visualize the association between gene expression and clinical outcome based on the mRNA datasets of 165 rectum adenocarcinoma that collected in the KM plot database (http://www.kmplot.com/). The CRC OS rates associated with the five genes are presented in Fig. 5. Of the five genes, only PPP3R1 demonstrated prognostic power for rectum adenocarcinoma (P=0.019). High expression of PPP3R1 indicated poorer survival rate (Fig. 5A), consistent with our finding that PPP3R1 was expressed at higher levels in CRC when compared with controls. The other four candidate genes showed no prognostic power for rectum adenocarcinoma.
Figure 5.
Survival analysis based on the mRNA dataset of 165 patients with rectum adenocarcinoma collected by Kaplan-Meier database. Patients were subdivided into high and low expression groups by using the percentiles of each mRNA expression level between the lower and upper quartiles of expression as cut-off point for five candidate immune-related signatures associated with survival. (A) PPP3R1, (B) APP, (C) CSTH, (D PSME or (E) DDX3X. Red and black lines indicate high- and low-expression level groups, respectively. HR, hazard ratio; PPP3R1, protein phosphatase 3 regulatory subunit Bα; APP, amyloid β precursor protein; CTSH, cathepsin H; PSME4, proteasome activator subunit 4; DDX3X, DEAD-Box Helicase 3 X-Linked.
Discussion
The present study compared the peripheral blood transcriptomes of patients with CRC with those of healthy control patients to identify CRC-specific immune genes. In doing so, five candidate genes were selected and used to construct a predictive model for CRC through a process of feature selection using logistic regression. The predictive model exhibited strong statistical power for distinguishing CRC from controls, with an accuracy of 100.0% in the training set and 89.2% in the independent test set. The immunofunctional enrichment analysis revealed that the genes were associated with the adaptive immune response, the innate immune response and cytokine signaling. From the KM datasets, one of the five candidate genes (PPP3R1) demonstrated good prognostic performance in the OS analysis of 165 patients with rectum adenocarcinoma using the KM plot database (http://www.kmplot.com/). Considering the similarities in pathogenesis between CRC and rectum adenocarcinoma, it was reasonable to hypothesize that PPP3R1 may exhibit a similar prognosis to CRC. These preliminary results are promising; however, further research with larger cohorts and long-term follow-up is required to validate the results.In current clinical practice, CRC screening and diagnosis relies mainly on the fecal occult blood test (FOBT), colonoscopy and carcinoembryonic antigen (CEA) detection (50). However, each method has disadvantages: The sensitivity of FOBT and CEA is limited, and whilst colonoscopy is the gold standard for CRC diagnosis, the bowel preparation required and occasional severe complications that occur limit its application (51). Furthermore, clinical stratification, treatment and prognosis of CRC depends on tumor location and TNM staging; however, treatment outcomes vary, and remain unsatisfactory, suggesting that these indicators do not provide optimal prognostic information (52).There is increasing evidence that the pathogenesis, progression, treatment response and prognosis of CRC are all significantly influenced by a complex interplay between cancer cells and the immune system, particularly by the tumor microenvironment (10,53,54). The immune cells in peripheral blood constitute a third line of immune defense; thus, the peripheral blood transcriptome could reflect the overall immune characteristics of all types of cancer, including CRC (22). Investigating novel immune-related gene expression signatures using peripheral blood transcriptome profiling will provide new strategies for the diagnosis, treatment and prognosis of CRC in the future.The present study identified five CRC-specific immune genes (PPP3R1, APP, CTSH, PSME4 and DDX3X). Of these, PPP3R1, APP and CTSH were upregulated in the blood samples from patients with CRC compared with healthy control samples, whereas PSME4 and DDX3X were downregulated. The KM survival analysis showed that of the five genes, only PPP3R1 was closely associated with the clinical prognosis of CRC, with PPP3R1-upregulation indicating poor survival. This finding suggests that the survival of CRC patients is associated with the patients' immune system status and that PPP3R1 might serve as a biomarker for predicting CRC patient prognosis.PPP3R1, also named calcineurin B, is one of the regulatory subunits of calcineurin (CaN). CaN is a calcium-dependent, calmodulin stimulated serine/threonine protein phosphatase under the control of Ca2+/calmodulin (55). CaN is a heterodimer, which includes a catalytic α subunit and a Ca2+ binding regulatory β subunit (56,57). The CaN catalytic subunit gene family consists of three members (serine/threonine-protein phosphatase 2B catalytic subunits α, β and γ) (58). Lakshmikuttyamma et al (59) revealed that CaN expression is closely associated with the development of colon carcinoma, as indicated by an increased level of CaN phosphatase activity and higher levels of protein expression in colorectal adenocarcinomas. The main CaN signaling pathways in CRC are regulated via nuclear factor of activated T cells 5 (NFAT) and CaN-NFAT, which serve critical roles in mediating cellular activation of T cell immune responses (60). Adaptive immune responses are an essential aspect of tumor-host interactions in CRC, and the progression from pre-cancerous (adenomatous) colon lesions to malignant CRC involves a complex pathway associated with activated T lymphocytes (61). A previous study demonstrated that CaN and NFAT are constitutively expressed by intestinal epithelial cells and that these genes promote CRC development (62). In early CRC, CaN is activated by microbiota derived toll-like receptor ligands, and CaN and NFAT promote oncogenesis via modulation of tumor stem cells in an NFAT-dependent manner (63,64).To summarize, upregulated CaN (through PPP3R1) mediates cellular activation and the immune response in T cells, reflecting tumor-host interactions and playing an essential role in the oncogenic processes involved in CRC development. Therefore, this gene could be considered a characteristic feature of CRC and of potential importance for CRC detection. Consistent with the aforementioned previous studies, the present study demonstrated that PPP3R1 was significantly increased in the CRC group, and survival analysis also indicated that high levels of PPP3R1 were associated with a poor prognosis.APP is a membrane-bound protein ubiquitously expressed in a variety of cell types and is also found in neurite plaques of Alzheimer's disease (AD) as a precursor protein of β-amyloid (65). To the best of our knowledge, the majority of research has focused on the role of APP in AD; however, its biological functions in non-neural cells and tumors remain unknown (66). Meng et al (67) demonstrated, both in vitro and in vivo, that APP is involved in the proliferation of humancolon carcinoma cells. They also postulated that APP plays a crucial role in the cellular proliferation and survival of non-neural cells, including colon carcinoma cells. Another study reported that both CRC and pancreatic adenocarcinoma upregulated APP, and that patients with these diseases whose tumors exhibit upregulated APP, present with a poor prognosis and a short survival time (68). Consistent with these reports, in the present study, APP was also upregulated in the CRC group, and it was suggested that APP-upregulation may serve as a potent CRC diagnostic marker. In addition, APP-downregulation may prove to be a novel molecular target for adjuvant and neoadjuvant pharmacological treatment options.There is notable evidence of an inverse link between AD and cancer. For example, a previous study suggested that AD was longitudinally associated with a decreased risk of cancer, and a history of cancer was associated with a decreased risk of AD in the group aged ≥65 years Caucasian adults (69). Whether CRC specifically is associated with AD risk or with other neurodegenerative disorders requires further investigation.CTSH is a lysosomal glycoprotein and a member of the cysteine proteinase family. Together with cathepsins B and L, CTSH belongs to the peptidase C1 protein family, and can act both as an aminopeptidase and as an endopeptidase (70). Various types of cysteine proteinase have critical roles in MHC class II immune responses, apoptosis and activation of growth factors and hormones (71–74). In 1985, a study observed that pre-operative serum levels of C-reactive protein (CRP) were inversely correlated with the activity of cathepsin H and collagenase, and that levels of these peptidases were raised in rectal and sigmoid tumors (75). This study found that CTSH activity and protein patterns reflect both cancer stage and site. CTSH-specific activity is significantly increased in CRC, and there is a distinct pattern of gene expression during CRC progression. These findings suggest that CTSH may be particularly useful in defining Dukes' B and C stage (76) cancer and in distinguishing subsets of cancer types at a given site (77). Another study indicated that CTSH levels were significantly increased in the serum of patients with CRC, and that higher levels are associated with a poor prognosis (78). A different study reported that intestinal epithelial cells contain abundant constitutive levels of the cathepsin proteases. These function in human leukocyte antigen class II mediated antigen presentation to CD4(+) T lymphocytes in the presence of the proinflammatory cytokine γ-IFN (79). Consistent with these reports, CTSH was increased in the serum of patients with CRC in the present study, and was also shown to be involved in MHC class II antigen presentation.DDX3X is a subfamily of the DEAD-box helicase (DDX), which is the largest RNA helicase family and which regulates RNA biogenesis by unwinding short RNA duplexes (60). The DDX3X subfamily performs numerous nuclear functions and plays a role in the regulation of translation (80,81). DDX3X and DDX3 have dual roles in different types of cancer; acting either as oncogenes or as tumor suppressor genes (82).In CRC, the function of DDX3 remains controversial. Some studies hypothesize that DDX3 acts as a tumor suppressive gene with significant prognostic predictive power in CRC, and have found that a low level of DDX3 indicates poor prognosis and that downregulation promotes metastasis (83). In other research, DDX3 was found to have the opposite effect, acting as an oncogenic gene in CRC, with upregulation of DDX3 correlated with the β-catenin/Wnt signaling pathway (84). Inhibition of DDX3 with the small molecule inhibitor RK-33, which binds to the ATP-binding site of DDX3, could inhibit Wnt signaling, and this strategy may indicate a promising therapy in a subset of patients with CRC (85). In the present study, DDX3X was a tumor suppressor gene that was downregulated in the CRC group when compared with the healthy controls. Understanding the definite role of DDX3X in CRC requires further investigation.The most downregulated gene in the present study, PSME4, also named Proteasome Activator PA200, is a heat/armadillo repeat protein. It binds to the ends of core or 20S proteasomes, specifically recognizes acetylated histones and promotes ATP and ubiquitin-independent degradation of core histones during spermatogenesis and the DNA damage response (86,87). To the best of our knowledge, there are currently only a few reports on the role of PSME4 in cancer, and only one report on the regulation of proteasome activator PA200 on tumor cell (HeLa cervical carcinoma and B16.F10murinemelanoma cell) responsiveness to glutamine and resistance to ionizing radiation (88). The present study is, to the best of our knowledge, the first to suggest that PSME4 has a role as a tumor suppressor gene in CRC.In the present study, peripheral blood transcriptome profiling analysis identified five immune-system related genes that could discriminate CRC samples from healthy controls. There were more patients with left-sided CRC than with right-sided CRC; however, we previously identified that there are no differences in blood RNA biomarkers between left- and right-side CRC (31).Using these five CRC-specific immune genes, the performance of a predictive model was constructed and evaluated. Functional enrichment analysis indicated that the five biomarkers were mainly involved in the following pathways: ‘Signaling by the BCR’, ‘TCR signaling’, ‘class I MHC mediated antigen processing and presentation’, ‘CLRs’, ‘FCERI signaling’, ‘toll-like receptor cascades’, ‘signaling by interleukins’, ‘MHC class II antigen presentation’ and ‘neutrophil degranulation’. These nine immune signaling pathways are associated with the adaptive immune response, the innate immune response and with cytokine signaling. Survival analysis of the five candidate genes indicated that upregulation of PPP3R1 predicted a poor survival rate in patients with CRC, and that the other four candidate genes showed no significant prognostic power for CRC.As a case control report, there are some limitations to the present study. Firstly, the sample size was relatively small and different genes or more genes with better discriminatory power may be identified among a larger independent cohort of patients; second, a peripheral blood transcriptome analysis could reflect some aspects of immune status rather than global alterations; and third, the nature of the mechanisms driving these immune-related transcriptomic biomarkers in peripheral blood is not yet clear, and the biological functions of some biomarkers require further study. For example, an in vitro investigation of the biological functions of genes would be helpful in elucidating mechanisms of carcinogenesis in CRC, and such a study will be carefully considered for future work.Furthermore, it has been reported that microsatellite instability (MSI) is detected in 15% of all CRC cases (89). MSI is a hypermutable phenotype caused by the loss of DNA mismatch repair activity, which has been observed in different types of cancer cells and thus could be a potential biomarker for cancer detection (90). This could be a further potential limitation. However, this present study of blood-based biomarkers focused on gene expression profiles of blood cells instead of CRC cells. Investigating the consistency between CRC cell MSI levels and blood cell gene expression profiles in CRC is required in the future.The relationship between bowel microbiota and immune cells also requires further investigation. Microbiota populations in the human large bowel usually exist in a symbiotic relationship with the host, and there is an increasing amount of evidence to suggest that the intestinal microbiota plays an important role in the development of CRC (91). For example, microbiotic imbalances may expose the colon to different metabolic and inflammatory stimuli (92).In conclusion, the present study established a peripheral blood analytic methodology as a promising technology for the diagnosis of CRC. The results also provide more information about the immune system-related pathogenic mechanisms involved in CRC. In addition, these findings may provide clues to potential novel immunotherapy targets for CRC.
Authors: Leming Shi; Laura H Reid; Wendell D Jones; Richard Shippy; Janet A Warrington; Shawn C Baker; Patrick J Collins; Francoise de Longueville; Ernest S Kawasaki; Kathleen Y Lee; Yuling Luo; Yongming Andrew Sun; James C Willey; Robert A Setterquist; Gavin M Fischer; Weida Tong; Yvonne P Dragan; David J Dix; Felix W Frueh; Frederico M Goodsaid; Damir Herman; Roderick V Jensen; Charles D Johnson; Edward K Lobenhofer; Raj K Puri; Uwe Schrf; Jean Thierry-Mieg; Charles Wang; Mike Wilson; Paul K Wolber; Lu Zhang; Shashi Amur; Wenjun Bao; Catalin C Barbacioru; Anne Bergstrom Lucas; Vincent Bertholet; Cecilie Boysen; Bud Bromley; Donna Brown; Alan Brunner; Roger Canales; Xiaoxi Megan Cao; Thomas A Cebula; James J Chen; Jing Cheng; Tzu-Ming Chu; Eugene Chudin; John Corson; J Christopher Corton; Lisa J Croner; Christopher Davies; Timothy S Davison; Glenda Delenstarr; Xutao Deng; David Dorris; Aron C Eklund; Xiao-hui Fan; Hong Fang; Stephanie Fulmer-Smentek; James C Fuscoe; Kathryn Gallagher; Weigong Ge; Lei Guo; Xu Guo; Janet Hager; Paul K Haje; Jing Han; Tao Han; Heather C Harbottle; Stephen C Harris; Eli Hatchwell; Craig A Hauser; Susan Hester; Huixiao Hong; Patrick Hurban; Scott A Jackson; Hanlee Ji; Charles R Knight; Winston P Kuo; J Eugene LeClerc; Shawn Levy; Quan-Zhen Li; Chunmei Liu; Ying Liu; Michael J Lombardi; Yunqing Ma; Scott R Magnuson; Botoul Maqsodi; Tim McDaniel; Nan Mei; Ola Myklebost; Baitang Ning; Natalia Novoradovskaya; Michael S Orr; Terry W Osborn; Adam Papallo; Tucker A Patterson; Roger G Perkins; Elizabeth H Peters; Ron Peterson; Kenneth L Philips; P Scott Pine; Lajos Pusztai; Feng Qian; Hongzu Ren; Mitch Rosen; Barry A Rosenzweig; Raymond R Samaha; Mark Schena; Gary P Schroth; Svetlana Shchegrova; Dave D Smith; Frank Staedtler; Zhenqiang Su; Hongmei Sun; Zoltan Szallasi; Zivana Tezak; Danielle Thierry-Mieg; Karol L Thompson; Irina Tikhonova; Yaron Turpaz; Beena Vallanat; Christophe Van; Stephen J Walker; Sue Jane Wang; Yonghong Wang; Russ Wolfinger; Alex Wong; Jie Wu; Chunlin Xiao; Qian Xie; Jun Xu; Wen Yang; Liang Zhang; Sheng Zhong; Yaping Zong; William Slikker Journal: Nat Biotechnol Date: 2006-09 Impact factor: 54.908
Authors: Ilona Kryczek; Ke Wu; Ende Zhao; Shuang Wei; Linhua Vatan; Wojciech Szeliga; Emina Huang; Joel Greenson; Alfred Chang; Jacek Roliński; Piotr Radwan; Jingyuan Fang; Guobin Wang; Weiping Zou Journal: J Immunol Date: 2011-02-28 Impact factor: 5.422
Authors: Adel M Zaatar; Chun Ren Lim; Chin Wei Bong; Michelle Mei Lin Lee; Jian Jiek Ooi; David Suria; Rakesh Raman; Samuel Chao; Hengxuan Yang; Soon Bin Neoh; Choong-Chin Liew Journal: J Exp Clin Cancer Res Date: 2012-09-17