Eun-A Ko1, Young-Won Kim2, Donghee Lee2, Jeongyoon Choi2, Seongtae Kim2, Yelim Seo2, Hyoweon Bang2, Jung-Ha Kim3, Jae-Hong Ko2. 1. Department of Physiology and Cell Biology, University of Nevada, Reno School of Medicine, Reno, Nevada 89557, USA. 2. Department of Physiology, College of Medicine, Chung-Ang University, Seoul 06974, Korea. 3. Department of Family Medicine, Chung-Ang University Hospital, College of Medicine, Chung-Ang University, Seoul 06973, Korea.
Abstract
Lung cancer is the most common cause of cancer deaths worldwide and several molecular signatures have been developed to predict survival in lung cancer. Increasing evidence suggests that proliferation and migration to promote tumor growth are associated with dysregulated ion channel expression. In this study, by analyzing high-throughput gene expression data, we identify the differentially expressed K+ channel genes in lung cancer. In total, we prioritize ten dysregulated K+ channel genes (5 up-regulated and 5 down-regulated genes, which were designated as K-10) in lung tumor tissue compared with normal tissue. A risk scoring system combined with the K-10 signature accurately predicts clinical outcome in lung cancer, which is independent of standard clinical and pathological prognostic factors including patient age, lymph node involvement, tumor size, and tumor grade. We further indicate that the K-10 potentially predicts clinical outcome in breast and colon cancers. Molecular signature discovered through K+ gene expression profiling may serve as a novel biomarker to assess the risk in lung cancer.
Lung cancer is the most common cause of cancer deaths worldwide and several molecular signatures have been developed to predict survival in lung cancer. Increasing evidence suggests that proliferation and migration to promote tumor growth are associated with dysregulated ion channel expression. In this study, by analyzing high-throughput gene expression data, we identify the differentially expressed K+ channel genes in lung cancer. In total, we prioritize ten dysregulated K+ channel genes (5 up-regulated and 5 down-regulated genes, which were designated as K-10) in lung tumor tissue compared with normal tissue. A risk scoring system combined with the K-10 signature accurately predicts clinical outcome in lung cancer, which is independent of standard clinical and pathological prognostic factors including patient age, lymph node involvement, tumor size, and tumor grade. We further indicate that the K-10 potentially predicts clinical outcome in breast and colon cancers. Molecular signature discovered through K+ gene expression profiling may serve as a novel biomarker to assess the risk in lung cancer.
Ion channels are ion-permeable protein complexes in the lipid membranes. They establish and control the voltage gradient across biological membranes by gating the flow of specific small ions down their electrochemical gradient, which play important roles in diverse cellular processes including cardiac, skeletal, and smooth muscle contraction, epithelial transport of nutrients and ions, T-cell activation and pancreatic beta-cell insulin release, hormonal secretion, osmotic regulation of blood pressure and so on [12345]. Previous studies have shown that ion channels are involved in the development of various human diseases including cancers [67891011]. The roles of ion channels, especially potassium channels, in the growth of tumor cells have been well documented [121314151617]. Blocking channel activity affects the growth of certain tumors both in vitro and in vivo [131819].Among all the voltage-gated ion channels, potassium channels (K+ channels) have attracted most of the attentions in regard to the relationship with humancancers because K+ channels constitute the largest heterogeneous family, such as voltage-gated K+ (Kv) channels, calcium-activated K+ (KCa) channels, inwardly rectifying K+ (Kir) channels, and two-pore K+ (K2p) channels, with more than eighty genes encoding membrane proteins that control membrane potential [20]. More importantly, K+ channels play a pivotal role in the proliferation, migration, and invasion of various tumor cells and have been considered as new targets for designing cancer treatment strategies [2021]. For example, Kv channels have been found to be involved in the proliferation of uterine and breast cancer cells [2223]. Similarly, the activation of KCa channels has been shown to drive tumor cell proliferation in astrocytoma and glioma [2425]. Actually, increasing evidences indicate that, instead of a single subfamily of K+ channel being present in tumor cells from tissues such as colon, prostate, and breast, a variety of K+ channels are found in these tissues [17]. For example, the expression of Kv, KCa, and K2p channels has been reported in prostate cancer cells [262728].Lung cancer is the most common cause of cancer-related death in human worldwide [2930]. Several molecular signatures have been developed to predict survival in lung cancer [313233343536]. However, little is known regarding the role of K+ channels in promoting tumor growth and its influence on the prognosis of lung cancer compared with other cancers. In this study, we analyzed high-throughput gene expression data to identify K+ channel genes implicated in lung cancer. A K+ channel gene signature was sought that differentiates lung tumor tissues from normal ones. This molecular signature predicted clinical prognosis in humanlung cancer effectively, which is independent of, but cooperative with, standard clinical and pathological prognostic factors including patient age, lymph node involvement, tumor size, and tumor grade. We also indicated that this gene signature potentially may be used to predict outcome in humanbreast and colon cancers.
METHODS
K+ channel genes
The definition of ion channel gene for human was obtained from IUPHAR-DB [37]. In total, we collected eighty-eight K+ channel genes, including 47 Kv channels, 12 KCa channels, 15 Kir channels, and 14 K2p channels (Supplementary Table 1).
Gene expression data
Two independent gene expression datasets on lung cancer (GSE10072 and GSE19804) were downloaded from the Gene Expression Omnibus (GEO) database. These two datasets were chosen based on the availability of both normal and tumor tissue specimens, as well as having a large number of samples available. We used these two datasets to filter out the differentially expressed genes between normal and tumor tissues.Besides the two GEO datasets we mentioned above, we also downloaded the gene expression datasets representing human lung, breast, and colon cancers from publicly available repositories. These datasets were chosen based on the large number of samples, the availability of clinical outcome data, and the diversity of tumor types. For each tumor type, training and validation cohorts were constructed. For lung cancer, we obtained three datasets (n = 359) which were available from a single study [31]. Two datasets were combined as training cohort (n = 257) and the other one was used as validation cohort (n = 102). The dataset for breast cancer (n = 295) was available from previous study [38]. Lastly, for colon cancer, we downloaded the dataset (n = 232) from GEO database (GSE17538). In breast and colon cancer, the tumor samples were randomly separated into two parts (1/2 for training and 1/2 for validation). Clinical characteristics of the patients used for training and validation are listed in Supplementary Table 2.
Statistical analysis
For the datasets GSE10072 and GSE19804, two-tailed t-test was used to identify the genes that differentially expressed between normal and tumor tissues. Genes with adjusted p-value < 0.01 after Bonferroni procedure were deemed dysregulated. The common differentially expressed genes shared by the two datasets were saved for further analysis.Next, we tested the power of the differentially expressed genes in predicting the clinical outcome in lung, breast, and colon cancers. For each cancer training/validation dataset, we normalized the gene expression level into the scale of [−1, 1] by probability of expression (POE) algorithm [3940] implemented in the metaArray library of the R Statistical Package (R Foundation for Statistical Computing, Vienna, Austria) [41]. Based on the gene expression and clinical outcome data from the training dataset, we can assign a Wald statistic generated by Cox proportional-hazard regression to each gene as a weight. A risk score was calculated for each patient using a linear combination of weighted gene expression as below:Here, s is the risk score of the patient; n is the number of differentially expressed genes; w denotes the weight of gene i; e denotes the expression level of gene i; and µ and τ are the mean and standard deviation of the gene expression values for gene i across all samples, respectively. Patients were then divided into high-score and low-score groups with the median of the risk score as the threshold value. A high score indicated a poor outcome. The weight of each gene was fixed, based on the training groups, and then tested in the validation groups [34].
RESULTS
Differentially expressed K+ channel genes in lung cancer
Two independent lung cancer datasets (GSE10072 and GSE19804) were obtained from GEO database. Both datasets contain the gene expression data from both normal and tumor lung tissues. For the dataset GSE10072, all the subjects (59 normal and 58 tumor tissues) were collected in the Lombardy region of Italy [42]. Affymetrix Human Genome U133A Array was used to measure the gene expression levels in this dataset. In contrast, the subjects in the dataset GSE19804 (60 normal and 60 tumor tissues) were from Taiwan [43]. All the subjects in this cohort are female. The gene expression levels were assayed by Affymetrix Human Genome U133 Plus 2.0 Array (Affymetrix, Santa Clara, CA, USA).In the dataset GSE10072, all the K+ channel genes were represented by 97 probesets. Among these probesets, 7 probe sets representing 6 genes were found be to up-regulated in tumor tissue while 12 probesets coding for 9 genes were down-regulated in tumor tissue, at the specified significance level (Bonferroni-adjusted p < 0.01) (Supplementary Table 3). Most of these results can be reproduced in the dataset GSE19804. 5 out of 6 up-regulated K+ channel genes in the dataset GSE10072 were validated in the dataset GSE19804 (Fig. 1 and Supplementary Fig. 1). Similarly, 5 out of 9 down-regulated genes in the dataset GSE10072 were confirmed by the dataset GSE19804 (Fig. 1 and Supplementary Fig. 1). Therefore, we designated the ten common dysregulated genes (5 up-regulated and 5 down-regulated in tumor tissue) shared by both datasets as K-10 signature (Table 1). This K-10 signature accurately distinguished tumor tissue specimens from normal ones in both Italian and Taiwanese cohorts (Fig. 2).
Fig. 1
Distribution of expression level for the K-10 signature in the Italian cohort (GSE10072).
The 10 dysregulated genes were verified in the Taiwanese cohort (GSE19804), among which 5 genes (6 probe sets, the upper row) were up-regulated while 5 genes (7 probe sets, the lower row) were down-regulated in tumor tissue. Each panel was labelled with the corresponding gene symbol and probeset ID. Y-axis: log transformed expression level. The solid line depicts the mean of expression value for normal or tumor tissue. The dot line indicates the overall mean of expression value across all individual samples.
Table 1
K-10 signature
Fig. 2
Heatmap of expression level for the K-10 signature.
(A) Heatmap of expression level for the K-10 signature in the Italian cohort (GSE10072). 6 probe sets represent 5 up-regulated genes and 7 probe sets represent 5 down-regulated genes. Each row was labelled with the corresponding gene symbol and probe set ID. “++”: tumor tissue; “−”: normal tissue. (B) Heatmap of expression level for the K-10 signature in the Taiwanese cohort (GSE19804). 15 probe sets encoding 5 up-regulated and 5 down-regulated genes were displayed. Each row was labelled with the corresponding gene symbol and probe set ID. “++”: tumor tissue; “−”: normal tissue.
Predicting clinical outcome in lung cancer by K-10 signature
We hypothesized that K-10 signature is predictive of tumor outcome in lung cancerpatients. We constructed a risk scoring system that combined gene expression of K-10 with risk for death in the training dataset. High-score patients were defined as those having a risk score greater than or equal to the group median score. In independent validation cohorts, we tested the ability of the risk score to stratify patients into prognostic groups. We performed Kaplan-Meier survival analysis comparing the high-score and low-score groups and determined statistical significance by log-rank tests. The K-10 signature was able to identify patients with poor overall survival in lung cancer in both the training (p < 0.001) and testing (p = 0.033) cohorts (Fig. 3). This association between K-10 risk score and survival was also confirmed by univariate Cox proportional hazard analysis of overall survival. High-score patients had an increased risk for death of 1.78-fold in the training cohort (p < 0.001) and 2.08-fold in the testing cohort (p = 0.036).
Fig. 3
Kaplan-Meier curves for the patients from the training and validation cohorts.
The expression of K-10 signature predicts poor clinical outcome in lung cancer. High-score patients were defined as those having a K-10 risk score greater than or equal to the group median score. p-value were calculated by log-rank tests for the differences in overall survival between the high-score and low-score groups.
Independence of K-10 from other clinicopathologic factors
We investigated the performance of the K-10 signature in comparison with clinicopathologic variables associated with prognosis in lung cancer. A multivariate Cox analysis of overall survival indicated that K-10 status remained a significant covariate in relation to the standard clinicopathologic factors in lung cancer, including patient age, lymph node status, tumor size, and tumor grade (Table 2). We next stratified the patients according to the factors significant on multivariate analysis, i.e., age, lymph node status, and tumor size, respectively. For patients with age < 65 and ≥ 65, the high-score patients had a significantly increased risk for death of 1.31-fold (p = 0.013) and 1.30-fold (p = 0.006), respectively. For patients with and without lymph node involvement, the high-score patients had a significantly increased risk for death of 1.31-fold (p = 0.005) and 1.31-fold (p = 0.014), respectively. For patients with tumor size < T3 and ≥ T3, the high-score patients had a significantly increased risk for death of 1.25-fold (p = 0.005) and 1.41-fold (p = 0.066), respectively. Kaplan-Meier survival analysis also demonstrated a significantly reduced overall survival for high-score patients in each subset grouped by age, lymph node status, and tumor size (Fig. 4). Taken together, these results suggest that the expression of K-10 signature is associated with clinical outcomes and is an independent prognostic factor.
Table 2
Cox proportional hazard analysis of overall survival for the lung cancer patients
Fig. 4
Kaplan-Meier survival curves of the lung cancer patients grouped by clinicopathologic factors.
K-10 is independent from other clinicopathologic factors in lung cancer. (A) Patients were stratified by age; (B) Patients were stratified by lymph node status; and (C) Patients were stratified by tumor size. High-score patients were defined as those having a K-10 risk score greater than or equal to the group median score. p-value were calculated by log-rank tests for the differences in overall survival between the high-score and low-score groups.
Predicting survival in other cancers by K-10 signature
We further tested the performance of K-10 in predicting clinical outcome in breast and colon cancers. A risk score based on the expression of K-10 signature was assigned to each patient with either breast or colon cancer. Kaplan-Meier survival analysis indicated that the K-10 signature was also able to identify patients with poor overall survival in breast and colon cancers in both the training (p = 0.004 for breast cancer and p = 0.016 for colon cancer) and testing (p = 0.047 in breast cancer and p = 0.007 in colon cancer) cohorts (Fig. 5). Univariate Cox proportional hazard regression on overall survival demonstrated that, in the training cohorts, high-score patients had an increased risk for death of 2.59-fold in breast cancer (p = 0.005) and 2.29-fold in colon cancer (p = 0.019). In the validation cohorts, high-score patients had an increased risk for death of 1.97-fold in breast cancer (p = 0.052) and 2.05-fold in colon cancer (p = 0.009).
Fig. 5
Kaplan-Meier curves for the patients from the training and validation cohorts in breast and colon cancers.
The expression-of K-10 signature predicts poor clinical outcome in both (A) breast and (B) colon cancers. High-score patients were defined as those having a K-10 risk score greater than or equal to the group median score. p-values were calculated by log-rank tests for the differences in overall survival between the high-score and low-score groups.
We also conducted multivariate analyses using standard clinical and pathological factors available for the breast and colon cancer datasets. Cox proportional hazards regression indicated that K-10 status was still a significant covariate in colon cancer but not in breast cancer (Supplementary Table 4). In breast cancer with poor prognosis (age < 40, tumor size ≥ T2, tumor grade ≥ 2, or ER-negative), we further stratified the patients by using the K-10 risk score. Kaplan-Meier survival analysis demonstrated a significantly reduced overall survival for the high-score patients when compared to the low-score patients except for the ER-negative group (Supplementary Fig. 2). A similar analysis for the colon cancerpatients with poor prognosis (age ≥ 60 or with stage 3, 4) also demonstrated an increased risk for death in the high-score patients (Supplementary Fig. 3).
DISCUSSION
The major aim of this work is to identify the differentially expressed K+ channel genes in lung cancer. Leveraging microarray expression data in two independent cohorts, we identified a gene signature comprised of K-10 for differentiating lung tumor tissue from normal ones. K-10 signature shows a considerable power to predict clinical outcome in humanlung cancer. Multivariate analysis indicates that K-10 is independent of, but cooperative with, standard clinical and pathological prognostic factors including patient age, lymph node involvement, tumor size, and tumor grade. Further analysis suggests that K-10 is also potential to serve as a novel biomarker to assess the risk in breast and colon cancers.The differentially expressed K+ channel genes were first derived from the Italian lung cancer cohort. At this step, we in total identified 15 dysregulated genes between normal and tumor tissue. We next validated these genes in the Taiwanese female cohort. Despite racial and gender difference, we found 10 common dysregulated genes shared by the two groups. Heatmap of expression level of these genes demonstrates that K-10 signature clearly separates the tumor samples from the normal ones in both datasets, which suggests that K-10 signature is not likely to be racial and gender specific.These findings suggest that K+ channel is a mediator of tumor growth and progression. K+ channels participate in the regulation of membrane potential and electrochemical driving force for Ca2+ entry and both mechanisms are associated with mitotic cell cycling, proliferation, cell volume and development of cancer [444546]. Previous studies have described that alteration in expression and function of diverse types of K+ channels, including KCa, Kv, and K2p, in colon, breast, and pancreatic cancers [474849]. Kv channels are widely implicated in the development of different tumors [5051]. Specifically, we found that KCNA5, KCNAB1, and KCNAB2 were downregulated in lung tumor tissue. KCa channels are gated by calcium and contribute to cell proliferation by change of intracellular calcium during tumor cell migration [4852]. Based on our K-10 signature, both KCNMB3 and KCNN4 among KCa channels were upregulated in lung tumor tissue. K2p channels are background channels that open at rest and affect membrane potential, which also play a critical role in both cell apoptosis and tumorigenesis [4953]. We also found that altered expression of K2p channels in lung tumor tissue, in particular, KCNK1, KCNK12 , and KCNK5, were overexpressed in tumor tissue, while KCNK3 was down-regulated.The mutation of channels is also critical factor for functional dysregulation of K+ channels. For example, insertion/deletion of 12 nucleotides in the 5' untranslated region (5'UTR) of KCNK1 was identified in long QT syndromepatients [54]. The missense variants in KCNK5 have been observed among patients with Balkan-onset nephropathy [55]. Lorenz et al. [56] reported an allele association between the KCNMB3delA750- mutant and idiopathic systemic epilepsy syndrome. Genomic DNA analysis revealed a nonsense mutation in KCNA5 in patients with idiopathic atrial fibrillation, and two mutations, G182R and E211D, localized to the KCNA5, were identified in idiopathic pulmonary arterial hypertension (IPAH) patients [5758]. Moreover, homozygous mutations of KCNK3 have been identified in PAH patients [59]. The finding in our study, however, the mutations of K+ channels are not expected to correspond to the K-10 signature genes.Molecular signatures discovered through gene expression profiling have been shown to add prognostic value to clinical and pathological findings in several humancancers [313233343536]. There is also accumulated evidence indicating that differentially expression of K+ channels have correlated with tumorigenesis in lung, breast and colon cancer pathology. Previous reports suggesting that K2p channel may be novel molecular-markers with differential expression related to breast cancer. For example, KCNK5, KCNK9, and KCNK12 were highly expressed, and KCNK6 and KCNK15 were down-regulated in breast cancer [60]. The dysregulation of KCNA5 has also identified in colorectal cancer [51] and breast cancer [61]. Similar to our results, increased KCNN4 expression was identified with aggressive features of lung carcinoma [62]. However, four genes among our K-10 signature, KCNMB3, KCNAB1, KCNAB2 , and KCNJ8 , have been less researched, with only few studies related to cancer [63].In this study, we analyzed the gene expression profiles in humanlung cancer and developed gene signature to predict cancer. However, the additional molecular studies are required to validate K-10 signature in humanlung cancer. Identifying prognostic variables that work cooperatively with known factors may improve the identification of patients at higher risk for relapse and death.
Authors: M Bloch; J Ousingsawat; R Simon; P Schraml; T C Gasser; M J Mihatsch; K Kunzelmann; L Bubendorf Journal: Oncogene Date: 2006-12-04 Impact factor: 9.867
Authors: Kerby Shedden; Jeremy M G Taylor; Steven A Enkemann; Ming-Sound Tsao; Timothy J Yeatman; William L Gerald; Steven Eschrich; Igor Jurisica; Thomas J Giordano; David E Misek; Andrew C Chang; Chang Qi Zhu; Daniel Strumpf; Samir Hanash; Frances A Shepherd; Keyue Ding; Lesley Seymour; Katsuhiko Naoki; Nathan Pennell; Barbara Weir; Roel Verhaak; Christine Ladd-Acosta; Todd Golub; Michael Gruidl; Anupama Sharma; Janos Szoke; Maureen Zakowski; Valerie Rusch; Mark Kris; Agnes Viale; Noriko Motoi; William Travis; Barbara Conley; Venkatraman E Seshan; Matthew Meyerson; Rork Kuick; Kevin K Dobbin; Tracy Lively; James W Jacobson; David G Beer Journal: Nat Med Date: 2008-07-20 Impact factor: 53.440
Authors: Núria Comes; Joanna Bielanska; Albert Vallejo-Gracia; Antonio Serrano-Albarrás; Laura Marruecos; Diana Gómez; Concepció Soler; Enric Condom; Santiago Ramón Y Cajal; Javier Hernández-Losa; Joan C Ferreres; Antonio Felipe Journal: Front Physiol Date: 2013-10-10 Impact factor: 4.566