| Literature DB >> 35954424 |
Alexsandro Dos Santos1,2,3, Geneviève Ouellete1,2, Caroline Diorio2,4, Sabine Elowe2,3,5, Francine Durocher1,2.
Abstract
Loss of mitotic regulation is commonly observed in cancer and is a major cause of whole-chromosome aneuploidy. The identification of genes that play a role in the proper progression of mitosis can help us to understand the development and evolution of this disease. Here, we generated a list of proteins implicated in mitosis that we used to probe a patient-derived breast cancer (BC) continuum gene-expression dataset generated by our group by human transcriptome analysis of breast lesions of varying aggressiveness (from normal to invasive). We identified cytoskeleton-associated protein 2 (CKAP2) as an important mitotic regulator in invasive BC. The results showed that CKAP2 is overexpressed in invasive BC tumors when compared with normal tissues, and highly expressed in all BC subtypes. Higher expression of CKAP2 is also related to a worse prognosis in overall survival and relapse-free survival in estrogen receptor (ER)-positive and human epidermal growth factor receptor type 2 (HER2)-negative BC patients. Knockdown of CKAP2 in SKBR3 cells impaired cell proliferation and cell migration and reduced aggregate formation in a 3D culture. Our results show the important role of CKAP2 in BC tumorigenesis, and its potential utility as a prognostic marker in BC.Entities:
Keywords: CKAP2; breast cancer; mitosis; prognostic biomarker; tumorigenesis
Year: 2022 PMID: 35954424 PMCID: PMC9367390 DOI: 10.3390/cancers14153759
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.575
Figure 1Enrichment analysis of mitotically relevant genes. (A) Flowchart of study design. HTA: human transcriptome array. p: p value. FC: fold change. PRECOG: Prediction of Clinical Outcomes from Genomic Profiles. (B) Venn diagram showing the 24 differentially expressed genes in each subgroup of BC progression compared to normal breast lesions. (C) Enrichment analysis of differentially expressed genes (DEGs) in BC using Metascape. Bar graph of enriched terms across input gene list, colored by p-values.
The 24 mitotically relevant genes related to prognosis in breast cancer identified in this study (genes highlighted in green or gray are downregulated or upregulated, respectively). 1 TCGA-BRCA dataset (https://www.cbioportal.org/, accessed on 26 January 2022); 2 UALCAN (http://ualcan.path.uab.edu/, accessed on 21 November 2021); 3 GEPIA (http://gepia.cancer-pku.cn/, accessed on 28 October 2021); 4 HTA data from Kothari and colleagues (2018); 5 PRECOG (https://precog.stanford.edu/, accessed on 1 June 2021); 6 KMplot (https://kmplot.com/, accessed on 1 October 2021). ADH: atypical ductal hyperplasia; DCIS: ductal carcinoma in situ; IDC: invasive ductal carcinoma; BLCA: bladder urothelial carcinoma; BRCA: breast invasive carcinoma; CESC: cervical squamous cell carcinoma and endocervical adenocarcinoma; LIHC: liver hepatocellular carcinoma; LUAD: lung adenocarcinoma.
| Gene Symbol | Gene Name | Expression in TCGA Cancers Other than BC 1,2,3 | Expression in Breast Invasive Carcinoma (Compared to Normal)—TCGA 1,2,3 | Expression (HTA Tissue Compared to Normal 4) | Fold Change 4 | PRECOG 5 | Kmplot 6 | |||
|---|---|---|---|---|---|---|---|---|---|---|
| ADH | DCIS | IDC | ||||||||
| 1 |
| Insulin-like growth factor 1 | underexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | underexpressed | underexpressed in all stages | −1.87 | −2.02 | −2.34 | −4.7 | low expression is a bad prognosis (RFS, OS and DMSF) |
| 2 |
| Endothelin 3 | underexpressed in BRCA, CESC, LUAD, etc. | underexpressed | underexpressed in IDC | −1.03 | −1.23 | −1.53 | −3.42 | low expression is a bad prognosis (RFS) |
| 3 |
| DDB1- and CUL4-associated factor 13 | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | 1.11 | 1.19 | 1.96 | 3.38 | high expression is a bad prognosis (RFS and DMSF) |
| 4 |
| Cytoskeleton-associated protein 2 | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | −1.09 | −1.03 | 1.67 | 3.72 | high expression is a bad prognosis (RFS, OS and DMSF) |
| 5 |
| Proliferating cell nuclear antigen | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | 1.02 | 1.09 | 1.6 | 3.91 | high expression is a bad prognosis (RFS, OS and DMSF) |
| 6 |
| Epithelial cell transforming 2 | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | −1.02 | 1.06 | 1.56 | 4.24 | high expression is a bad prognosis (RFS, OS and DMSF) |
| 7 |
| Ezrin | overexpressed in BRCA, CESC, LIHC, etc. | overexpressed | overexpressed in DCIS | 1 | 1.63 | 1.16 | 4.58 | high expression is a bad prognosis (RFS) |
| 8 |
| Cyclin-dependent kinase 1 | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | 1.14 | 1.26 | 1.79 | 4.6 | high expression is a bad prognosis (DMSF) |
| 9 |
| Chaperonin-containing TCP1 subunit 5 | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | 1.03 | 1.18 | 1.53 | 5.31 | high expression is a bad prognosis (RFS and DMSF) |
| 10 |
| Abnormal spindle microtubule assembly | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | 1.14 | 1.31 | 1.85 | 6.14 | high expression is a bad prognosis (RFS, OS and DMSF) |
| 11 |
| DNA topoisomerase II alpha | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | 1.07 | 1.23 | 3.85 | 6.52 | high expression is a bad prognosis (RFS, OS and DMSF) |
| 12 |
| Anillin actin-binding protein | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | 1.07 | 1.22 | 2.57 | 6.67 | high expression is a bad prognosis (RFS) |
| 13 |
| Polo-like kinase 1 | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | −1.02 | 1.08 | 1.61 | 6.81 | high expression is a bad prognosis (RFS, OS and DMSF) |
| 14 |
| Centromere protein F | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | 1.18 | 1.05 | 2.71 | 6.84 | high expression is a bad prognosis (RFS, OS and DMSF) |
| 15 |
| Cyclin A2 | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | −1.14 | 1.12 | 1.88 | 7.08 | high expression is a bad prognosis (RFS, OS and DMSF) |
| 16 |
| Kinesin family member 11 | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | −1.06 | 1.08 | 2.1 | 7.19 | high expression is a bad prognosis (RFS, OS and DMSF) |
| 17 |
| Denticleless E3 ubiquitin protein ligase homolog | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | 1.04 | 1.18 | 1.64 | 7.38 | high expression is a bad prognosis (RFS, OS and DMSF) |
| 18 |
| Cyclin B1 | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | 1 | 1.14 | 1.57 | 7.5 | high expression is a bad prognosis (RFS and OS) |
| 19 |
| Kinesin family member 23 | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | −1.03 | −1.02 | 1.56 | 7.99 | high expression is a bad prognosis (RFS, OS and DMSF) |
| 20 |
| Marker Of Proliferation Ki-67 | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | 1.03 | 1.28 | 2.66 | 8.19 | high expression is a bad prognosis (RFS, OS and DMSF) |
| 21 |
| Nucleolar and spindle-associated protein 1 | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | 1.06 | 1.36 | 2.18 | 8.83 | high expression is a bad prognosis (RFS, OS and DMSF) |
| 22 |
| TPX2 microtubule nucleation factor | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | −1.08 | 1.17 | 1.91 | 9.04 | high expression is a bad prognosis (RFS, OS and DMSF) |
| 23 |
| Forkhead box M1 | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | −1.06 | 1.06 | 1.52 | 9.81 | high expression is a bad prognosis (RFS, OS and DMSF) |
| 24 |
| Cyclin B2 | overexpressed in BLCA, BRCA, CESC, LIHC, LUAD, etc. | overexpressed | overexpressed in IDC | −1.02 | 1.15 | 1.67 | 10.44 | high expression is a bad prognosis (RFS, OS and DMSF) |
Top 4 clusters with their representative enriched terms (one per cluster) from cellular components enrichment analysis. “Count” is the number of genes in the user-provided lists with membership in the given ontology term. “%” is the percentage of all of the user-provided genes that are found in the given ontology term (only input genes with at least one ontology term annotation are included in the calculation). “Log10(P)” is the p-value in log base 10. “Log10(q)” is the multi-test adjusted p-value in log base 10.
| GO | Description | Count | % | Log10(p) | Log10(q) | Genes |
|---|---|---|---|---|---|---|
| CORUM:310 | Cell-cycle kinase complex CDC2 | 4 | 16.67 | −5.54 | −1.84 | |
| GO:0000940 | Condensed chromosome outer kinetochore | 3 | 12.5 | −2.5 | 0 | |
| GO:0072686 | Mitotic spindle | 8 | 33.33 | −2.19 | 0 | |
| GO:0044297 | Cell body | 3 | 12.5 | −1.34 | 0 |
Top 5 clusters with their representative enriched terms (one per cluster) from pathway enrichment analysis. “Count” is the number of genes in the user-provided lists with membership in the given ontology term. “%” is the percentage of all of the user-provided genes that are found in the given ontology term (only input genes with at least one ontology term annotation are included in the calculation). “Log10(P)” is the p-value in log base 10. “Log10(q)” is the multi-test adjusted p-value in log base 10.
| GO | Description | Count | % | Log10(p) | Log10(q) | Genes |
|---|---|---|---|---|---|---|
| R-HSA-156711 | Polo-like-kinase-mediated events | 5 | 20.83 | −7.07 | −2.84 | |
| R-HSA-69273 | Cyclin-A/B1/B2-associated events during G2/M transition | 6 | 25 | −6.85 | −2.84 | |
| M5893 | HALLMARK MITOTIC SPINDLE | 12 | 50 | −6.12 | −2.42 | |
| WP2361 | Gastric cancer network 1 | 4 | 16.67 | −3.96 | −0.94 | |
| GO:0030866 | Cortical actin cytoskeleton organization | 3 | 12.5 | −3.14 | −0.4 |
Top 5 clusters with their representative enriched terms (one per cluster) cellular functions enrichment analysis. “Count” is the number of genes in the user-provided lists with membership in the given ontology term. “%” is the percentage of all of the user-provided genes that are found in the given ontology term (only input genes with at least one ontology term annotation are included in the calculation). “Log10(P)” is the p-value in log base 10. “Log10(q)” is the multi-test adjusted p-value in log base 10.
| GO | Description | Count | % | Log10(p) | Log10(q) | Genes |
|---|---|---|---|---|---|---|
| M00693 | Cell-cycle-G2/M transition | 4 | 16.67 | −5.07 | −1.7 | |
| GO:0019901 | Protein kinase binding | 9 | 37.5 | −3.12 | 0 | |
| GO:0008022 | Protein C-terminus binding | 5 | 20.83 | −2.71 | 0 | |
| GO:0015631 | Tubulin binding | 8 | 33.33 | −1.64 | 0 | |
| GO:0050839 | Cell-adhesion molecule binding | 4 | 16.67 | −1.45 | 0 |
Figure 2Validation of the data from HTA by qPCR and CKAP2 mutation analysis. (A) Four genes were selected and validated by RT-qPCR in different breast cancer cell lines. GAPDH and HPRT1 were used as the house-keeping genes to normalize mRNA-based expression data using the 2−ΔΔCT method. (B) Bar chart of CKAP2 mutation in pan-cancers from cBioPortal. The red bars indicate gene amplifications, blue bars are homozygous deletions, green bars are non-synonymous mutations, gray bars indicate multiple alterations. (C) Lollipop plot showing the distribution of CKAP2 mutations across the coding protein from all cancers and BC. The y-axis represents the number of mutations. The x-axis represents the amino acid numbers from the domain start and stop positions. (D) Correlation plots illustrating the relationship between mRNA expression levels (RSEM) and putative copy number changes (Genomic Identification of Significant Targets in Cancer-GISTIC2) for the CKAP2 gene. Deep deletion, homozygously deleted; Shallow deletion, heterozygously deleted; Diploid, two alleles present; Gain, low-level gene amplification event; Amplification, high-level gene amplification event. (E) Oncoprint of genomic alterations found in CKAP2 in TCGA-BC. Red bars indicate gene amplifications, blue bars are deep deletions, green bars are missense mutations, gray bars indicate truncating mutations. LumA: luminal A; LumB: luminal B, HER2+: human epidermal growth factor receptor 2 positive; TN: triple negative breast cancer; VUS: variance of unknow significance; CKAP2_C: cytoskeleton-associated protein 2 C-terminus.
Figure 3Human CKAP2 expression levels in different tumor types determined using Oncomine, Tumor Immune Estimation Resource (TIMER2.0), UALCAN (University of Alabama at Birmingham Cancer data analysis Portal), and TNMPLOT. (A) mRNA expression levels of CKAP2 in 20 cancer types from Oncomine. Numbers in red and blue cells represent dataset numbers in which levels of CKAP2 are statistically increased or decreased, respectively (p < 0.05, a fold-change > 1.5, and gene rank top 10%). (B) CKAP2 expression in different cancers was analyzed by TIMER2.0. (* p < 0.05; ** p < 0.01; *** p < 0.001). (C–E) mRNA expression of CKAP2 in BC from UALCAN. Boxplots showing CKAP2 mRNA levels in healthy controls versus individuals with BC (C), BC subclasses (D), and based on BC stages (E) (* p < 0.05; *** p < 0.001). (F–H) CKAP2 protein expression analysis using data from Clinical Proteomic Tumor Analysis Consortium (CPTAC). Boxplots showing CKAP2 protein levels in healthy controls versus individuals with BC (F), BC subclasses (G), and based on BC stages (H). (** p < 0.01; *** p < 0.001). (I,J) CKAP2 expression from paired tumor and adjacent normal tissues using TNMPLOT. (I) Violin plot showing CKAP2 expression from RNA-seq data. (J) Violin plot showing CKAP2 expression from gene chip data. BC: breast cancer; HER2+: human epidermal growth factor receptor 2 positive; TNBC: triple negative breast cancer. Data with p < 0.05 were considered statistically significant.
Figure 4Correlation of CKAP2 expression with clinicopathological parameters and patient survival. (A) Relationship between CKAP2 mRNA expression and clinicopathological parameters (such as age nodal status, receptor status, p53 status, and triple-negative status) of BC generated from bc-GenExMiner. N: number of cases. (B) The prognostic value of CKAP2 in BC patients was plotted from KMplotter. CKAP2 expression and prognosis in BC patients were analyzed for relapse-free survival (RFS) and overall survival (OS). ER: estrogen receptor status; HER2: human epidermal growth factor receptor; HR hazard ratio. Data with p < 0.05 were considered statistically significant.
Figure 5Highly correlated genes (Spearman’s correlation r ≥ 0.5) co-expressed with CKAP2 in TCGA-BRCA cohort. (A) Bar graph of enriched terms across input gene list, colored by p-values, using Metascape. Upper graph: cellular components. Lower graph: pathways. (B) Correlation between CKAP2 and 8 selected genes from the top 30 highly correlated genes (r ≥ 0.5), which are also present in the list of 24 mitotically relevant genes. (C) Venn diagram showing the overlap of ChIP-seq datasets for E2F1, MYB2L, and FOXM1 with the list of 24 mitotic genes. This analysis showed that all transcription factors collectively bind to the promoters of 12 mitotic genes (including CKAP2). Eleven genes were highly co-expressed with CKAP2 and upregulated in IDC (highlighted in red). Data with p < 0.05 were considered statistically significant. p-value (p).
Top 30 genes with highest Spearman’s correlation coefficient (r ≥ 0.5) predicted to co-express with CKAP2. Genes with an asterisk are also present in the list of 24 mitotically relevant genes. The q-value is derived from Benjamini–Hochberg FDR correction procedure.
| Gene Symbol | Cytoband | Spearman’s Correlation | |||
|---|---|---|---|---|---|
| 1 |
| 13q21.2 | 0.739389854 | 1.27 × 10−172 | 2.54 × 10−168 |
| 2 |
| 13q21.33 | 0.731464545 | 3.78 × 10−167 | 3.78 × 10−163 |
| 3 |
| 2q14.1 | 0.689512954 | 3.87 × 10−141 | 2.58 × 10−137 |
| 4 |
| 13q13.1 | 0.687867544 | 3.29 × 10−140 | 1.65 × 10−136 |
| 5 |
| 1q31.3 | 0.679105968 | 2.32 × 10−135 | 9.29 × 10−132 |
| 6 |
| 15q15.1 | 0.674458017 | 7.39 × 10−133 | 2.47 × 10−129 |
| 7 |
| 15q13.3 | 0.669955242 | 1.78 × 10−130 | 5.10 × 10−127 |
| 8 |
| 2q13 | 0.66822039 | 1.44 × 10−129 | 3.60 × 10−126 |
| 9 |
| 13q13.2 | 0.667922988 | 2.05 × 10−129 | 4.57 × 10−126 |
| 10 |
| 10q23.33 | 0.66342863 | 4.26 × 10−127 | 8.54 × 10−124 |
| 11 |
| 3q26.31 | 0.661963278 | 2.38 × 10−126 | 4.33 × 10−123 |
| 12 |
| 2q33.1 | 0.661022354 | 7.14 × 10−126 | 1.19 × 10−122 |
| 13 |
| 10q26.2 | 0.660220461 | 1.82 × 10−125 | 2.80 × 10−122 |
| 14 |
| 14q22.3 | 0.659368342 | 4.89 × 10−125 | 6.99 × 10−122 |
| 15 |
| 15q15.1 | 0.658250777 | 1.78 × 10−124 | 2.38 × 10−121 |
| 16 |
| 15q23 | 0.657140181 | 6.39 × 10−124 | 8.00 × 10−121 |
| 17 |
| 1q32.1 | 0.65321807 | 5.59 × 10−122 | 6.59 × 10−119 |
| 18 |
| 7p14.2 | 0.651426238 | 4.22 × 10−121 | 4.70 × 10−118 |
| 19 |
| 3q13.13 | 0.65101644 | 6.69 × 10−121 | 7.05 × 10−118 |
| 20 |
| 17q22 | 0.649728789 | 2.83 × 10−120 | 2.83 × 10−117 |
| 21 |
| 3p24.3 | 0.646315057 | 1.25 × 10−118 | 1.19 × 10−115 |
| 22 |
| 13q12.11 | 0.644899156 | 5.93 × 10−118 | 5.40 × 10−115 |
| 23 |
| 6q14.1 | 0.643244495 | 3.62 × 10−117 | 3.15 × 10−114 |
| 24 |
| 12q13.12 | 0.642394775 | 9.13 × 10−117 | 7.62 × 10−114 |
| 25 |
| 12q23.1 | 0.642202897 | 1.12 × 10−116 | 9.01 × 10−114 |
| 26 |
| 4q27 | 0.640527147 | 6.90 × 10−116 | 5.31 × 10−113 |
| 27 |
| 1p31.3 | 0.64047746 | 7.28 × 10−116 | 5.40 × 10−113 |
| 28 |
| 1q41 | 0.640325562 | 8.57 × 10−116 | 6.13 × 10−113 |
| 29 |
| 3p21.31 | 0.638416389 | 6.67 × 10−115 | 4.60 × 10−112 |
| 30 |
| 4q24 | 0.635192431 | 2.06 × 10−113 | 1.38 × 10−110 |
Figure 6The TIMER analysis results. (A) Relationships between CKAP2 expression and immune infiltration levels of B cell, CD8+ T cells, CD4+ T cells, dendritic cells, macrophages, and neutrophils in breast cancer according to TIMER2.0. (B) The correlation between somatic copy number alterations (SCAN) and abundance of immune infiltrates of CKAP2, including deep deletion, shallow deletion, diploid/normal, low-level gain, and high amplification.
Figure 7CKAP2 expression in MCF10A series and other invasive ductal carcinoma (IDC) cell lines (MCF7, BT474, SKBR3, MDA-MB-231, MDA-MB-468) by immunofluorescence. (A–C) Cells were stained with CKAP2 (green), centrin (red), and Hoechst (blue). The CKAP2 immunofluorescence intensity was measured in ≥60 cells per cell line. (A) Representative images of mitotic cells from MCF10A series (MCF10A, AT1, DCIS, and CA1). Superplots show the quantification of CKAP2 intensity. (B) Representative images of mitotic cells from other IDC cell lines (MCF7, BT474, SKBR3, MDA-MB-231, MDA-MB-468). Superplots show the quantification of CKAP2 intensity. (C) Representative images of interphasic cells from other IDC cell lines (MCF7, BT474, SKBR3, MDA-MB-231, MDA-MB-468). Superplots show the quantification of CKAP2 intensity. (D) Immunoblot showing CKAP2 protein levels in MCF10A and CA1 and other IDC cell lines (MCF7, BT474, SKBR3, MDA-MB-231, MDA-MB-468). Tubulin: loading control. LumA: luminal A, lumB: luminal B; HER2: human epidermal growth factor receptor 2 positive; TNBC: triple negative breast cancer.
Figure 8Effect of CKAP2 knockdown in breast cancer. (A) Immunoblot showing CKAP2 protein levels in parental SKBR3 cells and SKBR3 cells that stably express a scrambled vector (control) and three different shRNAs (sh1, sh2, and sh3). Tubulin and GAPDH: loading control. (B) Growth curve of CKAP2 cell lines (scrambled, sh1, sh2, and sh3). Data are shown as mean ± SEM. p < 0.05 were considered statistically significant. n = 3 biological replicates. (C) Cell migration (wound healing) assay in CKAP2 cell lines (scrambled, sh1, sh2, ad sh3) using IBIDI inserts. Representative images are shown from three independent experiments. Graphs show the percentage of wound closure ± SEM normalized relative to time 0. (D) 3D spheroid formation assay in CKAP2 cell lines (scrambled, sh1, sh2, ad sh3). Representative images are shown from three independent experiments. Graph shows the spheroid volume ± SEM normalized by control at day 2. All statistical tests of comparative data were performed using one-way (B,C) or two-way (D) ANOVA for differential comparison between more than two groups. Data with p < 0.05 were considered statistically significant. Scale bar: 500 µm.