| Literature DB >> 35547758 |
Cristiana Gaiteiro1,2,3,4,5, Janine Soares1,2,3,4,6, Marta Relvas-Santos1,2,3,4,7,8,9, Andreia Peixoto1,2,3,7,8, Dylan Ferreira1,2,3,4,5,7,8, Paula Paulo2,3,10, Andreia Brandão2,3,10, Elisabete Fernandes1,2,3,11, Rita Azevedo12, Carlos Palmeira1,2,3,11,13, Rui Freitas1,2,3,4,7,8, Andreia Miranda1,2,3,14, Hugo Osório7,14,15, Jesús Prieto5, Luís Lima1,2,3, André M N Silva9,16, Lúcio Lara Santos1,2,3,4,11,16,17, José Alexandre Ferreira1,2,3,4,16.
Abstract
Rationale: Bladder cancer (BC) management demands the introduction of novel molecular targets for precision medicine. Cell surface glycoprotein CD44 has been widely studied as a potential biomarker of BC aggressiveness and cancer stem cells. However, significant alternative splicing and multiple glycosylation generate a myriad of glycoproteoforms with potentially distinct functional roles. The lack of tools for precise molecular characterization has led to conflicting results, delaying clinical applications. Addressing these limitations, we have interrogated the transcriptome and glycoproteome of a large BC patient cohort for splicing signatures.Entities:
Keywords: CD44; bladder cancer; glycomics; glycoproteogenomics; proteogenomics
Mesh:
Substances:
Year: 2022 PMID: 35547758 PMCID: PMC9065180 DOI: 10.7150/thno.67409
Source DB: PubMed Journal: Theranostics ISSN: 1838-7640 Impact factor: 11.600
Figure 1Expression of CD44 and cancer relevant exon-exon junctions in the healthy urothelium and bladder tumors. CD44 gene expression significantly decreased in bladder tumors in comparison to the healthy urothelium. Furthermore, the nature of the variants changed with the severity of disease, with non-muscle invasive bladder cancer (NMIBC), showing higher expression of e5-v2 and e5-v3 generally found in lengthier isoforms (CD44v2-10; CD44v3-10) and a significantly lower abundance of e5-e15 characteristic of CD44s/st in comparison to muscle-invasive bladder cancer (MIBC). MIBC presented an opposite pattern, supporting enrichment for shorter CD44s isoform. The CD44sol isoform was also evaluated but its expression was vestigial and not linked to any tumor type, as such it was not represented in the panel A. B) TCGA analysis of 413 MIBC cases confirmed the association of CD44s with more aggressive late-stage disease. Briefly, CD44s mRNA was elevated in T3/4 tumors in comparison to T2 tumors, supporting its association with invasion, confirming the observations from our patient's dataset. C) Elevated CD44v10 and CD44s mRNA significantly associate with worst prognosis in MIBC. The Kaplan-Meier curves highlight a clear link between the expression of shorter CD44 isoforms and worst prognosis in bladder cancer. Also, despite its low expression, patients with tumors presenting high CD44sol mRNA presented better prognosis. D) CD44s is expressed by a subgroup of basal-like tumors enriched for genes defining mesenchymal traits. In general, CD44sol associated with more differentiated and less aggressive luminal tumors, whereas the other forms of CD44 related with basal phenotypes, frequently less cohesive and poorly differentiated lesions. However, CD44s and CD44v10 were characteristic of a subgroup of basal tumors enriched for ZEB1/2, CDH2, TWIST1, SNAI1 that define mesenchymal phenotypes. Collectively, these observations link shorter CD44 isoforms to bladder cancer invasion and poor prognosis. E) Total CD44 and isoforms expressions in relevant healthy cells and organs. The CD44 gene presents a heterogeneous expression pattern in healthy tissues, and multiple isoforms may coexist in the same organ. Notably, CD44s was expressed by most of the studied tissues, showing significantly high mRNA levels in white blood cells. The gastrointestinal and colorectal tracts as well as the testicle also present high CD44s, however, with low total CD44 levels. Collectively, this result shows that CD44s is not a cancer specific signature. The results correspond to the mean and standard deviation for three independent experiments. Triplicate measurements were conducted for each experiment. P values are presented for one-way ANOVA, nonparametric Wilcoxon, and Kruskal-Wallis tests.
Figure 2CD44-Tn and STn glycoproteoforms present high cancer specificity. A) Tn and STn antigens are not expressed in the healthy urothelium, are elevated in cancer, and increased in invasive tumors. The STn antigen, defined by affinity for the anti-tag-72 antibody [B72.3+CC49], is not expressed in the healthy urothelium and is elevated in cancer, being significantly overexpressed in muscle invasive bladder cancer (MIBC). The Tn antigen, based on VVA lectin immunoaffinity, is also not expressed in the healthy urothelium and is elevated in a subset of patients, specially at more advanced stages. B) CD44s colocalizes with Tn and STn antigens in MIBC. CD44 was diffusively expressed across the tumor section without a defined pattern. The STn antigen was observed both in superficial and invasive layers and the Tn antigen was found in scattered niches without a defined expression pattern. Immunohistochemistry also showed the co-localization of CD44 with Tn and STn positive areas in CD44shigh tumors, exhibiting low amounts of other isoforms (according to RT-PCR, data not shown). The healthy urothelium expressed high amounts of CD44 and the Tn antigen is present in the cytoplasm of upper stratum umbrella cells while the STn antigen was not detected. C) In situ proximity ligation assays (PLA) supports CD44s-STn glycoproteoforms in MIBC and its presence within tumor invasive fronts. In situ proximity ligation assay showed close spatial proximity between CD44 and STn in the same cells, strongly supporting CD44-STn glycoproteoforms in tumors. This phenotype was mostly observed in invasive fronts of CD44shigh tumors and was not detected in the healthy urothelium, suggesting cancer-specificity. D) Double staining immunofluorescence supports CD44s-Tn glycoproteoforms in bladder cancer. CD44shigh tumors presented niches of cells co-expressing CD44 and Tn antigen, strongly suggesting CD44-Tn glycoproteoforms. This was not observed in the healthy urothelium. E) Glycosylation with Tn and STn antigens provides cancer specificity to CD44. CD44 was abundantly expressed in all studied healthy tissues. STn expression was either absent or low in secretions and cells facing the lumen of the respiratory, gastrointestinal, and colorectal tracts. Immunohistochemistry suggested STn and CD44 co-localization in the stomach, appendix, small intestine, colon, gallbladder, and white blood cells in MALT. PLA did not confirm these hypotheses, suggesting cancer specificity of CD44-STn glycoproteoforms. In healthy tissues, the Tn was restricted to the cytoplasm of goblet cells in the intestinal tract, Leydig cells in testicular tissue, pancreatic acini, hepatocytes, mucinous cells of the gastric epithelium, alveolar macrophages, and gallbladder epithelium. CD44 and Tn antigen co-expression was suggested in pancreatic tissue, testicle, and gallbladder, which was not confirmed by double staining immunofluorescence.
Figure 3Highly invasive T24 bladder cancer cells express high levels of CD44s. A) Capacity to invade Matrigel in vitro for RT4, 5637 and T24 cells. Grades I/II cell lines (RT4 and 5637) are significantly less capable of invading Matrigel in vitro. B) CD44 expression is higher for grade II (5637) and III (T24) cells. Flow cytometry analysis showed significantly higher CD44 levels in 5637 and T24 compared to RT4 cells. C) Western blots show different CD44 expression patterns according to cell grade, with T24 cells showing mostly shorter proteoforms. WB confirmed the overexpression of shorter CD44 proteoforms (at approximately 75 and 50 kDa) in T24 cells, and the presence of heavier proteoforms in the other cell lines (above 150 kDa). D) Characterization of CD44 isoforms by RT-PCR showing isoforms shortening with cell lines aggressiveness and the marked CD44shigh phenotype of T24 cells. CD44 gene expression was significantly higher in T24 in comparison to RT4 and 5637 cells, in agreement with protein analysis. RT-PCR also revealed increasing mRNA shortening with cells aggressiveness. Accordingly, RT4 showed higher e5-v2 and e5-v3 transcripts in comparison to the other cell lines, 5637 predominantly presented e5-v8, whereas T24 cells mainly expressed e5-e15 characteristic of CD44s/st. E) RNAseq confirmed the marked difference between T24 and 5637 cells, the first expressing shorter CD44 mRNAs in opposition to lengthier mRNAs in 5637 cells. The top image is a DESeq2's plot for visualization of alternative splicing events. The plot shows the coverage of constitutive and variable exons for 5637 (in red) and T24 (in blue) cells. The bottom part of the image shows the gene structure and depicts differentially expressed exons (purple lines linked to the x-axis of the coverage plot). The main differences are found in the variable region spanning v2-v10, which was mostly missing in T24 cells. Predicted transcripts are shown in the bottom. Asterisks highlight those showing transcripts higher probability of generating proteins and, therefore, used for glycoprotegenomics. Legend for panel E with Uniprot and Ensemble codes: 1- P16070-12 (CD44-201); 2- E9PKC6 (CD44-222); 3- P16070-4 (CD44-206); 4- CD44-227 (no Uniprot code); 5- P16070-1 (CD44-208); 6- H0YD13 (CD44-224); 7- Q86UZ1 (CD44-207); 8- P16070-10 (CD44-209); 9- P16070-19 (CD44-203); 10- P16070-11 (CD44-210); 11- P16070-18 (CD44-205); 12- H0Y5E4 (CD44-211); 13- H0YDW7 (CD44-221); 14- H0Y2P0 (CD44-204); 15- CD44-232 (no Uniprot code); 16- H0YD17 (CD44-228); 17-H0YD90 (CD44-234); 18- CD44-213 (no Uniprot code); 19- H0YEU1 (CD44-231); 20- H0YEV3 (CD44-219); 21- H0YES0 (CD44-212). The results for A-D correspond to the mean and standard deviation for three independent experiments. Triplicate measurements were conducted for each experiment. P values are presented for one-way ANOVA tests.
Figure 4Glycoproteogenomics, building on RNAseq-customized databases and glycomics for protein annotation, enables CD44 proteoforms identification in bladder cancer cell lines and tumors. A) Glycomics characterization of 5637 and T24 cells showed fucosylated and sialylated T antigens as main glycospecies. MALDI-MS analysis of permethylated benzyl-GalNAc glycans revealed [M+Na]+ main ions for fucosylated and sialylated T antigens in both cell lines. However, 5637 predominantly expressed mono-sialylated T antigens whereas di-sialylated structures were more abundant in T24 cells. Moreover, 5637 cells presented higher abundance and diversity of glycoforms extended beyond core 2 with different degrees of sialylation and fucosylation. B) SDS-PAGE gels and western blots for CD44 IPs highlighting the nature of the isoforms identified by nano-LC-M/MS in glycoproteogenomics settings. Briefly, CD44 immunoprecipitated from membrane extracts was separated by gradient SDS-PAGE and bands were excised and analyzed by nanoLC-HCD/CID-MS/MS using RNAseq-customized databases and glycomics data for isoforms annotation. The nature of the isoforms translated by reporter ions for constitute-variable junctions identified in each band and cell line was highlighted. CD44 heterogeneity was evident for the two cell lines. CD44s was found to be the main isoform in T24 cells. C) nano-LC-MS and HCD-MS/MS spectra for reporter ions v2-v3, e5-v3, e5-e15 (CD44s/st), and e2(partial)-e3(partial) CD44sol in T24 cells. For reporter ions we show the MS isotopic envelope, the corresponding HCD product ion spectra highlighting GalNAc oxonium ions, GalNAc cross-ring fragments, other glycan fragments, and y- and b-series peptide backbone ions that support protein annotation. The predicted glycopeptide sequence, including the nature of the glycans and glycosites annotation (whenever possible) was also presented. D) Glycosites identified for v2-v3, e5-v3 and e5-e15 reporter glycopeptides and corresponding glycans for 5637 and T24 cells. A higher number of glycosites were identified at the e5-e15 junction for T24 in comparison to 5637 cells. On the other hand, more glycosites were identified at the v2-v3 and e5-v3 junctions in 5637 cells. E) nano-LC-MS and HCD-MS/MS spectra for CD44 immunoprecipitated from CD44shigh tumors and areas of CD44-STn co-expression. SDS-PAGE and western blots showed a pattern similar to T24 cells, characterized by bands bellow 75 kDa. An HCD product ion spectrum for an CD44s-Tn specific glycopeptide sequence is presented. Identified and possible glycosites are identified in grey. The symbol * corresponds to amino acid modifications: C - carbamidomethyl and T - 2-amino-3-ketobutyric acid.
Peptides corresponding to CD44 exon and exon-exon junctions identified by SDS-PAGE-nanoLC-MS/MS.
| Mw (kDa) | 5637 | T24 | ||
|---|---|---|---|---|
| Exons and exon-exon junctions1 | Assignment2 | Exon and exon-exon junctions1 | Assignment2 | |
| >200-250 | e1-e2; e2-e3; e3-e4; e4-e5; v2-v3, v3-v4; v4-v5; v6 (partial); v10-e15; e19; CD44sol3 | CD44v2-10 (46%) | No ID | - |
| 200-150 | No ID | - | No ID | - |
| 150-100 | e1-e2; e2-e3; e3-e4; e4-e5; v2-v3, v3-v4; v4-v5; v6 (partial); v10-e15; e19; e5-e15 | CD44v2-10 (49%) | CD44sol3 | CD44sol (96%) |
| 100-75 | e1-e2; e2-e3; e3-e4; e4-e5; v2-v3; v3-v4; v6; v10-e15; e5-e15; e19, CD44sol3 | CD44v2-10 (43%) | e1-e2; e2-e3; e3-e4; e4-e5; v2-v3, v3-v4; v4-v5; v7 (partial); v10-e15; e19; e5-e15, CD44sol3 | CD44v2-10 (48%) |
| 75-50 | No ID | - | e1-e2; e2-e3; e3-e4; e5-v3; v2-v3, v3-v4, v4-v5; v10-e15, e19, e5-e15; CD44sol3 | CD44v2-10 (47%) |
| 50-37 | No ID | - | e1-e2, e2-e3, e3-e4, e4-e5, e5-e15 | CD44s (59%) |
| 37-25 | e1-e2; e2-e3; e3-e4; e4-e5; v2-v3; e5-v3; v3-v4; v10-e15, e19; e5-e15, CD44sol | CD44v2-10 (49%) | e1-e2; e2-e3; e3-e4; e4-e5; v2-v3, v3-v4; v4-v5; v6 (partial); v7 (partial); v10-e15; e19; e5-e15 | CD44v2-10 (49%) |
| 25-20 | No ID | - | No ID | - |
| 20-15 | No ID | - | No ID | - |
| 15-10 | No ID | - | No ID | - |
1Given the high coverage and significantly different structure, we use the general designation to highlight CD44sol;
2Protein coverages considers the complete variant deduced from e5-variable region junctions, without considering exon-exon jumps;
3CD44sol was used as a general designation for the following exon-exon junctions: v2(partial)-v3(partial), v3(partial)-v4(partial), v4(partial)-v5(partial), v5(partial)-v6(partial), v6(partial)-v10(partial), v10(partial)-e16(partial), whereas partial stands for incomplete exon sequences.
Figure 5CD44 silencing inhibits proliferation, invasion, and activation of relevant proteins in MAPK/ERK, and PI3K/AKT pathways in T24 CD44s and B) CD44 siRNA promotes a massive decrease in CD44 gene expression and complete abrogation of CD44 protein levels in 5637 and T24 cell lines. According to A, CD44 gene expression after siRNA is decreased by approximately 80% and 96% in 5637 and T24 cells, respectively, in comparison to controls, which translated into significantly decreased protein expression (exemplified in Figure B for T24 cells). C) CD44 is a promoter of proliferation in 5637 and T24 cells. Proliferation was reduced by more than 50% after CD44 siRNA silencing in both cell lines compared to controls. D) CD44 silencing decreases the invasive capacity of CD44shigh T24 cells and promoted invasion in 5637 cells. The number of T24 invading cells decreased approximately 1.7-fold after CD44 knock-down comparing to controls. On the other hand, the number of cells invading matrigel in vitro was 1.7-fold higher for 5637 cells under the same stimuli. E) Analysis of the phosphorylation status of 49 RTK and F) Effect of CD44 silencing in phosphorylation of 37 proteins activated by different kinases. CD44s seems to be a modulator and a co-receptor of different RTKs, including EGFR, Insulin, and IGF-1 receptors, as well as transcription factors such CREB, involved in tumor proliferation and invasion. The results correspond to mean and standard deviation for three independent experiments. Triplicate measurements were conducted for each experiment. P values are presented for Unpaired T tests.
Figure 6Glycoengineered T24 cells express CD44s-Tn/STn glycoproteoforms in mimicry of human tumors. A) Immunofluorescence studies on glycoengineered T24 bladder cancer cells demonstrated the overexpression of immature short-chain O-glycans together with CD44 at the cell surface of the cells. Immunoassays show high levels of sialylated T antigens and no Tn and STn antigens in wild type cells. T24 C1GALT1 KO lacked extended glycosylation translated by the presence of sialylated T antigens and presented high Tn levels. ST6GALNAC1 KI induced an overexpression of STn. These glycans were co-localized with CD44 at the cells surface. B) CD44 gene expression and the nature of the variants do not change with alterations in glycosylation induced by C1GALT1 KO and ST6GALNAC1 KI, except for CD44s, which increased in Tn-expressing cells. C) Western blots show a different CD44 expression pattern for T24 glycoengineered cells in comparison to controls. The main molecular band of CD44 at 75kDa shows a slight decrease, possibly due to glycans complexity reduction. A new band at 37kDa emerged, suggesting that glycans truncation may impact on CD44 molecular weight. D) MS and HCD-MS/MS for a CD44s specific glycopeptide (correspoponding to the e5-e15 junction) from T24 C1GALT1 KO cells evidencing short-chain O-glycosylation. The product ion spectrum shows a typical GalNAc fragmentation pattern, characterized by the presence of an evident ion at m/z 144.066 and a high 144.066/138.055 oxonium ions ratio, as well as the presence of the others HexNAc oxonium ions (m/z 126.055,168.066, 186.076 and 204.087). The spectrum also shows major fragment glycopeptides with GalNAc losses. The results correspond to the mean and standard deviation for three independent experiments. Triplicate measurements were conducted for each experiment. P values are presented for two-way ANOVA and Unpaired T tests.
Figure 7CD44 glycosylation in CD44s-enriched cells shapes its functional contribution to proliferation and invasion. A) O-glycans shortening by C1GALT1 KO and C1GALT1 KO/ST6GALNAC1 KI does not change the proliferation of T24 cells. T24 cells overexpressing Tn or STn antigens maintain proliferation rates compared to controls. B) Increased STn antigen at the cell surface, driven by C1GALT1 KO and ST6GALNAC1 KI, enhances the invasive capacity of T24 cells. T24 cells overexpressing the Tn antigen do not change their capacity to invade Matrigel in comparison to the control in vitro. On the other hand, STn overexpression leads to a significant increase in invasion. C) CD44 silencing promotes a notable decrease in CD44 gene expression in glycoengineered T24 cells expressing Tn and STn. CD44 mRNA expression after siRNA is significantly decreased by approximately 95% and 91% in C1GALT1 KO and C1GALT1 KO/ST6GALNAC1 KI T24 cells, respectively, in comparison to controls. D) CD44-Tn and STn glycoproteoforms do not play a role in T24 cells proliferation. After CD44s inhibition, C1GALT1 KO and C1GALT1 KO/ST6GALNAC1 KI T24 cells maintained their proliferation in relation to controls, showing that CD44-Tn/STn glycoproteoforms do not influence cell proliferation. E) CD44-STn but not Tn glycoproteoforms drive invasion. After CD44s silencing, C1GALT1 KO T24 cells maintained their capacity to invade Matrigel in vitro, whereas C1GALT1 KO/ST6GALNAC1 KI cells showed decreased invasion. Similar effects were observed in wild-type cells but not in Tn-expressing cells, suggesting that sialylation may have a role in tumor invasion. F) Influence of STn-overexpression on the response of tyrosine kinases receptors to CD44 silencing. We show an activation of RTK proteins such EGFR and Insulin receptor in the absence of CD44 for C1GALT1 KO/ST6GALNAC1 KI cells, opposite to the effect observed in wild-type cells. G) CD44-STn glycoproteoforms seem to promote the activation of relevant transcription factors involved in cell proliferation and invasion. In the absence of CD44, a significant downregulation of STAT3 S727 and STAT2 phosphorylation was observed in C1GALT1 KO/ST6GALNAC1 KI cells. Results correspond to the mean and standard deviation for three independent experiments. Triplicate measurements were conducted for each experiment. P values are presented for Unpaired T tests.