Literature DB >> 29983670

CD44 glycoprotein in cancer: a molecular conundrum hampering clinical applications.

Rita Azevedo1,2, Cristiana Gaiteiro1,2,3, Andreia Peixoto1,2,4, Marta Relvas-Santos1, Luís Lima1,4, Lúcio Lara Santos1,2,5,6, José Alexandre Ferreira1,2,4,5,7,8.   

Abstract

CD44 is a heavily glycosylated membrane receptor playing a key role in cell adhesion, signal transduction and cytoskeleton remodelling. It is also one of the most studied glycoproteins in cancer, frequently explored for stem cell identification, and associated with chemoresistance and metastasis. However, CD44 is a general designation for a large family of splicing variants exhibiting different degrees of glycosylation and, potentially, functionally distinct roles. Moreover, structural diversity associated with ambiguous nomenclature has delayed clinical developments. Herein, we attempt to comprehensively address these aspects and systematize CD44 nomenclature, setting milestones for biomarker discovery. In addition, we support that CD44 may be an important source of cancer neoantigens, most likely resulting from altered splicing and/or glycosylation. The discovery of potentially targetable CD44 (glyco)isoforms will require the combination of glycomics with proteogenomics approaches, exploring customized protein sequence databases generated using genomics and transcriptomics. Nevertheless, the necessary high-throughput analytical and bioinformatics tools are now available to address CD44 role in health and disease.

Entities:  

Keywords:  CD44; CD44 isoforms; Cancer biomarkers; Glycosylation; Nomenclature

Year:  2018        PMID: 29983670      PMCID: PMC6020424          DOI: 10.1186/s12014-018-9198-9

Source DB:  PubMed          Journal:  Clin Proteomics        ISSN: 1542-6416            Impact factor:   3.988


The transmembrane glycoprotein receptor CD44 plays a key role in cell adhesion to the extracellular matrix and interacts with growth factors and several extracellular ligands, including hyaluronic acid, collagen, osteopontin and many metalloproteinases to drive signal transduction and cytoskeleton rearrangements [1]. By interacting with co-factors and adaptor proteins, CD44 has been further implicated in lymphocyte homing, haematopoiesis, cell migration and adhesion, tumour invasion and metastasis [1]. Several isoforms of CD44 can be generated through the insertion of alternative exons at the variable region in a process regulated at both tissue and cellular levels. While ubiquitously expressed in healthy adult and foetal tissues, the molecular plasticity of alternatively spliced CD44 accounts for diversified functional roles. However, the intricate correlation between CD44 isoforms and underlying biological functions is yet to be fully disclosed. It has been long described that malignant transformation and progression are accompanied by a deregulation of CD44 splicing mechanisms, comprehensively addressed in recent reviews [2, 3]. The events leading to CD44 isoform molecular remodelling have direct implications in several cancer hallmarks and appear to vary according to the type of lesion, supporting the existence of disease-specific molecular fingerprints and potentially targetable biomarkers [4]. Not surprisingly, CD44 has been a hot topic in cancer research, frequently associated with more aggressive phenotypes and widely explored for cancer stem-cell identification [5]. Particular focus has been set on narrowing CD44 screening to its cancer-associated isoforms envisaging the necessary sensitivity and specificity for clinical applications. However, the lack of protocols for its full isoform discrimination at the protein level, and the existence of many functionally distinct isoforms poses a major drawback. Currently, these hurdles can only be partially circumvented by targeting CD44 transcripts using variant-specific probes or by emerging RNAseq approaches. Analytical difficulties are aggravated by vast post-translational modifications, with emphasis on the very high glycosylation density of variable regions. Moreover, glycans often present a non-templated and context-dependent nature, with several glycoforms coexisting for the same protein on a given biological milieu, further increasing CD44 molecular and functional diversity (Table 1). This poses a major challenge for identification by conventional immunoassays as well as high-throughput proteomics, which has delayed the definition of CD44 isoforms in health and disease.
Table 1

Proposed CD44 nomenclature for experimentally observed isoforms, its correspondence with UniProt and NCBI databases and predicted N- and O-glycosylation sites

Proposed nomenclatureUniProtNCBIPredicted glycosylation sites
N-glycosylationaO-glycosylationbTotal
CD44v2-10Isoform 1Isoform 18146154
CD44v3-10Isoform 4Isoform 28133141
CD44v8-10Isoform 10Isoform 377986
CD44v10Isoform 11Isoform 675663
CD44sIsoform 12Isoform 463238
CD44stIsoform 15Isoform 863238
CD44s-exon 15Isoform 18Isoform 762329
CD44solubleIsoform 19Isoform 5235

aN-glycosylation sites predicted using NetNGlyc server 1.0 (http://www.cbs.dtu.dk/services/NetNGlyc/)

bO-glycosylation sites predicted using NetOGlyc server 4.0 (http://www.cbs.dtu.dk/services/NetOGlyc/)

Proposed CD44 nomenclature for experimentally observed isoforms, its correspondence with UniProt and NCBI databases and predicted N- and O-glycosylation sites aN-glycosylation sites predicted using NetNGlyc server 1.0 (http://www.cbs.dtu.dk/services/NetNGlyc/) bO-glycosylation sites predicted using NetOGlyc server 4.0 (http://www.cbs.dtu.dk/services/NetOGlyc/) The absence of nomenclature standardization also emerges as a key issue, making inter-study comparisons and clinical translation almost impossible. Showcasing some examples, CD44H and CD44E terminologies arise from the first observations in hematopoietic and epithelial cells; gp116 and gp85 distinguish glycoforms by molecular weights (116 and 85 kDa, respectively); CD44 hematopoietic cell E-/L-selectin ligand (HCELL) refers to CD44 isoforms expressed in hematopoietic and cancer cells showing elevated sialofucosylated glycans content and high affinity for E-/L-selectin ligands [6, 7]. All these designations fail to provide clear insights on the molecular nature of the isoforms. Notwithstanding, some studies adopt a nomenclature based on commercial names of the used monoclonal antibodies, highlighting the targeted variable exon, being CD44v3, CD44v6, CD44v9 amongst the most associated with cancer, including chemoresistance and prognosis [8, 9]. Moreover, these studies often disregard that the analysis of a specific variable exon can result in the detection of all isoforms containing it instead of one particular protein [10]. In addition, most variable regions may be significantly hindered by dense glycosylation, biasing detection. On the other hand, reports exploring the transcriptome have emerged as a powerful tool for determining CD44 isoforms diversity; however, few have provided the necessary validation at the protein level due to above mentioned analytical difficulties. Another important source of nomenclature ambiguity results from the direct translation of results from Mus musculus studies to Homo sapiens disregarding that mice CD44 gene presents an extra variable exon (v1) not present in humans. Further contributing to ambivalence, UniProt and NCBI protein databases adopt different designations for the same isoform (illustrated in Table 1). Altogether these aspects impact negatively on our understanding of the biological and clinical relevance of CD44 isoforms in cancer and other diseases, urging nomenclature standardization. Facing these challenges, we have conducted a comprehensive in silico analysis of CD44 isoforms through NCBI and UniProt databases, using BLAST and TMHMM Server v2.0 tools. The human CD44 gene is located on the short arm of chromosome 11 [GRCh38.p7, NC_000011.10 (35138870-35232402)] and its precursor mRNA consists of 19 exons. Namely, exons 1–16 encode the extracellular domain, exon 17 encodes the highly conserved transmembrane domain, and exons 18 and 19 encode the cytoplasmic domain. In silico analysis in NCBI has predicted over twenty-one possible mRNA transcripts derived from the alternative splicing of exons 6–14 and 18, eleven of which were also found in UniProt. However, only eight have been experimentally confirmed (detailed in Table 1 and Fig. 1), specifically six isoforms with variable extracellular domain extensions, one isoform with a truncated cytoplasmic tail, and one isoform truncated at the extracellular domain. Here we attempt to standardize the nomenclature for the above described isoforms through a logical terminology. Following pre-existing designations, we propose that human CD44 isoforms originated by alternative splicing of the variable region should highlight the included exons. Conversely, isoforms lacking the variable region should present a nominal designation. Figure 1 constitutes a schematic representation of the experimentally determined human CD44 isoforms, and the proposed nomenclature is described below:
Fig. 1

Schematic representation of experimentally confirmed human CD44 pre-mRNA and respective isoforms. Blue filled boxes represent constant region exons, while white filled boxes represent exons of the variable region present in the designated CD44 isoform. Dark blue filled boxes with reduced box size represent truncated exons from the constant region. The blue line represents missing exon(s). Exon 18, filled black, contains an early 3’UTR and only makes part of CD44st isoform

Schematic representation of experimentally confirmed human CD44 pre-mRNA and respective isoforms. Blue filled boxes represent constant region exons, while white filled boxes represent exons of the variable region present in the designated CD44 isoform. Dark blue filled boxes with reduced box size represent truncated exons from the constant region. The blue line represents missing exon(s). Exon 18, filled black, contains an early 3’UTR and only makes part of CD44st isoform CD44v2-10 The canonical CD44 isoform includes a peptide sequence encoded by exons 6–14, while splicing out exon 18. This isoform has a predicted molecular weight of 82 kDa but presents extensive glycosylation, thereby arising to approximately 250 kDa or higher. CD44v3-10 Also known as epican, this isoform results from the retention of exons 7–14 (v3–v10) and splicing out of exon 18. Its unmodified form has 77 kDa, resulting in an up to 200 kDa glycoprotein after post-translational modifications. CD44v8-10 Also known as CD44E or CD44R1, this isoform is originated through retention of exons 12–14 (v8–v10) and splicing out of exon 18. It has 53 kDa in its unaltered form, while reaching 130 kDa in its glycosylated form. CD44v10 Also known as gp116 or CD44R2, this isoform retains the variable exon 14 (v10), while splicing out exon 18. The unglycosylated form has approximately 47 kDa whereas the glycosylated forms have been observed at 120 kDa. CD44s Also known as CD44H or gp85, the standard form of CD44 splices out all variable exons and exon 18. Originally with 39 kDa, the subsequent post-translational addition of N-linked and O-linked oligosaccharides gives rise to a 85–90 kDa glycoprotein. CD44st Also known as short-tail or tail-less, is a 32 kDa isoform splicing out the variable region and exon 19, while retaining exon 18. Importantly, exon 18 contains a stop codon that originates a truncated cytoplasmic tail, consequently leading to the loss of intracellular protein domains and signalling motifs necessary for interaction with cytoskeletal components. CD44s-exon15 A CD44s homolog of 37 kDa lacking the peptide sequence encoded by exon 15. CD44sol This CD44 soluble isoform only retains exons 1–4, while presenting truncated forms of the exons 3 and 4. The modification of the two later exons leads to a smaller extracellular domain as well as to the loss of transmembrane and cytoplasmic domains. This 16 kDa isoform is often shed to bodily fluids through matrix metalloprotease activity. In summary, the wide array of structurally similar CD44 isoforms, associated with dense glycosylation and inadvertent lack of nomenclature consensus has posed a significant challenge for inferring on CD44 role in cancer. These aspects have biased many previous conclusions, provided several conflicting data, and significantly delayed clinical development. Most studies disregard CD44 glycosylated domains, which sometimes more than double the molecular weight of the isoforms, decisively modulating biophysical, biochemical and functional properties of the receptor (predicted glycosylation sites detailed in Table 1). Glycosylation also raises a tremendous challenge for CD44 mapping based on conventional proteomics, urging the introduction of glycan-targeted approaches. As such, more comprehensive strategies will certainly require the integration of glycomics/glycoproteomics with emerging proteogenomics, exploring customized protein sequence databases generated using genomics and transcriptomics. This approach is also expected to pinpoint relevant cancer neoantigens for driving targeted therapeutics and immunotherapy development. Nevertheless, we augment that the necessary technologies are now available for addressing CD44 molecular and functional diversity in health and disease, ultimately providing targetable biomarkers for oncology.
  10 in total

Review 1.  CD44: from adhesion molecules to signalling regulators.

Authors:  Helmut Ponta; Larry Sherman; Peter A Herrlich
Journal:  Nat Rev Mol Cell Biol       Date:  2003-01       Impact factor: 94.444

Review 2.  Regulation of alternative splicing of CD44 in cancer.

Authors:  Lubomir Prochazka; Radek Tesarik; Jaroslav Turanek
Journal:  Cell Signal       Date:  2014-07-13       Impact factor: 4.315

3.  A requirement for the CD44 cytoplasmic domain for hyaluronan binding, pericellular matrix assembly, and receptor-mediated endocytosis in COS-7 cells.

Authors:  Hong Jiang; Richard S Peterson; Weihua Wang; Eckart Bartnik; Cheryl B Knudson; Warren Knudson
Journal:  J Biol Chem       Date:  2002-01-15       Impact factor: 5.157

Review 4.  CD44: More than a mere stem cell marker.

Authors:  I Morath; T N Hartmann; V Orian-Rousseau
Journal:  Int J Biochem Cell Biol       Date:  2016-09-15       Impact factor: 5.085

Review 5.  CD44 and HCELL: preventing hematogenous metastasis at step 1.

Authors:  Pieter P Jacobs; Robert Sackstein
Journal:  FEBS Lett       Date:  2011-08-05       Impact factor: 4.124

Review 6.  Redox regulation in stem-like cancer cells by CD44 variant isoforms.

Authors:  O Nagano; S Okazaki; H Saya
Journal:  Oncogene       Date:  2013-01-21       Impact factor: 9.867

7.  CD44 isoforms are heterogeneously expressed in breast cancer and correlate with tumor subtypes and cancer stem cell markers.

Authors:  Eleonor Olsson; Gabriella Honeth; Pär-Ola Bendahl; Lao H Saal; Sofia Gruvberger-Saal; Markus Ringnér; Johan Vallon-Christersson; Göran Jönsson; Karolina Holm; Kristina Lövgren; Mårten Fernö; Dorthe Grabau; Ake Borg; Cecilia Hegardt
Journal:  BMC Cancer       Date:  2011-09-29       Impact factor: 4.430

8.  CD44 standard and CD44v10 isoform expression on leukemia cells distinctly influences niche embedding of hematopoietic stem cells.

Authors:  Ulrike Erb; Amelie Pajip Megaptche; Xiaoyu Gu; Markus W Büchler; Margot Zöller
Journal:  J Hematol Oncol       Date:  2014-03-31       Impact factor: 17.388

Review 9.  The Importance of CD44 as a Stem Cell Biomarker and Therapeutic Target in Cancer.

Authors:  Ranjeeta Thapa; George D Wilson
Journal:  Stem Cells Int       Date:  2016-04-21       Impact factor: 5.443

Review 10.  CD44: A Multifunctional Cell Surface Adhesion Receptor Is a Regulator of Progression and Metastasis of Cancer Cells.

Authors:  Linda T Senbanjo; Meenakshi A Chellaiah
Journal:  Front Cell Dev Biol       Date:  2017-03-07
  10 in total
  15 in total

Review 1.  Targeting cancer stem cell pathways for cancer therapy.

Authors:  Liqun Yang; Pengfei Shi; Gaichao Zhao; Jie Xu; Wen Peng; Jiayi Zhang; Guanghui Zhang; Xiaowen Wang; Zhen Dong; Fei Chen; Hongjuan Cui
Journal:  Signal Transduct Target Ther       Date:  2020-02-07

2.  Expression of CD44 Isoforms in Tumor Samples and Cell Lines of Human Colorectal Cancer.

Authors:  V O Novosad; I S Polikanova; E A Tonevitsky; D V Maltseva
Journal:  Bull Exp Biol Med       Date:  2022-05-27       Impact factor: 0.804

Review 3.  Role of CD44 isoforms in epithelial-mesenchymal plasticity and metastasis.

Authors:  Mark Primeaux; Saiprasad Gowrikumar; Punita Dhawan
Journal:  Clin Exp Metastasis       Date:  2022-01-12       Impact factor: 5.150

4.  Extracellular Domains I and II of cell-surface glycoprotein CD44 mediate its trans-homophilic dimerization and tumor cluster aggregation.

Authors:  Madoka Kawaguchi; Nurmaa Dashzeveg; Yue Cao; Yuzhi Jia; Xia Liu; Yang Shen; Huiping Liu
Journal:  J Biol Chem       Date:  2020-01-22       Impact factor: 5.157

5.  Target Score-A Proteomics Data Selection Tool Applied to Esophageal Cancer Identifies GLUT1-Sialyl Tn Glycoforms as Biomarkers of Cancer Aggressiveness.

Authors:  Sofia Cotton; Dylan Ferreira; Janine Soares; Andreia Peixoto; Marta Relvas-Santos; Rita Azevedo; Paulina Piairo; Lorena Diéguez; Carlos Palmeira; Luís Lima; André M N Silva; Lúcio Lara Santos; José Alexandre Ferreira
Journal:  Int J Mol Sci       Date:  2021-02-07       Impact factor: 5.923

Review 6.  Targeting the "Sweet Side" of Tumor with Glycan-Binding Molecules Conjugated-Nanoparticles: Implications in Cancer Therapy and Diagnosis.

Authors:  Nora Bloise; Mohammad Okkeh; Elisa Restivo; Cristina Della Pina; Livia Visai
Journal:  Nanomaterials (Basel)       Date:  2021-01-22       Impact factor: 5.076

7.  Estradiol-mediated inhibition of Sp1 decreases miR-3194-5p expression to enhance CD44 expression during lung cancer progression.

Authors:  Ming-Jer Young; Yung-Ching Chen; Shao-An Wang; Hui-Ping Chang; Wen-Bin Yang; Chia-Chi Lee; Chia-Yu Liu; Yau-Lin Tseng; Yi-Ching Wang; H Sunny Sun; Wen-Chang Chang; Jan-Jong Hung
Journal:  J Biomed Sci       Date:  2022-01-17       Impact factor: 8.410

8.  Glycoproteomics identifies HOMER3 as a potentially targetable biomarker triggered by hypoxia and glucose deprivation in bladder cancer.

Authors:  Andreia Peixoto; Dylan Ferreira; Rita Azevedo; Rui Freitas; Elisabete Fernandes; Marta Relvas-Santos; Cristiana Gaiteiro; Janine Soares; Sofia Cotton; Beatriz Teixeira; Paula Paulo; Luís Lima; Carlos Palmeira; Gabriela Martins; Maria José Oliveira; André M N Silva; Lúcio Lara Santos; José Alexandre Ferreira
Journal:  J Exp Clin Cancer Res       Date:  2021-06-09

9.  Preclinical Evaluation of an Engineered Single-Chain Fragment Variable-Fragment Crystallizable Targeting Human CD44.

Authors:  Philipp Diebolder; Cedric Mpoy; Jalen Scott; Truc T Huynh; Ryan Fields; Dirk Spitzer; Nilantha Bandara; Buck E Rogers
Journal:  J Nucl Med       Date:  2020-06-08       Impact factor: 11.082

Review 10.  Targeting cancer stem cell pathways for cancer therapy.

Authors:  Liqun Yang; Pengfei Shi; Gaichao Zhao; Jie Xu; Wen Peng; Jiayi Zhang; Guanghui Zhang; Xiaowen Wang; Zhen Dong; Fei Chen; Hongjuan Cui
Journal:  Signal Transduct Target Ther       Date:  2020-02-07
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.