Literature DB >> 32468071

Biomarkers of tumor invasiveness in proteomics (Review).

Daniel L Pouliquen1, Alice Boissard2, Olivier Coqueret1, Catherine Guette2.   

Abstract

Over the past two decades, quantitative proteomics has emerged as an important tool for deciphering the complex molecular events involved in cancers. The number of references involving studies on the cancer metastatic process has doubled since 2010, while the last 5 years have seen the development of novel technologies combining deep proteome coverage capabilities with quantitative consistency and accuracy. To highlight key findings within this huge amount of information, the present review identified a list of tumor invasive biomarkers based on both the literature and data collected on a biocollection of experimental cell lines, tumor models of increasing invasiveness and tumor samples from patients with colorectal or breast cancer. Crossing these different data sources led to 76 proteins of interest out of 1,245 mentioned in the literature. Information on these proteins can potentially be translated into clinical prospects, since they represent potential targets for the development and evaluation of innovative therapies, alone or in combination. Herein, a systematical review of the biology of each of these proteins, including their specific subcellular/extracellular or multiple localizations is presented. Finally, as an important advantage of quantitative proteomics is the ability to provide data on all these molecules simultaneously in cell pellets, body fluids or paraffin‑embedded sections of tumors/invaded tissues, the significance of some of their interconnections is discussed.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32468071      PMCID: PMC7307599          DOI: 10.3892/ijo.2020.5075

Source DB:  PubMed          Journal:  Int J Oncol        ISSN: 1019-6439            Impact factor:   5.650


1. Introduction

The development of molecular characterization of tumors two decades ago led to a revolution in cancer research and therapeutic approaches. In the case of breast cancer, 'molecular portraits' provided by the quantitative analysis of gene expression patterns (1) have helped define different intrinsic subtypes while multiparameter molecular tests such as the PAM-50 provide useful prognostic information (2). However, despite the usefulness of mRNA-based classifications, integrated proteogenomic analyses of breast or colon tumors have revealed that protein abundance cannot be reliably predicted from DNA- or RNA-level measurements as they are only modestly correlated. In particular, proteomics identify colorectal cancer subtypes similar to those detectable by transcriptome profiles, but also reveal features not detectable in transcript profiles, probably as a result of post-transcriptional regulation (3). Further evidence of this unique potential of proteomics-based subtyping was recently provided in the study by Johansson et al who identified protein products mapping to non-coding genomic regions, potentially leading to a new class of tumor-specific immunotherapeutic targets (4). High-throughput proteomics is still an underdeveloped field compared to transcriptomics (4) and its contribution to oncology has probably not yet been fully realized. However, in two decades, quantitative proteomics has rapidly evolved both technologically (5) and strategically, allowing researchers to explore the complexity of protein interaction networks in a wide variety of situations, but also to formulate new hypotheses to be further functionally tested (6). To identify tumor biomarkers to assist in individualizing treatments for certain types of cancer, many complementary technologies have also been developed (7). As an example, for the majority of the common causes of cancer-related mortality worldwide, such as lung cancer, these technologies have led to the identification of predictive biomarkers of drug resistance, candidate biomarkers for diagnosis and prognostic biomarkers (8). To deal with the limited success of 'targeted therapies', quantitative proteomics, together with other major technological and conceptual developments, has reinforced the search for characteristic features of the adhesive-migratory phenotype of malignant cells (9). Some other examples of important contributions include the study of RNA-protein complexes (10), stem cell plasticity (11), chromatin remodeling (12) and more recently, the regulation of mitochondrial function and dynamics (13), shed microvesicles biology (14) or mechanisms of radioresistance (15). The advent of data-independent acquisition methods (DIA) (16), such as sequential window acquisition of all theoretical mass spectra (SWATH-MS) that combines deep proteome coverage capabilities with quantitative consistency and accuracy (17), has further extended this potential. In particular, SWATH-MS does not rely on precursor intensity for fragmentation selection, but rather analyzes all precursors in specified mass ranges (18). This overcomes the limitations of previous data-dependent acquisition (DDA) methods in mass spectrometry related to the large dynamic concentration range observed in body fluids, and the fact that less abundant proteins were usually masked by highly abundant proteins. With regard to relative quantification, strong correlations have been found between protein abundances and functional activities of a set of drug metabolizing enzymes in human liver microsomes (19). Additionally, although mass spectrometry is not commonly used in clinical laboratories, the validation of cancer biomarkers found by this technology compared with western blot analysis or ELISA has already been established (20). It was recently demonstrated that proteomes and transcrip-tomes were better associated in highly proliferative tumors than in lowly proliferative tumors (4). Herein, data on cancer invasiveness are reviewed with the aim of highlighting key findings within the huge amount of information available in the literature. The present review focuses in particular on a list of 76 proteins of interest, selected after crossing with SWATH-MS data collected by our team on experimental models and human tumor samples.

2. Data collection methods

The procedure used to identify the list of proteins of interest is summarized in Fig. 1. First, an initial search on the PubMed database was performed on March 7, 2019, with the keywords 'quantitative proteomics', 'cancer' and 'invasiveness'. In total, 93 studies with full text in the English language were analyzed, published between 2005 and 2019. A file of 1,245 proteins mentioned in comparative analyses between tumor cell lines of different invasiveness, tumor cell lines versus normal cell lines, tumors versus normal tissues, invasive or not, from 42 relevant articles was established. The number of citations of each protein in these articles was then recorded, and the 130 potential candidates for which quantitative changes were documented at least in three different studies, were listed for further examination.
Figure 1

Methodology used for the identification of the 76 main invasiveness biomarkers.

Subsequently, this list of 130 candidates was crossed with experimental data collected on rat malignant mesotheliomas (MMs) differing by their invasiveness (21), and then with clinical data from cohorts of patients with colon adenocarcinoma (22) or breast cancer (23). As a result of technological improvements that occurred between these two studies, the number of specific biomarkers identified in breast cancer was higher than that in colon cancer. Although our team is primarily focused on breast cancer, the authors wished to extend the present review to other types of malignant tumors originating in two different tissues in order to validate the most robust and generalizable biomarkers. This led to a final list of 76 upregulated or downregulated proteins common to the three sources of data, which represent potential tumor invasive biomarkers. The present review is based on i) articles selected from the procedure illustrated in Fig. 1; and ii) the screening of literature for each individual protein listed in Table I with the following keywords 'name of the protein' and 'cancer', without or with 'invasiveness'. Finally, some articles combining two or more of these protein names with these keywords were also analyzed.
Table I

List of biomarkers of invasiveness identified from quantitative proteomic studies.

CodeProtein namePubMed citations *Exp. 1 rat data #Exp. 2 rat data §Clinical data
1433E14-3-3 protein epsilon3x
1433Z14-3-3 protein zeta/delta3
ACLYATP-citrate synthase3
ACTN4Actinin-46x
AL1A1Retinal dehydrogenase 16x
ANXA1Annexin A19x
ANXA2Annexin A26x
ANXA6Annexin A65
APMAPAdipocyte plasma membrane-associated protein3x
ATPAATP synthase subunit alpha, mitochondrial4
ATPDATP synthase subunit delta, mitochondrial4x
BGH3Transforming growth factor-beta-induced protein ig-h35
BIPEndoplasmic reticulum chaperone BIP4x
CADH1Cadherin-14x
CALRCalreticulin5
CALXCalnexin5x
CATDCathepsin D9
CD166CD166 antigen5x
CLUSClusterin6x
CO1A2Collagen alpha-2(I) chain4
COF1Cofilin-16
COIA1Collagen alpha-1(XVIII) chain4
COX5ACytochrome c oxidase subunit 5A, mitochondrial4
DPYL3Dihydropyrimidinase-related protein 34x
ECM1Extracellular matrix protein 15x
EZRIEzrin4
FASFatty acid synthase3
FETUAAlpha-2-HS-glycoprotein4x
FIBGFibrinogen gamma chain3
FINCFibronectin5
FLNAFilamin-A5
FSCN1Fascin3x
GELSGelsolin8x
GRP75Stress-70 protein, mitochondrial5x
GSTP1Glutathione-S-transferase P8x
H4Histone H43x
HS90AHeat shock protein HSP 90-alpha3x
HSPB1Heat shock protein beta-16
ITGB1Integrin beta15x
K1C10Keratin, type I cytoskeletal 103x
K1C18Keratin, type I cytoskeletal 184
K2C8Keratin, type II cytoskeletal 84
KPYMPyruvate kinase PKM3
LEG1Galectin-16
LEG3Galectin-34
LMNAPrelamin-A/C3x
LUMLumican4
MMP272 kDa type IV collagenase6x
MMP9Matrix metalloproteinase-94x
MYH9Myosin-93x
NID1Nidogen-13x
NPMNucleophosmin6
PAI1Plasminogen activator inhibitor 16x
PCNAProliferating cell nuclear antigen3x
PDIA4Protein disulfide-isomerase A43
PDIA6Protein disulfide-isomerase A63
PHBProhibitin3x
PHB2Prohibitin-23
POSTNPeriostin7
PRDX1Peroxiredoxin-16
PRDX2Peroxiredoxin-23x
PROF1Profilin-16
RANGTP-binding nuclear protein Ran3
S10A4Protein S100-A43x
S10A6Protein S100-A64
SERPHSerpin H14
SSBPSingle-stranded DNA-binding protein, mitochondrial3x
STMN1Stathmin5x
TAGLTransgelin3
TENATenascin5
TGM2Protein-glutamine gamma-glutamyltransferase 26
TPISTriosephosphate isomerase4
TPM2Tropomyosin beta chain4x
TSP1Thombospondin-17
VIMEVimentin8
VINCVinculin4

Common list between the 130 proteins with ≥3 citations (PubMed, keywords: 'quantitative proteomics + cancer + invasiveness'), and experimental rat tumors data (shown below with the

or § symbols and in Fig. 1). #, Shared in common with the 137 proteins listed in supplemental Table S2 in a previous study (21) (common list between each invasive tumors M5-T1, F5-T1 or F4-T2 vs. non-invasive M5-T2).

Shared in common with the list of 457 proteins exhibiting significant changes (P<0.05, Marker View statistical test), two most invasive tumors (M5-T1 and F5-T1) vs. non-invasive (M5-T2) (21).

Shared in common with the list of proteins exhibiting significant changes (P<0.05) from two clinical studies on patients with colorectal adenocarcinoma (□) (22), or breast cancer (x) (23), or from both cohorts (■).

3. Extracellular matrix (ECM)

A number of the proteins in the list of potential biomarkers of invasiveness are localized in the ECM, either specifically or in parallel to other intracellular localizations, which are summarized in Fig. 2.
Figure 2

Biomarkers distributed in the extracellular matrix (ECM) and/or other cell compartments.

S100 proteins

Communications between stromal cells and cancer cells represent a key parameter of invasiveness. The S100 family of proteins in particular is involved in the formation and maintenance of a pro-inflammatory environment. Among the 21 members of this family present in humans, all regulated by Ca2+ binding, 11 are described in the literature in relation to the subject of the present review (S100A4, S100A6, S100A7, S100A8, S100A9, S100A10, S100A11, S100A13, S100A14, S100A16 and S100P). S100A4 and S100A6 are the most extensively documented; however, S100A8, which forms heterodimers with S100A9 [the overexpression of which is very useful for the identification of circulating tumor cells (24)], is also involved in the regulation of inflammatory responses and pre-metastatic niche formation, together with S100A4 (25,26). The heterodimeric protein, named calprotectin, is released by activated granulocytes, and functions in a cytokine-like manner by triggering signaling pathways involved in inflammatory processes. A recent review revealed the complex functionality of this molecule, emphasizing the fact that its function depends on its concentration and location inside or outside the cell (27). Of note, S100A9 and S100A4 exhibit common molecular interactions, including heterodimerization which require levels of Zn2+ that are found in the extracellular space, but not intracellularly (28). The contribution of S100A4, also known as metastasin, to tumor progression and metastasis in various tumors has been well documented over the past two decades, as recently reviewed (29). In normal and benign lesions, S100A4 is restricted to a few stromal fibroblasts and inflammatory cells (30). By contrast, both tumor cells and stromal cells secrete the protein in malignant tumors (25,30). Moreover, in the context of a pre-metastatic niche (31), the production of S100A4 serves as a link between inflammation and tumor metastasis and is indicative of a poor prognosis (32). S100A4 is also involved in the epithelial-mesenchymal transition (EMT) and is particularly highly expressed in the peripheral leading edge of breast cancer (26) and non-small cell lung cancer (33). S100A4 is related to tumor characteristics associated with a poor prognosis. The combined gain of S100A4 and the loss of membrane E-cadherin in cervical cancer tend to confirm its link with an unfavorable prognosis (34), in good agreement with previous studies by the authros on experimental malignant mesothelioma (21,35). S100A6, also known as calcyclin, is overexpressed in the majority of cancers, and its involvement in tumor cell motility has been widely documented (36). In metastatic tumor tissues, S100A6 expression has also been found to be higher than that in non-metastatic tissues (37). However, the molecular mechanisms underpinning the ability of this protein to regulate cell motility are not yet completely understood, as the down- or upregulation of S100A6 expression has been shown to lead to increased or decreased migration, respectively, in the particular case of osteosarcoma (38). Intracellularly and in vitro, its role in cytoskeletal reorganization and EMT has been widely investigated, although to date, several questions remain unanswered as regards its function/regulation in vivo, such as the confirmation of direct interaction with the cyto-skeleton and the determination of the primary cause(s) of its increased expression (36). Finally, S100A6 can be secreted or released by some cell types and the observation that serum levels are elevated in patients with gastric cancer, which is associated with lymph node metastasis (39), raises the question of the extracellular effects of this protein (36).

Annexins and galectins

Although functionally unrelated, these two families of proteins for which extracellular localization has been well documented, both share evidence of secretion via direct translocation and using a vesicle-based pathway, instead of the conventional endoplasmic reticulum/Golgi network in the context of cancer (40).

Annexins

Annexins are all Ca2+-regulated, phospholipid binding proteins involved in membrane domain organization and membrane trafficking (41); however, they have functionally distinct roles inside and outside cells. Annexin A6 is the largest member of the family, containing two annexin domains connected by a linker, and is distributed differently from Annexins A1 and A2 (42). Annexin A6 was initially demonstrated to be mainly located at the plasma membrane and endosomal compartment, although an increasing number of studies have presented evidence for its extracellular role in cell migration (43). Similar to Annexin A2, Annexin A6 is secreted via the exosomal pathway, and is predominantly cell surface-associated in invasive breast cancer cells (44). Keklikoglou et al recently demonstrated that extracellular vesicles enriched in Annexin A6 were released by mouse mammary cancer cells when treated with chemotherapy to facilitate the establishment of lung metastasis (45). Annexin A6 stabilizes activated EGFR on the cell surface for persistent downstream signaling (46). Nevertheless, Annexin A6 expression resumed in these invasive cancer cells following prolonged treatment with tyrosine kinase inhibitors, in relation to the recycling of two integrin receptors of fibronectin, and to accumulation of cholesterol in late endosomes (47,48). In humans, the contribution of Annexin A6 to increased pancreatic ductal adenocarcinoma has also been reported to depend on tumor cell-mediated uptake of an ANXA6/LRP1/TSP1 complex produced by cancer-associated fibroblasts (CAFs) (49). Finally, apart from reorganization of membrane lipids, Annexin A6 also forms reversible, membrane-cytoskeleton complexes through interactions with actin and α-actinin, and facilitates localized recruitment of signaling proteins by binding to a number of them (43). In particular, a recent study documented its role as a scaffold for Ras proteins (50). Although Annexins A1, A2 and A6 are all present in endosomal compartments, the extracellular functions of Annexins A1 and A2 differ in several respects: Annexin A1 is involved in the regulation of inflammation, apoptosis, leukocyte trafficking and inhibition of neutrophil and monocyte extravasation (51), while Annexin A2 is a co-receptor for tissue plasminogen and participates in neoangiogenesis (41). Early studies have demonstrated that Annexin A1 regulates leukocyte migratory events through its function as an agonist of n-formylpeptide receptors (FPRs), initiating a cascade of signaling events (52). Further investigations have indicated that the binding of Annexin A1 may lead to both pro- and anti-inflammatory effects depending on the type of ligand, its expression being either increased or decreased in different types of cancers(53). In the particular case of mammary tumorigenesis, studies have revealed that the expression and functional roles of Annexin A1 are controversial (54,55), suggesting an association which may be much more complex than initially thought (56). This is probably due to its diverse actions on the many cell types of the tumor and also to the great complexity of the network of ANXA1-regulated proteins (54). The contribution of hypoxia to the combined upregulation of this protein and S100A4 has recently been described (57). In addition, a link between MMP9 and these two proteins has been reported (58), mirroring previous findings (59). Annexin A2 is a multifunctional protein found at various cellular locations. In addition to being present in soluble form in the cytoplasm or associated with the actin cytoskeleton, and both the intra- and extracellular sides of the plasma membrane, this crucial protein is subjected to complex regulation via ligand binding and post-translational modifications (60). Although it is overexpressed in numerous types of cancer, both its upregulation and downregulation have been suggested as prognostic biomarkers (61). The overexpression of Annexin A2 in glioblastoma is associated with tumor aggressiveness and patient survival, and with a mesenchymal phenotype (62). These observations are coherent with the involvement of EMT in ovarian and colorectal cancer inva-siveness (63,64). The role of EMT in pancreatic cancer has also been reported to depend on the combined expression of tenascin C and Annexin A2 (65). Other studies in the same field have described links between Annexin A2 expression and S100A4 or S100A6 as prognostic biomarkers for invasive types of urothelial carcinoma (66) or gastric cancer (67). For invasive breast cancers characterized by increased glycolysis and carbonyl stress, the production of advanced glycation end products (AGEs) and AGE-modified proteins leads to alterations, affecting several proteins in parallel to the increase in Annexin A2, including fibrinogen gamma chain and prohibitin present in the list of biomarkers in the present review (68). Notably, a proteomic analysis of pleural effusion from lung adenocarcinoma patients identified seven proteins not previously reported in plasma, including Annexin A2, as well as another protein from our list, BGH3 (69). Finally, an investigation of the interaction of ovarian cancer and peritoneal cells identified several ECM proteins including Annexin A2, and highlighted links with BGH3, PAI1, fibronectin and periostin (70).

Galectins

Since the pioneering work dedicated to the comparative expression of galectins 1 and 3 in advanced cancers (71), revealing in particular their contribution to the stimulation of glioblastoma cell migration (72), interest in these proteins has continued to grow over the past two decades. Galectin 1 belongs to the first group of this family, characterized by the presence of one conserved carbohydrate-recognition domain (CRD). It represents an interesting target for the development of cancer therapies (73). Previous studies have confirmed the pioneering discovery that galectin 1 overexpression is involved in cancer invasiveness. By 2016, the role of galectin 1 in cancer progression had been clearly demonstrated, although the mechanisms underlying its different actions were not well understood (74). To overcome this gap in knowledge, numerous studies have been conducted over the past few years. For example, Bhat et al described the role of nuclear localization, suggesting that differential glycosylation at the level of tissue microanatomy regulates this parameter in breast carcinoma (75). Shen et al demonstrated the mechanisms through which galectin 1 mediates the activity of matrix metalloproteinase (MMP)9 through the Ras-Rac1-MEKK4-JNK-AP1 signaling pathway in urinary bladder urothelial carcinoma cell invasion (76). In addition, the mechanisms through which galectin 1 induces EMT have been investigated in detail, including its secretion by cancer-associated fibroblasts and its binding to integrin β1 (ITGB1) (77) or the non-canonical hedgehog pathway (78) in gastric cancer, and the activation of an αvβ3-integrin/FAK/PI3K/AKT signaling pathway in hepatocellular carcinoma (79). Finally, additional findings by Qian et al revealed that galectin 1 induces the secretion of stromal cell-derived factor-1, thus modulating stromal pancreatic stellate cells in the context of pancreatic cancer metastasis (80), and plays an important role in immune escape of gingival squamous cell carcinoma through induction of T cell apoptosis (81). However, galectin 3 has incited twice as much interest as galectin 1 in the context of cancer invasiveness, and a seven-fold increase was observed in the number of PubMed references on galectin 3 between the years 1999 and 2014. The wealth of data collected during this period has led to several reviews of excellent quality. Galectin 3 expression is closely involved in tumor cell transformation, migration, invasion and metastasis in a wide variety of cancers (82). The value of this protein as a prognostic biomarker for gastric cancer was confirmed in 2018 by the observation that its overexpression is associated with shorter overall survival in all patients (83). Galectin 3 orchestrates different cell events in the tumor microenvironment, suppressing immune surveillance by killing T cells and interfering with NK cell function (84). Upon secretion, galectin 3 can oligomerize, playing a homeostatic role in tumors by favoring either the exit of tumor cells from a stressed environment or the entry of endothelial and immune cells into the tumor organoid (85). Finally, two other biomarkers involved in cell-matrix interactions and present in the list of invasive biomarkers in the present review, fibronectin and S100A4, were found together with galectin 1 and galectin 3, in a list of 61 statistically significant differentially expressed genes associated with ERBB2 overexpression in breast cancers, and their clinical relevance was demonstrated at the protein level (86).

Cathepsin D and transglutaminase 2 (TGM2)

Another type of secreted protein affecting both cancer cells and stromal cells is cathepsin D (CATD), a lysosomal cysteine and aspartic proteinase, which is generally overexpressed in aggressive cancers and is associated with a poor prognosis (87). In the case of breast cancer, Derocq et al demonstrated that CATD secreted in the extracellular environment triggers fibroblast outgrowth by binding to the β-chain of the LDL receptor-related protein-1 (LRP1), a scavenger receptor mediating the endocy-tosis of various extracellular ligands and involved in signal transduction and gene transcription (88). The multiple roles of CATD show increasing promise for the development of novel anticancer agents (89). The ubiquitous TGM2 belongs to a family of enzymes that catalyze the formation of covalent bonds between a free ε-amino group of a lysine and the γ-carboxyl group of a gluta-mine. Although localized in different cellular compartments, it can also be exported from the cell, then interacting with and/or cross-linking numerous components of the ECM where together with nuclear factor (NF)-κB it influences cellular sensitivity to genotoxic agents and activates EMT (90). TGM2 is overexpressed in various types of cancer, where it remodels and stabilizes the ECM in association with MMP2 and MMP9, and high levels of TGM2 are associated with lower survival rates (91).

Collagen type I and gelatinases

Among the 28 types of collagens that represent the most abundant proteins of the extracellular matrix, two are present in our list of invasive biomarkers, both of which assemble into higher-order supra-molecular structures such as fibrils (92). Almost ten years ago, Garamszegi et al reported that collagen type I, among other ECM molecules, induces Smad2 activation in human breast cancer cells, suggesting that cell-matrix communication was more complex than previously thought (93). The interest for this molecule has grown during over the past decade through the discovery that this major fibrillary component of the stroma induces the dedifferentiation of epithelial cells and the disruption of the E-cadherin adhesion complex (94). In parallel, the overexpression of this type of collagen has been shown to be associated with tumor development in gastric cancer (95) and medulloblastoma (96). Subsequently, a detailed proteomic analysis of breast tissue collagens has confirmed that fibrillary collagens, in particular, are increased in tumor tissue compared to matched normal tissue, in agreement with findings by other authors (97). A link with the presence of bone marrow-derived fibrocyte-like cells, defined as α-1 type I collagen-positive cells producing FGF2, has also been reported in human malignant mesothelioma (98). More recently, Rong et al demonstrated that the collagen type I alpha 2 chain (COL1A2) gene modulated cell motility through interaction with the cytoskeleton (99). Cancer cells share with immune cells the ability to penetrate the dense network of the BM. To cross this barrier, they secrete different categories of proteases, in particular MMPs (also known as matrixins) (92), among which MMP2 and MMP9 (gelatinases a and b, respectively) expression has been shown to be associated with invasiveness (100). MMP9 has the particularity of degrading both type I and type IV collagens, and for this reason it facilitates invasion across the BM (101) in association with Annexin A1 (59), resulting in a poor prognosis (102). The association of both MMP9 and MMP2 with tumor invasiveness was further confirmed for a number of cancer types (103). A link with S100A4 was soon established in the context of breast cancer invasiveness (104). These investigations demonstrated that exposure to interleukin (IL)-1β induced EMT in MCF-7 cells, which could then respond to chemokine CXCL12 (101), ultimately leading to the expression of S100A4 and increased secretion of MMP2 and MMP9 (105). Matsuura et al contributed further insight into the mechanistic process by demonstrating that S100A4 binds to Smad3, an important mediator of transforming growth factor (TGF)-β signaling, in a Ca2+-dependent manner, which ultimately induces MMP9 expression (106).

Fibronectins

Fibronectin is a major stromal protein associated with tumors. It is a part of the four matrix components that are common to breast cancer progression and mammary gland involution (107). Its contribution to the mechanisms through which a tumor cell undergoes metastasis to a predetermined location was initially documented by Kaplan et al in 2005, who demonstrated that bone marrow-derived hematopoietic progenitors that express the vascular endothelial growth factor receptor 1 (VEGFR1) form cellular clusters that upregulate fibronectin, providing a niche for incoming tumor cells (108). Other factors have been identified more recently, including a platelet ADP receptor that recruits VEGFR1+ cells in the lung, fibroblasts that secrete both fibronectin and S100A4, or exosomes produced by pancreatic ductal adenocarcinoma cells and taken up by Kupffer cells, leading to upregulation of fibronectin production by hepatic stellate cells (31). Another advancement in the understanding of the metastatic process was the discovery that several ECM proteins, including fibro-nectin are processed via the plasminogen-plasmin pathway as a result of interactions between ovarian cancer cells and peritoneal cells (70). Interactions between fibronectin and integrins, in particular α5β1, could provide interesting prospects for the treatment of stroma-rich tumors. Studies on the parameters that regulate such interactions have highlighted the modulating role played by tenascin C (109). Finally, fibro-nectin plays other important roles, revealed by silencing its gene which leads to the inhibition of cell proliferation and the promotion of cell senescence and apoptosis via the regulation of the PI3K/AKT signaling pathway (110).

Proteoglycans

Proteoglycans, which consist of a core protein and glycosaminoglycan side-chains, are a large family of proteins involved in interactions of cells with the ECM. Two of these, ECM1 and LUM, appear in the list of invasive biomarkers in the present review. Extracellular matrix protein 1 (ECM1) is significantly elevated in a number of epithelial tumors, giving rise to metastases, and its high expression around blood vessels has been suggested to play a role in angiogenesis (111). Interest for this protein grew significantly after a 2008 study classifying tumors on the basis of the expression of ECM components in relation to different clinical outcomes (112). Subsequently, Lal et al established this protein as a novel prognostic biomarker for poor long-term survival in 134 women diagnosed with invasive breast cancer (113), and the link with EMT was documented by two independent studies on breast cancer and hepatocellular carcinoma (114,115). The silencing of ECM1 in two triple-negative breast cancer cell lines has revealed that it regulates actin cytoskeletal architecture and decreases the expression of the prometastatic protein S100A4, suggesting that it is the primary effector of the changes observed in morphology, migration, invasion and adhesion (116). Finally, ECM1 facilitates the expression of genes associated with EMT by activating the ITGB4/FAK/glycogen synthase kinase 3β signaling pathway (117). In contrast to ECM1, lumican (LUM) initially exhibited antitumor activity (118,119). However, in 2012, Radwanska et al revealed that the role of lumican in cancer cell motility was more complex than previously considered, through the involvement of gelsolin, a protein with multiple effects on the cellular pool of actin (120). The role of lumican in cancer appears to vary according to the type of tumor, with a restrictive role in prostate cancer invasion (121) and a role in colorectal adenoma-to-carcinoma progression (122). Its complex role has been investigated in detail in several studies, revealing in particular that glioblastoma and neuroblastoma cancer stem-like cells promote the activation of large quantities of lumican and decorin in the context of temozolomide resistance (123), while lumican deficiency promotes a pro-angiogenic tumor phenotype in melanoma (124).

Matricellular proteins

In addition to the galectins discussed above, two other proteins belonging to this category are present in the list of invasive biomarkers in the present review, periostin and thrombospondin-1 (TSP1), both of which have been reported to represent secreted matrix molecules decorating ECM fibers in the process of tissue-specific restricted guidance of cancer invasion (125). Periostin (also known as POST) interacts with multiple cell-surface receptors, particularly integrins, promoting cancer cell survival, EMT, invasion and metastasis (126). In a large cohort of 300 patients with breast cancer, Kim et al demonstrated that a high epithelial periostin expression was more frequently observed in the distant metastatic relapse-positive group, compared with the negative group and was associated with a reduced overall survival (127). In relation to its ability to induce EMT, Mino et al reported that periostin expression was markedly higher in an undifferentiated intrahepatic cholangiocarcinoma cell line compared with a moderately differentiated one (128). Similar contradictory findings were initially reported for TSP1 in relation to cancer progression, as this protein presents both stimulatory and inhibitory effects (129). An immunohisto-chemistry study of 80 cases of intraductal papillary-mucinous neoplasms of the pancreas revealed an association between TSP1 and tumor invasiveness: Patients in the strongly positive group exhibited a significantly poorer prognosis compared with the negative group (130). Of note, Firlej et al described the capacity of TSP1 to increase both hypoxia and cell migration which appear to be linked (131). In another type of cancer for which hypoxia is a hallmark, the increased aggressiveness of pancreatic ductal adenocarcinoma is dependent on the formation of Annexin A6/LDL receptor-related protein 1/TSP1 complexes by cancer-associated fibroblasts (CAFs) and their uptake by tumor cells (49). Platelet-secreted TSP1 contributes to colon cancer invasiveness by promoting the signal regulation of MMP-9 via the p38MAPK pathway (132). Finally, Joshi et al investigated the mechanisms through which bone marrow-derived mesenchymal stromal cells and prostate cancer cells interact, demonstrating that the bioactive principle responsible for this chemotaxis in co-culture was present in a high-molecular weight fraction containing TSP1, and the formation of complexes of this protein with fragments of fibronectin function as matrikines (133).

TGF-β-induced protein ig-h3

BGH3 (TGFBI, also known as βig-h3) is involved in cell-collagen interactions by binding to types I, II and IV collagen. It has been described as a promoter or a suppressor of cancer growth as its effect is apparently highly cell type-dependent. BGH3 is upregulated in a number of tumor types, and Lebdai et al demonstrated that this overexpression in aggressive clear cell renal cell carcinoma was associated with the stage, size, grade and necrosis (SSIGN) score, as well as with outcomes (134). In melanoma, Nummela et al also reported that BGH3 impaired the adhesion of melanoma cells to collagen type I, fibronectin and laminin, thus confirming its role as an important regulator of invasive growth (135). Additionally, Klamer et al revealed that the upregulation of BGH3 in hematopoietic stem and progenitor cells may help loosen the adhesive contacts with the bone marrow niche from which they originate, thus making them susceptible for polarization and subsequent egress (136). Finally, quantitative changes in BGH3 were also related to the parallel evolution of other ECM proteins associated with inva-siveness, such as fibronectin, periostin and Annexin A2 (70), enzymes involved in redox regulation of the cell, such as peroxiredoxins (137), or S100A4 in the context of EMT (138).

CD166 antigen

CD166 [also known as activated leukocyte cell adhesion molecule (ALCAM)], is a member of the immunoglobulin superfamily. It was first described as a novel actor in invasive growth and control of matrix metalloproteinase activity (139). A first review of its interest in cancer highlighted the existence of a marked heterogeneity of expression in different tumors, with two additional levels of complexity due to the fact that its expression is dependent on the stage of tumor development and on RNA and protein levels in breast cancer tissues (140). Weidle et al subsequently confirmed and detailed the context-dependent prognostic impact of CD166 expression in cancers (141). The recent finding by von Lersner et al that CD166 promotes malignant behavior through the regulation of its availability (dynamic turnover of the protein at the cell surface) rather than its specific activity, provides a key explanation of its heterogeneous expression within malignant diseases (142).

Nidogen-1 (NID1)

Basement membranes (BMs) consist mainly of collagen type IV, laminins and glycoproteins, including nidogens and heparin sulfate proteoglycans (143). The main component of nidogens, NID1, binds to laminin and acts as a bridge to the collagen network to complete the core basement membrane scaffold (144). However, in studies on the matrisome, which includes not only the structural components of the BM, but also proteins that it interacts with, or modifies, the ECM, NID1was additionally found to be expressed by the stroma (145). Among the numerous studies documenting the implication of NID1 in cancers, Zhou et al demonstrated that the expression of NID1 in ovarian cancer cells revealed an EMT phenotype characterized by the enhancement of mobility, invasiveness and cisplatin resistance (146), while Pedrola et al observed a significant increase in NID1 expression in the invasion front of endometrial tumors compared to their paired superficial zone (147).

4. Plasma proteins

Plasminogen activator inhibitor 1 (PAI1)

PAI1 is a glycoprotein synthesized by various normal cells and a large number of different tumor cells. It belongs to the serine protease inhibitor super family (SERPIN), which probably represents the most important component of the plasminogen activator system (148). A high tumor level of this protein has been associated with a poor patient prognosis (149). This association was confirmed in renal cell carcinoma (150) and breast cancer; higher concentrations of PAI1 have been shown to be associated with an aggressive phenotype and a poor prognosis, with a positive correlation has been found with MMP9 (151). Rhone et al further analyzed the clinicopathological determinants of patients with invasive breast cancer and found significantly higher PAI1 concentrations in patients with ductal carcinoma compared to those with lobular carcinoma, suggesting that a high PAI1 expression predisposes to a pro-coagulant environment expressed by simultaneous activation of coagulation and fibrinolysis suppression (152). Finally, PAI1 is now considered to be a prognostic factor, particularly in breast cancer, and Li et al recently reviewed the numerous tumor promoting factors involved in the modulation of PAI1 activity (153).

Alpha-2-HS-glycoprotein

Alpha-2-HS-glycoprotein [also known as fetuin-A (FETUA)] is a serum glycoprotein involved in the adhesion of tumor cells, functioning as a chemoattractant in breast cancer progression (154), and interacting synergistically with CXCL12 at low concentrations (155). FETUA is endocytosed by tumor cells, enhancing the secretion of exosomes to the extracellular milieu that ultimately promotes cell spreading and adhesion, a process that also requires Annexin A2 and A6 (44,46,50,156). Combined with ECM1, the diagnostic potential of this protein has recently been confirmed in another cancer type, non-small cell lung cancer (157).

Fibrinogen gamma chain (FGG)

Interest for this protein in cancer emerged through the discovery of its presence in a list of seven molecules associated with the formation of AGEs in tumors exhibiting increased glycolysis and carbonyl stress (68). Together with other downstream thrombin procoagulant targets, fibrinogen has long been investigated as a promoter of tumor cell metastatic potential, which was confirmed through the experimental demonstration that tumor growth was markedly impeded in fibrinogen-deficient mice (158). Additional evidence was provided by Honda et al, who found a protein complex containing FGG in plasma from patients with advanced ovarian cancer (159), and by the report of its association with gastric cancer (160). Finally, in a study investigating the clinicopathological significance of this protein in the process of migration and invasion of hepatocellular carcinoma cells, a higher FGG expression was significantly associated with a higher recurrence rate and a shorter survival through EMT signaling by regulating the expression levels of Slug and zinc finger E-box-binding homeobox 1 (ZEB1) (161).

5. Membrane proteins

A number of proteins are known to be localized in the membrane but are not restricted to it (Fig. 3). The first category includes ITB1 and APMAP. The second category, which corresponds to proteins additionally localized in Golgi and endosomes (CADH1), ER (PDIA6), or cytoplasm (HS90A), will be discussed in the third paragraph of this chapter.
Figure 3

Biomarkers distributed in the cell membrane, cytosol and endoplasmic reticulum (ER).

ITGB1

More than 10 molecules are known to bind to the cytoplasmic tail of this protein, some of them acting as binding platforms for cytoskeletal and signaling molecules; however, the mechanisms through which integrin signaling is induced in the intracellular space have not yet been elucidated (162). To date, the associations between ITGB1 and the clinical features of patients with cancer are still unclear and a number of studies have reported contradictory conclusions depending on the type of cancer (163,164).

Adipocyte plasma membrane-associated protein (APMAP)

Recently, APMAP, discovered in 2001 (165) during investigations on the course of adipocyte differentiation, was found to interact with two extracellular collagen cross-linking matrix proteins, lysyl oxidase-like 1 and 3, suggesting that it may represent a novel regulator of extracellular matrix components (166). A crucial step was the discovery that this protein was increased in sera from a large cohort of patients with prostate cancer compared to a control group (167). A few months ago, these investigations finally led Jiang et al to reveal a novel crucial functional role of cholesterol, which was shown to induce EMT in prostate cancer metastasis through the accumulation of APMAP in cholesterol-induced lipid rafts (168).

Membrane proteins with multiple localizations

The keystone position of S100A4 in the acquisition of invasiveness by cancer cells can be illustrated by its inverse correlation with E-cadherin expression in different invasive phenotypes of oral squamous cell carcinoma (169). In melanoma cells, this inhibition of E-cadherin expression, which leads to EMT and increased invasiveness, has been shown to be associated with the upregulation of Annexin A1, which can be reversed by the use of small interfering RNAs (170). Overall, E-cadherin functional loss has been associated with a poor prognosis and survival in various types of cancer (171). Notably, Yu et al recently reported an inverse correlation between E-cadherin and peroxiredoxin 1 expression, and demonstrated that these two combined parameters were associated with EMT, poor differentiation, deeper invasion and an advanced TNM stage of gastric cancer (172).

6. Cytoplasm/cytosol

Potential biomarkers of invasiveness present in the cytosol/cytoplasm are summarized in Fig. 3.

Metabolic enzymes

Glycolysis dysregulation is a main hallmark of cancer cells and has been the subject of extensive investigations (173) since the research by Warburg a century ago (174). These studies have emphasized the key role of enzymes in the adaptation to hypoxia, including triose phosphate isomerase (TPI) which catalyzes the interconversion of dihydroxyacetone phosphate and glyceraldehyde-3-phosphate, playing an important role in the development of many types of cancers (175). Apart from its metabolic function, Lincet and Icard demonstrated that this enzyme also participated in cell cycle activation (176), while its transcriptional regulation involves microRNAs (miRNAs/miRs)-22/28 (177). A growing interest in fatty acid synthase (FAS) emerged in oncology in the mid-2000s, when this lipogenic enzyme was found to confer growth and survival advantages to cancer cells rather than functioning as an anabolic energy-storage pathway (178). FAS upregulation has been shown to be associated with a poor prognosis in a variety of cancers, although the underlying mechanisms are yet not completely understood. Its localization in the nucleus in a subset of prostate cancer cells has been found to be associated with the Gleason grade (179). Subsequently, Wang et al revealed that the knockdown of this enzyme in human colorectal cancer cell lines attenuated the activation of the Wnt signaling pathway and metastasis; a positive correlation was observed in patients between FAS expression and Wnt signal biomarker gene expression (180). These investigations opened up interesting therapeutic prospects against prostate (181) and breast cancer cells (182). Another key enzyme involved in the biosynthesis of fatty acids is ATP-citrate lyase (ACLY), which is upregulated or activated in several types of cancer (183), particularly in gastric cancer, where its regulation involves miR-133b (184). These observations have led to the development of ACLY inhibitors attracting interest as promising anticancer agents (185), with citrate levels monitored as an indicator of cancer aggressiveness and/or as a biomarker for response to therapy (186).

Peroxiredoxins

A number of cancer cells are characterized by an increase in reactive oxygen species (ROS) and a dysregulation of enzymes involved in the redox-regulating proteins, in particular peroxiredoxins, which catalyze the peroxide reduction of H2O2, organic hydroperoxides and peroxynitrite (187). The over-simplification of the role of 'antioxidants', which until now was attributed to these enzymes has recently been questioned (188), as the genetic disruption of their expression in mice has been shown to lead to an increased incidence of neoplasia, consistent with a probable role in the protection of genomic integrity (189). Moreover, although their overexpression has been mostly reported in various malignant tumors, the suppression of 2-Cys peroxiredoxins has also been found in certain metastatic cancers (190), raising questions as to their complex functions related to tumor cell invasiveness, with important implications for therapies (191). In the particular case of peroxiredoxin 1, a major member of the family present mainly in the cytosol, the recent study by Kim et al demonstrated that this protein has RNA-binding properties, binding to a specific subset of small nucleolar RNAs (snoRNAs) and regulating these molecules at the post-transcriptional level (192).

Pyruvate kinase PKM

PKM, which catalyzes the final step in glycolysis, consists of four isoforms in mammals, two of which, PKM1 and PKM2, are encoded by the PKM gene through alternative splicing of mutually exclusive exons (193). Although PKM2. but not PKM1 was initially considered to favor cancer cell proliferation, the exclusive role of PKM2 in tumorigenesis has recently been challenged. In the particular case of liver tumorigenesis, various PKM1/PKM2 ratios and pyruvate kinase activities can sustain the glucose catabolism required for the process (194). Moreover, although PKM2 has been suggested to be the predominant isoform in cancer cells, providing a basis for the development of novel therapeutic strategies (195), mass spectrometry-based proteomic analyses have demonstrated that PKM2 can be detected in both cancer and normal cells (196). Nevertheless, deeper investigations evaluating the mechanisms through which the two isoforms regulate the invasiveness of pancreatic ductal adenocarcinoma have revealed that both regulate cell migration and invasion in vitro, but only PKM2 overexpression the promotes metastasis of cancer cells in vivo (197).

Aldehyde dehydrogenases

This superfamily consists of 19 proteins displaying mainly catalytic functions involved in detoxification, and their role in cancer has been widely emphasized over the past decade (198), particularly in relation to stem cells and resistance to chemotherapy (199). The overexpression of the first member of this family, ALDH1A1, also known as retinal dehydrogenase 1, is generally associated with poor outcomes (200-202).

7. Cytoskeleton

Among the 76 upregulated or downregulated proteins in the list of potential tumor invasive biomarkers in the present review, seven belong to the cytoskeleton, while another seven interact with the membrane or the nucleus (Fig. 4).
Figure 4

Cytoskeletal biomarkers and their associations with other cell compartments.

Cytoskeletal proteins

Transgelin is a protein which affects the dynamics of the actin cytoskeleton through the stabilization of actin filaments. This biomarker is involved in a number of cancer-related processes and was initially described as a tumor suppressor. The transgelin level has been shown to be decreased in tumor cells relative to cells of healthy tissues. However, it can be highly expressed by the reactive tumor stroma and its re-expression in tumor cells in the more advanced stages of cancer can support migration and metastasis (203). In support of this observation, transgelin positivity has been associated with more aggressive tumors, a high Ki-67 index and low estrogen and progesterone receptor expression levels in of breast cancer (204). Fascin is another protein regulating cytoskeletal structures which coordinates motility and invasion and promotes filopodia formation in carcinoma cells. Although absent from most normal epithelia, its expression has been associated with metastasis in colorectal and gastric cancers (205). Stathmin is a major microtubule-destabilizing protein which promotes microtubule depolymerization and mediates the effects of p27Kip1, an inhibitor of cyclin-dependent kinase complexes (206). This action is obtained either through the sequestration of free tubulin dimers or directly by the induction of the microtubule-catastrophe. Thus, stathmin is an important target of the main regulator of the M phase and offers interesting prospects for anti-metastatic therapies (207). The value of stathmin as a prognostic biomarker has been confirmed by the observation that its overexpression is associated with tumor cell differentiation, lymph node invasion and a high TNM stage (208). Profilin-1 (PROF1) is an actin-monomer binding protein ubiquitously expressed in all cell types. It regulates actin dynamics and cell motility and plays an important role in the migration of cancer cells (209). In addition to its sequestering function on actin monomers, PROF1 promotes the assembly of globular-actin monomers (G-actin) into filamentous-actin (F-actin) (210), interacts with certain membrane lipids, and is also involved in regulating the expression of several cancer stem cell genes (211). PROF1 has been shown to be downregulated in different types of cancer, including breast, pancreatic and hepatocellular carcinoma, and it is associated with aggressive clinicopathological characteristics and a poor prognosis (212). Filamin A (FLNA) was the first actin filament cross-linking protein identified in non-muscle cells and the contribution of its structure and functions to cell migration and adhesion has already been reviewed (213). Differences in subcellular localizations and effects on cancer development have led to the conclusions that an association exists between high cytoplasmic levels and invasive cancers, whereas the localization of an active form to the nucleus and its interaction with transcription factors is linked to a decrease in invasiveness (214). Consequently, as a promising prospect, drugs that can transpose FLNA from the cytoplasm to nucleus are currently under development (215). Myosin 9 has recently attracted the attention of oncology researchers, as it has been found that this protein, classified as a cytokine involved in cytoskeletal reorganization and coded by a suppressor gene, plays an important role in the formation of cellular pseudopodia and is closely related to the progression and a poor prognosis of the majority of solid tumors (216). Finally, the tropomyosin isoform Tpm2.1 is considered to be another tumor suppressor, regulating the sensitivity to apoptosis beyond anoikis (217). Indeed, both the mRNA expression and protein levels of this molecule have been shown to be significantly decreased in colorectal cancer compared with paired adjacent normal tissue (218). In breast cancer, the downregulated expression of this protein is due to its promoter methylation, induced by hypoxia, leading to cell invasiveness, poor prognosis and chemoresistance (219). In another independent study, Shin et al found that the loss of Tpm2.1 increased the efficiency of migration of breast cancer cells out of the spheroids on different coated extracellular matrices (220). Finally, Mitchell et al completed these investigations by demonstrating the mechanisms through which the loss of this high molecular weight tropomyosin induced glioblastoma cell spreading and elongation in soft 3D hydrogels, recapitulating the biomechanical architecture of the brain (221).

Membrane-cytoskeleton interactions

According to Uniprot, among the cytoskeletal proteins in the list of potential biomarkers of invasiveness in the present review, three co-localize in the membrane: Cofilin-1, ezrin and vinculin. Cofilin-1 belongs to the actin-depolymerizing family of proteins, which is essential for the dynamic changes in the actin cytoskeleton associated with the reorganization of cellular shape during the acquisition of invasiveness (222). A number of investigations have led to the conclusion that cofilin-1 expression increases in relation to cell cycle progression, migration, intravasation and the invasion of cancer cells. Gasparski et al revealed how the maturation of invadopodia, these actin-rich structures present in invasive cancer cells which degrade the surrounding ECM to facilitate invasion, was related to the downregulation of integrin β3 expression, leading to an increase in cofilin activity (223). However, biphasic effects between the cofilin level and locomotory rate have also been observed, suggesting that the process will proceed via a complex dose- and time-dependent manner, a partially documented complex mechanism that still needs to be clarified (224). Ezrin, which belongs to the ERM family, interacts with membrane proteins by organizing membrane-cytoskel-eton-associated complexes, thus creating specialized membrane domains, and also promotes tumor metastasis (225). The role of ezrin in the mechanism of the activation of the Wnt-β-catenin signaling pathway in the context of colorectal cancer has been well documented (226). However, although ezrin is clearly associated with a poor prognosis and metastasis in different cancer types, Cihan pointed out that contradictory results remain as regards the association between ezrin expression and clinicopathological features or prognostic parameters, suggesting that this field of research requires further investigations before evaluating ezrin-based therapies (227). The third protein of interest, vinculin, couples the ECM to the acto-myosin cytoskeleton via β-integrins and paxillin, and thus acts as a mechano-coupling and mechano-regulating protein (228). As an orchestrator of mechanical signaling events, its deregulation has important consequences on cell adhesion, contractility, motility and growth, all of which are crucial in the process of cancer metastasis (229). A positive association between estrogen receptor alpha and vinculin expression has also been demonstrated in breast cancer cells (230).

Cytoskeleton-nucleus interactions

According to Uniprot, among cytoskeletal proteins in the list of potential biomarkers of invasiveness in the present review, four of these interact with the nucleus: Nucleophosmin (NPM), heat shock protein β1 (HspB1), actinin-4 (ACTN4) and vimentin. NPM, an ubiquitous phosphoprotein belonging to the nucleoplasmin family of chaperones, is mainly localized in the nucleolus, a proportion of which continuously shuttles between the nucleus and the cytoplasm (231). Andersen et al initially identified this protein in a mass-spectrometry-based proteomic analysis of human centrosomes in the interphase of the cell cycle (232). NPM is involved in numerous pathways, including mRNA transport, chromatin remodeling and genome stability; however, Box et al demonstrated that its multifunctional role within the cell also included DNA repair pathways and the regulation of apoptosis in cancers (233). Of note, Werner et al reported the discovery of a translocation of the anaplastic lymphoma kinase gene (ALK) with the promoter region and a proximal domain of the NPM gene (NPM1) on chromosome 5q35, yielding a chimeric protein that modulates numerous genes involved in the evasion of the antitumor immune response, protection from hypoxia, angiogenesis, DNA repair, and cell migration and invasiveness (234). HspB1 (also known as Hsp-27) is a chaperone that regulates a number of fundamental cellular processes, and whose structural organization presents dynamic and complex rearrangements in response to changes in the cellular environment. Its sophisticated anti-apoptotic role acts both upstream and downstream of the mitochondria, the former effect occurring by alterations in F-actin or nucleus architecture integrity (235). Xie et al recently discovered a novel interaction between HspB1 and ezrin: The knockdown of HspB1 resulted in a decreased phosphorylation at ezrin Thr567, thus markedly suppressing the ability of ezrin to bind to the actin cytoskeleton, leading to the migration of esophageal squamous cell carcinoma cells (236). ACTN4 is a non-muscle isoform of α-actinin initially found concentrated in the cytoplasm of breast cancer cells migrating and located at the edge of cell clusters (239). Subsequently, Hayashida et al revealed that the β-catenin and actinin-4 complex was highly concentrated in actin-rich protrusions at the peripheries of cell clusters, and that their colocalization in the nucleus, repressing E-cadherin expression, induced cancer invasion (238). Thomas and Robinson also demonstrated that although the two α-actinin isoforms 1 and 4 share regulatory mechanisms, actinin-4 exhibits a unique mechanosensory regulation, which warrants further, more detailed investigation (239). In parallel, Yamaguchi et al observed that the overexpression of actinin-4, but not that of actinin-1, significantly promoted the formation of invadopodia by carcinoma cells (240). The study by Keeling et al clearly demonstrated the mechansims through which cytoskeletal organization regulated nuclear state, revealing that stress fibers and intermediate filaments modulated the mechanical properties of the nucleus and also chromatin condensation, and that actin and vimentin regulated chromatin condensation in an opposite manner (241). In a review on the role of vimentin in cell migration, Battaglia et al discussed how recent advances in microscopic techniques, combined with computational image analysis, have helped reveal the mechanisms through which vimentin promotes directed cell migration by coordinating the dynamics of actin filaments and microtubules (242).

Cytoskeleton-ECM interactions

Actin remodeling in cancer cells may be the result of the inactivation of several important actin-binding proteins such as gelsolin (243). Interest for this protein has increased as it is not only found in the cytoplasm, but also in the extracellular environment, providing future prospects for studies on its prognostic potential (244).

8. Endoplasmic reticulum

The endoplasmic reticulum (ER) is directly concerned with four potential biomarkers of invasiveness in the list in the present review, calnexin (CALX), BIP, serpin H1 and PDIA4 (Fig. 3), but also with four additional proteins, CALR, CLUS, TGM2 and CATD, which can be found in multiple cell compartments (Fig. 2). Calnexin is a resident chaperone of the ER which was initially identified as an important protein involved in the reduction or suppression of antigen presentation by the major histocompatibility complex (MHC) on tumor cells (245). In a very interesting study aimed at identifying proteins preferentially expressed in a poor prognosis group of patients with lung adenocarcinoma relative to a good prognosis group (exhibiting no recurrence), Okayama et al reported that calnexin was preferentially expressed in the former, and this result was further confirmed in a cell-culture model (246). Ryan et al subsequently confirmed the prognostic significance of CALX in colorectal cancer (247). Together with calreticulin, calnexin serves as a molecular chaperone, which prevents the aggregation and export of incompletely folded proteins from the ER, a mechanism involved in the metastatic progression of tumors (248). As CALX can escape from the ER and be transported to the plasma membrane or released outside the cell, its impact on the human immune system was recently investigated by Chen et al, who reported that its upregulation was associated with the inhibition of T-cell infiltration in tumor tissues (249). The binding immunoglobulin protein (BIP) is an ER-lumenal polypeptide chain binding protein, which belongs to the heat shock protein 70 family and interacts with numerous partners. Evidence of its role in various types of cancer began to emerge a decade ago (250). BIP is an essential factor of the translocation machinery for protein import into the ER; it regulates Ca2+ homeostasis in the ER, facilitates ER-associated protein degradation, and can initiate the unfolded protein response, inducing autophagy and crosstalking with the apoptosis machinery to assist in the cell survival decision (251). Among the numerous contributions of BIP to cancers found in the literature, Herroon et al interestingly found a link between hemeoxygenase (HO), an inducible enzyme involved in the resistance of cells against oxidative stress, whose overexpression is associated with aggressiveness, and the upregulation of BIP (252). The impact BIP on the success of anticancer therapies has been described by Chen et al, who reported that the inhibition of the hexosamine biosynthesis pathway resulted in the downregulation of BIP, which exacerbated cisplatin-induced non-small-cell lung cancer apoptosis (253). Serpin H1 (Hsp47) specifically binds to procollagen as a resident protein of the ER, and dissociates from it in the cis-Golgi to allow fibril formation (254). A recent review of studies on this protein in the context of cancer has revealed that it plays a role in numerous steps of collagen synthesis, promoting tumor angiogenesis, growth, migration and metastatic capacity (255). Protein disulfide isomerases (PDIs), which correct the arrangement of disulfide bonds in the ER through reductase, oxidase and isomerase functions, are implicated in the development of certain types of cancer, leading to the development of PDI inhibitors as potential novel anticancer therapies (256). Among the 21 members of this family documented so far in mammals, four of these, including PDIA6 and PDIA4, are upregulated in a variety of tumor cells. A mechanistic study demonstrated the mechanisms through which PDIA4 negatively regulates tumor cell death by inhibiting degradation and the activation of procaspases 3 and 7 via their mutual interaction (257). Calreticulin (CALR), belonging to the damage-associated molecular patterns (DAMPs), is involved in the immunogenicity of cell death, but also represents a major predictor of a better prognosis in various types of cancer (258). However, it also plays an additional role as a pro-tumorigenic multifunctional ER protein with variable distribution as it has been shown to promote the progression of pancreatic cancer cells via the integrin/EGFR-ERK/MAPK pathway (259). The increased complexity of the functions of this pleiotropic protein was recently illustrated by the discovery that CALR functions outside the ER where its translocation to the cell membrane serves as an 'eat me' signal, promoting a silenced immune response (efferocytosis), while its effects on cytokine production are dependent on its conformation (260). Clusterin is a highly glycosylated protein initially described as a cytoprotective chaperone-like molecule controlling cell-cell and cell-matrix interactions including adhesion (138). Subsequently, a link between this protein and TSP1 was established in the regulation of MMP9 when tumors cells interact with platelets in colonic cancer invasion (132). Clusterin has been shown to facilitate metastasis in hepato-cellular carcinoma through the formation of complexes with eukaryotic translation initiation factor 3 subunit (EIF3I) and the activation of the Akt pathway, promoting the expression of MMP13 (261). Additionally, Shapiro et al reported that CLUS was overexpressed in metastatic human colorectal cancer cells in association with the presence of stem cells (262), while Liu et al reported that the overexpression of clusterin promoted the invasiveness of clear cell renal carcinoma cells, an effect mediated by S100A4 (263).

9. Mitochondrion and nucleus

Four proteins in the list of biomarkers of invasiveness in the present review are described as macromolecules localized both in the mitochondria and nucleus, while five are restricted to the mitochondria and another five are restricted to the nucleus (Fig. 5).
Figure 5

Mitochondrial and nuclear biomarkers and their associations with other cell compartments.

Proteins shared by the two subcellular structures

The first protein in this category is the π isoform of glutathione-S-transferase P1 (GSTP1). It belongs to a subgroup of the GST family initially involved in cellular protection against free radical and carcinogenic compounds (264). However, other functions have been discovered for this protein, including the maintenance of cellular redox homeostasis, the downregulation of which has been associated with a poor prognosis (265). 14-3-3 proteins (their names are derived from the elution profile on HPLC) are recognition structures at or near DNA replication. They are cell cycle-regulated, being maximal at the G1-S phase and minimal at the G0-G1 phase (266). This family consists mainly of seven isoforms present in mammals whose dysregulated expression contributes to tumorigenesis in different types of cancer. In particular, the downregulation of the ε isoform (1433E) has been associated with lung (267) and gastric tumorigenesis (268). Although an increase or decrease may occur according to cancer type, leading to some confusion, high levels of the ε isoform have been shown to predict a poor two-year overall survival of a group of chemotherapy-resistant compared with chemotherapy-sensitive patients (269). Prohibitins are important intercellular communicators between the nucleus and mitochondria, and their many functions are highly dependent on their localization (270). Both prohibitins localize to the inner membrane, functioning as mitochondrial chaperones, although they are also present in the plasma membrane and nucleus where they act in membrane signaling and independently as transcriptional repressors of target genes, respectively (271-273). As regards the implication of PHB2 in transcription, Zhou et al recently described its substantial localization in the nucleolus, where it maintains nucleolar morphology, while promoting tumor proliferation and probably repressing differentiation in rhabdomyosarcoma cells (273). Mechanistically, PHB2 has also been reported to promote prostate cancer cells by inhibiting the expression of AKT serine/threonine kinase 2 (274). Finally, Yan et al reported that PHB2 also mediates mitophagy (275). GRP75 (also known as Hsp70 or mortalin) plays a major role in the import and refolding of mitochondrial proteins, representing a potential serum biomarker of high prognostic value for patients with colorectal cancer (276). Additionally, Cruz et al demonstrated that this protein is a candidate biomarker of drug-resistant disease in ovarian cancer cell lines and tissues (277), while Niu et al described its involvement in modulating oncogenic Dbl-driven endocytosis (278).

Mitochondrial proteins

Among the 16 different subunits composing the ATP synthase, two were initially reported to be overexpressed in cancers and to be associated with histological grade (279), the α-subunit, a major component of the catalytic F1 head, and the d-subunit, a major component of the F0 membranous domain (280). Cytochrome c oxidase subunit 5A (COX5A) was identified in a list of 16 mitochondrial differentially expressed proteins involved in mitochondrial oxidative stress that contribute to nasopharyngeal carcinoma metastasis (281). The increased abundance of this mitochondria-bound protein in tumorous areas and its role in the migration and invasion of non-small cell lung carcinoma cells was confirmed by Chen et al, who demonstrated that its knockdown by siRNA decreased the migration and invasion of the tumor cells (282). Single-stranded DNA binding protein (SSBP) is another specific mitochondrial protein that maintains the structural stability of the mitochondrial genome by binding to single-stranded mtDNA. This protein regulates mitochondrial function and metabolism, and its level correlates with cancer cell aggressiveness. Thus, novel treatment strategies aimed at its downregulation have been proposed to increase the accumulation of ROS, decrease key glycolytic enzymes and finally, enhance the radiosensitivity of lung cancer cells (283). The role of this protein in the regulation of the base excision repair pathway (284) and in the protection against DNA damage events (285) has recently been investigated.

Nuclear proteins

In addition to Ki-67, proliferating cell nuclear antigen (PCNA) has been used in immunohistochemistry experiments, demonstrating that these two proteins are associated with each other, and with tumor grade and stage (286). Its value as an independent predictor of histological grade, recurrence rate and prognosis were subsequently confirmed in a number of studies on gastric cancer (287), hepatocellular carcinoma (288) and non-small cell lung cancer (289), highlighting in particular its interest, in association with p53, for the characterization of the invasive front of carcinomas (290). The development of chromatin immunoprecipitation (ChIP) has been crucial to the study of protein-DNA interactions, leading in particular to the elucidation of the role of the A family of type V intermediate filaments (LMNA) (291) in maintaining the positional stability of DNA repair foci in mammalian nuclei (292). As the principal component of the lamina, the meshwork of proteins at the nucleoplasmic side of the inner nuclear membrane, lamins provide mechanical steadiness to the cell nucleus by protecting it from mechanical forces (293). Kim et al described the mechansims through which lamin A/C mediates the formation of the perinuclear apical actin cables to protect the nuclear structural integrity (294). In complement to that study, Taheri et al also demonstrated the role of lamin A in determining the viscoelasticity of the chromatin network (295). Notably, Zuo et al recently reported that differences in lamin A/C expression patterns between high and low Gleason scores in prostate cancer tissues was not associated with LMNA mutations, but rather with EMT or MET processes (296). The past few years have seen a growing interest in the study of keratin-cancer associations, with some important recent reviews on this subject (297,298). In particular, certain researchers have focused on cytokeratins 8 (K2C8) and 18 (K1C18), which allow enrichment in circulating tumor cells (299). The fact that K2C8 constitutes an important part of the cytoskeleton and is involved in the migration, invasion and metastasis of small-cell lung carcinoma cells led Erlandsson et al to develop a novel treatment protocol for this type of cancer based on the use of an anti-keratin 8 antibody (300). A high expression of K1, K8 and K18 has been associated with a poor survival and a higher risk of recurrence (301), suggesting that these keratins function as sensors of changing epithelia (297). Finally, the K8/K18 pair has been demonstrated to modulate α6β4 integrin-mediated signaling with an impact on cancer progression (302). Perturbations of chromatin remodeling complexes have been well-documented in malignant progression, in particular when EMT is involved (303). In these studies, H4 can undergo post-translational modifications, as it belongs to the four types of core histones forming the octamer units of nucleosome core particles (304). Together with H3, H4 is also subject to a wide variation in the abundance of acetylation of its lysine residues during the reprogramming of somatic cells (fibroblasts) into induced pluripotent stem cells (305).

10. Miscellaneous

The following two proteins have been suggested to be localized in the cytoplasm, although to date, there is no further documentation in the literature. Another isoform of the 14-3-3 family of proteins mentioned above, 1433Z, which binds to several different enzymes and may reduce apoptosis, has been associated with induction of tumorigenesis in mice (306). Notably, the proteomic characterization of the tumor-promoting rearrangements of the lungs in a model of metastatic breast cancer in the mouse revealed that 1433Z was included in a small list of proteins differentially expressed at a stage corresponding to secretion of tumor-derived factors (307). Dihydropyrimidinase-like 3 (DPYL3), which interacts witih ezrin, was first identified as a biomarker differentially expressed in several pancreatic ductal adenocarcinoma cell lines originating from liver metastasis in contrast to others originating from lymph node metastasis and the primary tumor (308). Of note, recent findings by Matsunuma et al revealed that the dysregulation of this protein was specific to a subset of triple-negative breast cancers characterized by low expression of claudins and E-cadherin and high levels of mesenchymal biomarkers (309), while Yang et al demonstrated that its inhibition promoted the metastasis of lung cancer (310).

11. Extension to and links with other proteins of interest not included in the list

The stringent methodology used in the present review to select biomarkers of interest led to the exclusion of a number of important candidates from the list. To extend the approach of the present review to additional molecules reported in the recent literature and which are involved in the cancer metastatic process, below, three examples are provided that may offer new perspectives. A first example is represented by S100B, another member of the S100 family discussed above in the section entitled 'S100 proteins'. This protein has been a subject of cancer research since 1983 and since then, 377 references have been published, with a continuous increased observed from 1995. The reason for this interest is based on the observation that S100B improves the early diagnosis, staging and prognosis of malignant melanoma (311,312), and that its high expression promotes self-renewal and tumorigenicity in ovarian cancer stem cells (313). Moreover, the serum S100B level has been shown to be independently associated with a poor outcome of patients with metastatic breast cancer (314), while treatment with S100B inhibitors blocks glioma growth through the alteration of the polarization and trafficking of tumor-associated myeloid-derived cells (315). Notably, although S100B was not detected in the data described in the present review, a recent study based on an integrative analysis of transcriptomic and proteomic data of MCF7 cells submitted to acid adaptation reported high expression levels of both S100B and S100A6 in the course of EMT process affecting breast cancer cells (316). Additionally, proteomics and microarray data from breast cancer patients have revealed a shorter long-term survival in two subsets of patients with a combined high expression of S100B, kallikrein and S100A7 or S100A14 - S100A16 (317). In association with S100B (318), and Annexin A2 (please also see section above entitled 'Annexins and galectins'), a second example is provided by the transmembrane receptor for advanced glycation end products (RAGE). This protein, a member of the immunoglobulin superfamily known since 1949, has been the subject of growing interest in cancer research since 2001. It is a receptor for HMGA1, increasing the migration and invasion of triple-negative breast cancer cells (319). Other RAGE ligands, such as HMGB1 (320), or S100P, S100A8 and S100A9 (321), enhance its expression, which may further lead to increased proliferation, migration and metastasis. The role of RAGE in cancer initiation and progression is now well recognized and underpins therapeutic strategies aimed at reducing AGE levels (322). The complex molecular functions of RAGE-ligand signaling arrays appear to be associated with all the hallmarks of cancer(323,324). Of note, although RAGE was not detected in either our experimental or clinical proteomic data, an increasing abundance of HMGA1 was previously found within the three rat models of malignant mesothelioma exhibiting increasing levels of invasiveness (21), a feature which is in agreement with the association described between RAGE and this nuclear protein (319). The third example is represented by the bZIP transcription factor, nuclear factor erythroid 2-related factor 2 (NRF2), initially highlighted in studies on chemopreventive agents during investigations of the benefit conferred by consumption of fruits and vegetables on the reduced incidence of cancer (325). Interest for this protein in cancerology grew considerably, beginning in 2004, and then exponentially since 2010, reaching as many as 555 references in 2019. Recent reviews have highlighted the importance of its modulation for cancer chemoprevention and therapy (326), and the crucial role of the disruption of KEAP1 binding to NRF2 (327). To date, the NRF2 pathway represents a driver of cancer progression, metastasis and resistance to therapy (328). To provide mechanical insight into the implications of this pathway in lung cancer metastasis, Lignitto et al dissected the molecular events regulating the simultaneous loss of KEAP1 and stabilization of the transcriptional regulator Bach1 (329). These authors demonstrated that free heme promoted the physical interaction between Bach1 and Fbxo22, a substrate receptor of the CRL1 complex, by inducing Ho1, a heme-catabolizing enzyme (329). Subsequently, Wiel et al revealed that the reduction of free heme by long-term supplementation with antioxidants also stimulated metastasis by stabilizing Bach1, leading to increased glycolysis rates (330). Finally, an interesting point with regard to NRF2 is the increased abundance in CD44 antigen found in a most aggressive model of rat MM (M5-T1), compared to the two less invasive ones (F4-T2 and F5-T1) (21). This suggests a link with HMGA1 mentioned above (320), and illustrates the importance of the CD44-NRF2 axis described in breast cancer stem cell-like cells (331).

12. Conclusions

The aim of the present review was to establish a list of potential biomarkers of cancer invasiveness at the crossroads between literature data and experimental and clinical data, which may provide the groundwork for both basic science and translational studies. Although a number of other proteins remain outside the focus of this review, the main point is that this list represents quantitative changes which are common to different cancer types and locations. In the field of basic science, a first question concerns a number of biomarkers that are increased or decreased in most/all situations and which could therefore represent an important tool with which to understand the biological system considered as a whole (at the cancer cell or tumor scale). This approach may assist in the detection of defects within the network, the stoichiometry of components and their connectivity. Another question is why some of these biomarkers exhibit various quantitative patterns of change (increase or decrease) according to the different situations considered. On the translational side, the use of combined biomarkers could contribute to the diagnosis and prognosis of certain types of aggressive cancers, for example malignant mesothelioma, for which improvements are urgently required. This tool could also help to define and evaluate more accurate therapeutic strategies. Finally, a question which is beyond the scope of this review, is how it would be possible to prioritize these different biomarkers in a given context. This will certainly offer interesting prospects in this fascinating field of research.
  329 in total

1.  Activated leukocyte cell adhesion molecule (ALCAM/CD166/MEMD), a novel actor in invasive growth, controls matrix metalloproteinase activity.

Authors:  Pim C Lunter; Jeroen W J van Kilsdonk; Hanneke van Beek; Ine M H A Cornelissen; Mieke Bergers; Peter H G M Willems; Goos N P van Muijen; Guido W M Swart
Journal:  Cancer Res       Date:  2005-10-01       Impact factor: 12.701

2.  Loss of keratins 8 and 18 leads to alterations in α6β4-integrin-mediated signalling and decreased neoplastic progression in an oral-tumour-derived cell line.

Authors:  Hunain Alam; Samrat T Kundu; Sorab N Dalal; Milind M Vaidya
Journal:  J Cell Sci       Date:  2011-05-24       Impact factor: 5.285

Review 3.  Structure and Mechanisms of F-Type ATP Synthases.

Authors:  Werner Kühlbrandt
Journal:  Annu Rev Biochem       Date:  2019-03-22       Impact factor: 23.643

Review 4.  Chromatin structures condensed by linker histones.

Authors:  Bing-Rui Zhou; Yawen Bai
Journal:  Essays Biochem       Date:  2019-04-23       Impact factor: 8.000

Review 5.  Potential role of Anxa1 in cancer.

Authors:  Chunmei Guo; Shuqing Liu; Ming-Zhong Sun
Journal:  Future Oncol       Date:  2013-11       Impact factor: 3.404

6.  Invasiveness and MMP expression in pancreatic carcinoma.

Authors:  X Yang; E D Staren; J M Howard; T Iwamura; J E Bartsch; H E Appert
Journal:  J Surg Res       Date:  2001-06-01       Impact factor: 2.192

7.  Mutant p53 promotes tumor progression and metastasis by the endoplasmic reticulum UDPase ENTPD5.

Authors:  Fotini Vogiatzi; Dominique T Brandt; Jean Schneikert; Jeannette Fuchs; Katharina Grikscheit; Michael Wanzel; Evangelos Pavlakis; Joël P Charles; Oleg Timofeev; Andrea Nist; Marco Mernberger; Eva J Kantelhardt; Udo Siebolts; Frank Bartel; Ralf Jacob; Ariane Rath; Roland Moll; Robert Grosse; Thorsten Stiewe
Journal:  Proc Natl Acad Sci U S A       Date:  2016-12-12       Impact factor: 11.205

8.  Cervical Cancer Cell Line Secretome Highlights the Roles of Transforming Growth Factor-Beta-Induced Protein ig-h3, Peroxiredoxin-2, and NRF2 on Cervical Carcinogenesis.

Authors:  Georgia Kontostathi; Jerome Zoidakis; Manousos Makridakis; Vasiliki Lygirou; George Mermelekas; Theofilos Papadopoulos; Konstantinos Vougas; Alexios Vlamis-Gardikas; Peter Drakakis; Dimitrios Loutradis; Antonia Vlahou; Nicholas P Anagnou; Kalliopi I Pappa
Journal:  Biomed Res Int       Date:  2017-02-02       Impact factor: 3.411

9.  The role of peroxiredoxins in cancer.

Authors:  Arianna Nicolussi; Sonia D'Inzeo; Carlo Capalbo; Giuseppe Giannini; Anna Coppa
Journal:  Mol Clin Oncol       Date:  2017-01-10

10.  Nuclear lamin A/C harnesses the perinuclear apical actin cables to protect nuclear morphology.

Authors:  Jeong-Ki Kim; Arghavan Louhghalam; Geonhui Lee; Benjamin W Schafer; Denis Wirtz; Dong-Hwee Kim
Journal:  Nat Commun       Date:  2017-12-14       Impact factor: 14.919

View more
  4 in total

1.  Unveiling a Hidden Biomarker of Inflammation and Tumor Progression: The 65 kDa Isoform of MMP-9 New Horizons for Therapy.

Authors:  Rocco Rossano; Marilena Larocca; Margherita Macellaro; Domenico Bilancia; Paolo Riccio
Journal:  Curr Issues Mol Biol       Date:  2021-12-25       Impact factor: 2.976

2.  Molecular Alterations Associated with Acquired Drug Resistance during Combined Treatment with Encorafenib and Binimetinib in Melanoma Cell Lines.

Authors:  Vikas Patel; István Szász; Viktória Koroknai; Tímea Kiss; Margit Balázs
Journal:  Cancers (Basel)       Date:  2021-12-01       Impact factor: 6.639

3.  ANXA2 is correlated with the molecular features and clinical prognosis of glioma, and acts as a potential marker of immunosuppression.

Authors:  Kaiming Ma; Xin Chen; Weihai Liu; Yang Yang; Suhua Chen; Jianjun Sun; Changcheng Ma; Tao Wang; Jun Yang
Journal:  Sci Rep       Date:  2021-10-21       Impact factor: 4.379

Review 4.  Curcuminoids as Modulators of EMT in Invasive Cancers: A Review of Molecular Targets With the Contribution of Malignant Mesothelioma Studies.

Authors:  Daniel L Pouliquen; Alice Boissard; Cécile Henry; Olivier Coqueret; Catherine Guette
Journal:  Front Pharmacol       Date:  2022-07-08       Impact factor: 5.988

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.