Ioannis Kyrou1, Emmanouil Karteris2, Periklis Katopodis2, Rachel Kerslake2, Julie Davies2, Harpal S Randeva1, Kamaljit Chatha3, Marcia Hall2, Demetrios A Spandidos4, Vladimir Anikin2, Andreas Polychronis5, Jan L Robertus6. 1. Warwickshire Institute for the Study of Diabetes, Endocrinology and Metabolism (WISDEM), University Hospitals Coventry and Warwickshire NHS Trust, Coventry CV2 2DX, UK. 2. Department of Life Sciences, College of Health, Medicine and Life Sciences, Brunel University London, Uxbridge UB8 3PH, UK. 3. Warwick Medical School, University of Warwick, Coventry CV4 7AL, UK. 4. Laboratory of Clinical Virology, Medical School, University of Crete, 71409 Heraklion, Greece. 5. Mount Vernon Cancer Centre, Middlesex HA6 2RN, UK. 6. National Heart and Lung Institute, Imperial College London, London SW3 6LY, UK.
Abstract
Severe acute respiratory syndrome (SARS) coronavirus‑2 (SARS‑CoV‑2), the causative viral agent for the ongoing COVID‑19 pandemic, enters its host cells primarily via the binding of the SARS‑CoV‑2 spike (S) proteins to the angiotensin‑converting enzyme 2 (ACE2). A number of other cell entry mediators have also been identified, including neuropilin‑1 (NRP1) and transmembrane protease serine 2 (TMPRSS2). More recently, it has been demonstrated that transmembrane protease serine 4 (TMPRSS4) along with TMPRSS2 activate the SARS‑CoV‑2 S proteins, and enhance the viral infection of human small intestinal enterocytes. To date, a systematic analysis of TMPRSS4 in health and disease is lacking. In the present study, using in silico tools, the gene expression and genetic alteration of TMPRSS4 were analysed across numerous tumours and compared to controls. The observations were also expanded to the level of the central nervous system (CNS). The findings revealed that TMPRSS4 was overexpressed in 11 types of cancer, including lung adenocarcinoma, lung squamous cell carcinoma, cervical squamous cell carcinoma, thyroid carcinoma, ovarian cancer, cancer of the rectum, pancreatic cancer, colon and stomach adenocarcinoma, uterine carcinosarcoma and uterine corpus endometrial carcinoma, whilst it was significantly downregulated in kidney carcinomas, acute myeloid leukaemia, skin cutaneous melanoma and testicular germ cell tumours. Finally, a high TMPRSS4 expression was documented in the olfactory tubercle, paraolfactory gyrus and frontal operculum, all brain regions which are associated with the sense of smell and taste. Collectively, these data suggest that TMPRSS4 may play a role in COVID‑19 symptomatology as another SARS‑CoV‑2 host cell entry mediator responsible for the tropism of this coronavirus both in the periphery and the CNS.
Severe acute respiratory syndrome (SARS) coronavirus‑2 (SARS‑CoV‑2), the causative viral agent for the ongoing COVID‑19 pandemic, enters its host cells primarily via the binding of the SARS‑CoV‑2 spike (S) proteins to the angiotensin‑converting enzyme 2 (ACE2). A number of other cell entry mediators have also been identified, including neuropilin‑1 (NRP1) and transmembrane protease serine 2 (TMPRSS2). More recently, it has been demonstrated that transmembrane protease serine 4 (TMPRSS4) along with TMPRSS2 activate the SARS‑CoV‑2 S proteins, and enhance the viral infection of human small intestinal enterocytes. To date, a systematic analysis of TMPRSS4 in health and disease is lacking. In the present study, using in silico tools, the gene expression and genetic alteration of TMPRSS4 were analysed across numerous tumours and compared to controls. The observations were also expanded to the level of the central nervous system (CNS). The findings revealed that TMPRSS4 was overexpressed in 11 types of cancer, including lung adenocarcinoma, lung squamous cell carcinoma, cervical squamous cell carcinoma, thyroid carcinoma, ovarian cancer, cancer of the rectum, pancreatic cancer, colon and stomach adenocarcinoma, uterine carcinosarcoma and uterine corpus endometrial carcinoma, whilst it was significantly downregulated in kidney carcinomas, acute myeloid leukaemia, skin cutaneous melanoma and testicular germ cell tumours. Finally, a high TMPRSS4 expression was documented in the olfactory tubercle, paraolfactory gyrus and frontal operculum, all brain regions which are associated with the sense of smell and taste. Collectively, these data suggest that TMPRSS4 may play a role in COVID‑19 symptomatology as another SARS‑CoV‑2 host cell entry mediator responsible for the tropism of this coronavirus both in the periphery and the CNS.
Severe acute respiratory syndrome (SARS) coronavirus-2 (SARS-CoV-2) is the causative viral agent for the ongoing COVID-19 pandemic, with >75 million infected individuals and >2.38 million deaths worldwide, up to the writing of the present study (February 14, 2021) (1,2). During the writing of the present manuscript, a glimpse of hope appeared on the horizon with vaccinations being developed and administered globally (3-7).It is now well-established that the entry of SARS-CoV-2 into cells is facilitated by its spike (S) proteins, mainly through binding to the angiotensin-converting enzyme 2 (ACE2) (8,9). Moreover, the SARS-CoV-2 S proteins are primed/activated by the transmembrane protease serine 2 (TMPRSS2), which appears to also play a key role in this viral infection (8,10,11). As such, there is increasing interest in identifying additional molecular mediators that may also facilitate the SARS-CoV-2 infection of host cells and promote further adverse COVID-19 symptoms. Accordingly, neuropilin-1 (NRP1) has been identified as a novel cellular mediator implicated in the SARS-CoV-2 infection (12-14), whilst similar evidence has also more recently emerged for TMPRSS4, another transmembrane protease. Indeed, TMPRSS4 - along with TMPRSS2 - appear to activate the SARS-CoV-2 S proteins, and enhance subsequent viral infection of human small intestinal enterocytes (15). These data from Zang et al, provide evidence that the intestine is an additional target organ for SARS-CoV-2 (15). Moreover, a recent study demonstrated that both ACE2 and TMPRSS2 genes were abundantly expressed in enterocytes of the lower gastrointestinal (GI) tract, whilst also exhibiting co-expression with TMPRSS4, particularly in the small intestine (16). Of note, whereas COVID-19 presentation usually includes fever, cough, pulmonary/respiratory symptoms, and loss of taste and smell (17,18), an increasing number of studies has also reported GI-related symptoms (e.g., diarrhoea, nausea and vomiting) in patients with COVID-19 (19-21). Indeed, a proportion of these patients with extra-pulmonary symptoms do not exhibit other respiratory symptoms (22).Apart from respiratory and GI-related symptoms, it is now evident that patients with COVID-19 may experience a wide repertoire of neurological symptoms and complications (23-28), suggesting that SARS-CoV-2 can also attack the central nervous system (CNS), potentially via additional cell entry mediators, including NRP1 (14,29). Indeed, in silico analysis indicated that certain key mediators which facilitate the entry of SARS-CoV-2 into host cells, including ACE2, Cathepsin L (CTSL), TMPRSS2 and TMPRSS4, are expressed in the CNS, with the two transmembrane proteases being highly expressed in neurons (30).Finally, increasing evidence suggests that cancer constitutes an additional risk factor for severe COVID-19infection, with hospitalised patients with COVID-19 (particularly males and those receiving treatment with chemotherapy) exhibiting a poor prognosis and being associated with a high case-fatality rate (31). The authors of the present study, as well as other researchers have documented the differential expression of SARS-CoV-2 infection mediators in malignant states (11,32). TMPRSS2 in particular, was previously found to be significantly upregulated in rectum adenocarcinoma, as well as in prostate, breast and lung cancer (11). Notably, compared to never-smokers, a similar ACE2 and TMPRSS2 expression, but a higher TMPRSS4 expression, has been found in the human bronchial epithelial cells of current smokers when compared to never-smokers (33).To date, there is a paucity of published data about the expression profile of TMPRSS4 in cancer. Therefore, present study curated pan-cancer data in terms of TMPRSS4 gene expression using in silico approaches. The TMPRSS4 expression profile is also further expanded by providing more evidence regarding the presence of this transmembrane protease in the CNS and GI track.
Data and methods
Bioinformatics analysis
The expression analysis of TMPRSS4 was validated through the Genotype-Tissue Expression (GTEx, www.gtexportal.org) and GEPIA (gepia.cancer-pku.cn). Information regarding the TCGA cohort pan-cancer data was acquired through cBioPortal (www.cbioportal.org/) and single cell analysis was done by using the Single Cell Portal (singlecell.broadinstitute.org/single_cell). Datasets accessed for pan-cancer analysis were the following: ACC, adrenocortical carcinoma; BLCA, bladder urothelial carcinoma; BRCA, breast invasive carcinoma; CESC, cervical squamous cell carcinoma and endocervical adenocarcinoma; CHOL, cholangiocarcinoma; COAD, colon adenocarcinoma; DLBC, lymphoid neoplasm diffuse large B cell lymphoma; ESCA, oesophageal carcinoma; GBM, glioblastoma multiforme; HNSC, head and neck squamous cell carcinoma; KICH, kidney chromophobe; KIRC, kidney renal clear cell carcinoma; KIRP, kidney renal papillary cell carcinoma; LAML, acute myeloid leukaemia; LGG, brain lower grade glioma; LIHC, liver hepatocellular carcinoma; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; MESO, mesothelioma; OV, ovarian serous cystadenocarcinoma; PAAD, pancreatic adenocarcinoma; PCPG, pheochromocytoma and paraganglioma; PRAD, prostate adenocarcinoma; READ, rectum adenocarcinoma; SARC, sarcoma; SKCM, skin cutaneous melanoma; STAD, stomach adenocarcinoma; TGCT, testicular germ cell tumours; THCA, thyroid carcinoma; THYM, thymoma; UCEC, uterine corpus endometrial carcinoma; UCS, uterine carcinosarcoma; and UVM, uveal melanoma. The visualisation of TMPRSS4 expression in the human brain was performed using the Allen brain atlas with 6 human donors, as assessed by microarray and presented as a heatmap (34). These data were acquired from a publicly available source which underwent all appropriate approvals by the Human Investigation Committees and Institutional Ethics Committees of each institute from which samples were obtained.
Statistical analysis
The method used for differential analysis in the present study was one-way ANOVA, using disease state (Tumour or Normal) as variables for calculating differential expression: Gene expression ~disease state. The expression data are first log2(TPM+1) transformed for differential analysis and the log2FC is defined as median (Tumor) – median (Normal). Genes with higher |log2FC| values and lower q values than pre-set thresholds are considered differentially expressed genes. For more information please visit: http://gepia.cancer-pku.cn/help.html.
Results
Using the TCGA and GTEX datasets, it was demonstrated that, compared to the normal control, TMPRSS4 was upregulated in 11 cancer datasets (i.e., CESC, COAD, LUAD, LUSC, OV, PAAD, READ, STAD, THCA, UCEC and UCS) (Fig. 1), while it was significantly downregulated in 6 datasets (i.e., KICH, KIRC, KIRP, LAML, SKCM and TGCT), with three of the latter consisting kidney tumours (i.e., KICK, KIRC and KIRP) (Fig. 2).
Figure 1
From the 33 TCGA cancer datasets, 11 exhibited a higher TMPRSS4 expression in tumour samples. TMPRSS4 was highly expressed in lung cancers (LUAD and LUSC), gynecological cancers (CESC, OV, UCEC and UCS), pancreatic carcinoma (PAAD), colon adenocarcinoma (COAD), rectum carcinoma (READ), stomach adenocarcinoma (STAD) and thyroid cancer (THCA). T, tumour; N, normal; TPM, transcripts per million; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; CESC, cervical squamous cell carcinoma and endocervical adenocarcinoma; OV, ovarian serous cystadenocarcinoma; UCEC, uterine corpus endometrial carcinoma; UCS, uterine carcinosarcoma.
Figure 2
Out of the 33 TCGA cancer datasets, TMPRSS4 was significantly downregulated in 6 datasets (i.e., KICH, KIRC, KIRP, LAML, SKCM and TGCT), with three of these consisting kidney tumours (i.e., KICK, KIRC and KIRP) compared to normal tissues. KICH, kidney chromophobe; KIRC, kidney renal clear cell carcinoma; KIRP, kidney renal papillary cell carcinoma; LAML, acute myeloid leukaemia; SKCM, skin cutaneous melanoma; TGCT, testicular germ cell tumours; T, tumour; N, normal.
Furthermore, using the cBioportal pan-cancer panel, the present study identified specific cancer types with a number of TMPRSS4 amplifications in LGG, LUAD, STAD, deep deletions in BRCA, HNSC, and shallow deletions in BLCA, BRCA, CESC, ESCA, LUAD, LUSC, OV and TGCT. Furthermore, there were two datasets for thyroid cancers with a number of patients presenting diploid and not mutated versions of the gene, namely for THYM and THCA. Of note, in the majority of the studied cancers, the majority of the patients had deletions and partly some gains and amplifications (Fig. 3).
Figure 3
cBioportal analysis of the 32 TCGA datasets with information about the gene dysrégulation. In BLCA, LUAD and LUSC there are a number of shallow deletions, while in BRCA and HNSC, high expression and more deep deletions are observed. In gynaecological cancers (CESC, OV, UCEC and UCS), there is high expression of the TMPRSS4 and a few patients present Gains and Missense 'variants of uncertain significant' (VUS). BLCA, bladder urothelial carcinoma; BRCA, breast invasive carcinoma; CESC, cervical squamous cell carcinoma and endocervical adenocarcinoma; HNSC, head and neck squamous cell carcinoma; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; OV, ovarian serous cystadenocarcinoma; UCEC, uterine corpus endometrial carcinoma; UCS, uterine carcinosarcoma.
As aforementioned, there is emerging evidence suggesting abundant SARS-CoV-2 infection of the GI tract in severe COVID-19 cases (35). Herein, these observations were expanded upon using single cell analysis from 51 cell subsets (366,650 cells in total) in the colon mucosa of 18 ulcerative colitis and 12 healthy individuals using the Single Cell Portal (36). Using T-distributed Stochastic Neighbour Embedding (tSNE) - a machine learning algorithm for visualization - distinct cell sub-populations express TMPRSS4 (Fig. 4). Notably, a high TMPRSS4 expression was noted in immature enterocytes, BEST4-expressing enterocytes and enterocytes, as presented in Fig. 4C.
Figure 4
Single cell analysis of colon mucosa, using the Single Cell Portal, revealed widespread expression of TMPRSS4. (A) Single-cell transcriptomics data are displayed as a spectral tSNE (T-distributed Stochastic Neighbour Embedding) plot of 366,650 cells, annotated according to known cell types; (B) sub-populations of cell types which are enriched for TMPRSS4 expression, with expression intensity demonstrated by a heat map; (C) single cell analysis using the Single Cell Portal, enriched for population subtypes, represented as violin plots. Notably, high TMPRSS4 expression was noted in immature enterocytes, BEST4-expressing enterocytes and enterocytes. TA1, TA2; transit-amplifying (TA) cells.
Finally, TMPRSS4 expression was also observed in various brain regions, based on a human microarray data set of 6 human brains (Fig. 5). These data were acquired from a publicly available source which underwent all appropriate approvals by the Human Investigation Committees and Institutional Ethics Committees of each institute from which samples were obtained.
Figure 5
Heatmap expression of TMPRSS4 in the brains of 6 human donors, as assessed by microarray (Allen brain atlas). Global brain expression of TMPRSS4, as detected by microarray probes, with expression of cerebral areas indicated on the hemispheres and expression in brain nuclei indicated as dots. Heatmaps of expression are displayed below individual brains and organised by anterior to posterior regions with the frontal lobe (Fl), hippocampal formation (HiF), occipital lobe (Ol), parietal lobe (Pl), temporal lobe (Tl), striatum (Str), dorsal thalamus (dT), cerebral cortex (cbcx) and myelencephalon (MY) marked on the heatmap.
Fig. 5 presents these data as heatmaps of log2 expression values, showing that TMPRSS4 is expressed throughout the human brain. Of note, anosmia represents one of the common first symptoms of SARS-CoV-19 infection (37,38), and although there is global brain expression of TMPRSS4, a high TMPRSS4 expression is noted in brain regions important for the sense of smell and taste, namely in the olfactory tubercle (Log2 expression, 2.1±1), paraolfactory gyrus (Log2 expression, 1.3±0.3) and frontal operculum (Log2 expression, 1.5±0.5). These Log2 expression values are provided by the Allen Brain Atlas. Previous research has also demonstrated that TMPRSS4 exhibited the highest expression in neurons, with the neuronal rich cerebral cortex exhibiting a high TMPRSS4 expression (39). TMPRSS4 protein expression is 1.5 protein-transcripts per million (pTPM) in this region with the highest expression localised to the neuronal cells (ProteinAtlas_ENSG00000137648).
Discussion
In the present study, comprehensive evidence is presented regarding the peripheral tissue and CNS distribution of a new potential mediator of SARS-CoV-2 infection, namely TMPRSS4. Of note, it was demonstrated that TMPRSS4 is overexpressed in lung cancer; a condition that predisposes to severe COVID-19 (33,40,41). The current findings are in agreement with those of a previous study demonstrating the co-expression of TMPRSS4 with other key SARS-CoV-2 cell entry mediators (i.e., ACE2, ADAM17 and TMPRSS2) in bronchial epithelial cells from never smokers and current smokers (33). Notably, that study also demonstrated that the TMPRSS4 levels were elevated in smokers, suggesting that this may be an additional risk factor for SARS-CoV-2 infection (33). In the lung cancer cohorts investigated in the present study, a marginal difference was note in LUAD with a marked downregulation of TMPRSS4 in current smokers, whilst in patients with LUSC an increase in TMPRSS4 was noted in smokers compared to non-smokers (Fig. S1A and B). This is suggestive of potential tissue-specific effects, given that patients with LUAD have poorer prognosis than those with LUSC (42).Another recent study also demonstrated a high expression of TMPRSS4 in the human endometrium, which increased with age, particularly in the early phases of the cycle (43). This is of increasing importance given that SARS-CoV-2 infectivity increases with age (44). The present study expanded on these observations and provided evidence of TMPRSS4 expression in other gynaecological tissues, demonstrating that TMPRSS4 is expressed in the cervix, vagina, fallopian tubes, ovaries, breast and uterus, with the first two being the primary organs in terms of a high expression (Fig. S1D). Moreover, it was demonstrated that TMPRSS4 was significantly upregulated in gynaecological malignancies, namely CESC, OV, UCEC and UCS. As age appears to play a role in the risk of COVID-19, the present study further investigated the expression of TMPRSS4 in the above-mentioned malignancies with age. In this respect, TMPRSS4 expression increased with age only in the case of UCEC (Fig. S1C). One of the major limitations of the present study was that in silico data were accessed, that did not allow us to perform statistical analysis with a post hoc test. Another limitation is the absence of survival analysis data. Future studies are required to determine whether TMPRSS4 can be of prognostic value in terms of overall- and/or progression free survival.Recently, two different groups have described the involvement of TMPRSS4 as a SARS-CoV-2 entry mediator in the GI (15,16). In particular, ACE2, TMPRSS2 and TMPRSS4 have been shown to be co-expressed primarily in the small intestine (16). In another study, through a series of elegant experiments, it was shown that, apart from TMPRSS2, TMPRSS4 also enhanced SARS-CoV-2 infectivity in gut epithelial cells (15), suggesting that a leaky gut may allow SARS-CoV-2 to spread to other organs, including the liver. Future studies are required to focus on the role of these key cell entry mediators in GI tract organs, particularly in relation to extra-pulmonary manifestations of severe COVID-19.Finally, it was demonstrated that TMPRSS4 is present in a number of anterior and posterior regions of the brain. This corroborates previous finding indicating expression in the cerebral cortex, hippocampus and caudate (45). Of note, other SARS-CoV-2 cell entry mediators (e.g., ACE2, TMPRSS2, NRP1 and CTSL) are also expressed in the CNS, raising the possibility of a synergy between multiple such proteins which may further drive the SARS-CoV-2 neurotropism (29,30,46).In conclusion, the present study documents widespread TMPRSS4 protein expression in the CNS and GI tract, and provides further evidence supporting the potential involvement of TMPRSS4 in facilitating adverse COVID-19 outcomes in patients with certain cancers. Collectively, these data suggest that TMPRSS4 may be implicated in the symptomatology/complications of COVID-19, acting as another SARS-CoV-2 cell entry mediator responsible for the tropism of this novel coronavirus in the periphery and the CNS.
Authors: Spyros Darmanis; Steven A Sloan; Ye Zhang; Martin Enge; Christine Caneda; Lawrence M Shuer; Melanie G Hayden Gephart; Ben A Barres; Stephen R Quake Journal: Proc Natl Acad Sci U S A Date: 2015-05-18 Impact factor: 11.205
Authors: Merryn Voysey; Sue Ann Costa Clemens; Shabir A Madhi; Lily Y Weckx; Pedro M Folegatti; Parvinder K Aley; Brian Angus; Vicky L Baillie; Shaun L Barnabas; Qasim E Bhorat; Sagida Bibi; Carmen Briner; Paola Cicconi; Andrea M Collins; Rachel Colin-Jones; Clare L Cutland; Thomas C Darton; Keertan Dheda; Christopher J A Duncan; Katherine R W Emary; Katie J Ewer; Lee Fairlie; Saul N Faust; Shuo Feng; Daniela M Ferreira; Adam Finn; Anna L Goodman; Catherine M Green; Christopher A Green; Paul T Heath; Catherine Hill; Helen Hill; Ian Hirsch; Susanne H C Hodgson; Alane Izu; Susan Jackson; Daniel Jenkin; Carina C D Joe; Simon Kerridge; Anthonet Koen; Gaurav Kwatra; Rajeka Lazarus; Alison M Lawrie; Alice Lelliott; Vincenzo Libri; Patrick J Lillie; Raburn Mallory; Ana V A Mendes; Eveline P Milan; Angela M Minassian; Alastair McGregor; Hazel Morrison; Yama F Mujadidi; Anusha Nana; Peter J O'Reilly; Sherman D Padayachee; Ana Pittella; Emma Plested; Katrina M Pollock; Maheshi N Ramasamy; Sarah Rhead; Alexandre V Schwarzbold; Nisha Singh; Andrew Smith; Rinn Song; Matthew D Snape; Eduardo Sprinz; Rebecca K Sutherland; Richard Tarrant; Emma C Thomson; M Estée Török; Mark Toshner; David P J Turner; Johan Vekemans; Tonya L Villafana; Marion E E Watson; Christopher J Williams; Alexander D Douglas; Adrian V S Hill; Teresa Lambe; Sarah C Gilbert; Andrew J Pollard Journal: Lancet Date: 2020-12-08 Impact factor: 79.321
Authors: Ruochen Zang; Maria Florencia Gomez Castro; Broc T McCune; Qiru Zeng; Paul W Rothlauf; Naomi M Sonnek; Zhuoming Liu; Kevin F Brulois; Xin Wang; Harry B Greenberg; Michael S Diamond; Matthew A Ciorba; Sean P J Whelan; Siyuan Ding Journal: Sci Immunol Date: 2020-05-13
Authors: Periklis Katopodis; Emmanouil Karteris; Harpal S Randeva; Demetrios A Spandidos; Sayeh Saravi; Ioannis Kyrou Journal: Int J Mol Med Date: 2021-12-22 Impact factor: 4.101
Authors: Ioannis Kyrou; Emmanouil Karteris; Rachel Kerslake; Harpal S Randeva; Danny Jonigk; Christopher Werlein; Jan L Robertus; Periklis Katopodis; Petra Jasker; Demetrios A Spandidos Journal: Mol Med Rep Date: 2022-02-25 Impact factor: 2.952
Authors: Samar Rezq; Alexandra M Huffman; Jelina Basnet; Licy L Yanes Cardozo; Damian G Romero Journal: Int J Mol Sci Date: 2021-09-09 Impact factor: 5.923