Literature DB >> 31411736

Molecular characterization of colorectal adenomas reveals POFUT1 as a candidate driver of tumor progression.

Malgorzata A Komor1,2, Meike de Wit1, Jose van den Berg1, Sanne R Martens de Kemp1,2, Pien M Delis-van Diemen1, Anne S Bolijn1, Marianne Tijssen1, Tim Schelfhorst2, Sander R Piersma2, Davide Chiasserini2, Joyce Sanders1, Christian Rausch1, Youri Hoogstrate3, Andrew P Stubbs4, Mark de Jong5, Guido Jenster3, Beatriz Carvalho1, Gerrit A Meijer1, Connie R Jimenez2, Remond J A Fijneman1.   

Abstract

Removal of colorectal adenomas is an effective strategy to reduce colorectal cancer (CRC) mortality rates. However, as only a minority of adenomas progress to cancer, such strategies may lead to overtreatment. The present study aimed to characterize adenomas by in-depth molecular profiling, to obtain insights into altered biology associated with the colorectal adenoma-to-carcinoma progression. We obtained low-coverage whole genome sequencing, RNA sequencing and tandem mass spectrometry data for 30 CRCs, 30 adenomas and 18 normal adjacent colon samples. These data were used for DNA copy number aberrations profiling, differential expression, gene set enrichment and gene-dosage effect analysis. Protein expression was independently validated by immunohistochemistry on tissue microarrays and in patient-derived colorectal adenoma organoids. Stroma percentage was determined by digital image analysis of tissue sections. Twenty-four out of 30 adenomas could be unambiguously classified as high risk (n = 9) or low risk (n = 15) of progressing to cancer, based on DNA copy number profiles. Biological processes more prevalent in high-risk than low-risk adenomas were related to proliferation, tumor microenvironment and Notch, Wnt, PI3K/AKT/mTOR and Hedgehog signaling, while metabolic processes and protein secretion were enriched in low-risk adenomas. DNA copy number driven gene-dosage effect in high-risk adenomas and cancers was observed for POFUT1, RPRD1B and EIF6. Increased POFUT1 expression in high-risk adenomas was validated in tissue samples and organoids. High POFUT1 expression was also associated with Notch signaling enrichment and with decreased goblet cells differentiation. In-depth molecular characterization of colorectal adenomas revealed POFUT1 and Notch signaling as potential drivers of tumor progression.
© 2019 The Authors. International Journal of Cancer published by John Wiley & Sons Ltd on behalf of UICC.

Entities:  

Keywords:  POFUT1; adenoma-to-carcinoma progression; colorectal adenoma

Mesh:

Substances:

Year:  2019        PMID: 31411736      PMCID: PMC7027554          DOI: 10.1002/ijc.32627

Source DB:  PubMed          Journal:  Int J Cancer        ISSN: 0020-7136            Impact factor:   7.396


cancer‐associated event colorectal cancer eukaryotic translation initiation factor 6 gene set enrichment analysis high‐risk adenoma immunohistochemistry liquid chromatography tandem mass spectrometry low‐risk adenoma microsatellite instability/instable microsatellite‐stable protein O‐fucosyltransferase 1 RNA sequencing regulation of nuclear pre‐mRNA domain containing 1B tissue microarray whole genome sequencing

Introduction

Colorectal adenomas are benign precursor lesions of colorectal cancer (CRC) that arise from normal epithelium.1 The prevalence of adenomas in the large intestine is much higher than the incidence of cancer,2, 3 implying that the majority of adenomas will never progress to CRC.4 In clinical practice, adenomas detected during colonoscopy are completely removed, and consequently the natural history of disease is disrupted. Based on the prevalence of focal cancer in endoscopically removed adenomas, it is estimated that only 5% of adenomas will eventually progress to CRC.5, 6 Currently, adenomas larger than 1 cm and/or with a villous component and/or with high‐grade dysplasia are referred to as “advanced adenomas” and are considered to be clinically relevant precursors of CRC. However, incidence studies of both advanced adenomas and CRCs suggest that these features alone are not precise predictors of the malignant progression.7 Cancer is caused by molecular alterations in DNA, thereby affecting gene expression at RNA and protein level. The “advanced adenoma” definition neglects molecular changes that accompany adenoma‐to‐carcinoma progression. In multiple cancer types, the progression of dysplastic epithelial premalignant lesions, like colorectal adenomas, has been associated with acquisition of genomic instability.8, 9 This often concerns chromosomal instability, which affects about 85% of CRCs.10 Studies on chromosomal instability in colorectal adenomas and cancers led to identification of nonrandom chromosomal aberrations and potential CRC driver events, which play a major role in adenoma‐to‐carcinoma progression.11, 12, 13, 14, 15, 16, 17, 18 Seven chromosomal copy number aberrations have been identified as colorectal cancer‐associated events (CAEs); gains of chromosomal arms 8q, 13q and 20q and losses of chromosomal arms 8p, 15q, 17p and 18q. With the accuracy of 78%, the presence of at least two of these CAEs enabled distinction of an adenoma with a focus of cancer from a nonmalignant adenoma.11 Therefore, adenomas with at least two out of the seven CAEs are marked as high risk of progressing to malignancy, further referred to as high‐risk adenomas (HRAs).11 We recently observed that only 23–36% of advanced adenomas classify as HRAs based on their DNA copy number profile.19 The aim of the present study was to characterize adenomas at low and high risk of progressing to cancer by molecular profiling at DNA, RNA and protein level, allowing to examine the biological processes in which these adenomas differ and to discover putative drivers of early colorectal tumor development.

Materials and Methods

Tissue data

Fresh frozen tissue material from 30 CRCs, 30 adenomas and 18 normal colorectal mucosa samples was collected at the Department of Pathology of the Amsterdam University Medical Center (VUmc) in Amsterdam, as described previously.20 Collection, storage and use of tissue and patient data were performed in compliance with the “Code for Proper Secondary Use of Human Tissue in the Netherlands” (https://www.federa.org/). All normal samples were adjacent to colorectal neoplasia; four normal colon samples were adjacent to adenomas and cancers, six to colorectal adenomas and eight to CRC. All normal samples were obtained from the furthest point from colorectal neoplasia within the surgically resected material and judged as 100% normal by an expert pathologist. In our study all adenomas were larger than 1 cm in size to allow sampling of fresh frozen material for research purposes from tissues that were collected for routine diagnostics. Therefore, all of the adenomas used in our study were “advanced adenomas.” For each sample, one tissue piece was cut into serial sections that were alternatingly used for DNA, RNA and protein isolation in the order DNA–RNA‐protein‐(…)‐DNA–RNA‐protein, to obtain the most comparable molecular profiles on DNA, RNA and protein level.

Genomics data

Low‐coverage whole genome sequencing (WGS) data for the adenomas and RNA sequencing (RNA‐seq) data for colorectal adenomas and cancers were obtained in our previous study.20 For the normal adjacent colon sample collection, DNA and RNA isolation, low‐coverage WGS and RNA‐seq was performed as previously described for adenomas and cancers.20 Raw sequencing data were made available through the European Genome‐Phenome Archive (https://ega-archive.org/, EGAS00001002854). DNA copy number aberration identification in CRCs and normal adjacent colon samples was performed as described previously for the adenomas.20

Mass spectrometry proteomics data

Sample preparation for liquid chromatography tandem mass spectrometry proteomics (LC–MS/MS) was performed as previously described,21 with some modifications (Supplementary Materials and Methods). Mass spectrometry was performed on a Q Exactive‐HF mass spectrometer (Thermo Fisher, Bremen, Germany) using a data independent acquisition mass spectrometry protocol. The data independent acquisition mass spectrometry method consisted of a MS1 scan from 400 to 1,000 m/z at 15,000 resolution (AGC target of 3 × 106 and 50 ms injection time). For MS2, 24 variable size DIA segments were acquired at 30,000 resolution (AGC target 3 × 106 and auto for injection time). The data independent acquisition mass spectrometry method included 20 windows of 20 m/z, 2 × 40 m/z and 2 × 60 m/z. Collision energy was set at 28%. The spectra were recorded in centroid mode. The default charge state for the MS2 was set to 3.

RNA‐seq data analysis

RNA‐seq data preprocessing was performed as described previously,20 now using human genome build hg19 (USCS RefSeq hg19, gencode v19 annotation). RNA‐seq data were subjected to differential expression analysis, cellular decomposition (ESTIMATE22 algorithm), gene set enrichment analysis (GSEA)23 and gene‐dosage effect analysis (Supplementary Materials and Methods).

Proteomics data analysis

An in‐house spectral library was established using LC–MS/MS data derived from CRCs, colorectal adenomas and normal adjacent colon samples (manuscript in preparation), which was used in Spectronaut24 to identify mass spectra. Protein groups were identified, quality control was performed and protein expression data was subjected to differential expression analysis, GSEA23 and gene‐dosage effect analysis (Supplementary Materials and Methods).

Quantification of tumor‐stroma and goblet cells

Fresh‐frozen tissue sections taken “before” and “after” the tissue sections used for DNA, RNA and protein isolation were stained with hematoxylin and eosin, and scanned using Aperio AT2 Scanner (Leica Biosystems Imaging, Amsterdam, The Netherlands). The digital images were used for stroma and goblet cells quantification (Supplementary Materials and Methods).

Immunohistochemical staining of tissue microarrays and patient‐derived colorectal adenoma organoids

Candidate drivers of adenoma‐to‐carcinoma progression were selected for immunohistochemical (IHC) validation of protein expression in colorectal tissues using tissue microarrays (TMAs), and in cultures of epithelial cells using sections of patient‐derived colorectal adenoma organoids. Candidates were selected using the following criteria: higher expression in HRAs when compared to low‐risk adenomas (LRAs); and higher intensity in CRCs when compared to normal colon according to the Human Protein Atlas (http://www.proteinatlas.org).25 See Supplementary Materials and Methods for details on IHC and patient‐derived organoids.

Results

Molecular characterization of LRA and HRA

With the aim to characterize colorectal adenomas in the context of colorectal tumor progression, we have performed low‐coverage WGS, genome‐wide RNA‐seq and tandem mass spectrometry proteomics (LC–MS/MS) on 30 colorectal adenomas,20 30 CRCs and 18 adjacent normal colon tissues (see Fig. 1 for an overview of the analyses applied in the entire study and Supplementary Table S2 for clinical information on the samples). Using low‐coverage WGS we determined DNA copy number aberrations in the samples. Within the adenomas, nine HRAs were identified based on the presence of at least two CAEs. To obtain a robust representation of LRAs, only microsatellite‐stable (MSS) lesions that carried none of the CAEs were included. Two adenomas were microsatellite‐instable (MSI), two adenomas carried only one CAE, and for two adenomas the calling of CAEs remained inconclusive,20 leaving 15 MSS adenomas with no CAEs that were classified as LRAs (Supplementary Fig. S1 and Table S3). No significant associations were observed for risk of progression and pathological adenoma features like size, grade of dysplasia or histology (Table S4). CRCs showed the well‐known nonrandom pattern of chromosomal instability with CAEs being the most frequent, next to gain of chromosome 7 and loss of chromosome 14 (Fig. S1). As six CRCs had previously been identified as MSI,20 the DNA copy number frequency for MSI CRCs and MSS CRCs were examined separately, revealing less chromosomal aberrations in MSI CRCs (Fig. S1). No chromosomal aberrations were observed in the normal adjacent colon samples (Fig. S1).
Figure 1

Fresh‐frozen tissue fragments of colorectal cancers (n = 30), colorectal adenomas (n = 30) and normal adjacent colon samples (n = 18) were used for low‐coverage WGS, RNA‐seq, tandem mass spectrometry proteomics and histology analysis. DNA copy number aberration identification and HRA and LRA stratification was performed using the low‐coverage WGS data. RNA‐seq and proteomics data were used for differential gene/protein expression analysis and GSEA. Additionally, single sample GSEA and ESTIMATE algorithm, which calculate the enrichment of stromal and immune gene signatures, were used on the RNA expression data set. Stroma quantification was performed on sections originating from the same tissue fragments as used for the molecular profiling data to validate the results of the expression analysis. Stroma percentage was compared between HRA and LRA and correlated with the stromal score of the ESTIMATE algorithm. Next, DNA copy number driven gene‐dosage effect analysis was performed. Ninety‐two and ten genes were identified to correlate in terms of DNA copy number, RNA and protein expression in CRCs and adenomas, respectively. Three genes, POFUT1, RPRD1B and EIF6, were overlapping between adenomas and cancers and were observed to be amplified and overexpressed in HRAs and CRCs. Validation of POFUT1 and RPRD1B by immunohistochemical staining was performed in TMAs of the formalin‐fixed, paraffin‐embedded tissue pieces and for POFUT1 also in full sections of patient‐derived adenoma organoids. Additionally, goblet cell quantification was performed on the sections of colorectal adenomas and association with POFUT1 expression and risk of progression was identified. [Color figure can be viewed at http://wileyonlinelibrary.com]

Fresh‐frozen tissue fragments of colorectal cancers (n = 30), colorectal adenomas (n = 30) and normal adjacent colon samples (n = 18) were used for low‐coverage WGS, RNA‐seq, tandem mass spectrometry proteomics and histology analysis. DNA copy number aberration identification and HRA and LRA stratification was performed using the low‐coverage WGS data. RNA‐seq and proteomics data were used for differential gene/protein expression analysis and GSEA. Additionally, single sample GSEA and ESTIMATE algorithm, which calculate the enrichment of stromal and immune gene signatures, were used on the RNA expression data set. Stroma quantification was performed on sections originating from the same tissue fragments as used for the molecular profiling data to validate the results of the expression analysis. Stroma percentage was compared between HRA and LRA and correlated with the stromal score of the ESTIMATE algorithm. Next, DNA copy number driven gene‐dosage effect analysis was performed. Ninety‐two and ten genes were identified to correlate in terms of DNA copy number, RNA and protein expression in CRCs and adenomas, respectively. Three genes, POFUT1, RPRD1B and EIF6, were overlapping between adenomas and cancers and were observed to be amplified and overexpressed in HRAs and CRCs. Validation of POFUT1 and RPRD1B by immunohistochemical staining was performed in TMAs of the formalin‐fixed, paraffin‐embedded tissue pieces and for POFUT1 also in full sections of patient‐derived adenoma organoids. Additionally, goblet cell quantification was performed on the sections of colorectal adenomas and association with POFUT1 expression and risk of progression was identified. [Color figure can be viewed at http://wileyonlinelibrary.com] To explore the biological processes playing a role in colorectal tumor progression, the tissue samples were analyzed by RNA‐seq and LC–MS/MS. Mass spectrometry analysis lead to identification of 5,080 protein groups in the whole data set and 4,903 in the group of HRAs and LRAs (false discovery rate ≤0.01). Among the adenomas, one HRA was identified as an outlier due to low protein group number and highly differing expression profile from the rest of the adenoma samples (Fig. S2) and was excluded from further proteomic analyses. Dimensionality reduction of the RNA and protein expression data allowed to clearly discern adenomas from CRCs and normal adjacent colon tissues (Figs. S3 and S3c) while HRAs and LRAs were indistinguishable (Figs. S3). Differential gene expression analysis between the HRAs and LRAs revealed 298 genes with higher and 125 genes with lower expression in HRAs (Table S5). Differential protein expression analysis revealed 78 proteins with higher and 86 with lower expression in HRAs (Table S6). Fourteen genes were differentially expressed on both RNA and protein level, with 9 genes higher and 5 lower expressed in HRAs (Table S7). To gain further insights into the global differences between the adenomas, we performed GSEA with hallmark gene signatures (molecular signature database26) on lists of genes and proteins ranked according to differences in the expression between HRAs and LRAs (Fig. 2). Processes that were more prominent in HRAs on RNA and protein level were related to proliferation, immune response and stroma development. Additionally, a number of signaling pathways were enriched in HRAs either only on the RNA (KRAS‐signaling up, Hedgehog‐, WNT‐, IL2‐STAT5‐, NOTCH‐signaling’ or protein level (PI3K/AKT/mTOR‐, mTORC1‐signaling). The processes more prominent in LRAs compared to HRAs were identified on the protein level and included “protein secretion” and the metabolic gene sets (Fig. 2).
Figure 2

Gene set enrichment analysis results in the differential analysis between HRA and LRA, on RNA and protein level, as measured by RNA‐seq and mass spectrometry proteomics. Genes or proteins were ranked based on their fold change and p‐value, with genes/proteins significantly overexpressed in HRAs on top of the list. GSEA was performed on the ranked list using hallmark gene sets. Gene sets enriched in HRAs are marked red, and gene sets enriched in LRAs are marked blue. The size of the dot reflects the significance of the enrichment (false discovery rate ≤0.15). For a subset of the signaling pathways, like Hedgehog, Wnt and Notch, GSEA on the protein level could not be determined since the number of proteins from these gene sets identified by LC–MS/MS was too small. [Color figure can be viewed at http://wileyonlinelibrary.com]

Gene set enrichment analysis results in the differential analysis between HRA and LRA, on RNA and protein level, as measured by RNA‐seq and mass spectrometry proteomics. Genes or proteins were ranked based on their fold change and p‐value, with genes/proteins significantly overexpressed in HRAs on top of the list. GSEA was performed on the ranked list using hallmark gene sets. Gene sets enriched in HRAs are marked red, and gene sets enriched in LRAs are marked blue. The size of the dot reflects the significance of the enrichment (false discovery rate ≤0.15). For a subset of the signaling pathways, like Hedgehog, Wnt and Notch, GSEA on the protein level could not be determined since the number of proteins from these gene sets identified by LC–MS/MS was too small. [Color figure can be viewed at http://wileyonlinelibrary.com] To put the GSEA group‐level differences between HRAs and LRAs in context of progression toward CRC, we performed single‐sample GSEA on RNA level in adenomas and cancers using the hallmark gene sets (Fig. S4). Seven gene sets were significantly differential between HRAs and LRAs (p ≤ 0.05, Fig. 3). In six cases, the single‐sample GSEA score increased through colorectal tumor progression, with the lowest score in LRAs and the highest in CRCs. These include “Notch‐” and “Hedgehog‐signaling” together with immune‐ and stroma‐related gene sets, like “epithelial‐mesenchymal transition.” For “heme metabolism,” the single‐sample GSEA score decreased through colorectal tumor progression (Fig. 3).
Figure 3

Single sample gene set enrichment scores represented per sample type; LRAs, HRAs and CRCs. Gene sets with significant differences in enrichment scores between HRA and LRA (p ≤ 0.05) were selected for this figure. [Color figure can be viewed at http://wileyonlinelibrary.com]

Single sample gene set enrichment scores represented per sample type; LRAs, HRAs and CRCs. Gene sets with significant differences in enrichment scores between HRA and LRA (p ≤ 0.05) were selected for this figure. [Color figure can be viewed at http://wileyonlinelibrary.com]

Characterization of LRA and HRA tumor microenvironment

As GSEA revealed increased stroma and immune processes in HRAs, we examined the differences in tumor microenvironment between HRAs and LRAs. By applying the ESTIMATE algorithm22 on RNA expression data, enrichment scores for stromal and immune signatures were calculated in each sample reflecting the expression of stroma‐ and immune‐related genes (Fig. S5). A significant increase of stromal score was identified in HRAs when compared to LRAs (p = 0.012). An even more significant increase was observed between MSS cancers and HRAs (p = 5.7e−5). In terms of the immune score, even though a gradual increase from LRAs through HRAs to MSS cancers was identified, the differences between the groups were insignificant (p = 0.096 and 0.98, respectively). MSI cancers had significantly higher immune score than MSS cancer (p = 0.021, Fig. S5). To morphologically confirm the differences in the amounts of stroma between the HRAs and LRAs, we performed stroma quantification on hematoxylin and eosin‐stained slides by digital image analysis (Fig. 4 a). One sample could not be analyzed due to excessive tissue folds. The amount of stroma in HRAs (median = 40.89) was significantly higher than in LRAs (median = 27.20, p = 0.002, Fig. 4 b). Stroma percentage calculated by image analysis also positively correlated with the ESTIMATE stromal score from the RNA expression analysis (Fig. 4 c). This indicates that the expression differences between HRAs and LRAs in stromal and immune pathways are associated with the morphological differences in the amount of stroma in the tissue samples.
Figure 4

Stroma quantification on hematoxylin and eosin‐stained slides. (a) Representative image of assigning class to area on the slide; stroma, epithelium or lumen. Each class was quantified by calculating the size of its area. (b) Significant difference in stroma percentage between HRA and LRA, as calculated by the image analysis. (c) Significant positive correlation identified between stroma percentage measured by image analysis and ESTIMATE stromal score. [Color figure can be viewed at http://wileyonlinelibrary.com]

Stroma quantification on hematoxylin and eosin‐stained slides. (a) Representative image of assigning class to area on the slide; stroma, epithelium or lumen. Each class was quantified by calculating the size of its area. (b) Significant difference in stroma percentage between HRA and LRA, as calculated by the image analysis. (c) Significant positive correlation identified between stroma percentage measured by image analysis and ESTIMATE stromal score. [Color figure can be viewed at http://wileyonlinelibrary.com]

Candidate drivers of adenoma‐to‐carcinoma progression

Next to identification of differences in tumor microenvironment, we investigated DNA copy number driven gene‐dosage effect to reveal changes between HRAs and LRAs driven by the aberrations in the epithelial cells (Fig. 1). Pairwise correlation analysis was performed between DNA copy number, RNA and protein expression for colorectal adenomas and CRCs. In the cancers, 92 genes were positively correlated among the data types (Fig. S6 and Table S8). Chromosome 20 was associated with the largest global expression changes on RNA and protein level with 28 genes (~30%), including HNF4A, TOMM34 and RPRD1B, which were previously described to be gained and overexpressed in CRC cell lines and tissues.27, 28 Gene‐dosage effect was also identified for DIS3, which is located on chromosome 13 and often gained in CRC.27, 29 Other genomic regions with the highest number of perturbed genes considered almost all chromosomes involved in the CAEs. In the adenomas, positive and significant correlations between DNA copy numbers, RNA and protein expression were identified for 10 genes (Fig. S6 and Table S9). As HRAs are characterized by presence of CAEs, potential drivers of early colorectal tumor progression are expected to reside on the CAE‐defined chromosomes. Gene‐dosage effect was identified for two genes from chromosome arm 8p; however, these genes were associated both with gains and losses in the HRA group (Fig. S1) and consequently, higher and lower gene and protein expression when compared to LRAs. For the genes located on the CAE‐related chromosome 20, POFUT1, RPRD1B and EIF6, gene‐dosage effect was associated with only gains (Fig. S1) and overexpression in HRAs when compared to LRAs (Fig. 5). We performed gene‐level overlap analysis between gene‐dosage effects in CRCs and in adenomas to identify genes prominent for both HRAs and CRCs. The analysis revealed POFUT1, RPRD1B and EIF6, implying that the gain of chromosome arm 20q and expression of these three genes play an important role in both HRAs and CRCs. For all of these three genes DNA copy number, RNA and protein expression increased gradually from normal adjacent colon, through LRAs and HRAs to CRCs (Fig. 5). POFUT1, RPRD1B and EIF6 reside on neighboring cytogenetic bands—20q11.21, 20q11.23 and 20q11.22, respectively. Moreover, significant positive correlations were identified between these genes on DNA, RNA and protein level, suggesting their coamplification and coexpression (Fig. S7).
Figure 5

Proteogenomic representation of the potential drivers of colorectal tumors. DNA copy number, RNA and Protein expression (as measured by mass spectrometry proteomics) were plotted for EIF6 (a), POFUT1 (b) and RPRD1B (c) for each sample among different stages of colorectal tumor development: normal adjacent colon, LRAs, HRAs and CRCs. Correlating, gradual increase in DNA copy number and RNA and Protein expression was observed for each of these three genes. [Color figure can be viewed at http://wileyonlinelibrary.com]

Proteogenomic representation of the potential drivers of colorectal tumors. DNA copy number, RNA and Protein expression (as measured by mass spectrometry proteomics) were plotted for EIF6 (a), POFUT1 (b) and RPRD1B (c) for each sample among different stages of colorectal tumor development: normal adjacent colon, LRAs, HRAs and CRCs. Correlating, gradual increase in DNA copy number and RNA and Protein expression was observed for each of these three genes. [Color figure can be viewed at http://wileyonlinelibrary.com] To validate gene‐dosage effect of POFUT1, EIF6 and RPRD1B in colorectal tumors, we evaluated the relation between DNA copy numbers, RNA and protein expression of these genes in The Cancer Genome Atlas (TCGA) Provisional CRC data set.30, 31 Gene‐dosage effect was confirmed for each of these three genes in this data set on both RNA (n = 382) and protein level (n = 90), as gene and protein expression was higher when the DNA copy of the gene was gained or amplified (Figs. S8–S10).

Validation of increased POFUT1 expression in HRAs

To verify whether protein expression of POFUT1, RPRD1B and EIF6 is increased in CRCs and HRAs compared to LRAs and normal colon tissue, we aimed to evaluate their expression by immunohistochemistry (IHC) using TMAs obtained from the same samples as were used for the molecular profiling. Data in the Human Protein Atlas25 indicated that the expression of EIF6, as measured by IHC, is already high in normal colon tissue, leaving little room to detect increased EIF6 protein expression in adenomas and CRCs. Therefore, TMAs were stained for POFUT1 and RPRD1B, while EIF6 was discarded from IHC analysis. Within the TMA cores of colorectal tissues, RPRD1B was observed mainly in the nuclei of epithelial cells (Fig. S8), the staining confirmed increasing protein expression of RPRD1B in HRAs and CRCs as observed in the molecular profiling data (Fig. 5 c). Nevertheless, several LRAs and normal adjacent colon samples exhibited high intensity of RPRD1B staining (Fig. S11 and Table S10). Therefore, the difference in RPRD1B expression measured by IHC between LRAs and HRAs was not significant (p = 0.197; Table S10). Comparisons of CRCs with HRAs to LRAs and of CRCs with HRAs to LRAs with normal colon samples yielded significant differences (p = 0.017 and 0.003, respectively; Table S10). POFUT1 immunohistochemical staining was predominantly observed in the cytoplasm of epithelial cells, the staining showed gradual increase of POFUT1 expression through different stages of colorectal tumor progression (Figs. 6 a and 6 b), thereby verifying the molecular profiling data (Fig. 5 b). High levels of POFUT1 expression measured by IHC were more frequent in HRAs compared to LRAs, in HRAs and cancers compared to LRAs and in HRAs and cancers compared to LRAs and normal adjacent colon (Tables S11). POFUT1 expression was also significantly associated with grade of dysplasia (Table S11). Interestingly, POFUT1 expression was lower in MSI than in MSS cancers on both RNA and protein level (Figs. 5 b and 6), suggesting its specific role for chromosomal instability tumors. Previously, depletion of POFUT1 was shown to play a role in differentiation of the proliferative epithelial cells into goblet cells through inactivation of Notch signaling.32 Therefore, we quantified the amount of goblet cells in the adenomas using hematoxylin and eosin‐stained sections to examine this finding in the context of risk of progression. No association of the amount of goblet cells with dysplasia or other pathological features was identified (Table S11). Lower amounts of goblet cells were significantly associated with high POFUT1 expression (p = 0.017; Table S11) and high risk of progression (p = 0.007; Table S11), implying that also in our study POFUT1 is linked to goblet cell differentiation and indicating its role in early colorectal tumor development.
Figure 6

Immunohistochemical staining of POFUT1 in colorectal tissues and patient‐derived organoids. (a) Representative POFUT1 staining in different tissue sample type. Top left: normal adjacent colon; top right: LRA; bottom left: HRA; bottom right: CRC. (b) POFUT1 expression as measured by a product of epithelial cytoplasmic staining intensity (negative = 0, weak = 1, moderate = 2 or strong = 3) and percentage of the cells stained positively (0–100%) was plotted for each tissue sample among different stages of colorectal tumor development. See Table S11 for group comparisons and statistical testing. (c) Representative images of POFUT1 staining in LRA organoid (top) and HRA organoid (bottom). (d) POFUT1 expression in epithelial cytoplasm plotted in HRA and LRA organoids, as measured by a product of epithelial cytoplasmic staining intensity (negative = 0, weak = 1, moderate = 2 or strong = 3) and percentage of the cells stained positively (0–100%). See Table S13 for group comparisons and statistical testing. [Color figure can be viewed at http://wileyonlinelibrary.com]

Immunohistochemical staining of POFUT1 in colorectal tissues and patient‐derived organoids. (a) Representative POFUT1 staining in different tissue sample type. Top left: normal adjacent colon; top right: LRA; bottom left: HRA; bottom right: CRC. (b) POFUT1 expression as measured by a product of epithelial cytoplasmic staining intensity (negative = 0, weak = 1, moderate = 2 or strong = 3) and percentage of the cells stained positively (0–100%) was plotted for each tissue sample among different stages of colorectal tumor development. See Table S11 for group comparisons and statistical testing. (c) Representative images of POFUT1 staining in LRA organoid (top) and HRA organoid (bottom). (d) POFUT1 expression in epithelial cytoplasm plotted in HRA and LRA organoids, as measured by a product of epithelial cytoplasmic staining intensity (negative = 0, weak = 1, moderate = 2 or strong = 3) and percentage of the cells stained positively (0–100%). See Table S13 for group comparisons and statistical testing. [Color figure can be viewed at http://wileyonlinelibrary.com] To further corroborate the role of POFUT1 in the pathogenesis of CRC in an independent series, expression of POFUT1 was investigated in a cohort of patient‐derived colorectal adenoma organoids. First, we performed low‐coverage WGS and based on the presence of two or more CAEs revealed 8 HRA and 15 LRA organoids in the series (Table S12). Next, IHC staining of the organoids was performed to evaluate POFUT1 expression in the neoplastic cells. Also in the organoids, POFUT1 was mainly observed in the cytoplasm and high POFUT1 expression was associated with HRAs (p = 0.008; Table S13 and Figs. 6 c and 6 d), confirming its potential role in early colorectal tumor development.

Discussion

Studying the natural history of colorectal adenomas, including progression to cancer, is challenging because adenomas are removed when detected during colonoscopy. Yet, there is a need for better understanding of the biology of adenomas that progress to CRC. We set out to molecularly characterize adenomas at high risk of progressing to CRC and to identify putative drivers of this process. POFUT1 was found to be amplified and overexpressed in HRAs and CRCs when compared to LRAs and adjacent normal colon epithelium. POFUT1 overexpression was successfully validated by immunohistochemical staining on TMAs and in patient‐derived colorectal adenoma organoids, indicating that POFUT1 plays a role in colorectal adenoma‐to‐carcinoma progression. Additionally, high POFUT1 expression and high risk of progression to cancer were associated with a decrease in goblet cell differentiation. The novelty of the current study is multi‐omics analysis of colorectal adenomas at high and low risk of progressing to cancer, in the context of CRCs and normal adjacent colon samples. Comprehensive analysis of high throughput DNA, RNA and protein profiling data of the same samples has not been performed yet for colorectal adenomas, while it did provide additional insights in CRC.27, 28 On RNA and/or protein level, the enrichment of gene sets and pathways were identified to be increasing through different stages of colorectal tumor development, from normal colon, through LRA and HRA to CRC. These included pathways known to play a role in or accompany colorectal carcinogenesis like Hedgehog, Notch, KRAS, PI3K/AKT/mTOR or Wnt signaling, proliferation, epithelial‐mesenchymal transition or immune activation.33 This suggests that a lot of processes inherent to cancer are already more active in HRAs compared to LRAs. Conversely, gene sets enriched in LRAs when compared to HRAs, like protein secretion, fatty‐acid or heme metabolism, decreased in CRC, consistent with previous observations.34 Fourteen genes were identified to be differentially expressed between HRAs and LRAs on both RNA and protein level. Among upregulated genes/proteins in HRAs, genes of both epithelial and stromal origins were found. This included HNF4A, a transcriptional activator of epithelial differentiation35 that is located on chromosomal arm 20q, previously shown to be amplified and activated in the majority of CRCs28 and studied as a prognostic biomarker for this disease.36 An unexpected result was the overexpression of multiple tumor microenvironment‐related genes/proteins in HRAs, including collagens, fibronectin, vimentin, immunoglobulins or calprotectin. While a broad range of stroma proportion has been reported in CRC,37 this is far less evident in adenomas. It has been shown that stromal genes can be expressed by epithelial cells, which typically occurs in association with invasion, a phenomenon referred to as epithelial‐mesenchymal transition.35 Nevertheless, by definition, stroma invasion is a process characteristic to cancer and not yet occurring in adenomas. We have performed stroma quantification by image analysis on adenoma tissue sections originating from the same tissue fragments that were used for molecular profiling, and observed a significant increase in stroma percentage in HRAs compared LRAs. Our data indicate that differential expression of the stroma genes between HRAs and LRAs is likely due to the differences in the stroma proportion. Even though significant, the variation in the amount of stroma in the adenomas is certainly not as big as in CRCs.37 To identify putative drivers of adenoma‐to‐carcinoma progression from the epithelial cells, we examined DNA‐driven aberrations in the colorectal tumors. Combining DNA and RNA data to study gene‐dosage effect has been performed in CRC18; however, only for a limited number of potential candidates functional assays confirmed their oncogenic potential.14, 29, 38 Addition of the protein layer provides insight into which chromosomal aberrations lead to functional consequences.28 Despite the high depth of the proteomics measurement in the present study with over 5,000 protein groups detected in total, adding the protein layer can be also limiting, in terms of the number of proteins measured overall and subsequently considered in the analysis. In our study, gene‐dosage effect analysis in CRCs led to the identification of 92 genes, a subset of which has previously been described, including HNF4A,28 TOMM34,28 DIS3 29 or RPRD1B.27 In the adenomas, the CAE‐driven gene‐dosage effect analysis yielded potential drivers of colorectal tumor progression that are already amplified and overexpressed in HRAs—POFUT1, RPRD1B and EIF6. The three genes are located on neighboring cytobands of chromosome arm 20q, which is the most frequently amplified chromosomal arm in CRC.18, 28 POFUT1 is a fucosylation factor that activates Notch through addition of fucose groups,39 a process required for the canonical Notch signaling.32, 40 In our study, POFUT1 was amplified and overexpressed while Notch signaling was enriched in HRAs and CRCs, when compared to LRAs. High expression of POFUT1 in HRAs and CRCs was validated using immunohistochemical staining of TMAs and adenoma‐derived organoids. Recently, POFUT1 overexpression was shown to have oncogenic activity in CRC through activation of NOTCH1 signaling, and consequently affecting proliferation, invasion and migration.41 Additionally, depletion of POFUT1 or Notch signaling was shown to be associated with converting proliferative cells into goblet cells.32, 42 Indeed, in the present study, low numbers of goblet cells were significantly associated with high‐risk status and high POFUT1 expression in adenomas, indicating that in HRAs POFUT1 and Notch signaling play a role in increased proliferation and decreased differentiation. Altogether this suggests that POFUT1 through the Notch signaling pathway is a putative driver of adenoma‐to‐carcinoma progression. Further functional studies on adenoma preclinical models are needed to confirm this hypothesis. RPRD1B is overexpressed in many tumor types and has been shown to have an oncogenic activity by regulating the transcription of cyclin D143 and other Wnt targets,44 consistent with the significant enrichment of Wnt signaling in HRAs demonstrated by GSEA in the present study. RPRD1B was proven to accelerate tumorigenesis by promoting cell proliferation and invasion.43, 44 Altogether, this suggests that RPRD1B may play a role in colorectal tumor progression through enhanced Wnt signaling. Although the TMA IHC analyses did not validate differences in RPRD1B expression levels between LRA and HRA, its predominant staining of neoplastic cells combined with the molecular profiling data suggest that RPRD1B should also be considered as a putative driver of colorectal tumor development. EIF6 is a translation initiation factor that plays a role in ribosome complex formation and protein synthesis downstream of PI3K/Akt/mTOR signaling pathway.45, 46 It is overexpressed in multiple tumor types,47, 48 including CRC, where expression of EIF6 has been shown to increase from normal colon, through adenoma to CRC.49 Functional studies on EIF6 suggest its oncogenic activity through increasing cancer cell motility and invasion.50, 51 The fact that we identified significant enrichment of PI3K/Akt/mTOR signaling in HRAs when compared to LRAs, suggests that EIF6 and PI3K/Akt/mTOR signaling play a role in adenoma‐to‐carcinoma progression. Additionally, the transcription of EIF6 has been shown to be regulated by NOTCH1,51 consistent with Notch signaling enrichment in HRAs and CRCs. Individuals with a history of colorectal neoplasia carry an increased risk of developing CRC in the future and therefore are enrolled in the colonoscopy‐based surveillance programs.52 As removal of nonmalignant precursor lesions during colonoscopy is an approach to decrease CRC incidence and mortality rates,53 currently, detection of advanced adenoma is an indication to shorten the interval for the follow‐up surveillance colonoscopy.52 The high prevalence of advanced adenomas in an elderly population leads to a substantial burden on endoscopic capacity.52 Moreover, given that not all advanced adenomas eventually progress to cancer, frequent surveillance colonoscopies in patients with these lesions lead to overdiagnosis and overtreatment.4 In our study, we have shown that HRAs, in contrast to LRAs, in a number of aspects resemble CRCs on molecular level, while they represent only approximately 30% of the advanced adenomas.19 Introduction of a more specific definition of adenomas associated with risk of future CRC development may significantly improve the CRC surveillance programs and reduce patient burden. Additional studies are still needed to evaluate if patients with HRAs indeed have higher CRC incidence and mortality rate compared to patients with advanced adenomas, and whether POFUT1 can be used as biomarker to identify HRAs in the surveillance setting. In our study, we performed multi‐omics characterization of colorectal adenomas in the context of colorectal tumor development. We focused on conventional chromosomal instability adenomas, the most prevalent precursors of CRC,10 as MSI adenomas are relatively rare with a prevalence of only 3%.54 MSI CRCs were included in our analyses, which frequently differed from MSS CRCs in terms of gene expression and GSEA, confirming the distinct etiology of MSS and MSI CRCs. POFUT1, RPRD1B and EIF6 were identified as putative drivers of adenoma‐to‐carcinoma progression. In light of what is known about the roles these genes play in carcinogenesis, our results imply that the transition from LRAs to HRAs involves the interplay of Wnt, Notch and PI3K/AKT/mTOR signaling pathways. As such, our study shows that biological processes inherent to CRC are already more active in HRAs than in LRAs. Moreover, our study emphasizes the key role that specific DNA copy number alterations play in progression from premalignancy to cancer, indicating that in comparison to the generally used morphology‐based concept of “advanced adenoma,” the molecular CAE‐based concept of HRA is a more specific marker to define risk of progressing to CRC. Appendix S1: Supplementary Tables Click here for additional data file. Appendix S2: Supplementary Figures Click here for additional data file. Appendix S3: Supplementary Material Click here for additional data file.
  54 in total

Review 1.  A genetic model for colorectal tumorigenesis.

Authors:  E R Fearon; B Vogelstein
Journal:  Cell       Date:  1990-06-01       Impact factor: 41.582

Review 2.  Lessons from hereditary colorectal cancer.

Authors:  K W Kinzler; B Vogelstein
Journal:  Cell       Date:  1996-10-18       Impact factor: 41.582

3.  Colorectal Cancer Cell Line Proteomes Are Representative of Primary Tumors and Predict Drug Sensitivity.

Authors:  Jing Wang; Dmitri Mouradov; Xiaojing Wang; Robert N Jorissen; Matthew C Chambers; Lisa J Zimmerman; Suhas Vasaikar; Christopher G Love; Shan Li; Kym Lowes; Karl-Johan Leuchowius; Helene Jousset; Janet Weinstock; Christopher Yau; John Mariadason; Zhiao Shi; Yuguang Ban; Xi Chen; Robert J C Coffey; Robbert J C Slebos; Antony W Burgess; Daniel C Liebler; Bing Zhang; Oliver M Sieber
Journal:  Gastroenterology       Date:  2017-06-16       Impact factor: 22.682

4.  TPX2 and AURKA promote 20q amplicon-driven colorectal adenoma to carcinoma progression.

Authors:  Anke H Sillars-Hardebol; Beatriz Carvalho; Marianne Tijssen; Jeroen A M Beliën; Meike de Wit; Pien M Delis-van Diemen; Fredrik Pontén; Mark A van de Wiel; Remond J A Fijneman; Gerrit A Meijer
Journal:  Gut       Date:  2011-12-29       Impact factor: 23.059

Review 5.  Crosstalks between translation and metabolism in cancer.

Authors:  Stefano Biffo; Nicola Manfrini; Sara Ricciardi
Journal:  Curr Opin Genet Dev       Date:  2017-11-15       Impact factor: 5.578

6.  Gene-dosage dependent overexpression at the 13q amplicon identifies DIS3 as candidate oncogene in colorectal cancer progression.

Authors:  Florence L M de Groen; Oscar Krijgsman; Marianne Tijssen; Lianne E M Vriend; Bauke Ylstra; Erik Hooijberg; Gerrit A Meijer; Renske D M Steenbergen; Beatriz Carvalho
Journal:  Genes Chromosomes Cancer       Date:  2014-01-29       Impact factor: 5.006

7.  Overexpression of p27BBP in head and neck carcinomas and their lymph node metastases.

Authors:  Paola Rosso; Giorgio Cortesina; Francesca Sanvito; Alessandra Donadini; Barbara Di Benedetto; Stefano Biffo; Pier Carlo Marchisio
Journal:  Head Neck       Date:  2004-05       Impact factor: 3.147

8.  Colorectal adenoma to carcinoma progression follows multiple pathways of chromosomal instability.

Authors:  Mario Hermsen; Cindy Postma; Jan Baak; Marjan Weiss; Anna Rapallo; Andrea Sciutto; Guido Roemen; Jan-Willem Arends; Richard Williams; Walter Giaretti; Anton De Goeij; Gerrit Meijer
Journal:  Gastroenterology       Date:  2002-10       Impact factor: 22.682

9.  Extending the limits of quantitative proteome profiling with data-independent acquisition and application to acetaminophen-treated three-dimensional liver microtissues.

Authors:  Roland Bruderer; Oliver M Bernhardt; Tejas Gandhi; Saša M Miladinović; Lin-Yang Cheng; Simon Messner; Tobias Ehrenberger; Vito Zanotelli; Yulia Butscheid; Claudia Escher; Olga Vitek; Oliver Rinner; Lukas Reiter
Journal:  Mol Cell Proteomics       Date:  2015-02-27       Impact factor: 5.911

Review 10.  Colonoscopy Reduces Colorectal Cancer Incidence and Mortality in Patients With Non-Malignant Findings: A Meta-Analysis.

Authors:  Jun Pan; Lei Xin; Yi-Fei Ma; Liang-Hao Hu; Zhao-Shen Li
Journal:  Am J Gastroenterol       Date:  2016-01-12       Impact factor: 10.864

View more
  13 in total

1.  Other Types of Glycosylation.

Authors:  Yohei Tsukamoto; Hideyuki Takeuchi
Journal:  Adv Exp Med Biol       Date:  2021       Impact factor: 3.650

Review 2.  Glycans that regulate Notch signaling in the intestine.

Authors:  Mohd Nauman; Pamela Stanley
Journal:  Biochem Soc Trans       Date:  2022-04-29       Impact factor: 4.919

3.  CREPT is required for murine stem cell maintenance during intestinal regeneration.

Authors:  Liu Yang; Haiyan Yang; Yunxiang Chu; Yunhao Song; Lidan Ding; Bingtao Zhu; Wanli Zhai; Xuning Wang; Yanshen Kuang; Fangli Ren; Baoqing Jia; Wei Wu; Xiongjun Ye; Yinyin Wang; Zhijie Chang
Journal:  Nat Commun       Date:  2021-01-11       Impact factor: 14.919

4.  CREPT/RPRD1B promotes tumorigenesis through STAT3-driven gene transcription in a p300-dependent manner.

Authors:  Wanli Zhai; Xiongjun Ye; Yinyin Wang; Yarui Feng; Ying Wang; Yuting Lin; Lidan Ding; Liu Yang; Xuning Wang; Yanshen Kuang; Xinyuan Fu; Y Eugene Chin; Baoqing Jia; Bingtao Zhu; Fangli Ren; Zhijie Chang
Journal:  Br J Cancer       Date:  2021-02-03       Impact factor: 7.640

5.  Fusion transcripts and their genomic breakpoints in polyadenylated and ribosomal RNA-minus RNA sequencing data.

Authors:  Youri Hoogstrate; Malgorzata A Komor; René Böttcher; Job van Riet; Harmen J G van de Werken; Stef van Lieshout; Ralf Hoffmann; Evert van den Broek; Anne S Bolijn; Natasja Dits; Daoud Sie; David van der Meer; Floor Pepers; Chris H Bangma; Geert J L H van Leenders; Marcel Smid; Pim J French; John W M Martens; Wilbert van Workum; Peter J van der Spek; Bart Janssen; Eric Caldenhoven; Christian Rausch; Mark de Jong; Andrew P Stubbs; Gerrit A Meijer; Remond J A Fijneman; Guido W Jenster
Journal:  Gigascience       Date:  2021-12-09       Impact factor: 6.524

6.  A panel of DNA methylation markers for the classification of consensus molecular subtypes 2 and 3 in patients with colorectal cancer.

Authors:  Inge van den Berg; Marcel Smid; Robert R J Coebergh van den Braak; Mark A van de Wiel; Carolien H M van Deurzen; Vanja de Weerd; John W M Martens; Jan N M IJzermans; Saskia M Wilting
Journal:  Mol Oncol       Date:  2021-09-30       Impact factor: 6.603

7.  Cytoplasmic eIF6 promotes OSCC malignant behavior through AKT pathway.

Authors:  Zechen Zhao; Weiming Chu; Yang Zheng; Chao Wang; Yuemei Yang; Teng Xu; Xueming Yang; Wei Zhang; Xu Ding; Gang Li; Hongchuang Zhang; Junbo Zhou; Jinhai Ye; Heming Wu; Xiaomeng Song; Yunong Wu
Journal:  Cell Commun Signal       Date:  2021-12-18       Impact factor: 5.712

8.  CRISPRs in the human genome are differentially expressed between malignant and normal adjacent to tumor tissue.

Authors:  Job van Riet; Chinmoy Saha; Nikolaos Strepis; Rutger W W Brouwer; Elena S Martens-Uzunova; Wesley S van de Geer; Sigrid M A Swagemakers; Andrew Stubbs; Yassir Halimi; Sanne Voogd; Arif Mohammad Tanmoy; Malgorzata A Komor; Youri Hoogstrate; Bart Janssen; Remond J A Fijneman; Yashar S Niknafs; Arul M Chinnaiyan; Wilfred F J van IJcken; Peter J van der Spek; Guido Jenster; Rogier Louwen
Journal:  Commun Biol       Date:  2022-04-08

9.  Genome-Wide Methylation Profiling of lncRNAs Reveals a Novel Progression-Related and Prognostic Marker for Colorectal Cancer.

Authors:  Shujuan Lin; Simeng Gu; Sangni Qian; Yaxin Liu; Jinghao Sheng; Qilong Li; Jinhua Yang; Xiaojiang Ying; Zhenjun Li; Mengling Tang; Jianbing Wang; Kun Chen; Mingjuan Jin
Journal:  Front Oncol       Date:  2022-01-20       Impact factor: 6.244

10.  RNA-sequencing identification and validation of genes differentially expressed in high-risk adenoma, advanced colorectal cancer, and normal controls.

Authors:  Namjoo Kim; Jeong-An Gim; Beom Jae Lee; Byung Il Choi; Seung Bin Park; Hee Sook Yoon; Sang Hee Kang; Seung Han Kim; Moon Kyung Joo; Jong-Jae Park; Chungyeul Kim; Han-Kyeom Kim
Journal:  Funct Integr Genomics       Date:  2021-07-17       Impact factor: 3.410

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.