The role of somatic copy number alterations (SCNAs) that occur in colorectal tumors is poorly understood. SCNAs are correlated with corresponding gene expression changes that may contribute to neoplastic progression. Thus, we examined SCNAs and the expression of messenger RNAs (mRNAs) located at corresponding loci in colorectal neoplasia, a progression model of human neoplasm. We used 42 colorectal neoplastic samples, including adenomas, intramucosal cancers (IMC) and invasive colorectal cancers (CRC) that were microsatellite stable (MSS) using a genome-wide SNP array and gene expression array (first cohort). In addition, validation analyses were examined (37 colorectal neoplasias). None of the mRNAs with a corresponding SCNA was found in the adenomas. However, three mRNAs, including ARFGEF2 at 20q13.13, N4BP2L2 at 13q13.1 and OLFM4 at 13q14.3 with a copy number (CN) gain at the corresponding locus were upregulated in IMCs of the first cohort. Moreover, upregulated expression of ARFGEF2 and OLFM4 was upregulated in the validation analysis. Finally, 28 mRNAs with gains of corresponding loci were pooled in invasive CRC of the first cohort. The mRNAs, including ACSS2 (20q11.22), DDX27 (20q13.13), MAPRE1 (20q11.21), OSBPL2 (20q11.22) and PHF20 (20q11.22-q11.23) with CN gains of the corresponding loci were identified in 28 mRNAs. Four of these mRNAs (DDX27, MAPRE1, OSBPL2 and PHF20) were upregulated in the invasive CRC in the validation analysis. We conclude that specific 13q and 22q CN gains with gene expression changes in the corresponding loci may play an important role in IMC cells' progression into invasive CRC.
The role of somatic copy number alterations (SCNAs) that occur in colorectal tumors is poorly understood. SCNAs are correlated with corresponding gene expression changes that may contribute to neoplastic progression. Thus, we examined SCNAs and the expression of messenger RNAs (mRNAs) located at corresponding loci in colorectal neoplasia, a progression model of human neoplasm. We used 42 colorectal neoplastic samples, including adenomas, intramucosal cancers (IMC) and invasive colorectal cancers (CRC) that were microsatellite stable (MSS) using a genome-wide SNP array and gene expression array (first cohort). In addition, validation analyses were examined (37 colorectal neoplasias). None of the mRNAs with a corresponding SCNA was found in the adenomas. However, three mRNAs, including ARFGEF2 at 20q13.13, N4BP2L2 at 13q13.1 and OLFM4 at 13q14.3 with a copy number (CN) gain at the corresponding locus were upregulated in IMCs of the first cohort. Moreover, upregulated expression of ARFGEF2 and OLFM4 was upregulated in the validation analysis. Finally, 28 mRNAs with gains of corresponding loci were pooled in invasive CRC of the first cohort. The mRNAs, including ACSS2 (20q11.22), DDX27 (20q13.13), MAPRE1 (20q11.21), OSBPL2 (20q11.22) and PHF20 (20q11.22-q11.23) with CN gains of the corresponding loci were identified in 28 mRNAs. Four of these mRNAs (DDX27, MAPRE1, OSBPL2 and PHF20) were upregulated in the invasive CRC in the validation analysis. We conclude that specific 13q and 22q CN gains with gene expression changes in the corresponding loci may play an important role in IMC cells' progression into invasive CRC.
copy numbercopy neutral loss of heterozygositycolorectal cancermessenger RNAmicrosatellite instabilitymicrosatellite stablesomatic copy number alteration
INTRODUCTION
Colorectal cancer (CRC) is one of the most common malignant neoplasms throughout the world.
Recent advances in detection and chemotherapeutic/biological agent‐based therapies have dramatically increased the survival rate of CRC patients.
Such advances in clinical medicine are based on noteworthy progress in basic cancer research, particularly molecular mechanism in colorectal pathogenesis. Despite such advances, CRC remains an uncontrollable disease.Cancer cells are characterized by cytogenetic alterations that can be used to define specific disease entities and their prognostic markers.
,
Previous studies showed that there are two molecular CRC subtypes, including chromosomal (CIN, or microsatellite stable MSS) and those with microsatellite instabilities (MIN; MSI). CIN is observed in 80% to 90% of sporadic CRCs; this alteration is closely associated with an accelerated rate of gains or losses of whole or partial portions of chromosomes that are detected as somatic copy number alterations (SCNA) that are identified using an array‐based analysis.
,
,
,
MIN is estimated to occur in approximately 10% to 20% of sporadic CRCs. An alternative molecular pathway of sporadic CRC is characterized by instability in DNA microsatellite sequences.
,
,
,Comprehensive CINs in CRC have been investigated by many researchers using either comparative genomic hybridization (CGH) or array‐based SNP arrays (SNP array).
,
,
These approaches have contributed to the detection of many chromosomal aberrations, including gains and losses, helping to evaluate complex disease progression.
SNP arrays may be suitable for gathering more detailed information regarding the possible underlying mechanism of chromosomal abnormalities and to evaluate genetic characteristics. In recent studies, it has been used in place of CGH arrays.
,
Accumulated chromosomal alterations involving key regulatory genes are essential genetic events in the initiation and progression of colorectal tumorigenesis.
,
,
Many alterations, such as SCNA can disrupt or change proper gene function.
Although there is wide agreement that chromosomal aberrations may involve genes that possess important function in tumor development,
,
,
such an assumption may not be necessarily true in chromosomal aberrations occurring in human neoplasia. Traditional theory suggests that tumor suppressor genes are inactivated by a loss of another chromosomal locus and that oncogenes are enhanced by gene amplification.
Thus, SCNA may be assumed to be accompanied by corresponding genetic alteration in colorectal tumors.
However, whether such functional abnormality is always apparent remains unknown, given that whether a change in SCNA does induce actual expression has not been identified in previous studies.
,
This issue might be further complicated by the finding that many chromosomal aberrations span large chromosomal regions that involve multiple genes.
,We set out to determine whether SCNAs occurring in tumor cells were involved with the altered expression of genes in affected chromosomal regions. To approach this problem, we focused on colorectal tumors, including adenoma, intramucosal cancer (IMC), and invasive CRC with an MSS phenotype that evolves through a multistep accumulation of molecular alterations, including genetic and epigenetic abnormalities.
MATERIALS AND METHODS
Patients
This study used 42 tumor samples and normal colonic mucosa that were obtained from 42 enrolled patients with 15 colorectal adenomas (including tubular and tubulovillous adenomas), eight patients with IMC and 19 patients with CRC that invaded into the muscular layer without metastasis. Adenoma and IMC were diagnosed according to the modified World Health Organization (WHO) 2019 criteria.
In addition, the adenomas that we examined included low (LGA) and high‐grade adenomas (HGA). Clinicopathological findings were recorded according to the General Rules for Management of the Japanese Colorectal Cancer Association.
Finally, 37 colorectal tumors including 15 adenomas, eight IMCs and 14 invasive CRCs were examined for validation analysis (second cohort). The clinicopathological findings are shown in Table 1.
TABLE 1
Clinicopathological findings of the colonic neoplasms in sampled cohorts
Clinicopathological findings of the colonic neoplasms in sampled cohortsAbbreviations: A, ascending colon; C, cecum; D, descending colon; MDA, moderately differentiated adenocarcinoma; MUC, mucinous adenocarcinoma; R, rectum; S, sigmoid colon; T, transverse colon.This study was approved by the local ethics committee of Iwate Medical University (approval number MH2020‐066), and all patients provided informed consent.
Crypt isolation method
Crypt isolation from the tumors and normal mucosa (distal site of the colon for surgical specimens) was performed as described elsewhere.
Tumor glands (adenomatous and intramucosal carcinomatous glands) were obtained from suspected target lesions as determined by magnification, whereas cancer glands in invasive CRC were taken from the invasive front. Briefly, fresh tissues were minced with a razor into small pieces and incubated at 37°C for 30 minutes in calcium‐ and magnesium‐free Hanks' balanced salt solution (CMF) containing 30 mM EDTA. The isolated crypts were immediately fixed in 70% ethanol and stored at 4°C until used for DNA/RNA extractions. The fixed isolated crypts were observed under a dissecting microscope (SZ60; Olympus, Tokyo, Japan). Some isolated crypts were routinely processed for histopathological analysis to confirm the histological nature of the isolated glands. Contamination, such as interstitial cells, was not evident in any of the 42 samples used for array‐based analysis and the 37 samples used for the validation test.
DNA extraction
For each patient, DNA was extracted from isolated tumors and normal glands using classical phenol‐chloroform extraction.
Analysis of MSI
The MSI status was determined using a consensus panel of five reference microsatellite markers (BAT25, BAT26, D2S123, D3S546 and D17S250) using a previously described method.
When no marker was altered, the tumors were defined as MSS. When only one marker was altered, the tumors were defined as low MSI. When two or more markers were altered, the tumors were defined as high MSI.
RNA EXTRACTION
Isolation of total RNA from cancer cells was performed using RNeasy Mini kits (Qiagen, Valencia, CA) in accord with the manufacturer's instructions. The nucleic acid concentration was determined using a Nanodrop1000 spectrophotometer (Thermo Fisher Scientific; Waltham, MA), and the RNA purity was verified using 1.5% denaturing agarose gels.
SNP array analysis
The Cytoscan HD (Affymetrix, UK) platform was used in the present study. This array contains more than 1.9 million non‐polymorphic markers and over 740 000 SNP markers with an average intragenic marker spacing of 880 bps and intergenic marker spacing of 1737 bps. These platforms are composed of microarrays containing non‐polymorphic probes for CNVs (copy number variations) from coding and noncoding regions of the human genome as well as polymorphic SNP probes. All procedures were performed as instructed by the manufacturer. The hybridized slides bearing DNA marked with biotin, were analyzed with a GeneChip Scanner 3000 7G (Affymetrix) and Chromosome Analysis Suite (ChAS) Software (Affymetrix). Detailed information for definition of abnormalities was previously described.
Classification of copy number alteration
In the present study, we classified SCNAs into five subtypes, including gain, loss of heterozygosity (LOH), copy neutral LOH (CN‐LOH), mosaic and mixed types, as previously reported.
Whereas LOH was considered a gross chromosomal change that results in loss of the entire gene and the surrounding region, gain was defined as a gross chromosomal change that was caused by a gain of the entire gene and the surrounding region. CN‐LOH was defined as an occurrence of LOH in the absence of the allelic loss (copy number [CN] ≧ 2). A mosaic pattern was defined as a mixture of normal and abnormal cells with SCNAs. Finally, a mixed pattern was a mixture of >2 SCNA patterns within one locus, such as LOH and LOH mosaic, or gain and LOH or gain and gain mosaic.In the present study, a mosaic LOH was considered to be a LOH, whereas mosaic gain was regarded as a gain. In addition, a mixed pattern was classified into dominant patterns, such as LOH>gain, LOH; gain>LOH, gain.
CLARIOM S HUMAN ARRAY AND GENE EXPRESSION ANALYSIS
For each array experiment, 500 ng of total RNA was used for labeling using the Clariom S Human Array (Thermo Fisher Scientific). This array contains 21 453 messenger RNAs (mRNAs). Probe labeling, chip hybridization and scanning were performed according to the manufacturer's instructions. A Probe Set (gene‐exon) was considered expressed if ≥50% of the samples were detected above background (DABG) values below the DABG threshold (DABG <0.05). The array data were generated using the Transcriptome Analysis Console (TAC version 4.0.1.36) and analyzed with the Affymetrix Chromosome Analysis Suite v.4.1 (ChAS) (Affymetrix Inc.,).To validate microarray results, quantitative reverse transcription polymerase chain reaction (qRT‐PCR) was performed. One microgram of total RNA was reverse transcribed to first‐strand cDNA with the Qiagen cDNA Synthesis Kit (Qiagen), and this cDNA was subsequently used as a template for quantitative PCR with gene‐specific primers. The ubiquitous β‐actin gene served as a control for constitutive gene expression. qRT‐PCR reactions were performed using Power SYBR Green PCR Master Mix (Thermo Fisher Scientific) on a CFX96 Touch Real‐Time PCR Detection System (Bio‐Rad, Hercules, CA). Relative expression levels (2−Δ) were calculated according to the Livak and Schmittgen method.
Expression levels of each gene were compared with the expression level of actin.
Statistical analysis
The differences between SCNAs with an upregulated or downregulated corresponding gene expression level (including gain and non‐gain) were assessed with a Fisher's exact test. In addition, the differences in the frequencies of genotype (ex ARFGEF2/CN gain) between adenoma, IMC and invasive CRC were also assessed with a Fisher's exact test with an adjusted Bonferroni correction. If statistically significant differences among the three lesion types were detected, comparisons between two groups were performed using a Fisher's exact test. Differences in SCNA number including gain, LOH and CN‐LOH among the groups were evaluated using the Kruskal‐Wallis H test in Stat Mate‐III. A P‐value <.05 was accepted as significant. If statistical differences among the three lesion types were detected, comparisons between two groups were performed using the same method (Kruskal‐Wallis H test). Final statistical significance was determined after two group comparison. As a result, a P‐value <.05 was accepted as significant.
Workflow (experimental design)
The workflow used in the present study is summarized in Figure 1.
FIGURE 1
Workflow of experimental design. First, somatic copy number alteration (SCNA) was examined using a Cytoscan HD. Second, messenger RNAs (mRNA) expression was investigated according to the following criteria: fold‐change in expression (<−1.5 or >1.5) and P < .05 using a Clariom S Human Array, compared with that of normal gland. Third, SCNAs with altered gene expression were pooled in adenoma, intramucosal cancers (IMC) and colorectal cancer (CRC). Finally, the association of selected mRNAs with corresponding gene expression was validated
Workflow of experimental design. First, somatic copy number alteration (SCNA) was examined using a Cytoscan HD. Second, messenger RNAs (mRNA) expression was investigated according to the following criteria: fold‐change in expression (<−1.5 or >1.5) and P < .05 using a Clariom S Human Array, compared with that of normal gland. Third, SCNAs with altered gene expression were pooled in adenoma, intramucosal cancers (IMC) and colorectal cancer (CRC). Finally, the association of selected mRNAs with corresponding gene expression was validated
RESULTS
Microsatellite analysis of colorectal adenoma, IMC and invasive CRC
All tumors were classified into an MSS phenotype according to criteria previously reported.
mRNA expression profiling in colorectal tumors
We sought to identify potential mRNA biomarkers in 42 colorectal neoplasias, including 15 adenomas, eight IMCs and 19 invasive CRCs. We used global mRNA expression profiling in the analysis of the 42 colorectal neoplasias and compared them with normal gland samples as follows. First, we examined the difference in the expression levels of mRNA between isolated normal and neoplastic glands using a t test (P value) with an adjusted Benjamini‐Hochberg method/False discovery rate (FDR) correction. Second, such mRNAs were selected and grouped. As a result, we identified 31 differentially (upregulated) mRNAs in adenomas (Table 2). Using a similar method, three upregulated and five downregulated mRNAs were grouped in Table 3. Third, 29 mRNAs were groped in invasive CRC (14 upregulated and 15 downregulated; Table 5). Finally, individual data pooled in colorectal adenoma, IMC and invasive CRC are shown in Table S1
(
t test with FDR correction).
TABLE 2
Associated expression of mRNA with copy number alteration in conventional adenoma
Official symbol
Location
Up‐regulated case (n = 15) (%)
Pattern of copy number alteration (n = 15) (%)
P‐value
Gain
Non‐gain
MIR3654
7q33
10 (66.7)
4 (26.7)
11 (73.3)
.5604
OLFM4
13q14.3
7 (46.7)
3 (20)
12 (80)
.0769
GNAS
20q13.32
9 (60)
2 (13.3)
13 (86.7)
1.000
RPS21
20q13.33
8 (53.3)
2 (13.3)
13 (86.7)
1.000
RPL41
12q13.2
10 (66.7)
1 (6.7)
14 (93.3)
1.000
CEACAM5
19q13.2
9 (60)
1 (6.7)
14 (93.3)
1.000
PIGR
1q32.1
11 (73.3)
0 (0)
15 (100)
1.000
TMSB10
2p11.2
8 (53.3)
0 (0)
15 (100)
1.000
SPTBN1
2p16.2
8 (53.3)
0 (0)
15 (100)
1.000
EPCAM
2p21
8 (53.3)
0 (0)
15 (100)
1.000
ITM2C
2q37.1
8 (53.3)
0 (0)
15 (100)
1.000
CAST
5q15
12 (80)
0 (0)
15 (100)
1.000
SLC12A2
5q23.3
7 (46.7)
0 (0)
15 (100)
1.000
RACK1
5q35.3
8 (53.3)
0 (0)
15 (100)
1.000
SNORD95
5q35.3
8 (53.3)
0 (0)
15 (100)
1.000
SNORD96A
5q35.3
8 (53.3)
0 (0)
15 (100)
1.000
H4C3
6p22.2
9 (60)
0 (0)
15 (100)
1.000
EEF1A1
6q13
11 (73.3)
0 (0)
15 (100)
1.000
EEF1G
11q12.3
10 (66.7)
0 (0)
15 (100)
1.000
RPS3
11q13.4
10 (66.7)
0 (0)
15 (100)
1.000
SNORD15A
11q13.4
10 (66.7)
0 (0)
15 (100)
1.000
HSPA8
11q24.1
13 (86.7)
0 (0)
15 (100)
1.000
SNORD14C
11q24.1
13 (86.7)
0 (0)
15 (100)
1.000
SNORD14D
11q24.1
13 (86.7)
0 (0)
15 (100)
1.000
RPS28
19p13.2
10 (66.7)
0 (0)
15 (100)
1.000
CSTB
21q22.3
8 (53.3)
0 (0)
15 (100)
1.000
RPL3
22q13.1
12 (80)
0 (0)
15 (100)
1.000
SNORD43
22q13.1
12 (80)
0 (0)
15 (100)
1.000
SNORD83B
22q13.1
12 (80)
0 (0)
15 (100)
1.000
MT‐CO1
MT
9 (60)
0 (0)
15 (100)
1.000
MT‐CO2
MT
8 (53.3)
0 (0)
15 (100)
1.000
Abbreviations: CN‐LOH, copy neutral loss of heterozygosity; LOH, loss of heterozygosity.
TABLE 3
Associated increased expression level of mRNA with copy number alteration in intramucosal cancer
Official symbol
Location
Up‐regulated case (n = 8) (%)
Pattern of copy number alteration (n = 8) (%)
P‐value
Gain
Non‐gain
N4BP2L2
13q13.1
6 (75)
6 (75)
2 (25)
.0357
OLFM4
13q14.3
6 (75)
6 (75)
2 (25)
.0357
ARFGEF2
20q13.13
6 (75)
6 (75)
2 (25)
.0357
Abbreviations: CN‐LOH, copy neutral loss of heterozygosity; LOH, loss of heterozygosity.
Associated expression of mRNA with copy number alteration in conventional adenomaAbbreviations: CN‐LOH, copy neutral loss of heterozygosity; LOH, loss of heterozygosity.Associated increased expression level of mRNA with copy number alteration in intramucosal cancerAbbreviations: CN‐LOH, copy neutral loss of heterozygosity; LOH, loss of heterozygosity.Associated decreased expression level of mRNA with copy number alteration in intramucosal cancerAbbreviations: CN‐LOH, Copy neutral loss of heterozygosity; LOH, loss of heterozygosity.
SCNAs in conventional adenoma, IMC and CRC
In CRC, the mean total number of chromosomal aberrations per patient was 516, with an average of 315 gains (range: 30‐644), 136 LOHs (range: 4‐480) and 66 copy‐neutral LOHs (range: 12‐185). In conventional adenoma, the mean total number of chromosomal aberrations per patient was 75, with an average of 35 gains (range: 1‐103), 18 LOHs (range: 10‐33) and 22 copy‐neutral LOHs (range: 8‐50). Finally, in IMC, the mean total number of chromosomal aberrations per patient was 337, with an average of 246 gains (range: 14‐641), 68 LOHs (rang: 29‐181) and 23 copy‐neutral LOHs (range: 4‐43). There were significant differences in the median values in the total numbers of SCNAs, gains and LOHs between adenoma and IMC or invasive CRC (P < .01). Statistically significant differences in the median number of CN‐LOH between invasive CRC and adenoma or IMC were found (Figure 2).
FIGURE 2
Total number of somatic copy number alterations (SCNAs), gains, loss of heterozygosity (LOHs) and copy neutral loss of heterozygosity (CN‐LOHs) in colorectal adenoma, intramucosal cancers (IMC) and colorectal cancer (CRC)
Total number of somatic copy number alterations (SCNAs), gains, loss of heterozygosity (LOHs) and copy neutral loss of heterozygosity (CN‐LOHs) in colorectal adenoma, intramucosal cancers (IMC) and colorectal cancer (CRC)Regions of SCNAs, including gain, LOH and copy neutral LOH detected in more than 30% of cases were pooled in adenoma, IMC and invasive CRC. These SCNAs are listed in Table S2. In addition, the ideograms of adenoma, IMC and invasive CRC are depicted in Figure S1. Finally, a systematic analysis of CN gains and losses and altered regulation of corresponding mRNAs is shown in Table S3.
Integrated genome and transcriptome analysis
To identify the correlation of SCNA status with corresponding gene expression, we utilized a statistical approach. First, each mRNA and corresponding chromosomal location (eg, DDX27 located at 20q13.13) was explored in our three categories (colorectal adenoma, IMC and invasive CRC). Second, we examined the association of the corresponding chromosomal location with SCNA, including gain and non‐gain in adenoma, IMC and invasive CRC (eg, SCNA at 20q13.13 was a CN gain). In addition, we determined whether the SCNA pattern was correlated with a change in the corresponding gene expression level (upregulated or downregulated) using a Fisher's exact test (Tables 2, 3, 4, 5, 6). Finally, we showed that different patterns of SCNAs could be demonstrated, using CN levels and gene expression levels (Figure 3).
TABLE 4
Associated decreased expression level of mRNA with copy number alteration in intramucosal cancer
Official symbol
Location
Down‐regulated case (n = 8) (%)
Pattern of copy number alteration (n = 9) (%)
P‐value
LOH
Non‐LOH (gain)
FABP1
2p11.2
5 (62.5)
0 (0)
8 (100)
1.000
GSKIP
14q32.2
4 (50)
2 (25)
6 (75)
.4286
SELENOH
11q12.1
3 (37.5)
0 (0)
8 (100)
1.000
TMX2
11q12.1
3 (37.5)
1 (12.5)
7 (87.5)
1.000
XKRY2
Yq11.222
3 (37.5)
1 (12.5)
7 (87.5)
.375
Abbreviations: CN‐LOH, Copy neutral loss of heterozygosity; LOH, loss of heterozygosity.
TABLE 5
Associated increased expression level of mRNA with copy number alteration in CRC
Official symbol
Location
Up‐regulated case (n = 19) (%)
Pattern of copy number alteration (n = 19) (%)
P‐value
Gain
Non‐gain
DDX27
20q13.13
18 (94.7)
18 (94.7)
1 (5.3)
.0326
MAPRE1
20q11.21
15 (79)
17 (89.5)
2 (10.5)
.0351
ACSS2
20q11.22
15 (79)
17 (89.5)
2 (10.5)
.0351
PHF20
20q11.22‐q11.23
16 (84.2)
17 (89.5)
2 (10.5)
.0175
OSBPL2
20q13.33
15 (79)
17 (89.5)
2 (10.5)
.0351
DHX35
20q11.23‐q12
10 (52.6)
17 (89.5)
2 (10.5)
1.000
SEMG1
20q13.12
9 (47.4)
17 (89.5)
2 (10.5)
.4737
ZFP64
20q13.2
12 (63.2)
17 (89.5)
2 (10.5)
.1228
CAB39L
13q14.2
9 (47.4)
16 (84.2)
3 (15.8)
.2105
DACH1
13q21.33
12 (63.2)
16 (84.2)
3 (15.8)
.5232
TPX2
20q11.21
12 (63.2)
16 (84.2)
3 (15.8)
.5232
DEFB126
20p13
12 (63.2)
12 (63.2)
7 (36.8)
.3261
ODF1
8q22.3
11 (57.9)
11 (57.9)
8 (42.1)
.1809
VIRMA
8q22.1
11 (57.9)
0 (0)
19 (100)
1.000
Abbreviations: CN‐LOH, Copy neutral loss of heterozygosity; LOH, loss of heterozygosity.
TABLE 6
Associated decreased expression level of mRNA with copy number alteration in CRC
Official symbol
Location
Down‐regulated case (n = 19) (%)
Pattern of copy number alteration (n = 19) (%)
P‐value
LOH
Non‐LOH
CNDP2
18q22.3
11 (57.9)
12 (63.2)
7 (36.8)
.0063
TSPAN1
1p34.1
15 (79)
6 (31.6)
13 (68.4)
1.000
SH3BGRL3
1p36.11
8 (42.1)
6 (31.6)
13 (68.4)
1.000
B2M
15q21.1
11 (57.9)
6 (31.6)
13 (68.4)
.3189
PIGR
1q32.1
16 (84.2)
3 (15.8)
16 (84.2)
1.000
MS4A12
11q12.2
12 (63.2)
3 (15.8)
16 (84.2)
1.000
MUC2
11p15.5
16 (84.2)
2 (10.5)
17 (89.5)
.2982
SLC26A3
7q22.3‐q31.1
13 (68.4)
1 (5.3)
18 (94.7)
.3158
FABP1
2p11.2
17 (89.5)
1 (5.3)
18 (94.7)
.1053
CEACAM7
19q13.2
16 (84.2)
1 (5.3)
18 (94.7)
1.000
LGALS4
19q13.2
14 (73.7)
1 (5.3)
18 (94.7)
1.000
ITM2C
2q37.1
18 (94.7)
1 (5.3)
18 (94.7)
1.000
CD63
12q13.2
11 (57.9)
1 (5.3)
18 (94.7)
.4211
CA1
8q21.2
19 (100)
0 (0)
19 (100)
1.000
ZG16
16p11.2
16 (84.2)
0 (0)
19 (100)
1.000
Abbreviations: CN‐LOH, Copy neutral loss of heterozygosity; LOH, loss of heterozygosity.
FIGURE 3
Expression levels of messenger RNAs (mRNAs) pooled in array‐based tests in colorectal adenoma, intramucosal cancer and colorectal cancer. Different patterns of somatic copy number alteration (SCNAs) are marked by different colors connecting copy number levels and gene expression levels. Red, gain; Blue, loss of heterozygosity (LOH), green, copy neutral LOH and black, no alteration
Expression levels of messenger RNAs (mRNAs) pooled in array‐based tests in colorectal adenoma, intramucosal cancer and colorectal cancer. Different patterns of somatic copy number alteration (SCNAs) are marked by different colors connecting copy number levels and gene expression levels. Red, gain; Blue, loss of heterozygosity (LOH), green, copy neutral LOH and black, no alterationAnalysis of the SCNA data and mRNA expression data revealed that none of the SCNAs had a correspondingly expressed gene in adenoma (Table 2) (Fisher's exact test). In IMC, we observed eight SCNAs (gains) with altered expression of genes, and three were correlated with upregulated expression of genes (Fisher's exact test). They included the following: ARFGEF2 (assay ID: Hs01573093_m1, ADP Ribosylation Factor Guanine Nucleotide Exchange Factor 2 at 20q13.13), N4BP2L2 (assay ID: Hs00263840_m1, neural precursor cell expressed developmentally down‐regulated protein 4 binding protein 2 Like 2 at 13q1.1) and OLFM4 (olfactomedin; assay ID: Hs00197437_m1 at 13q14.3) (Table 3). Next, we assessed the correlation between SCNA and the corresponding expression of mRNA in invasive CRC (Fisher's exact test). As a result, six mRNAs were identified out of 29 mRNAs in which a statistical change in expression level was found in invasive CRC, compared with that of normal gland (Fisher's exact test). They included the following six genes: ACSS2 (assay ID: Hs01120914_m1; Acyl‐coenzyme A synthetase short‐chain family member 2, 20q11.22); DDX27 (assay ID: Hs00916726_g1; DEAD‐Box Helicase 27, 20q13.13); MAPRE1 (assay ID: Hs01121100_g1; Microtubule Associated Protein RP/EB Family Member 1, 20q11.21); OSBPL2 (assay ID: Hs00375881_m1; oxysterol binding protein like 2, 20q11.22); PHF20 (PHD (assay ID: Hs00916778_m1; Plant Homeodomain Finger Protein 20, 20q11.22‐q11.23) (Table 5); and CNDP2 (Cytosolic non‐specific dipeptidase 2) However, expression of CNDP2 could not be examined due to the limited availability of RNA from isolated neoplastic glands. To summarize, SCNA gains correlated with gene expression in array‐based data.Associated increased expression level of mRNA with copy number alteration in CRCAbbreviations: CN‐LOH, Copy neutral loss of heterozygosity; LOH, loss of heterozygosity.Associated decreased expression level of mRNA with copy number alteration in CRCAbbreviations: CN‐LOH, Copy neutral loss of heterozygosity; LOH, loss of heterozygosity.Comparison of specific mRNAs with corresponding SCNA gains between conventional adenoma, intramucosal cancer and CRC in array‐based dataAbbreviations: CN, copy number; CRC, colorectal cancer; SCNA, somatic copy number alteration.*Bonferroni corrected P‐value = .0009; **Bonferroni corrected P‐value = .0003; ***Bonferroni corrected P‐value <.0001.We defined CN gain with corresponding gene expression, including ARFGEF2, N4BP2L2, OLFM, ACSS2, DDX27, MARRE1, OSBPL2 and PHF204 as a specific genotype (eg, termed upregulated ARFGEF2/CN gain, etc.). There were significant differences in the frequencies of the specific genotypes among the three lesions (adenoma, IMC and invasive CRC) using a Fisher's exact test analysis with Bonferroni correction. We found significant differences in the frequencies of three specific genotypes, including upregulated ARFGEF2/CN gain, upregulated N4BP2L2/CN gain and upregulated OLFM4/CN gain in comparisons of adenoma vs IMC, adenoma vs invasive CRC and IMC vs invasive CRC. In addition, the frequencies of upregulated ACSS2/CN gain, DDX27/CN gain, MARRE1/CN gain, OSBPL2/CN gain and PHF20/CN gain genotypes were statistically higher in invasive CRC than in adenoma or IMC using the same statistical method. The results are depicted in Table 7.
TABLE 7
Comparison of specific mRNAs with corresponding SCNA gains between conventional adenoma, intramucosal cancer and CRC in array‐based data
None of the SCNAs had a correspondingly expressed gene in adenomas in the second cohort. In the validation cohort for IMC, the median values of the expression levels of ARFGEF2 and OLFM4 with CN gains of the corresponding loci (20q13.13 and 13q14.3, respectively) were upregulated. However, there was no association between the expression level of N4BPP2L2 and CN gain of 13q13.1 (Figure 4A). In addition, the median values of 4 mRNAs (DDX27 (20q13.13), MAPRE1 (20q11.21), OSBPL2 (20q11.22) and PHF20 (20q11.22‐q11.23) were upregulated in invasive CRC. However, upregulation of ACSS2 (20q11.22)) was not observed in invasive CRC (Figure 4B). Finally, SCNAs of IMC and invasive CRC used for the second cohort (validation cohort) were examined using the same method used in the first cohort. SCNAs of 20q13.13 and 13q14.3 with expression of corresponding genes including ARFGEF2 and OLFM4 were found in the second cohort (using a one‐sided t test). In addition, SCNAs of 20q13.13, 20q11.21, 20q11.22 and 21q11.22‐q11.23, which included expression of genes DDX27, MAPRE1, OSBPL2 and PHF20, were observed in the second cohort (using a one‐sided t test) (Table S4).
FIGURE 4
Expression levels of validated messenger RNAs (mRNAs) in intramucosal cancers (IMC) and invasive colorectal cancer. Different patterns of somatic copy number alteration (SCNAs) are marked by different colors connecting copy number levels and gene expression levels. A, Expression levels of mRNAs for ARFGF2, N4BP2L2 and OLFM4 in IMC. B, Expression level of messenger RNAs of ACSS2, DDX27, MAPRE1, OSBPL2 and PHF20 in colorectal cancer with a microsatellite stable phenotype. Red, gain; blue and black, no alteration
Expression levels of validated messenger RNAs (mRNAs) in intramucosal cancers (IMC) and invasive colorectal cancer. Different patterns of somatic copy number alteration (SCNAs) are marked by different colors connecting copy number levels and gene expression levels. A, Expression levels of mRNAs for ARFGF2, N4BP2L2 and OLFM4 in IMC. B, Expression level of messenger RNAs of ACSS2, DDX27, MAPRE1, OSBPL2 and PHF20 in colorectal cancer with a microsatellite stable phenotype. Red, gain; blue and black, no alterationTaken together, two genotypes (upregulated ARFGEF2/CN gain and OLFM4/CN gain) were retained in IMC. Moreover, four genotypes, including upregulated DDX27/CN gain, MAPRE1/CN gain, OSBPL2/CN gain and PHF20/CN gain were sustained in invasive CRC (Table S4).
DISCUSSION
Our crypt isolation method enabled us to obtain pure tumor tissues and normal glands.
,
We used an array‐based method that is necessary to evaluate comprehensive genome‐wide alterations of a given tumor.
,
Therefore, crypt isolation could certify the origin of DNA or RNA extracted from epithelial tumor cells. This finding is important for the identification of the association between SCNA and expression of mRNA using human tumor tissue that is composed of both tumor cells and interstitial cells. However, different handling of different tissue specimens might also explain the differences in mRNA expression profiles. Moreover, the expression profile of mRNA may depend on sampling sites and differences between organs. Thus, the properties of isolated tumor glands could be influenced by such differences. In our experience, however, the crypt isolation method could be simply and reproducibly performed in any organ and sample site.The most common changes in CRC are gains at 7p, 7q, 13q, 20p, 20q, Xp and Xq and losses at 8p, 17p, 18p and 18q.
Gains in chr8, chr10 and 21q were found more frequently in primary tumors, and gains in chr14, chr3p and chr18q were related to metastatic tumors.
,
Another study identified deletions from chr1p, 17p and 18q and gains in chr7 and 13q in metastatic CRC. Whereas those associations deserve further validation in larger populations,
those findings are consistent with our data. What our study emphasizes in particular is that SCNA by itself does not play a critical role in colorectal carcinogenesis. Rather, a specific genotype (eg, a CN gain with high expression of the corresponding gene) may contribute to neoplastic progression of colorectal neoplasia. This finding is consistent with the hypothesis that SCNA‐affected genes are closely associated with signaling pathways, pathological features, and therapeutic targets in CRC.
This understanding may facilitate the development of more effective molecular approaches and the evaluation of colorectal carcinogenesis when it is viewed as a heterogeneous disease.Recent studies revealed that colorectal adenoma has a low level of SCNAs, compared with carcinomas.
,
By contrast, not only advanced adenomas but also early adenomas have SCNAs comparable to that of carcinomas as shown in a previous report.
Whether such SCNAs are correlated with the expression of corresponding mRNAs has remained unclear. However, we did not find the correlated expression of mRNAs with corresponding SCNAs. This finding is interesting because the expression of mRNA does not depend on SCNA in adenoma, suggesting that mechanisms other than SCNA, such as epigenetic alteration or unknown enhancers, play a major role in changing the expression of mRNA in adenoma cells. Collectively, the data suggest that SCNAs without functional expression of mRNA cannot progress from adenomatous cells into carcinomatous cells.With regard to the IMCs that we examined here (interpreted as high‐grade colorectal adenomas in the West), it would be beneficial to identify them early in progression
,
because they are intermediates between adenomas and invasive CRCs. In the present study, the number of mRNAs that showed altered (upregulated/downregulated) expression of mRNA compared with those in normal crypts was lower in IMC than in adenoma. However, upregulated expression of mRNAs with CN gain, that is, ARFGEF2 (20q13.13) and OLFM4 (13q14.3) was commonly found in both the first and second cohorts. This finding suggests that those genes located at 20q13.13 and 13q14.3, respectively, are upregulated by CN gains in IMCs and may play important roles in the development of IMC. ARFGEF2, which is involved in Golgi transport, is closely associated with intracellular vesicular trafficking and is involved in the activation of ARFs by accelerating the replacement of bound GDP with GTP.
,
,
In addition, it may be responsible for guanine‐nucleotide exchange activity and also inhibition of brefeldin A, which inhibits protein transport from the endoplasmic reticulum to the golgi complex.
However, the genetic role of this gene in the development of CRC remains unknown. In addition, OLFM4 is selectively expressed in inflamed colonic epithelium and is an anti‐apoptotic factor that promotes tumor growth and is an extracellular matrix glycoprotein that facilitates cell adhesion.
,
High levels of expression of OLFM4 are found in CRC, suggesting that OLFM4 is a candidate diagnostic marker of CRC.
,
Moreover, this protein is a stem cell marker and potential biomarker of early CRC that involves IMC, supporting our finding.
,
We suggest that the current finding is novel in that the two genotypes, including upregulated ARFGEF2/CN gain and upregulated OLFM4/CN gain, may be responsible for the adenoma to cancer progression.Genes with causal roles in cancers are often found in amplified genomic regions, whereas tumor suppressors may be embedded in deleted genomic segments.
,
,
In many cases, high‐level amplification or homozygous deletion may be necessary for the onset of neoplastic growth or invasion due to the requirement for activation of oncogenes or inactivation of tumor suppressor genes.
In the present study, we selected five mRNAs that were correlated with gains of their corresponding genes as identified in both first and second (validation) cohorts. Among them, four mRNAs including DDX27 (20q13.13), MAPRE1 (20q11.21), OSBPL2 (20q11.22) and PHF20 (20q11.22‐q11.23) were ultimately upregulated in invasive CRC. DDX27, which is associated with stem cell activity, was previously reported to be upregulated in CRC.
,
This is an interesting finding in that stem cell activity may play an important role in carcinogenesis. A recent study has shown that DDX27 promotes CRC growth and metastasis.
Furthermore, high expression of DDX27 predicts a poorer prognosis of CRC patients.
Upregulation of MAPRE1, which was associated with a gain (20q11.21), was found in gastric cancer.
,
Furthermore, overexpression of MAPRE1 was also observed in CRC.
Therefore, this molecule may be useful as a differential marker of colorectal adenoma from CRC. In addition, high expression of OSBPL2 was associated with a gain at this gene locus. OSBPL2 is associated with the p53/SREBF2/NFYA signal pathway.
OSBPL2 might be induced via this pathway and serve as a cancer‐induced molecule. Finally, expression of PHF20 which is associated with the p53 signal pathway has not been reported in CRC, but might be associated with its development. We suggest that four specific genotypes, including DDX27/CN gain, MAPRE1/CN gain, OSBPL2/CN gain and PHF20/CN gain, may be major molecular events in the progression to CRC.There are some limitations to this study. First, heterogeneous expression of mRNAs might be essential to identify the roles of those that are highly expressed in CRC.
In the present study, we used a single sample from the invasive front of CRC. Therefore, heterogeneous expression of mRNAs could not be examined. The same would apply to heterogeneous occurrence of SCNA within the same tumor. Heterogeneity of expression of mRNAs and occurrence of SCNAs will be investigated in the near future. Second, the number of study participants may have been too small to identify the association of mRNA expression levels with SCNAs. Larger studies, such as that made possible by the Cancer Genome Alas (TCGA) and consensus molecular subtype (CMS) have already been performed in CRC.
,
,
However, we believe that the results of the present study contain new data.
,
,
For example, crypt isolation is a useful method to obtain target cells for molecular analysis and that method was utilized here as it permits the purification of neoplastic cells. In contrast, that approach was not utilized in TCGA. This difference is essential in understanding the differences between our data and those in TCGA. Moreover, early colorectal lesions, including colorectal adenoma and intramucosal cancer were target lesions in the present study. This point is crucial in explaining the unique aspects of the present study. Thus, we suggest that the current study provides new insight into the evaluation of colorectal carcinogenesis based on the “adenoma‐carcinoma sequence.” Finally, we could not make a meaningful association between SCNA and corresponding altered gene expression levels between LGA and HGA or HGA and IMC due to the small number of HGA cases. Larger studies will be needed to identify that association.In conclusion, our results suggest that SCNAs with altered expression of corresponding cancer‐related genes could enhance the progression of cancer cells into a more pathologic state. This finding supports a hypothesis in which SCNAs with upregulated/downregulated expression of corresponding genes in cancer cells may be required for the progression of CRC. Here, specific genotypes that were closely associated with neoplastic progression occurring in IMC were different from those of invasive CRC. This finding suggests that clonal exchange might occur during neoplastic progression from intramucosal cancer to invasive CRC.
CONFLICT OF INTEREST
The authors declare no potential conflict of interest.
AUTHOR CONTRIBUTIONS
Tamotsu Sugai, who is first author and the corresponding author, contributed to the preparation of the manuscript, including all aspects of the data collection and analysis. Mitsumasa Osakabe and Ryo Sugimoto performed all data collection and statistical analyses. Makoto Eizuka and Yoshihito Tanaka performed the collection of isolated tumors and normal glands. Naoki Yanagawa supported data preparation. Koki Otsuka, Akira Sasaki and Takayuki Matsumoto provided clinical support during the preparation of the manuscript. Hiromu Suzuki assisted the molecular analyses.Figure S1 Ideogram of copy number alterations in 3 lesions: adenoma, IMC and invasive CRC in cohort 1. Chromosomes are ordered from 1 to 22. The colored horizontal lines represent the frequencies of gains, LOHs and CNLOHs. Lines on the left indicate losses (red, copy neutral LOH; gray, LOH), and those on the right (green) indicate gains.Click here for additional data file.Table S1 Table 1‐a Significant differential expression of each mRNA in conventional adenomaTable S1‐b Significant differential expression of each mRNA in intramucosal cancerTable S1‐c Significant differential expression of each mRNA in CRCTable S2 Frequent CNA regions in conventional adenoma, intramucosal cancer and CRC with an MSS phenotypeTable S3‐a Association of expression of mRNA and copy number alteration in conventional adenoma in first cohortTable S3‐b Association of expression of mRNA and copy number alteration in intramucosal cancer in first cohortTable S3‐c Association of expression of mRNA and copy number alteration in CRC in first cohortTable S4 Association of SCNAs with expression of corresponding genes in validation cohort.Click here for additional data file.
Authors: Laurens G van der Flier; Andrea Haegebarth; Daniel E Stange; Marc van de Wetering; Hans Clevers Journal: Gastroenterology Date: 2009-05-18 Impact factor: 22.682
Authors: Richard Redon; Shumpei Ishikawa; Karen R Fitch; Lars Feuk; George H Perry; T Daniel Andrews; Heike Fiegler; Michael H Shapero; Andrew R Carson; Wenwei Chen; Eun Kyung Cho; Stephanie Dallaire; Jennifer L Freeman; Juan R González; Mònica Gratacòs; Jing Huang; Dimitrios Kalaitzopoulos; Daisuke Komura; Jeffrey R MacDonald; Christian R Marshall; Rui Mei; Lyndal Montgomery; Kunihiro Nishimura; Kohji Okamura; Fan Shen; Martin J Somerville; Joelle Tchinda; Armand Valsesia; Cara Woodwark; Fengtang Yang; Junjun Zhang; Tatiana Zerjal; Jane Zhang; Lluis Armengol; Donald F Conrad; Xavier Estivill; Chris Tyler-Smith; Nigel P Carter; Hiroyuki Aburatani; Charles Lee; Keith W Jones; Stephen W Scherer; Matthew E Hurles Journal: Nature Date: 2006-11-23 Impact factor: 49.962
Authors: C R Boland; S N Thibodeau; S R Hamilton; D Sidransky; J R Eshleman; R W Burt; S J Meltzer; M A Rodriguez-Bigas; R Fodde; G N Ranzani; S Srivastava Journal: Cancer Res Date: 1998-11-15 Impact factor: 12.701
Authors: Justin Guinney; Rodrigo Dienstmann; Xin Wang; Aurélien de Reyniès; Andreas Schlicker; Charlotte Soneson; Laetitia Marisa; Paul Roepman; Gift Nyamundanda; Paolo Angelino; Brian M Bot; Jeffrey S Morris; Iris M Simon; Sarah Gerster; Evelyn Fessler; Felipe De Sousa E Melo; Edoardo Missiaglia; Hena Ramay; David Barras; Krisztian Homicsko; Dipen Maru; Ganiraju C Manyam; Bradley Broom; Valerie Boige; Beatriz Perez-Villamil; Ted Laderas; Ramon Salazar; Joe W Gray; Douglas Hanahan; Josep Tabernero; Rene Bernards; Stephen H Friend; Pierre Laurent-Puig; Jan Paul Medema; Anguraj Sadanandam; Lodewyk Wessels; Mauro Delorenzi; Scott Kopetz; Louis Vermeulen; Sabine Tejpar Journal: Nat Med Date: 2015-10-12 Impact factor: 53.440
Authors: Wang Xiaoqian; Zhang Bing; Li Yangwei; Zhi Yafei; Zhang Tingting; Wang Yi; Li Qingjun; Luo Suxia; Zhang Ling; Wang Bo; Zheng Peng Journal: Technol Cancer Res Treat Date: 2021 Jan-Dec