| Literature DB >> 29968252 |
Malgorzata A Komor1,2, Linda Jw Bosch1, Gergana Bounova3, Anne S Bolijn1, Pien M Delis-van Diemen1, Christian Rausch1, Youri Hoogstrate4, Andrew P Stubbs5, Mark de Jong6, Guido Jenster4, Nicole Ct van Grieken7, Beatriz Carvalho1, Lodewyk Fa Wessels3,8, Connie R Jimenez2, Remond Ja Fijneman1, Gerrit A Meijer1.
Abstract
Consensus molecular subtyping is an RNA expression-based classification system for colorectal cancer (CRC). Genomic alterations accumulate during CRC pathogenesis, including the premalignant adenoma stage, leading to changes in RNA expression. Only a minority of adenomas progress to malignancies, a transition that is associated with specific DNA copy number aberrations or microsatellite instability (MSI). We aimed to investigate whether colorectal adenomas can already be stratified into consensus molecular subtype (CMS) classes, and whether specific CMS classes are related to the presence of specific DNA copy number aberrations associated with progression to malignancy. RNA sequencing was performed on 62 adenomas and 59 CRCs. MSI status was determined with polymerase chain reaction-based methodology. DNA copy number was assessed by low-coverage DNA sequencing (n = 30) or array-comparative genomic hybridisation (n = 32). Adenomas were classified into CMS classes together with CRCs from the study cohort and from The Cancer Genome Atlas (n = 556), by use of the established CMS classifier. As a result, 54 of 62 (87%) adenomas were classified according to the CMS. The CMS3 'metabolic subtype', which was least common among CRCs, was most prevalent among adenomas (n = 45; 73%). One of the two adenomas showing MSI was classified as CMS1 (2%), the 'MSI immune' subtype. Eight adenomas (13%) were classified as the 'canonical' CMS2. No adenomas were classified as the 'mesenchymal' CMS4, consistent with the fact that adenomas lack invasion-associated stroma. The distribution of the CMS classes among adenomas was confirmed in an independent series. CMS3 was enriched with adenomas at low risk of progressing to CRC, whereas relatively more high-risk adenomas were observed in CMS2. We conclude that adenomas can be stratified into the CMS classes. Considering that CMS1 and CMS2 expression signatures may mark adenomas at increased risk of progression, the distribution of the CMS classes among adenomas is consistent with the proportion of adenomas expected to progress to CRC.Entities:
Keywords: adenoma; colon; colorectal cancer; neoplasia; rectum
Mesh:
Substances:
Year: 2018 PMID: 29968252 PMCID: PMC6221003 DOI: 10.1002/path.5129
Source DB: PubMed Journal: J Pathol ISSN: 0022-3417 Impact factor: 7.996
Characteristics of sample Series 1 and Series 2 collected for this study
| Characteristics | Number of samples | |||
|---|---|---|---|---|
| Series 1 | Series 2 | Total | ||
| Lesion | Adenoma | 30 | 32 | 62 |
| Histological type | Tubular | 6 | 13 | 19 |
| Tubulovillous | 20 | 16 | 36 | |
| Villous | 4 | 3 | 7 | |
| Dysplasia | High grade | 10 | 8 | 18 |
| Low grade | 20 | 24 | 44 | |
| Risk of progression | High | 9 | 4 | 13 |
| Low | 17 | 22 | 39 | |
| No information | 2 | 6 | 8 | |
| Microsatellite status | MSS | 28 | 32 | 60 |
| MSI | 2 | 0 | 2 | |
| Lesion | Carcinoma | 30 | 29 | 59 |
| Differentiation grade | Less/Not | 4 | 2 | 6 |
| Well differentiated/ moderately differentiated | 25 | 27 | 52 | |
| No information | 1 | 0 | 1 | |
| Stage | I | 7 | 9 | 16 |
| II | 13 | 10 | 23 | |
| III | 6 | 9 | 15 | |
| IV | 3 | 1 | 4 | |
| I or III | 1 | 0 | 1 | |
| Microsatellite status | MSS | 24 | 23 | 47 |
| MSI | 6 | 6 | 12 | |
MSS, microsatellite‐stable.
Figure 1Overview of the data analysis approach. Both Series 1 and Series 2 were normalised separately to the TCGA CRC dataset via a batch effect removal method 27. After normalisation, all three datasets were merged together. Series 1 and Series 2 form the ‘study dataset’. CMS classification was applied to the merged dataset. The classes were obtained with the CMS random forest classifier, and assigned when the posterior probability of belonging to a CMS class was ≥0.5. Results of the classification were extracted for the CRCs and the adenomas from the study dataset. The pie charts represent the distribution of CMS classes for CRCs (left) and adenomas (right) for the study dataset.
Figure 2Hierarchical clustering based on gene expression profiles of the top 30 most variable genes. (A) Heatmap of all three datasets before batch effect removal. The batches corresponding to the TCGA dataset, Series 1 and Series 2 can be distinguished in the heatmap. (B) Heatmap before batch correction of the Series 1 and Series 2 study datasets only. Within the two batches, one can distinguish clusters enriched with adenomas and clusters enriched with cancers. (C) Heatmap of all three datasets after batch effect removal. Samples from the three experiments do not cluster together. (D) Heatmap of the Series 1 and Series 2 study datasets after batch effect removal. Clusters enriched with adenomas or cancers can still be distinguished, meaning that batch effect correction did not remove the variability between different lesions. The legend corresponds to all of the heatmaps in this figure.
Distribution of the CMS classes in cancers and adenomas from the study dataset and the validation set
| CMS1, | CMS2, | CMS3, | CMS4, | Non‐consensus, | |
|---|---|---|---|---|---|
| Study dataset (Series 1 and Series 2) | |||||
| Cancers | 7 (12) | 15 (25) | 2 (3) | 24 (41) | 11 (19) |
| Adenomas | 1 (2) | 8 (13) | 45 (73) | 0 (0) | 8 (13) |
| Validation set (Series 3) | |||||
| Cancers | 5 (14) | 7 (19) | 1 (3) | 18 (50) | 5 (14) |
| Adenomas | 1 (2) | 5 (11) | 28 (62) | 0 (0) | 11 (24) |
Figure 3Distribution of CMS classes among adenomas at high risk and adenomas at low risk of progressing to cancer. (A) Distribution of CMS classes among 13 high‐risk adenomas. (B) Distribution of CMS classes among 39 low‐risk adenomas. No high‐risk and low‐risk adenomas were classified as CMS1 or CMS4.
Gene sets enriched in CMS2 and CMS3 adenomas
| Gene set | Process category | Size | Members in signal | Normalised enrichment score |
| FDR |
|---|---|---|---|---|---|---|
| Gene sets enriched in CMS2 adenomas in comparison with CMS3 adenomas | ||||||
| G2M checkpoint | Proliferation | 183 | 100 | 2.00 | <0.001 | <0.01 |
| E2F targets | Proliferation | 183 | 105 | 1.77 | <0.001 | 0.01 |
| MYC targets V2 | Proliferation | 57 | 78 | 1.57 | 0.005 | 0.02 |
| Mitotic spindle | Proliferation | 168 | 28 | 1.58 | 0.001 | 0.02 |
| Epithelial–mesenchymal transition | Development | 136 | 51 | 1.74 | <0.001 | 0.01 |
| Myogenesis | Development | 103 | 42 | 1.67 | 0.001 | 0.01 |
| PI3K–AKT–mTOR signalling | Signalling | 85 | 38 | 1.62 | 0.004 | 0.01 |
| Wnt–β‐catenin signalling | Signalling | 33 | 13 | 1.61 | 0.005 | 0.02 |
| TGF‐β signalling | Signalling | 50 | 14 | 1.59 | 0.007 | 0.02 |
| Coagulation | Immune | 85 | 35 | 1.64 | 0.002 | 0.01 |
| Apical junction | Cellular component | 123 | 47 | 1.49 | 0.006 | 0.04 |
| Gene sets enriched in CMS3 adenomas in comparison with CMS2 adenomas | ||||||
| Protein secretion | Pathway | 90 | 35 | –1.78 | <0.001 | 0.03 |
| Glycolysis | Metabolic | 169 | 45 | –1.52 | <0.001 | 0.08 |
| Oxidative phosphorylation | Metabolic | 194 | 85 | –1.39 | <0.001 | 0.13 |
| Fatty acid metabolism | Metabolic | 132 | 39 | –1.35 | 0.017 | 0.13 |
| Haem metabolism | Metabolic | 144 | 33 | –1.27 | 0.020 | 0.15 |
| Oestrogen response late | Signalling | 152 | 32 | –1.30 | 0.020 | 0.15 |
Gene sets were grouped in process categories according to the original hallmark gene set grouping 32. Size indicates number of genes in the gene set; members in signal indicates how many genes from the gene set contributed to the enrichment score. The statistical values, normalised enrichment score, P values and FDR were calculated with GSEA 30. Gene sets enriched in CMS2 adenomas have positive enrichment scores, and gene sets enriched in CMS3 adenomas have negative enrichment scores.