| Literature DB >> 28683746 |
Kaja C G Berg1,2, Peter W Eide1,2, Ina A Eilertsen1,2, Bjarne Johannessen1,2,3, Jarle Bruun1,2, Stine A Danielsen1,2,3, Merete Bjørnslett1,2, Leonardo A Meza-Zepeda3,4, Mette Eknæs1,2, Guro E Lind1,2, Ola Myklebost3,4,5, Rolf I Skotheim1,2,3, Anita Sveen1,2,3, Ragnhild A Lothe6,7,8.
Abstract
BACKGROUND: Colorectal cancer (CRC) cell lines are widely used pre-clinical model systems. Comprehensive insights into their molecular characteristics may improve model selection for biomedical studies.Entities:
Keywords: Colorectal cancer cell lines; Consensus molecular subtypes; Copy number aberrations; Gene expression; Genomics; Methylation; Microsatellite instability; Mutations; Protein expression; miRNA
Mesh:
Substances:
Year: 2017 PMID: 28683746 PMCID: PMC5498998 DOI: 10.1186/s12943-017-0691-y
Source DB: PubMed Journal: Mol Cancer ISSN: 1476-4598 Impact factor: 27.401
Fig. 1Overview of the 34 CRC cell lines analyzed and key findings. a The cell lines are grouped according to the gene expression-based CMSs (except Colo320, which has a neuroendocrine origin), and MSI, POLE and CIMP status are indicated. In general, the morphologic appearance of cell lines in CMS1 and CMS4 (for example LoVo and RKO) was mesenchymal, whereas cell lines in CMS2 and CMS3 (for example IS3 and WiDr) appeared more epithelial-like. b The cell lines were analyzed on the DNA, RNA and protein levels as indicated (blue background). Bioinformatic analyses (grey) were performed both on individual data levels and by integration of two or more data levels. Key findings (white) and references to figures and tables with detailed results are given (green). CIMP: CpG island methylator phenotype, CMS: consensus molecular subtypes, CNA: copy number aberrations, MSI/MSS: microsatellite instable/stable, OG: oncogene, TF: transcription factor, TS: tumor suppressor, SNV: single nucleotide variant
Fig. 2DNA aberrations reflect the type of genomic instability. a We investigated the frequencies (vertical axes) of SNVs in each of six categories (indicated in the top panels) grouped according to sequence motif (flanking nucleotides are indicated on the horizontal axes). MSI cell lines (n = 8, excluding DLD1 and HCT15) and the POLE mutated cell line HCC2998 displayed different mutation signatures associated with the respective types of hypermutation. The MSI cell lines DLD-1 and HCT15 had a distinct mutation signature with a combination of deficient mismatch repair and POLD1 mutation. b Overview of detected SNVs/indels in 37 genes included in the Cosmic Cancer Gene Census and that were mutated in at least four MSI cell lines or one MSS cell line among the 27 cell lines analyzed by targeted deep sequencing. Most genes showed clear mutation frequency differences between MSS and MSI/POLE mutated cell lines. c There was an inverse relationship between the CNA load (horizontal axis; percent of basepairs with aberrant copy number) and the SNV/indel load (vertical axis) in the cell lines, reflecting their molecular subtype, as indicated. The neuroendocrine cell line Colo320 (green circle) grouped along with the MSS cell lines, and had few SNVs/indels and a moderate number of CNAs, including gain of 8q and 13q. d MSI/POLE mutated cell lines had a lower frequency of CNAs (vertical axis) along the genome than e MSS cell lines. In each plot, chromosomes are indicated on the horizontal axes and separated by vertical lines (whole and dashed lines for chromosomes and chromosome arms, respectively). Frequent aberrations are highlighted, including gains on 7p, 7q, 8q, 12p, 13q, 20q and losses on 4p, 4q, 17p, 18q and 22q, which are chromosome arms known to be frequently affected by CNAs in primary CRCs. CNA: copy number aberration, MSI/MSS: microsatellite instable/stable, POLE: POLE mutated, SNV: single nucleotide variant
Mutation status in CRC critical genes. Cell lines were examined by Sanger sequencing, targeted sequencing or by both methods
| TP53 | KRAS | BRAF | PIK3CA | PTEN | MSI | CIMP | |
|---|---|---|---|---|---|---|---|
| CaCo2 | p.E204X | wt | wt | wt | wt | MSS | CIMP- |
| CL-11a | p.S215N | p.V14I; p.Q61H | wt | wt | wt | MSS | CIMP+ |
| CL-34a | p.S127P; p.K382fs | wt | p.V600E | wt | wt | MSI | CIMP+ |
| CL-40a | p.R248Q | p.G12D | wt | wt | wt | MSS | CIMP+ |
| Co115 | wt | wt | p.V600E | wt | p.E157fs; p.R233X | MSI | CIMP+ |
| Colo205b | p.Y107fs; p.Y103fs | wt | p.V600E | wt | wt | MSS | CIMP+ |
| Colo320 | p.R248W | wt | wt | wt | wt | MSS | CIMP- |
| Colo678a | wt | p.G12D | wt | wt | wt | MSS | CIMP+ |
| DLD-1 | p.S241F | p.G13D | wt | p.E545K; p.D549N | wt | MSI | CIMP+ |
| EB | wt | p.G12D | wt | p.E545K | wt | MSS | CIMP+ |
| FRI | p.C277F | p.G13D | wt | p.E545K | wt | MSS | CIMP- |
| HCC2998b | p.R213X | p.A146T | wt | wt | p.Y46C; p.R130Q; p.F341V | MSS | CIMP- |
| HCT116 | wt | p.G13D | wt | p.H1047R | wt | MSI | CIMP+ |
| HCT15 | p.S241F | p.G13D | wt | p.E545K; p.D549N | wt | MSI | CIMP+ |
| HT29 | p.R273H | wt | p.V600E; p.T119Sc | wt | wt | MSS | CIMP+ |
| IS1 | p.Y163H | p.G12D | wt | wt | wt | MSS | CIMP- |
| IS3 | p.Y163H | p.G12D | wt | wt | wt | MSS | CIMP- |
| KM12b | p.P72fs; p.H179R | wt | p.P403fs | wt | p.G129X; p.K267fs | MSI | CIMP+ |
| LoVo | wt | p.G13D; p.V14A | wt | wt | wt | MSI | CIMP- |
| LS1034 | p.G245S | p.A146T | wt | wt | wt | MSS | CIMP- |
| LS174T | wt | p.G12D | p.D211Gc | p.H1047R | wt | MSI | CIMP- |
| NCI-H508 | p.R273H | wt | p.G596R | p.E545K | wt | MSS | CIMP- |
| RKO | wt | wt | p.V600E | p.H1047R | wt | MSI | CIMP+ |
| SW1116 | p.A159D | p.G12A | wt | wt | wt | MSS | CIMP- |
| SW1463a | p.R248Q | p.G12C | wt | wt | wt | MSS | CIMP- |
| SW403a | p.E51X | p.G12V | wt | wt | wt | MSS | CIMP- |
| SW48 | wt | wt | p.R347Xc | p.G914Rc | wt | MSI | CIMP+ |
| SW480 | p.R273H; p.P309S | p.G12V | wt | wt | wt | MSS | CIMP- |
| SW620 | p.R273H; p.P309S | p.G12V | wt | wt | wt | MSS | CIMP- |
| SW837a | p.R248W | p.G12C | wt | wt | wt | MSS | CIMP+ |
| SW948 | p.G117fs | p.Q61L | wt | p.E542K | wt | MSS | CIMP- |
| TC71 | p.C176Y; p.R213X | p.G12D | wt | p.R88Qc | p.R233X | MSI | CIMP- |
| V9P | p.G245D | wt | wt | wt | wt | MSS | CIMP- |
| WiDr | p.R273H | wt | p.V600E; p.T119Sc | wt | wt | MSS | CIMP+ |
All variants found with targeted sequencing and not observed in Sanger sequencing data were found in regions outside codons targeted by Sanger. HT29/WiDr had two variants each in BRAF, where one was verified by Sanger, and the other was outside of codons assessed. Only non-synonymous mutations were reported from targeted sequencing data
Mutation data were available from Sanger sequencing only
Mutation data were available from targeted sequencing only
Mutations found with targeted sequencing only, but outside of regions assessed by Sanger sequencing
Fig. 3Gene expression based classification of CRC cell lines revealed a separation between colon-like and undifferentiated cell lines associated with the consensus molecular subtypes (CMS). a PCA of cell line mRNA expression data (plotted as sample-wise PC1 versus PC2) showed that the cell lines had a bimodal density distribution along PC1 (bottom plot), indicating two distinct subgroups largely separating CMS2/3 from CMS1/4. Each point represents one cell line, and is colored according to the CMS class and with point type indicating MSI-status. Dashed vertical line (red) indicates the least frequent value between the two density modes of PC1, and was used as a threshold to separate the cell lines into the two subgroups. b PC1 (horizontal axis) was strongly correlated with the sample-wise enrichment score for a set of gastro-intestinal tissue enhanced genes (vertical axis), and cell lines with high enrichment scores, left of the red dashed line, were termed “colon-like” and the remaining “undifferentiated”. c Gene set enrichment analyses comparing colon-like and undifferentiated cell lines showed that colon-like cell lines had higher expression of genes upregulated by HNF4A and lower expression of genes related to colorectal cancer stemness. Undifferentiated cell lines had higher expression of genes related to epithelial to mesenchymal transition and genes upregulated by TGFβ. The plot includes the top 15 gene sets tested (ranked by p-value) and the -log10 p-value is plotted on the horizontal axis. d Top 5 differentially expressed transcription factors and kinases (mRNA level), miRNAs and proteins between colon-like and undifferentiated cell lines. mRNAs and miRNAs are ranked by p-value while proteins are ranked by absolute log2 fold-change. The log2 fold-changes (log2FC) between the sample groups are indicated. e Classification of the individual cell lines according to the colon-like and undifferentiated subgroups. CRC: colorectal cancer, CMS: consensus molecular subtypes, log2FC: log2 fold-change, MSI/MSS: microsatellite instable/stable, PCA: principal component analysis
Fig. 4CNAs and SNVs/indels in cancer-critical genes. Among genes in the Cancer Gene Census (n = 83 genes included in the targeted sequencing panel, ranked vertically in alphabetical order), simultaneous mutations and CNAs in individual cell lines (grouped horizontally according to genomic phenotypes as indicated) were detected in CRC relevant oncogenes, including KRAS and EGFR, and tumor suppressor genes, including TP53 and APC. The cell line Colo320, which has a neuroendocrine origin, is marked by an asterisk. CNA: copy number aberration, CRC: colorectal cancer, MSI/MSS: microsatellite instable/stable, POLE: POLE mutated, SNV: single nucleotide variant
Fig. 5mRNA and protein expression levels are highly concordant among cell lines. a The density distribution (horizontal axis) of cross-cell line Pearson’s correlations (vertical axis) for expression of matched genes (microarray data) and proteins (Reverse Phase Protein Array data) (n = 194) shows an overall strong correlation. The horizontal line indicates the median correlation coefficient for all gene-protein pairs. b Differential expression analyses between colon-like and undifferentiated cell lines showed strong correspondence at the mRNA and protein level (plotted as the log2 fold-changes between the two groups of cell lines for matched protein on the vertical axis versus mRNA on the horizontal axis). The plot includes gene-protein pairs with adjusted p-value <0.1 from differential expression analysis in either mRNA or protein data. Gene-protein pairs with absolute log2 fold-change >0.5 (mRNA) between colon-like and undifferentiated cell lines are indicated by names and the rest by circles. Pearson correlation analysis (r2) indicated that 43% of the variance in the log2 fold-change at the protein level could be explained by mRNA-level log2 fold-change
Fig. 6Characteristics of individual cell lines at multiple molecular levels. The cell lines are ranked alphabetically within the colon-like (n = 18; top) and undifferentiated (n = 15; bottom) subgroups. The neuroendocrine Colo320 is found below the undifferentiated cell lines (marked by a dark grey box). a The heatmap shows standardized single sample gene set expression enrichment scores for the eight selected pathways indicated at the bottom (indicates how many standard deviations the score is above or below the mean). Red indicates relative upregulation and blue indicates relative downregulation among cell lines. b The table indicates selected molecular events characteristic of each cell line. Amp: DNA amplification, mut: “mutation” (single nucleotide variant or insertion/deletion), m: mRNA level, p: protein level, wt: wild type