| Literature DB >> 24670820 |
Magali Olivier1, Annette Weninger2, Maude Ardin1, Hana Huskova1, Xavier Castells1, Maxime P Vallée3, James McKay3, Tatiana Nedelko2, Karl-Rudolf Muehlbauer2, Hiroyuki Marusawa4, John Alexander5, Lee Hazelwood5, Graham Byrnes6, Monica Hollstein7, Jiri Zavadil1.
Abstract
Experimental models that recapitulate mutational landscapes of human cancers are needed to decipher the rapidly expanding data on human somatic mutations. We demonstrate that mutation patterns in immortalised cell lines derived from primary murine embryonic fibroblasts (MEFs) exposed in vitro to carcinogens recapitulate key features of mutational signatures observed in human cancers. In experiments with several cancer-causing agents we obtained high genome-wide concordance between human tumour mutation data and in vitro data with respect to predominant substitution types, strand bias and sequence context. Moreover, we found signature mutations in well-studied human cancer driver genes. To explore endogenous mutagenesis, we used MEFs ectopically expressing activation-induced cytidine deaminase (AID) and observed an excess of AID signature mutations in immortalised cell lines compared to their non-transgenic counterparts. MEF immortalisation is thus a simple and powerful strategy for modelling cancer mutation landscapes that facilitates the interpretation of human tumour genome-wide sequencing data.Entities:
Mesh:
Year: 2014 PMID: 24670820 PMCID: PMC5259794 DOI: 10.1038/srep04482
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Mutation patterns derived from exome data obtained from MEF immortalised cell lines.
Mutation type distributions (a–f) and sequence context (g–l) of single base substitutions. For each treatment condition, data are shown for two independent immortalised cell lines and for a set of human tumours related to the tested condition. In (a–f), the percentage of each substitution type is shown with the total number of mutations indicated in parentheses. In (g–l), heat maps of mutation sequence context are shown. The percentage of each substitution type within a triplet sequence context is colour-coded according to the percent values. Highly abundant mutations are represented in red and low abundance mutations are in yellow. (a,g) Aristolochic-acid treatment (two left panels) and upper urinary tract human tumours (right panel) from patients exposed to AA. (b,h) Benzo(a)pyrene treatment (two left panels) and lung adenocarcinomas (right panel) from heavy smokers. (c,i) N-methyl-N'-nitro-N-nitrosoguanidine treatment (two left panels) and human recurrent glioblastoma treated with temozolomide (right panel). (d,j) UVC treatment (two left panels) and human skin squamous cell carcinomas (right panel) (COSMIC v65). (e,k) AID transgene (two panels). (f,l) Data from four independent cell lines obtained by spontaneous immortalisation of the Hupki MEF primary cells (Spont, no treatment).
Significance of the mutation strand bias for all mutation types in each experimental condition (ratio of the number of mutation on the non-transcribed to transcribed strand and FDR q-values for significance)
| SBS type | Condition | AA | BaP | MNNG | UVC | HxAID-Tg | Spont |
|---|---|---|---|---|---|---|---|
| Ratio | 1.0 | 1.1 | 1.1 | 1.0 | 1.0 | 1.4 | |
| FDR-q | 1.000 | 0.881 | 1.000 | 1.000 | 1.000 | 0.277 | |
| Significance | - | - | - | - | - | - | |
| Ratio | 1.3 | 1.0 | 1.0 | 1.0 | 0.8 | 0.9 | |
| FDR-q | 0.591 | 1.000 | 1.000 | 1.000 | 0.805 | 1.000 | |
| Significance | - | - | - | - | - | - | |
| Ratio | 2.1 | 1.5 | 0.5 | 1.1 | 1.1 | 1.6 | |
| FDR-q | <10–18 | 0.038 | 0.277 | 1.000 | 1.000 | 0.147 | |
| Significance | - | - | - | - | |||
| Ratio | 0.9 | 0.6 | 0.7 | 0.8 | 1.0 | 1.0 | |
| FDR-q | 1.000 | <10–9 | 0.707 | 0.796 | 1.000 | 1.000 | |
| Significance | - | - | - | - | - | ||
| Ratio | 1.0 | 0.9 | 0.7 | 0.8 | 1.1 | 1.0 | |
| FDR-q | 1.000 | 0.805 | 0.796 | 0.796 | 0.948 | 1.000 | |
| Significance | - | - | - | - | - | - | |
| Ratio | 2.2 | 1.1 | 1.0 | 1.3 | 1.0 | 1.1 | |
| FDR-q | <10–3 | 0.805 | 0.796 | 0.026 | 1.000 | 0.796 | |
| Significance | - | - | - | - |
Significance: * q < 0.05; ** q < 0.01; *** q < 0.001.
Figure 2Analysis of mutation signatures derived from exome data obtained from MEF immortalised cell lines.
(a) Principal component analysis (PCA) of WES data using mutation signatures. PCA was computed using as input the frequency matrices of sequence context mutations (96 variables) from cell lines immortalised following exposure of primary Hupki MEFs to a carcinogen (AA, BaP, MNNG or UVC), from Hupki MEFs carrying the AID transgene (HxAID-Tg) or from Hupki MEF-derived cell lines that immortalised spontaneously (Spont). Each sample is plotted considering the value of the first and second principal components (PC1 and PC2). The percentage of variance explained by each component is indicated within brackets in each axis. A 95% confidence ellipsis is drawn for each experimental condition and the empty squares indicate the respective centre of gravity. Cells and samples are represented by round and squared solid symbols, respectively. (b) Same as in (a) but with the human tumour datasets (same as shown in Figure 1) added to the input and labelled by arrows. HxAID-Tg samples are omitted in (b) as no corresponding relevant tumour data were identified. AAN_UTUC, aristolochic acid nephropathy-related upper urinary tract urothelial carcinoma; GBM_TZM, glioblastoma after temozolomide treatment; Lung_Ca, lung carcinoma; Skin_SCC, skin squamous cell carcinoma. (c) Graphical representation of the similarity distance of each of the six MEF signatures (front-back axis) to each of the 27 human cancer signatures2 (horizontal axis, 1A through U2). The vertical axis measures the similarity of signatures between the two systems, expressed as negative log(tan(angle), see Methods. Negative values below the x-z plane correspond to angles >45° and represent dissimilarity, and are thus not shown.