| Literature DB >> 30224644 |
Andrew O Giacomelli1,2, Xiaoping Yang2, Robert E Lintner2, James M McFarland2, Marc Duby2, Jaegil Kim2, Thomas P Howard1,2,3, David Y Takeda1,2,3, Seav Huong Ly1,2, Eejung Kim2, Hugh S Gannon1,2, Brian Hurhula2, Ted Sharpe2, Amy Goodale2, Briana Fritchman2, Scott Steelman2, Francisca Vazquez1,2, Aviad Tsherniak2, Andrew J Aguirre1,2, John G Doench2, Federica Piccioni2, Charles W M Roberts1,3,4, Matthew Meyerson1,2,3, Gad Getz2,3,5,6, Cory M Johannessen2, David E Root2, William C Hahn7,8,9,10.
Abstract
Unlike most tumor suppressor genes, the most common genetic alterations in tumor protein p53 (TP53) are missense mutations1,2. Mutant p53 protein is often abundantly expressed in cancers and specific allelic variants exhibit dominant-negative or gain-of-function activities in experimental models3-8. To gain a systematic view of p53 function, we interrogated loss-of-function screens conducted in hundreds of human cancer cell lines and performed TP53 saturation mutagenesis screens in an isogenic pair of TP53 wild-type and null cell lines. We found that loss or dominant-negative inhibition of wild-type p53 function reliably enhanced cellular fitness. By integrating these data with the Catalog of Somatic Mutations in Cancer (COSMIC) mutational signatures database9,10, we developed a statistical model that describes the TP53 mutational spectrum as a function of the baseline probability of acquiring each mutation and the fitness advantage conferred by attenuation of p53 activity. Collectively, these observations show that widely-acting and tissue-specific mutational processes combine with phenotypic selection to dictate the frequencies of recurrent TP53 mutations.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30224644 PMCID: PMC6168352 DOI: 10.1038/s41588-018-0204-y
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Figure 1Deletion of endogenous wild-type but not mutant TP53 impacts fitness in human cancer cells
Comparison of enrichment scores for CRISPR-Cas9 reagents targeting TP53 (a) or MDM2 (b) in cell lines whose TP53 statuses were defined using TP53 mutation, copy-number, target gene expression, and nutlin-3 sensitivity data (see Supplementary Table 1). Only cell lines with concordant functional and genetic classifications were included in these analyses. Each point represents the gene-level score for a given cell line, and error bars indicate the mean and standard deviation of each group. Cell lines in the loss-of-function (LOF) category harbor homozygous deletion, frameshift, or nonsense mutation of TP53 and express low levels of p53 protein (see Supplementary Fig. 1c). (****, P < 0.0001, Two-tailed Welch’s t-test) (c) PARIS, a rescaled normalized mutual information (RNMI)-based statistical analysis[52] was used to nominate genes whose enrichment scores were significantly different between the p53 non-functional mutant (LOF, missense, splice-site, and indel, with functional score < 0) and p53 functional wild-type (functional score > 0) cell line classes in genome-scale CRISPR-Cas9[12] and RNAi[11] screens. Reported p53 pathway components that scored as significant (FDR < 0.05) in both analyses are highlighted in red. (d) Infographic depicting differential enrichment scores of reported p53 pathway members and target genes in genome-scale CRISPR-Cas9 (lower left triangle) and RNAi (upper right triangle) screens. Dashed lines indicate transcriptional regulation. Genes scoring as significant in both analyses are outlined in yellow.
Figure 2Comprehensive mutational scanning of TP53
(a) A library comprising 8258 mutant TP53 alleles was introduced into A549 p53WT and p53NULL cells in a pooled format under conditions that favor the integration of a single vector in each cell. Library-infected p53WT cells were treated with nutlin-3, and library-infected p53NULL cells were treated with either nutlin-3 or etoposide. After 12 days, genomic DNA was harvested, PCR-amplified and subjected to next generation sequencing. (b–d) Heat maps of normalized allele enrichment scores (Z-scores) with codon-level average Z-scores plotted at right. (e) Left, The reported domain structure of p53 with residues 175, 248, and 273 highlighted: TAD = transactivation domain, PRD = Proline-rich domain, DBD = DNA-binding domain, ZN = Zinc-binding domain, 4D = tetramerization domain, CTD = C-terminal domain. Right, Total number of missense and nonsense mutations found at each codon in the IARC database[2,32]. (f–h) Density plot of alleles with silent mutations (wild-type alleles), nonsense mutations at codons 44–289 (loss-of-function alleles), missense mutations that are common in cancer, and SNV-generated missense mutations that have never been observed in cancer. Differences among all groups of alleles were significant in each condition (P < 0.0001, Wilcoxon rank sum test).
Figure 3Tissue-of-origin-selective TP53 mutations are linked to specific mutational processes
(a) Each signature in the COSMIC Mutational Signatures database contains 96 mutation probabilities, one for each trinucleotide mutation type[9,10]. To assign a baseline mutation probability to each TP53 allele for each signature, we first determined its trinucleotide mutation type, and then assigned the corresponding value from the database. Depicted here is the assignment of a mutation probability for Arg248Gln under the influence of Signature 1. (b) Fisher’s exact tests were performed to identify TP53 mutations that occur significantly more frequently in specific tumor types in the IARC[2,32] and GENIE[35] databases. The heatmap shows the relative mutation probabilities for each of the indicated TP53 mutations under the influence of each signature, depicted graphically on a white-to-red scale as the signature-specific percentile of all SNV-derived TP53 mutations (n = 2810). The percent of tumors of each tissue-of-origin in which a given signature was found by Alexandrov et al.[10] is depicted graphically on a white-to-black scale.
Figure 4The TP53 mutational spectrum modeled as a function of mutational signatures and phenotypic selection
(a) Generalized linear models were trained to predict the mutation frequency of each TP53 allele in the IARC database[2,32] using mutational signatures from the COSMIC database[9,10], phenotypic selection data from the TP53 MITE library screens, or both. (b) Position-level mutation rates predicted by the combined model are plotted downwards and observed mutation rates in the GENIE validation database are plotted upwards.