| Literature DB >> 33230179 |
Vincent Carbonnier1, Bernard Leroy2, Shai Rosenberg3,4, Thierry Soussi5,6,7.
Abstract
The diagnosis of somatic and germline TP53 mutations in human tumors or in individuals prone to various types of cancer has now reached the clinic. To increase the accuracy of the prediction of TP53 variant pathogenicity, we gathered functional data from three independent large-scale saturation mutagenesis screening studies with experimental data for more than 10,000 TP53 variants performed in different settings (yeast or mammalian) and with different readouts (transcription, growth arrest or apoptosis). Correlation analysis and multidimensional scaling showed excellent agreement between all these variables. Furthermore, we found that some missense mutations localized in TP53 exons led to impaired TP53 splicing as shown by an analysis of the TP53 expression data from the cancer genome atlas. With the increasing availability of genomic, transcriptomic and proteomic data, it is essential to employ both protein and RNA prediction to accurately define variant pathogenicity.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33230179 PMCID: PMC7683535 DOI: 10.1038/s41598-020-74892-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Functional analysis of TP53 variants. (A) TP53 variants from the core domain resulting from SNS, DNS and TNS are functionally impaired. TP53 variants were separated into 3 categories: amino (residues 1 to 99), core (residues 100 to 300) and carboxy (residues 301 to 393). Remaining activity ranging from 0 (no activity) to 1 (full activity) was determined from the normalized data of Giacomelli et al.[12] (see "Methods"). All variants: all types of substitutions; SS: SNS variants; MS: DNS and TNS variants. (B) Analysis of the activity of TP53 variants according to their frequency in the UMD_TP53 Mutation Database. Boxplots display TP53 variant loss of activity from the whole database (orange plot) or from various datasets with TP53 mutants classified into eight categories according to their frequencies in the database (blue plot). CSD data are shown in red. The read-outs are identified on the x-axis with data from the three different datasets, Kato et al. (top), Kotler et al. (middle) and Giacomelli et al. (bottom). The Y axis shows variant-normalized TP53 functionality.
Figure 2Correlation matrix for the three large-scale (12 read-outs) analyses of TP53 activity. Correlation R values are shown on the left part of the panel. Positive correlations are displayed in blue and negative correlations in red. Color intensity and the size of the square are proportional to the correlation coefficients. On the right side of the correlogram, the legend color shows the correlation coefficients and the corresponding colors. All correlations are highly significant (p < 0.001) (see supplementary Fig. 3 for a detailed view of the statistics).
Figure 3Binary correlation analysis between the various read outs of TP53 loss of activity. Overall Pearson R correlation coefficients and p values are shown at the top of each figure. Data for other read-outs are shown in Supplementary Fig. 3.
Figure 4(A) TP53 variants can be divided into two classes according to their loss of activity. The binary comparison between two studies (Kotler and Giac_A) has been split into two panels, with variants found in the CSD in the lower right panel (red) and rare TP53 variants (variants found once in or absent from the database) in the upper right panel (green). Data for other read-outs are shown in Supplementary Fig. 3. (B) Position of the 7 CSD outlier variants in different binary comparison. Two variants, p.A138V and p.V218G (shown by black arrows), display no loss of activity in the yeast assay (top figures) but are inactive using the various mammalian readouts (bottom figures). Other variants associated with splice defects are also displayed in the figure. Data for other read-outs are shown in Supplementary Fig. 5.
Figure 5Multidimensional scaling (MDS) for reduction of the 12 activity measures into bidimensional space. CSD, CSD outliers and no CSD mutations are colored red, green and black respectively.
Alternative loss of function of the 7 CSD TP53 variants.
| TP53 variant1 | Activity2 | Occurrence in UMD3 | Position in the | Potential loss of function | ||
|---|---|---|---|---|---|---|
| Score according to Kato et al.4 | Score according to Kotler et al.5 | Score according to Giacomelli et al.6 | ||||
| p.E224D | 0.91 | 0.65 | 0.56 | 48 | Last Base Exon | Splice variant |
| p.V218G | 0.86 | 0.20 | 0.20 | 35 | Exon | Differential loss of activity in yeast and mammalian cells |
| p.G187S | 0.72 | 0.57 | 0.55 | 34 | Exon | Splice variant |
| p.A138V | 0.67 | 0.13 | 0.35 | 126 | Exon | Thermosensitive; differential loss of activity in yeast and mammalian cells |
| p.S106R | 0.51 | 0.69 | 0.63 | 39 | Exon | RNA destabilization; possible splicing defect |
| p.R181H | 0.44 | 0.61 | 0.50 | 77 | Exon | Position 181 is known to impair TP53 for specific functions7 |
| p.R181C | 0.40 | 0.65 | 0.58 | 103 | Exon | |
1The sequence nomenclature used for TP53 variants in this work is in accordance with the Human Genome Variation Society's guidelines using the NM_000546.5 transcript sequence and the full-length protein NP_000537.3.
2TP53 functional data were scaled in the range of 0 to 1 with 0 corresponding to the lowest activity of TP53 variants.
3Number of tumors expressing each TP53 variant included in the TP53_UMD.
4Data from the transcriptional activity performed in yeast using 8 different p53 response elements (mean of the activities for the 8 readouts) from the work of Kato et al.[9].
5Score from the growth arrest assay in H1299 cells performed by Kotler et al.[11].
6Mean score from the three assays performed by Gacomelli et al.[12] (see "Methods" for a full description of the assays).
7Both in vitro studies and mouse models indicate that these variants are functionally defective[21,32].
Figure 6TP53 RNA expression differs in tumors depending on TP53 mutation status. Box-and-whisker plots show the interquartile ranges (boxes), median values (horizontal lines inside the boxes), and full-range distributions (whisker lines) for TP53 RNA content according to the type of TP53 mutation. RNA expression and TP53 mutational status values for multiple tumor types were extracted from cBioPortal (see "Methods").