| Literature DB >> 35251978 |
Thierry Soussi1,2, Panagiotis Baliakas1.
Abstract
Locus-specific databases are invaluable tools for both basic and clinical research. The extensive information they contain is gathered from the literature and manually curated by experts. Cancer genome sequencing projects generate an immense amount of data, which are stored directly in large repositories (cancer genome databases). The presence of a TP53 defect (17p deletion and/or TP53 mutations) is an independent prognostic factor in chronic lymphocytic leukemia (CLL) and TP53 status analysis has been adopted in routine clinical practice. For that reason, TP53 mutation databases have become essential for the validation of the plethora of TP53 variants detected in tumor samples. TP53 profiles in CLL are characterized by a great number of subclonal TP53 mutations with low variant allelic frequencies and the presence of multiple minor subclones harboring different TP53 mutations. In this review, we describe the various characteristics of the multiple levels of heterogeneity of TP53 variants in CLL through the analysis of TP53 mutation databases and the utility of their diagnosis in the clinic.Entities:
Keywords: TP53 mutation; chronic lymphocytic leukemia; genetic analysis model; mutation database; variant classification guidelines
Year: 2022 PMID: 35251978 PMCID: PMC8890000 DOI: 10.3389/fonc.2022.808886
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1The locus-specific database UMD_TP53: a central hub for multifactorial analysis. 1: TP53 variants and patient information are collected and stored in a relational database specifically developed for the storage and the analysis of genetic variants. 2: Exposome analysis: influence of the external and internal environment on the landscape of mutational events to identify the links between exposure to various types of carcinogens, specific mutational events in the TP53 gene and the development of specific cancers. 3: More than 7,000 different TP53 variants have been discovered in various types of cancer with heterogeneous LOF and GOF. 4: Multiple bioinformatics tools, including machine learning, have been developed to predict and classify TP53 variants. 5: Genome-based prognostic biomarkers can be used for several cancer types for potential incorporation into clinical prognostic staging systems or practice guidelines such as TP53 and CLL. 6: Analysis of TP53 variants points to the various functional domains of the protein essential for tumor suppression. 7: Functional analysis has led to the identification of the multiple pathways regulated by TP53. 8: Small molecules have been developed that specifically target missense TP53 variants and restore p53 transcriptional activity, thereby enabling tumor regression. Although this figure describes the TP53 database, the various aspects can be applied to other genes as well.
TP53 mutation databases.
| UMD1 | IARC2 | LOVD3 | COSMIC4 | TCGA5 | ICGC6 | MSKSCC7 | GENIE | |
|---|---|---|---|---|---|---|---|---|
| LSDB | LSDB | LSDB | CGD | CGD | CGD | CGD | CGD | |
|
| 2021R1 | R20, July 2019 |
| v94 | NA | v28 | V10 | V10 |
|
| 1991 | 1991 | 2013 | 2004 | 2008 | 2013 | 2016 | 2016 |
|
| 2021 | 2019 | Jun-21 | May-21 | Jun-21 | Mar-21 | Jun-21 | Jun-21 |
|
| 170,428 | 29,891 | 676 | 47,788 | 4,250 | 6557 | 3,249 | 4,813 |
|
| 8,046 | 4,526 | 400 | 5,705 | 1,961 | 1031 | 11,30 | |
|
| Yes | Yes | No | Yes | No | No | No | No |
|
| 6,704 | 2,273 | 6 | 4,129 | 32 studies | 86 projects | NR | NR |
|
| Yes | Yes | No | Yes | Yes | Yes | No | Yes* |
|
| Yes | No | No | No | No | No | No | No |
|
| No | Yes | No | Partial | Yes | Yes | Yes | Yes |
|
| Yes | Unknown | Unknown | Unknown | NR | NR | NR | NR |
|
| No | No | No | Yes | NR | NR | NR | NR |
|
| Yes | Partial | No | Partial | Partial | Partial | Partial | Partial |
|
| Yes | Yes | No | No | No | No | No | No |
|
| Yes | Yes | No | Yes | Yes | Yes | Yes | Yes |
|
| Yes | No | No | No | No | No | No | No |
|
| Yes | Yes | unknown | Yes | Yes | Yes | Yes | Yes |
|
| Yes | Yes | Yes | Yes | No | No | No | No |
|
| No | Yes | No | No | No | No | No | No |
|
| Yes | Yes | Yes | Yes | Yes | No* | Yes | Yes |
|
| No | No | Yes | No | No | No | No | No |
|
| Alive | on hold | Unknown | Alive | Alive | Alive | Alive | Alive |
|
| 179 | 31 | 0 | 412 | 0 | 0 | 6 CLL cases | 235 CLL cases |
|
| 4,698 | 187 | 0 | 40 | 0 | 0 | 0 | 13 |
1 http://p53.fr/tp53-database/mutation-database.
2 https://p53.iarc.fr/.
3 https://databases.lovd.nl/shared/genes/TP53; LOVD database includes mostly non-pathogenic SNPs reported in population studies.
4 https://cancer.sanger.ac.uk/cosmic.
5Only the 32 PAN cancer studies (10,967 samples) are included here.
6 https://www.cbioportal.org/.
7 https://www.synapse.org/#!Synapse:syn7222066/wiki/405659; MSKSCC data were extracted from GENIE V10.0.
8All GENIE data except MSKSCC study.
9LOVD database includes mostly non-pathogenic SNPs reported in population studies.
10Manuscript known to includes spurious data are flagged.
11Multiple publications report genetic information for the same patient.
*Only via https://genie.cbioportal.org/.
Figure 3The UMD_CLL database. (A) TP53 mutations from CLL patients included in UMD_TP53 have been manually curated to correct for study duplication. For patients analyzed via Sanger in the nineties and via NGS more recently, only the more recent data were kept in the database as the sensitivity of NGS uncovered less frequent variants. (B) The UMD_CLL database includes three independent sets of functional data used to assess the loss of function of more than 10,000 TP53 variants: A, B and C, data from Giacomelli et al. in mammalian cells; RFS, data from Kotler et al. in mammalian cells; K, data from Kato et al. in yeast cells. Correlation analysis and multidimensional scaling showed excellent agreement between these three sets of data (19). Each dataset has been used to compare the TP53 variants from UMD_CLL (red) to benign TP53 SNPs (green). (C) The landscape of TP53 variants in CLL is similar to that of other types of cancer, with 78% of tumors expressing a mutant TP53 (missense and in-frame variants) and 22% null variants (splice, nonsense and frameshift mutations); (D) Analysis of the distribution of TP53 variants in TP53 protein from CLL patients showed several unusual features, such as a frameshift mutation in codon 209. See text for more details. (E) At least 25% of CLL patients carry at least two pathogenic TP53 variants, and up to 13% carry more than four. This situation is shared only with myelodysplastic syndrome, where up to 20% of patients show two TP53 variants. As half of the CLL data in UMD_TP53 originated from Sanger analyses, it is likely that CLL intratumor heterogeneity is underestimated. (F) All TP53 variants from UMD_TP53 have been classified according to ACMG criteria. For this purpose, all newly discovered, rare, benign SNPs misidentified as pathogenic mutations have been removed from the database.
Figure 2Distribution of benign missense TP53 SNPs in the p53 protein. SNPs specific for an ethnic population are indicated by colored dots.
Figure 4TP53 status in CLL patients, a snapshot. The top panel displays a schematic view of the tumor with the two TP53 alleles. The middle panel shows cytogenetic analysis performed by FISH (left) or by SNP arrays (right). The lower panel displays an example of the read alignments from NGS. 1: No TP53 mutation: In monoclonal B-cell lymphocytosis, TP53 mutation and 17p deletion are very rare, leading to negative results for both FISH and genetic analysis. 2: TP53 mutation without LOH: In early stages of CLL, the frequency of TP53 mutation is low (less than 10%) with many cases showing no LOH. Sensitive sequencing analysis with NGS is able to identify low VAF TP53 variants (variant M1 in the lower panel). 3: TP53 mutation with LOH: In late-stage or relapsing disease, TP53 mutations associated with 17p deletion can be found in 30 to 50% of CLL patients. In the majority of cases, VAF is greater than 50% due to the loss of the second allele. This situation is commonly seen in CLL. 4: Multiple TP53 and LOH: in both early and late-stage disease, FASAY (functional analysis of separated alleles of p53 on yeast) or SMRT (single-molecule, real-time sequencing) has demonstrated a high level of intratumoral heterogeneity in CLL with the presence of multiple independent subclones expressing different pathogenic TP53 variants (M1, M2 and M3 in the lower panel). Although 17p deletion is often observed in these patients, it is difficult to determine if subclones expressing different TP53 variants are associated with it, and even more so if the VAF of the variant is low. 5: Copy neutral LOH: Following the initial mutational inactivation of one allele, the remaining wild-type allele is deleted concurrently with the duplication of the mutated allele, leading to copy neutral LOH (cnLOH). Detecting cnLOH is difficult and thus the frequency of the event is currently unknown. Without SNP array analysis and if the VAF of the variant is lower than 50%, this situation can be misidentified as a tumor without LOH. Tumors with VAF greater than 50% without obvious 17p deletion should be checked for cnLOH. 6: Bi-allelic mutations: Inactivation of the TP53 gene via different mutations in the two alleles is possible but difficult to distinguish from intratumoral heterogeneity. Although this situation is often described as plausible in many reviews, it has never been formally identified, as only single-cell sequencing would be able to validate bi-allelic TP53 inactivation.