Literature DB >> 35251978

Landscape of TP53 Alterations in Chronic Lymphocytic Leukemia via Data Mining Mutation Databases.

Thierry Soussi1,2, Panagiotis Baliakas1.   

Abstract

Locus-specific databases are invaluable tools for both basic and clinical research. The extensive information they contain is gathered from the literature and manually curated by experts. Cancer genome sequencing projects generate an immense amount of data, which are stored directly in large repositories (cancer genome databases). The presence of a TP53 defect (17p deletion and/or TP53 mutations) is an independent prognostic factor in chronic lymphocytic leukemia (CLL) and TP53 status analysis has been adopted in routine clinical practice. For that reason, TP53 mutation databases have become essential for the validation of the plethora of TP53 variants detected in tumor samples. TP53 profiles in CLL are characterized by a great number of subclonal TP53 mutations with low variant allelic frequencies and the presence of multiple minor subclones harboring different TP53 mutations. In this review, we describe the various characteristics of the multiple levels of heterogeneity of TP53 variants in CLL through the analysis of TP53 mutation databases and the utility of their diagnosis in the clinic.
Copyright © 2022 Soussi and Baliakas.

Entities:  

Keywords:  TP53 mutation; chronic lymphocytic leukemia; genetic analysis model; mutation database; variant classification guidelines

Year:  2022        PMID: 35251978      PMCID: PMC8890000          DOI: 10.3389/fonc.2022.808886

Source DB:  PubMed          Journal:  Front Oncol        ISSN: 2234-943X            Impact factor:   6.244


Introduction

In 1956, Ingram used protein sequencing to provide the first demonstration of a severe disease (human sickle-cell anemia in that work) resulting from a single amino acid substitution (1). Since then, it has been largely demonstrated that gene mutations are the basis for most genetic diseases. The development of DNA sequencing and molecular cloning technologies in the late 1970s contributed greatly to the identification of genes involved in both monogenic and polygenic disorders, including complex diseases like cancer (2). The alterations occurring in those genes are numerous and variable in nature, ranging from point mutations to large deletions or translocations. Moreover, the task of reporting, storing, classifying and analyzing them has been a major challenge (3). To provide a pertinent response to this latter, locus-specific databases (LSDBs) have been developed ( ). Although intended for single genes, LSDBs do offer great accuracy as they are curated manually by experts in the field (4, 5). They also provide information that can be used for large-scale analyses and often include structural, functional or evolutionary data (6). For constitutional mutations associated with a genetic syndrome, several LSDBs also include phenotypic data useful for the study of genotype-phenotype correlation (7).
Figure 1

The locus-specific database UMD_TP53: a central hub for multifactorial analysis. 1: TP53 variants and patient information are collected and stored in a relational database specifically developed for the storage and the analysis of genetic variants. 2: Exposome analysis: influence of the external and internal environment on the landscape of mutational events to identify the links between exposure to various types of carcinogens, specific mutational events in the TP53 gene and the development of specific cancers. 3: More than 7,000 different TP53 variants have been discovered in various types of cancer with heterogeneous LOF and GOF. 4: Multiple bioinformatics tools, including machine learning, have been developed to predict and classify TP53 variants. 5: Genome-based prognostic biomarkers can be used for several cancer types for potential incorporation into clinical prognostic staging systems or practice guidelines such as TP53 and CLL. 6: Analysis of TP53 variants points to the various functional domains of the protein essential for tumor suppression. 7: Functional analysis has led to the identification of the multiple pathways regulated by TP53. 8: Small molecules have been developed that specifically target missense TP53 variants and restore p53 transcriptional activity, thereby enabling tumor regression. Although this figure describes the TP53 database, the various aspects can be applied to other genes as well.

The locus-specific database UMD_TP53: a central hub for multifactorial analysis. 1: TP53 variants and patient information are collected and stored in a relational database specifically developed for the storage and the analysis of genetic variants. 2: Exposome analysis: influence of the external and internal environment on the landscape of mutational events to identify the links between exposure to various types of carcinogens, specific mutational events in the TP53 gene and the development of specific cancers. 3: More than 7,000 different TP53 variants have been discovered in various types of cancer with heterogeneous LOF and GOF. 4: Multiple bioinformatics tools, including machine learning, have been developed to predict and classify TP53 variants. 5: Genome-based prognostic biomarkers can be used for several cancer types for potential incorporation into clinical prognostic staging systems or practice guidelines such as TP53 and CLL. 6: Analysis of TP53 variants points to the various functional domains of the protein essential for tumor suppression. 7: Functional analysis has led to the identification of the multiple pathways regulated by TP53. 8: Small molecules have been developed that specifically target missense TP53 variants and restore p53 transcriptional activity, thereby enabling tumor regression. Although this figure describes the TP53 database, the various aspects can be applied to other genes as well. Genomic studies of tumor samples in the pre-genomic era were focused either on a small number of genes analyzed in large patient cohorts, or on a more significant number of genes but in only a few tumors. Indeed, large-scale analyses combining a multitude of genes and tumors represented a Herculean and costly task. The development of high-throughput methodologies capable of sequencing an entire genome in only a few days (next generation sequencing, NGS) has radically changed the entire field of cancer biology. In the present post-genomic era, whole genome sequencing in a multitude of tumors can be performed in a matter of days. The International Cancer Genome Consortium (ICGC, http://dcc.icgc.org/), the Cancer Genome Atlas Project (TCGA, http://cancergenome.nih.gov/) and the Sanger Institute (http://www.sanger.ac.uk/) have undertaken large-scale cancer genome analyses in different types and subtypes of cancer. That work has led to the creation of large data repositories (cancer genome databases, CGDs) freely available to the entire scientific community (8–10). Both LSDBs and CGDs can be considered as central hubs linking clinical and basic research ( ). They all make important contributions to our knowledge of the intricate pathways regulating cell fate, and our ability to identify new clinical biomarkers and develop novel therapeutic molecules. TP53 mutation databases are the perfect example of the successful use of these compilations of cancer associated alterations. Indeed, the TP53 suppressor gene is the most frequently mutated gene in human cancer and analyses of these alterations have fueled basic and clinical research, leading in turn to a number of novel therapeutics currently in phase III trials (11).

TP53 Databases and Repositories

Although multiple TP53 LSDBs have been created, only two, UMD_TP53 (Universal Mutation_Database, developed by the present team) and IARC, count 30 years of TP53 mutation analyses in various types of cancer ( ) (12, 13). Both have been regularly updated with both TP53 variants and new tools to classify them. The IARC database was updated for the last time in 2019 and is currently awaiting transfer to a new host. The next update to the UMD_TP53 will be performed in March 2022. It will bring a new innovative system to classify TP53 variant pathogenicity and a new version of Seshat to analyze variants (14).
Table 1

TP53 mutation databases.

UMD1 IARC2 LOVD3 COSMIC4 TCGA5 ICGC6 MSKSCC7 GENIE 8
LSDBLSDBLSDBCGDCGDCGDCGDCGD
Version 2021R1R20, July 2019 TP53:210617v94NAv28V10V10
Creation date 19911991201320042008201320162016
Last update 20212019Jun-21May-21Jun-21Mar-21Jun-21Jun-21
Number of entries 170,42829,891676 9 47,7884,25065573,2494,813
Unique variant 8,0464,5264005,7051,961103111,30
Cell lines data YesYesNoYesNoNoNoNo
Curated publications 6,7042,27364,12932 studies86 projectsNRNR
Online search YesYesNoYesYesYesNoYes*
Publication warning 10 YesNoNoNoNoNoNoNo
Sex/Age/Ethnicity NoYesNoPartialYesYesYesYes
Curation for duplicate publications 11 YesUnknownUnknownUnknownNRNRNRNR
Sample duplications NoNoNoYesNRNRNRNR
SNP curation YesPartialNoPartialPartialPartialPartialPartial
Availability of functional data YesYesNoNoNoNoNoNo
Availability of predictive data YesYesNoYesYesYesYesYes
ACMG criteria YesNoNoNoNoNoNoNo
Data accuracy YesYesunknownYesYesYesYesYes
Germline mutation YesYesYesYesNoNoNoNo
Familial data NoYesNoNoNoNoNoNo
Availability for Download YesYesYesYesYesNo*YesYes
Submission for analysis NoNoYesNoNoNoNoNo
Current status Aliveon holdUnknownAliveAliveAliveAliveAlive
CLL publications/cases 179310412006 CLL cases235 CLL cases
Number of TP53 variants in CLL 4,69818704000013

1 http://p53.fr/tp53-database/mutation-database.

2 https://p53.iarc.fr/.

3 https://databases.lovd.nl/shared/genes/TP53; LOVD database includes mostly non-pathogenic SNPs reported in population studies.

4 https://cancer.sanger.ac.uk/cosmic.

5Only the 32 PAN cancer studies (10,967 samples) are included here.

6 https://www.cbioportal.org/.

7 https://www.synapse.org/#!Synapse:syn7222066/wiki/405659; MSKSCC data were extracted from GENIE V10.0.

8All GENIE data except MSKSCC study.

9LOVD database includes mostly non-pathogenic SNPs reported in population studies.

10Manuscript known to includes spurious data are flagged.

11Multiple publications report genetic information for the same patient.

*Only via https://genie.cbioportal.org/.

TP53 mutation databases. 1 http://p53.fr/tp53-database/mutation-database. 2 https://p53.iarc.fr/. 3 https://databases.lovd.nl/shared/genes/TP53; LOVD database includes mostly non-pathogenic SNPs reported in population studies. 4 https://cancer.sanger.ac.uk/cosmic. 5Only the 32 PAN cancer studies (10,967 samples) are included here. 6 https://www.cbioportal.org/. 7 https://www.synapse.org/#!Synapse:syn7222066/wiki/405659; MSKSCC data were extracted from GENIE V10.0. 8All GENIE data except MSKSCC study. 9LOVD database includes mostly non-pathogenic SNPs reported in population studies. 10Manuscript known to includes spurious data are flagged. 11Multiple publications report genetic information for the same patient. *Only via https://genie.cbioportal.org/. The number of CLL-related TP53 mutations in the various databases is quite low except in UMD_TP53 ( ). Because of the clinical importance of TP53 mutations in CLL, a curated subset for that pathology, called UMD_CLL, has been added to the UMD_TP53 database ( ). The latest version of UMD_CLL includes 4,698 mutations, corresponding to 3,419 samples, as patients with multiple TP53 mutations are frequent in this disease. The characteristics of these variants are discussed in the following sections of this review.
Figure 3

The UMD_CLL database. (A) TP53 mutations from CLL patients included in UMD_TP53 have been manually curated to correct for study duplication. For patients analyzed via Sanger in the nineties and via NGS more recently, only the more recent data were kept in the database as the sensitivity of NGS uncovered less frequent variants. (B) The UMD_CLL database includes three independent sets of functional data used to assess the loss of function of more than 10,000 TP53 variants: A, B and C, data from Giacomelli et al. in mammalian cells; RFS, data from Kotler et al. in mammalian cells; K, data from Kato et al. in yeast cells. Correlation analysis and multidimensional scaling showed excellent agreement between these three sets of data (19). Each dataset has been used to compare the TP53 variants from UMD_CLL (red) to benign TP53 SNPs (green). (C) The landscape of TP53 variants in CLL is similar to that of other types of cancer, with 78% of tumors expressing a mutant TP53 (missense and in-frame variants) and 22% null variants (splice, nonsense and frameshift mutations); (D) Analysis of the distribution of TP53 variants in TP53 protein from CLL patients showed several unusual features, such as a frameshift mutation in codon 209. See text for more details. (E) At least 25% of CLL patients carry at least two pathogenic TP53 variants, and up to 13% carry more than four. This situation is shared only with myelodysplastic syndrome, where up to 20% of patients show two TP53 variants. As half of the CLL data in UMD_TP53 originated from Sanger analyses, it is likely that CLL intratumor heterogeneity is underestimated. (F) All TP53 variants from UMD_TP53 have been classified according to ACMG criteria. For this purpose, all newly discovered, rare, benign SNPs misidentified as pathogenic mutations have been removed from the database.

As early as 2005, in collaboration with C. Ishioka’s group, UMD_TP53 was updated with TP53 functional data to improve the curation of the database and develop the first tools to assess TP53 variant loss of function (LOF) (15, 16). These tools have shown tremendous value for distinguishing true oncogenic TP53 variants from passenger or artifactual mutations. Data from two recent large-scale studies analyzing TP53 LOF via multiple assays in mammalian cells have also been included in UMD_TP53 to refine TP53 variant classification (17–19). Version 1 of Seshat was released in 2018. Seshat is a web service for annotating TP53 information derived from sequencing data. It allows the use of mutation annotation format (MAF) or variant call format (VCF) files. Seshat performs accurate variant annotations using the nomenclature of the Human Genome Variation Society and the stable TP53 genomic reference provided by Locus Reference Genomic (14). Several single nucleotide polymorphisms (SNPs) in the coding region of the TP53 gene have been identified and extensively characterized. Among the missense SNPs, rs1042522 (p.Pro72Arg) is common in all populations across the globe. Contrastingly, rs1800371 (p.Pro47Ser) has been shown to be specific to the African population (20). Both SNPs are included in ClinVar and considered benign according to American College of Medical Genetics and Genomics (ACMG) criteria. In a recent survey, new TP53 missense SNPs, including five variants specific to the Asian population, were identified and characterized ( ) (21). None of these variants were found to display LOF compared to the normal TP53 gene ( ) and they are now defined as bona fide benign SNPs (21). In UMD_TP53, these variants are specifically flagged as germline SNPs. However, other LSDBs and CGDs define several of them as somatic and potentially pathogenic variants.
Figure 2

Distribution of benign missense TP53 SNPs in the p53 protein. SNPs specific for an ethnic population are indicated by colored dots.

Distribution of benign missense TP53 SNPs in the p53 protein. SNPs specific for an ethnic population are indicated by colored dots. The UMD_CLL database. (A) TP53 mutations from CLL patients included in UMD_TP53 have been manually curated to correct for study duplication. For patients analyzed via Sanger in the nineties and via NGS more recently, only the more recent data were kept in the database as the sensitivity of NGS uncovered less frequent variants. (B) The UMD_CLL database includes three independent sets of functional data used to assess the loss of function of more than 10,000 TP53 variants: A, B and C, data from Giacomelli et al. in mammalian cells; RFS, data from Kotler et al. in mammalian cells; K, data from Kato et al. in yeast cells. Correlation analysis and multidimensional scaling showed excellent agreement between these three sets of data (19). Each dataset has been used to compare the TP53 variants from UMD_CLL (red) to benign TP53 SNPs (green). (C) The landscape of TP53 variants in CLL is similar to that of other types of cancer, with 78% of tumors expressing a mutant TP53 (missense and in-frame variants) and 22% null variants (splice, nonsense and frameshift mutations); (D) Analysis of the distribution of TP53 variants in TP53 protein from CLL patients showed several unusual features, such as a frameshift mutation in codon 209. See text for more details. (E) At least 25% of CLL patients carry at least two pathogenic TP53 variants, and up to 13% carry more than four. This situation is shared only with myelodysplastic syndrome, where up to 20% of patients show two TP53 variants. As half of the CLL data in UMD_TP53 originated from Sanger analyses, it is likely that CLL intratumor heterogeneity is underestimated. (F) All TP53 variants from UMD_TP53 have been classified according to ACMG criteria. For this purpose, all newly discovered, rare, benign SNPs misidentified as pathogenic mutations have been removed from the database. The three major CGDs (ICGC, TCGA and GENIE) include data from both whole exome and whole genome sequencing of multiple tumors ( ). CGDs list fewer TP53 variants than LSDBs do. However, the former are able to show the full pattern of mutations in a single tumor, which enables analyses that are not possible with the latter. For example, CGDs enable the identification of mutual exclusivity of genomic alterations to identify genes belonging to a same functional pathway, as they do not mutate simultaneously in a same patient (22).

Shaping the Landscape of TP53 Mutations in CLL

Although most DNA damage resulting from endogenous and/or exogenous insults is successfully managed by the various DNA repair mechanisms, some does escape those processes and transform into stable mutations. Of these latter, only a few will target cancer genes and thus confer a growth advantage (driver mutations). The remaining mutations will be co-selected during the neoplastic process (passenger mutations). The number of driver mutations is very low (less than 20). However, that of passenger mutations is several orders of magnitude higher, ranging from 0.8 substitutions per megabase for hematological neoplasms such as CLL to 9 or 11 for lung cancer or melanoma respectively (23). As a result, due to the specificity of the damage caused by such insults and the specific repair mechanisms used by the cell to correct the damage, mutagenic processes generate characteristic point mutation rate spectra, which are called mutational signatures. These signatures point to the mutagenic processes active in a tumor and reveal the high tissue specificity of these mutagenic mechanisms. For these analyses, passenger mutations are preferred as they are not subject to any selective process. In contrast, mutations in driver genes are highly biased as only those able to drive a cancer phenotype will be selected, whether it is via the LOF of a tumor suppressor gene or the gain of function of an oncogene. For the latter, mutations (predominantly missense variants) are restricted to a few codons in the gene targeting key functional residues. For tumor suppressor genes, mutations (predominantly nonsense or frameshift) will lead to a null phenotype or the synthesis of an inactive truncated protein. The mode of inactivation of TP53 is unique compared to other tumor suppressor genes, with more than 80% of somatic and germline TP53 alterations being missense mutations that lead to the synthesis of a stable mutant protein that accumulates in the nucleus of tumor cells (24). The classification of TP53 as a tumor suppressor gene led to a general belief wherein the loss of TP53 function is the sole mechanism associated with TP53 mutations. In fact, this strong selection to maintain expression of mutants in tumors is known to have a vital role in transformation, including dominant activity (DN) and/or a gain of function (GOF), making TP53 variants oncogenic. The distribution of mutations in the p53 protein is also unique among oncogenes and tumor suppressor genes as nearly all of the protein’s 393 amino acid residues have been the target of at least one mutation in human cancer. Each residue in the core domain (containing the DNA-binding region) has been found to be mutated at least five times in independent tumors, and up to 6,000 times for hotspot mutants. Nevertheless, the distribution of these mutations, and therefore the landscape of TP53 variants observed in a number of types of cancer, is very heterogenous. This aspect may result from the specificity of the insults that generate the mutations (5-methylcytosine deamination at CpG dinucleotide, UV, tobacco carcinogens or chemotherapy) or from the tissue-specific selection of TP53 variants with a special growth advantage (25). At first glance, an analysis of the 3,914 cases of CLL in UMD_TP53 shows a mutation profile similar to those of other cancers ( ), with 76% of missense mutations mostly localized in the DNA-binding domain of TP53. The unusual feature of TP53 mutation in CLL is the presence of a specific hot spot variant: a deletion of two nucleotides at codon 209 (c.626_627del) leading to premature termination (p.Arg209LysfsTer6) ( ). Frameshift variants are found all along the TP53 gene in every type of cancer, but variant c.626_627del is highly predominant in CLL (15% of frameshift mutations in CLL compared to 1 to 2% in other cancer types). The sequence around codon 209 contains an inverted repeat that could explain its specific mutability. Furthermore, the observation of this variant in both untreated and treated patients indicates that it originates from an unknown endogenous mechanism. Although frameshift variants are usually not expressed due to NMD (nonsense-mediated mRNA decay) and protein instability, a specific selection for a truncated TP53 cannot be formally excluded.

Subclonality of TP53 Mutations

Whole exome and whole genome sequencing have provided new insights into the heterogeneity and evolution of tumors, with, importantly, the detection of a high number of subclones in a single tumor (26, 27). This knowledge on the subclonality of TP53 mutations is likely to have implications for biomarker discovery and/or cancer therapy, particularly in the era of targeted treatments. Furthermore, indications of a relationship between this heterogeneity and clinical outcomes are emerging. TP53 mutated subclones with variant allele frequencies (VAFs) lower than 10% (range 0.3% to 10%, depending on the study), undetectable by conventional Sanger sequencing, have been reported in multiple studies (28–33). Subclonal TP53 variants and high VAF variants have the same profile, including similar hot spot variants. Longitudinal studies have shown that some of these clones can become more prevalent during the development of the tumor, regardless of whether the patient was treated or not. These small mutated subclones have been shown to be associated with unfavorable prognoses in some studies. However, this issue remains controversial, and there is currently no use of mutated subclones in the clinic. TP53 classifications and the methods and cut-offs used to define low VAF clones must be harmonized to enable consensus. Another characteristic is the high number of CLL patients with multiple TP53 variants ( ). This feature appears specific to CLL; it has not been observed in other types of cancer (22). Bi-allelic TP53 inactivation could explain two TP53 variants but not a higher number of them (range 3 to 10) (34). This high intratumor heterogeneity has been detected in multiple independent studies and validated by specific analyses such as FASAY (functional analysis of separated alleles in yeast) and SMRT (single-molecule real-time sequencing) that confirm different allelic locations for these TP53 variants. Like for minor subclones, most TP53 variants identified in tumors bearing multiple TP53 variants are truly pathogenic. The basis of this specific selection for multiple TP53 variants during the course of CLL is currently unknown.

TP53 Mutation Heterogeneity and Pathogenicity

As early as the nineties, it was obvious that TP53 mutant LOF was heterogenous. Variants were classified as “contact” or “structural,” depending on whether the substituted amino-acid acted directly on DNA interactions (p.Arg273His) or caused a general effect on the protein structure (p.Arg175His). Several classifications for variants based on TP53 aspects have been suggested to stratify patients with TP53 mutations but none have reached the clinic due to the high heterogeneity of the variants and the specificity of the variants among cancer types. A number of predictive tools have been developed, exploiting such information as sequence phylogenetic conservation, amino acid physicochemical properties, functional domains and structural attributes. Commonly used variant effect prediction methods include SIFT (35, 36), PolyPhen (37), GERP++ (38), Condel (39), CADD (40), fathmm (41), MutationTaster (42), MutationAssessor (43), GESPA (44) and, more recently, REVEL (45) and ENVISION (46). Several of these methods, such as fathmm, Condel, CADD and REVEL, integrate data from multiple tools to improve classification accuracy. Recent methods have used machine learning processes. Their training and validation were conducted using datasets of classified variants taken from either pathogenic (COSMIC, TCGA, GENIE, HGMD) or benign (dbSNP, gnomAD or Clinvar) variant databases. Nonetheless, for TP53 and other genes, these various classifiers have heterogenous outcomes and no consensus for their use has been reached. GENIE uses SIFT and PolyPhen, whereas TCGA uses SIFT, PolyPhen and MutationAssessor, and COSMIC uses fathmm. When employing predicting methods based on phylogenetic conservation, tools based on amino acid physicochemical properties such as SIFT or PolyPhen should be used with great precaution as the relation of the deleteriousness of the protein predicted by these tools and any association with disease is far from being straightforward. To solve this issue, the American College of Medical Genetics and Genomics (ACMG) and the Association for Molecular Pathology (AMP) have published standards and guidelines for the interpretation of sequence variants (47). These guidelines describe a proposition for classifying variants as “pathogenic,” “likely pathogenic”, “uncertain significance”, “likely benign” or “benign” according to a series of criteria with levels of evidence defined as “very strong”, “strong”, “moderate” or “supporting”. They have been widely adopted by clinical laboratories around the world. However, these recommendations were primarily designed for constitutional variants. Thus, their use for somatic variants requires some adjustment (48). The two main criteria used for the levels of evidence were based on population (BA1, BS1 and BS2) and functional (BS3) data, which are now fully available in UMD_TP53. One of the main advantages for TP53 over other tumor suppressor genes is the availability of a range of functional data for all possible missense mutations occurring in the coding region for the large isoform of the protein. Data from three independent large-scale saturation mutagenesis screening studies carried out in different settings (yeast or mammalian) and with different readouts (transcription, growth arrest or apoptosis) are currently available (17, 18, 49). A correlation analysis showed excellent agreement between LOF for the protein and the occurrence of these variants in different cancer types, making this criterion suitable for defining PS3 for ACMG classification ( ) (19). An analysis of the UMD_CLL database indicates that 91.9% of the TP53 variants identified in CLL, whether clonal or subclonal, are classified as pathogenic and 6.7% as VUS ( ).

A Snapshot of TP53 Mutation Status in CLL

TP53 status in tumors is complex as multiple mechanisms can impair TP53 tumor suppression pathways. Furthermore, it is quite likely that cancer specificity plays an important role in this process due to the large diversity of TP53 function and regulation among the various tissues. Although MDM2 expression is upregulated in numerous cancers, resulting in a loss of p53-dependent activities, its frequency in CLL is quite low. Other mechanisms, such as the dysregulation of the microRNA network that controls TP53, are also possible but their importance in CLL needs further investigation. In contrast, CGDs have made it possible to identify the co-occurrence or mutual exclusivity of specific genetic events. In the former, alterations of certain combinations of genes tend to co-exist in a same tumor, whereas in the latter, mostly only one out of a group of genes is altered. Individual alterations targeting similar biological processes are believed to be mutually redundant, with one alteration being sufficient to deregulate the affected process. Identifying mutual exclusivity can therefore help to identify unknown functional interactions. In CLL, this type of analysis is averted by the important genetic heterogeneity of the tumors, showing multiple subclones with different genetic alterations. Because NGS gives a global picture of these events, defining whether or not they occur in the same cells is difficult. This problem will likely be resolved once sufficient single-cell genomic analyses have been performed. As shown in , TP53 status in CLL can be very heterogeneous, as the prevalence of TP53 abnormalities, including 17p deletion and TP53 mutations, varies across the different phases of the disease (26). Furthermore, the subclonal heterogeneity of the tumors can sometimes be misleading. Indeed, bulk NGS analyses generate an averaged picture of a given population of cells, which may result in an underestimation of their true heterogeneity. Nevertheless, a general picture emerges from the various studies. TP53 mutations are not the prime event in CLL. In the early phase of the disease (stage 0), TP53 mutations appear to be either absent or infrequent, but this issue needs to be carefully reevaluated using NGS assays validated for limits of detection (LOD) ranging from 0.05% to 1% ( , panel 1). Furthermore, because these variants are usually not associated with a deletion of the second allele, FISH or SNP arrays are not suitable for early detection analyses ( , panel 2). 17p deletion and complex karyotypes occur during disease progression, leading to the conventional view of CLL with a single TP53 mutation associated with TP53 loss of heterozygosity (LOH) ( , panel 3). Targeted, high-depth, NGS of TP53 coupled with an adequate pipeline able to reach a LOD of at least 1% has led to the discovery of multiple subclones expressing different pathogenic TP53 variants ( , panel 4). Why CLL has such a propensity for TP53 mutations is currently unknown. However, it is clear that CLL depends on signals from the microenvironment and that its cells cycle between lymphoid tissue sites such as lymph nodes and peripheral blood. It is possible that the strong proliferation signals provided by the microenvironment in lymph nodes require a loss of several anti-proliferative signals such as that provided by TP53.
Figure 4

TP53 status in CLL patients, a snapshot. The top panel displays a schematic view of the tumor with the two TP53 alleles. The middle panel shows cytogenetic analysis performed by FISH (left) or by SNP arrays (right). The lower panel displays an example of the read alignments from NGS. 1: No TP53 mutation: In monoclonal B-cell lymphocytosis, TP53 mutation and 17p deletion are very rare, leading to negative results for both FISH and genetic analysis. 2: TP53 mutation without LOH: In early stages of CLL, the frequency of TP53 mutation is low (less than 10%) with many cases showing no LOH. Sensitive sequencing analysis with NGS is able to identify low VAF TP53 variants (variant M1 in the lower panel). 3: TP53 mutation with LOH: In late-stage or relapsing disease, TP53 mutations associated with 17p deletion can be found in 30 to 50% of CLL patients. In the majority of cases, VAF is greater than 50% due to the loss of the second allele. This situation is commonly seen in CLL. 4: Multiple TP53 and LOH: in both early and late-stage disease, FASAY (functional analysis of separated alleles of p53 on yeast) or SMRT (single-molecule, real-time sequencing) has demonstrated a high level of intratumoral heterogeneity in CLL with the presence of multiple independent subclones expressing different pathogenic TP53 variants (M1, M2 and M3 in the lower panel). Although 17p deletion is often observed in these patients, it is difficult to determine if subclones expressing different TP53 variants are associated with it, and even more so if the VAF of the variant is low. 5: Copy neutral LOH: Following the initial mutational inactivation of one allele, the remaining wild-type allele is deleted concurrently with the duplication of the mutated allele, leading to copy neutral LOH (cnLOH). Detecting cnLOH is difficult and thus the frequency of the event is currently unknown. Without SNP array analysis and if the VAF of the variant is lower than 50%, this situation can be misidentified as a tumor without LOH. Tumors with VAF greater than 50% without obvious 17p deletion should be checked for cnLOH. 6: Bi-allelic mutations: Inactivation of the TP53 gene via different mutations in the two alleles is possible but difficult to distinguish from intratumoral heterogeneity. Although this situation is often described as plausible in many reviews, it has never been formally identified, as only single-cell sequencing would be able to validate bi-allelic TP53 inactivation.

TP53 status in CLL patients, a snapshot. The top panel displays a schematic view of the tumor with the two TP53 alleles. The middle panel shows cytogenetic analysis performed by FISH (left) or by SNP arrays (right). The lower panel displays an example of the read alignments from NGS. 1: No TP53 mutation: In monoclonal B-cell lymphocytosis, TP53 mutation and 17p deletion are very rare, leading to negative results for both FISH and genetic analysis. 2: TP53 mutation without LOH: In early stages of CLL, the frequency of TP53 mutation is low (less than 10%) with many cases showing no LOH. Sensitive sequencing analysis with NGS is able to identify low VAF TP53 variants (variant M1 in the lower panel). 3: TP53 mutation with LOH: In late-stage or relapsing disease, TP53 mutations associated with 17p deletion can be found in 30 to 50% of CLL patients. In the majority of cases, VAF is greater than 50% due to the loss of the second allele. This situation is commonly seen in CLL. 4: Multiple TP53 and LOH: in both early and late-stage disease, FASAY (functional analysis of separated alleles of p53 on yeast) or SMRT (single-molecule, real-time sequencing) has demonstrated a high level of intratumoral heterogeneity in CLL with the presence of multiple independent subclones expressing different pathogenic TP53 variants (M1, M2 and M3 in the lower panel). Although 17p deletion is often observed in these patients, it is difficult to determine if subclones expressing different TP53 variants are associated with it, and even more so if the VAF of the variant is low. 5: Copy neutral LOH: Following the initial mutational inactivation of one allele, the remaining wild-type allele is deleted concurrently with the duplication of the mutated allele, leading to copy neutral LOH (cnLOH). Detecting cnLOH is difficult and thus the frequency of the event is currently unknown. Without SNP array analysis and if the VAF of the variant is lower than 50%, this situation can be misidentified as a tumor without LOH. Tumors with VAF greater than 50% without obvious 17p deletion should be checked for cnLOH. 6: Bi-allelic mutations: Inactivation of the TP53 gene via different mutations in the two alleles is possible but difficult to distinguish from intratumoral heterogeneity. Although this situation is often described as plausible in many reviews, it has never been formally identified, as only single-cell sequencing would be able to validate bi-allelic TP53 inactivation. Another genetic configuration observed in CLL is copy neutral LOH (cnLOH), with the same mutation in both alleles of a given cell ( , panel 5). This genetic event is attributed to mitotic recombination in tumor cells where the wild-type allele is replaced by the mutant allele leading to a large region of homozygosity that can be detected early by SNP-arrays. Inversely, this situation cannot be detected by any karyotyping analyses and could be misinterpreted as heterozygous mutation if the sequencing VAF is below 50%. The situation described in panel 6 of (two different mutations in the two alleles of a single cell) is theoretically possible and often described as a potential status in CLL and other tumors. However, such a situation appears to be very uncommon and has never been formally observed in CLL.

TP53 Clinical Considerations

TP53 mutations have been described in CLL since the early 1990s (50, 51). An association between TP53 mutations, drug resistance and poor clinical outcomes was first demonstrated in 1993 by El Rouby et al. and thereafter confirmed in further studies (52–54). In 2000, using FISH analysis for multiple chromosomal markers, Dohner et al. showed that 17p deletion, where the TP53 gene is located, was an independent predictor of disease progression and survival (55). Using either 17p deletion or TP53 mutation as a biomarker, subsequent studies confirmed this finding and resulted in TP53’s classification as a well-established prognostic marker furthermore able to provide pertinent information for establishing an appropriate course of treatment for patients. The therapeutic approach for CLL carrying TP53 mutations will be addressed in detail in another review in this series. There are, however, some biological aspects and some issues related to methodological/technical details that can be discussed here. There is no longer a place for chemo-based regimens in patients with CLLs presenting TP53 mutations. The introduction of novel targeted agents has greatly altered the clinical course of these patients, who now benefit from responses that were never observed during the chemo(immunotherapy) era (56). That said, even with the use of novel agents, CLL remains incurable. Patients with TP53 disruption (TP53 mutation/17p deletion) exhibit worse clinical outcomes compared to those without it, indicating that the management of the former is still an unmet challenge (57). This is more evident for relapsed/refractory (R/R) CLL (58–62) where data on front-line therapies are still scarce because the follow-up of clinical trials at the front line is still short (63, 64). Moreover, little is known on R/R CLL response to novel agents, a setting wherein TP53 disruption seems to be an unfavorable prognostic/predictive factor (65, 66). Another parameter that needs to be taken into consideration is that in both clinical trials and clinical care, TP53 disruptions are considered equal whatever their nature. There is thus no differentiation being made between patients with monoallelic or bi-allelic aberrations, despite data suggesting that the latter may exhibit more aggressive clinical courses (33, 60). Similarly, the number or type of mutations receives no consideration as a specific clinical feature either. Moreover, in untreated CLL with TP53 mutations, there is a subset of patients with indolent clinical courses, which suggests that other disease- and/or patient-related parameters may alter the impact of TP53 disruption (67, 68). Also, genomic instabilities at the chromosomal and molecular level, as well as the immunogenetic features of the clonotypic B-cell receptor, namely the somatic hypermutation status of the immunoglobulin heavy variable gene, have been proposed as factors that may aggravate or alleviate the impact of TP53 mutations (69–72). Finally, there is a discrepancy regarding the threshold for reporting TP53 mutations detected by NGS in the clinical setting versus the official guidelines that merits discussion. According to the latest versions of these latter, only mutations with VAF ≥10% should be reported and used for directing treatment choice (73). This conservative approach within the official guidelines is based mainly on the fact that the clinical impact of small TP53 clones, especially those below 5%, has not been demonstrated to date in prospective clinical trials. However, diagnostic laboratories are becoming more experienced in NGS data output management, and resultantly, clones down to 2-5% are being reported in the clinical setting and, in the majority of cases, taken into consideration for clinical decision-making.

Remaining Challenges and Perspectives

Compared to other cancer types, the clinical value of TP53 status in CLL has always been uncontested and it is now a required biomarker for patient stratification. It is therefore essential that TP53 analyses be performed in a standardized manner to provide consistent data across the various clinical laboratories. For this purpose, the TP53 Network of the European Research Initiative on Chronic Lymphocytic Leukemia (ERIC) had released a first recommendation in 2012 and updated it recently to take into account the emergence of NGS (73, 74). Nevertheless, considering the quick evolution of methodologies and the discovery of the subclonal heterogeneity of TP53 variants with low VAF clones, a new consensus must be reached for the controversial issue of the limit of detection used to report TP53 variants in clinical laboratories. Although conventional Sanger sequencing has been widely used in the past, it is now clear that NGS-based analysis should become mandatory for the clinical detection of low VAF clones. The current situation is unclear, with several studies suggesting that patients with low VAF TP53 clones have the same clinical prognosis as patients with high VAF ones, and other studies unable to confirm that finding. Reaching a consensus to define a robust, clinically justified LOD will be essential for improving patient stratification. Furthermore, despite their relative infrequency, it will be important to evaluate the real incidence of multiple TP53 subclonal mutations using adequate methodologies as well as their evolution during the course of disease and with different types of treatment. Whether or not this reservoir of heterogenous oncogenic TP53 variants is an essential component of the plasticity of CLL remains to be addressed.

Author Contributions

All authors listed have made a substantial, direct, and intellectual contribution to the work, and approved it for publication.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
  74 in total

Review 1.  Translating p53 into the clinic.

Authors:  Chit Fang Cheok; Chandra S Verma; José Baselga; David P Lane
Journal:  Nat Rev Clin Oncol       Date:  2010-10-26       Impact factor: 66.675

2.  Integrated Analysis of TP53 Gene and Pathway Alterations in The Cancer Genome Atlas.

Authors:  Lawrence A Donehower; Thierry Soussi; Anil Korkut; Yuexin Liu; Andre Schultz; Maria Cardenas; Xubin Li; Ozgun Babur; Teng-Kuei Hsu; Olivier Lichtarge; John N Weinstein; Rehan Akbani; David A Wheeler
Journal:  Cell Rep       Date:  2019-07-30       Impact factor: 9.423

3.  Actionable Genes, Core Databases, and Locus-Specific Databases.

Authors:  Amélie Pinard; Morgane Miltgen; Arnaud Blanchard; Hélène Mathieu; Jean-Pierre Desvignes; David Salgado; Aurélie Fabre; Pauline Arnaud; Laura Barré; Martin Krahn; Philippe Grandval; Sylviane Olschwang; Stéphane Zaffran; Catherine Boileau; Christophe Béroud; Gwenaëlle Collod-Béroud
Journal:  Hum Mutat       Date:  2016-09-26       Impact factor: 4.878

Review 4.  Clinical Implications of Novel Genomic Discoveries in Chronic Lymphocytic Leukemia.

Authors:  Gregory Lazarian; Romain Guièze; Catherine J Wu
Journal:  J Clin Oncol       Date:  2017-02-13       Impact factor: 44.544

5.  Survival of Del17p CLL Depends on Genomic Complexity and Somatic Mutation.

Authors:  Sunita R Setlur; Jennifer R Brown; Lijian Yu; Haesook T Kim; Siddha Kasar; Parul Benien; Wei Du; Kevin Hoang; Andrew Aw; Bethany Tesar; Reina Improgo; Stacey Fernandes; Saranya Radhakrishnan; Josephine Klitgaard; Charles Lee; Gad Getz
Journal:  Clin Cancer Res       Date:  2016-08-08       Impact factor: 12.531

6.  The impact of TP53 mutations and TP53 deletions on survival varies between AML, ALL, MDS and CLL: an analysis of 3307 cases.

Authors:  A Stengel; W Kern; T Haferlach; M Meggendorfer; A Fasan; C Haferlach
Journal:  Leukemia       Date:  2016-09-29       Impact factor: 11.528

7.  Targeting BCL2 with Venetoclax in Relapsed Chronic Lymphocytic Leukemia.

Authors:  Andrew W Roberts; Matthew S Davids; John M Pagel; Brad S Kahl; Soham D Puvvada; John F Gerecitano; Thomas J Kipps; Mary Ann Anderson; Jennifer R Brown; Lori Gressick; Shekman Wong; Martin Dunbar; Ming Zhu; Monali B Desai; Elisa Cerri; Sari Heitner Enschede; Rod A Humerickhouse; William G Wierda; John F Seymour
Journal:  N Engl J Med       Date:  2015-12-06       Impact factor: 91.245

8.  International Cancer Genome Consortium Data Portal--a one-stop shop for cancer genomics data.

Authors:  Junjun Zhang; Joachim Baran; A Cros; Jonathan M Guberman; Syed Haider; Jack Hsu; Yong Liang; Elena Rivkin; Jianxin Wang; Brett Whitty; Marie Wong-Erasmus; Long Yao; Arek Kasprzyk
Journal:  Database (Oxford)       Date:  2011-09-19       Impact factor: 3.451

Review 9.  TP53: an oncogene in disguise.

Authors:  T Soussi; K G Wiman
Journal:  Cell Death Differ       Date:  2015-05-29       Impact factor: 15.828

10.  COSMIC: the Catalogue Of Somatic Mutations In Cancer.

Authors:  John G Tate; Sally Bamford; Harry C Jubb; Zbyslaw Sondka; David M Beare; Nidhi Bindal; Harry Boutselakis; Charlotte G Cole; Celestino Creatore; Elisabeth Dawson; Peter Fish; Bhavana Harsha; Charlie Hathaway; Steve C Jupe; Chai Yin Kok; Kate Noble; Laura Ponting; Christopher C Ramshaw; Claire E Rye; Helen E Speedy; Ray Stefancsik; Sam L Thompson; Shicai Wang; Sari Ward; Peter J Campbell; Simon A Forbes
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

View more
  1 in total

1.  Five Percent Variant Allele Frequency Is a Reliable Reporting Threshold for TP53 Variants Detected by Next Generation Sequencing in Chronic Lymphocytic Leukemia in the Clinical Setting.

Authors:  Tatjana Pandzic; Claes Ladenvall; Marie Engvall; Mattias Mattsson; Monica Hermanson; Lucia Cavelier; Viktor Ljungström; Panagiotis Baliakas
Journal:  Hemasphere       Date:  2022-08-02
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.