| Literature DB >> 35054819 |
Hajnalka Laura Pálinkás1,2, Lőrinc Pongor3,4, Máté Balajti3, Ádám Nagy3,4, Kinga Nagy1,2, Angéla Békési1,2, Giampaolo Bianchini5, Beáta G Vértessy1,2, Balázs Győrffy3,4.
Abstract
The clonal composition of a malignant tumor strongly depends on cellular dynamics influenced by the asynchronized loss of DNA repair mechanisms. Here, our aim was to identify founder mutations leading to subsequent boosts in mutation load. The overall mutation burden in 591 colorectal cancer tumors was analyzed, including the mutation status of DNA-repair genes. The number of mutations was first determined across all patients and the proportion of genes having mutation in each percentile was ranked. Early mutations in DNA repair genes preceding a mutational expansion were designated as founder mutations. Survival analysis for gene expression was performed using microarray data with available relapse-free survival. Of the 180 genes involved in DNA repair, the top five founder mutations were in PRKDC (n = 31), ATM (n = 26), POLE (n = 18), SRCAP (n = 18), and BRCA2 (n = 15). PRKDC expression was 6.4-fold higher in tumors compared to normal samples, and higher expression led to longer relapse-free survival in 1211 patients (HR = 0.72, p = 4.4 × 10-3). In an experimental setting, the mutational load resulting from UV radiation combined with inhibition of PRKDC was analyzed. Upon treatments, the mutational load exposed a significant two-fold increase. Our results suggest PRKDC as a new key gene driving tumor heterogeneity.Entities:
Keywords: DNA repair; cancer; mutation burden; next generation sequencing; non-homologous end joining; survival
Mesh:
Substances:
Year: 2022 PMID: 35054819 PMCID: PMC8775830 DOI: 10.3390/ijms23020633
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Characteristics of DNA repair pathway mutations in colorectal cancer. Summary of the analysis pipeline of selecting DNA-repair genes linked to increased mutation burden in colorectal cancer samples of the TCGA repository (A). A higher mutation load (median and interquartile ranges of the total mutation count) was observed when mutations were present in any of the DNA repair pathways (B). Simultaneous mutations in DNA repair pathways show enrichment for each signature as demonstrated by the high odds ratios of co-mutation when comparing DNA repair mechanisms summarized in a pairwise heatmap format. Percentage of patients with co-mutation in pathways is displayed within the tiles of the heatmap (C). Mutation load (boxplots showing median values with lower and upper quartile values and whiskers marking the range) related to mutation in most commonly mutated genes associated with DNA repair (D). BER = base excision repair, HR = homologous recombinational repair, MMR = mismatch repair, NER = nucleotide excision repair, and NHEJ = non-homologous end joining repair.
Top genes associated with high mutation burden in colorectal cancer.
| Cohort | Gene | Mutation Burden (Mutant Samples) | Mutation Burden (Wild Samples) | Fold Increase | Samples with Mutation (%) | Total Samples | |
|---|---|---|---|---|---|---|---|
| TCGA COAD (mutect2) |
|
|
|
|
|
|
|
|
| 3.7 × 10−24 | 2926 | 333 | 8.8 | 20.0 | 431 | |
|
| 1.3 × 10−22 | 4002 | 336 | 11.9 | 13.9 | 431 | |
|
| 1.2 × 10−20 | 2951 | 335 | 8.8 | 15.1 | 431 | |
|
| 5.1 × 10−18 | 3538 | 348 | 10.2 | 10.2 | 431 | |
| DFCI |
| 2.2 × 10−14 | 919 | 123 | 7.5 | 7.3 | 619 |
|
| 1.2 × 10−13 | 948 | 123 | 7.7 | 6.3 | 619 | |
|
| 1.4 × 10−12 | 819 | 123 | 6.7 | 7.4 | 619 | |
|
| 1.3 × 10−9 |
|
|
|
|
| |
|
| 1.1 × 10−8 | 1122 | 126 | 8.9 | 3.2 | 619 | |
| GenenTech |
| 4.4 × 10−4 | 909 | 82 | 11.1 | 18.1 | 72 |
|
| 1.2 × 10−3 | 2393 | 86 | 27.8 | 5.6 | 72 | |
|
| 1.5 × 10−3 | 1218 | 84 | 14.5 | 6.9 | 72 | |
|
| 2.2 × 10−3 |
|
|
|
|
| |
|
| 3.2 × 10−3 | 1195 | 84 | 14.2 | 6.9 | 72 |
Figure 2Founder mutations lead to an increased mutation load. Theoretical histogram showing the number of genes with a mutation as a function of the mutation frequency in a sample (A). The area under the curve (AUC) equals the total mutation burden (TMB) in a sample. Theoretical and actual representation using a representative example (with TCGA sample identifier) of founder mutation and different mutation load patterns in colorectal cancer patients with a single (B), double (C), and exponential (D) clonal expansion. The red arrows show the incidence of the founder mutation. Mutations in the PRKDC gene are indicated in the TCGA samples.
Figure 3Localization of PRKDC mutations show uniform distribution in the entire gene using lollipop plot figures representing mutations in an information-dense manner indicating the positions in amino acid coordinates within the domain organization scheme of the gene (A,B). Some mutations are localized in annotated domains of PRKDC (NUC194, FAT, and PI3–PI4 kinase domains), while other mutations are localized in structurally/functionally unknown protein segments (grey line). Distribution of different types of mutations color coded in PRKDC (A) and mutation frequency for each sample using different shades of grey (B). Expression of PRKDC is significantly higher in colorectal cancer samples (microarray gene expression data from the NCBI Gene Expression Omnibus (GEO) database, see Section 4.8) (C).
Figure 4Experimental overview of the in vitro experiments. Cell lines were treated with UV to induce DNA damage and inhibitors against PRKDC and ATM. Five rounds of treatment with recovery periods were applied. Multiclone cell colonies (approx. three colonies/well) were cultured in 96-well plates. DNA was extracted from approximately 10 colonies, creating a polyclonal mixture, where drug-induced mutations were amplified over detection thresholds. Samples were subjected to whole-exome sequencing for mutation analysis.
Figure 5The number of mutations show a significant increase in samples treated with UV and either PRKDC or both inhibitors. Boxplot showing the number of mutations including SNVs and indels in the samples treated with UV, iPRKDC, iATM, or both. The mean numbers of mutations in each category are represented by squares.
Results of the survival analysis for the expression of the top genes associated with high mutation burden in colorectal cancer patients. RFS = relapse-free survival, CI = 95% confidence interval.
| Gene | HR | CI | Expression Fold Change (Tumor vs. Normal) | |
|---|---|---|---|---|
|
| 0.72 | 0.58–0.9 | 4.40 × 10−3 | 6.44 |
|
| 0.73 | 0.58–0.91 | 5.60 × 10−3 | 4.30 |
|
| 0.81 | 0.64–1.02 | 6.90 × 10−2 | 2.74 |
|
| 1.18 | 0.93–1.49 | 1.68 × 10−1 | 1.48 |
|
| 1.17 | 0.94–1.46 | 1.70 × 10−1 | 1.40 |
|
| 1.48 | 1.13–1.95 | 4.70 × 10−3 | 1.38 |
|
| 0.67 | 0.53–0.85 | 8.0 × 10−4 | 1.09 |
|
| 0.61 | 0.47–0.77 | 5.30 × 10−5 | 0.95 |