| Literature DB >> 27924834 |
Igor B Rogozin1,2, Artem G Lada3,4, Alexander Goncearenco1, Michael R Green3, Subhajyoti De5, German Nudelman6, Anna R Panchenko1, Eugene V Koonin1, Youri I Pavlov3,7.
Abstract
Follicular lymphoma (FL) is an uncurable cancer characterized by progressive severity of relapses. We analyzed sequence context specificity of mutations in the B cells from a large cohort of FL patients. We revealed substantial excess of mutations within a novel hybrid nucleotide motif: the signature of somatic hypermutation (SHM) enzyme, Activation Induced Deaminase (AID), which overlaps the CpG methylation site. This finding implies that in FL the SHM machinery acts at genomic sites containing methylated cytosine. We identified the prevalence of this hybrid mutational signature in many other types of human cancer, suggesting that AID-mediated, CpG-methylation dependent mutagenesis is a common feature of tumorigenesis.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27924834 PMCID: PMC5141443 DOI: 10.1038/srep38133
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Mutable DNA sequence motifs analyzed in this work.
Variants of DNA sequences corresponding to a defined motif (left column, bold) are shown to the right in the double-stranded DNA form. Mutation-prone bases are in red and underlined.
Association between known mutable motifs and the DNA sequence context of somatic mutations in exomes of follicular lymphoma.
| Mutable motif | Mutator protein/system | Reference | Excess of mutations in the motif | Type of statistical test | P- value |
|---|---|---|---|---|---|
| CpG methylation | 2.3 | Fisher | <10−8 | ||
| Monte-Carlo | <0.001 | ||||
| T | APOBEC1 (A1) or A3A or A3B | 0.9 | Fisher | NS | |
| Monte-Carlo | NS | ||||
| WR | AID | 1.2 | Fisher | <10−7 | |
| Monte-Carlo | <0.001 | ||||
| WR | AID component of somatic hypermutation | 1.0 | Fisher | NS | |
| Monte-Carlo | NS | ||||
| WR | SHM/AID in FL | This study | 2.7 | Fisher | <10−8 |
| Monte-Carlo | <0.001 | ||||
| W | DNA polymerase η (pol η) | 1.1 | Fisher | <2 × 10−5 | |
| Monte-Carlo | <0.001 | ||||
| T | A1/A3A/A3B | 0.8 | Fisher | NS | |
| Monte-Carlo | NS | ||||
| WR | AID | 0.9 | Fisher | NS | |
| Monte-Carlo | NS | ||||
| W | pol η | 1.1 | Fisher | <2 × 10−5 | |
| Monte-Carlo | <0.001 | ||||
*The correlation was measured using Fisher exact test (Fisher) and Monte Carlo (MC) test. Mutable positions in consensus sequences are underlined (R = A or G; Y = T or C; M = A or C; K = G or T; D = A,T or G; H = A, T, or C; W = A or T, Fig. 1). The excess of mutations in motifs was calculated using the ratio Fm/Fn, where Fm is the fraction of somatic mutations observed in the given mutable motif (the number of mutated motifs divided by the number of mutations), and Fn is the frequency of the motif in the DNA neighborhood of somatic mutations (the number of motif positions divided by the total number of all un-mutated positions in the 120 bp window).
**NS, no significant excess.
Difference between the mutability of AID motifs WRC/GYW with vs without an extra 3′ GC pair (Fig. 1) in various cancer genomes (the Sanger COSMIC Whole Genome Project)*.
| Tissue/cancer | Fraction mutated WR | Fraction mutated Y | P-value, Fisher test |
|---|---|---|---|
| Follicular lymphoma | 0.046 (9,438) | 0.039 (37,315) | 0.005 |
| Blood | 0.063 (16,668) | 0.050 (71,551) | <10−10 |
| Blood: Acute Myeloid Leukemia | 0.067 (10,226) | 0.048 (43,958) | <10−10 |
| Blood: GCB Lymphomas | 0.063 (5,033) | 0.050 (21,674) | 5 × 10−5 |
| 0.037 (118,009) | 0.037 (449,770) | NS | |
| 0.022 (55,861) | 0.029 (221,217) | NS | |
| 0.02 (58,896) | 0.029 (231,074) | NS | |
| Colon | 0.102 (261,993) | 0.073 (1,018,315) | <10−10 |
| Kidney | 0.039 (45,835) | 0.032 (179,222) | <10−10 |
| Liver | 0.036 (115,921) | 0.030 (480,198) | <10−10 |
| 0.036 (221,301) | 0.036 (872,132) | NS | |
| Ovary | 0.044 (33,436) | 0.037 (132,132) | <10−10 |
| Pancreas | 0.077 (60,409) | 0.057 (242,824) | <10−10 |
| Prostate | 0.072 (27,084) | 0.064 (106,577) | 3 × 10−5 |
| Rectum | 0.086 (46,433) | 0.068 (177,479) | <10−10 |
| 0.004 (201,866) | 0.042 (824,249) | NS | |
| Stomach | 0.090 (204,610) | 0.061 (802,363) | <10−10 |
| Uterus | 0.054 (84,699) | 0.046 (317,894) | <10−10 |
Tissue types without significant correlation (taking into account the Bonferroni correction for multiple tests) between the motif and somatic mutations are underlined. “Fraction mutated CGYW” and “Fraction mutated YCG/CGR and SNCG/CGNS” are fractions of mutated motifs (the number of the mutated motifs divided by the total number of motifs in the analyzed data set). Absence of significant excess of mutations in CGYW/WRCG (NS, no significant excess) indicates that there is no connection between mutagenesis of CG and GYW motifs.
**Total number of motifs are in brackets.
Preferential mutability of WRC/GYW and somatic mutations in various cancer Whole Genomes and Whole Exomes (the Sanger COSMIC Whole Genome Project)*.
| Tissue | Fraction of mutations observed in the mutable motif | Fraction of motifs in surrounding regions | Excess of mutations in the motif | P-value Fisher exact test |
|---|---|---|---|---|
| Blood | 0.265 (10,633) | 0.222 (630,338) | 1.19 | <10−10 |
| Blood: Acute Myeloid Leukemia | 0.254 (6,844) | 0.221 (405,754) | 1.15 | <10−10 |
| Blood: GCB Lymphomas | 0.296 (2,747) | 0.221 (165,805) | 1.34 | <10−10 |
| 0.146 (85,203) | 0.230 (4,877,914) | 0.64 | NS | |
| 0.093 (38,750) | 0.229 (2,247,271) | 0.41 | NS | |
| 0.074 (41,454) | 0.229 (2,399,601) | 0.32 | NS | |
| Colon | 0.257 (175,109) | 0.221 (8,986,392) | 1.16 | <10−10 |
| 0.225 (32,382) | 0.228 (1,875,298) | 0.99 | NS | |
| 0.226 (74,161) | 0.222 (4,426,455) | 1.02 | NS | |
| 0.174 (180,284) | 0.226 (9,867,123) | 0.77 | NS | |
| 0.229 (22,340) | 0.225 (1,309,849) | 1.02 | NS | |
| Pancreas | 0.227 (35,165) | 0.220 (2,112,760) | 1.03 | 2 × 10−3 |
| 0.22 (13,036) | 0.222 (775,226) | 0.99 | NS | |
| 0.223 (23,330) | 0.226 (1,343,391) | 0.99 | NS | |
| 0.050 (198,098) | 0.224 (8,986,960) | 0.22 | NS | |
| Stomach | 0.274 (115,652) | 0.221 (6,977,875) | 1.24 | <10−10 |
| Uterus | 0.254 (55,999) | 0.229 (3,212,849) | 1.11 | <10−10 |
Tissue types without significant correlation (taking into account the Bonferroni correction for multiple tests) between the motif and somatic mutations are underlined. The excess of mutations in motifs was calculated using the ratio Fsm/Fc, where Fsm is the fraction of somatic mutations observed in the studied mutable motif (the number of mutated motifs divided by the number of mutations), and Fc is the frequency of the motif in the DNA context of somatic mutations (the number of motif positions divided by the total number of all un-mutated positions in surrounding regions).
*Absence of significant excess of mutations in WRC/GYW (NS, no significant excess) suggests that there is no connection between mutagenesis and WRC/GYW motifs.
Figure 2Tumors types with mutation enrichment in the hybrid AID/CpG motif tend to possess an excess of mutations with pure AID signature.
Figure 3The methylation ratio in WRCG motifs and non- WRCG motifs (YCG/CGR and SNCG/CGNS motifs).
The fraction of motifs in each bin (0–20% methylation ratio, 20–40% methylation ratio, etc.) is shown.