| Literature DB >> 28125682 |
Chaohui Yuan1, Charles C Chu2,3, Xiao-Jie Yan2, Davide Bagnara2, Nicholas Chiorazzi2,3, Thomas MacCarthy1.
Abstract
The targeting of mutations by Activation-Induced Deaminase (AID) is a key step in generating antibody diversity at the Immunoglobulin (Ig) loci but is also implicated in B-cell malignancies such as chronic lymphocytic leukemia (CLL). AID has previously been shown to preferentially deaminate WRC (W = A/T, R = A/G) hotspots. WGCW sites, which contain an overlapping WRC hotspot on both DNA strands, mutate at much higher frequency than single hotspots. Human Ig heavy chain (IGHV) genes differ in terms of WGCW numbers, ranging from 4 for IGHV3-48*03 to as many as 12 in IGHV1-69*01. An absence of V-region mutations in CLL patients ("IGHV unmutated", or U-CLL) is associated with a poorer prognosis compared to "IGHV mutated" (M-CLL) patients. The reasons for this difference are still unclear, but it has been noted that particular IGHV genes associate with U-CLL vs M-CLL. For example, patients with IGHV1-69 clones tend to be U-CLL with a poor prognosis, whereas patients with IGHV3-30 tend to be M-CLL and have a better prognosis. Another distinctive feature of CLL is that ~30% of (mostly poor prognosis) patients can be classified into "stereotyped" subsets, each defined by HCDR3 similarity, suggesting selection, possibly for a self-antigen. We analyzed >1000 IGHV genes from CLL patients and found a highly significant statistical relationship between the number of WGCW hotspots in the germline V-region and the observed mutation frequency in patients. However, paradoxically, this correlation was inverse, with V-regions with more WGCW hotspots being less likely to be mutated, i.e., more likely to be U-CLL. The number of WGCW hotspots in particular, are more strongly correlated with mutation frequency than either non-overlapping (WRC) hotspots or more general models of mutability derived from somatic hypermutation data. Furthermore, this correlation is not observed in sequences from the B cell repertoires of normal individuals and those with autoimmune diseases.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28125682 PMCID: PMC5268644 DOI: 10.1371/journal.pone.0167602
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Comparison of the number of WGCW hotspots to mean mutation frequency.
Number of WGCW hotspots in the germline IGHV gene sequence (horizontal axis) vs mean mutation frequency of each IGHV gene in CLL (vertical axis). Linear regression fit for all data points is shown by gray line; orange line shows fit with three outliers (encircled points) removed.
Fig 2Profile of AID hotspots for (A) IGHV 3–21*01 and (B) IGHV3-48*03. Each colored dot in each panel represents an AID regular hot/cold/neutral spot as labeled at the edges. Double dots are overlapping hotspots (WGCW). CDR1 and CDR2 are indicated in gray shading. Although these genes have few WGCW hotspots, they both have a particularly dense region of regular GYW hotspots in CDR2, particularly so for IGHV 3–21*01. The blue boxes highlight dense clusters of non-overlapping WRC/GYW hotspots.
Fig 3Comparison of the number of WGCW hotspots to mean mutation frequency aggregated by gene.
This plot is equivalent to Fig 1, except that each point here is aggregated by gene (e.g. IGHV3-23) by combining all patients with IGHV3-23 alleles (e.g. IGHV3-23*01, IGHV3-23*02, etc.) into single data points. The regression line (orange) was fit excluding the outliers (encircled points).
Potential amino acid changes at WGCW hotspots.
| Amino-acid | Abbr | U.total | U.norm | U.FW | U.CDR | M.total | M.norm | M.FW | M.CDR |
|---|---|---|---|---|---|---|---|---|---|
| Ala | A | 13 | 16.0% | 4 | 9 | 10 | 18.9% | 7 | 3 |
| Cys | C | 5 | 6.2% | 5 | 0 | 0 | 0.0% | 0 | 0 |
| Ile | I | 0 | 0.0% | 0 | 0 | 0 | 0.0% | 0 | 0 |
| Leu | L | 13 | 16.0% | 8 | 5 | 9 | 17.0% | 7 | 2 |
| Met | M | 1 | 1.2% | 1 | 0 | 1 | 1.9% | 1 | 0 |
| Phe | F | 0 | 0.0% | 0 | 0 | 0 | 0.0% | 0 | 0 |
| Trp | W | 0 | 0.0% | 0 | 0 | 0 | 0.0% | 0 | 0 |
| Val | V | 0 | 0.0% | 0 | 0 | 0 | 0.0% | 0 | 0 |
| Gly | G | 0 | 0.0% | 0 | 0 | 0 | 0.0% | 0 | 0 |
| His | H | 2 | 2.5% | 2 | 0 | 1 | 1.9% | 1 | 0 |
| Pro | P | 0 | 0.0% | 0 | 0 | 0 | 0.0% | 0 | 0 |
| Ser | S | 26 | 32.1% | 5 | 21 | 15 | 28.3% | 3 | 12 |
| Thr | T | 0 | 0.0% | 0 | 0 | 0 | 0.0% | 0 | 0 |
| Tyr | Y | 0 | 0.0% | 0 | 0 | 0 | 0.0% | 0 | 0 |
| Arg | R | 0 | 0.0% | 0 | 0 | 0 | 0.0% | 0 | 0 |
| Asn | N | 0 | 0.0% | 0 | 0 | 0 | 0.0% | 0 | 0 |
| Asp | D | 0 | 0.0% | 0 | 0 | 0 | 0.0% | 0 | 0 |
| Gln | Q | 16 | 19.8% | 14 | 2 | 14 | 26.4% | 8 | 6 |
| Glu | E | 3 | 3.7% | 0 | 3 | 0 | 0.0% | 0 | 0 |
| Lys | K | 2 | 2.5% | 0 | 2 | 3 | 5.7% | 0 | 3 |