| Literature DB >> 22808135 |
Yanwen Jiang1, T David Soong, Ling Wang, Ari M Melnick, Olivier Elemento.
Abstract
The processes of somatic hypermutation (SHM) and class switch recombination introduced by activation-induced cytosine deaminase (AICDA) at the Immunoglobulin (Ig) loci are key steps for creating a pool of diversified antibodies in germinal center B cells (GCBs). Unfortunately, AICDA can also accidentally introduce mutations at bystander loci, particularly within the 5' regulatory regions of proto-oncogenes relevant to diffuse large B cell lymphomas (DLBCL). Since current methods for genomewide sequencing such as Exon Capture and RNAseq only target mutations in coding regions, to date non-Ig promoter SHMs have been studied only in a handful genes. We designed a novel approach integrating bioinformatics tools with next generation sequencing technology to identify regulatory loci targeted by SHM genome-wide. We observed increased numbers of SHM associated sequence variant hotspots in lymphoma cells as compared to primary normal germinal center B cells. Many of these SHM hotspots map to genes that have not been reported before as mutated, including BACH2, BTG2, CXCR4, CIITA, EBF1, PIM2, and TCL1A, etc., all of which have potential roles in B cell survival, differentiation, and malignant transformation. In addition, using BCL6 and BACH2 as examples, we demonstrated that SHM sites identified in these 5' regulatory regions greatly altered their transcription activities in a reporter assay. Our approach provides a first cost-efficient, genome-wide method to identify regulatory mutations and non-Ig SHM hotspots.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22808135 PMCID: PMC3395700 DOI: 10.1371/journal.pone.0040332
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Approach to detect aberrant SHM target genes genome-wide.
H3K4me3 ChIP-seq and SNV detection information of NB, GCB, OCI-Ly1, and OCI-Ly8 cells.
| Naïve B Cells | Germinal Center B Cells | OCI-Ly1 | OCI-Ly8 | |
| Uniquely AlignedNon-clonalReads | 25639608 | 25801467 | 16181704 | 3909119 |
| Total SNVs | 45567 | 40087 | 39740 | 23663 |
| High Confidence SNVs(% of Total) | 32538 (71.41%) | 26258 (65.50%) | 23152 (58.26%) | 14062 (59.43%) |
| Percentage of High Confidence SNVs in dbSNP132 | 74.97% | 67.70% | 86.93% | 80.26% |
| H3K4me3 EnrichedRefSeq Promoters | 20225 | 16418 | 19014 | 18351 |
| High ConfidencePromoter SNVs | 11390 | 6479 | 15475 | 8630 |
| Percentage of High Confidence PromoterSNVs in dbSNP132 | 93.64% | 94.35% | 91.58% | 92.97% |
| High Confidence Novel Promoter SNVs | 724 | 366 | 1314 | 607 |
| Genes Containing High ConfidenceNovel Promoter SNVs# | 529 | 236 | 841 | 436 |
Filtering by SNV ratio ≥0.33, quality score ≥90, and p≥0.05 for comparison of average quality score comparison between reads that have reference base and reads have putative SNV.
For the complete list of genes that contain high confidence novel promoter SNVs, please see supporting information.
Figure 2Novel promoter SNVs target important B cell genes in DLBCL.
iPAGE analysis using Gene Ontology database (#) [43] or a lymphoid specific gene set database (*) [44] showing strong enrichment of genes that are important for germinal center B cells and DLBCLs contain novel promoter SNVs in OCI-Ly1 and OCI-Ly8 cells.
SHM hotspot genes in NB, GCB, OCI-Ly1, and OCI-Ly8 cells.
| Cell Type | Gene Name | SNV Number |
| SNV in RGYW (Obs) | SNV in RGYW (Exp) | z-score RGYW | SNV in WA (Obs) | SNV in WA (Exp) | z-score WA | Ts/Tv |
|
|
| 63(31) | 0 | 6 | 4 |
| 1 | 3 | −1.17 | 36/27 |
|
| 6(6) | 1.3E−08 | 1 | 0 |
| 0 | 1 | −0.71 | 3/3 | |
|
| 5(4) | 6.5E−07 | 0 | 0 | −0.6 | 2 | 0 |
| 2/3 | |
|
| 3(2) | 0.00087 | 0 | 0 | −0.57 | 2 | 0 |
| 3/0 | |
|
| 3(3) | 0.00087 | 0 | 0 | −0.34 | 2 | 0 |
| 3/0 | |
|
| 3(2) | 0.00087 | 0 | 0 | −0.48 | 1 | 0 |
| 1/2 | |
|
|
| 35(13) | 0 | 0 | 2 | −1.38 | 8 | 4 |
| 24/11 |
|
| 5(2) | 3.7E−08 | 1 | 0 |
| 1 | 1 | 0.67 | 0/5 | |
|
| 3(3) | 0.00015 | 0 | 0 | −0.37 | 2 | 0 |
| 2/1 | |
|
| 3(3) | 0.00015 | 0 | 0 | −0.47 | 1 | 0 |
| 1/2 | |
|
| 3(2) | 0.00015 | 1 | 0 |
| 0 | 0 | −0.7 | 2/1 | |
|
| 3(3) | 0.00015 | 0 | 0 | −0.45 | 1 | 0 |
| 3/0 | |
|
|
| 87(44) | 0 | 10 | 4 |
| 31 | 11 |
| 58/29 |
|
| 46(42) | 0 | 5 | 2 |
| 17 | 6 |
| 33/13 | |
|
| 30(29) | 0 | 5 | 2 |
| 9 | 3 |
| 17/13 | |
|
| 15(15) | 0 | 5 | 1 |
| 2 | 1 |
| 12/3 | |
|
| 14(11) | 0 | 3 | 1 |
| 1 | 1 | 0.03 | 7/7 | |
|
| 13(12) | 0 | 2 | 1 |
| 3 | 1 |
| 9/4 | |
|
| 12(2) | 0 | 2 | 1 |
| 1 | 1 | 0.04 | 6/6 | |
|
| 11(11) | 0 | 2 | 1 |
| 6 | 2 |
| 6/5 | |
|
| 9(9) | 1E−12 | 2 | 1 |
| 2 | 1 |
| 5/4 | |
|
| 6(4) | 1.6E−07 | 0 | 0 | −0.62 | 2 | 1 |
| 3/3 | |
|
| 6(3) | 1.6E−07 | 1 | 0 |
| 3 | 1 |
| 4/2 | |
|
| 6(5) | 1.6E−07 | 0 | 0 | −0.5 | 3 | 1 |
| 4/2 | |
|
| 6(6) | 1.6E−07 | 0 | 0 | −0.69 | 4 | 1 |
| 2/4 | |
|
| 5(2) | 7.3E−06 | 0 | 0 | −0.53 | 2 | 0 |
| 1/4 | |
|
| 4(2) | 0.00022 | 0 | 0 | −0.42 | 1 | 0 |
| 4/0 | |
|
| 4(1) | 0.00022 | 0 | 0 | −0.41 | 1 | 0 |
| 2/2 | |
|
| 4(3) | 0.00022 | 0 | 0 | −0.41 | 1 | 0 |
| 2/2 | |
|
| 4(4) | 0.00022 | 0 | 0 | −0.45 | 1 | 0 |
| 3/1 | |
|
| 3(2) | 0.00423 | 1 | 0 |
| 1 | 0 |
| 1/2 | |
|
| 3(3) | 0.00423 | 0 | 0 | −0.42 | 1 | 0 |
| 2/1 | |
|
| 3(3) | 0.00423 | 0 | 0 | −0.51 | 2 | 0 |
| 2/1 | |
|
| 3(3) | 0.00423 | 2 | 0 |
| 1 | 0 | 0.85 | 0/3 | |
|
| 3(3) | 0.00423 | 0 | 0 | −0.41 | 2 | 0 |
| 1/2 | |
|
| 3(1) | 0.00423 | 0 | 0 | −0.42 | 1 | 0 |
| 2/1 | |
|
| 3(2) | 0.00423 | 0 | 0 | −0.42 | 1 | 0 |
| 3/0 | |
|
| 3(0) | 0.00423 | 2 | 0 |
| 0 | 0 | −0.55 | 1/2 | |
|
| 3(3) | 0.00423 | 0 | 0 | −0.41 | 1 | 0 |
| 1/2 | |
|
| 3(3) | 0.00423 | 1 | 0 |
| 0 | 0 | −0.58 | 0/3 | |
| Ly8 |
| 38(29) | 0 | 4 | 2 |
| 2 | 4 | −1.19 | 25/13 |
|
| 16(5) | 0 | 0 | 1 | −1.04 | 4 | 2 |
| 9/7 | |
|
| 6(4) | 8.6E−09 | 2 | 0 |
| 0 | 0 | −0.6 | 4/2 | |
|
| 6(5) | 8.6E−09 | 1 | 0 |
| 0 | 1 | −0.72 | 6/0 | |
|
| 6(6) | 8.6E−09 | 3 | 0 |
| 0 | 0 | −0.69 | 5/1 | |
|
| 4(4) | 2.7E−05 | 1 | 0 |
| 1 | 1 | 0.67 | 3/1 | |
|
| 4(4) | 2.7E−05 | 1 | 0 |
| 1 | 1 | 0.57 | 2/2 | |
|
| 4(4) | 2.7E−05 | 2 | 0 |
| 1 | 1 | 0.36 | 4/0 | |
|
| 4(2) | 2.7E−05 | 3 | 0 |
| 0 | 0 | −0.69 | 2/2 | |
|
| 3(1) | 0.00108 | 2 | 0 |
| 0 | 0 | −0.77 | 2/1 | |
|
| 3(2) | 0.00108 | 1 | 0 |
| 0 | 1 | −0.85 | 2/1 | |
|
| 3(3) | 0.00108 | 0 | 0 | −0.34 | 1 | 0 |
| 1/2 | |
|
| 3(3) | 0.00108 | 0 | 0 | −0.3 | 1 | 0 |
| 2/1 | |
|
| 3(3) | 0.00108 | 2 | 0 |
| 0 | 0 | −0.69 | 1/2 | |
|
| 3(3) | 0.00108 | 1 | 0 |
| 0 | 0 | −0.44 | 3/0 |
Column 2 lists the genes where non-Ig SHM hotspots were identified. Column 3 shows the novel SNVs identified at the 5′ gene regulatory region of these SHM hotspot genes, the numbers in the parentheses represent the numbers of novel SNVs downstream of TSS. The B-H corrected p values estimating the probability of observing the number of mismatches (column 3) by chance are listed in column 4. Column 5 shows the number of novel SNVs observed in the RGYW motifs, whereas column 6 shows the number of novel SNVs expected in the RGYW motifs given the local sequence context. The Z-score for enrichment of novel SNVs within the RGYW motifs of each non-Ig hotspot is listed in column 7. Column 8 shows the number of novel SNVs observed in WA motifs, whereas column 9 shows the number of novel SNVs expected in WA motifs given the local sequence context. The Z-score for enrichment of novel SNVs within the WA motifs of each non-Ig hotspot is listed in column 10. Column 11 shows the numbers of Transition (Ts) and Transversion (Tv) of novel SNVs in the 5′ regulatory region of each non-Ig hotspot gene.
Figure 3Detection of SHM.
(A). A snapshot of UCSC genome browser showing H3K4me3 ChIP-seq reads density at BACH2 promoter. The top three tracks represent SHMs, novel SNVs, and known SNVs (SNP132) detected in OCI-Ly1 by applying SHMseeqer to OCI-Ly1 H3K4me3 ChIP-seq short reads. (B). OCI-Ly1 H3K4me3 ChIP-seq short reads spanning chr6 position 91062534 (BACH2 intron 1) where a SHM was detected (shaded). (C). Sanger sequencing trace showing detection of both G (wild-type) and A (mutation) at chr6 position 91062534 in OCI-Ly1. (D). Overall validation rates of selected SNVs/SHMs within BACH2, BCL2, BCL6, BTG2, and MYO1E loci. N indicates number of SNVs/SHMs validated by Sanger sequencing in each locus.
Figure 4Aberrant SHMs affect promoter activity.
The effects of selected SHMs on BCL6 or BACH2 promoter activities were tested by dual luciferase assay in OCI-Ly1. Reporter activity of promoter bearing individual SHM was normalized to reporter activity of the wild-type promoter. Error bars indicate standard errors of three independent experiments.