| Literature DB >> 23469094 |
Lin Huang1, Miles D Lange, Yangsheng Yu, Song Li, Kaihong Su, Zhixin Zhang.
Abstract
VH replacement occurs through RAG-mediated recombination between the cryptic recombination signal sequence (cRSS) near the 3' end of a rearranged VH gene and the 23-bp RSS from an upstream unrearranged VH gene. Due to the location of the cRSS, VH replacement leaves a short stretch of nucleotides from the previously rearranged VH gene at the newly formed V-D junction, which can be used as a marker to identify VH replacement products. To determine the contribution of VH replacement products to mouse antibody repertoire, we developed a Java-based VH Replacement Footprint Analyzer (VHRFA) program and analyzed 17,179 mouse IgH gene sequences from the NCBI database to identify VH replacement products. The overall frequency of VH replacement products in these IgH genes is 5.29% based on the identification of pentameric VH replacement footprints at their V-D junctions. The identified VH replacement products are distributed similarly in IgH genes using most families of VH genes, although different families of VH genes are used differentially. The frequencies of VH replacement products are significantly elevated in IgH genes derived from several strains of autoimmune prone mice and in IgH genes encoding autoantibodies. Moreover, the identified VH replacement footprints in IgH genes from autoimmune prone mice or IgH genes encoding autoantibodies preferentially encode positively charged amino acids. These results revealed a significant contribution of VH replacement products to the diversification of antibody repertoire and potentially, to the generation of autoantibodies in mice.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23469094 PMCID: PMC3585286 DOI: 10.1371/journal.pone.0057877
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Immunoglobulin VH, DH, and JH gene usages in the mouse IgH sequence repertoire.
The mouse IgH gene sequence data set containing 17,179 entries was downloaded from NCBI databases. The potential VH, DH, and JH germline gene assignments were performed using the IMGT/V-QUEST program by sending batches of sequences by the VHRFA program. Clonally redundant IgH sequences were removed if they contain identical CDR3 regions. The usages of different families of VH germline genes (A), DH genes (B), and JH (C) genes in the functional or non-functional unique IgH genes were analyzed.
Frequencies of VH replacement footprint motifs with different length in the N1 and N2 regions of the mouse IgH genes.
| Number of unique Sequences | Number of Sequences with V, D, J genes | Minimal Length of VH replacement footprint | VH replacement footprint motifs in the N1 region | VH replacement footprint motifs in the N2 region |
| Frequency of VH replacement products (%) | |
| Test IgH genes | 271 | 252 | 3 | 101 | 65 | 0.0001 | 40.1 |
| 4 | 55 | 23 | 0.0001 | 21.8 | |||
| 5 | 14 | 4 | 0.0308 | 5.5 | |||
| 6 | 2 | 0 | 0.4786 | 0.79 | |||
| 7 | 1 | 0 | 0.3168 | 0.39 | |||
| NCBI IgH genes | 11309 | 10159 | 3 | 3384 | 2622 | 0.0001 | 33.55 |
| 4 | 1609 | 979 | 0.0001 | 15.95 | |||
| 5 | 534 | 256 | 0.0001 | 5.29 | |||
| 6 | 179 | 50 | 0.0001 | 1.77 | |||
| 7 | 45 | 8 | 0.0001 | 0.45 |
Unique sequences were identified after removal of IgH sequences with identical CDR3 regions.
Total number of IgH gene sequences with clearly identifiable VH, DH, and JH genes.
N1 region refers to the V-D junction.
N2 region refers to the D-J junction.
The frequencies of potential VH replacement footprint motifs in the N1 and N2 regions were compared by two-tailed Chi-square with Yate’s correction. p<0.05 was considered significant and p<0.0001is considered extremely significant.
Numbers of IgH gene sequences with VH replacement “footprint” motifs in the N1 regions were divided by the total number of IgH gene sequences with V, D, J gene assignment.
Mouse IgH gene sequences were previously described.
The mouse IgH gene sequences were downloaded from the NCBI database on May 7, 2011. The GI numbers of these sequences were included in Table S1.
List of potential VH replacement products in the test IgH sequences.
| Sequence ID | VH gene | 3′ VH | P | N1 | DH | Potential footprint donor | Position |
| FJ816520 | IGHV1S132 | tgtgcaaga | gg | IGHD2-14 | IHGV8-10, IGHV8-14, IGHV8S2 | Y | |
| FJ150867 | IGHV14-3 | tgtgcaaga | gg | IGHD1-1 | IGHV3-3, IGHV10-3, IGHV13-1 | Y | |
| FJ150854 | IGHV1S132 | tgtgcaaga | gc | IGHD2-12 | IGHV7-1 | Y | |
| GU907018 | IGHV1-9 | tgtgccaga | g | IGHD1-1 | IGHV8-10, IGHV8-14, IGHV8S2 | Y | |
| FJ816537 | IGHV1-74 | tgtgcaa |
| IGHD2-12 | IGHV3-3, IGHV10-3, IGHV13-1 | Y | |
| FJ816495 | IGHV1-47 | tgtgcaagg |
| IGHD1-1 | IGHV3-3, IGHV10-3, IGHV13-1 | Y | |
| GU907010 | IGHV1-5 | tgtacaaga |
| IGHD2-1 | IGHV10-1, IGHV12-3 | Y | |
| GU907038 | IGHV1-4 | tgtgcaaga | tc |
| IGHD2-3 | IGHV3-1 | Y |
| FJ816546 | IGHV1-4 | tgtgcaag |
| IGHD1-1 | IGHV8-12, IGHV1-11, IGHV12-3 | Y | |
| FJ816592 | IGHV14-1 | tgtgc |
| IGHD2-14 | IGHV2-6-7 | Y | |
| FJ816442 | IGHV14-1 | tgtgcta |
| IGHD1-1 | IGHV2-3, IGHV2-6-6 | Y | |
| FJ816522 | IGHV2-9-1 | tgtgccagaga | tc | ggg | IGHD2-14 | IGHV7-3 | N |
| GU906999 | IGHV14-3 | tgtgctaga | g | IGHD1-1 | IGHV8-10, IGHV8-14, IGHV8S2 | N | |
| GU906995 | IGHV14-3 | tgtgctgga | g | IGHD1-1 | IGHV8-10, IGHV8-14, IGHV8S2 | N |
The identified VH replacement footprints in the N1 regions are underlined.
The relative positions of the potential donors and recipient VH genes in the identified VH replacement product were analyzed to determine if the VH replacement occurred through an upstream VH gene replacing a downstream VH gene (Y) or a downstream VH gene replacing an upstream gene (N). Only functional VH germline genes were used in this analysis.
Frequencies of VH replacement products in IgH genes using different families of mouse VH genes.
| VH family | Number of IgH gene sequences | Motifs in the N1 region | Frequency of VH replacement products (%) |
| VH1/J558 | 6530 | 314 | 4.81 |
| VH2/Q52 | 665 | 55 | 8.27c |
| VH3/36-60 | 565 | 30 | 5.31 |
| VH4/X-24 | 57 | 3 | 5.26 |
| VH5/7183 | 998 | 68 | 6.81 |
| VH6/J606 | 131 | 6 | 4.58 |
| VH7S107 | 253 | 8 | 3.16 |
| VH8/3609 | 139 | 9 | 6.47 |
| VH9/VGAM3-8 | 144 | 11 | 7.64 |
| VH10/VH10 | 127 | 4 | 3.15 |
| VH11/CP3 | 37 | 0 | 0 |
| VH12/CH27 | 43 | 3 | 6.98 |
| VH13/3609N | 7 | 1 | 14.29 |
| VH14/SM7 | 459 | 26 | 5.66 |
| VH15/VH15A | 4 | 0 | 0 |
| VH5-2/7183.2 | 56 | 0 | 0 |
Number of IgH gene sequences with VH replacement “footprint” motifs in the N1 regions divided by the total number of IgH gene sequences assigned to a VH gene family.
Functional IgH genes using the VH5-2/7183.2 gene were analyzed for potential VH replacement footprints in the N1 regions.
The frequency of VH replacement products using VH2/Q52 family of VH genes is significantly higher than the overall frequency of VH replacement products in mouse IgH genes.
Figure 2Enrichment of VH replacement products in IgH genes derived from different strains of autoimmune prone mice and IgH genes encoding autoantibodies.
The frequencies of VH replacement products in IgH genes derived from different strains of mice were analyzed using the VHRFA program based on the keyword linked to each IgH gene in the NCBI database. VH replacement products were assigned based on the identification of (A) 5-mer VH replacement footprints, (B) 4-mer VH replacement footprints, or (C) 3-mer VH replacement footprints within the VH-DH junctions (N1 regions). The frequencies of VH replacement products in different subcategories were compared with that in the BALB/c mice. n, number of IgH sequences in each subcategory. Statistical significance was determined using a two-tailed Chi square test with Yate’s correction. p<0.05 (*) is considered significant and p<0.0001 (**) is considered extremely significant. The detailed sequence analysis and the identified VH replacement products with 5-mer VH replacement footprints correlating with keywords are included in Table S6.
Figure 3VH replacement footprints preferentially contribute charged amino acids to the CDR3 regions.
(A) The frequencies of charged amino acids encoded by the identified pentameric VH replacement footprints or the N1 regions of non-VH replacement products were compared. Detailed amino acid sequences of the IgH CDR3 regions are listed in Table S6. (B) The frequencies of individual amino acid encoded by the identified VH replacement footprints or the N1 regions of non-VH replacement products were compared. n, amino acids encoded by the identified VH replacement footprints or the N1 regions of non-VH replacement products. (C) The frequencies of individual amino acid encoded by the 3′ end of VH germline genes and DH regions were compared. n, amino acids encoded by the VH gene 3′ ends or DH regions. (D) Usages of different amino acids encoded by the identified VH replacement footprints in functional VH replacement products and non-functional VH replacement products. n, amino acids encoded by the identified VH replacement footprints. Statistical significance was determined using a two-tailed Chi square test with Yate’s correction. n, number of amino acid residues encoded by indicated sequences. p<0.05 (*) is considered significant and p<0.0001 (**) is considered extremely significant.
Figure 4Comparison of the amino acids encoded by VH replacement footprints and the IgH CDR3 lengths of VH replacement products.
(A) The usages of different amino acids encoded by VH replacement footprints with 5, 4, or 3 nucleotides were compared. n, number of amino acid residues encoded by the identified VH replacement footprints with different lengths. Statistical significance was determined using a two-tailed Chi square test with Yate’s correction. p<0.05 (*) is considered significant and p<0.0001 (**) is considered extremely significant. (B) Comparison of the IgH CDR3 lengths of VH replacement products containing the 5-mer or the 3-mer VH replacement products with the CDR3 length of the total functional IgH genes. n, number of IgH sequences or VH replacement products with 3- or 5-mer VH replacement footprints. Statistical significance was determined using unpaired t test. p<0.05 (*) is considered significant and p<0.0001 (**) is considered extremely significant.
Figure 5The enriched VH replacement products identified in different strains of autoimmune prone mice or IgH genes encoding autoantibodies have been positively selected during autoimmune responses.
(A) Analysis of the frequencies of positively charged versus negatively charged amino acids encoded by the 3′ end VH genes and the identified VH replacement footprints from different strains of mice or IgH genes encoding autoantibodies. Statistical significance was determined using a two-tailed Chi square test with Yate’s correction. p<0.05 (*) is considered significant. (B) Comparison of the amino acids encoded by the identified VH replacement footprints in MRL/lpr mice. n, numbers of amino acids encoded by the identified VH replacement footprints. (C) Mutation status analysis of identified VH replacement products and non-VH replacement products from different subgroups of IgH genes.