| Literature DB >> 23267360 |
Yi Sun1, Zhancai Liu, Zhaoyong Li, Zhengxing Lian, Yaofeng Zhao.
Abstract
The VH replacement process is a RAG-mediated secondary recombination in which the variable region of a rearranged VHDJH is replaced by a different germline VH gene. In almost all human and mouse VH genes, two sequence features appear to be crucial for VH replacement. First, an embedded heptamer, which is located near the 3' end of the rearranged VH gene, serves as a cryptic recombination signal sequence (3'cRSS) for the VH replacement process. Second, a short stretch of nucleotides located downstream of the 3'cRSS serve as a footprint of the original VH region, frequently encoding charged amino acids. In this review, we show that both of these two features are conserved in the VH genes of all jawed vertebrates, which suggests that the VH replacement process may be a conserved mechanism.Entities:
Keywords: 3′cRSS; VH replacement; VH replacement footprint; charged amino acid; immunoglobulin; vertebrate
Year: 2012 PMID: 23267360 PMCID: PMC3526766 DOI: 10.3389/fimmu.2012.00392
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Frequency of the 3′cRSS heptamer in the functional germline VH genes of 12 species.
| Mouse | IGHV1 (J558) | 56.9 (29/51) | 94.1 (48/51) | 5.9 (3/51) | IMGT database |
| IGHV2 (Q52) | 100 (8/8) | 100 (8/8) | − | ||
| IGHV3 (36–60) | 100 (6/6) | 100 (6/6) | − | ||
| IGHV4 (X-24) | 100 (1/1) | 100 (1/1) | − | ||
| IGHV5 (7183) | 90 (9/10) | 90 (9/10) | 10 (1/10) | ||
| IGHV6 (J606) | − | − | 100 (5/5) | ||
| IGHV7 (S107) | 100 (3/3) | 100 (3/3) | − | ||
| IGHV8 (3609) | 100 (6/6) | 100 (6/6) | − | ||
| IGHV9 (VGAM3-8) | − | 100 (4/4) | − | ||
| IGHV10 (VH10) | 100 (2/2) | 100 (2/2) | − | ||
| IGHV11 (CP3) | − | − | 100 (2/2) | ||
| IGHV12 (CH27) | 100 (1/1) | 100 (1/1) | − | ||
| IGHV13 (3609N) | 50 (1/2) | 50 (1/2) | 50 (1/2) | ||
| IGHV14 (SM7) | 50 (2/4) | 50 (2/4) | 50 (2/4) | ||
| IGHV15 (VH15A) | 100 (1/1) | 100 (1/1) | − | ||
| IGHV16 | − | 100 (1/1) | − | ||
| Norway rat ( | IGHV1 | 41.7 (10/24) | 87.5 (21/24) | 12.5 (3/24) | IMGT database |
| IGHV2 | 28.1 (9/32) | 59.4 (19/32) | 40.6 (13/32) | ||
| IGHV3 | 100 (4/4) | 100 (4/4) | − | ||
| IGHV4 | 50 (1/2) | 50 (1/2) | 50 (1/2) | ||
| IGHV5 | 71.4 (15/21) | 80.9 (17/21) | 19.1 (4/21) | ||
| IGHV6 | 12.5 (1/8) | 12.5 (1/8) | 87.5 (7/8) | ||
| IGHV7 | 66.7 (4/6) | 66.7 (4/6) | 33.3 (2/6) | ||
| IGHV8 | 87.5 (7/8) | 87.5 (7/8) | 12.5 (1/8) | ||
| IGHV9 | − | 75 (3/4) | 25 (1/4) | ||
| IGHV10 | − | − | 100 (2/2) | ||
| IGHV11 | 40 (2/5) | 40 (2/5) | 60 (3/5) | ||
| IGHV12 | − | − | 100 (1/1) | ||
| Guinea pig ( | VH1 | 90.4 (19/21) | 95.2 (20/21) | 4.8 (1/21) | Guo et al., |
| VH2 | 93.8 (15/16) | 100 (16/16) | − | ||
| VH3 | 78.8 (41/52) | 84.6 (44/52) | 15.4 (8/52) | ||
| Rabbit | IGHV1 | − | 100 (12/12) | − | Ros et al., |
| − | − | ||||
| African elephant ( | VH1 | 66.7 (2/3) | 66.7 (2/3) | 33.3 (1/3) | Guo et al., |
| VH2 | 100 (2/2) | 100 (2/2) | − | ||
| VH3 | 83.3 (5/6) | 83.3 (5/6) | 16.7 (1/6) | ||
| VH4 | 85.3 (29/34) | 88.2 (30/34) | 11.8 (4/34) | ||
| VH5 | 100 (2/2) | 100 (2/2) | − | ||
| VH7 | 100 (1/1) | 100 (1/1) | − | ||
| Gray short-tailed opossum ( | VH1 | 94.7 (18/19) | 94.7 (18/19) | 5.3 (1/19) | Wang et al., |
| VH2 | − | 100 (1/1) | − | ||
| VH3 | 100 (1/1) | 100 (1/1) | − | ||
| Painted turtle | VH1 | 89.7 (26/29) | 96.6 (28/29) | 3.4 (1/29) | Ensembl database |
| VH2 | 10 (1/10) | 10 (1/10) | 90 (9/10) | ||
| VH3 | 100 (2/2) | 100 (2/2) | − | ||
| VH4 | − | − | 100 (1/1) | ||
| VH5 | 100 (3/3) | 100 (3/3) | − | ||
| VH6 | − | − | 100 (2/2) | ||
| VH7 | 50 (3/6) | 50 (3/6) | 50 (3/6) | ||
| VH8 | 100 (1/1) | 100 (1/1) | − | ||
| VH9 | 75 (3/4) | 75 (3/4) | 25 (1/4) | ||
| VH10 | 100 (9/9) | 100 (9/9) | − | ||
| VH11 | − | 100 (1/1) | − | ||
| Anole lizard | Group I | 46.7 (7/15) | 86.7 (13/15) | 13.3 (2/15) | Ensembl database |
| Group II | 35.3 (6/17) | 76.5 (13/17) | 23.5 (4/17) | ||
| Group III | 53.3 (8/15) | 80 (12/15) | 20 (3/15) | ||
| Group IV | 65 (13/20) | 100 (20/20) | − | ||
| Ac VH38265 | 100 (1/1) | 100 (1/1) | − | ||
| Ac VH338184 | 100 (1/1) | 100 (1/1) | − | ||
| Ac VH377057 | − | − | 100 (1/1) | ||
| Ac VH405628 | − | − | 100 (1/1) | ||
| Western clawed frog | VH1 | 90.9 (10/11) | 100 (11/11) | − | Ensembl database |
| VH2 | 83.3 (5/6) | 83.3 (5/6) | 16.7 (1/6) | ||
| VH3 | − | 20 (1/5) | 80 (4/5) | ||
| VH4 | 50 (1/2) | 100 (2/2) | − | ||
| VH5 | 100 (3/3) | 100 (3/3) | − | ||
| VH6 | 100 (1/1) | 100 (1/1) | − | ||
| VH8 | 100 (6/6) | 100 (6/6) | − | ||
| VH9 | − | 100 (1/1) | − | ||
| VH10 | 100 (1/1) | 100 (1/1) | − | ||
| VH11 | 100 (2/2) | 100 (2/2) | − | ||
| Zebrafish ( | IGHV1 | 100 (4/4) | 100 (4/4) | − | Danilova et al., |
| IGHV2 | 100 (3/3) | 100 (3/3) | − | ||
| IGHV3 | − | 100 (1/1) | − | ||
| IGHV4 | 85.7 (6/7) | 85.7 (6/7) | 14.3 (1/7) | ||
| IGHV5 | − | 66.7 (2/3) | 33.3 (1/3) | ||
| IGHV6 | 100 (1/1) | 100 (1/1) | − | ||
| IGHV7 | 100 (1/1) | 100 (1/1) | − | ||
| IGHV8 | 75 (3/4) | 100 (4/4) | − | ||
| IGHV9 | 50 (2/4) | 50 (2/4) | 50 (2/4) | ||
| IGHV10 | − | 100 (1/1) | − | ||
| IGHV11 | 100 (2/2) | 100 (2/2) | − | ||
| IGHV13 | − | 100 (1/1) | − | ||
| IGHV14 | 100 (1/1) | 100 (1/1) | − | ||
| Atlantic salmon ( | IGHV1 | 63.6 (7/11) | 100 (11/11) | − | Yasuike et al., |
| IGHV2 | 100 (3/3) | 100 (3/3) | − | ||
| IGHV3 | − | 100 (1/1) | − | ||
| IGHV4 | − | 80 (4/5) | 20 (1/5) | ||
| IGHV6 | 12.5 (1/8) | 100 (8/8) | − | ||
| IGHV7 | 100 (2/2) | 100 (2/2) | − | ||
| IGHV8 | 100 (10/10) | 100 (10/10) | − | ||
| IGHV9 | − | 100 (1/1) | − | ||
| IGHV10 | 100 (1/1) | 100 (1/1) | − | ||
| IGHV12 | 100 (1/1) | 100 (1/1) | − | ||
| IGHV15 | 100 (2/2) | 100 (2/2) | − | ||
| IGHV16 | 100 (4/4) | 100 (4/4) | − | ||
| IGHV17 | − | 100 (1/1) | − | ||
| Channel catfish ( | VH1 | 100 (2/2) | 100 (2/2) | − | Bengten et al., |
| VH2 | − | − | 100 (1/1) | ||
| VH3 | 20 (1/5) | 20 (1/5) | 80 (4/5) | ||
| VH5 | − | − | 100 (1/1) | ||
| VH6 | − | − | 100 (2/2) | ||
| VH7 | − | − | 100 (5/5) | ||
| VH9 | 100 (1/1) | 100 (1/1) | − | ||
| VH11 | − | 50 (1/2) | 50 (1/2) | ||
| VH12 | − | − | 100 (2/2) | ||
| VH14 | 100 (1/1) | 100 (1/1) | − | ||
The denominator inside the parentheses represents the total number of functional VH genes in the corresponding VH family, and the numerator represents the number of functional VH genes that contain the specified type of heptamer in the corresponding VH family. If a VH gene had more than one allele, only the 01 allele was used in the analysis of the 3′cRSS.
All of the functional VH genes were obtained from the VH repertoire of the C57BL/6 strain (imgt.org/IMGTrepertoire).
The functional germline VH genes of the painted turtle were identified from the genome scaffolds JH585110, JH585065, and JH585278 (pre.ensembl.org/Chrysemys_picta_bellii).
The functional germline VH genes of the anole lizard were identified from the genome scaffold GL343491 (ensembl.org/Anolis_carolinensis). The groups of the anole lizard VH genes are classified according to the phylogenetic study by Gambon Deza et al. (2009). Ac VH338184, Ac VH377057, and Ac VH405628 are three functional germline VH genes that are not included in the following phylogenetic analysis.
The functional germline VH genes of the western clawed frog were identified from the genome scaffolds GL173803, GL173909, and GL173564 (ensembl.org/Xenopus_tropicalis). Two VH genes from VH4 family are not included in the following phylogenetic analysis.
Figure 1Phylogenetic analysis of the VH genes from 12 species. Phylogenetic trees were constructed using MrBayes3.1.2 and viewed in TreeView. Multiple DNA sequence alignments for the tree construction were performed using ClustalX2. Only the FR1-3 regions (as defined by the IMGT numbering system) of each sequence were utilized to construct the trees. Each VH family is represented with one sequence per species, which was chosen among the functional VH genes of the species. The optional VH genes of most species are named according to the nomenclature from the IMGT database (imgt.cines.fr) or other references. The only exceptions were the anole lizard, painted turtle, and western clawed frog, for which the VH genes were named using our own annotation. The species abbreviations used are as follows: Mm, mouse (Mus musculus); Rn, Norway rat (Rattus norvegicus); Cap, guinea pig (Cavia porcellus); Oc, rabbit (Oryctolagus cuniculus); La, African elephant (Loxodonta Africana); Md, gray short-tailed opossum (Monodelphis domestica); Cp, painted turtle (Chrysemys picta); Ac, anole lizard (Anolis carolinensis); Xt, western clawed frog (Xenopus tropicalis); Dr, zebrafish (Danio rerio); Ss, Atlantic salmon (Salmo salar); Ip, channel catfish (Ictalurus punctatus).
Frequency of the 3′cRSS heptamer in the functional germline VH genes of the eight groups.
| Group A | 56.9 (66/116) | 90.5 (105/116) | |
| Group B | 67.2 (127/189) | 78.3 (148/189) | |
| Group C | 67.3 (138/205) | 79.5 (163/205) | |
| Group D | 55.6 (10/18) | 88.9 (16/18) | |
| Group E | 68.5 (50/73) | 87.7 (64/73) | |
| Group F | 94.1 (16/17) | 94.1 (16/17) | |
| Group G | 36.7 (11/30) | 76.7 (23/30) | |
| Group H | 39.1 (9/23) | 69.6 (16/23) | |
Figure 2Charged amino acids encoded by the nucleotide sequence following the 3′cRSS of the functional germline VH genes. (A) Two examples of amino acids that are encoded by the nucleotide sequence following the 3′cRSS of the functional germline VH genes in three reading frames. The typical length of the nucleotide sequence following the 3′cRSS is 7 nt in tetrapods (above) and 9 nt in teleosts (below). The 3′cRSSs in the two sequences are shown in gray. (B) Average frequency of the charged amino acids encoded by the nucleotide sequence following the 3′cRSS of the functional germline VH genes in three reading frames and by all functional DH germline genes in 11 species. All functional germline DH sequences are available from the IMGT database (for Norway rat), Ensembl genome database (for painted turtle, genome scaffold JH584564), and other references (Ros et al., 2004; Danilova et al., 2005; Bengten et al., 2006; Zhao et al., 2006; Wang et al., 2009; Wei et al., 2009; Yasuike et al., 2010; Guo et al., 2011, 2012). For each DH gene, the reading frame which encodes the highest percentage of the charged amino acids are chosen to calculate the average percentage of the charged amino acids encoded by DH genes in certain species; thus, the actual average percentage of the charged amino acids encoded by the DH genes in certain species is much lower than the value showed in the figure.