| Literature DB >> 31781122 |
Justyna Mika1, Sylwia Kabacik2, Christophe Badie2, Joanna Polanska1, Serge M Candéias3.
Abstract
The genes coding for the antigenic T cell receptor (TR) subunits are assembled in thymocytes from discrete V, D, and J genes by a site-specific recombination process. A tight control of this activity is required to prevent potentially detrimental recombination events. V, D, and J genes are flanked by semi-conserved nucleotide motives called recombination signal sequences (RSSs). V(D)J recombination is initiated by the precise introduction of a DNA double-strand break exactly at the border of the genes and their RSSs by the RAG recombinase. RSSs are therefore physically separated from the coding region of the genes before assembly of a rearranged TR gene. During a high throughput profiling of TRB genes in mice, we identified rearranged TRB genes in which part or all of a flanking RSS was retained in V-D or D-J coding joints. In some instances, this retention of germline DNA resulted from the use of an upstream alternative RSS. However, we also identified TRB sequences where retention of germline DNA occurred in the absence of alternative RSS, suggesting that RAG activity was mis-targeted during recombination. Similar events were also identified in human rearranged TRB and TRG genes. The use of alternative RSSs during V(D)J recombination illustrates the complexity of RAG-RSSs interactions during V(D)J recombination. While the frequency of errors resulting from mis-targeted RAG activity is very low, we believe that these RAG errors may be at the origin of oncogenic translocations and are a threat for genetic stability in developing lymphocytes.Entities:
Keywords: RSS; T cell receptor genes; V(D)J recombination; errors; lymphoid neoplasms; translocations
Mesh:
Substances:
Year: 2019 PMID: 31781122 PMCID: PMC6857471 DOI: 10.3389/fimmu.2019.02637
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Figure 1TRBJ1-7 is used in murine TRB coding joints through rearrangement of a new RSS. (A) Examples of murine rearranged TRB genes using TRBJ1-7. The TRBV gene used in each rearrangement is indicated on the left, and its sequence is shown from the Cys codon (TGT) until the V-D junction. The eventual P nucleotides are indicated in bold at the end of the V gene sequence. In the N/P-TRBD-P/N column, the TRBD gene is capital letters. The eventual P nucleotides are indicated in bold at the ends of the TRBD gene sequence. N nucleotides are in lowercase letters. The P/GL DNA column shows a stretch of nucleotides originally classified as N nucleotides, identical to germline DNA, with the eventual P nucleotides, in bold (see text for details). The 6 first nucleotides of the TRBJ1-7 gene are shown on the left column. The sequences in the top section are a random selection of TRB gene rearrangements having 30 to 39 nt of homology with germline DNA upstream of TRBJ1-7. The lower section shows four sequences in which the homology extends for 50–68 base upstream of TRBJ1-7. The complete collection of sequences is shown in Supplementary Table 1. (B) A new RSS located 40 bp upstream of the TRBJ1-7 gene, in the TRBJ1-6 gene sequence. The sequence of the TRBJ1 region encompassing TRBJ1-6 and TRBJ1-7 is shown. The boundaries of the genes are indicated by the dashed lines below the sequence. The open arrowhead above the sequence indicates the border of the TRBJ1-7 RSS described in the literature, separated from TRBJ1-7 coding sequence by a dash. Its heptamer and nonamer are underlined. The heptamer of the TRBJ1-6 gene is also underlined. The black arrowhead at position −40 indicates the position of the new RSS characterized in this study, embedded in the sequence of the TRBJ1-6 gene. Its heptamer and nonamer are shaded.
Figure 2Rearrangement of the murine TRBJ1-2 through a new RSS. (A) Sequence of rearranged TRB genes using TRBJ1-2 through a new upstream RSS. The TRBV gene used in each rearrangement is indicated on the left, and its sequence is shown from the Cys codon (TGT) until the V-D junction. The eventual P nucleotides are indicated in bold at the end of the V gene sequence. In the N/P-TRBD-P/N column, the TRBD gene is capital letters. The eventual P nucleotides are indicated in bold at the ends of the TRBD gene sequence. N nucleotides are in lowercase letters. The P/GL DNA column shows a stretch of nucleotides originally classified as N nucleotides, identical to germline DNA, with the eventual P nucleotides, in bold (see text for details). The 6 first nucleotides of the TRBJ1-2 gene are shown on the left column. (B) A new RSS located 31 bp upstream of the TRBJ1-2 gene. The boundaries of the TRBJ1-2 gene are indicated by the dashed lines below the sequence. The open arrowhead above the sequence indicates the border of the TRBJ1-2 RSS described in the literature, separated from TRBJ1-2 coding sequence by a dash. Its heptamer and nonamer are underlined. The black arrowhead at position −31 indicates the position of the new RSS characterized in this study. Its heptamer and nonamer are shaded.
Figure 3Rearranged TRB genes using TRBJ1-5, TRBJ2-5, and TRBJ2-6 including germline DNA in their D-J coding joint. (A) Sequence of these genes. The TRBV gene used in each rearrangement is indicated on the left, and its sequence is shown from the Cys codon (TGT) until the V-D junction. In the N/P-TRBD-P/N column, the TRBD gene is capital letters. The eventual P nucleotides are indicated in bold at the ends of the TRBD gene sequence. N nucleotides are in lowercase letters. The GL DNA column shows a stretch of nucleotides originally classified as N nucleotides, identical to germline DNA. The 6 first nucleotides of the TRBJ genes are shown on the left column. (B) A putative cryptic RSS located 47 bp upstream of the TRBJ2-6 gene. The open arrowhead above the sequence indicates the border of the TRBJ2-6 RSS described in the literature, separated from TRBJ2-6 coding sequence by a dash. Its heptamer and nonamer are underlined. The black arrowhead at position −47 indicates the position of the putative cryptic RSS used in the three rearranged TRB genes shown in (A). Its heptamer and nonamer are shaded.
Figure 4Rearranged TRB genes including germline DNA in a coding joint in CBA/Ca peripheral blood lymphocytes. The first column indicates the V and D or V and J involved in the coding joint including germline DNA retention. The gene for which GL DNA retention is observed is indicated in the second column, and the third column indicates the length (L), in nucleotides, of this stretch of GL DNA. The sequence of the coding joint is given in the last column, aligned on the left when GL DNA retention is observed from a RSS located downstream of the rearranged gene (V), and on the right when it is observed from a RSS located upstream of the rearranged gene (5′D, J). The sequence of the V-D and D-J coding joints are shown in the last column, anchored on the six last (for V) or first (for D and J) nucleotides of the genes. The stretch of GL DNA in each coding joint is underlined and the heptamer and, when present, the nonamer of the gene's RSS are overlaid in gray. When present, N nucleotides are shown in lower case letters and P nucleotides in bold letters. The D1/J1-5 and D2/J2-5 rearrangements are those reported in Figure 3. Underlined N nucleotides are identical to the sequence of TRBD genes.
Figure 5Rearranged TRB including germline DNA in a coding joint in C57BL/6 and Balb/c mice. The first column indicates the gene for which GL DNA retention is observed. The second column indicates the length (L), in nucleotides, of this stretch of GL DNA, and the third the origin of the TRB gene (C, C57BL/6; B, Balb/c; T, thymus; S, spleen; 1 or 2, the ID of the mouse). The sequence of the rearranged TR gene or of the V-D and D-J coding joints is given in the last column, aligned on the left when GL DNA retention is observed from a RSS located downstream of the rearranged gene (V, 3′D), and on the right when it is observed from a RSS located upstream of the rearranged gene (5′D, J). The sequence are anchored on the six last (for V, 3′D) or first (for 5′D and J) nucleotides of the genes. The heptamer of the RSS and, when present, the nomaner, are shaded in gray in the stretch of GL DNA in each coding joint. When present, N nucleotides are shown in lower case letters and P nucleotides in bold letters. Underlined N nucleotides are identical to the sequence of the TRBD1 gene.
Figure 6Distribution of germline DNA retention length at genes with RSS-12 and RSS-23 in murine TRB coding joints. These violin plots represent the distribution and median value of GL DNA retention length at coding ends in rearranged TRB genes in Balb/c and C57BL/6 mice. The median value is higher at genes with a RSS-23 (p = 4.0*10−3 by a U-Mann-Whitney test). Dots represent outlying values.
Summary of GL DNA retention in human TRB genes.
| HIP01181 | 102017 | 2 | TRBJ1-3 (22), TRBJ1-6 (25) |
| HIP05941 | 267329 | 4 | TRBD1 (11,11,17), TRBJ2-1 (21) |
| HIP13244 | 234627 | 2 | TRBD1 (14,12) |
| HIP13309 | 429849 | 5 | TRBV28-1 (11), TRBD1 (27,16,11), TRBD2 (14) |
| HIP13350 | 245731 | 3 | TRBD1 (13,12), TRBD2 (14) |
| HIP13376 | 411273 | 9 | TRBD1 (18,17,15,14,11,11), TRBJ1-6 (32), TRBJ2-4 (19,14) |
| HIP13511 | 292766 | 8 | TRBD1 (17,15,12,11,11), TRBD2 (12), TRBJ1-6 (28), TRBJ2-7 (12) |
| HIP13741 | 378826 | 3 | TRBD1 (13), TRBD2 (28), TRBJ2-3 (11) |
| HIP13749 | 289975 | 5 | TRBV28.1 (10,10), TRBD1 (19,13,11) |
| HIP13769 | 236598 | 1 | TRBD1 (14) |
| HIP13803 | 277626 | 5 | TRBD1 (23), TRBJ1-2 (22), TRBJ2-4 (37,17), TRBJ2-6 (11) |
| HIP13806 | 157436 | 3 | TRBV28-1 (10), TRBD1 (17), TRBJ1-6 (28) |
| HIP13823 | 190464 | 2 | TRBD1 (20,15) |
| HIP13831 | 149119 | 1 | TRBD1 (13) |
| HIP13939 | 267205 | 5 | TRBD1 (32,11), TRBD2 (14,10), TRBJ1-5 (18) |
| HIP13951 | 409325 | 7 | TRBV11-3 (8), TRBD1 (18,14,11), TRBD2 (15), TRBJ1-5 (20), TRBJ2-4 (11) |
| HIP13958 | 306691 | 3 | TRBD1 (16,11), TRBD2 (19) |
| HIP13967 | 220861 | 5 | TRBD1 (18,13,11), TRBJ2-1 (21), TRBJ2-7 (12) |
| HIP13975 | 363702 | 3 | TRBD1 (14), TRBJ1.6 (28,19) |
| HIP13981 | 213139 | 1 | TRBD1 (16) |
| HIP13992 | 105882 | 0 | |
| HIP14064 | 310173 | 1 | TRBD2 (10) |
| HIP14121 | 258248 | 7 | TRBV11.3 (14), TRBV28-1 (10), TRBD1 (32,17,14,12), TRBJ2-6 (28) |
| HIP14152 | 82020 | 0 | |
| HIP14209 | 174386 | 2 | TRBD1 (26,11) |
| HIP14213 | 260624 | 6 | TRBD1 (16,15,14,11), TRBJ1-5 (20,20) |
| All | 6635892 | 93 | Average nb of GL nucleotide/seq: 16.33 |
Summary of GL DNA retention in human TRG genes (except TRGVB).
| HIP01181 | 127569 | 2 | TRGV2-1 (23), TRGV8-1 (25) |
| HIP05941 | 325934 | 5 | TRGV6-1 (21), TRGV11-1 (29,27,20,19) |
| HIP13244 | 274413 | 4 | TRGV2-1 (22), TRGV11-1 (29,29,27) |
| HIP13309 | 412483 | 9 | TRGV7-1 (21), TRGV8-1 (23,21), TRGV11-1 (29,29,29,26,26,23) |
| HIP13376 | 406349 | 3 | TRGV11-1 (26,26,21) |
| HIP13511 | 327805 | 4 | TCRGV2-1 (26), TRGV10-1 (23), TRGV11-1 (29,22) |
| HIP13741 | 426644 | 3 | TRGV8-1 (21), TRGV11-1 (29,27) |
| HIP13769 | 272913 | 4 | TRGV2-1 (21,14), TRGV11-1 (29,19) |
| HIP13803 | 329253 | 1 | TRGV8-1 (21) |
| HIP13806 | 212234 | 1 | TRGV11-1 (26) |
| HIP13823 | 219204 | 2 | TRGV10-1 (24), TRGV11-1 (26) |
| HIP13831 | 172321 | 2 | TRVG2-1 (23), TRGV11-1 (29) |
| HIP13939 | 311247 | 2 | TRGV5P-1 (23), TRGV11-1 (21) |
| HIP13951 | 444612 | 3 | TRGV8-1 (17), TRGV11-1 (29,25) |
| HIP13958 | 346863 | 2 | TRGV2-1 (20), TRGV11-1 (29) |
| HIP13975 | 677 | Outlier, not considered | |
| HIP13981 | 252444 | 2 | TRGV11-1 (30), TRGJP1 (14) |
| HIP13992 | 127238 | 3 | TRGV7-1 (22), TRGV11-1 (29,26) |
| HIP14064 | 374918 | 4 | TRGV11 (29,26,24,21) |
| HIP14121 | 291792 | 3 | TRGV10-1 (27), TRGV11-1 (28,24) |
| HIP14152 | 99534 | 1 | TRGV10-1 (23) |
| HIP14209 | 213320 | 1 | TRGV11-1 (29) |
| HIP14213 | 314526 | 4 | TRGV2-1 (39), TRGV7-1 (21), TRGV9-1 (25), TRGV11-1 (19) |
| All | 6284293 | 65 | Average nb of GL nucleotide/seq: 24.6 |
Figure 7Distribution of germline DNA retention length in coding joints in human rearranged TRB and TRG genes. These violin plots represent the distribution and median value of GL DNA retention length at coding ends in rearranged human TRB and TRG genes. The median value of germline DNA retention length is higher in TRG coding joints (p < 10−9 by a U-Mann-Whitney test). Dots represent outlying values.
Figure 8Germline DNA retention frequency in murine and human TRB and TRG coding joints. These histograms represents the Wilson frequency estimators and 95% confidence intervals of the frequency of occurrence of germline DNA retention in coding joints in the different datasets analyzed in this study.