| Literature DB >> 28856181 |
Rachele Antonacci1, Mariagrazia Bellini1, Vito Castelli1, Salvatrice Ciccarese1, Serafina Massari2.
Abstract
These data are presented in support of structural and evolutionary analysis of the published article entitled "The occurrence of three D-J-C clusters within the dromedary TRB locus highlights a shared evolution in Tylopoda, Ruminantia and Suina" (Antonacci et al., 2017) [1]. Here we describe the genomic structure and the gene content of the T cell receptor beta chain (TRB) locus in Camelus dromedarius. As in the other species of mammals, the general genomic organization of the dromedary TRB locus consists of a pool of TRBV genes located upstream of in tandem TRBD-J-C clusters, followed by a TRBV gene with an inverted transcriptional orientation. A peculiarity of the dromedary TRB locus structure is the presence of three TRBD-J-C clusters, which is a common feature of sheep, cattle and pig sequences.Entities:
Keywords: Camelus dromedarius; Dromedary genome; IMGT; T cell receptor; TRB locus
Year: 2017 PMID: 28856181 PMCID: PMC5562110 DOI: 10.1016/j.dib.2017.08.002
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Description of the TRB genes in the Camelus dromedarius genome assembly. The position of all genes and their classification and functionality are reported.
| TRBV1 | F | NW_011591622 | 861263-861886 |
| TRBV2 | F | NW_011591622 | 932263-932714 |
| TRBV3 | P | NW_011591622 | 927952-928412 |
| TRBV5S1 | F | NW_011591622 | 937384-937843 |
| TRBV5S2 | F | NW_011591622 | 940879-941358 |
| TRBV5S3 | F | NW_011591622 | 955293-955748 |
| TRBV6 | F | NW_011591622 | 944809-945237 |
| TRBV7S1 | F | NW_011591622 | 947134-947581 |
| TRBV7S2 | F | NW_011591622 | 962228-962689 |
| TRBV8 | F | NW_011591622 | 950124-950593 |
| TRBV9 | P | NW_011591622 | 965923-966346 |
| TRBV10 | F | NW_011591622 | 970368-970809 |
| TRBV11 | F | NW_011591622 | 975860-976308 |
| TRBV12S1 | P | NW_011591622 | 981727-982197 |
| TRBV12S2 | P | NW_011591622 | 992125-992569 |
| TRBV14 | P | NW_011591622 | 995472-995906 |
| TRBV15S1 | F | NW_011591622 | 997569-998023 |
| TRBV15S2 | F | NW_011591622 | 999129-999583 |
| TRBV16 | F | NW_011591622 | 1003645-1004098 |
| TRBV19 | F | NW_011591622 | 1018094-1018641 |
| TRBV20 | F | NW_011591622 | 1020910-1021565 |
| TRBV21S1 | F | NW_011591622 | 1028337-1028797 |
| TRBV21S2 | F | NW_011591151 | 70843-70731 |
| TRBV21S3 | P | NW_011591151 | 62738-62511 |
| TRBV22 | F | NW_011591151 | 46518-46381 |
| TRBV23 | P | NW_011591151 | 60590-60480 |
| TRBV24 | P | NW_011591151 | 56428-56106 |
| TRBV25 | F | NW_011591151 | 52347-52219 |
| TRBV26 | F | NW_011591151 | 66428-66297 |
| TRBV27 | F | NW_011591151 | 41158-41032 |
| TRBV28 | F | NW_011591151 | 32762-32640 |
| TRBV29 | F | NW_011591151 | 27109-26837 |
| TRBD1 | F | NW_011591151 | 9932-9943 |
| TRBJ1-1 | F | NW_011591151 | 9247-9294 |
| TRBJ1-2 | F | NW_011591151 | 9116-9159 |
| TRBJ1-3 | F | NW_011591151 | 8861-8910 |
| TRBJ1-4 | F | NW_011591151 | 8258-8308 |
| TRBJ1-5 | F | NW_011591151 | 7982-8031 |
| TRBJ1-6 | F | NW_011591151 | 7491-7543 |
| TRBC1 | F | NW_011591151 | EX1 4773-5166 |
| EX2 4311-4328 | |||
| EX3 4044-4150 | |||
| EX4 3711-3731 | |||
| TRBC3 | nd | NW_011591151 | EX2 2866-2883 |
| EX3 2599-2705 | |||
| EX4 2266-2286 | |||
| TRBJ3-1 | F | NW_011620189 | 653-702 |
| TRBJ3-1 | F | NW_011601111 | 2234-2283 |
| TRBJ3-2 | F | NW_011601111 | 2426-2476 |
| TRBJ3-3 | F | NW_011601111 | 2642-2690 |
| TRBJ3-4 | nd | NW_011601111 | 2787-2814 |
| TRBJ2-2 | F | NW_011616084 | 215-265 |
| TRBJ2-3 | nd | NW_011616084 | 2-46 |
| TRBJ2-6 | nd | NW_011607149 | 185-231 |
| TRBC2 | nd | NW_011593440 | EX1 1911-2149 |
| EX2 2622-2639 | |||
| EX3 2800-2906 | |||
| EX4 3190-3210 | |||
| TRBV30 | F | NW_011593440 | 14509-14160 |
nd: not defined (indicates that the nt sequence of the gene is incomplete and its functionality cannot be defined).
L-PART1/ V-exon for TRBV genes and coding sequence for TRBD and TRBJ.
Description of the Camdro TRBV pseudogenes.
| TRBV3 | ● | ||||
| TRBV9 | ● | ● | |||
| TRBV12S1 | ● | ||||
| TRBV12S2 | ● | ||||
| TRBV14 | ● | ||||
| TRBV21S3 | ● | ● | |||
| TRBV23 | ● | ||||
| TRBV24 | ● | ● |
Description of the unrelated TRB genes in the Camelus dromedarius genome assembly. The position of all genes and their classification and functionality are reported.
| MOXD2 | F | NW_011591622 | 850155-856730 |
| TRY1 | F | NW_011591622 | 870036-876394 |
| TRY2 | F | NW_011591622 | 882909-888072 |
| TRY3 | nd | NW_011623391 | 1-2387 |
| TRY4 | F | NW_011591151 | 13974-17714 |
| EPBH6 | F | NW_011593440 | 46466-60647 |
nd: not defined (indicates that the nt sequence of the gene is incomplete and its functionality cannot be defined).
Fig. 1The IMGT Protein display of the dromedary TRBV genes. Only functional genes and in-frame pseudogenes are shown. The description of the strands and loops and of the FR-IMGT and CDR-IMGT is according to the IMGT unique numbering for V-REGION [6]. The amino acid length of the CDR-IMGT AA is also indicated in square brackets.
Camelus dromedarius D-J-C region genomic clones. The primer sequences, the PCR conditions and the size of each clone are reported.
| Clone | Primer pairs sequence (5′-3′) | Primer location | T annealing | Product length (bp) |
|---|---|---|---|---|
| pSCBJ11 | JB11U: CTTTGGAGAAGGCACCAG | TRBJ1-1 gene | 55/58 | 4396 |
| CB2L: TGGTTGCGGGGGTTGTGC | TRBC gene exon 1 | |||
| pSCJ22KN | CB2U: GCACAACCCCCGCAACCA | TRBC gene exon 1 | 53/55 | 5000 |
| JB34L: GCCAAAGTACTGAGTGTT | TRBJ3-4 gene | |||
| pSCBJ27U | JB34U: AACACTCAGTACTTTGGC | TRBJ3-4 gene | 56/58 | 4077 |
| CB2L: TGGTTGCGGGGGTTGTGC | TRBC gene exon 1 | |||
| pSCBD3 | CB2U: GCACAACCCCCGCAACCA | TRBC gene exon 1 | 55/56 | 4848 |
| JB23L: CCGCCGAAAAACAGTGTC | TRBJ2-3 gene | |||
| pSCMG1 | JB23U: GACACTGTTTTTCGGCGG | TRBJ2-3 gene | 55/58 | 3160 |
| CB2L: TGGTTGCGGGGGTTGTGC | TRBC gene exon 1 | |||
| pSCB2C8 | CB2U: GCACAACCCCCGCAACCA | TRBC gene exon 1 | 62 | 1331 |
| 3UTR:GTTGAGCTCACTTTGCAGGG | TRBC2 gene 3UTR |
Fig. 2Nucleotide and deduced amino acid sequences of the dromedary TRBD (a), TRBJ (b) and TRDC (c) genes. The consensus sequence of the heptamer and nonamer are provided at the top of the figure and underlined. The numbering adopted for the gene classification is reported on the left of each gene. The gene sequence retrieved from the Ca_dromedarius_V1.0 genomic assembly is highlighted in red. In (a), the inferred amino acid sequence of the TRBD genes in the three coding frames are reported. In (b), the donor splice site for each TRBJ is shown. The canonical FGXG amino acid motifs are underlined. The unusual TRBJ3.6 gene motif is in italics. In (c), IMGT Protein display of the dromedary TRBC genes. Descriptions of the strands and loops were collected according to the IMGT unique numbering for C-DOMAIN [7].
| Subject area | |
|---|---|
| More specific subject area | |
| Type of data | |
| How data was acquired | |
| Data format | |
| Experimental factors | |
| Experimental features | |
| Data source location | |
| Data accessibility |