Literature DB >> 28856181

Data characterizing the genomic structure of the T cell receptor (TRB) locus in Camelus dromedarius.

Rachele Antonacci1, Mariagrazia Bellini1, Vito Castelli1, Salvatrice Ciccarese1, Serafina Massari2.   

Abstract

These data are presented in support of structural and evolutionary analysis of the published article entitled "The occurrence of three D-J-C clusters within the dromedary TRB locus highlights a shared evolution in Tylopoda, Ruminantia and Suina" (Antonacci et al., 2017) [1]. Here we describe the genomic structure and the gene content of the T cell receptor beta chain (TRB) locus in Camelus dromedarius. As in the other species of mammals, the general genomic organization of the dromedary TRB locus consists of a pool of TRBV genes located upstream of in tandem TRBD-J-C clusters, followed by a TRBV gene with an inverted transcriptional orientation. A peculiarity of the dromedary TRB locus structure is the presence of three TRBD-J-C clusters, which is a common feature of sheep, cattle and pig sequences.

Entities:  

Keywords:  Camelus dromedarius; Dromedary genome; IMGT; T cell receptor; TRB locus

Year:  2017        PMID: 28856181      PMCID: PMC5562110          DOI: 10.1016/j.dib.2017.08.002

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the data These data insight into the genomic structure of the T cell receptor (TRB) locus in Camelus dromedaries. This results in the first, mostly complete, map of the TRB locus in a species of the Tylopoda suborder. The dromedary TRB locus characterization can be used to increase the understanding in the evolution of Camelidae and to contribute to solving the relative placement of this species within the Artiodactyla order. The availability of the sequence of the dromedary TRB locus allows researchers to concentrate on functional study and provides a tool to use this specie as a valuable model for immunological research.

Data

Data presented in the text include tables and figures giving information on the genomic structure and the gene content of the dromedary TRB locus, a mammalian species belonging to the Camelus genus. This information was obtained by integrating the sequence data deduced from the public genomic assembly [2] with sequences obtained by PCR experiments conducted in our laboratory. Table 1 describes position, classification and functionality of the TRB genes retrieved from the dromedary public genome assembly. Table 2 shows the description of the dromedary TRBV pseudogenes. Table 3 describes position, classification and functionality of the unrelated TRB genes recovered from the dromedary public genome assembly. Fig. 1 shows the deduced amino acid sequences of the dromedary TRBV genes according to IMGT unique numbering for the V-REGION [6]. Table 4 provides the list of the genomic clones of the dromedary TRBD-J-C region with the primer pairs used and the PCR conditions. Fig. 2 shows the TRBD, the TRBJ and the TRBC gene sequences.
Table 1

Description of the TRB genes in the Camelus dromedarius genome assembly. The position of all genes and their classification and functionality are reported.

Gene classificationFunctionalityaNCBI Reference SequencePositionb
TRBV1FNW_011591622861263-861886
TRBV2FNW_011591622932263-932714
TRBV3PNW_011591622927952-928412
TRBV5S1FNW_011591622937384-937843
TRBV5S2FNW_011591622940879-941358
TRBV5S3FNW_011591622955293-955748
TRBV6FNW_011591622944809-945237
TRBV7S1FNW_011591622947134-947581
TRBV7S2FNW_011591622962228-962689
TRBV8FNW_011591622950124-950593
TRBV9PNW_011591622965923-966346
TRBV10FNW_011591622970368-970809
TRBV11FNW_011591622975860-976308
TRBV12S1PNW_011591622981727-982197
TRBV12S2PNW_011591622992125-992569
TRBV14PNW_011591622995472-995906
TRBV15S1FNW_011591622997569-998023
TRBV15S2FNW_011591622999129-999583
TRBV16FNW_0115916221003645-1004098
TRBV19FNW_0115916221018094-1018641
TRBV20FNW_0115916221020910-1021565
TRBV21S1FNW_0115916221028337-1028797
TRBV21S2FNW_01159115170843-70731
TRBV21S3PNW_01159115162738-62511
TRBV22FNW_01159115146518-46381
TRBV23PNW_01159115160590-60480
TRBV24PNW_01159115156428-56106
TRBV25FNW_01159115152347-52219
TRBV26FNW_01159115166428-66297
TRBV27FNW_01159115141158-41032
TRBV28FNW_01159115132762-32640
TRBV29FNW_01159115127109-26837
TRBD1FNW_0115911519932-9943
TRBJ1-1FNW_0115911519247-9294
TRBJ1-2FNW_0115911519116-9159
TRBJ1-3FNW_0115911518861-8910
TRBJ1-4FNW_0115911518258-8308
TRBJ1-5FNW_0115911517982-8031
TRBJ1-6FNW_0115911517491-7543
TRBC1FNW_011591151EX1 4773-5166
EX2 4311-4328
EX3 4044-4150
EX4 3711-3731
TRBC3ndNW_011591151EX2 2866-2883
EX3 2599-2705
EX4 2266-2286
TRBJ3-1FNW_011620189653-702
TRBJ3-1FNW_0116011112234-2283
TRBJ3-2FNW_0116011112426-2476
TRBJ3-3FNW_0116011112642-2690
TRBJ3-4ndNW_0116011112787-2814
TRBJ2-2FNW_011616084215-265
TRBJ2-3ndNW_0116160842-46
TRBJ2-6ndNW_011607149185-231
TRBC2ndNW_011593440EX1 1911-2149
EX2 2622-2639
EX3 2800-2906
EX4 3190-3210
TRBV30FNW_01159344014509-14160

nd: not defined (indicates that the nt sequence of the gene is incomplete and its functionality cannot be defined).

L-PART1/ V-exon for TRBV genes and coding sequence for TRBD and TRBJ.

Table 2

Description of the Camdro TRBV pseudogenes.

TRBV genesDefective LeaderFrameshiftStop codonDefective splice sitesDefective RSS
TRBV3
TRBV9
TRBV12S1
TRBV12S2
TRBV14
TRBV21S3
TRBV23
TRBV24
Table 3

Description of the unrelated TRB genes in the Camelus dromedarius genome assembly. The position of all genes and their classification and functionality are reported.

Gene classificationFunctionalityaNCBI reference sequencePosition
MOXD2FNW_011591622850155-856730
TRY1FNW_011591622870036-876394
TRY2FNW_011591622882909-888072
TRY3ndNW_0116233911-2387
TRY4FNW_01159115113974-17714
EPBH6FNW_01159344046466-60647

nd: not defined (indicates that the nt sequence of the gene is incomplete and its functionality cannot be defined).

Fig. 1

The IMGT Protein display of the dromedary TRBV genes. Only functional genes and in-frame pseudogenes are shown. The description of the strands and loops and of the FR-IMGT and CDR-IMGT is according to the IMGT unique numbering for V-REGION [6]. The amino acid length of the CDR-IMGT AA is also indicated in square brackets.

Table 4

Camelus dromedarius D-J-C region genomic clones. The primer sequences, the PCR conditions and the size of each clone are reported.

ClonePrimer pairs sequence (5′-3′)Primer locationT annealingProduct length (bp)
pSCBJ11JB11U: CTTTGGAGAAGGCACCAGTRBJ1-1 gene55/584396
CB2L: TGGTTGCGGGGGTTGTGCTRBC gene exon 1
pSCJ22KNCB2U: GCACAACCCCCGCAACCATRBC gene exon 153/555000
JB34L: GCCAAAGTACTGAGTGTTTRBJ3-4 gene
pSCBJ27UJB34U: AACACTCAGTACTTTGGCTRBJ3-4 gene56/584077
CB2L: TGGTTGCGGGGGTTGTGCTRBC gene exon 1
pSCBD3CB2U: GCACAACCCCCGCAACCATRBC gene exon 155/564848
JB23L: CCGCCGAAAAACAGTGTCTRBJ2-3 gene
pSCMG1JB23U: GACACTGTTTTTCGGCGGTRBJ2-3 gene55/583160
CB2L: TGGTTGCGGGGGTTGTGCTRBC gene exon 1
pSCB2C8CB2U: GCACAACCCCCGCAACCATRBC gene exon 1621331
3UTR:GTTGAGCTCACTTTGCAGGGTRBC2 gene 3UTR
Fig. 2

Nucleotide and deduced amino acid sequences of the dromedary TRBD (a), TRBJ (b) and TRDC (c) genes. The consensus sequence of the heptamer and nonamer are provided at the top of the figure and underlined. The numbering adopted for the gene classification is reported on the left of each gene. The gene sequence retrieved from the Ca_dromedarius_V1.0 genomic assembly is highlighted in red. In (a), the inferred amino acid sequence of the TRBD genes in the three coding frames are reported. In (b), the donor splice site for each TRBJ is shown. The canonical FGXG amino acid motifs are underlined. The unusual TRBJ3.6 gene motif is in italics. In (c), IMGT Protein display of the dromedary TRBC genes. Descriptions of the strands and loops were collected according to the IMGT unique numbering for C-DOMAIN [7].

The IMGT Protein display of the dromedary TRBV genes. Only functional genes and in-frame pseudogenes are shown. The description of the strands and loops and of the FR-IMGT and CDR-IMGT is according to the IMGT unique numbering for V-REGION [6]. The amino acid length of the CDR-IMGT AA is also indicated in square brackets. Nucleotide and deduced amino acid sequences of the dromedary TRBD (a), TRBJ (b) and TRDC (c) genes. The consensus sequence of the heptamer and nonamer are provided at the top of the figure and underlined. The numbering adopted for the gene classification is reported on the left of each gene. The gene sequence retrieved from the Ca_dromedarius_V1.0 genomic assembly is highlighted in red. In (a), the inferred amino acid sequence of the TRBD genes in the three coding frames are reported. In (b), the donor splice site for each TRBJ is shown. The canonical FGXG amino acid motifs are underlined. The unusual TRBJ3.6 gene motif is in italics. In (c), IMGT Protein display of the dromedary TRBC genes. Descriptions of the strands and loops were collected according to the IMGT unique numbering for C-DOMAIN [7]. Description of the TRB genes in the Camelus dromedarius genome assembly. The position of all genes and their classification and functionality are reported. nd: not defined (indicates that the nt sequence of the gene is incomplete and its functionality cannot be defined). L-PART1/ V-exon for TRBV genes and coding sequence for TRBD and TRBJ. Description of the Camdro TRBV pseudogenes. Description of the unrelated TRB genes in the Camelus dromedarius genome assembly. The position of all genes and their classification and functionality are reported. nd: not defined (indicates that the nt sequence of the gene is incomplete and its functionality cannot be defined). Camelus dromedarius D-J-C region genomic clones. The primer sequences, the PCR conditions and the size of each clone are reported.

Experimental design, materials and methods

Analysis of the dromedary TRB locus retrieved from the genome assembly: identification of the related and unrelated TRB genes

We employed the recent submission to NCBI (BioProject PRJNA234474) of a draft genome sequence from the Arabian camel [2] to identify the TRB locus in this species. A standard BLAST search (Basic Local Alignment Search Tool. http://blast.ncbi.nlm.nih.gov/Blast.cgi.) of the dromedary genomic resource was then performed by using human and sheep TRB gene sequences to assess their physical location in the dromedary genome. We directly retrieved a sequence of 457871 pb (gaps included) from the PRJNA234474_Ca_dromedarius_V1.0 assembly that corresponds to eight distinct unplaced and not continuous scaffolds (Fig. 1 in [1]). The sequence comprises the MOXD2 and the EPHB6 genes that flank the 5′ and 3′ ends, respectively, of all mammalian TRB loci studied to date. All dromedary TRB genes have been recognized and annotated while taking into account both the human sequence and the sheep genomic D-J-C region as a reference [3], [4], [5] (Table 1). The functionality of V, J and C genes was predicted through the manual alignment of sequences adopting the following parameters: (a) identification of the leader sequence at the 5′ of the TRBV genes; (b) determination of proper recombination signal (RS) sequences located at 3′ of the TRBV, 5′ of the TRBJ, and 3′ and 5′ ends of the TRBD genes, respectively; (c) determination of correct acceptor and donor splicing sites; (d) estimation of the expected length of the coding regions; (e) absence of frameshifts and stop signals in the coding regions of the genes. We annotated 33 TRBV germline genes (twenty-five functional genes and eight pseudogenes) (Table 2), one TRBD, 13 TRBJ and two complete and one incomplete TRBC genes. The analysis of the 3′ part of the locus revealed the potential presence of three D-J-C clusters similar to clusters found in sheep [4], [5]. We also identified and annotated four trypsin-like serine protease (TRY) genes (Table 3). In this context, downstream of the TRBV1 gene, proceeding from 5′ to 3′, we found as in humans two protease genes that we recognized tentatively, according to their genomic position, as TRY1 (alias PRSS58 or TRYX3) and TRY2 (alias TRY2P), respectively. A third TRY gene, named TRY3, was homologous to a gene located after the TRY2P gene in humans that was found within the NW_011623391 unplaced scaffold. Extrapolation of the synteny with the human sequence predicts that the NW_011623391 scaffold should be juxtaposed within the dromedary TRB locus, upstream of the TRBV3 gene (Fig. 1 in [1]). An additional TRY gene, classified as TRY4, was found before the D-J-C region. Thus, unlike humans, only one TRY gene encompasses the array of the TRBV genes. All dromedary TRY genes appear putatively functional with the presence of correct acceptor and donor splicing site and an absence of frameshifts and stop codon in their coding regions. The genomic structure of the MOXD2 and EPHB6 genes, which delimit the TRB locus, was also defined (Table 3).

Protein display of the dromedary TRBV genes

The deduced amino acid sequences of the germline TRBV genes were manually aligned according to IMGT unique numbering for the V-REGION [6] to maximize the percentage of identity (Fig. 1). Only potential functional genes and in-frame pseudogenes are shown. All sequences exhibit the typical framework regions (FR) and complementarity determining regions (CDR) as well as the four amino acids: cysteine 23 (1st-CYS) in FR1-IMGT, tryptophan 41 (CONSERVED-TRP) in FR2-IMGT, hydrophobic amino acid 89, and cysteine 104 (2nd-CYS) in FR3-IMGT [6]. Conversely, CDR-IMGT varies in amino acid composition and length. It should be noted that the TRBV21 genes show a difference in length of one amino acid in the FR3 that corresponds to a C′′ strand that is shorter and has a diverse amino acid sequence for TRBV21S2 compared to the TRBV21S1 gene.

Isolation of the dromedary TRBD-J-C region and analysis of the gene content

To isolate the entire TRBD-J-C region, we set up six different PCRs to produce six consecutive amplicons that cover the region between the first TRBJ and the last TRBC gene. Mostly, for each amplification, we used a primer pair, a gene-specific primer designed on the sequence of the TRBJ genes identified within the cDNA clones (see [1]), and a conserved primer constructed on the first exon of the TRBC genes. For the isolation of the TRBC2 gene, a 3'UTR lower primer derived from the sequence of the genomic assembly was used. Amplification consisted of an initial denaturation step at 93 °C for 2 min followed by 10 amplification cycles that each comprised a denaturation step at 93 °C for 10 s, an annealing step with a low temperature (53–56 °C, according to the melting temperature of the primers) for 30 s, an extension step at 68 °C for 7 min, followed by 25 cycles with a higher annealing temperature (55–58 °C, according to the melting temperature of the primers) and a gradually increasing extension time of 20 s as well as a final incubation at 68 °C for 7 min. A 30-deoxyadenosine overhang was added to blunt-ended amplicons by incubation with 1.0 unit of Platinum Taq DNA Polymerase (Invitrogen) at 72 °C for 10 min. These products were purified and cloned into the StrataClone TA-vector per the manufacturer's instructions. For each sample, 6 to 10 colonies were propagated and bi-directionally sequenced using M13 and T7 vector-specific primers. All plasmid sequence data were manually analysed. For the list of the clones with the primer pairs used and the PCR conditions see Table 4. All the obtained amplicons were sequenced (Acc. no. LT837971). The sequenced region is schematically illustrated in Fig. 3 in [1]. The nucleotide and deduced amino acid sequences of the TRBD, TRBJ and TRBC genes classified according to the similarity to the sheep sequence are shown in Fig. 2.
Subject areaBiology, genetics, genomics
More specific subject areaGenetics, Genomics and Molecular Biology
Type of dataTables and figures
How data was acquiredA standard BLAST search (Basic Local Alignment Search Tool.http://blast.ncbi.nlm.nih.gov/Blast.cgi.) of the public dromedary genomic assembly, Long PCR on genomic DNA and cloning
Data formatAnalyzed
Experimental factorsSequence analysis and dromedary DNA extraction
Experimental featuresDromedary lung genomic DNA was prepared from a single healthy animal. PCRs were performed by High Fidelity DNA polymerase. The PCR products were purified and cloned into the TA-vector system.
Data source locationBari and Lecce, Italy
Data accessibilityThe whole dromedary genome shotgun sequence is available at GenBank (ID: GCA_000767585.1). Sequence data published with this article were registered in EMBL database with the Accession numberLT837971
  8 in total

Review 1.  IMGT unique numbering for immunoglobulin and T cell receptor variable domains and Ig superfamily V-like domains.

Authors:  Marie-Paule Lefranc; Christelle Pommié; Manuel Ruiz; Véronique Giudicelli; Elodie Foulquier; Lisa Truong; Valérie Thouvenin-Contet; Gérard Lefranc
Journal:  Dev Comp Immunol       Date:  2003-01       Impact factor: 3.636

2.  IMGT unique numbering for immunoglobulin and T cell receptor constant domains and Ig superfamily C-like domains.

Authors:  Marie-Paule Lefranc; Christelle Pommié; Quentin Kaas; Elodie Duprat; Nathalie Bosc; Delphine Guiraudou; Christelle Jean; Manuel Ruiz; Isabelle Da Piédade; Mathieu Rouard; Elodie Foulquier; Valérie Thouvenin; Gérard Lefranc
Journal:  Dev Comp Immunol       Date:  2005       Impact factor: 3.636

3.  Organization, structure and evolution of 41kb of genomic DNA spanning the D-J-C region of the sheep TRB locus.

Authors:  R Antonacci; S Di Tommaso; C Lanave; E P Cribiu; S Ciccarese; S Massari
Journal:  Mol Immunol       Date:  2007-07-27       Impact factor: 4.407

4.  Erratum: Camelid genomes reveal evolution and adaptation to desert environments.

Authors:  Huiguang Wu; Xuanmin Guang; Mohamed B Al-Fageeh; Junwei Cao; Shengkai Pan; Huanmin Zhou; Li Zhang; Mohammed H Abutarboush; Yanping Xing; Zhiyuan Xie; Ali S Alshanqeeti; Yanru Zhang; Qiulin Yao; Badr M Al-Shomrani; Dong Zhang; Jiang Li; Manee M Manee; Zili Yang; Linfeng Yang; Yiyi Liu; Jilin Zhang; Musaad A Altammami; Shenyuan Wang; Lili Yu; Wenbin Zhang; Sanyang Liu; La Ba; Chunxia Liu; Xukui Yang; Fanhua Meng; Shaowei Wang; Lu Li; Erli Li; Xueqiong Li; Kaifeng Wu; Shu Zhang; Junyi Wang; Ye Yin; Huanming Yang; Abdulaziz M Al-Swailem; Jun Wang
Journal:  Nat Commun       Date:  2015-01-28       Impact factor: 14.919

5.  Camelid genomes reveal evolution and adaptation to desert environments.

Authors:  Huiguang Wu; Xuanmin Guang; Mohamed B Al-Fageeh; Junwei Cao; Shengkai Pan; Huanmin Zhou; Li Zhang; Mohammed H Abutarboush; Yanping Xing; Zhiyuan Xie; Ali S Alshanqeeti; Yanru Zhang; Qiulin Yao; Badr M Al-Shomrani; Dong Zhang; Jiang Li; Manee M Manee; Zili Yang; Linfeng Yang; Yiyi Liu; Jilin Zhang; Musaad A Altammami; Shenyuan Wang; Lili Yu; Wenbin Zhang; Sanyang Liu; La Ba; Chunxia Liu; Xukui Yang; Fanhua Meng; Shaowei Wang; Lu Li; Erli Li; Xueqiong Li; Kaifeng Wu; Shu Zhang; Junyi Wang; Ye Yin; Huanming Yang; Abdulaziz M Al-Swailem; Jun Wang
Journal:  Nat Commun       Date:  2014-10-21       Impact factor: 14.919

6.  The occurrence of three D-J-C clusters within the dromedary TRB locus highlights a shared evolution in Tylopoda, Ruminantia and Suina.

Authors:  Rachele Antonacci; Mariagrazia Bellini; Angela Pala; Micaela Mineccia; Mohamed S Hassanane; Salvatrice Ciccarese; Serafina Massari
Journal:  Dev Comp Immunol       Date:  2017-05-31       Impact factor: 3.636

7.  IMGT®, the international ImMunoGeneTics information system® 25 years on.

Authors:  Marie-Paule Lefranc; Véronique Giudicelli; Patrice Duroux; Joumana Jabado-Michaloud; Géraldine Folch; Safa Aouinti; Emilie Carillon; Hugo Duvergey; Amélie Houles; Typhaine Paysan-Lafosse; Saida Hadi-Saljoqi; Souphatta Sasorith; Gérard Lefranc; Sofia Kossida
Journal:  Nucleic Acids Res       Date:  2014-11-05       Impact factor: 19.160

8.  Extensive analysis of D-J-C arrangements allows the identification of different mechanisms enhancing the diversity in sheep T cell receptor beta-chain repertoire.

Authors:  Silvia Di Tommaso; Rachele Antonacci; Salvatrice Ciccarese; Serafina Massari
Journal:  BMC Genomics       Date:  2010-01-04       Impact factor: 3.969

  8 in total
  5 in total

Review 1.  The Camel Adaptive Immune Receptors Repertoire as a Singular Example of Structural and Functional Genomics.

Authors:  Salvatrice Ciccarese; Pamela A Burger; Elena Ciani; Vito Castelli; Giovanna Linguiti; Martin Plasil; Serafina Massari; Petr Horin; Rachele Antonacci
Journal:  Front Genet       Date:  2019-10-17       Impact factor: 4.599

2.  The T Cell Receptor (TRB) Locus in Tursiops truncatus: From Sequence to Structure of the Alpha/Beta Heterodimer in the Human/Dolphin Comparison.

Authors:  Giovanna Linguiti; Sofia Kossida; Ciro Leonardo Pierri; Joumana Jabado-Michaloud; Geraldine Folch; Serafina Massari; Marie-Paule Lefranc; Salvatrice Ciccarese; Rachele Antonacci
Journal:  Genes (Basel)       Date:  2021-04-14       Impact factor: 4.096

Review 3.  Evolution of the T-Cell Receptor (TR) Loci in the Adaptive Immune Response: The Tale of the TRG Locus in Mammals.

Authors:  Rachele Antonacci; Serafina Massari; Giovanna Linguiti; Anna Caputi Jambrenghi; Francesco Giannico; Marie-Paule Lefranc; Salvatrice Ciccarese
Journal:  Genes (Basel)       Date:  2020-06-05       Impact factor: 4.096

4.  The expansion of the TRB and TRG genes in domestic goats (Capra hircus) is characteristic of the ruminant species.

Authors:  Francesco Giannico; Serafina Massari; Anna Caputi Jambrenghi; Adriano Soriano; Angela Pala; Giovanna Linguiti; Salvatrice Ciccarese; Rachele Antonacci
Journal:  BMC Genomics       Date:  2020-09-11       Impact factor: 3.969

5.  Overview of the Germline and Expressed Repertoires of the TRB Genes in Sus scrofa.

Authors:  Serafina Massari; Mariagrazia Bellini; Salvatrice Ciccarese; Rachele Antonacci
Journal:  Front Immunol       Date:  2018-11-05       Impact factor: 7.561

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.