| Literature DB >> 34875068 |
Taciana Manso1, Géraldine Folch1, Véronique Giudicelli1, Joumana Jabado-Michaloud1, Anjana Kushwaha1, Viviane Nguefack Ngoune1, Maria Georga1, Ariadni Papadaki1, Chahrazed Debbagh1, Perrine Pégorier1, Morgane Bertignac1, Saida Hadi-Saljoqi1, Imène Chentli1, Karima Cherouali1, Safa Aouinti1, Amar El Hamwi1, Alexandre Albani1, Merouane Elazami Elhassani1, Benjamin Viart1, Agathe Goret1, Anna Tran1, Gaoussou Sanou1, Maël Rollin1, Patrice Duroux1, Sofia Kossida1.
Abstract
IMGT®, the international ImMunoGeneTics information system®, http://www.imgt.org/, is at the forefront of the immunogenetics and immunoinformatics fields with more than 30 years of experience. IMGT® makes available databases and tools to the scientific community pertaining to the adaptive immune response, based on the IMGT-ONTOLOGY. We focus on the recent features of the IMGT® databases, tools, reference directories and web resources, within the three main axes of IMGT® research and development. Axis I consists in understanding the adaptive immune response, by deciphering the identification and characterization of the immunoglobulin (IG) and T cell receptor (TR) genes in jawed vertebrates. It is the starting point of the two other axes, namely the analysis and exploration of the expressed IG and TR repertoires based on comparison with IMGT reference directories in normal and pathological situations (Axis II) and the analysis of amino acid changes and functions of 2D and 3D structures of antibody and TR engineering (Axis III).Entities:
Mesh:
Substances:
Year: 2022 PMID: 34875068 PMCID: PMC8728119 DOI: 10.1093/nar/gkab1136
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.IMGT resources. (A) Overview of IMGT databases and tools for genes, sequences and structures. (B) Main databases and datasets in the three axes of IMGT information system.
New IMGT concepts and their definitions
| New IMGT concepts | Definition | |
|---|---|---|
| IMGT label | IMGT-LOCUS-UNIT | gDNA of an immunoglobulin (IG) or T cell receptor (TR) IMGT locus unit from chromosome genomic assembly, that starts at the 5 prime (5′) end of the most 5′ IG or TR GENE-UNIT in the locus and ends at the 3 prime (3′) end of the most 3′ IG or TR GENE-UNIT in the locus |
| IMGT qualifiers | IMGT_locus_3prime_borne | Name of the gene identified as the 3′ borne of an IMGT-LOCUS-UNIT |
| IMGT_locus_3prime_gene | IMGT gene name of the most 3′ IG or TR GENE-UNIT of an IMGT-LOCUS-UNIT | |
| IMGT_locus_5prime_borne | Name of the gene identified as the 5′ borne of an IMGT-LOCUS-UNIT | |
| IMGT_locus_5prime_gene | IMGT gene name of the most 5′ IG or TR GENE-UNIT of an IMGT-LOCUS-UNIT | |
| IMGT_locus_length | Length of an IMGT-LOCUS-UNIT in kb or in bp | |
| IMGT_locus_name | Name of an IMGT-LOCUS-UNIT, that includes the Latin genus and species name and the IMGT locus type | |
| IMGT_locus_orientation | Orientation of an IMGT-LOCUS-UNIT on a chromosome, is either forward (FWD) or reverse (REV) | |
| IMGT_locus_positions | Positions of an IMGT-LOCUS-UNIT on a chromosome | |
| IMGT_locus_type | IMGT locus type (in higher vertebrates: IGH, IGK, IGL, TRA, TRB, TRG, TRD) of an IMGT-LOCUS-UNIT | |
54 fully annotated IG and TR loci are available in IMGT databases and tools, among these 54 loci, 50 have an IMGT locus accession number and 4 (with * in this table) have accession numbers from ENA, NCBI and Ensembl contigs built before the creation of IMGT Locus accession numbers. Note that the IMGT® biocuration of the first two fully annotated species, human (Homo sapiens) and mouse (Mus musculus) are not shown in this table. More information is available in http://www.imgt.org/IMGTrepertoire/LocusGenes/
| Taxon | Species | NCBI Assembly | Locus | Chromosomal localization | NCBI Chromosome Accession numbers | IMGT locus Accession numbers |
|---|---|---|---|---|---|---|
| MAMMALIA EUTHERIA (placentals) |
| ARS-UCD1.2 | IGK | 11 | CM008178.2 | IMGT000047 |
| IGL | 17 | CM008184.2 | IMGT000046 | |||
| TRA | 10 | CM008177.2 | IMGT000049 | |||
| TRD | 10 | CM008177.2 | IMGT000049 | |||
|
| Unknown | IGH | 21q24 | Unknown | * | |
|
| Unknown | TRG | 4 | Unknown | * | |
|
| CamDro3 | IGK | 28 | CM016654.2 | IMGT000061 | |
|
| CanFam3.1 | IGH | 8 | CM000008.3 | IMGT000001 | |
| IGK | 17 | CM000017.3 | IMGT000002 | |||
| IGL | 26 | CM000026.3 | IMGT000003 | |||
| TRA | 8 | CM000008.3 | IMGT000004 | |||
| TRB | 16 | CM000016.3 | IMGT000005 | |||
| TRD | 8 | CM000008.3 | IMGT000004 | |||
| TRG | 18 | CM000018.3 | IMGT000006 | |||
|
| Basenji_breed-1.1 | IGK | 17 | CM016447.1 | IMGT000067 | |
|
| ARS1 | IGK | 11 | CM004572.1 | IMGT000009 | |
| IGL | 17 | CM004578.1 | IMGT000033 | |||
|
| EquCab3.0 | IGH | 24 | CM009171.1 | IMGT000040 | |
| IGK | 15 | CM009162.1 | IMGT000053 | |||
|
| EquCab2.0 | IGK | 15 | CM000391.2 | IMGT000060 | |
|
| Felis_catus_9.0 | IGK | A3 | CM001380.3 | IMGT000050 | |
| IGL | D3 | CM001389.3 | IMGT000038 | |||
| TRA | B3 | CM001383.3 | IMGT000045 | |||
| TRB | A2 | CM001379.3 | IMGT000037 | |||
| TRD | B3 | CM001383.3 | IMGT000045 | |||
| TRG | A2 | CM001379.3 | IMGT000036 | |||
|
| Macaca_fascicularis_5.0 | TRB | 3 | CM001921.1 | IMGT000075 | |
|
| Mmul_10 | IGH | 7 | CM014342.1 | IMGT000064 | |
| IGK | 13 | CM014348.1 | IMGT000063 | |||
| IGL | 10 | CM014345.1 | IMGT000062 | |||
| TRB | 3 | CM014338.1 | IMGT000073 | |||
| TRG | 3 | CM014338.1 | IMGT000059 | |||
|
| Mmul_8.0.1 | TRA | 7 | CM002991.3 | IMGT000013 | |
| TRB | 3 | CM002984.2 | IMGT000012 | |||
| TRD | 7 | CM002991.3 | IMGT000013 | |||
|
| MusPutFur1.0 | TRB | Unknown | Unplaced genomic scaffold | IMGT000023 | |
|
| OryCun2.0 | TRA | 17 | CM000806.1 | IMGT000031 | |
| TRB | Unknown | Unplaced genomic scaffold | IMGT000032 | |||
| TRD | 17 | CM000806.1 | IMGT000031 | |||
| TRG | 10 | CM000799.1 | IMGT000030 | |||
|
| Oar_v4.0 | IGK | 3 | CM001584.2 | IMGT000010 | |
| IGL | 17 | CM001598.2 | IMGT000034 | |||
|
| Oar_rambouillet_v1.0 | IGL | 17 | CM008488.1 | IMGT000041 | |
| TRA | 7 | CM008478.1 | IMGT000048 | |||
| TRB | 4 | CM008475.1 | IMGT000042 | |||
| TRD | 7 | CM008478.1 | IMGT000048 | |||
|
| Rn_Celera Alternate Assembly AC_000074.1 | IGH | 6q32,33 | CM000236.2 | * | |
|
| Sscrofa11.1 | TRB | 18 | CM000829.5 | IMGT000039 | |
|
| turTru1 (Ensembl assembly) | TRA | Unknown | Ensembl genomic scaffold | IMGT000016 | |
| IMGT000017 | ||||||
| IMGT000018 | ||||||
| IMGT000020 | ||||||
| TRD | Unknown | IMGT000016 | ||||
| IMGT000017 | ||||||
| IMGT000018 | ||||||
|
| NIST Tur_tru v1 | TRG | Unknown | Unplaced genomic scaffold | IMGT000015 | |
| Aves |
| GRCg6 | IGH | 31 | CM003638.2 | IMGT000014 |
| Gallus_gallus-5.0 | Unknown | Unplaced genomic scaffold | IMGT000007 | |||
| Teleostei |
| GRCz11 | IGH | 3 | CM002887.2 | * |
|
| Omyk_1.0 | IGH | 13 | CM007947.1 | IMGT000043 | |
| 12 | CM007946.1 | IMGT000044 | ||||
|
| ICSASG_v2 | IGH | 6 | CM003284.1 | IMGT000028 | |
| 3 | CM003281.1 | IMGT000029 |
IMGT/V-QUEST reference directories for the analysis of rearranged V-DOMAIN (release 202135–4 on 2 September 2021).
| IMGT/V-QUEST reference directories | |||
|---|---|---|---|
| Taxon | Species | IG | TR |
| MAMMALIA EUTHERIA (placentals) |
| IGH, IGK, IGL | TRA, TRB, TRG, TRD |
|
| IGH, IGK, IGL | TRA, TRB, TRG, TRD | |
|
| TRA, TRG | ||
|
| IGH, IGK, IGL | TRA, TRG, TRD | |
|
| IGK | TRB, TRG | |
|
| IGH, IGK, IGL | TRA, TRB, TRG, TRD | |
|
| IGK, IGL | ||
|
| IGH, IGK | ||
|
| IGK, IGL | TRA, TRB, TRG, TRD | |
|
| IGH | TRB | |
|
| IGH, IGK, IGL | TRA, TRB, TRG, TRD | |
|
| TRB | ||
|
| IGH, IGK, IGL | TRA, TRB, TRG, TRD | |
|
| IGH, IGK, IGL | TRA, TRB, TRD | |
|
| IGH, IGK, IGL | ||
|
| IGH, IGK, IGL | TRB | |
|
| TRA, TRG, TRD | ||
|
| IGH | ||
| MAMMALIA PROTHERIA (monotremes) |
| IGH | |
| Aves |
| IGH, IGL | |
| Teleostei |
| IGH, IGI | TRA, TRD |
|
| IGH | TRB | |
|
| IGH | ||
IMGT/V-QUEST reference directory based alignment results for nucleotide V-DOMAIN analysis
| IMGT/V-QUEST reference directory sets | IMGT tools | Results for IG and TR V-DOMAIN |
|---|---|---|
| V, D, J reference sequences per species and per locus | IMGT/V-QUEST IMGT/HighV-QUEST | 1. Introduction of IMGT gaps according to the IMGT unique numbering ( |
| 2. Identification of the closest germline V, D and J genes and alleles | ||
| 3. Delimitation of the FR-IMGT and CDR-IMGT | ||
| Closest germline V gene and allele | 5. Identification of indels and their corrections (optional) ( | |
| 6. Evaluation of the percentage of identity for the V-REGION | ||
| 7. Description of mutations and amino acid (AA) changes (transitions, transversions, codon change, qualification of AA change according to the eleven IMGT AA classes ( | ||
| Closest V, D, J genes and alleles | Performed by the integrated IMGT/JunctionAnalysis ( | 8. Analysis of the Junction |
| IMGT/V-QUEST IMGT/HighV-QUEST | 9. Evaluation of the V-DOMAIN functionality | |
| Performed by the integrated IMGT/Automat ( | 10. Complete V-DOMAIN annotation ( | |
| IMGT/V-QUEST | 11. Advanced functionality for Clinical application: search for CLL subsets #2 and #8 (optional) ( |
Figure 2.Axis III workflow overview. The left panel displays all analysis processes: the data input panel shows the sources of the data present in the Axis III databases. The data analysis procedure is done by IMGT experts and it includes the analysis of the amino acid sequences and the 3D structures. Data are stored in IMGT/2Dstructure-DB and IMGT/3Dstructure-DB and linked to IMGT/mAb-DB. The right panel shows the user interface that provides flexible ways to query the data sets. The result page panel shows the online tools, such as IMGT/DomainGapAlign (Chain Details) and IMGT/Collier-de-Perles included in IMGT/2Dstructure-DB and Paratope/Epitope description and 3D structure incorporated in IMGT/3Dstructure-DB. IMGT/mAb-DB centralizes the information present in the other databases and adds several metadata for therapeutic proteins.