| Literature DB >> 30515166 |
Swapnil Mahajan1, Randi Vita1, Deborah Shackelford1, Jerome Lane1, Veronique Schulten1, Laura Zarebski1, Martin Closter Jespersen2, Paolo Marcatili2, Morten Nielsen2,3, Alessandro Sette1,4, Bjoern Peters1,4.
Abstract
The Immune Epitope Database (IEDB) is a free public resource which catalogs experiments characterizing immune epitopes. To accommodate data from next generation repertoire sequencing experiments, we recently updated how we capture and query epitope specific antibodies and T cell receptors. Specifically, we are now storing partial receptor sequences sufficient to determine CDRs and VDJ gene usage which are commonly identified by repertoire sequencing. For previously captured full length receptor sequencing data, we have calculated the corresponding CDR sequences and gene usage information using IMGT numbering and VDJ gene nomenclature format. To integrate information from receptors defined at different levels of resolution, we grouped receptors based on their host species, receptor type and CDR3 sequence. As of August 2018, we have cataloged sequence information for more than 22,510 receptors in 18,292 receptor groups, shown to bind to more than 2,241 distinct epitopes. These data are accessible as full exports and through a new dedicated query interface. The later combines the new ability to search by receptor characteristics with previously existing capability to search by epitope characteristics such as the infectious agent the epitope is derived from, or the kind of immune response involved in its recognition. We expect that this comprehensive capture of epitope specific immune receptor information will provide new insights into receptor-epitope interactions, and facilitate the development of novel tools that help in the analysis of receptor repertoire data.Entities:
Keywords: AIRR; BCR; CDR; IEDB; TCR; antibody; epitope; repertoire sequencing
Mesh:
Substances:
Year: 2018 PMID: 30515166 PMCID: PMC6255941 DOI: 10.3389/fimmu.2018.02688
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Figure 1Information captured in the IEDB. Detailed information related to the immune exposure of the host, type of assay used to test the immune response, and the reference of the data is captured in the IEDB. Data shown in this figure is from IEDB Assay ID: 1479091.
Data structure and grouping of captured receptor information.
| Receptor name | PMEL17 | |||
| Source organism | Homo sapiens | Homo sapiens | Homo sapiens | |
| Sequence identifier | Chain1: NCBI:5EU6_D | |||
| Protein sequence | Chain1: MKQEVTQIPAALS… | |||
| Nucleotide sequence | – | |||
| V gene | Chain1: TCRAV21 | Chain1: TRAV21*01 | Chain1: TRAV21*01 | |
| D gene | – | – | – | |
| J gene | – | Chain1: TRAJ53*01 | Chain1: TRAJ53*01 | |
| Receptor type | αβ | αβ | αβ | αβ |
| Chain type | Chain1: α | Chain1: α | Chain1: α | Chain1: α |
| Variable domain sequence | – | Chain1:KQEVTQIPA… | Chain1:KQEVTQIPA… | |
| CDR1 sequence | Chain1:DSAIYN | Chain1:DSAIYN | Chain1:DSAIYN | |
| CDR1 positions | – | Chain1: 28-33 | ||
| CDR2 sequence | Chain1:IQSSQRE | Chain1:IQSSQRE | Chain1:IQSSQRE | |
| CDR2 positions | – | Chain1: 51-57 | ||
| CDR3 sequence | Chain1: AVLSSGGSNYKLTF | Chain1: AVLSSGGSNYKLT | Chain1: AVLSSGGSNYKLT | Chain1: AVLSSGGSNYKLT |
| CDR3 positions | – | Chain 1: 92-104 | ||
Receptor data captured from publications is shown in ‘assay receptor’ column (IEDB assay ID: 2723539). The values in distinct receptor column were used for creating distinct receptor entries by combining receptors from different assays. If variable domain sequence was not available then CDR 1, 2 and 3 sequences were used to create distinct receptors. Similarly, the values in receptor group column are used for clustering similar distinct receptors in a group.
Figure 2Assay receptors. The curated and calculated assay receptor information is displayed side by side on the assay details pages in the IEDB. Data shown in this figure is from the IEDB Assay ID: 2723539.
Figure 3Receptor groups. Receptors are grouped based on their type, CDR3 sequence/s and host organism. Next generation repertoire sequencing experiments can report only a single chain CDR3 sequence for a receptor. Therefore, we group receptors hierarchically in groups with identical single chain CDR3 sequences (receptor group ID: 11040) which are divided in receptor groups based on CDR3 sequences from the other chain (receptor group ID: 1162 and 1525).
Figure 4Capturing engineered, camelid and other special receptor types in the IEDB. The nanobodies and HCAbs in the IEDB are captures under heavy and heavy-heavy receptor types. The heavy and light chain variable domains in the scFv are captured as individual chains under scFv receptor type. The diabodies are captured as constructs. The heavy and light chain pairs in the diabodies which bind to two different epitopes are captured as two different assays.
Figure 5Querying IEDB using antibody or TCR sequences. (A) In the past, user query results in the IEDB were displayed in different tabs named epitopes, antigens, assays, and references. We added a new results tab for receptors to display different receptor groups corresponding to the user query. (B) All the receptor information in results can be downloaded using “export results” link in the “receptors” tab. Similarly, more detailed results are downloaded from “assays” tab. (C) Users can filter any query results by receptor full length protein or CDR sequences using the new receptor search panel. The example shown is to filter results by antibody (receptor type is BCR heavy-light) heavy chain with “CSYAGGKSLV” as CDR3 sequence.
Figure 6Receptor details. Receptor details are split into 3 sections. (A) The first section is a short summary of receptor group. This section has information on accessions of receptor chains and PDB IDs of receptor-antigen complexes involving individual receptors from this receptor group, if available. (B) The second section provides information of individual receptors in the receptor group. This section provides CDR sequences, VDJ gene usage, variable domain sequences and epitopes which are recognized by each receptor. (C) The last section provides a short summary of epitopes recognized by receptor group including assays and publications, e.g., an antibody in group ID 651 recognizes two different epitopes from Dengue and one epitope from Zika genome polyproteins.
Receptor groups.
| Total curated receptors | 22,510 | 2,241 |
| Distinct receptors | 19,537 | |
| Receptor groups | 18,292 | |
| TCR groups | 16,949 | 536 |
| BCR groups | 1,343 | 1,714 |
Figure 7Distribution of the available receptors from different organisms. Over 90% of the antigen receptor data in the IEDB are from humans and around 8% from mice.