| Literature DB >> 33448242 |
Wing Ki Wong1, Sarah A Robinson1, Alexander Bujotzek2, Guy Georges2, Alan P Lewis3, Jiye Shi4, James Snowden4, Bruck Taddese5, Charlotte M Deane1.
Abstract
Solving the structure of an antibody-antigen complex gives atomic level information of the interactions between an antibody and its antigen, but such structures are expensive and hard to obtain. Alternative experimental sources include epitope mapping and binning experiments, which can be used as a surrogate to identify key interacting residues. However, their resolution is usually not sufficient to identify if two antibodies have identical interactions. Computational approaches to this problem have so far been based on the premise that antibodies with similar sequences behave similarly. Such approaches will fail to identify sequence-distant antibodies that target the same epitope. Here, we present Ab-Ligity, a structure-based similarity measure tailored to antibody-antigen interfaces. Using predicted paratopes on model antibody structures, we assessed its ability to identify those antibodies that target highly similar epitopes. Most antibodies adopting similar binding modes can be identified from sequence similarity alone, using methods such as clonotyping. In the challenging subset of antibodies whose sequences differ significantly, Ab-Ligity is still able to predict antibodies that would bind to highly similar epitopes (precision of 0.95 and recall of 0.69). We compared Ab-Ligity's performance to an existing tool for comparing general protein interfaces, InterComp, and showed improved performance on antibody cases achieved in a substantially reduced time. These results suggest that Ab-Ligity will allow the identification of diverse (sequence-dissimilar) antibodies that bind to the same epitopes from large datasets such as immune repertoires. The tool is available at http://opig.stats.ox.ac.uk/resources.Entities:
Keywords: Antibody; comparison; paratope; structure
Year: 2021 PMID: 33448242 PMCID: PMC7833755 DOI: 10.1080/19420862.2021.1873478
Source DB: PubMed Journal: MAbs ISSN: 1942-0862 Impact factor: 5.857
Number of positive and negative comparisons in the datasets, based on Ab-Ligity’s definition of similar epitopes
| Set | Positive | Negative |
|---|---|---|
| Full | 719 | 29,557 |
| CDRH3 ≤ 0.8 set | 266 | 29,519 |
Precision and recall on the full and CDRH3 ≤ 0.8 sets, using the selected paratope similarity thresholds for Ab-Ligity (0.1) and InterComp (0.6), based on Ab-Ligity’s definition of similar epitopes
| Method | Ab-Ligity | InterComp | ||
|---|---|---|---|---|
| Set | Precision | Recall | Precision | Recall |
| Full | 0.95 | 0.85 | 0.92 | 0.77 |
| CDRH3 ≤ 0.8 set | 0.95 | 0.69 | 0.86 | 0.59 |
Precision and recall on the full set, using heavy chain and light chain only homology models and the predicted paratopes on the corresponding single domain
| Method | Ab-Ligity | InterComp | ||
|---|---|---|---|---|
| Domain | Precision | Recall | Precision | Recall |
| VH | 0.88 | 0.78 | 0.74 | 0.82 |
| VL | 0.67 | 0.89 | 0.17 | 0.94 |
Figure 1.Analysis of anti-lysozyme (HEL) antibodies with dissimilar CDRH3 sequences and highly similar epitopes
Figure 2.Analysis of two anti-gp120 antibodies. VRC01 antibody (PDB code and chain IDs: 4LSS_HL_G) is colored in blue; VRC07 (PDB code and chain IDs: 4OLU_HL_G) is shown in pink. The gp120 core antigen is displayed as white surfaces and the superimposed antibodies are in cartoons. The CDRH3 loops are in solid shades of the cartoon representation. The PDB code, heavy and light chain ID, and antigen chain ID are separated by (‘_’) and listed in the legend with the corresponding CDRH3 sequences. CDRH3 sequences are shown by aligning their IMGT positions and ‘-’ indicates a gap in the alignment according to the IMGT numbering scheme.[29] Crystal paratope residues within the CDRs are in bold, and Parapred-predicted paratopes within the CDRs are underlined. The other five CDR sequences are listed in Supplementary Table S16. The Ab-Ligity paratope similarity score for the pair is listed
Figure 3.The Ab-Ligity workflow
Residue groupings for tokenization
| Group | Residues |
|---|---|
| Aliphatic | Glycine (G), Alanine (A), Valine (V), Leucine (L), Isoleucine (I), Proline (P) |
| Hydroxyl | Serine (S), Threonine (T) |
| Sulfur | Cysteine (C), Methionine (M) |
| Aromatic | Phenylalanine (F), Tyrosine (Y), Tryptophan (W) |
| Acidic | Aspartic acid (D), Glutamic acid (E) |
| Amine | Asparagine (N), Glutamine (Q) |
| Basic | Histidine (H), Lysine (K), Arginine (R) |