| Literature DB >> 35133949 |
David Prihoda1,2, Jad Maamary3, Andrew Waight4, Veronica Juan4, Laurence Fayadat-Dilman4, Daniel Svozil1,5, Danny A Bitton2.
Abstract
Despite recent advances in transgenic animal models and display technologies, humanization of mouse sequences remains one of the main routes for therapeutic antibody development. Traditionally, humanization is manual, laborious, and requires expert knowledge. Although automation efforts are advancing, existing methods are either demonstrated on a small scale or are entirely proprietary. To predict the immunogenicity risk, the human-likeness of sequences can be evaluated using existing humanness scores, but these lack diversity, granularity or interpretability. Meanwhile, immune repertoire sequencing has generated rich antibody libraries such as the Observed Antibody Space (OAS) that offer augmented diversity not yet exploited for antibody engineering. Here we present BioPhi, an open-source platform featuring novel methods for humanization (Sapiens) and humanness evaluation (OASis). Sapiens is a deep learning humanization method trained on the OAS using language modeling. Based on an in silico humanization benchmark of 177 antibodies, Sapiens produced sequences at scale while achieving results comparable to that of human experts. OASis is a granular, interpretable and diverse humanness score based on 9-mer peptide search in the OAS. OASis separated human and non-human sequences with high accuracy, and correlated with clinical immunogenicity. BioPhi thus offers an antibody design interface with automated methods that capture the richness of natural antibody repertoires to produce therapeutics with desired properties and accelerate antibody discovery campaigns. The BioPhi platform is accessible at https://biophi.dichlab.org and https://github.com/Merck/BioPhi.Entities:
Keywords: Antibody humanization; deep learning; deimmunization; human-likeness; humanness; immune repertoires; immunogenicity; machine learning
Mesh:
Substances:
Year: 2022 PMID: 35133949 PMCID: PMC8837241 DOI: 10.1080/19420862.2021.2020203
Source DB: PubMed Journal: MAbs ISSN: 1942-0862 Impact factor: 5.857
Figure 1.BioPhi integrated pipeline for bulk humanization (Sapiens) and humanness evaluation (OASis).
Figure 2.Diversity of human antibody germlines compared to human antibody repertoires from OAS.
Figure 3.OASis provides agranular and interpretable humanness score that is able to separate therapeutic antibodies of different origin.
Evaluation of antibody humanness scores. A granular score reports how different residues or stretches of the sequence contribute to the overall humanness. An interpretable score explains the humanness of a particular sequence comprehensively to the user, for example by near sequence matches or prevalence statistics. Diversity is reported as the size of the reference sequence library (in orders of magnitude). Separately for each column, values above 75% percentile are marked in bold, values below 25% percentile are marked in italics.
| | Granularity | Interpretability | Diversity (seqs) | Humanness classification | Immunogenicity | ||
|---|---|---|---|---|---|---|---|
| Method | Accuracy (%) | ROC AUC | R | R2 | |||
| Z-score | ✕ no | ✓ yes | |||||
| T20 | ✕ no | ✓ yes | 10^4 | ||||
| AbLSTM | ✕ no | ✕ no | 10^4 | 87.8 | 93.7 | −0.47 | 0.22 |
| MG Score | ✕ no | ✕ no | 91.8 | 95.9 | −0.46 | 0.21 | |
| IgReconstruct | ✕ no | ✓ yes | 92.6 | 96.7 | |||
| Germline content | ✓ yes | ✓ yes | 92.8 | 96.8 | −0.50 | 0.25 | |
| Hu-mAb | ✕ no | ✕ no | 10^7 | ||||
| OASis identity (loose) | ✓ yes | ✓ yes | 91.9 | 96.4 | −0.48 | 0.23 | |
| OASis identity (relaxed) | ✓ yes | ✓ yes | −0.51 | 0.26 | |||
| OASis identity (medium) | ✓ yes | ✓ yes | 92.6 | 96.6 | |||
| OASis identity (strict) | ✓ yes | ✓ yes | 91.9 | 95.6 | |||
Figure 4.Attention within the Sapiens neural network captures long-range dependencies between antibody loops.
Figure 5.Sapiens achieved abalanced humanness-preservation tradeoff on 25 antibodies with known parental sequences.
Evaluation of humanization methods. Separately for each column, values above 75% percentile are marked in bold, values below 25% percentile are marked in italics. Humanness change was computed as average absolute difference of OASis medium identity (or T20 score) of the humanized sequence and the parent sequence. Therefore, an increase of +34% refers to the absolute change in the humanness score (e.g. from 40% to 74%), not a relative change. Parental preservation was calculated as sequence identity of the parental and humanized sequence under Kabat numbering, in full sequence or Vernier regions only. Humanizing mutation precision was calculated as number of shared mutations (made both in predicted sequence and in experimentally humanized sequence), divided by total number of mutations made in the predicted sequence
| | | | Humanness improvement | Preservation | Humanizing mutation precision | |||
|---|---|---|---|---|---|---|---|---|
| Method | OASis | T20 | Total | Vernier | Total | Vernier | ||
| 25 known pairs | Experimental | 86% | - | - | ||||
| Sapiens*1 | +30% | +10% | 90% | |||||
| Sapiens*2 | +33% | +12% | 86% | 87% | 74% | 42% | ||
| Sapiens*3 | +12% | 86% | 86% | 73% | 40% | |||
| Sapiens*4 | 86% | 86% | 72% | 40% | ||||
| Hu-mAb | 91% | 72% | ||||||
| Straight graft | ||||||||
| Vernier graft | +32% | +11% | 85% | 70% | - | |||
| Experimental | +30% | +11% | 84% | 87% | - | - | ||
| 152 putative pairs | Automatic | Sapiens*1 | +30% | +10% | 89% | 72% | 48% | |
| Hu-mAb | ||||||||
| Straight graft | +33% | 84% | ||||||
| Vernier graft | +11% | 87% | 70% | - | ||||
| Manual | Sapiens *1 | |||||||
| Hu-mAb | 48% | |||||||
| Straight graft | 45% | |||||||
| Vernier graft | +30% | +11% | 85% | - | ||||
Figure 6.Evaluation of humanization methods on alarge scale using 152 humanized therapeutic antibodies with putative parental sequences.