| Literature DB >> 32182714 |
Duck Kyun Yoo1,2, Seung Ryul Lee1,3, Yushin Jung4, Haejun Han4, Hwa Kyoung Lee1,2, Jerome Han1,2, Soohyun Kim1,3, Jisu Chae1,3, Taehoon Ryu4, Junho Chung1,2,3.
Abstract
c-Met is a promising target in cancer therapy for its intrinsic oncogenic properties. However, there are currently no c-Met-specific inhibitors available in the clinic. Antibodies blocking the interaction with its only known ligand, hepatocyte growth factor, and/or inducing receptor internalization have been clinically tested. To explore other therapeutic antibody mechanisms like Fc-mediated effector function, bispecific T cell engagement, and chimeric antigen T cell receptors, a diverse panel of antibodies is essential. We prepared a chicken immune scFv library, performed four rounds of bio-panning, obtained 641 clones using a high-throughput clonal retrieval system (TrueRepertoireTM, TR), and found 149 antigen-reactive scFv clones. We also prepared phagemid DNA before the start of bio-panning (round 0) and, after each round of bio-panning (round 1-4), performed next-generation sequencing of these five sets of phagemid DNA, and identified 860,207 HCDR3 clonotypes and 443,292 LCDR3 clonotypes along with their clonal abundance data. We then established a TR data set consisting of antigen reactivity for scFv clones found in TR analysis and the clonal abundance of their HCDR3 and LCDR3 clonotypes in five sets of phagemid DNA. Using the TR data set, a random forest machine learning algorithm was trained to predict the binding properties of in silico HCDR3 and LCDR3 clonotypes. Subsequently, we synthesized 40 HCDR3 and 40 LCDR3 clonotypes predicted to be antigen reactive (AR) and constructed a phage-displayed scFv library called the AR library. In parallel, we also prepared an antigen non-reactive (NR) library using 10 HCDR3 and 10 LCDR3 clonotypes predicted to be NR. After a single round of bio-panning, we screened 96 randomly-selected phage clones from the AR library and found out 14 AR scFv clones consisting of 5 HCDR3 and 11 LCDR3 AR clonotypes. We also screened 96 randomly-selected phage clones from the NR library, but did not identify any AR clones. In summary, machine learning algorithms can provide a method for identifying AR antibodies, which allows for the characterization of diverse antibody libraries inaccessible by traditional methods.Entities:
Keywords: antibody discovery; c-Met; machine learning; next-generation sequencing; phage display; random forest
Mesh:
Substances:
Year: 2020 PMID: 32182714 PMCID: PMC7175295 DOI: 10.3390/biom10030421
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Figure 1Workflow of the machine learning-guided selection of antigen-reactive HCDR3 and LCDR3 clonotypes with confirmation of their reactivity.
Number of CDR3 clonotypes obtained from the bio-panning procedure.
| Clonotypes | Round 0 | Round 1 | Round 2 | Round 3 | Round 4 | Total |
|---|---|---|---|---|---|---|
| HCDR3 | 390,814 | 395,459 | 402,854 | 311,678 | 308,547 | 860,207 |
| LCDR3 | 272,317 | 253,899 | 250,630 | 187,314 | 117,239 | 443,292 |
Figure 2Distribution of the minimum depth of predictor variables (clonal abundance at round 0–4 of bio-panning) from individual decision trees in the RF prediction model for CDR3 clonotypes. Minimum depth value is colored according to its depth and mean value is calculated and displayed at points.
Figure 3The most influential variable interactions and distributions of CDR3 clonotypes. (a) Clonal abundance at the most influential interaction is plotted with binding property label from training data used in the random forest (RF) prediction model. AR, antigen-reactive, NR, antigen non-reactive. (b) Clonal abundance at the most influential interaction is plotted with a binding property label from validation data used in the RF prediction model. (c) Clonal abundance at the most influential interaction is plotted with confidence value (probability) from HiSeq-identified CDR3 clonotypes. Clonotypes with higher confidence values are distributed near the root variable axis (highlighted with a dashed blue circle) while clonotypes having lower confidence values are distributed below the y = x axis (dotted line) (highlighted with a dashed red circle).
Figure 4Reactivity of phage-displayed scFv clones in phage ELISA. Binding reactivity of 15 unique clones identified from the AR library and 16 unique clones from the NR library are shown. Wells in microtiter plates were either coated with recombinant mouse c-Met or just blocked with 3% BSA in PBS. Phage clones, HRP-conjugated anti-M13 antibody, and HRP substrate solution were added sequentially with intermittent washing.
Amino acid sequences of AR CDR3 clonotypes identified from AR library.
| Clone ID | HCDR3 AA * Sequence | LCDR3 AA * Sequence |
|---|---|---|
| AR1 | GSGGVDSIDA | GSYDNTYAGI |
| AR2 | SADGYGWDTAGNMDA | GSIDSNYDGI |
| AR3 | TAGTCTTSCNAGAYIDA | GGYDGSSAA |
| AR4 | TTCSGSYGWCADSIDA | GAYDSSYIGI |
| AR5 | SADSCATCATYPSEIDT | GSFDSSYVGM |
| AR6 | SADSCATCATYPSEIDT | GSFDSSYVGM |
| AR7 | SADSCATCATYPSEIDT | GSIDSNYDGI |
| AR8 | SADSCATCATYPSEIDT | GSYDSSYVGL |
| AR9 | SADSCATCATYPSEIDT | GSYDSSYDGV |
| AR10 | SADSCATCATYPSEIDT | GSFDSSYTGI |
| AR11 | SADSCATCATYPSEIDT | GSIDSRYVGI |
| AR12 | SADSCATCATYPSEIDT | GSYDSSYVGYVGV |
| AR13 | SADSCATCATYPSEIDT | GSYDNTYAGI |
| AR14 | SADSCATCATYPSEIDT | GGYDSSSGA |
| AR15 | SADSCATCATYPSEIDT | GAYDSSYIGI |
* AA: amino acid.
Clinical usage of small molecule inhibitors targeting c-Met in cancer therapy.
| Drug Name | Targets | FDA Approval Status | Approved Year |
|---|---|---|---|
| Tivantinib | c-Met, microtubule | None | N.A.* |
| Foretinib | c-Met, VEGFR-2 * | None | N.A. |
| Cabozantinib | c-Met, VEGFR, Axl | Medullary thyroid cancer | 2012 |
| Crizotinib | c-Met, ALK *, ROS1, RON * | ALK or ROS-1 positive NSCLC * | 2011 |
| Capmatinib | c-Met, EGFR *, ErbB-3 | None | N.A. |
| AMG337 | c-Met | None | N.A. |
| AZD6094 | c-Met | None | N.A. |
| BMS777607/ASLAN002 | c-Met, Axl, Tyro3, RON | None | N.A. |
| Glesatinib | c-Met, Axl | None | N.A. |
| Tepotinib | c-Met | None | N.A. |
* VEGFR-2: Vascular endothelial growth factor-2, ALK: Anaplastic lymphoma kinase, RON: Receptor d’Origine nantais, EGFR: Epidermal growth factor receptor, NSCLC: Non-small cell lung cancer, N.A.: not available.