| Literature DB >> 34546288 |
Constantin Schneider1, Andrew Buchanan2, Bruck Taddese3, Charlotte M Deane1.
Abstract
MOTIVATION: Antibodies are one of the most important classes of pharmaceuticals, with over 80 approved molecules currently in use against a wide variety of diseases. The drug discovery process for antibody therapeutic candidates however is time- and cost-intensive and heavily reliant on in-vivo and in-vitro high throughput screens. Here, we introduce a framework for structure-based deep learning for antibodies (DLAB) which can virtually screen putative binding antibodies against antigen targets of interest. DLAB is built to be able to predict antibody-antigen binding for antigens with no known antibody binders.Entities:
Year: 2021 PMID: 34546288 PMCID: PMC8723137 DOI: 10.1093/bioinformatics/btab660
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.DLAB-Re improves docking performance on the crystal structure dataset (A, B) and the model dataset (C, D). On crystal structure data, ZDOCK ranking and DLAB-Re ranking perform similarly and well. On models, the ZDOCK baseline performance is considerably worse and DLAB-Re significantly improves ranking performance. (A, C) DLAB-Re ranks the top 500 poses generated by ZDOCK better than ZDOCK, enriching the ratio of pairings with fnat > 0.5 poses ranked highly. The dashed line indicates the fraction of docking runs in which a pose with fnat > 0.5 is present in the 500 assessed poses.(B, D) Using the DLAB-Re-max score to remove 40%, 60% or 80% of the antibody–antigen pairings, respectively, can remove antibody–antigen pairings which did not yield high-fnat poses. This selects for pairings for which fnat > 0.5 poses exist in the top 500 poses generated by ZDOCK (dashed line) and for which DLAB-Re ranks the top 500 poses well (solid line). Error bars are ±one standard deviation, approximated, as described in Section 3
Fig. 2.DLAB-VS and ZDock binder classification performance. For each approach, the ratio of pairings for which the binding antibody was ranked in the top 2%, top 5%, top 10% and top 20% respectively is shown. (A) Comparison of the performance of ZDock and DLAB-VS binder classification on the model dataset to the random expectation (‘random’) of finding the binder in the top N%. Using the combination of DLAB-VS and ZDock scores (‘DLAB-VS+ZDock’) detailed in Section 3 and supplementing it with the DLAB-Re-max thresholding (‘DLAB-VS+ZDock + thresholding’), the classification performance on the model dataset can be improved significantly. (B) Performance on the post-snapshot model dataset after removing any CDR sequences with overlap to the model dataset (at 90% CDR sequence identity as defined above) both without (‘DLAB-VS+ZDock’) and with (‘DLAB-VS+ZDock + thresholding’) DLAB-Re-max score thresholding. Performance for the CDR sequences clustering with sequences in the model dataset is shown in Supplementary Figure S12