| Literature DB >> 25760769 |
Olga Obrezanova1, Andreas Arnell, Ramón Gómez de la Cuesta, Maud E Berthelot, Thomas R A Gallagher, Jesús Zurdo, Yvette Stallwood.
Abstract
Aggregation is a common problem affecting biopharmaceutical development that can have a significant effect on the quality of the product, as well as the safety to patients, particularly because of the increased risk of immune reactions. Here, we describe a new high-throughput screening algorithm developed to classify antibody molecules based on their propensity to aggregate. The tool, constructed and validated on experimental aggregation data for over 500 antibodies, is able to discern molecules with a high aggregation propensity as defined by experimental criteria relevant to bioprocessing and manufacturing of these molecules. Furthermore, we show how this tool can be combined with other computational approaches during early drug development to select molecules with reduced risk of aggregation and optimal developability properties.Entities:
Keywords: CDR, complementarity determining region; CH1, heavy chain constant domain; CL, light chain constant domain; ELISA, enzyme-linked immunosorbent assay; Fab, fragment antigen-binding; Fv, fragment variable; IgG, immunoglobulin G; ODA, oligomer detection assay; SE-HPLC, size exclusion high pressure liquid chromatography; VH, heavy chain variable region; VL, light chain variable region; aggregation; aggregation prediction; biotherapeutics; developability assessment; monoclonal antibody
Mesh:
Substances:
Year: 2015 PMID: 25760769 PMCID: PMC4622581 DOI: 10.1080/19420862.2015.1007828
Source DB: PubMed Journal: MAbs ISSN: 1942-0862 Impact factor: 5.857
Figure 1.Counts of representatives of germline families in the antibody set. (A) heavy chain germline families, (B) κ and λ light chain germline families.
Figure 2.The antibody sequences are plotted in the space of 2 principal components. The principal component analysis was performed based on sequence distance measure. Each marker corresponds to an antibody sequence. Crosses denote κ ‘design’ subset, squares – κ ‘PDB’ subset, black circles – therapeutic antibodies. Diamonds and pluses denote λ ‘design’ and λ ‘PDB’ subsets, respectively.
Performance statistics of aggregation models on training and test sets and by Monte-Carlo simulations
| Set/method of validation | Number of antibodies | Overall accuracy | Accuracy in Low | Accuracy in High | Specificity in Low | Specificity in High |
|---|---|---|---|---|---|---|
| training set | 256 | 92% | 89% | 95% | 95% | 88% |
| MC-whole set | 341 | 72% | 75% | 68% | 75% | 69% |
| test set | 85 | 76% | 78% | 75% | 81% | 71% |
| training set | 355 | 91% | 88% | 95% | 96% | 84% |
| MC-whole set | 472 | 68% | 67% | 71% | 76% | 60% |
| test set | 117 | 70% | 75% | 64% | 72% | 68% |
Figure 3.Descriptor importance for the most important 50 descriptors for the κ model. Solid bars indicate descriptor importance estimated from the model, hashed bars indicate descriptor importance estimated using MC validation. Both descriptor importance measures were normalized to fit the same axis range. The descriptor name indicates an AHo position of the central residue for this descriptor and an amino acid scale name (in parenthesis), the first character signifies location of the central residue, f is for frameworks and c is for CDRs. (A) Light chain variable region descriptors. (B) Heavy chain variable region descriptors.
Accuracy of predictive models evaluated on 49 antibody variants from ‘external’ data set
| Low | 43 (42 κ + 1 λ) | 36 | 84% |
| High | 6 (4 κ + 2 λ) | 5 | 83% |
| Overall | 49 (46 κ + 3 λ) | 41 | 84% |
Figure 4.Percentage of soluble aggregate measured by SE-HPLC for re-engineered variants of antibody D1.3. WT is parental antibody. Hashed bars indicate antibodies with predicted Low aggregation risk, the solid bar indicates antibody with predicted High aggregation risk.
Predicted aggregation risk for the wild type, the reference and the 7 variants of the anti-IFNγ antibody
| Variant | Predicted aggregation risk |
|---|---|
| WT | High |
| REF | High |
| #1–5 | Low |
| #6–7 | High |
Figure 5.DRB1 scores for the engineered anti-IFNγ antibody variants. Solid, hashed and dotted bars indicate the DRB1 scores for the whole antibodies, heavy variable chains and light variable chains, respectively. 80% of marketed therapeutic humanized antibodies have DRB1 scores not exceeding 1400 (data not shown).
Figure 6.In vitro characterization of engineered anti-IFNγ antibody variants – (A) product titer measured post Protein A purification (in mg/ml), (B) percentage of monomer, (C) binding affinity log(EC50) (EC50 in pM).
Comparison of the aggregation prediction tool to the Developability Index tool on antibody variants from the ‘external’ data set
| DI prediction tool | Lonza's aggregation prediction tool | |||
|---|---|---|---|---|
| Observed aggregation class | Number of antibodies | Accuracy | Number of antibodies | Accuracy |
| Low | 40 | 48% | 43 | 84% |
| High | 5 | 100% | 6 | 83% |
| Overall | 45 | 53% | 49 | 84% |