| Literature DB >> 29276518 |
Aleksandr Kovaltsuk1, Konrad Krawczyk1, Jacob D Galson2, Dominic F Kelly3, Charlotte M Deane1, Johannes Trück2.
Abstract
Next-generation sequencing of immunoglobulin gene repertoires (Ig-seq) allows the investigation of large-scale antibody dynamics at a sequence level. However, structural information, a crucial descriptor of antibody binding capability, is not collected in Ig-seq protocols. Developing systematic relationships between the antibody sequence information gathered from Ig-seq and low-throughput techniques such as X-ray crystallography could radically improve our understanding of antibodies. The mapping of Ig-seq datasets to known antibody structures can indicate structurally, and perhaps functionally, uncharted areas. Furthermore, contrasting naïve and antigenically challenged datasets using structural antibody descriptors should provide insights into antibody maturation. As the number of antibody structures steadily increases and more and more Ig-seq datasets become available, the opportunities that arise from combining the two types of information increase as well. Here, we review how these data types enrich one another and show potential for advancing our knowledge of the immune system and improving antibody engineering.Entities:
Keywords: Antibodies; B cell; Developability; Ig-seq; Next-generation sequencing; antibody modeling; computational modeling
Year: 2017 PMID: 29276518 PMCID: PMC5727015 DOI: 10.3389/fimmu.2017.01753
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Figure 1(A) Schematic representation of an antibody IgG structure. (B) Structure of the Fv region. (C) Genetic composition of VH and VL chains [IMGT numbering (9)]: VH is colored blue; VL is green; CDRs are labeled and depicted in different colors; and disulfide bonds are in yellow.
Figure 2Two aligned pairs of VH chains extracted from SAbDab, the antibody structural database (29). Complementarity determining region-3 of the heavy chain (CDR-H3) sequences in pair (A) belong to different CDR-H3 clonotypes but adopt very similar structural configurations with a root mean square deviation (RMSD) of ~1 Å. Pair (B) includes germline precursor (4JDV) and matured (3U7W) anti-gp120 antibodies (78, 79). Although CDR-H3 sequences of pair (B) are members of the same clonotype, the RMSD shows that their CDR-H3 shapes are structurally distinct (RMSD > 2 Å). CDR-H3 loops and their amino-acid sequences are in purple and cyan colors, mismatched amino acid are in bold. The RMSD of the backbone atom positions of proteins provides a pairwise measurement of the three-dimensional dissimilarity between two sets of coordinates where solved or predicted structures are available. Sub-Angstrom RMSD indicates structurally identical shapes, while an RMSD value greater than 2 Å for a short segment indicates structurally distinct configurations (80).
Figure 3Generalized workflow of antibody modeling. First, heavy and light chain frameworks are determined by homology modeling using templates from known structures. Next, the VH/VL orientation is calculated. The third step is modeling non-H3 complementarity determining regions (CDRs), followed by modeling and grafting of CDR-H3 onto the pre-assembled scaffold. Finally, sidechains are added to the resultant structure and it is refined.
Summary of currently available resources for computational/structural annotation of antibody sequences.
| Tool type | Tool name and reference | Short tool description |
|---|---|---|
| ANTIBODY NUMBERING | ANARCI ( | Variety of schemes (North, Chothia, Kabat, IMGT, AHo). Both online and command line versions are available |
| ANTIBODY NUMBERING | Abnum ( | Online numbering tool that operates with Kabat and Chothia schemes |
| SEQUENCE ANALYSIS | IgBLAST ( | Nucleotide and amino-acid antibody sequence analysis in IMGT and KABAT schemes |
| SEQUENCE ANALYSIS | IMGT/HighV-QUEST ( | Online antibody nucleotide sequence analysis in IMGT numbering scheme |
| STRUCTURE DATABASE | SabDab ( | Weekly updating database of all publically available antibody structures. |
| STRUCTURE/SEQUENCE DATABASE | abYsis ( | Database of antibody structures and sequences |
| SEQUENCE DATABASE | DIGIT ( | Database of antibody sequences |
| ANTIBODY MODELING | ABodyBuilder ( | Homology modeling (30 s per model) |
| ANTIBODY MODELING | PIGSPro ( | Homology modeling |
| ANTIBODY MODELING | Kotai Antibody Builder ( | Homology modeling (90 min per model) |
| ANTIBODY MODELING | Accelrys ( | Hybrid modeling (30 min per model) |
| ANTIBODY MODELING | RosettaAntibody ( | |
| ANTIBODY MODELING (COMMERCIAL) | Chemical Computing group ( | Homology modeling tool combined with molecular dynamics (30 min per model) |
| CDR-H3 MODELING | Sphinx ( | Length-independent hybrid modeling (30 min per model) |
| CDR-H3 MODELING | PLOP ( | |
| CDR-H3 MODELING | FREAD ( | Homology modeling (2 min per model) |
| PARATOPE PREDICTION | Paratome ( | Structural consensus to identify additional antigen recognizing regions outside the CDRs |
| PARATOPE PREDICTION | i-Patch ( | Statistical inference to devise a likelihood for a position to form a potential contact |
| PARATOPE PREDICTION | proABC ( | Sequence-based method that leverages machine learning to predict residues that form interactions |
Many of these tools have online presence and links to these are available on our website .