| Literature DB >> 25071986 |
Dimitris Nikoloudis1, Jim E Pitts1, José W Saldanha2.
Abstract
Classification of antibody complementarity-determining region (CDR) conformations is an important step that drives antibody modelling and engineering, prediction from sequence, directed mutagenesis and induced-fit studies, and allows inferences on sequence-to-structure relations. Most of the previous work performed conformational clustering on a reduced set of structures or after application of various structure pre-filtering criteria. In this study, it was judged that a clustering of every available CDR conformation would produce a complete and redundant repertoire, increase the number of sequence examples and allow better decisions on structure validity in the future. In order to cope with the potential increase in data noise, a first-level statistical clustering was performed using structure superposition Root-Mean-Square Deviation (RMSD) as a distance-criterion, coupled with second- and third-level clustering that employed Ramachandran regions for a deeper qualitative classification. The classification of a total of 12,712 CDR conformations is thus presented, along with rich annotation and cluster descriptions, and the results are compared to previous major studies. The present repertoire has procured an improved image of our current CDR Knowledge-Base, with a novel nesting of conformational sensitivity and specificity that can serve as a systematic framework for improved prediction from sequence as well as a number of future studies that would aid in knowledge-based antibody engineering such as humanisation.Entities:
Keywords: Antibody structure; CDR conformation; Canonical model; Clustering; Dynamic hybrid tree-cut; Humanisation; Nested architecture; Prediction; Redundant repertoire
Year: 2014 PMID: 25071986 PMCID: PMC4103072 DOI: 10.7717/peerj.456
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Summary of clustering dataset contents.
Total clustered members per CDR include outliers and singletons.
| Total PDB files | 1,351 |
| Files containing structures from two | 8/5 |
| Total antibody structures | 1,359 |
| Total number of CDRs | 13,086 |
| CDRs with missing C | 374 |
| Total clustered CDRs | 12,712 |
| CDR-L1 clustered | 2,155 |
| CDR-L2 clustered | 2,174 |
| CDR-L3 clustered | 2,164 |
| CDR-H1 clustered | 2,057 |
| CDR-H2 clustered | 2,130 |
| CDR-H3 clustered | 2,032 |
| Total non-redundant CDR sequences | 2,827 |
| PDB files with lambda isotypes | 194 |
| Heavy only | 77 |
| Light only | 78 |
| PDB files with bound antibodies | 673 |
Figure 1Superposition of 7-residue and 11-residue CDR-L2.
The 5 C-terminal residues of 1A4K (in red) 7 residue CDR-L2 (L52–L56) are superposed to the equivalent portion of 3FFD (in blue) 11 residue CDR-L2. Position L51 is highlighted in green, as the best insertion point in the structural numbering scheme. Graphics created with Swiss-PdbViewer (http://www.expasy.org/spdbv/).
Figure 2Superposition of Light Framework 3 with an insertion onto a typical LFR3.
Residues L60–L75 of crystal structure 1PW3 (in red), containing an insertion, are superposed onto a typical example of the equivalent Light chain fragment (here 1A4K, in blue). The new insertion point was introduced in position L67 (highlighted in green). Graphics created with Swiss-PdbViewer (http://www.expasy.org/spdbv/).
Figure 3Superposition of Heavy Framework 3 with an insertion onto a typical HFR3.
The Cα-trace of a two-leg superposition of residues H65–H73 and H76–H78 of crystal structures 3RPI (in yellow) and 3SE8 (in red), containing an insertion, onto the equivalent residues of a typical structure without an insertion (here 3MLY, in blue). The proposed insertion point H74 is highlighted in green in 3MLY and is shown with its side chain (Ser). Graphics created with Swiss-PdbViewer (http://www.expasy.org/spdbv/).
Modifications brought to the numbering scheme.
Modifications brought to the numbering scheme in the light of new and atypical sequences. LFR3, light chain framework 3; HFR3, heavy chain framework 3. CDR-H3 insertion positions H100uvw were not required in the present dataset, but were added for the technical continuity up to the pre-existing positions H100xyz and for future use. Thus 3U1S has a CDR-H3 length of 31 residues.
| Locus | Numbering | Maximum | Structures with the | CDR |
|---|---|---|---|---|
| CDR-L1 | – | 17 | N/A | L24-L34 |
| CDR-L2 | L51abcd | 11 | 2GSG, 2H32, 2H3N, 2OTU, | L50-L56 |
| LFR3 | L67ab | N/A | 1PW3 | N/A |
| CDR-L3 | L95cd | 13 | 2GSG, 2OTU, 2QHR, 3FFD, | L89-L97 |
| CDR-H1 | H31cdefghijk | 24 | 3K3Q | H23–H35 |
| CDR-H2 | H52ef | 15 | 3TWC, 3TYG | H50–H58 |
| HFR3 | H74abcdefg | N/A | 3SE8 | N/A |
| CDR-H3 | H100nopqrstuvw | 34 | 3U1S | H95-H102 |
Figure 4Ramachandran plot divided into conformational regions.
A: α-helix region; B: β-sheet region; D: δ-region; G: γ-region; L: left-handed helix region; P: polyproline II region. For the construction of reduced-Ramachandran logos (r-RL), residues belonging to regions with similar conformations were represented by the same letter: (A/D) = A, (B/P) = B, (L/G) = L. For the construction of full-Ramachandran logos (f-RL), each conformational region was represented individually. E.g., Ramachandran logos for CDR-L3 1TJH_L:r-RL: BBAABBBBB f-RL: BBDABPPPB.
Figure 5Example of the nested clusters architecture.
Level-1 cluster H1-13-III (i.e., the third top-level cluster of 13-residues CDR-H1), defined by RMSD-based hierarchical clustering, contains 3 Level-2 clusters, the members of each sharing the same reduced-Ramachandran logo, and in total 11 Level-3 clusters, the members of each sharing the same full-Ramachandran logo. All Level-3 clusters share the same reduced-Ramachandran logo with their parent Level-2 cluster, but each one displays a distinct full-Ramachandran logo.
Figure 6Illustration of the parameters taken into account for the dendrogram pruning of CDR-L1/12 residues with the Dynamic Hybrid method. The minimum gap statistic (gmin) defines the minimum required distance between the average core scatter and the joining height of the clusters (‘Gap’), for successful cluster formation. In this example, gmin is set lower than the displayed Gaps, so nodes above its value were considered as different clusters.
Summary table for the clustering of CDR-L1.
Unique sequence percentages per cluster are based on the total number of unique sequences in that length (i.e., total non-redundant sequences in that length, including outliers/singletons) and are rounded to the closest integer or to the closest first decimal, if lower than 0.5%. Two different clusters of the same length may contain the same CDR sequence.
| Cluster | Population | Unique | Medoid | Medoid | Level-3 | Level-2 | Cluster diameter (Å) | Best | Species in cluster | Isotype |
|---|---|---|---|---|---|---|---|---|---|---|
|
| ||||||||||
| L1-7-I | 2 | 1 (100%) | 3RPI_L | BBBGPBB | 1 | 1 | 0.00 (3RPI_L-3RPI_B) | 2.65 (3RPI_L) | HUMAN |
|
|
| ||||||||||
| L1-9-I | 7 | 3 (75%) | 3NGB_K | PBAPBBPPB | 6 | 2 | 0.96 (3NGB_L-3Sé_L) | 2.0 (3Sé_L) | HUMAN |
|
| L1-9-II | 3 | 1 (25%) | 3TV3_L | PLPDBAPBB | 3 | 2 | 0.47 (3TYG_L-3TWC_L) | 1.29 (3TV3_L) | HUMAN |
|
|
| ||||||||||
| L1-10-I | 126 | 27 (96%) | 1sy6_L | BPABPBABBB | 31 | 3 | 0.91 (3C09_B-2Z92_B) | 1.6 (3OZ9_L) | HUMAN, MOUSE |
|
|
| ||||||||||
| L1-11-I | 973 | 160 (89%) | 2fjf_W | BPABPDGDPBB | 120 | 28 | 1.29 (3fct_C-1ty7_L) | 1.2 (3D9A_L) | HUMAN, MOUSE, |
|
| L1-11-II | 35 | 9 (5%) | 1w72_L | PBPLAAABBPB | 17 | 5 | 1.05 (2g75_D-1lil_A) | 1.9 (3Q6G_L) | HUMAN, |
|
| L1-11-III | 23 | 7 (4%) | 3MLV_L | PBADAADBPBB | 7 | 1 | 0.76 (3UJI_L-1nfd_G) | 1.6 (3UJI_L) | HUMAN, MOUSE |
|
| L1-11-IV | 2 | 1 (1%) | 1tzh_A | PBPBPAAPBBB | 2 | 1 | 0.23 (1tzh_A-1tzh_L) | 2.6 (1tzh_A) | MOUSE |
|
|
| ||||||||||
| L1-12-I | 33 | 15 (58%) | 1hq4_A | BBABPBPAADBB | 13 | 5 | 1.02 (3LS4_L-3EO1_D) | 1.9 (1ORS_A) | HUMAN, MOUSE |
|
| L1-12-II | 24 | 6 (23%) | 2fx7_L | BPABPPPLLPBB | 7 | 1 | 0.66 (2b4c_L-1dn0_A) | 1.76 (2fx7_L) | HUMAN |
|
| L1-12-III | 14 | 2 (8%) | 3JUY_C | BPABPBAALPBB | 8 | 2 | 0.51 (1ob1_A-1n0x_M) | 1.8 (1n0x_L) | HUMAN, MOUSE |
|
| L1-12-IV | 10 | 2 (8%) | 2OTU_A | BPPAADADPPBB | 3 | 1 | 0.27 (2OTU_C-2GSG_C) | 1.68 (2OTU_A) | MOUSE |
|
|
| ||||||||||
| L1-13-I | 61 | 19 (73%) | 3IYW_L | BBBAADAADBPBB | 26 | 7 | 0.87 (2ig2_L-2b0s_L) | 1.43 (3N9G_L) | HUMAN, MOUSE |
|
| L1-13-II | 14 | 6 (23%) | 1pew_B | BPABGPAAABPBB | 5 | 1 | 0.66 (3H0T_A-3BDX_B) | 1.6 (1pew_A) | HUMAN |
|
| L1-13-III | 6 | 1 (4%) | 3FKU_X | BBBAADAAAAGBB | 6 | 2 | 0.35 (3FKU_U-3FKU_Y) | 3.2 (3FKU_X) | HUMAN |
|
|
| ||||||||||
| L1-14-I | 112 | 10 (40%) | 1oar_L | BBAAGPPBAAALPB | 36 | 5 | 1.08 (1oau_O-1nj9_L) | 1.5 (1oaq_L) | HUMAN, MOUSE, |
|
| L1-14-II | 66 | 9 (36%) | 3U4E_B | BBBAADAAABABBB | 50 | 25 | 1.34 (1mcw_W-1mcq_A) | 1.5 (3KDM_L) | HUMAN, SEAL |
|
| L1-14-III | 3 | 1 (4%) | 1lgv_B | BLAAAPPLAGDPBB | 2 | 1 | 0.15 (1lhz_B-1jvk_B) | 1.94 (1jvk_B) | HUMAN |
|
| L1-14-IV | 3 | 1 (4%) | 1jvk_A | BLAPPAPGBPDPBB | 3 | 3 | 0.44 (1lhz_A-1lgv_A) | 1.94 (1jvk_A) | HUMAN |
|
| L1-14-V | 3 | 3 (12%) | 7fab_L | BBBAADAADLBPBB | 3 | 1 | 0.36 (3H42_L-1aqk_L) | 1.84 (1aqk_L) | HUMAN |
|
| L1-14-VI | 3 | 1 (4%) | 2H3N_A | PPPGABPAADBPBB | 3 | 2 | 0.39 (2H3N_C-2H32_A) | 2.3 (2H3N_A) | HUMAN |
|
| L1-14-VII | 3 | 1 (4%) | 1mcs_B | PPAPPDPLPBDAPB | 3 | 3 | 1.18 (1mcn_B-1mcc_B) | 2.7 (1mcc_B) | HUMAN |
|
|
| ||||||||||
| L1-15-I | 26 | 13 (38%) | 2Y5T_B | BBABPDPBLLBPPBB | 14 | 3 | 0.67 (3ZTJ_L-2nz9_C) | 2.0 (1h0d_A) | HUMAN, MOUSE |
|
| L1-15-II | 6 | 3 (9%) | 1i9r_L | BPABPDBBADBBPBB | 4 | 1 | 0.61 (3PHO_A-1i7z_C) | 2.0 (3PHQ_A) | HUMAN, MOUSE |
|
|
| ||||||||||
| L1-16-I | 309 | 66 (89%) | 3QCU_L | BBABPAPPAALPBPBB | 81 | 13 | 1.24(2GJZ_A-1f3d_L) | 1.22 (1mju_L) | HUMAN, MOUSE, |
|
| L1-16-II | 3 | 1 (1%) | 1cfv_L | BPABPDDABABPLPBB | 3 | 1 | 0.22 (2bfv_L-1bfv_L) | 2.1 (1bfv_L) | MOUSE |
|
| L1-16-III | 3 | 1 (1%) | 3FO9_L | BPABPDPABPLBBPBB | 3 | 1 | 0.36 (3FO9_A-1axt_L) | 1.9 (3FO9_L) | MOUSE |
|
| L1-16-IV | 2 | 1 (1%) | 1l7s_L | BPABPPPPBLGPPABB | 1 | 1 | 0.00 (1VPO_L-1l7s_L) | 2.15 (1l7s_L) | MOUSE |
|
| L1-16-V | 2 | 1 (1%) | 1nak_L | BBABPDGBDLDDPPBB | 1 | 1 | 0.04 (1nak_M-1nak_L) | 2.57 (1nak_L) | MOUSE |
|
|
| ||||||||||
| L1-17-I | 171 | 36 (100%) | 3MNZ_A | BBABPDPPAADLPPPBB | 71 | 17 | 1.51 (1xcq_G-1him_H) | 1.45 (1q9r_A) | HUMAN, MOUSE, |
|
|
|
|
|
| |||||||
Summary table for the clustering of CDR-H2.
Level-2 clusters are shown exceptionally (marked with an asterisk) when no level-1 cluster is formed (minimum of 2 members required).
| Cluster | Population | Unique | Medoid | Medoid | Level-3 | Level-2 | Cluster diameter (Å) | Best resolution | Species in cluster |
|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||
| H2-8-I | 6 | 2 (100%) | 1f2x_K | BBBGAPBB | 3 | 3 | 1.00 (2OJZ_H-1f2x_K) | 1.89 (2OK0_H) | CAMEL, MOUSE |
|
| |||||||||
| H2-9-I | 412 | 110 (94%) | 1fe8_I | BBPAALPBB | 31 | 9 | 1.47 (3K81_B-2aj3_B) | 1.2 (3D9A_H) | HUMAN, MOUSE, |
| H2-9-II | 11 | 3 (3%) | 2ak1_H | BBBLLDBBB | 3 | 2 | 1.03 (2aju_H-1mco_H) | 1.5 (2aju_H) | HUMAN, MOUSE |
| H2-9-III | 4 | 1 (1%) | 3UAJ_H | BBBADPPPB | 2 | 1 | 0.14 (3UC0_H-3UAJ_H) | 2.71 (3UC0_H) | CHIMPANZEE |
| H2-9-IV | 4 | 2 (2%) | 1ken_H | BLPAAGAAG | 4 | 4 | 1.46 (1ken_T-1bgx_H) | 2.2 (1ay1_H) | MOUSE |
| H2-9-V | 2 | 1 (1%) | 1YC7_A | BBBPPLPBB | 1 | 1 | 0.13 (1YC7_A-1YC7_B) | 1.6 (1YC7_A) | CAMEL |
| H2-9-VI | 2 | 1 (1%) | 2aj3_D | BBBABGPBB | 1 | 1 | 0.19 (2aj3_D-2aj3_F) | 2.03 (2aj3_D) | HUMAN |
|
| |||||||||
| H2-10-I | 822 | 238 (62%) | 1uyw_H | BBPAAALPBB | 62 | 16 | 1.40 (1rzi_J-1a5f_H) | 1.22 (1mju_H) | HUMAN, MOUSE, |
| H2-10-II | 417 | 101 (27%) | 1i8i_B | BBPAALABBB | 48 | 18 | 1.46 (3GJE_H-1R24_B) | 1.06 (2X1Q_A) | HUMAN, MOUSE, |
| H2-10-III | 75 | 1 (0.3%) | 2KH2_B_20 | BBPAALDPBP | 35 | 6 | 1.31 | N/A | MOUSE |
| H2-10-IV | 19 | 1 (0.3%) | 1vhp_A_11 | BBPGALAPBB | 3 | 2 | 0.88 (1vhp_A_5-1vhp_A_6) | N/A | HUMAN |
| H2-10-V | 7 | 2 (1%) | 1G9E_A | BBPBDLDBPB | 7 | 3 | 1.19 (1G9E_9-1d6v_H) | 2.0 (1d6v_H) | LLAMA, |
| H2-10-VI | 4 | 1 (0.3%) | 1bzq_M | BBPAPABBPB | 1 | 1 | 0.03 (1bzq_L-1bzq_N) | 2.8 (1bzq_K) | DROMEDARY |
| H2-10-VII | 4 | 2 (1%) | 1cfv_H | BBPAALBPDB | 2 | 2 | 0.86 (1zv5_A-1bfv_H) | 2.0 (1zv5_A) | MOUSE, CAMEL |
| H2-10-VIII | 3 | 1 (0.3%) | 2fd6_H | BBBGBAABBB | 2 | 2 | 0.89 (3BT2_H-2fat_H) | 1.77 (2fat_H) | MOUSE |
| H2-10-IX | 3 | 2 (1%) | 2fjg_B | BPBAPLLPPB | 3 | 2 | 0.97 (3LMJ_H-2fjg_H) | 2.2 (3LMJ_H) | HUMAN |
| H2-10-X | 2 | 1 (0.3%) | 3NCY_Q | BBPAADPBBB | 1 | 1 | 0.00 (3NCY_Q-3NCY_P) | 3.2 (3NCY_Q) | MOUSE |
|
| |||||||||
| H2-11-O-1-1* | 1 | 1 (33%) | 2X6M_A | BBPPLLPDPBB | 1 | 1 | N/A | 1.62 (2X6M_A) | DROMEDARY |
| H2-11-O-2-1* | 1 | 1 (33%) | 3SE9_H | BBPAADLPBBB | 1 | 1 | N/A | 2.0 (3SE9_H) | HUMAN |
| H2-11-O-3-1* | 1 | 1 (33%) | 3H0T_B | BBBBBLBPBBB | 1 | 1 | N/A | 1.89 (3H0T_B) | HUMAN |
|
| |||||||||
| H2-12-I | 160 | 36 (95%) | 3IJH_B | BBPPAAALLPBB | 19 | 7 | 1.62 (4fab_H-2aeq_H) | 1.45 (1dlf_H) | HUMAN, MOUSE, |
| H2-12-II | 4 | 1 (3%) | 1aif_H | BPBDALPABBBB | 2 | 2 | 1.32 (1iai_I-1aif_H) | 2.9 (1aif_H) | MOUSE |
| H2-12-III | 4 | 1 (3%) | 3IXX_G | Not available | 1 | 1 | 0.1 (3IXX_I-3IXY_G) | 15.0 (3IXX_G) | MOUSE |
| H2-12-IV | 3 | 1 (3%) | 3QHZ_I | BBPAAPAPBBBP | 2 | 2 | 0.4 (3QHZ_H-3LZF_H) | 1.55 (3QHZ_H) | HUMAN |
|
| |||||||||
| H2-15-I | 3 | 1 (33%) | 1i3v_B | BBPDBPABADBPP | 3 | 1 | 0.55 (1i3v_A-1i3u_A) | 1.95 (1i3u_A) | LLAMA |
| H2-15-II | 2 | 1 (33%) | 3TYG_H | BBBAPBBADBDGBBB | 2 | 1 | 0.29 (3TYG_H-3TV3_H) | 1.29 (3TV3_H) | HUMAN |
|
|
|
|
| ||||||
Summation of clustered lengths per CDR.
(A) Summation of clustered lengths per CDR, with population, non-redundant sequences, number of clusters and outliers information. CDR lengths that were clustered for the first time are highlighted in bold/italics. (B) The complete CDR-H3 conformation, using the H95-H102 extents definition, has not been extensively clustered before; therefore only lengths that were not considered in Kuroda et al. (2009) are noted as new for conformity with the literature. CDR-H3 lengths 4 and 24 are marked with an asterisk as the corresponding structures are also found in North, Lehmann & Dunbrack (2011), but acknowledged as 2 residues longer, due to different CDR-H3 extents (H93-H102).
| (A) | ||||||
|
|
|
|
|
|
|
|
| L1 |
|
|
|
|
|
|
|
|
|
|
|
|
| |
| 10 | 127 | 28 | 1 | 126 | 1 | |
| 11 | 1,042 | 180 | 4 | 1,033 | 9 | |
| 12 | 82 | 26 | 4 | 81 | 1 | |
| 13 | 81 | 26 | 3 | 81 | 0 | |
| 14 | 207 | 25 | 7 | 193 | 14 | |
| 15 | 80 | 34 | 2 | 32 | 48 | |
| 16 | 352 | 74 | 5 | 319 | 33 | |
| 17 | 172 | 36 | 1 | 171 | 1 | |
| Total | 10 lengths | 2,155 | 434 | 30 | 2,048 | 107 |
|
|
|
|
|
|
|
|
| L2 | 7 | 2,161 | 278 | 3 | 2,159 | 2 |
| 11 | 13 | 3 | 2 | 13 | 0 | |
| Total | 2 lengths | 2,174 | 281 | 5 | 2,172 | 2 |
|
|
|
|
|
|
|
|
| L3 |
|
|
|
|
|
|
| 7 | 5 | 2 | 1 | 5 | 0 | |
| 8 | 138 | 43 | 6 | 136 | 2 | |
| 9 | 1,725 | 358 | 6 | 1,720 | 5 | |
| 10 | 113 | 27 | 12 | 107 | 6 | |
| 11 | 142 | 38 | 9 | 135 | 7 | |
| 12 | 19 | 6 | 4 | 19 | 0 | |
| 13 | 12 | 2 | 3 | 11 | 1 | |
| Total | 8 lengths | 2,164 | 480 | 42 | 2,143 | 21 |
|
|
|
|
|
|
|
|
| H1 | 10 | 6 | 2 | 1 | 6 | 0 |
| 12 | 2 | 2 | 0 | 0 | 2 | |
| 13 | 1,845 | 450 | 11 | 1,681 | 164 | |
| 14 | 72 | 17 | 1 | 70 | 2 | |
| 15 | 128 | 29 | 3 | 125 | 3 | |
| 16 | 3 | 2 | 1 | 2 | 1 | |
|
|
|
|
|
|
| |
| Total | 7 lengths | 2,057 | 503 | 17 | 1,884 | 173 |
| (B) | ||||||
|
|
|
|
|
|
|
|
| H2 | 8 | 6 | 2 | 1 | 6 | 0 |
| 9 | 436 | 117 | 6 | 435 | 1 | |
| 10 | 1,508 | 381 | 10 | 1,356 | 152 | |
|
|
|
|
|
|
| |
| 12 | 171 | 38 | 4 | 171 | 0 | |
| 15 | 6 | 3 | 2 | 5 | 1 | |
| Total | 6 lengths | 2,130 | 544 | 23 | 1,973 | 157 |
|
|
|
|
|
|
|
|
| H3 | 3 | 18 | 4 | 1 | 18 | 0 |
|
|
|
|
|
|
| |
| 5 | 93 | 28 | 6 | 85 | 8 | |
| 6 | 33 | 12 | 3 | 30 | 3 | |
| 7 | 97 | 41 | 7 | 69 | 28 | |
| 8 | 168 | 46 | 7 | 141 | 27 | |
| 9 | 181 | 55 | 8 | 132 | 49 | |
| 10 | 377 | 98 | 35 | 292 | 85 | |
| 11 | 231 | 64 | 26 | 151 | 80 | |
| 12 | 206 | 51 | 21 | 174 | 32 | |
| 13 | 130 | 42 | 22 | 105 | 25 | |
| 14 | 128 | 40 | 19 | 104 | 24 | |
| 15 | 96 | 23 | 18 | 81 | 15 | |
| 16 | 40 | 16 | 8 | 28 | 12 | |
| 17 | 28 | 14 | 6 | 19 | 9 | |
| 18 | 37 | 11 | 6 | 31 | 6 | |
| 19 | 48 | 12 | 9 | 46 | 2 | |
| 20 | 13 | 4 | 3 | 13 | 0 | |
|
|
|
|
|
|
| |
| 22 | 33 | 4 | 2 | 31 | 2 | |
|
|
|
|
|
|
| |
|
|
|
|
|
|
| |
|
|
|
|
| 0 |
| |
|
|
|
|
|
|
| |
|
|
|
|
|
|
| |
| Total | 25 lengths | 2,032 | 585 | 213 | 1,620 | 412 |
| Cumulative | 58 lengths | 12,712 | 2,827 | 330 | 11,840 | 872 |
Comparison of level-1 conformational clusters obtained in CDR-L1 with external sets.
The cluster medoid/median or representative of the external sets was used for identification of correspondences. In brackets, next to each correspondence, is the full, 3-level classification in this work of the representative of the external set and the number of corresponding members in full population comparison. Martin & Thornton (1996) cluster 14F is marked with a question mark, because its representative (2BJL, superseded by 4BJL) actually has a 13-residue CDR-L1.
| This work | ||
|---|---|---|
| L1-7-I | – | – |
| L1-9-I | – | – |
| L1-9-II | – | – |
| L1-10-I | 10A/1 | L1-10-1 |
| L1-11-I | 11A/2 | L1-11-1 |
| L1-11-II | – | L1-11-3 |
| L1-11-III | 11B/- | – |
| L1-11-IV | – | – |
| L1-12-I | – | L1-12-1 |
| L1-12-II | – | L1-12-2 |
| L1-12-III | – | – |
| L1-12-IV | – | L1-12-3 |
| L1-13-I | 13A/5 | L1-13-1 |
| L1-13-II | – | L1-13-2 |
| L1-13-III | – | – |
| L1-14-I | 14B/7 | L1-14-1 |
| L1-14-II | 14C/- | L1-14-2 |
| L1-14-III | – | – |
| L1-14-IV | – | – |
| L1-14-V | 14A/6 | – |
| L1-14-VI | – | – |
| L1-14-VII | – | – |
| L1-15-I | – | L1-15-1 |
| L1-15-II | – | – |
| L1-16-I | 16A/4 | L1-16-1 |
| L1-16-II | – | – |
| L1-16-III | – | – |
| L1-16-IV | – | – |
| L1-16-V | – | – |
| L1-17-I | 17A/3 | L1-17-1 |
|
| ||
| L1-12-O | 12A/6 | – |
| L1-14-O | 14D/- | – |
| L1-15-O | 15A/5 | L1-15-2 |
| L1-16-O | 16B/- | – |
Comparison of level-1 conformational clusters obtained in CDR-H2, with external sets.
See notes in Table 9.
| This work | ||
|---|---|---|
| H2-8-I | – | H2-8-1 |
| H2-9-I | 9A/1 | H2-9-1 |
| H2-9-II | – | H2-9-2 |
| H2-9-III | – | – |
| H2-9-IV | – | – |
| H2-9-V | – | – |
| H2-9-VI | – | – |
| H2-10-I | 10A/2 | H2-10-1 |
| H2-10-II | 10B/3 | H2-10-2 |
| H2-10-III | – | – |
| H2-10-IV | – | – |
| H2-10-V | – | – |
| H2-10-VI | – | – |
| H2-10-VII | – | – |
| H2-10-VIII | – | – |
| H2-10-IX | – | – |
| H2-10-X | – | – |
| H2-12-I | 12A/4 | H2-12-1 |
| H2-12-II | – | – |
| H2-12-III | – | – |
| H2-12-IV | – | – |
| H2-15-I | – | H2-15-1 |
| H2-15-II | – | – |
|
| ||
| H2-10-O | 10C/3 | H2-10-3 |
Figure 7A comparative view of all CDR-H1/13 residue clusters obtained in this work (in yellow), superposed to their correspondences from North, Lehmann & Dunbrack (2011), where applicable. Level-1 clusters from this work expand whenever possible towards closely-related variants, which are then further classified at levels 2 and 3 (complete 3-level classification in this work of the external median is given in brackets). This can be appreciated in clusters H1-13-I and H1-13-III from this work. The last four structures of this figure correspond to cluster medians from North, Lehmann & Dunbrack (2011) that were classified as outliers/singletons in this work.
Summary table for the clustering of CDR-L2.
| Cluster | Population | Unique | Medoid | Medoid | Level-3 | Level-2 | Cluster diameter (Å) | Best resolution | Species in cluster | Isotype |
|---|---|---|---|---|---|---|---|---|---|---|
|
| ||||||||||
| L2-7-I | 2109 | 272 (98%) | 1dn0_C | LLDPPPP | 121 | 45 | 1.56 (3KYM_O-1nj9_A) | 1.2 (3D9A_L) | HUMAN, MOUSE, | |
| L2-7-II | 39 | 8 (3%) | 3RIA_L | GADBBPP | 12 | 5 | 1.04 (3RHW_K-3FKU_X) | 1.66 (3QHF_L) | HUMAN, MOUSE | |
| L2-7-III | 11 | 2 (1%) | 2a6k_L | LLGGPPD | 8 | 3 | 0.75 (2ZJS_L-2V7H_L) | 2.5 (2a6i_A) | MOUSE |
|
|
| ||||||||||
| L2-11-I | 10 | 2 (67%) | 2OTU_A | BPAALPBBPPP | 1 | 1 | 0.39 (2GSG_C-2OTU_C) | 1.68 (2OTU_A) | MOUSE |
|
| L2-11-II | 3 | 1 (33%) | 2H32_A | BDBAABBBPPA | 3 | 2 | 0.2 (2H3N_C-2H3N_A) | 2.3 (2H3N_A) | HUMAN |
|
|
|
|
|
| |||||||
Summary table for the clustering of CDR-L3.
| Cluster | Population | Unique | Medoid | Medoid | Level-3 | Level-2 | Cluster diameter (Å) | Best resolution | Species in cluster | Isotype |
|---|---|---|---|---|---|---|---|---|---|---|
|
| ||||||||||
| L3-5-I | 10 | 4 (100%) | 3NGB_C | BBGAB | 2 | 1 | 0.26 (3U7W_L-3Sé_L) | 1.9 (3SE8_L) | HUMAN |
|
|
| ||||||||||
| L3-7-I | 5 | 2 (100%) | 3IU3_L | BBDDDLP | 4 | 1 | 0.49 (1mim_L-1dfb_L) | 2.6 (1mim_L) | HUMAN, MOUSE |
|
|
| ||||||||||
| L3-8-I | 105 | 28 (65%) | 1q9w_C | BPDABGBB | 13 | 4 | 0.76 (3KJ4_L-1m71_A) | 1.45 (1q9r_A) | HUMAN, MOUSE, |
|
| L3-8-II | 12 | 6 (14%) | 1tzh_L | BPDBBPBP | 9 | 5 | 1.18 (3DGG_C-1e6j_L) | 1.77 (2fat_L) | HUMAN, MOUSE |
|
| L3-8-III | 7 | 4 (9%) | 3hfl_L | BPDPABPB | 7 | 4 | 0.98 (1ORS_A-1ehl_L) | 1.7 (1YQV_L) | MOUSE |
|
| L3-8-IV | 5 | 2 (5%) | 3JWD_L | BPDADPDP | 3 | 3 | 0.66 (2J88_L-1rz7_L) | 2.0 (1rz7_L) | HUMAN, MOUSE |
|
| L3-8-V | 4 | 3 (7%) | 1za3_A | BPAABPDP | 4 | 4 | 0.85 (3DGG_A-1za3_L) | 2.3 (3DGG_A) | HUMAN |
|
| L3-8-VI | 3 | 1 (2%) | 3MCL_L | BBAAPBPA | 1 | 1 | 0.24 (3O11_A-3O11_L) | 1.7 (3MCL_L) | CHIMERA OF |
|
|
| ||||||||||
| L3-9-I | 1528 | 328 (92%) | 1dqj_A | BBDABPPPB | 153 | 34 | 1.72 (1maj_A_11-1kel_L) | 1.2 (3D9A_L) | HUMAN, MOUSE, |
|
| L3-9-II | 136 | 20 (6%) | 2XZQ_L | BBBBGDBPB | 36 | 8 | 1.36 (2E27_L-1pw3_B) | 1.5 (1oaq_L) | HUMAN, MOUSE, | |
| L3-9-III | 40 | 8 (2%) | 1op3_L | BPBBADBBB | 15 | 8 | 1.4 (8fab_C-3C08_L) | 1.75 (1op3_L) | HUMAN, MOUSE, | |
| L3-9-IV | 8 | 1 (0,3%) | 3MLR_L | BBBBBBBPA | 3 | 2 | 1.24 (3MLV_M-3MLS_M) | 1.8 (3MLR_L) | HUMAN |
|
| L3-9-V | 6 | 1 (0,3%) | 3RU8_L | BBBGLLBBB | 2 | 2 | 0.34 (1n0x_L-1hzh_M) | 1.8 (1n0x_L) | HUMAN |
|
| L3-9-VI | 2 | 1 (0,3%) | 3FN0_L | BBGBPBABA | 1 | 1 | 0.23 (3Q1S_L-3FN0_L) | 1.8 (3FN0_L) | HUMAN |
|
|
| ||||||||||
| L3-10-I | 22 | 2 (7%) | 3MUG_I | BBPBADLPBB | 4 | 2 | 0.76 (3MUH_L-3LRS_B) | 1.8 (3U2S_L) | HUMAN |
|
| L3-10-II | 20 | 2 (7%) | 1MCD_A | BBPBGLLBBP | 15 | 9 | 1.28 (1mcc_A-1lil_B) | 2.0 (2mcg_2) | HUMAN, SEAL |
|
| L3-10-III | 9 | 4 (15%) | 3B5G_A | BBBPAALPPB | 4 | 1 | 1.19 (3GO1_L-2XZA_L) | 1.36 (2XZC_L) | HUMAN |
|
| L3-10-IV | 9 | 4 (15%) | 3EYO_C | BBDABBPPPB | 5 | 2 | 0.51 (3F12_A-3eyq_C) | 1.8 (1jgu_L) | MOUSE |
|
| L3-10-V | 7 | 3 (11%) | 2dd8_L | BBBBADDGPB | 5 | 1 | 1.06 (3UJI_L-3G6A_L) | 1.6 (3UJI_L) | HUMAN |
|
| L3-10-VI | 5 | 1 (4%) | 3MLY_L | BBPBAALBPB | 1 | 1 | 0.24 (3MLZ_L-3MLY_M) | 1.7 (3MLY_L) | HUMAN |
|
| L3-10-VII | 5 | 3 (11%) | 3TV3_L | BBPBGADPBB | 4 | 3 | 0.84 (3TWC_L-1mcw_M) | 1.29 (3TV3_L) | HUMAN |
|
| L3-10-VIII | 4 | 1 (4%) | 2fl5_A | BBBPADLPPB | 2 | 2 | 0.18 (2fl5_C-2fl5_L) | 3.0 (2fl5_L) | HUMAN |
|
| L3-10-IX | 4 | 1 (4%) | 1mqk_L | BPDBGBPPBB | 3 | 1 | 0.31 (3HB3_D-1qle_L) | 1.28 (1mqk_L) | MOUSE |
|
| L3-10-X | 3 | 1 (4%) | 3IDY_L | BBDABBPPBB | 2 | 1 | 0.23 (3IDY_C-3IDX_L) | 2.5 (3IDX_L) | HUMAN |
|
| L3-10-XI | 2 | 1 (4%) | 1i7z_A | BPBBABPPBB | 2 | 1 | 0.12 (1i7z_A-1i7z_C) | 2.3 (1i7z_A) | CHIMERA OF |
|
| L3-10-XII | 17 | 3 (11%) | 1mcn_B | BBPPPADBBP | 17 | 15 | 1.55 (1mch_B-1lil_A) | 2.0 (2mcg_1) | HUMAN, SEAL |
|
|
| ||||||||||
| L3-11-I | 74 | 24 (63%) | 3G04_A | BBPBAADLBPB | 27 | 2 | 1.72 (3MAC_L-2rhe_A) | 1.43 (3N9G_L) | HUMAN, MOUSE, | |
| L3-11-II | 24 | 1 (3%) | 1yym_Q | BPDAPBPPBPB | 6 | 3 | 0.34 (1yym_L-1g9n_L) | 1.99 (2NY1_C) | HUMAN |
|
| L3-11-III | 11 | 3 (8%) | 2OMN_A | BBPBAPABABB | 7 | 3 | 1.01 (3GHE_L-2OMB_D) | 1.5 (3KDM_L) | HUMAN |
|
| L3-11-IV | 8 | 1 (3%) | 2QR0_G | BBBLPDDBABB | 8 | 4 | 0.36 (2QR0_S-2QR0_K) | 3.5 (2QR0_A) | HUMAN |
|
| L3-11-V | 5 | 2 (5%) | 3NH7_M | BBPBAPLLBBB | 4 | 3 | 0.59 (4bjl_A-3NH7_O) | 2.4 (4bjl_A) | HUMAN |
|
| L3-11-VI | 4 | 2 (5%) | 2JB6_A | BBBPAALDBBB | 2 | 1 | 0.77 (3UJJ_L-2JB5_L) | 2.0 (3UJJ_L) | HUMAN |
|
| L3-11-VII | 4 | 1 (3%) | 3EFF_C | BBDAPBBLAGB | 4 | 4 | 0.6 (3PJS_C-3EFF_A) | 3.8 (3EFF_A) | MOUSE |
|
| L3-11-VIII | 3 | 1 (3%) | 2b1h_L | BBPBDAALBPB | 2 | 1 | 0.2 (2b1a_L-2b0s_L) | 2.0 (2b1h_L) | HUMAN |
|
| L3-11-IX | 2 | 1 (3%) | 1nfd_E | BBBBGAALPPB | 1 | 1 | 0.17 (1nfd_E-1nfd_G) | 2.8 (1nfd_E) | MOUSE |
|
|
| ||||||||||
| L3-12-I | 6 | 1 (17%) | 2X7L_D | BBGBAAGADGBB | 1 | 1 | 0.04 (2X7L_K-2X7L_B) | 3.17 (2X7L_B) | Not available |
|
| L3-12-II | 6 | 1 (17%) | 1q1j_L | BBBPAPAALPPB | 2 | 1 | 0.31 (3GHB_L-3C2A_M) | 2.1 (3C2A_L) | HUMAN |
|
| L3-12-III | 4 | 2 (33%) | 3LZF_L | BBPBAPGAGBPB | 3 | 3 | 0.9 (3QHZ_M-3GBN_L) | 1.55 (3QHZ_M) | HUMAN |
|
| L3-12-IV | 3 | 2 (33%) | 3LQA_L | BBPBAPGAAGPB | 3 | 3 | 1.35 (3P30_L-3LMJ_L) | 2.2 (3LMJ_L) | HUMAN |
|
|
| ||||||||||
| L3-13-I | 6 | 1 (50%) | 2OTU_A | BBBBPPAABPBBB | 1 | 1 | 0.18 (2OTW_C-2OTU_G) | 1.68 (2OTU_A) | MOUSE |
|
| L3-13-II | 3 | 1 (50%) | 2QHR_L | BBBBBBLLBPBBB | 3 | 3 | 0.97 (3FFD_B-2GSG_A) | 2.0 (2QHR_L) | MOUSE |
|
| L3-13-III | 2 | 1 (50%) | 3MLW_L | BBBPDABABPPPB | 2 | 2 | 0.48 (3MLW_L-3MLW_M) | 2.7 (3MLW_L) | HUMAN |
|
|
|
|
|
| |||||||
Summary table for the clustering of CDR-H1.
Level-2 clusters are shown exceptionally (marked with an asterisk) when no level-1 cluster is formed (minimum of 2 members required).
| Cluster | Population | Unique | Medoid | Medoid | Level-3 | Level-2 | Cluster diameter (Å) | Best resolution | Species in cluster |
|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||
| H1-10-I | 6 | 2 (100%) | 1kxq_H | BPABPBABBB | 2 | 1 | 0.61 (3eba_A-1kxq_F) | 1.6 (1kxq_E) | CAMEL, HUMAN |
|
| |||||||||
| H1-12-O-1* | 1 | 1 (50%) | 1ghf_H | BBBBPAAABPBB | 1 | 1 | N/A | 2.7 (1ghf_H) | MOUSE |
| H1-12-O-2* | 1 | 1 (50%) | 3IY2_B | PBBLBABBABBB | 1 | 1 | N/A | 18.0 (3IY2_B) | MOUSE |
|
| |||||||||
| H1-13-I | 1555 | 390 (87%) | 2OSL_A | PPBLBPAADBPBB | 214 | 23 | 1.74 (2GK0_B-1rzi_J) | 1.2 (3D9A_H) | MOUSE, HUMAN, |
| H1-13-II | 49 | 6 (1%) | 3F7Y_A | PBBGPBBAAPBBB | 19 | 6 | 1.77 (3QXW_D-3GKZ_A) | 1.72 (2IH3_A) | MOUSE, HUMAN, |
| H1-13-III | 24 | 4 (1%) | 1bzq_K | BPBLPABBPABBB | 11 | 3 | 1.12 (2P42_D-1jtp_B) | 1.1 (2P45_B) | MOUSE, |
| H1-13-IV | 8 | 2 (0.4%) | 3QXV_A | BBABPBAPPBPBB | 7 | 2 | 0.97 (3QXV_B-1YC7_A) | 1.6 (1YC7_A) | DROMEDARY, |
| H1-13-V | 9 | 3 (1%) | 2X7L_J | PPBLPAPABBPBB | 4 | 2 | 1.11 (2W9E_H-1YC8_B) | 2.7 (1YC8_B) | CAMEL |
| H1-13-VI | 9 | 1 (0.2%) | 2WZP_E | BPABBABPLPBBB | 4 | 2 | 0.35 (2WZP_J-2BSE_E) | 2.6 (2WZP_D) | CAMELID |
| H1-13-VII | 2 | 1 (0.2%) | 1SHM_A | PPBGPAAPPBPBB | 2 | 1 | 0.19 (1SHM_A-1SHM_B) | 1.9 (1SHM_A) | HUMAN, MOUSE, |
| H1-13-VIII | 8 | 5 (1%) | 3EZJ_B | BPBGPAAAPDBBB | 6 | 2 | 1.05 (1zvy_A-1SJX_A) | 1.5 (1zvh_A) | HUMAN, CAMEL, LLAMA |
| H1-13-IX | 3 | 1 (0.2%) | 3GBM_H | BBPGGAPBDBPBB | 3 | 1 | 0.17 (3GBN_H-3GBM_I) | 2.2 (3GBN_H) | HUMAN |
| H1-13-X | 3 | 1 (0.2%) | 2X89_A | BBBLPPLLBBPBB | 2 | 1 | 0.22 (2X89_C-2X89_B) | 2.16 (2X89_A) | HUMAN |
| H1-13-XI | 2 | 1 (0.2%) | 1ngx_B | PPALPPBABBPBB | 1 | 1 | 0.01 (1ngx_H-1ngx_B) | 1.8 (1ngx_B) | HUMAN, MOUSE |
|
| |||||||||
| H1-14-I | 70 | 16 (94%) | 1kcv_H | BBBLBPAAABGBBB | 47 | 11 | 1.37 (2f58_H-2ajz_H) | 1.3 (1ncw_H) | HUMAN, MOUSE |
|
| |||||||||
| H1-15-I | 117 | 25 (86%) | 2HWZ_H | BBBLBBAAPPLPBBB | 43 | 12 | 1.68 (3BQU_B-3B2V_H) | 1.5 (3IFL_H) | HUMAN, MOUSE |
| H1-15-II | 6 | 2 (7%) | 3BAE_H | BBBLBBAAAALPPBB | 4 | 2 | 0.86 (3BKC_H-3AAZ_A) | 1.59 (3BAE_H) | HUMAN, MOUSE |
| H1-15-III | 2 | 1 (3%) | 3FZU_H | BBBLPAPPAADBPBB | 2 | 1 | 0.28 (3FZU_C-3FZU_H) | 2.5 (3FZU_H) | HUMAN |
|
| |||||||||
| H1-16-I | 2 | 1 (50%) | 3eak_B | PBBGLAABPAAAPPBB | 1 | 1 | 0.42 (3eak_A-3eak_B) | 1.95 (3eak_A) | CAMEL |
|
| |||||||||
| H1-24-O-1* | 1 | 1 (100%) | 3K3Q_A | PPBLBALDLGAAG | 1 | 1 | N/A | 2.6 (3K3Q_A) | LLAMA |
|
|
|
|
| ||||||
Comparison of level-1 conformational clusters obtained in CDR-L2 with external sets.
See notes in Table 9. In North, Lehmann & Dunbrack (2011), the CDR extents were defined as L49-L56, instead of L50-L56; hence a direct comparison is not possible. Nonetheless, since position L49 is fairly conserved structurally and for reference reasons, a correspondence of the longer by 1 residue clusters is shown, based on the representative of those clusters (in square brackets and in full-italics).
| This work | ||
|---|---|---|
| L2-7-I | 7A/1 |
|
| L2-7-II | – |
|
| L2-7-III | 7B/1 | – |
| L2-11-I | – |
|
| L2-11-II | – |
|
Comparison of level-1 conformational clusters obtained in CDR-L3, with external sets.
See notes in Table 9. In Kuroda et al. (2009), no cluster representatives are available, so the cluster member with the best resolution was arbitrarily selected in each case, in order to identify the correspondences with the results from the present study.
| This work | |||
|---|---|---|---|
| L3-5-I | – | – | – |
| L3-7-I | 7A/4 | L3-7-1 | 4(1MIM) |
| L3-8-I | 8B/- | L3-8-1 | 3B(1PZ5) |
| L3-8-II | L3-8-cis6-1 | 7(2FAT) | |
| L3-8-III | 8A/3 | L3-8-2 | 3A(1YQV) |
| L3-8-IV | – | – | – |
| L3-8-V | – | – | – |
| L3-8-VI | – | – | – |
| L3-9-I | 9A/1 | L3-9-cis7-1 | 1(1MJU) |
| L3-9-II | 9C/4 | L3-9-1 | 1A (1A6V) |
| L3-9-III | 9B/2 | L3-9-cis6-1 | (9-)2 (2FBJ) |
| L3-9-IV | – | – | – |
| L3-9-V | – | – | – |
| L3-9-VI | – | – | – |
| L3-10-I | – | – | – |
| L3-10-II | – | – | – |
| L3-10-III | – | L3-10-1 | – |
| L3-10-IV | – | L3-10-cis7,8-1 | 5(1JGU) |
| L3-10-V | – | – | – |
| L3-10-VI | – | – | – |
| L3-10-VII | 10B/-( | – | – |
| L3-10-VIII | – | – | – |
| L3-10-IX | – | – | – |
| L3-10-X | – | – | – |
| L3-10-XI | – | L3-10-cis8-1 | – |
| L3-10-XII | 10C/- | – | – |
| L3-11-I | 11A/5 | L3-11-1 | (11-)2 (2FB4) |
| L3-11-II | – | L3-11-cis7-1 | 8(2NY1) |
| L3-11-III | – | – | – |
| L3-11-IV | – | – | – |
| L3-11-V | 11B/- | – | – |
| L3-11-VI | – | – | – |
| L3-11-VII | – | – | – |
| L3-11-VIII | – | – | – |
| L3-11-IX | – | – | – |
| L3-12-I | – | – | – |
| L3-12-II | – | L3-12-1 | – |
| L3-12-III | – | – | – |
| L3-12-IV | – | – | – |
| L3-13-I | – | L3-13-1 | – |
| L3-13-II | – | – | – |
| L3-13-III | – | – | – |
|
| |||
| L3-10-O | 10A/5 | – | – |
Comparison of level-1 conformational clusters obtained in CDR-H1 with external sets.
See notes in Table 9. In Martin & Thornton (1996), the CDR extents definition is significantly different (H26-H35), but correspondences based on median structures are shown for reference (in square brackets and full-italics).
| This work | ||
|---|---|---|
| H1-10-I | – | H1-10-1 |
| H1-13-I |
| H1-13-1 |
| H1-13-II | – | H1-13-8 |
| H1-13-III | – | H1-13-6 |
| H1-13-IV | – | – |
| H1-13-V | – | – |
| H1-13-VI | – | – |
| H1-13-VII | – | – |
| H1-13-VIII | – | H1-13-5 |
| H1-13-IX | – | – |
| H1-13-X | – | – |
| H1-13-XI | – | – |
| H1-14-I |
| H1-14-1 |
| H1-15-I |
| H1-15-1 |
| H1-15-II | – | – |
| H1-15-III | – | – |
| H1-16-I | – | – |
|
| ||
| H1-12-O | – | H1-12-1 |
| H1-13-O |
| H1-13-3 |
| H1-16-O | – | H1-16-1 |
| H1-24-O | – | – |