| Literature DB >> 27881076 |
David T Barkan1, Xiao-Li Cheng1, Herodion Celino1, Tran T Tran1, Ashok Bhandari1, Charles S Craik2,3, Andrej Sali4,2,3, Mark L Smythe5,6.
Abstract
BACKGROUND: Disulfide-rich peptides (DRPs) are found throughout nature. They are suitable scaffolds for drug development due to their small cores, whose disulfide bonds impart extraordinary chemical and biological stability. A challenge in developing a DRP therapeutic is to engineer binding to a specific target. This challenge can be overcome by (i) sampling the large sequence space of a given scaffold through a phage display library and by (ii) panning multiple libraries encoding structurally distinct scaffolds. Here, we implement a protocol for defining these diverse scaffolds, based on clustering structurally defined DRPs according to their conformational similarity.Entities:
Keywords: Clustering; Disulfide-rich peptides; Drug discovery; Interleukin-23; Knottins; Phage display; Structure conservation
Mesh:
Substances:
Year: 2016 PMID: 27881076 PMCID: PMC5120537 DOI: 10.1186/s12859-016-1350-9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Protocol details. a Pipeline workflow. b Example of hierarchical clustering using toy data, portrayed as a tree where the leaves are DRPs and each inner node represents a cluster containing all DRPs in the sub-tree rooted at that node. Numbers at the branch point are the values of the distance metric when calculated across the two sub-trees that are being merged at the inner node. The red line is the empirically selected cutoff (here, 0.7); all sub-trees to the right of this cutoff represent the final clusters
Fig. 2Determination of clustering cutoffs. Top row: example of resulting clusters following the initial native overlap hierarchical clustering step. Each image represents a different cutoff applied for determining the final clusters for that step. These images informed the decision of which cutoff to impose in the final protocol. a Conotoxin and small hairpin clusters at the native overlap cutoff of 0.7, which was ultimately selected as the final cutoff. b At a cutoff of 0.6, the same conotoxin and small hairpin DRPs were assigned to the same cluster despite assuming different secondary structures. c At a cutoff of 0.8, conotoxin DRPs were assigned to separate clusters despite each cluster fold consisting of circular loops and short helical regions. Bottom row: example of a resulting cluster following the knottin reclustering step, with each image representing the knottin cluster containing the most DRPs after applying a different cutoff. Only the disulfide bonds in the DRPs are displayed, in yellow. The cutoffs assessed were (d) 2.0 Å RMSD, (e) 1.5 Å RMSD, and (f) 2.5 Å RMSD. 2.0 Å was selected as the optimal cutoff and used in the final protocol
Fig. 3Cluster DRP coverage. Clusters were sorted by size from most to least populated and each cluster was assigned an index starting with 1. At each index i, the cumulative number of DRPs in that cluster and all clusters with index less than i was calculated and divided by the total number of DRPs in the dataset, resulting in the coverage. Coverage as a function of index is displayed. Coverage curves are shown after completion of successive steps of the procedure (red: initial clustering; green: knottin reclustering; purple: longer singleton post-processing; blue: shorter singleton post-processing)
Fig. 4Cluster visualization. The top 20 clusters by size are displayed. Singleton DRPs are removed for clarity. DRPs are colored according to sequence conservation within the cluster, ranging from yellow (high conservation) to gray (moderate) to blue (low conservation). Disulfide bonds are shown in red
Matrix of structural diversity across clusters
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
| .64 | .42 | .25 | .67 | .39 | .50 | .31 | .67 | .42 | .31 | .41 | .53 | .34 | .69 | .39 | .39 | .33 | .39 | .58 |
|
| .64 |
| .39 | .27 | .73 | .42 | .58 | .39 | .48 | .47 | .33 | .24 | .42 | .47 | .70 | .70 | .50 | .27 | .45 | .70 |
|
| .42 | .39 |
| .52 | .34 | .38 | .48 | .67 | .40 | .36 | .38 | .41 | .50 | .32 | .46 | .33 | .39 | .57 | .32 | .33 |
|
| .25 | .27 | .52 |
| .31 | .53 | .47 | .75 | .33 | .31 | .67 | .26 | .31 | .26 | .32 | .24 | .31 | .63 | .25 | .33 |
|
| .67 | .73 | .34 | .31 |
| .38 | .55 | .38 | .37 | .42 | .38 | .37 | .34 | .45 | .69 | .61 | .44 | .45 | .23 | .77 |
|
| .39 | .42 | .38 | .53 | .38 |
| .74 | .53 | .50 | .33 | .65 | .22 | .28 | .34 | .54 | .45 | .36 | .47 | .34 | .37 |
|
| .50 | .58 | .48 | .47 | .55 | .74 |
| .53 | .33 | .39 | .58 | .26 | .34 | .45 | .61 | .58 | .50 | .47 | .27 | .57 |
|
| .31 | .39 | .67 | .75 | .38 | .53 | .53 |
| .40 | .31 | .50 | .30 | .41 | .37 | .39 | .27 | .42 | .63 | .25 | .37 |
|
| .67 | .48 | .40 | .33 | .37 | .50 | .33 | .40 |
| .72 | .30 | .33 | .38 | .39 | .50 | .55 | .39 | .37 | .43 | .63 |
|
| .42 | .47 | .36 | .31 | .42 | .33 | .39 | .31 | .72 |
| .25 | .50 | .36 | .42 | .56 | .42 | .47 | .33 | .34 | .28 |
|
| .31 | .33 | .38 | .67 | .38 | .65 | .58 | .50 | .30 | .25 |
| .20 | .19 | .29 | .39 | .33 | .31 | .47 | .25 | .33 |
|
| .41 | .24 | .41 | .26 | .37 | .22 | .26 | .30 | .33 | .50 | .20 |
| .41 | .35 | .30 | .30 | .28 | .22 | .30 | .30 |
|
| .53 | .42 | .50 | .31 | .34 | .28 | .34 | .41 | .38 | .36 | .19 | .41 |
| .29 | .53 | .45 | .25 | .31 | .36 | .38 |
|
| .34 | .47 | .32 | .26 | .45 | .34 | .45 | .37 | .39 | .42 | .29 | .35 | .29 |
| .42 | .45 | .50 | .32 | .48 | .55 |
|
| .69 | .70 | .46 | .32 | .69 | .54 | .61 | .39 | .50 | .56 | .39 | .30 | .53 | .42 |
| .64 | .44 | .36 | .32 | .60 |
|
| .39 | .70 | .33 | .24 | .61 | .45 | .58 | .27 | .55 | .42 | .33 | .30 | .45 | .45 | .64 |
| .42 | .30 | .27 | .61 |
|
| .39 | .50 | .39 | .31 | .44 | .36 | .50 | .42 | .39 | .47 | .31 | .28 | .25 | .50 | .44 | .42 |
| .39 | .43 | .47 |
|
| .33 | .27 | .57 | .63 | .45 | .47 | .47 | .63 | .37 | .33 | .47 | .22 | .31 | .32 | .36 | .30 | .39 |
| .30 | .47 |
|
| .39 | .45 | .32 | .25 | .23 | .34 | .27 | .25 | .43 | .34 | .25 | .30 | .36 | .48 | .32 | .27 | .43 | .30 |
| .36 |
|
| .58 | .70 | .33 | .33 | .77 | .37 | .57 | .37 | .63 | .28 | .33 | .30 | .38 | .55 | .60 | .61 | .47 | .47 | .36 |
|
For each cluster, the DRP with the highest average native overlap value to all other DRPs in the cluster was selected as the representative member to be used as the basis for phage display libraries (the calculation of native overlap is described in Methods). These average native overlap values for the representative DRPs are displayed along the matrix diagonal in bold. Additionally, pairwise structural alignments of all representatives were computed with SALIGN; the resulting native overlap values are displayed in off-diagonal cells in the matrix
Summary of clusters
| Cluster | Name | # Members | Avg Seq Id | Avg Length | Scaffold |
|---|---|---|---|---|---|
| 1 | Knottin CSαβ I | 115 | 21.8 | 38 | 2crdA |
| 2 | Knottin Classic ICK | 98 | 23.4 | 24 | 2jtbA |
| 3 | Insulin | 58 | 42.4 | 23 | 3gkyC |
| 4 | Small Conotoxin | 52 | 24.9 | 12 | 1e76A |
| 5 | Knottin Cyclotide ICK | 48 | 30.9 | 30 | 3e4hA |
| 6 | Small Hairpin | 42 | 15.9 | 16 | 1wo0A |
| 7 | EGF-like Hairpins | 39 | 17.0 | 19 | 2oqjL |
| 8 | Medium Conotoxin | 35 | 21.0 | 17 | 2uz6K |
| 9 | α-Defensin | 30 | 53.4 | 31 | 3lo2B |
| 10 | β-Defensin | 25 | 51.1 | 37 | 2nlsA |
| 11 | Large Hairpin | 22 | 21.5 | 17 | 1gm2A |
| 12 | Crambin | 19 | 56.8 | 46 | 1orlA |
| 13 | Helix-Loop-Helix | 19 | 12.2 | 34 | 1bzbA |
| 14 | LDL Receptor | 17 | 30.4 | 39 | 2kriB |
| 15 | Knottin CSαβ II | 12 | 19.7 | 29 | 1du9A |
| 16 | PMP Inhibitor | 11 | 59.0 | 35 | 2f91B |
| 17 | TNF Receptor | 11 | 42.6 | 39 | 1xu2R |
| 18 | Large Conotoxin | 10 | 24.6 | 19 | 1tckA |
| 19 | Tryptase Inhibitor | 10 | 42.2 | 39 | 2kmoA |
| 20 | Anti-microbial Peptide | 9 | 49.2 | 30 | 1mmcA |
Clusters are sorted by number of members. Name: Manually assigned name, derived from the most frequent SCOP fold assignment for each cluster. Avg Seq Id: Average pairwise sequence identity of all DRPs in the cluster. Avg Length: Average sequence length of all DRPs in the cluster, derived from the sequence resolved in the PDB structure. Scaffold: Selected representative for the cluster
Fig. 5Phage display experiment. a Structure of the peptide scaffold for phage library 1KATr1. Variable residue positions are colored red, and disulfide bonds in yellow. The same representation is used for the 1KVFr1 (b) and 1ZDCr1 (c) library scaffolds. d Enrichment ratios across successive rounds of phage panning for the three libraries. Panning was discontinued after the fourth round for 1KATr1 and 1KVFr1 due to a lack of enrichment. e Standard curve resulting from competition ELISA experiment to assess inhibition of IL-23/IL-23R complex formation by the PN-05-84 clone