| Literature DB >> 30832207 |
Rachael A Mansbach1, Timothy Travers2,3, Benjamin H McMahon4, Jeanne M Fair5, S Gnanakaran6.
Abstract
Marine cone snails are carnivorous gastropods that use peptide toxins called conopeptides both as a defense mechanism and as a means to immobilize and kill their prey. These peptide toxins exhibit a large chemical diversity that enables exquisite specificity and potency for target receptor proteins. This diversity arises in terms of variations both in amino acid sequence and length, and in posttranslational modifications, particularly the formation of multiple disulfide linkages. Most of the functionally characterized conopeptides target ion channels of animal nervous systems, which has led to research on their therapeutic applications. Many facets of the underlying molecular mechanisms responsible for the specificity and virulence of conopeptides, however, remain poorly understood. In this review, we will explore the chemical diversity of conopeptides from a computational perspective. First, we discuss current approaches used for classifying conopeptides. Next, we review different computational strategies that have been applied to understanding and predicting their structure and function, from machine learning techniques for predictive classification to docking studies and molecular dynamics simulations for molecular-level understanding. We then review recent novel computational approaches for rapid high-throughput screening and chemical design of conopeptides for particular applications. We close with an assessment of the state of the field, emphasizing important questions for future lines of inquiry.Entities:
Keywords: computational studies; conopeptides; conotoxins; docking; drug design; ion channels; machine learning; molecular dynamics; review
Mesh:
Substances:
Year: 2019 PMID: 30832207 PMCID: PMC6471681 DOI: 10.3390/md17030145
Source DB: PubMed Journal: Mar Drugs ISSN: 1660-3397 Impact factor: 5.118
Different categories used to classify the conopeptides, along with the basic type of categorization and a brief description.
| Category | Type | Description |
|---|---|---|
| Gene superfamily | sequence | Clustering of precursor region |
| Cysteine framework | sequence | Arrangement of cysteines |
| Loop class | sequence | Number of amino acids between cysteines |
| Disulfide connectivity | structure | Pattern of disulfide bond formation |
| Fold | structure | General three-dimensional structure |
| Subfold | structure | More specific three-dimensional structure |
| Pharmacological family | action | Target and mode of action (agonist, antagonist, etc.) |
Figure 1Comparison of two conotoxin sequences from Gene Superfamily A. Matching precursor signal sequence regions are in blue, places where the precursor regions do not match are in green, and the mature toxin regions are in red. The remainder of the sequences comprising the N-terminal and C-terminal pro-regions are in black. (a) sequence of conotoxin Ac4.1 from Conus achatinus; (b) sequence of conotoxin Ac1.2 from Conus achatinus.
Figure 2Illustration of sequence, cysteine framework and loop class for (a) -conotoxin ImI (Protein Data Bank or PDB structure 1G2G [51]) and (b) -conotoxin CnIIIC (PDB structure 2YEN [52]). In the illustrative 3D structures on the left, disulfide bonds are represented as yellow sticks. Images of peptides were generated with Pymol [53].
Summary of cysteine frameworks, with defining pattern and number of cysteines. Data compiled from the Conoserver, an automatically-updating online repository of conopeptide data [17,54]. In the entry for framework X, .[PO] represents an interceding hydroxyproline residue.
| Framework Name | Pattern | No. Cysteines |
|---|---|---|
| I | CC-C-C | 4 |
| II | CCC-C-C-C | 6 |
| III | CC-C-C-CC | 6 |
| IV | CC-C-C-C-C | 6 |
| V | CC-CC | 4 |
| VI/VII | C-C-CC-C-C | 6 |
| VIII | C-C-C-C-C-C-C-C-C-C | 10 |
| IX | C-C-C-C-C-C | 6 |
| X | CC-C.[PO]C | 4 |
| XI | C-C-CC-CC-C-C | 8 |
| XII | C-C-C-C-CC-C-C | 8 |
| XIII | C-C-C-CC-C-C-C | 8 |
| XIV | C-C-C-C | 4 |
| XV | C-C-CC-C-C-C-C | 8 |
| XVI | C-C-CC | 4 |
| XVII | C-C-CC-C-CC-C | 8 |
| XVIII | C-C-CC-CC | 6 |
| XIX | C-C-C-CCC-C-C-C-C | 10 |
| XX | C-CC-C-CC-C-C-C-C | 10 |
| XXI | CC-C-C-C-CC-C-C-C | 10 |
| XXII | C-C-C-C-C-C-C-C | 8 |
| XXIII | C-C-C-CC-C | 6 |
| XXIV | C-CC-C | 4 |
| XXV | C-C-C-C-CC | 6 |
| XXVI | C-C-C-C-CC-CC | 8 |
| XXVII | C-CC-C-C-C | 6 |
Figure 3The thirteen major conopeptide folds described in Akondi et al. [3], shown by representative examples from the PDB [56]. Disulfide bonds are represented as yellow sticks. All images rendered in Pymol [53]. (a) Fold A, also referred to as the “globular” fold: conotoxin -RgIA, PDB structure 2JUT [57]; (b) Fold B: conotoxin -CnIIIC, PDB structure 2YEN [58]; (c) Fold C, also referred to as “cysteine knot” fold: conotoxin -EVIA, PDB structure 1G1P [59]; (d) Fold D, also referred to as “ribbon” fold: ribbon isoform of conotoxin -GI, PDB structure 1XGB [60]; (e) Fold E, also referred to as “beaded” or “beads-on-a-string” fold: conotoxin -CMrVIA, PDB structure 2B5Q [61]; (f) Fold G: conotoxin -PIXIVA, PDB structure 2FQC [62]; (g) Kunitz fold: conkunitzin-S2, PDB structure 2J6D [63]; (h) Fold H: conotoxin MrIIIe, PDB structure 2EFZ [64]; (i) Fold I: Conotoxin -PIVA, PDB structure 1P1P [65]; (j) Fold J: contryphan-Vn, PDB structure 1NXN [66]; (k) Fold K: conantokin-G, PDB structure 1AD7 [15]; and (l) Fold L: conomarphin, PDB structure 2YYF [67]. There is no representative structure in the PDB for Fold F. The interested reader is referred to the original paper by Zhang et al. [68], which contains the characterization of the only determined structure of this fold.
Figure 4Different disulfide connectivities lead to different conotoxin structures. (a) With four cysteines, two different connectivities can lead to either a “globular” structure with -helical content (left, PDB 1PEN for conotoxin -PnIA [72]) or a flattened “ribbon” structure which often, but not always, displays -sheet content (right, PDB 2EW4 for conotoxin -MrIA [73]). (b) With six cysteine residues, two connectivities that differ only in the first two disulfide bonds can lead to either a “cysteine knot” structure with -sheet content (left, PDB 1AG7 for conotoxin -GS [74]) or another structure with no discernable secondary structure content (right, PDB 2EFZ for conotoxin MrIIIe [64]).
Summary of pharmacological families, their targets, and their modes of action. Data compiled from the Conoserver [17,54] and the Uniprot database [75].
| Family | Target | Mode of Action |
|---|---|---|
| Nicotinic acetylcholine receptors (nAChRs) | orthosteric, allosteric inhibition | |
| Neuronal pacemaker cation currents | increase calcium current | |
| Voltage-gated sodium channels (VGSCs) | agonist, delayed inactivation | |
| Presynaptic calcium channels or G protein-coupled presynaptic receptors (GPCRs) | blocker | |
| VGSC | agonist, no delayed inactivation | |
| Voltage-gated potassium channels (VGPCs) | blocker | |
| VGSC | antagonist, blocker | |
| Alpha-1 adrenergic receptors | allosteric inhibitor | |
| Serotonin-gated ion channels | antagonist | |
| Somatostatin receptor | antagonist | |
| Neuronal noradrenaline transporter | unknown | |
| Voltage-gated calcium channels (VGCCs) | blocker |
Figure 5Comparison of different categories with pharmacological family. (a) Comparison between cysteine framework and pharmacological target for all pairs of 243 peptides with determined pharmacological targets downloaded from the Conoserver [17,54]. Black indicates that two peptides are assigned different categories, while white indicates the two peptides are assigned the same category. The lower triangular shows cysteine framework; the upper triangular pharmacological target. (b) Comparison between cysteine framework and pharmacological target for the subset of 106 sequences with more than four cysteines. In (c) and (d), we show a comparison between fold and subfold class and pharmacological family for all pairs of 80 peptides with a defined pharmacological target and fold/subfold classes as described in Akondi et al. [3].
Figure 6Two-dimensional embedding of loop class, demonstrating its relationship to (a) fold and (b) pharmacological family. Data compiled from Akondi et al. [3], which includes 103 peptides with measured structures, 80 of which had identified pharmacological targets. Loop class was represented as a seven-dimensional vector, with vectors representing classes containing fewer than eight cysteines padded with negative ones for direct comparison. In all images, size of a given marker indicates the number of conopeptides with identical loop class and category, while color indicates category. Red arrows draw the reader’s attention to differences in clustering between the two panels. Blue arrows indicate an example of a structural isomer. The embedding was done for visualization purposes employing the t-Distributed Stochastic Neighbor Embedding (t-SNE) algorithm as implemented in Scikit Learn [79,80].
Figure 7Impact of disulfide connectivity on conotoxin structure. (a) The same sequence can adopt different structural folds given different disulfide connectivities [69]. Conotoxin -BuIA in globular form with connectivity 1-3, 2-4 (top, PDB structure 2I28 [82]) and in ribbon form with connectivity 1-4, 2-3 (bottom, PDB structure 2NS3 [83]). (b) Different sequences can adopt similar structural folds given the same disulfide connectivity. Conotoxin -EI in cyan (PDB structure 1K64 [84]) and conotoxin -IMI in pink (PDB structure 1G2G [51]), both in globular form with connectivity 1-3, 2-4. All images rendered in Pymol with disulfide bonds represented as yellow sticks [53].
Figure 8A visual overview of Section 3, with a focus on the different computational techniques employed.
Summary of work cited in Section 3. For each study, we note the broad goal and related subsection, the toxins involved, the methods employed, and a brief description of the results. Studies are listed in the order they are mentioned in the main text.
| Goal | Toxin(s) | Methods | Results/Citations |
|---|---|---|---|
|
| toxins targeting VGSC, VGCC, VGPC | Support vector machines (SVMs) | Predictor of sequence to target with average accuracy 90.3% [ |
| SVMs | Predictor of sequence to target with average accuracy 95.3% [ | ||
| SVMs | Predictor of sequence to target with average accuracy 94.2% [ | ||
| Radial basis function network | Predictor of sequence to target with average accuracy 89.7% [ | ||
| Random forests | Predictor of sequence to target with average accuracy 97.3% [ | ||
| RNA sequences from ten species | Logit, Label spreading, Perceptron | ConusPipe identifies potential conotoxins from sequence [ | |
|
| BtIIIA | MD simulation | Structure refinement [ |
| MD simulation | Structure refinement [ | ||
| MD simulation | Structure refinement [ | ||
| conantokin G | MD simulation | Structure refinement in complex with calcium [ | |
| sr11a, | MD, homology modeling | Structure refinement of sr11a [ | |
| Vt3.1 | MD, secondary structure predictors | Structure determination [ | |
| conantokins conBk-A, conBk-B, conBk-C | MD, secondary structure predictors | Structure determination [ | |
| Docking, MD | Validation of in silico predictions [ | ||
| Homology modeling, MD | Revealed molecular interactions between toxin and nAChR [ | ||
| Docking | Validation of homology models for eukaryotic sodium channels [ | ||
| Docking, biased MD, unbiased MD | Characterization of insertion into mammalian and bacterial channels [ | ||
| Docking | Testing of ToxDock algorithm [ | ||
|
| Docking | Identification of charged ring interaction with Shaker VGPC [ | |
| Docking | Identification of charged ring interaction with Shaker VGPC [ | ||
| Docking | Identification of charged ring interaction with VGSC [ | ||
| Docking, MD | Charge more important than steric for selectivity for | ||
| Docking, MD | Binding versus selectivity to nAChR subtypes [ | ||
| Quantum and classical MD | Find electronic changes from mutation and relate to h-bonding [ | ||
| Constant-pH MD | Probe pH effects on protonation [ | ||
| cyclic | MD simulation | Cyclicization effects on interaction with VGPC [ | |
| Docking | Hydrophobic effect on | ||
| MD simulation | Receptor side chain length relation to affinity for | ||
| MD simulation | Effect of dicarba instead of disulfide bridges on | ||
| Docking, MD | Tripeptide tail interaction with | ||
| Docking, MD | Receptor side chain orientation effects on binding to | ||
| Homology modeling, Docking | His-5, Gln-13 key residues control | ||
| MD simulation | Mechanism of selectivity for | ||
| MD simulation | Residue Glu-198 controls affinity for rat over human | ||
| Docking | Conserved proline controls affinity with | ||
| Docking, MD | Arg-7, Arg-9 controls affinity and selectivity for | ||
| Homology modeling, MD | Phe-9 controls binding with | ||
| MD simulation | Selectivity for | ||
| MD simulation | Selectivity for | ||
| Docking, MD | Key methionine residue responsible for toxicity [ | ||
|
| Docking | Affinities for different nAChR binding sites [ | |
| MD simulation | Verified and explained proposed binding orientation in VGSC [ | ||
| Docking | Identification of two different nAChR binding modes [ | ||
| Docking | Structural isomers bind to different sites on | ||
| MD, binding energy calculations | Identify binding site to | ||
| conantokin-T, conantokin-G | MD simulation | Determined metal-binding models [ | |
| Docking | Microscopic interactions of rapid unbinding from | ||
| Docking | Differences between conotoxin and snake toxin binding at nAChR [ | ||
| Docking, MD | Binding interaction investigation [ | ||
| Docking, MD | Molecular basis of binding to VGSC [ | ||
| Docking, MD | Systematic analysis of binding modes to Na | ||
| Docking, MD, umbrella sampling | Predict specificity for 8 different Na | ||
| Umbrella sampling | Predict IC50 values for Ca | ||
| Umbrella sampling | Dissociation constants from Na | ||
| Umbrella sampling | Binding pathway characterization [ | ||
| Umbrella sampling | Identification of multiple pathways in binding to | ||
| Random accelerated MD, Steered MD | Characterize multiple unbinding pathways from | ||
| MD, coarse-grained Brownian dynamics | Contribution of long-range electrostatics, h-bonding, hydrophobicity to approach and insertion into VGPC [ | ||
|
| Simplified quantum chemical calculations | Rapid simulations of folding/unfolding [ | |
| MD simulation | Size/shape fluctuations, translational and rotational diffusivity determination [ | ||
| MrIIIe | MD simulation | Effects of electric field strength [ | |
| MD simulation | Ensembles of different isomers thermodynamically favorable under different solvent conditions [ | ||
| MD simulation | Effects of removal of successive disulfide bonds and characterization of folding types [ | ||
| cyclic | MD simulation | Removal of disulfide bonds does not perturb structure [ | |
| MD simulation | Removal of disulfide bonds perturbs non-cyclic structures [ | ||
|
| contulakin-G | MD simulation, binding free energy calculations | Design of neurotensin analogues [ |
| MD simulation | Design of methionine-lacking mutant [ | ||
| 148 conopeptides with 3D structures in PDB [ | Docking, MD | Design of conotoxin targeting LPAR6 [ | |
| Homology search | Find sequences with insecticidal properties [ | ||
| MD simulation | Find strong copper-binding conopeptides to remove metals from environment [ | ||
| conantokin-G and mutants | Docking | Design of EAR16, EAR18 that reversibly block GluN2B NMDA receptor [ | |
| Docking, genetic algorithms | Design mutant with double the binding affinity for | ||
| Docking, genetic algorithms | Drug repurposing algorithm [ | ||
| Docking, Protein Surface Topography | Design mutant with nanomolar affinity for | ||
| MD, binding energy calculation | Assess importance of different disulfide bonds to binding with VGSC [ |
Figure 9Illustration of binding modes and key residues. Figure shows conotoxin -GIC in complex with the acetylcholine binding protein (AchBP) from Aplysia californica, often employed for homology modeling of nAChRs. Structure downloaded from the PDB server (code 5CO5). We highlight the key residues for selectivity identified by the study that also characterized the crystal structure through a combination of experimental and computational techniques [128]—in red, His-5, and in magenta, Gln-13, shown to control binding affinity and selectivity, respectively, for the nACHr subtype. (a) shows a zoom in of one of the binding pockets, with hydrogen bonds represented as yellow dotted lines, while (b) shows a top view. In both panels, the main body of the conotoxins are colored green and AchBP is colored blue. Images rendered with Pymol [53].