Ariana A Vasconcelos1,2, Jorge C Estrada3, Victor David3, Luciana S Wermelinger4, Fabio C L Almeida1,2, Russolina B Zingali3. 1. Instituto de Bioquímica Médica (IBqM) Leopoldo de Meis, Centro Nacional de Ressonância Magnética Nuclear, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil. 2. Centro Nacional de Ressonância Magnética Nuclear (CNRMN), Centro Nacional de Biologia Estrutural e Bioimagem (CENABIO), Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil. 3. Laboratório de Hemostase e Venenos, Instituto de Bioquímica Médica (IBqM) Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil. 4. Faculdade de Farmácia, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil.
Abstract
Disintegrins are small cysteine-rich proteins found in a variety of snake venom. These proteins selectively modulate integrin function, heterodimeric receptors involved in cell-cell and cell-matrix interaction that are widely studied as therapeutic targets. Snake venom disintegrins emerged from the snake venom metalloproteinase and are classified according to the sequence size and number of disulfide bonds. Evolutive structure and function diversification of disintegrin family involves a stepwise decrease in the polypeptide chain, loss of cysteine residues, and selectivity. Since the structure elucidation of echistatin, the description of the structural properties of disintegrins has allowed the investigation of the mechanisms involved in integrin-cell-extracellular matrix interaction. This review provides an analysis of the structures of all family groups enabling the description of an expanded classification of the disintegrin family in seven groups. Each group presents a particular disulfide pattern and sequence signatures, facilitating the identification of new disintegrins. The classification was based on the disintegrin-like domain of the human metalloproteinase (ADAM-10). We also present the sequence and structural signatures important for disintegrin-integrin interaction, unveiling the relationship between the structure and function of these proteins.
Disintegrins are small cysteine-rich proteins found in a variety of snake venom. These proteins selectively modulate integrin function, heterodimeric receptors involved in cell-cell and cell-matrix interaction that are widely studied as therapeutic targets. Snake venom disintegrins emerged from the snake venom metalloproteinase and are classified according to the sequence size and number of disulfide bonds. Evolutive structure and function diversification of disintegrin family involves a stepwise decrease in the polypeptide chain, loss of cysteine residues, and selectivity. Since the structure elucidation of echistatin, the description of the structural properties of disintegrins has allowed the investigation of the mechanisms involved in integrin-cell-extracellular matrix interaction. This review provides an analysis of the structures of all family groups enabling the description of an expanded classification of the disintegrin family in seven groups. Each group presents a particular disulfide pattern and sequence signatures, facilitating the identification of new disintegrins. The classification was based on the disintegrin-like domain of the human metalloproteinase (ADAM-10). We also present the sequence and structural signatures important for disintegrin-integrin interaction, unveiling the relationship between the structure and function of these proteins.
Integrin antagonists comprise molecules that can bind and interfere with the activity of the cellular receptors integrins. Some proteins or peptides are found in snake venoms such as the C-type lectins EMS6 (Marcinkiewicz et al., 2000) and Vixapatin (Momic et al., 2012) or disintegrins (Calvete, 2005a), or found in other animals venoms such as arthropods (Khamessi et al., 2018). The snake venom disintegrin family comprises a group of cysteine-rich proteins (40–100 amino acids) found in the venom from snakes from Elapidae, Viperidae, Atractaspididae, and Colubridae families (Kini and Evans, 1992; Calvete et al., 2005b; Arruda Macedo et al., 2015). These proteins are released in the venom as a result of the proteolytic process of the so-called PII snake venom metalloproteinases (SVMP) (Kini and Evans, 1992; Calvete et al., 2005b). It is worth emphasising that venoms from other venomous animals also contain metalloproteinases similar to that from snake venoms (Xia et al., 2013; Borges et al., 2016). Whether these molecules are able to generate disintegrins is yet to be elucidated. Snake venoms disintegrins were firstly described by the capacity to inhibit the platelet fibrinogen receptor integrin αIIbβ3 (Huang et al., 1987). Disintegrins are capable of modulating the function of a broad range of integrins (Gould et al., 1990; McLane et al., 2004), a family of heterodimeric receptors that play a fundamental role in mediating physiological and pathological processes, such as hemostasis and cancer (Phillips et al., 1980; Hynes, 1987). Many aspects of the disintegrin protein family were reviewed recently: as tools for antitumor activity (Schönthal et al., 2020), for antithrombotic agents (Kuo et al., 2019), recombinant and chimeric proteins on research (Huang et al., 2016; David et al., 2018; Uzair et al., 2018; Cesar et al., 2019; Lazarovici et al., 2019). This review focuses on structural studies of snake venom disintegrins and the features of integrin-disintegrin interaction.
Structural and Evolutive Bases of Disintegrin Molecules
Evolutive Diversification of Metalloproteinases Into Disintegrins
The emergence of disintegrins results from the divergence of snake venom Zn2+-Metalloproteinases (SVMPs). These proteins are present in the venom of the majority of venomous snakes and are capable of degrading the extracellular matrix and/or proteins belonging to the hemostasis system (de Lima et al., 2009; Fox and Serrano, 2009; Mackessy, 2009). SVMPs share a common ancestor with matrix-degrading metalloproteinases, the A Disintegrin And Metalloproteinase (ADAM) family (Moura-da-Silva et al., 1996). SVMPs are classified according to their domain organization: the PIII class contains the metalloproteinase domain followed by the C-terminal disintegrin-like and cysteine-rich domain; the PII class, contains the disintegrin domain at the C-terminal of the metalloproteinase domain and the PI class contains only the metalloproteinase domain (Fox and Serrano, 2009). The subfamily of ADAM containing the thrombospondin type-1 motif (ADAMTS) are also related to SVMPs, but the crystallographic structures showed that the disintegrin-like domain of ADAMTS adopts a different fold and are not structurally homologous to disintegrins (Takeda et al., 2012). For this reason, the analysis presented in this review will consider only the ADAM family.The evolution of disintegrin family was extensively studied by many authors (for review see: Juárez et al., 2008; Calvete, 2013; Fox and Serrano, 2009). The accepted hypothesis is that SVMPs have evolved relatively late from a common ancestor by speciation and positive Darwinian selection. The evolution started by gene duplication of the ancestral PIII disintegrin-like domain followed by neofunctionalization in the snake venoms generating the disintengrin domains of PII (Fox and Serrano, 2009). The evolution from disintegrin-like (PIII) to disintegrin domain (PII) ocurred through the successive loss of disulfide bonds and reduction in size to the different snake venom disintegrin subfamilies (long, medium-sized, dimeric, and short (Kini and Evans, 1992; Calvete, 2005a). The emergence of PII SVMP precursors follows another key event that includes deletion of the C-terminal cysteine-rich domain, due to a mutation causing the appearance of a stop codon, and the removal disulfide bonds (Calvete et al., 2003; Juárez et al., 2006a). PII SVMP undergoes limited proteolysis, releasing disintegrins domains (short, medium, and long) into the venom (Kini and Evans, 1992; Jia et al., 1996). Most proteins from dimeric or short groups are synthesized from short-coding mRNAs, lacking the metalloproteinase domain (Juárez et al., 2006b; Okuda et al., 2002; Bazaa et al., 2007). In fact, analysis of the pre-sequence of the cDNA disintegrin jerdostatin suggests that this disintegrin originates from a short-coding gene, instead of a proteolytic process such as the majority of disintegrin from PII SVMP (Sanz et al., 2005). Moreover, Bazaa et al., 2007 suggested that the shortening of the gene is due to the loss of introns and coding regions that contribute to the formation of the short-coding disintegrins. In summary, disintegrins originate from a multigene family from metalloproteinases that undergoes an accelerated evolutionary pathway resulting in great diversification (Juárez et al., 2008).
Structural Diversification of Disintegrins
Disintegrins have been classified according to the number of residues and disulfide bridges, into four subfamilies: smaller ones are the short disintegrins composed of 49–51 residues and four disulfide bridges; the medium size disintegrins with about 70 residues and six bridges; the largest consist in the long disintegrin with about 84 residues cross-linked by seven disulfide bridges; and homo- and heterodimers disintegrins (McLane et al., 1998; Calvete et al., 2003). Dimeric disintegrins contain subunits about 67 residues cross-linked by 2 interchain cysteine linkages and 4 intra-chain disulfide bonds (Calvete et al., 2000; Bilgrami et al., 2005; Figure 1). The evolution pathway of disintegrin structure diversification involved the reduction of the polypeptide chain and selective loss of pairs of cysteine residues that form disulfide bonds (Calvete et al., 2003; Calvete, 2005a, 2010; Bazaa et al., 2007). Phylogenetic analysis suggests that PII-dimeric and short disintegrins represent the more recent diverging lineages of disintegrins (Figure 1A) (Juárez et al., 2008).
FIGURE 1
Evolutionary relationships between disintegrins and current classification. (A) Dendrogram shows the evolutionary relationships between the different disintegrin subfamilies. The dendrogram also includes the disintegrin-like domain of ADAMs and PIII-SVMPs from which the different snake venom disintegrin subfamilies (long, medium, dimeric and short) evolved through the successive loss of disulfide bonds and size reduction. (B) Current classification of disintegrins: long (∼84 amino acids and 7 disulfide bonds) (connections between cysteines in lavender), medium (∼70 amino acids and 6 disulfide bonds) (connections between cysteines in red), dimeric (∼67 amino acids and 4 intrachain disulfide bonds for each subunit and 2 interchain) (connections in green), and short (41–51 amino acids and 4 disulfide bonds) (connections in blue). The integrin-binding RGD, KGD, MGD, MDL, KTS, and RTS tripeptide motifs localization is indicated by the magenta rectangle. (Adapted from Juaréz et al., 2008).
Evolutionary relationships between disintegrins and current classification. (A) Dendrogram shows the evolutionary relationships between the different disintegrin subfamilies. The dendrogram also includes the disintegrin-like domain of ADAMs and PIII-SVMPs from which the different snake venom disintegrin subfamilies (long, medium, dimeric and short) evolved through the successive loss of disulfide bonds and size reduction. (B) Current classification of disintegrins: long (∼84 amino acids and 7 disulfide bonds) (connections between cysteines in lavender), medium (∼70 amino acids and 6 disulfide bonds) (connections between cysteines in red), dimeric (∼67 amino acids and 4 intrachain disulfide bonds for each subunit and 2 interchain) (connections in green), and short (41–51 amino acids and 4 disulfide bonds) (connections in blue). The integrin-binding RGD, KGD, MGD, MDL, KTS, and RTS tripeptide motifs localization is indicated by the magenta rectangle. (Adapted from Juaréz et al., 2008).Additionally, disintegrins present much more complex structural diversity. To illustrate, disintegrins jarastatin (Bothrops jararaca)
, triflavin (Trimeresurus flavoviridis), and obtustatin exhibit different disulfide bridge patterns and structural features. Jarastatin and triflavin are disintegrins that contain six disulfide bonds with the same cysteine pattern, however, these disintegrins have different cystine connectives (Huang T. F. et al., 1991; Coelho et al., 1999; Cidade et al., 2006). Likewise, triflavin has an elongated and rigid structure composed of turns and antiparallel β-strands, and obtustatin has a compact globular structure composed of turns and without regular secondary structure (Huang T. F. et al., 1991; Coelho et al., 1999; Fujii et al., 2003; McLane et al., 2004; Cidade et al., 2006; Wermelinger et al., 2009). These cysteine pairing characteristics motivated us to perform a re-analysis of the classification of disintegrins based on the 3D structure to seek a better structural/function comprehension of disintegrins (see point section 4.1).Moreover, despite the majority of disintegrins following canonical structure features, some of them follow a different pathway (Calvete et al., 2003). The disintegrin graminelysin belongs to medium-size disintegrins but has some features of PIII derived disintegrins, such as the Cys13-Cys16 disulfide bond, which represent an intermediate step in the evolution pathway of medium-sized disintegrins (Wu et al., 2001). Bilitoxin-1 is a long disintegrin containing an additional cysteine residue involved in a disulfide bond homodimer (Nikai et al., 2000). Another example includes the identification of different PII SVMP transcripts found in the venom of Bothrops neuwiedi, suggesting recombination of PII SVMP with the catalytic domain of PI or PIII SVMP (Moura-da-Silva et al., 2011). This fact is well described below, where the groups are presented based on structural and cysteine pairing features. We also show a cladogram that illustrates the evolutionary path of the disintegrins.
Disintegrin Structure: NMR and Crystallization Studies
The number of disintegrins purified from the venom or identified by proteome or transcriptome analysis grows each year. The NMR studies expanded the panorama of the structure of disintegrins. Nevertheless, the structure of only 16 disintegrins has been solved by NMR or crystallography until now. Figure 2 presents the timeline of experimental information on disintegrin structure.
FIGURE 2
Timeline of the structure determination of Disintegrins. In a time-lapse of 5 years from the first disintegrin discovery.
Timeline of the structure determination of Disintegrins. In a time-lapse of 5 years from the first disintegrin discovery.
An Expanded Classification of Disintegrins Based on the Structural Features, Disulfide Pattern, and Comparison With Ancestral Disintegrin Fold
In this section, we propose an expanded classification based on the structural features of disintegrins and use the disintegrin-like domain of a PIII ADAM as a reference for folding and disulfide pattern. This reference is suitable since it is well established by several in-death phylogenetic studies (Juárez et al., 2008; Fox and Serrano, 2009; Calvete, 2013) that the PIII disintegrin-like domain is ancestral of all disintegrins. As previously mentioned in this work, disintegrins have a vast structural diversity. Within the disintegrin family there is a wide variety of disulfide bridge patterns and protein sizes and, at the same time, conserved structural features (Huang T. F. et al., 1991; Coelho et al., 1999; Cidade et al., 2006; Cheng et al., 2012). For instance, jarastatin and triflavin disintegrins contain six disulfide bonds with the same cysteine pattern while the disintegrin domain of ADAM metalloproteinases present the same size, structural homology, and different disulfide pattern (Huang T.-F. et al., 1991; Coelho et al., 1999; Cidade et al., 2006).Pairing cysteine residues in disintegrins is already known to play an important role in exposing the RGD binding motif that mediates inhibition of platelet aggregation, neutrophils, or endothelial cell function (Blodel and White, 1992; Takagi et al., 2002; Calvete et al., 2003; Calvete, 2005a). In addition, the modulatory activity of the disintegrins depends on the proper pairing of the cysteine residues, contributing to the conformation of the disintegrin structure (Niewiarowski et al., 1994). The conservative aspect of cysteine residues and the disulfide bond pattern between the disintegrins subfamily contribute to the hypothesis of strong selection for maintaining the active conformation of these proteins (Juárez et al., 2008). Therefore, we decided to investigate in more detail the patterns of protein disulfide bridges within the structure of some members of the disintegrin family, which show disintegrin fold either as an isolated protein or as a domain of a larger protein.To select these proteins, we used the PFAM family (PF0020) classification, which describes all the subfamilies and their correspondent structures, contains annotations and multiple sequence alignments. We analyzed the sequence and structure alignment and proposed a classification based on the pattern of disulfide bonds using the ancestral subfamily ADAM as a reference. We assigned letters from A to N for each possible position occupied by these cysteines. The analysis pointed out seven possible disulfide patterns, named group 1, for proteins of ADAM subfamily; group 2, for disintegrins similar to bitistatin A; group 3, bitistatin B; group 4, for kistrin; group 5 for salmosin; group 6, for dimers; group 7, for obtustatin. Figure 3 shows the sequence alignment of each group proposed in this classification.
FIGURE 3
Sequence alignments of representative disintegrins. The sequences were obtained through research in the Blast software (Altschul et al., 1990; Gish and States, 1993). The pink color highlights the sequence signatures, green the residues that are not conserved within the signatures, orange the cysteines. Dotted blue squares show the conserved L/I that compose the hydrophobic core. RGD/KGD or related three-peptides are underlined. The codes representing the sequences were obtained from PDB and UniProt. Abbreviated: VAP-1 (Vascular apoptosis-inducing protein 1); RRV-X (Russell’s viper venom factor X activator); EcoHIT (Viper PIII hemorrhagic SVMP Echis coloratus); EcH (Metalloproteinase Echis pyramidum leakeyi); Gloydius shedaoensis (G. shedaoensis); SSV (saw-scaled Viper). The multiple sequence alignment was performed using the ClustalW2 server (https://www.genome.jp/tools-bin/clustalw).
Sequence alignments of representative disintegrins. The sequences were obtained through research in the Blast software (Altschul et al., 1990; Gish and States, 1993). The pink color highlights the sequence signatures, green the residues that are not conserved within the signatures, orange the cysteines. Dotted blue squares show the conserved L/I that compose the hydrophobic core. RGD/KGD or related three-peptides are underlined. The codes representing the sequences were obtained from PDB and UniProt. Abbreviated: VAP-1 (Vascular apoptosis-inducing protein 1); RRV-X (Russell’s viper venom factor X activator); EcoHIT (Viper PIII hemorrhagic SVMP Echis coloratus); EcH (Metalloproteinase Echis pyramidum leakeyi); Gloydius shedaoensis (G. shedaoensis); SSV (saw-scaled Viper). The multiple sequence alignment was performed using the ClustalW2 server (https://www.genome.jp/tools-bin/clustalw).The classification of the disulfide bond patterns followed the tertiary structure alignments with the reference protein ADAM10 Extracellular Domain (ADAM10, PDB id 6BE6). Only the alignments of primary structure did not provide sufficient information to classify the pattern within the disintegrin family, whereas the comparison with ADAM10 provided the exact location and classification for each cysteine. Figure 4 shows the disulfide pattern for each group and the superposition of one component of each group with ADAM10. It illustrates the rationale of the classification proposed here.
FIGURE 4
Disulfide bonding pattern for each group and the structural superposition of a component of each group with ADAM10. Each group has a characteristic pattern that allowed us to be divided into seven groups, where (A) Group 1 of the ADAM/SVPM; (B) Group 2 of Bitistatin A; (C) Group 3 of Bitistantin B; (D) Kistrin Group 4; (E) Group 5 of Salmosin; (F) Group 6 of the dimers; (G) Group 7 of the Obtustatin/Echistatin (here represented by Obtustatin). In the connectivity scheme, each cysteine is represented by a letter that follows the order from A to N with different colors, where cysteines involved in interchain connections or connections between domains can occur (orange square), and still, cysteines replaced by another amino acid residue (light blue square). (Superposition color). Note that the structures of all groups can be superposed, except for salmosin, which shows a different conformation at the N-terminus.
Disulfide bonding pattern for each group and the structural superposition of a component of each group with ADAM10. Each group has a characteristic pattern that allowed us to be divided into seven groups, where (A) Group 1 of the ADAM/SVPM; (B) Group 2 of Bitistatin A; (C) Group 3 of Bitistantin B; (D) Kistrin Group 4; (E) Group 5 of Salmosin; (F) Group 6 of the dimers; (G) Group 7 of the Obtustatin/Echistatin (here represented by Obtustatin). In the connectivity scheme, each cysteine is represented by a letter that follows the order from A to N with different colors, where cysteines involved in interchain connections or connections between domains can occur (orange square), and still, cysteines replaced by another amino acid residue (light blue square). (Superposition color). Note that the structures of all groups can be superposed, except for salmosin, which shows a different conformation at the N-terminus.For group 1 (Figure 4A), we analyzed 11 representatives, which have a disintegrin domain with up to 14 cysteines. They are organized into 6 disulfide bonds and two cysteines that may form disulfide bonds with cysteines in the other domain of the protein (Jia et al., 1996). The letter-based pattern is defined as AE BC DJ F GI HM L kN, where F and L, which appear here unmatched, are cysteines involved in connections with the other domains. This group is formed by mammalian (ADAM) and snake venom metalloproteinases (SVMP), both containing the disintegrin domain with the same fold.Group 2 and 3 represent two disulfide patterns (alternate folding) for the same sequence. They were first described for the disintegrin bitistatin, group 2 for bitistatin A (Figure 4B), and 3 for bitistatin B (Figure 4C). The disulfide pattern for group 2 is AD BG CF EK HJ IM LN and AG BF CD EK HJ IM LN for group 3 differing only at disulfide bonds at the N-terminal region.Group 4 (Figure 4D) is the group of kistrin, which is also presented as only 1 disintegrin monomeric domain. It consists of 15 proteins that have 12 cysteines, all involved in disulfide bonds. The pattern is AF BE DJ GI HM KN with the cysteines of positions C and L were evolutionarily replaced by other amino acids. We noted that the disulfide pattern is the same as that found for the ancestral subfamily ADAM, differing only at the N-terminal region. Despite this, the detailed analysis of the sequence alignment (Figure 3) of the studied groups showed that the ADAM group (group 1) and the Kistrin group (group 4), have many conserved amino acid sequences in the N-terminal portion.Group 5 (Figure 4E) are also monomeric proteins with 1 disintegrin domain. It consists of 11 proteins, but the only member with a high-resolution structure available is the salmosin. This disintegrin has a different disulfide pattern, represented by AD BE FI HJ GM KN. Interestingly, the cysteines that occupied the C and L positions in the ADAM ancestor were replaced by other amino acids throughout the evolutionary process and for this reason, they are not included in this pairing. Therefore, members of this group include 12 cysteines involved in 6 disulfide bonds. Salmosin is the only available disintegrin structure solved so far that cannot be completely superposed, differing at the N-terminal region, with the reference group (group 1) and other disintegrins.Group 6 (Figure 4F) are the dimeric disintegrins. They include 4 representatives with resolved structures. The dimeric disintegrins are small and present the N-terminal region quite conserved among themselves, but distinct when compared to the other disintegrins. They have 10 cysteines, of which 8 are involved in intra-chain disulfide bonds and two are involved in inter-chain cystine linkages (cysteines E and F). Also, they present evolutionary replacement at the L position for other amino acids. The pattern observed for this group is DJ EF GI HM KN, containing the sequence signature NPCC at the N-terminal (Figure 3 and Table 1).
TABLE 1
Signatures of the sequences of the study groups based on the classification of seven groups.
Sequence
G1
G2
G3
G4
G5
G6
G7A
G7B
GxECDC
✔
✔
✔
✔
✔
—
—
—
CCDAATCKLxxGAQC
—
—
—
✔
✔
—
—
—
GPCCR
—
—
—
—
—
—
✔
✔
CRxARGD
—
✔
✔
✔
✔
—
—
—
CCxQCxF
—
✔
✔
✔
—
—
—
—
DDxCxG
—
—
—
✔
✔
—
—
—
DCPR
—
—
—
✔
—
✔
—
✔
NxCC
—
—
—
✔
✔
✔
—
—
x variation of amino acid residue.
Signatures of the sequences of the study groups based on the classification of seven groups.x variation of amino acid residue.Group 7 (Figure 4G) comprises the small disintegrins, which have members that have, in addition to the RGD motif, KTS, and RTS sequences, with a GJ HM IL KN binding pattern, these disintegrins have 8 cysteines that form 4 disulfide bonds. In addition, group 7 displays few sequence signatures in common with other groups (Table 1).The new classification of the disintegrin family provided the description of the sequence signatures that will enable localization of an unknown sequence within a group and to be able to predict properties such as structural features and functional capabilities. We looked for sequence signatures within each group that may help to classify and model the structure of an unknown sequence. The signature GxECDC sequence is common for the groups G1, G2, G3, G4, and G5, absent in groups G6 and G7 (short and dimeric disintegrins, Table 1). Here, we emphasize that group 7 was subdivided into 7A and 7B.The sequences CRxARGD and CCxQCxF are common to groups G2, G3, G4, and G5, which relate to medium and long proteins. The CRxARGD signature is shared with the salmosin group, a more distinct group due to its more differentiated disulfide bond pattern.We also observed signatures present in some related groups, such as CCDAATCKLxxGAQC and DDxCxG, which were common to groups G4 and G5. Both groups harbor medium-sized disintegrins, but with different folds. We also observed that short disintegrins exhibit the highest variability in their integrin recognition motifs, including, in addition to RGD, the KTS and RTS motifs. This variability in the motifs for integrin recognition was also observed for G6, a group of dimeric disintegrins, which present in addition to RGD, the motifs MDL and VGD and share the DCPR signature at the c-terminal portion with the G4 groups of Kistrin and and some members of G7B Echistatin.We present a simple cladogram analysis based solely on the alignment of the amino acid sequences presented in Figure 3 using Mega (Kumar et al., 2018) (Figure 5), which showed a good correlation with the classification of disintegrins based on their structural properties (Figure 4) and corroborating previous evolutionary and structural studies (Calvete et al., 2003; Calvete, 2005a). We want to make it clear that the cladogram showed in Figure 5 reports well the clade of each group, but not the temporal evolution line. For an in-depth phylogenetic analysis, see Juárez et al., 2008, Calvete, 2013 and Fox and Serrano, 2009. The goal of presenting the cladogram is to support the classification. This analysis, together with data from the literature, showed that the evolutionary process that resulted in structural and functional diversification within the family of disintegrins, involved a reduction in the number of cysteine residues and successive losses of disulfide bonds (Juárez et al., 2008; Calvete, 2010; Carbajo et al., 2015). The different disulfide bond patterns (Figure 4) highlighted the idea that the disintegrins represent an example of the divergent evolution of a conserved structural motif (Carbajo et al., 2015).
FIGURE 5
The sequence clustering of disintegrin. Simplified neighbor-joining method, revealing the main sequence clusters (clades) of the disintegrins using the ClustalW2 server (https://www.genome.jp/tools-bin/clustalw) and MEGA software (Molecular Evolutionary Genetics Analysis) (Kumar et al., 2018) as a tool for sequence alignment, and clustering. This analysis report is useful for sequence clustering in clades, but not for the temporal evolution analysis. The goal of presenting the cladogram is to support the classification. The ancestor of an ADAM gene (orange), and the emergence of the disintegrin family through the successive loss of disulfide bonds. The cysteine residues that are lost along the evolutionary pathway result in divisions in disintegrins long (lavender) that comprises groups 2 and 3; medium disintegrins (red), group 4; medium disintegrins with a non-canonical disulfide pattern, which has Salmosin as a representative of group 5 (yellow); dimeric disintegrins, gathered here in group 6; and short disintegrins divided into groups 7A, Obtustatin group (blue) and group 7B, Echistatin group (pink).
The sequence clustering of disintegrin. Simplified neighbor-joining method, revealing the main sequence clusters (clades) of the disintegrins using the ClustalW2 server (https://www.genome.jp/tools-bin/clustalw) and MEGA software (Molecular Evolutionary Genetics Analysis) (Kumar et al., 2018) as a tool for sequence alignment, and clustering. This analysis report is useful for sequence clustering in clades, but not for the temporal evolution analysis. The goal of presenting the cladogram is to support the classification. The ancestor of an ADAM gene (orange), and the emergence of the disintegrin family through the successive loss of disulfide bonds. The cysteine residues that are lost along the evolutionary pathway result in divisions in disintegrins long (lavender) that comprises groups 2 and 3; medium disintegrins (red), group 4; medium disintegrins with a non-canonical disulfide pattern, which has Salmosin as a representative of group 5 (yellow); dimeric disintegrins, gathered here in group 6; and short disintegrins divided into groups 7A, Obtustatin group (blue) and group 7B, Echistatin group (pink).
Disintegrin Structural Properties
Since the purification of Echistatin, isolated for the first time in 1988 from the venom of Echis carinatus snake, some disintegrins’ structures were solved leading to a better understanding of the structure-activity relationship. As mentioned before, the disintegrins vary in size and fold, but they display many conserved structural features. Table 2 summarizes the information about all disintegrins that had their structure solved so far. They are stabilized by multiple disulfide bonds (from 4 to 7), they lack a canonical hydrophobic core and well-defined secondary structures.
TABLE 2
Summary of the sequence, ligands, structural information and references of the disintegrins structures reported in PDB database.
Disintegrin name
Length (aa)
Number of cysteines
Ligand
Specie
Loop
Method
RMSD resolution (Å)
PDB ID
Group
References
Bitistatin
89
14
αvβ3
Bitis arietans
RGD
NMR
0.50
2MOP/2MP5
2–3
Carbajo et al. (2015)
Kistrin
68
12
αvβ3
Agkistrodon rhodostoma
RGD
NMR
0.47
1N4Y
4
Cooke et al. (1992)
Rhodostomin
68
12
αvβ3/αIIbβ3
Calloselasma rhodostoma
RGD
NMR
0.61
2PJF
4
Guo et al. (2001)
Rhodostomin R46E mutant
68
12
αIIbβ3
Calloselasma rhodostoma
RGD
CRYSTAL
NA
4R5U
4
Huang et al. (2015a)
Rhodostomin D51E mutant
68
12
αvβ3
Calloselasma rhodostoma
RGD
NMR
0.59
2PJG/2PJF
4
Chen et al. (2009)
Rhodostomin P48A mutant
68
12
αvβ1
Calloselasma rhodostoma
RGD
NMR
0.57
2PJI
4
Chuang et al. (2007)
Rhodostomin G50L mutant
68
12
αvβ3
Calloselasma rhodostoma
RGD
NMR
0.45
2LJV
4
Chuang et al. (2012)
RhodostominARGDP mutant
68
12
αIIbβ3/αvβ3
Calloselasma rhodostoma
RGD
CRYSTAL
NA
4M4C
4
Chang et al. (2014)
RhodostominKKKRT mutant
68
12
αIIbβ3/αvβ3
Calloselasma rhodostoma
RGD
CRYSTAL
NA
4R5Rb
4
Huang et al. (2015b)
Rhodostomin ARLDDL mutant
72
12
αvβ3
Calloselasma rhodostoma
RGD
CRYSTAL
NA
3UCI
4
Shiu et al. (2012)
Disintegrin with the AKGDWN motif
68
12
αIIbβ3
Calloselasma rhodostoma
RGD
NMR
0.75
1Q7I
4
Chuang et al. (2004)
Rhodostomin mutante Protein RGD Motif C-terminal mutation
79
12
αIIbβ3
Calloselasma rhodostoma
RGD
NMR,
0.54
2M7F
4
Chang et al. (2014)
Rhodostomin mutante Protein RGD Motif C-terminal mutation
79
12
αIIbβ3
Calloselasma rhodostoma
RGD
NMR
0.51
2M7H
4
Chuang et al. (2013)
Rhodostomin mutante Protein RGD Motif C-terminal mutation
76
12
αIIbβ3
Calloselasma rhodostoma
RGD
NMR
0.60
2M75
4
Chuang et al. (2013)
Flavoridin
70
12
αIIbβ3
Trimeresurus flavoviris
RGD
NMR
0.85
1FVL
4
Senn and Werner, (1993)
Trimestatin
70
12
αvβ3
Trimeresurus flavoviris
RGD
CRYSTAL
NA
1J2L
4
Fujii et al. (2003)
Acostatin
64
10
αIIbβ3
Agkistrodon contortrix contortrix
RGD
CRYSTAL
NA
3C05
4–5
Moiseeva et al. (2008)
Albolabrin
84
12
αIIbβ3
Trimeresurus albolabris
RGD
NMR
NA
NA
5
Smith et al. (1996)
Salmosin
70
12
αIIbβ3
Agkistrondon halys
RGD
NMR
0.49
1L3X
5
Shin et al. (2003)
Schistatin
64
10
αIIbβ3/αvβ3
Echis carinatus
RGD
CRYSTAL
NA
1RMR
6
Bilgrami et al. (2004)
Heterodimer from Saw-scaled viper
64
10
αIIbβ3/αvβ3
Echis carinatus
RGD
CRYSTAL
NA
1TEJ
6
Bilgrami et al. (2005)
Saw-scaled viper
64
10
α4β1/α4β7
Echis carinatus
MLD
CRYSTAL
NA
1Z1X
6
Hassan et al. (2005)
Jerdostatin
46
8
α1β1
Protobothrops jerdonii
RTS
NMR
0.64
2W9O
7A
Carbajo et al. (2011)
Viperistatin
44
8
α1β1
Vipera palestinae
KTS
NMR
NA
NA
7A
Kisiel et al. (2004)
Obtustatin
41
8
α1β1
Vipera lebetina obtusa
KTS
NMR
0.46
1MPZ
7A
Moreno-Murciano, (2003)
Echistatin
49
8
αIIbβ3
Echis carinatus
RGD
NMR
0.52
2ECH/IRO3/6LSQ
7B
Chen et al. (1991)
Summary of the sequence, ligands, structural information and references of the disintegrins structures reported in PDB database.The disintegrins structure consists of a series of loops tightly held together by disulfide bonds with almost no regular secondary structure. All hydrophobic residues are at least partially exposed to the protein surface, except for one leucine/isoleucine residue that is buried in a protein core. This I/L residue is present in all groups and should be important for protein stabilization. More studies are necessary to understand the role of the hydrophobic residues in the overall folding. The exposure of hydrophobic residues and folding without a canonical hydrophobic core is not exclusive for disintegrins and it is also present in defensins (Machado et al., 2018; Pinheiro-Aguiar et al., 2020). It was recently proposed that these proteins are stabilized by hydrophobic surface clusters acting as an independent folding unit. In the absence of a canonical hydrophobic core, the surface clusters promote the folding by the interaction of exposed hydrophobic residues with the adjacent side chains regulated by solvation forces (Almeida et al., 2021).There are exceptions for the disintegrin general structure. One exception is bitistatin, which is mainly found in two different forms, as bitistatin A, and B. Both have an identical primary structure, their molecular architecture can be defined, generally, as a fold with an elongated shape, including a “ladder” of seven disulfide bonds, representing the dominant organizational characteristic of polypeptide folds. Some differences are found in the intra-domain interactions between the two bitistatins. In bitistatin A, some hydrophobic interactions are present between Val4-Pro21 and between Ile9 and the fragment Glu11-Cys18. Also, some inter-domain interactions are found with the aromatic ring of Tyr44 and the side chain of the sequence Ile9-Glu14, and between Leu36 and the residues Ser40-Tyr44. In contrast, the Bitistatin B has the Pro3 positioned adjacent to the side chain Glu11-Gln12, where the Val4 interacts with the residues Gly6-Gln12. An important role of the Leu10 was described, in Bitistatin B, to form a hydrophobic core to stabilize the protein with an Asn43-Tyr44 interaction. On the other hand, Bitistatin A shows a hydrophobic interaction with the Leu36 to the Ser40-Tyr44 (Carbajo et al., 2015).Also, these groups share some signature sequences (Table 1) from group 1 (Adams and SVMP) to group 5 (medium disintegrins), which corroborate the evolutionary idea that disintegrins derived from long to short disintegrins (Bazaa et al., 2007).Another example is acostatin classified as group 6, a heterodimeric disintegrin isolated from the venom of Agkistrodon contortrix contortrix. The crystal structure shows a tetramer (dimer of dimer), with each dimer with a similar fold. The structures present the same disulfide pattern of group 6, with Cys residues DJ, GI, HM, and KN forming intrachain disulfide bridges and E and F interchain disulfide bridges. Both dimers form an identical disulfide pattern. (Figure 4) (Moiseeva et al., 2008). Interestingly, if only the sequence were to be taken into account, as in the cladogram construction, chain α would be wrongly classified as group 4 and chain β as group 5 (Figure 5).One of the most different structures reported so far is the Salmosin (group 5). This disintegrin isolated from the Agkistrodon halys venom has the RGD motif conserved with an unusual finger shape and is distal from the rigid nucleus of the C terminal domain. In addition, although the RGD motif does not interact with the hydrophobic nucleus of the protein, it has been stabilized by a network of molecular contacts through a small beta antiparallel sheet comprising residues of Ile46-Ala50 and Asp54-Tyr58. The distribution of electrostatic charge on the surface of the salmosin differs dramatically, in comparison with other disintegrins, showing a cluster of negatively charged residues near the RGD loop (Shin et al., 2003). NMR data further indicated that salmosin has a topology similar to kistrin (member of group 4D), although the two molecules have entirely different disulfide bond patterns. Furthermore, it has been shown that salmosin is also made up of several closed folds and irregular loops, including residues Gly3-Gly9, Cys15-Cys21, Lys22-Lys27, Leu33-Leu38, Gly44-Ile46, and Gly62-Gly65, and that these loops are stabilized by a disulfide bond matrix across Cys AD, C, BE, FI, HJ, GM, L and KN (Shin et al., 2003).
Integrins
Integrins are a large family of surface receptors, involved in cell-cell and cell interactions with extracellular matrix components, present in biological processes such as angiogenesis and hemostasis (Hynes, 1992; Ata and Antonescu, 2017). Integrins are heterodimers formed by one α and one β-subunit, stabilized by noncovalent bonds (Barczyk et al., 2010) (Figure 6A). Each integrin subunit combines to form 24 heterodimers, composed of 18 α subunits and 8 β (Barczyk et al., 2010). Each subunit has a transmembrane polypeptide type I; containing three domains, a glycosylated domain, a hydrophobic domain, and an endo-domain. The α subunit can vary in the range of 120–180 Kda. At the N-terminus, the α subunit possesses seven homologous domains of 50 amino acids each. The extracellular portion of α subunit is composed of two calf domains, one thigh domain, one β-helix domain (β-propeller), and one ligand-binding I domain (interactive domain) named αI (Shattil and Newman, 2004; Barczyk et al., 2010; Dermont et al., 2010). β-subunit varies from 95 to 117 Kda. Each β-subunit contains a divalent cation binding site located at 100 residues from the amino terminus (Berman et al., 2003). The extracellular portion of the β subunit has four EGF-like domains, one type I domain inside in the hybrid domain that composes the integrin head, known as βI domain (Dermont et al., 2010; Coller, 2015). αI and βI domains possess specific binding regions for metallic ions. The presence of magnesium ion in the domain I of αI, known as metal ion dependent-adhesion site (MIDAS), modulates the binding of integrin to the specific ligand. Some integrins, such as αIIbβ3, lacks αI domain, in that case, the ligand-binding lies between βI domain and β-propeller of the α subunit (Dermont et al., 2010; Coller, 2015). Ligand-binding by αI and βI domains promotes a conformational change in the extracellular integrin portion, promoting separation of transmembrane and cytoplasmic portion, a process known as outside-in activation. Intracellular signaling promotes conformational changes in the cytoplasmatic integrin tail until the ligand-binding domain, known as inside-out activation (Barczyk et al., 2010; Dermont et al., 2010). Three conformational states are reported for integrin receptors: a low-affinity state, where the integrin structure is bent (Figure 6A); an intermediate affinity state triggered by the binding to the ligand, where the integrin structure is elongated, however partially activated; and a high affinity conformational, also triggered by the ligand-binding, where the integrin structure is extended and opened (Figure 6A) (Barczyk et al., 2010; Dermont et al., 2010). Structural studies on integrins have been solved mainly by x-ray crystallography of the extracellular structure fragments, αI domain, transmembrane and cytosolic complex, and intracellular protein in complex with the cytosolic tail of integrins (Liddington, 2014). Although few examples of structural experiments of integrin were reported, some integrins described on disintegrin structure studies, such as αIIbβ3 and αvβ3, have been characterized.
FIGURE 6
Integrin structure and activation. (A) General representation of integrin in the inactive closed state and the active ligand-bound state. The subunit domains are highlighted, where: EGF, epidermal growth factor; βTD, beta-tail domain. (B) crystal structure of alphaVbeta3 integrin interacting with fibronectin. The chains are represented by different colors, in which cyan we have the alpha chain, in green, we have the beta chain and in orange we have fibronectin. We can also see more broadly that the RGD domain (highlighted in magenta) of fibronectin is involved in the interaction with the integrin alphaVbeta3 (PDB 4MMX). The RGD motif binds to the head of αVβ3, in this interaction Tyr122 of the β subunit is important and is also conserved in other integrins, such as α5β1 and β2 integrins, which, like αVβ3, are drug targets (Goodman and Picard, 2012; Nagae et al., 2012). In addition, Asp 118 and 150 also participate in the interaction.
Integrin structure and activation. (A) General representation of integrin in the inactive closed state and the active ligand-bound state. The subunit domains are highlighted, where: EGF, epidermal growth factor; βTD, beta-tail domain. (B) crystal structure of alphaVbeta3 integrin interacting with fibronectin. The chains are represented by different colors, in which cyan we have the alpha chain, in green, we have the beta chain and in orange we have fibronectin. We can also see more broadly that the RGD domain (highlighted in magenta) of fibronectin is involved in the interaction with the integrin alphaVbeta3 (PDB 4MMX). The RGD motif binds to the head of αVβ3, in this interaction Tyr122 of the β subunit is important and is also conserved in other integrins, such as α5β1 and β2 integrins, which, like αVβ3, are drug targets (Goodman and Picard, 2012; Nagae et al., 2012). In addition, Asp 118 and 150 also participate in the interaction.In addition to structural characteristics of αIIbβ3 integrin, crystallography study of integrin αvβ3 reveals that NH2-terminal segments of the α and β subunits consist of an ovoid head with two parallel tails. The head is formed by a seven-bladed β-propeller from αv and a βA domain. Also, four solvent-exposed Ca2+ binding sites are found in the A-B β hairpin loops of blades 4 to seven at the propeller´s bottom. Finally, the αv ends in the thigh and calf-1 and 2 domains. Each calf domain contains two antiparallel β sheets, one with four strands and the other one with five (Xiong et al., 2001). Binding to RGD peptide, reveals that the pentagonal peptide fits into a crevice between the propeller and the βA domains on the integrin head. Also, the binding is associated with tertiary and quaternary changes in the integrin; affecting the α-1 α-2 loops and helices and the α2-C, F-α7 loops. At the same time, the βA and the αv propeller suffer a small change, with the two domains moving closer together at the peptide-binding site (Xiong et al., 2001). A study of the crystal structure of integrin αvβ3 with a cyclic pentapeptide ligand RGD showed that the peptide fits in a crevice between the β-propeller and the βA domains on the integrin head. Also, this study reveals that the Arg side chain gets into a narrow groove at the top of the propeller domain. Furthermore, the Asp made contact with the βA involving the Asp carboxylate group, this made a kind of network of polar interactions, making the Asp side chain interact primarily with the βA residues. On the other hand, the Gly residue makes contact in an interface between the α and the β, making several hydrophobic interactions with the αV. The fact that the Asp interacts with the βA suggests this residue is responsible for the recognition of the βA (Xiong et al., 2001). Figure 6B shows the interaction of αvβ3 integrin with the RGD domain of fibronectin, representing the fit of the protein into the domains of the integrin. Possibly, disintegrins interact similarly, but a high-resolution structure of the disintegrin/integrin complex is not available.
Structural Features of Disintegrin-Integrin Interaction
Only a very few reports show the interaction between disintegrin and integrins. The recognition of Integrins by disintegrins is mediated by the specific binding loop of each disintegrin, some of them containing the RGD sequence. In that matter, it is of interest to look upon the interaction of peptides containing RGD sequence and integrins.
Functional Diversification of Disintegrins Motifs and Bases of Integrin-Modulation
Functionally, the evolution of the disintegrin family is influenced by positive Darwinian selection, guiding the adaptation of the conformation loop and C-terminal to the target integrin receptor (Juárez et al., 2008). The evolution of integrin-binding motif on disintegrins emerges from an ancestral Arg-Gly-Asp (RGD) sequence, according to phylogenetic and codon substitution studies, to a panel of integrin receptors targets (Juárez et al., 2008). RGD sequence emerged from a subgroup of PIII-SVMP with 66RDECD70 sequence. Minimum mutations accomplish the conversion of RDE to RGD sequence, and three mutations (DNA) were the minimum codon changes required to the emergence of inhibitory integrin motifs (Juárez et al., 2008). Dimeric disintegrins present different integrin-binding motifs, suggesting fast evolution and cumulative structure changes (Calvete et al., 2003). Disintegrin inhibitory loop motifs are represented by RGD, capable of modulating integrins such as αIIbβ3, αvβ3, and α5β1. Variations of RGD motif include KGD, VGD, MGD, and WGD. KTS and RTS selectively modulate α1β1; MLD motifs, capable of modulating integrins such as α9β1, α4β1 and α4β7 (Table 2); (Sanz et al., 2006). Most RGD-disintegrins are monomeric proteins and this motif is present in all homo-dimeric and some subunits of hetero-dimers disintegrins (Walsh and Marcinkiewicz, 2011).Different structural features of disintegrins define the interaction with the integrin receptors. Conserved aspartate within the disintegrin motif might be responsible for binding to integrin receptors, while the specificity is ruled by the other two residues within the disintegrin motif. Further, the residues flanking the tripeptide motif, the conformation of the motif loop, and the C-terminal of the disintegrin sequence defined the integrin-binding and selectivity (Calvete, 2005a). This characteristic modulates the selectivity of disintegrins to different integrins, as represented in Table 3, which shows the selectivity interactions of disintegrins, divided according to each group of our classification, through different receptors, indicating the affinity of each member.
TABLE 3
Affinity and interactions with integrins to each group of disintegrins according to the classification proposed.
Groups
Integrins
REF
Disintegrin
β3
β1
β 6
β7
β5
β2
αIIb
αv
α2
α5
α6
α3
α4
α9
α1
α2
α7
αv
α4b
αv
αM
G 1
VAP1
—
—
No
Yes
Yes
Yes
No
—
—
—
—
—
—
—
—
Araki et al. (2002)
SVMP Elapid (atragin)
Yes
Yes
—
No
—
—
—
—
—
—
—
—
—
—
—
Wu et al. (2016)
Factor X activator (Russells viper venom factor X activator RVV-X)
Yes
—
—
—
—
—
—
—
—
—
—
—
—
—
—
Takeya et al. (1992)
ADAM 22
—
Yes
No*
Yes
No*
No*
Yes
—
No*
—
—
—
—
—
D’Abaco et al. (2006)
ADAM 10
—
—
—
Yes
—
—
—
No
—
—
—
Yes
—
—
—
Eto et al. (2002); Jones et al. (2013); Siney et al. (2017)
G 2/3
Bitistatin
Yes
Yes
—
—
—
—
—
—
—
—
—
—
—
—
—
Calvete et al. (1994); Pfaff et al. (1994); Mcquade et al. (2004)
G 4
Kistrin
Yes
Yes
No
—
—
—
—
—
—
—
—
—
—
—
Juliano et al. (1996); Dennis et al. (1990); Tselepis et al. (1997)
Rhodostomin
Yes
Yes
—
No
—
No
—
—
—
No
—
—
—
No
Chang and Lo (1998); Yeh et al. (2001); Yang et al. (2005)
Flavoridin
Yes
Yes
—
—
—
—
—
—
—
—
—
—
—
—
—
Pfaff et al. (1994)
Tzabcanin
—
Yes
—
—
—
—
—
—
—
—
—
—
—
—
—
Saviola et al. (2016)
Barbourin
Yes
No
—
No
—
—
—
—
—
—
—
—
—
—
—
Pfaff et al. (1994)
Cerastin
Yes
Yes
—
—
—
—
—
—
—
—
—
—
—
—
—
Scarborough et al. (1991)
Tergeminin
Yes
Yes
—
No
—
—
—
—
—
—
—
—
—
—
—
Scarborough et al. (1993)
Jararacin
Yes
Yes
—
—
—
—
—
—
—
—
—
—
—
—
—
Scarborough et al. (1993)
Lutosin
Yes
Yes
—
—
—
—
—
—
—
—
—
—
—
—
—
Scarborough et al. (1993)
Viridin
Yes
Yes
—
—
—
—
—
—
—
—
—
—
—
—
—
Scarborough et al. (1993)
Molossin
Yes
Yes
—
—
—
—
—
—
—
—
—
—
—
—
—
Scarborough et al. (1993)
Triflavin
Yes
Yes
—
Yes
—
—
—
—
—
—
—
—
—
—
—
Sheu et al. (1996); Kaku et al. (1997)
Cotiarin
Yes
Yes
—
—
—
—
—
—
—
—
—
—
—
—
—
Scarborough et al. (1993)
G 5
Salmosin
Yes
Yes
—
—
—
—
—
—
—
—
—
—
—
—
—
Park et al. (1998); Kang et al. (1999); Marcinkiewicz et al. (2003)
Adinbitor
Yes
—
—
—
—
—
—
—
—
—
—
—
—
—
—
Tur-Fu et al. (1991)
Jarastatin
Yes
—
—
—
—
—
—
—
—
—
—
—
—
—
Yes
Coelho et al. (2004); Wermelinger et al. (2009)
Saxatilin
Yes
—
—
—
—
—
—
—
No
No
—
—
—
—
—
Hong et al. (2002); Sohn et al. (2008)
Albolabrin
Yes
—
—
—
—
—
—
—
—
—
—
—
—
—
—
Lu et al. (1994)
G 6
Disintegrin VLO4
—
—
Yes
Yes
No
—
—
—
No
No
—
—
—
—
—
Calvete et al. (2003)
Labein
Yes
—
—
Yes
Yes
Yes
—
—
—
—
Yes
—
—
—
—
Gasmi et al. (2001); Eble et al. (2003); Kusuma et al. (2012)
Disintegrin VB7A
—
—
Yes
Yes
No
—
—
—
No
No
—
—
—
—
—
Calvete et al. (2003)
Acostatin
Yes
—
—
—
—
—
—
—
—
—
—
—
—
—
—
Okuda et al. (2002)
Schistatin
Yes
Yes
—
—
—
—
—
—
—
—
—
—
—
—
—
Okuda et al. (2001)
disintegrin from Saw-scaled viper
Yes
Yes
—
—
—
—
Yes
—
—
—
—
—
Yes
—
—
Okuda et al. (2001)
Disintegrin EO4A
—
—
Yes
Yes
No
Yes
—
—
No
No
—
—
—
—
—
Calvete et al. (2003)
G 7A
Labestatin
—
—
—
—
—
—
—
—
Yes
—
—
—
—
—
—
Olfa et al. (2005)
Viperistatin
—
—
—
—
—
—
—
—
Yes
Yes
—
—
—
—
—
Staniszewska et al. (2009); Bazan-Socha et al. (2014)
Jerdostatin
—
—
—
—
—
—
—
—
Yes
—
—
—
—
—
—
Sanz et al. (2005)
Obustatin
—
—
—
—
—
—
—
—
Yes
—
—
—
—
—
—
Marcinkiewicz et al. (2003); Calvete et al. (2007); Ghazaryan et al. (2015)
G 7B
Multisquamatin
Yes
—
—
—
—
—
—
—
—
—
—
—
—
—
—
Trikha et al. (1994); Okuda et al. (2001)
Echistatin
Yes
Yes
Yes
—
—
—
—
—
Yes
—
—
—
—
—
McLane et al. (1996); Surazyński et al. (2005); Szabo et al. (2012); Jing and Sun (2019)
Yes – disintegrin interact with integrin, No – disintegrin not interact with integrin, * – not interact with the alpha portion.
Affinity and interactions with integrins to each group of disintegrins according to the classification proposed.Yes – disintegrin interact with integrin, No – disintegrin not interact with integrin, * – not interact with the alpha portion.Moreover, the modulatory activity of disintegrins depends on the appropriate pairing of cysteine residues, contributing to the conformation of the disintegrin structure (Niewiarowski et al., 1994). The conservative aspect of cysteine residues and disulfide bond pattern among disintegrins contribute to the hypothesis of strong selection to maintain the active conformation of these proteins (Juárez et al., 2008).
Studies of the Structure-activity Relationship
Integrins recognize many physiological ligands, including soluble and surface proteins. X-ray crystallographic structures of the extracellular domains of αVβ3 have provided insights into the integrin structure-function relationship (Xiong et al., 2002; Van Agthoven et al., 2014). Disintegrins containing RGD or KGD motifs have been reported as unique and potentially useful tools to investigate integrin-ligand interactions. This is because these motifs serve as an integrin ligand-binding site, through which it plays a key role in interacting with integrin receptors. However, these studies are still scarce, for example, there is no report of a structure obtained by crystallography of a disintegrin-integrin complex, most of the studies were conducted by docking analysis.In general, the RGD motif and the C-terminal of disintegrins have been studied to modulate the Integrin-Disintegrin interaction. To illustrate, the molecular docking of rhodostomin into Integrin αIIbβ3, reveal a series of interactions between the Arg residue of the disintegrin with Asp 224 of the Integrin by salt bridges and with Tyr189 and Ser255 by hydrogen bonds. Also, the Asp residue of the RGD loop demonstrates to interact with the Ser123 of the β3 subunit (Chang et al., 2017). Likewise, the C-terminus of echistatin showed to interact with integrin αvβ3. The molecular docking of the echistatin into the integrin αvβ3 showed that the M28 of the side chain may interact with the D126 of the β3 subunit and the H44 of the C-terminus and the K45 of the C-terminus may interact with the side chain of M180 of the β3 subunit (Chen et al., 2020). In this regard, variations in the C-terminal region of disintegrins may be a requirement for the recognition of different integrins. Otherwise, the hydrophobic residues of flavoridin and kistrin dictate the specificity for the αIIbβ3 integrin. Also, analyses of the trimestatin reveal that the residues Pro53 and Trp67 may be important for the recognition of the β3 subunit (Bilgrami et al., 2004).Up to appoint, the interactions between the RGD loop and the C-terminus of the disintegrins play a key role in integrin recognition. Flavoridin shows the contact of the RGD loop and the C terminal domains, involved between the residues Cys27, Ala28, Asp29 and residues Gly7, Asn11, Cys13, Leu21 and the other between Cys13, Ala25, and Cys26. Further, the studies of Senn 1993 found a connection of the C-terminal of the molecule in the residues Cys64 to Trp67 to the loop containing the RGD sequence, which suggests a role of the C-terminal in the recognition and interaction of the disintegrin with the integrin (Senn and Werner, 1993). Many studies have shown that the residues that flank the RGD motif and in the C-terminal region of the disintegrins affect their specificities and binding affinities to integrins (Dennis et al., 1993; Marcinkiewicz et al., 1999; Chen et al., 2009; Cheng et al., 2012).Finally, the study of the interaction of jararacin and jarastatin with αIIbβ3 integrin by docking reveals, in the first place, that jararacin has more interactions with this integrin. The RGD motif of the jararacin interacts with both subunits of the integrin, making hydrogen bonds and ionic interactions, and indicates that the N-terminus and C-terminus region interact with both subunits of the integrin too. In the same way, the docking complex of the jarastatin showed hydrogen and ionic interactions and interactions of the N-terminus and C-terminus with both parts of the integrin (Wermelinger et al., 2009).
Mutants in the Study of the Structure-Activity Relationship
Several studies have shown that the amino acid residues that flank the RGD motif and the C-terminal region of the disintegrins modulate their specificity of interaction with integrin complexes (Dennis et al., 1993; Scarborough et al., 1993; McLane et al., 1998; Rahman et al., 1998; Cheng et al., 2012; Shiu et al., 2012). For example, disintegrins that have an ARGDW sequence showed a greater affinity for binding to the αIIbβ3 integrin, while disintegrins with an ARGDN sequence preferentially bind to αvβ3 and α5β1 integrins (Scarborough et al., 1993). To a better understanding of these interactions, some mutants were produced over the years. To investigate the structural basis of the integrin inhibitory potency, some studies have made mutations in the amino acid residues of the interaction loop of the disintegrins and analyzed the effects produced on the structure and activity of the disintegrins by replacing the native motif (Chen et al., 2009; Carbajo et al., 2011; Calvanese et al., 2015; Chang et al., 2017). These studies have revealed that mutations in the disintegrin that change the aspartate to glutamate, in the RGD loop, decrease their activity, as for Kistrin, which showed a 100-fold decrease in activity with this mutation (Dennis et al., 1993). Likewise, the rhodostomin (Rho) D51E mutant (2PJG/2PJF) was 1,000 times less active than Rho in inhibiting integrins.A powerful tool to investigate this is NMR spectroscopy and molecular docking has been used as valuable tools to study the relationship between the structure, dynamics, and function of the mutant strains of the rhodostomin protein (2PJF) (Table 2) to understand the important structural requirements for the recognition of integrins. The structural study carried out by Chen et al. (2009), found that Rho and its mutant have the same tertiary fold with three double-stranded antiparallel beta-sheets. There are no structural differences between the RG [D/E] loop. Two small differences between Rho and its mutant D51E were found only in its backbone dynamics and 3D structures. The relaxation parameter R2 value of E51 is 13% higher than that of residue D51. A difference in charge separation of 1.76 A was found between the positive (R49) and negative (D51 or E51) side chains.The coupling of Rho to the αvβ3 integrin, by molecular docking, showed that the amide and carbonyl groups of the main structure of the amino acid D51 of Rho formed hydrogen bonds with the amino acids R216 and R214 of the integrin. However, this hydrogen bond does not exist in the structure of the complex formed by the mutant protein D51E and the integrin. Therefore, the study suggests that the hydrogen bonds between the side chain and the backbone of the D51 residue of Rho and integrin are important for their binding to integrin (Chen et al., 2009). In addition, the Rho mutant P48A (2PJI) has its inhibitory capacity to αvβ1 integrin increased by 4.4 times. Docking of P48A showed no difference in the structure of the complex with α5β1 integrin, pointing out the importance of the dynamics, especially of the RGD loop. The mutant P48A caused differences in the order parameter (S2), conformational exchange contribution (Rex), and local correlation time (τe). The authors showed that the thermal flexibility (S2), which are motions in the pico to nanoseconds timescale, are increased for residues R49, G50, and D51 in the mutant P48A (Shiu et al., 2012).The Rho mutant G50L (2LJV) is a disintegrin that specifically binds to αvβ3 integrin. According to Shiu et al. (2012), the docking models of the mutant G50L and integrin showed that the amino acid L50 mutant G50L can be accommodated by a cavity within the interface between the αv and β3 subunits of the αvβ3 integrin. In contrast, such a pocket is not found in αIIbβ3 integrin, due to the formation of hydrogen bonds between αIIb residue Y190 and residue R216 of β3 subunits, resulting in blocking the bonding of residue L50. It was also observed that the G50L mutation increased the rigidity of the RLD motif, and the adjacent residues exhibited a slow conformational exchange. This finding shows that the slow movements of the RLD motif also play a vital role in modulating the integrin recognition link (Chuang et al., 2012; Shiu et al., 2012).The ARLDDL mutant (3UCI), a potent and selective αvβ3 integrin antagonist, was designed to investigate the function-structure-dynamic relationship. The 3D structure of the ARLDDL mutant was determined by X-ray crystallography, and its tertiary fold is the same as the reported disintegrin structures. The only difference found in the RLD motif of the ARLDDL loop was a compact β-turn structure with a distance of 5.5 Å between R49 (Cα) and D52 (Cα) compared to those of disintegrins ranging from 6.8 to 8.4 Å (Shiu, 2011; Shiu et al., 2012).In the study by Chang et al. (2017), it was observed that the content of the sequence of the RGD loop and the C terminal of the disintegrins mutually affected their conformations, resulting in functional and structural differences in the integration of the integrin. Structural analysis by NMR showed that Rho mutants containing a 48ARGDWN-65PRNPWNG sequence exhibited the highest selectivity in inhibiting cell adhesion mediated by αIIbβ3 integrin. The results, of molecular docking, suggested that the content of the sequence and the length of the C-terminal regions in the disintegrins are critical to their ability to bind to the αIIbβ3 integrin (Chang et al., 2017).Carbajo et al. (2011) made changes to the structure of jerdostatin by replacing the native RTS motif with KTS. These authors demonstrated by NMR that wild-type jerdostatin and its mutant R24K present a common structure, but different dynamic profiles. They found differences in movements on the picosecond to nanosecond time scale and deceleration movements for some residues of the R24K mutant compared to the wild type of jerdostatin (Carbajo et al., 2011). According to the authors, these findings may explain the reduction in the inhibitory potency of the integrin of the mutant jerdostatin R24K (IC50 703 nM) compared to the wild type (IC50 180 nM) (Carbajo et al., 2011). A complementary study by Calvanese et al. (2015), used Molecular Dynamics (MD) simulations of the two molecules (wild type jerdostatin and it is mutant) to explore in atomic resolution the structural bases of their different dynamic behaviors, to identify the atomic movements that could differentiate their behaviors dynamic and, therefore, their properties/activities. The analysis confirmed the combined movements between the recognition loop and the C-terminal tail considered relevant to the functional capacity of jerdostatin. It also revealed the residues that dominate such a mechanism. Indicating that both wild-type jerdostatin and the R24K mutant share a common structure but differ in global movements. Studies like this are important to clarify whether disintegrins movements can be functional for integrin binding (Sanz-Soler et al., 2012; Calvanese et al., 2015).This kind of study helped the development of pharmacological agents that block platelet aggregation by inhibition of integrin αIIbβ3. Some commercialized drugs are the eptifibatide, a cyclic heptapeptide that originated from the disintegrin barbourin, and the tirofiban, which originated from the disintegrin echistatin. The barbourin is a disintegrin isolated from the venom of Sistrurus miliarius barbouri, which possesses a KGD loop and a high affinity to integrin αIIbβ3 (Tcheng and O’Shea, 2002). Jing and Lu 2005, investigated a mutant proinsulin chimera with eight amino acids from barbourin (CAKGDWNC, respectively). Interestingly, they found that the protein inhibits human platelet aggregation induced by ADP and retains its binding activity to the insulin receptor. Also, Xiao et al., 2004, showed how the eptifibatide fits into the Integrin αIIbβ3, demonstrating some hydrophobic contacts with the Lys, based on the binding loop of barbourin, with Phe231 in the αIIb β-propeller. In addition, the Asp224 of the residue αIIb may form hydrogen bonds with the eptifibatide.Otherwise, interaction studies between tirofiban (based on echistatin) showed that the co-crystal between the drug and the Integrin αIIbβ3 reveals that the sulfonamide groups (in the tirofiban) interacted with Tyr166 and Arg214 by hydrogen bonds in the β3 subunit and the butyl and pyridyl groups may interact with the Phe160 and Tyr190 in the αIIb subunit by hydrogen bond interaction (Xiao et al., 2004). Interestingly, the interaction with free integrin reveals poor changes in integrin αIIbβ3 structure, changing the globular conformation of the free Integrin to an open conformation in just 4% of the free integrins, seen by transmission electron microscopy (Hantgan, et al., 2002).
Conclusion and Future Directions
To conclude, disintegrins are divided into five different groups according to their polypeptide length and the number of disulfide bonds. In this work, we propose a classification based on patterns of disulfide bonds, where this classification resulted in the division into seven groups, organized by disulfide bind pattern. Through comparing amino acid sequences by multiple-sequence alignment, a brief phylogenetic analysis, and extensive literature review, we support the view that the different disintegrin subfamilies evolved from a common ADAM (a disintegrin and metalloproteinase-like) and that structural diversification occurred through disulfide bond engineering.A deep analysis of the conserved cysteine residues in each disintegrin subfamily (Figure 4) strongly indicates that structural diversity of disintegrins was achieved during evolution through selective loss of disulfide bonds. It is well-known that disulfide bonds play a key role in the stability and impose distinct protein folding, with a specific orientation of the loop regions of these proteins. Recognition of integrins by disintegrins is mediated by the specific binding loop of each disintegrin. As well that, the residues flanking the tripeptide motif, the motif loop conformation, and the C-terminus of the disintegrin sequence influence integrin binding and selectivity (Calvete, 2005a). The C-terminal region of disintegrins and the flexibility of the RGD loop (located at the apex of the loop), may be a requirement for the recognition of different integrins, specifically for the β subunit. These two parts mutually affected their conformations, making it crucial to the Integrin modulation. This characteristic modulates the selectivity of disintegrins to different integrins, as mentioned in this work. Furthermore, the modulatory activity of disintegrins depends on the proper pairing of cysteine residues, contributing to the conformation of the disintegrin structure (Niewiarowski et al., 1994). The conservative aspect of cysteine residues and the disulfide bond pattern between disintegrins contribute to the hypothesis of strong selection to maintain the active conformation of these proteins (Juárez et al., 2008).Finally, few structural studies show the interaction between disintegrins and integrins. In our work, the selective interactions of disintegrins are shown, divided according to each group of our classification, through different receptors, indicating the affinity of each member to differents integrins, exhibit the importance of the structural studies to the comprehension of the interaction of the Disintegrins to its receptor targets. This knowledge is fundamental for the designing of new drugs that target integrins, as it is shown that disulfide arrangements of disintegrins have an impact on integrin/disintegrin interaction. Also, it is important to point out that only a small percentage of the available disintegrins from venoms has been investigated so far and these molecules present opportunities for larger Integrin engagement surface with good stability, increasing the opportunity for the development of new drugs (Trim et al., 2021).
Authors: C Marcinkiewicz; R R Lobb; M M Marcinkiewicz; J L Daniel; J B Smith; C Dangelmaier; P H Weinreb; D A Beacham; S Niewiarowski Journal: Biochemistry Date: 2000-08-15 Impact factor: 3.162
Authors: Nicole Kusuma; Delphine Denoyer; Johannes A Eble; Richard P Redvers; Belinda S Parker; Rebecca Pelzer; Robin L Anderson; Normand Pouliot Journal: Int J Cancer Date: 2011-05-05 Impact factor: 7.396