| Literature DB >> 30984766 |
Christopher E Dempsey1, Maria Giulia Bigotti1, Josephine C Adams1, Andrea Brancaccio1,2.
Abstract
Dystroglycan (DG) is an adhesion complex that links the cytoskeleton to the surrounding extracellular matrix in skeletal muscle and a wide variety of other tissues. It is composed of a highly glycosylated extracellular α-DG associated noncovalently with a transmembrane β-DG whose cytodomain interacts with dystrophin and its isoforms. Alpha-dystroglycan (α-DG) binds tightly and in a calcium-dependent fashion to multiple extracellular proteins and proteoglycans, each of which harbors at least one, or, more frequently, tandem arrays of laminin-globular (LG) domains. Considerable biochemical and structural work has accumulated on the α-DG-binding LG domains, highlighting a significant heterogeneity in ligand-binding properties of domains from different proteins as well as between single and multiple LG domains within the same protein. Here we review biochemical, structural, and functional information on the LG domains reported to bind α-dystroglycan. In addition, we have incorporated bioinformatics and modeling to explore whether specific motifs responsible for α-dystroglycan recognition can be identified within isolated LG domains. In particular, we analyzed the LG domains of slits and agrin as well as those of paradigmatic α-DG non-binders such as laminin-α3. While some stretches of basic residues may be important, no universally conserved motifs could be identified. However, the data confirm that the coordinated calcium atom within the LG domain is needed to establish an interaction with the sugars of α-DG, although it appears that this alone is insufficient to mediate significant α-DG binding. We develop a scenario involving different binding modes of a single LG domain unit, or tandemly repeated units, with α-DG. A variability of binding modes might be important to generate a range of affinities to allow physiological regulation of this interaction, reflecting its crucial biological importance.Entities:
Keywords: binding affinities; dystroglycan; laminin globular (LG) domains; protein modeling; protein structure
Year: 2019 PMID: 30984766 PMCID: PMC6450144 DOI: 10.3389/fmolb.2019.00018
Source DB: PubMed Journal: Front Mol Biosci ISSN: 2296-889X
Figure 1(A) The dystrophin-glycoprotein complex (DGC). Dystroglycan is embedded within the membrane as a pivotal member of the complex establishing multiple contacts with extracellular, transmembrane, and intracellular partners. In skeletal muscle it creates a “molecular bridge” between cells and the surrounding tissues offering stability upon muscle contraction. (B) Domain architecture of representative α-dystroglycan binding partners. LG domains representing α-DG-binding sites are boxed in green. Each domain has been assessed using InterProScan 5.2 (https://www.ebi.ac.uk/interpro/) database as described in the Supplementary section. Codes: LN-N-term. (Laminin, N-terminal), LN-EGF (Laminin-type epidermal growth factor-like), LN IV (Laminin IV domain), LNαDI (Laminin alpha, domain I), LNDII (Laminin Domain II), NtA (N-terminal of agrin), SEA (Sperm protein, Enterokinase and Agrin), EGF-like (Epidermal growth factor-like), IG-like (Immunoglobulin-like), Leu-r (leucine-rich), Cys-r (cysteine-rich). Not to scale.
LG-containing human proteins and proteoglycans with emphasis on those known to bind α-dystroglycan.
| Laminin-α1 (present in laminin-111 and laminin-121) | LNA1 | P25391 | 3,075 | LG1(181)-LG2(177)-LG3(188)-LG4(173)-LG5(181) | LG4-5 | Binding is calcium dependent | Andac et al., | |
| Laminin-α2 (present in laminin-211, laminin-221 and laminin-213) | LNA2 | P24043 | 3,122 | LG1(184)-LG2(182)-LG3(185)-LG4(172)-LG5(181) | LG1-3 LG4-5 | Binding is calcium dependent | Talts et al., | |
| Laminin-α3 (present in aminin-332, laminin-311 and laminin-321) | LNA3 | Q16787 | 3,333 | LG1(202)-LG2(163)-LG3(161)-LG4(165)-LG5(174) | Not binding | From sequence comparison, predicted not to bind calcium (Timpl et al., | Timpl et al. ( | - PDB not available |
| Laminin-α4 (present in laminin-411, laminin-421 and laminin-423) | LNA4 | Q16363 | 1,823 | LG1(203)-LG2(180)-LG3(169)-LG4(172)-LG5(174) | LG1-3 | Lower affinity than the corresponding modules from LNA1 and LNA2 | Talts et al., | - PDB not available |
| Laminin-α5 (present in laminin-511, laminin-521 and laminin-523) | LNA5 | O15230 | 3,695 | LG1(194)-LG2(175)-LG3(169)-LG4(174)-LG5(173) | LG4-5 | Lower affinity than LG4-5 from α1 or α2 | Shimizu et al., | - PDB not available |
| Agrin | AGR | O00468 | 2,067 | LG1(177)-LG2(184)-LG3(182) | LG1-2 | LG3 does not bind DG. LG1 might represent the best binder in the LG1-2 tandem. Affinity is influenced by a splice site (A: KSRK) on LG2. The variant with KSRK binds less strongly. LG2 is not alone sufficient for high-affinity binding | Campanelli et al., | |
| Perlecan | PRL | P98160 | 4,391 | LG1(186)-LG2(181)-LG3(189) | LG1-3 | Only tandem arrays can bind. No binding, or very weak binding, with isolated domains | Friedrich et al., | |
| Pikachurin | PKC | Q63HQ2 | 1,017 | LG1(179)-LG2(180)-LG3(180) | LG1-3 | It is not known if isolated domains can bind | Sato et al., | |
| Neurexin1α | NRX1A | Q9ULB1 | 1,477 | LG1(188)-LG2(183)-LG3(193)-LG4(174)-LG5(176)-LG6(169) | LG1-6 | There is a plethora of splicing isoforms. Usually, the forms with no inserts bind better to α-DG | Sugita et al., | |
| Neurexin2α | NRX2A | Q9P2S2 | 1,712 | LG1(179)-LG2(198)-LG3(194)-LG4(173)-LG5(176)-LG6(209) | ? | |||
| Neurexin3α | NRX3A | Q9Y4C0 | 1,643 | LG1(176)-LG2(183)-LG3(193)-LG4(173)-LG5(176)-LG6(171) | ? | |||
| Neurexin1β | NRX1B | P58400 | 442 | LG1(169) | It is exactly the same as LG6 of Neurexin1α | Rudenko et al., | ||
| Neurexin2β | NRX2B | P58401 | 666 | LG1(209) | ? | |||
| Neurexin3β | NRX3B | Q9HDB5 | 637 | LG1(171) | ? | |||
| Slit-1 | SLT1 | O75093 | 1,534 | LG1(174) | ? | The KVR motif is NOT conserved | ||
| Slit-2 | SLT2 | O94813 | 1,529 | LG1(174) | A double mutation of two basic residues (KVR to AVA) abolishes binding. | Wright et al., | ||
| Slit-3 | SLT3 | NP_001258875 | 1,530 | LG1(175) | ? | The KVR motif is conserved |
The list includes the laminin-α3 chain (reported not to bind DG). α-DG binding has not yet been confirmed for the two paralogs of Slit-2.
In bold: LG domains which are able to bind α-DG in an isolated fashion, and potential templates used for computational modeling. Laminin-α3 chain sequence refers to the isoform 3A.
Figure 2Crystal structure of perlecan LG3 domain. β-strands are annotated according to the scheme of Le et al. (2011). In this representation (A) strands M and N are “fused” and a short sequence in the D-E loop is missing density in the crystal structure. The Ca2+ ion in the perlecan LG3 Ca2+ binding site (ligand residues are highlighted) is shown as a sphere. (B) homology model of a slit-2 LG domain constructed on the perlecan LG3 structure template. The slit-2 model produces a potential Ca2+-binding domain with a very similar arrangement of backbone and side chain ligands for Ca2+ as in the LG3 domain, suggesting that the slit-2 LG domain is likely to bind Ca2+. See Figure 8 for additional interpretation of slit LG domain homology models.
Figure 3(A) Structure of laminin-α2 LG4 cocrystallised with a LARGE polysaccharide moiety. LG domain side chain and backbone carbonyl ligands coordinating the Ca2+ ion are identified with black bold and italic annotations, respectively. LG domain groups making direct interaction with the carbohydrate are identified with blue annotations; these are: R2803 side chain stacking with the Xyl2 ring; G2826 backbone carbonyl with a GlcA1 ring hydroxyl group and D2783 and I2874 backbone amide NH groups with the GlcA3 carboxyl. The two carbohydrate oxygen atoms coordinating the Ca2+ ion are identified with red stars (Briggs et al., 2016), PDB:5IK5. (B) Structure of perlecan overlaid with the disaccharide unit shown in (A). To construct the model in (B) the perlecan LG (PDB:3SH5) and laminin-α2 LG4 domains were superimposed, and the disaccharide from laminin-α2 directly extracted onto perlecan. Apart from a small difference in the positions of the Ca2+ ions in the two structures, the disaccharide “fits” into the perlecan structure and provides equivalent Ca2+ coordinating ligands as in the laminin-α2 LG4 structure. Equivalent groups providing potential hydrogen bonds with carbohydrate (G4276; V4324; A4325) are annotated in blue. The similarities suggest that perlecan should coordinate carbohydrate units of α-DG in a manner similar to that of laminin-α2 LG4.
Amino acid ligands for Ca2+ in X-ray structures and modeled LG domains.
| Laminin α1 LG4 2JD4 (Mg2+) | NQM | FMFD | GK | |
| Laminin α2 LG4 5IK5 (Ca2+) | NHA | FSYD | KKAD | |
| Laminin α2 LG5 1QU0 (Ca2+) | QKM | MFHVD | SAST | |
| Laminin α3 LG4 model on 2JD4 | – | FALG | LPGN | |
| Laminin α5 LG3 model on 2JD4 | RASP | – | PPP | |
| Laminin α5 LG4 model on 2JD4 | VAQ | – | HQGA | |
| Agrin LG3 | GLERS | MMYD | LGAT | |
| Perlecan LG3 | GVEVGEAGQGK | VFRYQ | PGP | |
| Neurexin 1α LG2 2H0B (Ca2+) | GKSA | SLVIN | EDYT | |
| Neurexin 1β LG1 2R1D (Ca2+) | SGLG | VKFN | GRQLT | |
| Slit-1 LG1 | GDN | VSYD | GKHY | |
| Slit-2 LG1 | GDK | ASYD | NLSK | |
| Slit-3 LG1 | KGDN | VYD | QKQP |
K is reported as a N in the NCBI sequence: a mutation introduced to eliminate N-glycosylation. Amino acids in bold and italics provide side chain (carboxyl, amide or hydroxyl) or backbone carbonyl Ca.
Figure 4Homology models of laminin LG domains. Laminin-α5 LG3 (A) and laminin-α5 LG4 (C) built on the crystal structure template of murine laminin-α1 LG4 (B, PDB:2JD4, NCBI Code: P19137). Neither the LG3 nor the LG4 domain of laminin-α5 has a canonical Ca2+ binding site. However, each of the laminin-α5 LG domain models places potential Ca2+ chelating ligands near the expected position of a Ca2+ ion. Conformational flexibility in the loops on the edge of these domains may allow weak Ca2+ binding that could explain observations of EDTA attenuation of α-DG binding by laminin-α5. The laminin-α5 LG4 model has a basic patch (RHR) in a topologically-equivalent location on the B-C loop as other basic patches of other known α-DG-binding LG domains including laminin-α1 LG4 (middle). Note that 2JD4 is the murine laminin-α1 LG4 structure; the human laminin-α1 LG4 equivalent has RKK rather than RKR in this basic patch motif. In red: residues involved in Ca2+ binding (italic numbers denote backbone carbonyl ligands), in purple: Arg residues of the basic patch.
Figure 5(A) Comparison of the Ca2+ binding site in Ca2+-bound (green) and Ca2+-free (blue) perlecan. Ca2+-bound perlecan PDB:3SH5; Ca2+-free perlecan PDB:3SH4. The structure of the binding site is maintained in the absence of Ca2+. Similar observations have been made with agrin in its free and Ca2+ - bound states (Stetefeld et al., 2004). The Ca2+ ligands are highlighted; italic type denotes backbone carbonyl ligands. (B) Homology model of laminin-α3 LG4. The model was built on the crystal structure template of laminin-α1 LG4 (PDB:2JD4). This domain lacks potential Ca2+-chelating acidic or amide groups on the D-E and J-K loops (see residues labeled in red); the side chain carboxyl (D3051) on the F-G loop is poorly positioned to chelate Ca2+. This domain is predicted to have a very weak Ca2+ binding site.
Figure 6Homology models of agrin LG2 KSRK-containing splice variant. Models were obtained using as template: (A) human perlecan LG3 (3SH5), (B) chicken agrin LG3 (1PZ7) and (C) murine laminin-α1 LG4 (2JD4). The potential Ca2+ binding site is relatively well defined and independent of the structural template. The Ca2+ ions modeled into the putative Ca2+ binding sites are shown as spheres. The position of the KSRK sequence (green) that lies in the loop between the J and K strands (see text) is not well-constrained in the models. These models illustrate the variability in structures of modeled LG domains obtained when using a range of different structural templates. Note that the strongest sequence homology (≈35% identity) is between human agrin LG2 and perlecan LG3.
Figure 7Homology model of human agrin LG2 showing the structural context of mutations that cause congenital myasthenic syndrome (CMS). The domain scaffold (strands and loops) is shown in blue with the Ca2+ atom in white whilst the missense mutations positions (numbering referring to human agrin, NCBI code: O00468) are reported in purple. The model was constructed using the perlecan LG3 domain (PDB: 3SH5; see Figure 2) as a template.
Figure 8Potential Ca2+ sites of the LG domains of the three Slits: (A) Slit-1; (B) Slit-2; (C) Slit-3. Models were built on templates of LG domains with known structures (see main text) and viewed from the top edge of the β-sandwich (where “top” is equivalent to the upper part of the LG domain structures as represented in Figure 2). Slit-2 has a similar arrangement of potential Ca2+-chelating groups (red), as perlecan (see Figure 2). In isoforms 2 and 3 the KVR sequence (basic residues in purple) that supports α-DG binding is adjacent to the Ca2+ site; for comparison, the basic KSRK sequence that hinders α-DG binding in the agrin LG2 splice variant (see Figure 6) would lie to the right of the Ca2+ binding site in this view.
Figure 9A color code for LG domain-α-dystroglycan binding affinities. Based on available biochemical information, the α-DG binding partners are classified by the binding affinities of their multiple LG domains, represented using the color code reported in the figure. The affinity scale has been estimated semi-quantitatively based on available binding data (see Sciandra et al., 2013): tighter affinity refers to the lower nanomolar range (0.1–1 nM), medium affinity to the higher nanomolar range (10–100 nM), weak or very weak binding within the micromolar range. The laminin (LN) sub-family (referring to the five different α chains, A1 to A5) is on the left whilst agrin (AGR), perlecan (PRL) and other binding partners (PKC: pikachurin, NRX: neurexin, SLT2: Slit-2) are on the right. The drawings are not to scale. Numbers between domains indicate the number of amino acids in the linkers separating them.
Figure 10Multiple sequence alignment of selected isolated LG domains that are binders of α-DG. The sequences refer to all the strong binders identified and include also the LG3 module of the laminin α2 chain (reported to be a weaker binder). Secondary structure elements, as retrieved from the available 3D structure of human perlecan (3SH4/3SH5), are rendered as follows: β-strands, from A to N (red), α-helices (purple), calcium-coordinating amino acid positions (orange and indicated by an asterisk). Sequences were aligned in MUSCLE 3.8 and the alignment presented in Boxshade. Basic residues belonging to the loops neighboring the coordinated Ca2+ are reported in blue, the ones belonging to the opposite side of the domain are turquoise. The specific LG domain sequences in the alignment are identified as follows: NRX1A/1B: neurexin α1/β1, LNA1/2/5: laminin α1/α2/α5, SLT2: slit-2, PRL: perlecan.
Figure 11Multiple sequence alignment of selected isolated LG domains that are non-binders of α-DG. Laminin α3 has been reported not to bind α-DG. Therefore, it is assumed that none of its tandem LG domains would be able to bind α-DG. Secondary structure elements as retrieved from the available 3D structure of human perlecan (3SH4/3SH5): β-strands, from A to N (red), α-helices (purple), calcium-coordinating amino acid positions (orange and indicated by an asterisk). Basic residues belonging to the loops neighboring the coordinated Ca2+ are reported in blue whereas the one belonging to the opposite side of the domain in turquoise. Code: LNA3_LG1 to LNA3_LG5 (1st to 5th module from laminin α3).
Figure 12Unrooted phylogenetic diagram of the protein sequence relationships of the LG domains analyzed in this study. The alignment of 246 positions was prepared in MAFFT and the Newick output rendered in iTOL. Codenames as in Table 1. Asterisks indicate known α-DG-binding LG domains.
Figure 13LG-domains binding modes toward α-DG. Model of how different tandem assembly combinations might be required for fine modulation of α-DG binding affinity. The distance between the two disaccharide units that interact with two adjacent LG domains has not been determined, and the cartoon only depicts an example scenario. Because of their size, tandem LG domains cannot bind to adjacent GlcA-Xyl disaccharide units. A disaccharide unit is indicated, with Xyl: Xylose and GlucA: Glucuronic acid. The grouping on the right illustrates that LG domains that are able to bind α-DG as isolated units (in red) have not been found as neighbors. For color code, see Figure 9.