Literature DB >> 15784257

The structure of DC-SIGNR with a portion of its repeat domain lends insights to modeling of the receptor tetramer.

Greg A Snyder1, Marco Colonna, Peter D Sun.   

Abstract

The dendritic cell-specific ICAM-3 non-integrin (DC-SIGN) and its close relative DC-SIGNR recognize various glycoproteins, both pathogenic and cellular, through the receptor lectin domain-mediated carbohydrate recognition. While the carbohydrate-recognition domains (CRD) exist as monomers and bind individual carbohydrates with low affinity and are permissive in nature, the full-length receptors form tetramers through their repeat domain and recognize specific ligands with high affinity. To understand the tetramer-based ligand binding avidity, we determined the crystal structure of DC-SIGNR with its last repeat region. Compared to the carbohydrate-bound CRD structure, the structure revealed conformational changes in the calcium and carbohydrate coordination loops of CRD, an additional disulfide bond between the N and the C termini of the CRD, and a helical conformation for the last repeat. On the basis of the current crystal structure and other published structures with sequence homology to the repeat domain, we generated a tetramer model for DC-SIGN/R using homology modeling and propose a ligand-recognition index to identify potential receptor ligands.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15784257      PMCID: PMC7094344          DOI: 10.1016/j.jmb.2005.01.063

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


Introduction

The dendritic cell-specific ICAM-3 non-integrin (DC-SIGN) and its close relative DC-SIGNR are members of the C-type lectin family. Originally discovered as a human immunodeficiency virus (HIV)-binding protein, DC-SIGN has been shown to bind carbohydrates on various pathogens, including Ebola, Mycobacterium tuberculosis, hepatitis C virus and cytomegalovirus.1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 In the case of HIV-1 infection, DC-SIGN and DC-SIGNR (together referred to as DC-SIGN/R) have been proposed to facilitate the viral infection of T-cells in trans through binding with HIV gp120.5, 12 However, recent evidence suggests that DC-SIGN/R function as antigen capturing receptors to facilitate the presentation of HIV-1 antigen by dendritic cells. DC-SIGN/R each consist of four domains: a cytoplasmic domain with a di-leucine motif for internalization, a single-spanning transmembrane region, a region with seven and one-half 23 amino acid residue repeats, and a carbohydrate-recognition domain (CRD). DC-SIGNR is 77% identical with DC-SIGN in amino acid sequence, and differs mainly in tissue and cellular expression patterns although recent reports indicate that it may differ in binding and processing of pathogens.14, 15, 16 It has been established that DC-SIGN/R recognize specifically high-mannose carbohydrates. Previous structural studies have shown the molecular details of this interaction. More recently, however, DC-SIGN/R were shown to recognize terminal fucose and galactose-containing carbohydrates, such as blood group antigens B Lewisa, and Lewisx structures in addition to mannose.18, 19 While recognition between various carbohydrate substructures gives insight into how various carbohydrate model compounds are recognized by the CRD of the receptor, the overall receptor binding affinity appears to depend on the multivalent nature of the ligand. For example, while DC-SIGN/R bind to model carbohydrate with millimolar affinity, the receptors recognize HIV gp120, which carries multiple high-mannose-based, N-linked glycosylations with nanomolar affinity. This carbohydrate valency-dependent avidity effect was shown to be the result of DC-SIGN/R tetramerization through its repeat region.17, 21 To understand the nature of the receptor-carbohydrate interaction, we have determined the crystal structure of DC-SIGNR with a portion of the repeat domain. We propose a tetramer model for the intact extracellular receptor, and formulate a scheme to predict the potential ligands.

Results

Description of the overall structure

The crystals of DC-SIGNR CRD with its last repeat belong to the orthorhombic space group P212121. The protein solution was supplemented with 10 mM mannose and 5 mM CaCl2 prior to crystallization. Crystals were obtained under several conditions that include polyethylene glycol with various molecular masses (2000–8000 Da) and buffers with a pH between 6.0 and 8.0. Crystals contained one molecule per asymmetric unit with a solvent content of 32.8% (v/v) (Matthew's coefficient of 1.8). Molecular replacement rotation and translation correlation coefficients were ranked and yielded a single solution well above the background. The initial phased map had clear electron density for both the main chains and the side-chains. After rebuilding of loops, the electron density was continuous throughout the structure and the final structure consists of residues 260–398, with Met282 modeled in two alternative conformations. The final refined crystallographic R-factors are 17.75% and 19.3% for R work and R free, respectively, at 1.41 Å resolution (Table 1 ). The overall structure is a typical long-form C-type lectin, and the CRD portion superimposes with a root-mean-square (r.m.s) deviation of 0.67 Å (for 126 Cα atoms) to the CRD-only structure (Figure 1 (b)). Although both mannose and calcium were present in the crystallization solution, only calcium was observed bound in the canonical calcium binding site. This structure contains additional residues at both the amino and carboxyl termini including a disulfide bond linking the two termini as well as a short α-helix at the beginning of the repeat domain.
Table 1

Crystallographic data collection and refinement statistics

A. Data collection
Space group, cell lengths (Å)P212121
Resolution (Å)20.0–1.41 (1.45–1.41)a


Cell dimensions
 a (Å)38.23
 b (Å)54.88
 c (Å)62.32
No. observations25,340 (2102)
Completeness (%)97.5 (99.2)
Rsymb (%)6.9 (22.2)
II37.1 (5.6)


B. Refinement statistics
Rwork/Rfreec (%)17.7/19.3


No. atoms
 Protein1143
 Water103
 Other1


Ramachandran plot
 Most favored (%)91.9
 Allowed (%)8.1
 Generously allowed (%)0
 Forbidden (%)0


r.m.s. deviations
 Bond lengths (Å)0.015
 Bond angles (deg.)1.37

Values within parentheses are for the highest-resolution shell.

Rsym=Σ|I−〈I〉|/ΣI,where 〈Ih〉 is the mean intensity of multiple measurements of symmetry-equivalent reflections.

Rwork or , where Fc is the calculated and Fo is the observed structure factor amplitude of reflection h for the working or 5% free set, respectively.

Figure 1

The structure of DC-SIGNR with repeat domain R8. (a) A representation of the four domains of DC-SIGN and DC-SIGNR including cytoplasmic, transmembrane (TM), repeats 1–8 (R1–R8) and the carbohydrate recognition domain (CRD). The portion of DC-SIGNR crystallized, DC-SIGNR R8, is shown (amino acid residues 249–399). The four trypsin digestion sites are indicated as 1, 2, 3, and 4. (b) The crystal structure of DC-SIGNR R8 (blue) showing the position of the carbohydrate-binding calcium (yellow, Ca2), the disulfide that links the N and the C terminus, and the visible portion of the R8 forming a helical repeat. (c) Superposition of DC-SIGNR R8 (blue) with the DC-SIGNR R7 (PDB code1SL6; gray). Carbohydrate has been omitted for clarity.

Crystallographic data collection and refinement statistics Values within parentheses are for the highest-resolution shell. Rsym=Σ|I−〈I〉|/ΣI,where 〈Ih〉 is the mean intensity of multiple measurements of symmetry-equivalent reflections. Rwork or , where Fc is the calculated and Fo is the observed structure factor amplitude of reflection h for the working or 5% free set, respectively. The structure of DC-SIGNR with repeat domain R8. (a) A representation of the four domains of DC-SIGN and DC-SIGNR including cytoplasmic, transmembrane (TM), repeats 1–8 (R1–R8) and the carbohydrate recognition domain (CRD). The portion of DC-SIGNR crystallized, DC-SIGNR R8, is shown (amino acid residues 249–399). The four trypsin digestion sites are indicated as 1, 2, 3, and 4. (b) The crystal structure of DC-SIGNR R8 (blue) showing the position of the carbohydrate-binding calcium (yellow, Ca2), the disulfide that links the N and the C terminus, and the visible portion of the R8 forming a helical repeat. (c) Superposition of DC-SIGNR R8 (blue) with the DC-SIGNR R7 (PDB code1SL6; gray). Carbohydrate has been omitted for clarity.

The N-terminal disulfide and R8 repeat

Additional amino acids were present in both the N and C termini compared with the CRD-only structure. At the C-terminus we observed additional density for amino acids 395–398, including a disulfide bond between Cys395 and Cys265, which links both the N-and C-termini into close proximity (Figure 1). As a result, the ring of the N-terminal histidine residue (His267) stacks against the ring of the C-terminal Phe396. The helical repeat domain has been shown to be responsible for tetramerization of the receptor. Our DC-SIGNR R8 construct contains the last repeat that immediately precedes the CRD domain. This repeat region encompasses residues Gln249-Cys265. A portion of this repeat, Ala260-Cys265, is ordered in the structure and forms a short α-helix. Although the rest of the R8 repeat appears disordered in our crystal, presumably due to the proximity to the N terminus of the expressed recombinant R8 construct, the presence of a short helix is consistent with the secondary structure prediction that the repeat domain is mainly α-helical. The N-terminal CRD disulfide bond (Cys265-Cys395) and the helical repeat conformation was observed recently in the structure of DC-SIGNR R7 (CRD with its last two repeats).19, 22 The r.m.s deviation between the CRD domain of DC-SIGNR R8 and that of R7 is 0.76 Å for 129 Cα atoms. However, the hinge angle between the CRD and the repeat domain differs by about 40° (100° and 60°, respectively) between the two structures, indicating a domain flexibility between the CRD and the repeat domain of the receptor (Figure 1).

The calcium and carbohydrate-binding sites

The primary calcium site involved in binding carbohydrate (Ca2) has a well-ordered calcium ion in this structure. The amino acid residues involved directly in coordinating the calcium ion are Glu359, Asn361, Glu366, and Asp378, and a water molecule (W19). With the exception of Asn377, which is rotated out of the calcium coordination, the ligand positions are well conserved between the R8 and CRD-only structures of DC-SIGNR (Figure 2 (a) and Table 2 ). In contrast, no attributable electron density was found near the two auxiliary calcium ions (Ca1 and Ca3) binding site and the two residues coordinating the auxiliary calcium, Asn362 and Asn365, moved 4.0 Å and 1.9 Å, respectively, compared to the CRD-only structure (1K9J). The movement of Asn362 and Asn365 effectively disrupts the coordination of Ca1 and Ca3, further evidence that both auxiliary calcium ions are absent from the DC-SIGNR R8 structure. Despite the presence of mannose in the crystallization buffer and the existence of additional electron density at the putative carbohydrate-binding site, attempts to fit mannose were not satisfactory, and instead, water molecules were built throughout the carbohydrate-binding site.
Figure 2

Carbohydrate-binding and calcium-binding sites. (a) Cα traces of the DC-SIGNR R8 (blue) and DC-SIGNR CRD (gray) primary calcium-binding sites showing side-chains involved in coordinating calcium. In the absence of carbohydrate, Asp377 is not involved in calcium coordination. The calcium ion, seen in nearly the same position in both structures is shown in yellow (Ca2). A water molecule (red) is present in the location where normally ligand binds (DC-SIGNR R8 structure). (b) The secondary calcium-binding site shows the Cα loop in DCSIGNR R8 and DC-SIGNR CRD Loop movement is observed between ligand-bound and apo structures from closed to open, respectively. Calcium ions present in the structure of DC-SIGNR CRD only are shown in gray (Ca1 and Ca3). Side-chain movements between each structure are summarized in Table 2.

Table 2

Comparison of the calcium ligand distances

Distance (Å)
Calcium ionResidueAtomRR81K9JChange Δ (Å)
Ca2Glu 359OE22.14(OE1) 2.630.49
Ca2Asn 361OD12.042.440.4
Ca2Glu 366OE12.032.430.4
Ca2Asn 377OD15.792.473.32
Ca2Asp 378O2.102.420.32
Ca2Asp 378OD12.022.340.32
Ca2W 192.04N/A
Ca2Man C2O4N/A2.57
Ca2Man C2O3N/A2.49
Ca2–Ca20.29
Ca3Glu 336OE112.942.2810.66
Ca3Asn 365OD14.322.461.86
Ca3Asp 367OD23.332.540.79
Ca3Asp 367OD12.352.580.23
Ca1Asp 332OD13.012.580.43
Ca1Asp 332OD21.992.490.5
Ca1Glu 336OE213.092.5810.51
Ca1Asn 362OD16.442.434.01
Ca1Glu 366O3.272.450.82
Ca1Asp 367OD12.352.330.02
Carbohydrate-binding and calcium-binding sites. (a) Cα traces of the DC-SIGNR R8 (blue) and DC-SIGNR CRD (gray) primary calcium-binding sites showing side-chains involved in coordinating calcium. In the absence of carbohydrate, Asp377 is not involved in calcium coordination. The calcium ion, seen in nearly the same position in both structures is shown in yellow (Ca2). A water molecule (red) is present in the location where normally ligand binds (DC-SIGNR R8 structure). (b) The secondary calcium-binding site shows the Cα loop in DCSIGNR R8 and DC-SIGNR CRD Loop movement is observed between ligand-bound and apo structures from closed to open, respectively. Calcium ions present in the structure of DC-SIGNR CRD only are shown in gray (Ca1 and Ca3). Side-chain movements between each structure are summarized in Table 2. Comparison of the calcium ligand distances The comparison between the current apo-DC-SIGNR R8 and the mannose-containing CRD structure showed both the primary calcium/carbohydrate-binding loop (residues 361–366) and the secondary calcium-binding loop (residues 332–339) assumed an “open” conformation in the apo state while adopting a “closed” conformation in the carbohydrate-bound state of the receptor (Figure 2). In the presence of carbohydrate, the conformation of the primary carbohydrate-binding loop (residues 361–366), the closed conformation, is defined by the coordinating hydrogen bonds between the side-chains of Asn361 and Ser363, and the bound N-acetyl-d-glucosamine (GlcNAc1) as well as between Asn362 and Asn365, and their bound Ca1 and Ca3. In the absence of carbohydrate, however, Ser363 moved 4.9 Å toward the solvent, resulting in a more open conformation for the primary carbohydrate-binding loop. In conjunction with this loop movement, Asn362 and Asn365 lost their coordination geometry for the auxiliary calcium sites. The ejection of the auxiliary Ca1 and Ca3, in turn, resulted in the displacement of another calcium coordination residue, Glu336 from the secondary calcium-binding loop, toward the solvent and thus adopting an open conformation for the loop (Table 2 and Figure 2). Interestingly, an arginine residue from a symmetry-related molecule, Arg397, is found near the putative Ca1 and Ca3 sites, forming a hydrogen bond with the secondary calcium-binding loop to neutralize, as a surrogate to the missing calcium ion, the partial negative charges of the region. Despite the presence of 5 mM CaCl2 in the crystallization setup, both Ca1 and Ca3 appear to be absent, suggesting that these auxiliary calcium sites are of low affinity compared to that of the primary calcium-binding site (Ca2), and that their occupancies are coupled to the binding of the carbohydrate ligand. Namely, they are glycan-induced calcium-binding sites. In the absence of the bound carbohydrate, both calcium coordination loops adopt an open, conformation ejecting the auxiliary calcium ions and become less ordered. The result suggests the function of these glycan-induced auxiliary calcium is to stabilize the conformation of the glycan-binding loops synergistically to the bound glycan rather than to pre-conform the glycan-binding loop.23, 24, 25

Modeling of the DC-SIGN/R tetramer

A homology search was performed using sequences corresponding to various lengths of the repeat domain of DC-SIGNR against known structures in the Protein Data Bank (PDB). The resulting sequence identities between segments of known structures and portions of DC-SIGNR repeats are 32% between residues 32–117 of the focal adhesion kinase (PDB code 1K05) and repeats R1–R3 of DC-SIGNR (Figure 3 ), 31% between residues 328–390 of Muts (PDB code 1NNE) and repeats R5–R7, 33% between residues 23–67 of the large ribosomal subunit from Deinococcus radiodurans (PDB code 1NKW) and repeats R6 and R7, 37% between residues 60–106 of the monomeric isocitrate dehydrogenase (PDB code 1ITW) and repeats R6–R8. All homologous structures are helical in nature.
Figure 3

Alignment of homologous repeat sequences. CLUSTAL W(1.74) sequence alignment of the ecto-domain of DC-SIGNR repeats 1–8 (indicated by R1–R8) sequence with homologous sequences used to predict the boundary of helical regions. Helical regions are denoted by the letter H and coil or turn regions are indicated by the letter C. Because repeats are nearly identical, differing by only a single amino acid residue in some cases, most of the homologous matches can be translated along the repeats.

Alignment of homologous repeat sequences. CLUSTAL W(1.74) sequence alignment of the ecto-domain of DC-SIGNR repeats 1–8 (indicated by R1–R8) sequence with homologous sequences used to predict the boundary of helical regions. Helical regions are denoted by the letter H and coil or turn regions are indicated by the letter C. Because repeats are nearly identical, differing by only a single amino acid residue in some cases, most of the homologous matches can be translated along the repeats. Both homology modeling and sequence-based secondary structure prediction resulted in similar secondary structure assignment, including the boundary of helices, turns and loops throughout the R1–R8 repeat domain of DC-SIGNR. Additional structural information derived from gel-filtration experiments on truncated receptors showing that receptor tetramerization requires R5–R8 repeats and analytical ultracentrifugation observations suggesting an elongated shape of the tetramer were included in the modeling of the tetramer. Based on the overlapping homologous structures and the biophysical shape consideration and using the focal adhesion kinase (PDB code 1K05) as a template (Supplementary Data Figure 1), a polyalanine model of DC-SIGNR tetramer was built manually using the crystallographic program O and subjected to energy minimization using CNS (Figure 4 ). The tetramer model displays a 4-fold symmetry, with the core tetramerization domain adopting a four-helix bundle structure similar to that of the focal adhesion kinase (see Supplementary Data for a more detailed description of the model). The arrangement of the R7 and R8 helices in this model agree with the recently deposited structure of DC-SIGNR containing both R7 and R8 repeats (PDB code 1SL6). The dimensions of the model proposed here are ∼80 Å×80 Å×190 Å with individual CRD separated by ∼50 Å. On the basis of the model, the tetrameric CRD head encompasses an area of approximately 6400 Å2.
Figure 4

Model of the extracellular portion of the DC-SIGN/R tetramer. (a) A side-view of the model tetramer. The boundary of the repeat domain, CRD and carbohydrate (from DC-SIGNR CRD PDB code 1K9J) are shown as well as a view of the model looking down onto the top of the tetramer. (b) A single helical tetramer model is shown with helical breaks in the region near proline residues.

Model of the extracellular portion of the DC-SIGN/R tetramer. (a) A side-view of the model tetramer. The boundary of the repeat domain, CRD and carbohydrate (from DC-SIGNR CRD PDB code 1K9J) are shown as well as a view of the model looking down onto the top of the tetramer. (b) A single helical tetramer model is shown with helical breaks in the region near proline residues. We carried out limited proteolysis using trypsin to explore the likelihood of the proposed model of helical repeat bundles for the tetrameric DC-SIGNR versus a model consisting of an elongated linear concatenation of helical repeats (Figure 4). Since identical trypsin digestion sites are found within each repeat region, a differential use of each potential trypsin site would suggest differential protection from the protease. The tight packing of the proposed model predicts a biased protease-sensitivity for the different repeats with the core tetramer packing repeats less accessible than the peripheral repeats, while the linear helical concatenation model predicts an equal protease-sensitivity for each repeat. The digestion with trypsin was carried out using a recombinant expressed and refolded full extracellular DC-SIGNR, termed DC-SIGNR R1, that has been characterized to be a tetramer. Digestion of DC-SIGNR R1 with trypsin resulted in four major fragments F1∼25 kDa, F2∼9 kDa 8, F3∼19 kDa, and F4∼7 kDa, with the F1 and F2 appearing before F3 and F4 in time-based digestions (Supplementary Data Figure 3). No other intermediate fragment could be identified. The amino-terminal sequencing revealed fragments F1 (residues 14–237) and F2 (residues 238–313) resulting from cleavages at trypsin sites between repeats R1 and R2 (site 1) and within the CRD (site 3). Fragments F3 (residues 14–179) and F4 (residues 270–313) appeared to be derived from F1 and F2 by further digesting at site 3 and 4, respectively (Figure 1). These results indicate that most of the tetramerization repeats (R2–R8) remain resistant to digestion by trypsin, consistent with it being a compact tetramer unit rather than an elongated linear helical tetramer in which all repeats appear equally susceptible to protease. Digestion experiments with subtilisin are consistent with these results, indicating protease-sensitive sites being primarily between repeats R1 and R2, and after the helical repeat domain at the beginning of the CRD region.

Evaluating potential DC-SIGN/R ligands

Earlier studies of the DC-SIGN/R CRD binding to model carbohydrate compounds suggest that the receptors prefer a high-mannose type of carbohydrate.17, 18, 26 More recently, the receptors were shown to recognize also sialyl-Lewis-like carbohydrates. The dissociation constant (K d) between DC-SIGN/R CRD and the model compounds, however, are millimolar at best, while the functional ligand recognition by the receptor has better than micromolar affinity. Thus, much of the receptor-ligand binding affinity appears to be derived from an avidity effect of the DC-SIGN/R tetramer. The requirement of tetramer binding for ligand recognition would, in turn, impose limitations to its ligand selection. Namely, ligands carrying multiple glycosylations capable of engaging the multimeric DC-SIGN/R CRD simultaneously would be preferred by the receptor. The surface area encompassed by the tetrameric CRD in our current DC-SIGN/R model is approximately 6400 Å2, or 1600 Å2 per CRD molecule. This requires the potential ligands of DC-SIGN/R to possess a surface glycosylation level exceeding one glycan molecule per 1600 Å2 of its surface area. This enables us to formulate a potential ligand index i to evaluate potential ligands of DC-SIGN/R on the basis of their surface glycosylation density:where N is the number of predicted potential glycosylation sites and M is the molecular mass of the candidate protein. A potential DC-SIGN/R ligand would possess an index greater than 1.0 and proteins with the indices less than 1.0 are less likely to be ligands of the receptor. The calculation of this potential ligand index for a number of viral envelope glycoproteins as well as for some cell-surface glycoproteins is summarized in Table 3 . Of the potential viral targets of DC-SIGN/R, HIV-1, coronavirus and Marburg virus are known to bind DC-SIGN. In addition, HRSV, influenza and human foamy viruses appear to be good candidates for DC-SIGN/R. Among the cellular targets, in addition to the known ICAM-3 ligand, several surface glycoproteins also score favorably for DC-SIGN binding.
Table 3

Probability index of potential DC-SIGN/R ligands

GlycoproteinDescriptionMass (kDa)Number of potential glycosylationsaPotential ligand index i
A. Viral proteins
gp120HIV-154.0244.8
GPMarburg virus74.4233.7
Spike glycoprotein E2Coronavirus-229E128.6303.4
Glycoprotein GHRSV-A232.571.9
HemagglutininInfluenza A39.681.9
Env polyproteinHuman foamy virus113.7151.8
Env polyproteinSpuma retrovirus113.4151.8
GHHerpes simplex191.1101.5
GBHerpes simplex1100.3101.3
GDHerpes simplex143.330.7
Envelope glycoproteinHTLV34.651.3
Envelope glycoproteinDengue type 349.730.6
Glycoprotein EHemorrhagic fever22.410.4
Envelope glycoproteinWest Nile virus18.410.4


B. Cellular targets
Mucin (Muc-1)Tumor marker108.0100 (O)12.9
Bovine Mucin (BSM)Mucosal secretion158.45(N)/171(O)17.2
CD24Adhesion molecule mucin-like8.082/15(O)10.9
CD43Leukosialin, mucin40.326(O)6.4
ICAM-3Adhesion molecule49.1152.3
CD45Tyr phosphatase63.4163
CD16Fc Receptor21.062.4
ICAM-2Adhesion molecule22.562.2
ICAM-1Adhesion molecule49.281.8
CD47Integrin-associated protein35.261.6
CD44Hermes antigen81.511(O)1.6
CD31PECAM-182.510(O)1.5
IgGAntibody150.080.8
KIR 2DL2NK receptor21.620.7
HLA-CW3MHC I44.820.4

The O-linked glycans are indicated as (O). Otherwise, the numbers indicate N-linked glycans.

Probability index of potential DC-SIGN/R ligands The O-linked glycans are indicated as (O). Otherwise, the numbers indicate N-linked glycans.

Discussion

DC-SIGN and DC-SIGNR are part of an antigen-capturing network of receptors expressed on dendritic cells. Previously, the structures of a mannose-bound and a Lewisx-bound form of the receptor showed critical residues involved in both calcium and carbohydrate interactions.18, 19 Our current structure of DC-SIGNR R8 represents an apo form of the receptor. The structure revealed that much of the CRD adopts a conformation very similar to that observed in the carbohydrate-bound receptor, with the exception of two loops that are involved in the coordination of carbohydrate (residues 361–366) and auxiliary calcium ions (residue 332–339) in the bound-form. In the absence of the bound carbohydrate, both loops adopt open conformations that are likely attributed to the loss of interactions with the putative carbohydrate and calcium. The absence of two bound auxiliary calcium ions compared with the structure of the carbohydrate-bound receptor suggests that the auxiliary calcium sites are of low affinity compared to the primary calcium site, and their presence appears to be ligand-induced. The multivalent nature of DC-SIGN/R indicates that recognition of small carbohydrate compounds by individual CRD alone is not sufficient to achieve the high-affinity interactions of DC-SIGN and DC-SIGNR with pathogens like HIV-1 gp120. The functional receptors have been shown to be tetramers.17, 21 In addition, biochemical studies with repeat domain deletion mutants have shown that a minimum of three repeats are necessary to form tetramers, with additional repeats functioning to stabilize the tetramer. On the basis of the current crystal structures and available biophysical data, a tetramer for the entire extracellular DC-SIGNR receptor was constructed by homology modeling in which the repeat regions form helical bundles to bring together their CRDs in a 4-fold related symmetry. This helical bundle-mediated oligomerization resembles superficially the trimer of rat mannose-binding protein. While the receptor repeat domain is conserved in most species, a notable exception is that of Old World Rhesus monkey, whose DC-SIGNR gene (CD209L2) is missing all the repeats and DC-SIGN gene is missing the fourth repeat. CD209L2 is predicted to be a monomer and has been shown to be less efficient in binding to both ICAM-3 and HIV gp120. The fourth repeat in our model serves as a connecting helix between the two helical bundles. Deletion of this repeat would most likely shorten this connecting helix but may not affect the formation of the helical bundles (R6–R8 and R1–R3). The results of trypsin digestion studies appear to support a model in which the helical repeats are protected from protease by forming tightly packed helical bundles rather than by forming a single elongated helical domain (Figure 4). Using this DC-SIGNR tetramer model and the assumption that high-affinity ligand binding requires simultaneous engagement of multiple CRD of the tetrameric receptor, we formulated a prediction scheme for potential ligands of DC-SIGN/R based on their predicted gross glycosylation density. The results show that several viral envelope glycoproteins, including HIV-1 gp120, Marburg virus GP, coronavirus spike protein, and HRSV glycoprotein G, possess high ligand indices. Among them, gp120 of HIV, GP of Ebola, and the spike protein of coronavirus are known ligands of DC-SIGN. Of the potential cellular targets, in addition to the known ICAM-3 ligand, mucins are notably ranked high in our scoring scheme. The low-scoring molecules, such as IgG, KIR2DL2 and HLA-CW3 did not exhibit binding to DC-SIGN/R (data not shown). It should be noted that the receptor-ligand binding will also depend on the geometrical constraint, including the distance between and the orientation of the CRDs. The distance between glycans, in general, should correlate with their surface density. Situations in which local spacing variation resulting in the distance between glycans either too close or too far apart to simultaneously engage the multimeric CRD would clearly affect the recognition by the receptor. Nevertheless, the known flexibility of glycans and the observed variation in the hinge angle between the receptor CRD and repeat domains of DC-SIGNR illustrate the built-in flexibilities in both the receptor and ligands, and thus lend some degree of freedom to the receptor-ligand recognition. These intrinsic flexibilities would lead to greater variability in distance and orientation, and marginalize the geometric constraint. Nonetheless, the most obvious reasons to use the surface area of CRD instead of the distance are (1) to enable us to derive a prediction scheme based on surface glycan density of a potential ligand, and (2) to have the prediction less dependent on the precise conformation, thus, the degree of correctness, of the tetramer model. In addition, equation (1) assumes a globular shape for proteins and a uniform distribution of their glycosylation. Clearly, both the local distribution of glycans and the actual shape of the protein presenting the glycans influence the receptor recognition. For example, despite a low score for Dengue and hemorrhagic fever, evidence suggests that these viruses are recognized by DC-SIGN/R.28, 29 Although the Dengue virus envelope protein is not heavily glycosylated, the crystal structures of both the type 2 and type 3 Dengue envelope protein E showed that the two conserved glycosylation sites are located at the protein dimer interface, resulting in four glycans distributed symmetrically at ∼32 Å apart across the interface.30, 31 This generates four closely packed glycan residues, which enables the recognition by the tetrameric DC-SIGN. This equation is thus a first-order approximation that does not reflect variations in protein shape or distribution of glycosylation. In conclusion, the mechanism of receptor-carbohydrate recognition may be more complicated than previously thought. The high-affinity binding strategy employed by these receptors appears to be twofold. First, the structure of each individual CRD determines the preference of the receptor for particular carbohydrate structures. Secondly, and perhaps more importantly, the high-affinity interaction as well as ligand specificity rely on receptor oligomerization, which would increase the affinity of ligand binding and impose constraints on the density and distribution of carbohydrates found on target pathogens.

Materials and Methods

Protein expression, purification and crystallization

DNA encoding amino acid residues 250–399 of the human DC-SIGNR, which includes the last repeat (R8) and the CRD, referred as DC-SIGNR R8, was inserted into the pET 22b vector (Figure 1(a)). The expression of the full-length extracellular domain of DC-SIGN and DC-SIGNR has been described. Proteins were expressed as inclusion bodies in Escherichia coli BL21 (DE3) and reconstituted in vitro. Refolded DC-SIGNR R8 was loaded onto a Source 15Q column (Amersham) and further purified by size-exclusion chromatography using a Superdex S200 column (Amersham). The peak fractions were then concentrated to 10 mg/ml and characterized using SDS-PAGE, N-terminal sequencing and mass spectrometry. Initial crystallization screening trials were carried out by microbatch experiments using an automated crystallization robot (Douglas Instruments Oryx 6).32, 33 Repeated attempts to crystallize the entire ectodomain of either DC-SIGN or DC-SIGNR did not yield any diffraction-quality crystals. In contrast, rod-like crystals of the DC-SIGNR R8 construct appeared in many conditions within six hours of setup. Optimization of crystal growth conditions was performed by fine-screening of pH and precipitant concentration. Crystals used for X-ray data collection were grown by the hanging-drop, vapor-diffusion method in a well solution of 100 mM MgCl2, 100 mM sodium cacodylate (pH 6.5), 12% (w/v) polyethylene glycol 3000.

X-ray data collection and structure determination

Crystals of DC-SIGNR R8 were briefly transferred into well solution supplemented with 20% (v/v) glycerol and flash-frozen in a liquid nitrogen stream at 100 K. The X-ray diffraction data were collected on a 3X3 charge-coupled device detector at the Structural Biology Center Collaborative Access Team beamline 19ID and processed using HKL2000. The crystals diffracted to 1.41 Å and were indexed to the orthorhombic space group P212121 with cell dimensions a=38.2 Å, b=54.8 Å, and c=62.3 Å. Molecular replacement using the coordinates for DC-SIGNR CRD (PDB accession code 1K9J) provided phase information. Diffraction data from 41–3.0 Å were used for the rotation and translation functions with the program AMoRe. After rigid body refinement using program packages AMoRe and CNS. A complete model was built with the occupancies for disordered side-chains and loops set to zero. Initial refinement in CNS included simulated annealing, conjugate gradient minimization and individual temperature factor refinement. Further refinement using maximum likelihood methods was performed with the program Refmac 5. The final geometry of the structure was evaluated using the program PROCHECK. Least-squares superpositions were performed using the program LSQMAN.

Modeling of the DC-SIGN tetramer

A protein search using the program BLASTp for sequences corresponding to one, two, three, four and all eight repeat domains of DC-SIGNR in various combinations was used to query the PDB. From this search we identified a representative set of structures that includes focal adhesion kinases, Taq Muts and DNA-binding proteins, with sequence identity of 30–70% (PDB accession codes 1K05, 1P85, 1IOM, 1NNE, 1NKW, 1EWR, 1ITW and 1HP7). The homologous portions of these structures were aligned on the basis of sequence homology to each corresponding repeat subunit of DC-SIGNR, and their secondary structure was viewed using the program O. The structure of the focal adhesion kinase (PDB code 1K05) was used as a template for tetramer formation, with the additional structures being used primarily to predict location of turns. The final model was refined in CNS using rigid body and energy minimization. Ribbon diagrams were prepared using the program MOLSCRIPT.

Evaluating potential ligands of DC-SIGN/R

Since both receptors use multiple CRD domains to modulate avidity-mediated binding to various carbohydrates found on a variety of pathogens, we derived a formula to evaluate and identify potential receptor ligands. Let the surface area encompassed by the tetramer of DC-SIGNR CRD be S o, the surface area of a protein of interest be S, then the binding of DC-SIGN/R requires the number of glycosylations N satisfying: Thus, an index for potential ligands can be defined as:When the likelihood index i is greater than 1, the protein of interest possess, on average, higher glycosylation density than is required for binding to DC-SIGN/R and, conversely, when i is less than 1, the target protein is under glycosylated for DC-SIGN/R binding. Assuming a spherical nature for proteins, which is only a crude approximation but will nonetheless result in a correct power-dependence on the molecular mass, the surface area S of a given protein can be calculated, to the first approximation, from its molecular mass by:where N A is Avogadro's constant (6.022×1023), M is the molecular mass (in Da), and D is the average density of a protein, which has a value of 1.3–1.4 g/ml. If D=1.4 g/ml is taken and equation (3) is substituted in equation (2), then:where S o is in Å2.

Protein Data Bank accession code

Coordinates have been deposited with the Protein Data Bank under accession code 1XPH.
  39 in total

1.  The CCP4 suite: programs for protein crystallography.

Authors: 
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  1994-09-01

Review 2.  DC-SIGN: escape mechanism for pathogens.

Authors:  Yvette van Kooyk; Teunis B H Geijtenbeek
Journal:  Nat Rev Immunol       Date:  2003-09       Impact factor: 53.106

3.  Crystallography & NMR system: A new software suite for macromolecular structure determination.

Authors:  A T Brünger; P D Adams; G M Clore; W L DeLano; P Gros; R W Grosse-Kunstleve; J S Jiang; J Kuszewski; M Nilges; N S Pannu; R J Read; L M Rice; T Simonson; G L Warren
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  1998-09-01

4.  L-SIGN (CD 209L) is a liver-specific capture receptor for hepatitis C virus.

Authors:  Jason P Gardner; Robert J Durso; Robert R Arrigale; Gerald P Donovan; Paul J Maddon; Tatjana Dragic; William C Olson
Journal:  Proc Natl Acad Sci U S A       Date:  2003-04-03       Impact factor: 11.205

5.  Characterization of DC-SIGN/R interaction with human immunodeficiency virus type 1 gp120 and ICAM molecules favors the receptor's role as an antigen-capturing rather than an adhesion receptor.

Authors:  Greg A Snyder; Jennifer Ford; Parizad Torabi-Parizi; James A Arthos; Peter Schuck; Marco Colonna; Peter D Sun
Journal:  J Virol       Date:  2005-04       Impact factor: 5.103

6.  Dendritic-cell-specific ICAM3-grabbing non-integrin is essential for the productive infection of human dendritic cells by mosquito-cell-derived dengue viruses.

Authors:  Erika Navarro-Sanchez; Ralf Altmeyer; Ali Amara; Olivier Schwartz; Franck Fieschi; Jean-Louis Virelizier; Fernando Arenzana-Seisdedos; Philippe Desprès
Journal:  EMBO Rep       Date:  2003-07       Impact factor: 8.807

7.  Sequence and expression of a membrane-associated C-type lectin that exhibits CD4-independent binding of human immunodeficiency virus envelope glycoprotein gp120.

Authors:  B M Curtis; S Scharnowske; A J Watson
Journal:  Proc Natl Acad Sci U S A       Date:  1992-09-01       Impact factor: 11.205

8.  HCV and HIV binding lectin, DC-SIGNR, is expressed at all stages of HCV induced liver disease.

Authors:  G Cole; N Coleman; E Soilleux
Journal:  J Clin Pathol       Date:  2004-01       Impact factor: 3.411

Review 9.  DC-SIGN: a novel HIV receptor on DCs that mediates HIV-1 transmission.

Authors:  T B H Geijtenbeek; Y van Kooyk
Journal:  Curr Top Microbiol Immunol       Date:  2003       Impact factor: 4.291

10.  DC-SIGN and DC-SIGNR bind ebola glycoproteins and enhance infection of macrophages and endothelial cells.

Authors:  Graham Simmons; Jacqueline D Reeves; Case C Grogan; Luk H Vandenberghe; Frédéric Baribaud; J Charles Whitbeck; Emily Burke; Michael J Buchmeier; Elizabeth J Soilleux; James L Riley; Robert W Doms; Paul Bates; Stefan Pöhlmann
Journal:  Virology       Date:  2003-01-05       Impact factor: 3.616

View more
  14 in total

1.  DC-SIGN neck domain is a pH-sensor controlling oligomerization: SAXS and hydrodynamic studies of extracellular domain.

Authors:  Georges Tabarani; Michel Thépaut; David Stroebel; Christine Ebel; Corinne Vivès; Patrice Vachette; Dominique Durand; Franck Fieschi
Journal:  J Biol Chem       Date:  2009-06-05       Impact factor: 5.157

2.  West Nile virus discriminates between DC-SIGN and DC-SIGNR for cellular attachment and infection.

Authors:  Carl W Davis; Hai-Yen Nguyen; Sheri L Hanna; Melissa D Sánchez; Robert W Doms; Theodore C Pierson
Journal:  J Virol       Date:  2006-02       Impact factor: 5.103

3.  Structural analysis of natural killer cell receptor protein 1 (NKR-P1) extracellular domains suggests a conserved long loop region involved in ligand specificity.

Authors:  Zofie Sovová; Vladimír Kopecký; Tomáš Pazderka; Kateřina Hofbauerová; Daniel Rozbeský; Ondřej Vaněk; Karel Bezouška; Rüdiger Ettrich
Journal:  J Mol Model       Date:  2010-09-14       Impact factor: 1.810

4.  Insights from NMR Spectroscopy into the Conformational Properties of Man-9 and Its Recognition by Two HIV Binding Proteins.

Authors:  Syed Shahzad-Ul-Hussan; Mallika Sastry; Thomas Lemmin; Cinque Soto; Sandra Loesgen; Danielle A Scott; Jack R Davison; Katheryn Lohith; Robert O'Connor; Peter D Kwong; Carole A Bewley
Journal:  Chembiochem       Date:  2017-03-22       Impact factor: 3.164

5.  Length variation of DC-SIGN and L-SIGN neck-region has no impact on tuberculosis susceptibility.

Authors:  Luis B Barreiro; Olivier Neyrolles; Chantal L Babb; Paul D van Helden; Brigitte Gicquel; Eileen G Hoal; Lluís Quintana-Murci
Journal:  Hum Immunol       Date:  2006-12-04       Impact factor: 2.850

6.  Respiratory syncytial virus glycoprotein G interacts with DC-SIGN and L-SIGN to activate ERK1 and ERK2.

Authors:  Teresa R Johnson; Jason S McLellan; Barney S Graham
Journal:  J Virol       Date:  2011-11-16       Impact factor: 5.103

7.  The evolutionary history of the CD209 (DC-SIGN) family in humans and non-human primates.

Authors:  M Ortiz; H Kaessmann; K Zhang; A Bashirova; M Carrington; L Quintana-Murci; A Telenti
Journal:  Genes Immun       Date:  2008-06-05       Impact factor: 2.676

8.  N-glycan mediated adhesion strengthening during pathogen-receptor binding revealed by cell-cell force spectroscopy.

Authors:  Joost Te Riet; Ben Joosten; Inge Reinieren-Beeren; Carl G Figdor; Alessandra Cambi
Journal:  Sci Rep       Date:  2017-07-27       Impact factor: 4.379

9.  Pivotal advance: The promotion of soluble DC-SIGN release by inflammatory signals and its enhancement of cytomegalovirus-mediated cis-infection of myeloid dendritic cells.

Authors:  N Plazolles; J-M Humbert; L Vachot; B Verrier; C Hocke; F Halary
Journal:  J Leukoc Biol       Date:  2010-10-12       Impact factor: 4.962

10.  Solution NMR analyses of the C-type carbohydrate recognition domain of DC-SIGNR protein reveal different binding modes for HIV-derived oligosaccharides and smaller glycan fragments.

Authors:  Fay Probert; Sara B-M Whittaker; Max Crispin; Daniel A Mitchell; Ann M Dixon
Journal:  J Biol Chem       Date:  2013-06-20       Impact factor: 5.157

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.