| Literature DB >> 35904547 |
Valery Lutsyk1, Pawel Wolski1, Wojciech Plazinski1,2.
Abstract
Carbohydrates play an essential role in a large number of chemical and biochemical processes. High structural diversity and conformational heterogeneity make it problematic to link their measurable properties to molecular features. Molecular dynamics simulations carried out at the level of classical force fields are routinely applied to study the complex processes occurring in carbohydrate-containing systems, while the usefulness of such simulations relies on the accuracy of the underlying theoretical model. In this article, we present the coarse-grained force field dedicated to glucopyranose-based carbohydrates and compatible with the recent version of the Martini force field (v. 3.0). The parameterization was based on optimizing bonded and nonbonded parameters with a reference to the all-atom simulation results and the experimental data. Application of the newly developed coarse-grained carbohydrate model to oligosaccharides curdlan and cellulose displays spontaneous formation of aggregates of experimentally identified features. In contact with other biomolecules, the model is capable of recovering the protective effect of glucose monosaccharides on a lipid bilayer and correctly identifying the binding pockets in carbohydrate-binding proteins. The features of the newly proposed model make it an excellent candidate for further extensions, aimed at modeling more complex, functionalized, and biologically relevant carbohydrates.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35904547 PMCID: PMC9367002 DOI: 10.1021/acs.jctc.2c00553
Source DB: PubMed Journal: J Chem Theory Comput ISSN: 1549-9618 Impact factor: 6.578
Figure 1Chemical formulas of the considered glucose-based saccharides exploiting different linkage types, atomistic structures of the corresponding trimers, and the illustration of the mapping scheme on the example of a single residue. The shown bead numbering is used to define the force field terms (Tables –7). Analogous bead numbering is applied in the case of monosaccharides (Table ).
Parameters for α(1 → 4)-Linked Glucopyranose Di-, Oligo-, and Polysaccharides (Bond Stretching, Bond-Angle Bending, and Improper- and Regular-Dihedral Distortion) in the Presently Proposed Force Fielda
| type | topological pattern | parameters | ||
|---|---|---|---|---|
| bonds: | B1-B4 | 18,000 | 0.251 | |
| B2-B3 | 32,000 | 0.280 | ||
| B3-B4 | 34,000 | 0.279 | ||
| B2-B4 | 60,000 | 0.292 | ||
| B2-B8 | 32,000 | 0.280 | ||
| angles | B1-B4-B2 | 340 | 86 | |
| B1-B4-B3 | 580 | 142 | ||
| B3-B2-B8 | 310 | 103 | ||
| B4-B2-B8 | 180 | 103 | ||
| B1-B2-B8 | 210 | 105 | ||
| B2-B8-B5 | 30 | 107 | ||
| B2-B8-B6 | 240 | 148 | ||
| B2-B8-B7 | 120 | 99 | ||
| improper dihedrals | B4-B3-B2-B1 | 200 | 9 | |
| B8-B2-B7-B6 | 170 | 22.3 | ||
| B2-B8-B4-B3 | 300 | –67 | ||
| regular dihedrals | B4-B2-B8-B6 | –12.8 | 161 | 1 |
| B3-B2-B8-B7 | –2.2 | 165 | 1 | |
| B1-B3-B7-B5 | –3.6 | –72 | 5 | |
Force constants correspond to eqs –5. Atom numbering concerns only the first linkage; the parameters for any subsequent, nth linkage can be obtained by increasing the corresponding bead numbers by 4n.
Eq .
Eq .
Concerns only the nonreducing end.
Parameters for α(1 → 6)-Linked Glucopyranose Di-, Oligo-, and Polysaccharides (Bond Stretching, Bond-Angle Bending, and Improper- and Regular-Dihedral Distortion) in the Presently Proposed Force Fielda
| type | topological pattern | parameters | ||
|---|---|---|---|---|
| bonds: | B1-B4 | 21,000 | 0.251 | |
| B2-B3 | 35,000 | 0.280 | ||
| B2-B4 | 60,000 | 0.322 | ||
| B3-B4 | 24,000 | 0.289 | ||
| B2-B5 | 13,000 | 0.225 | ||
| B3-B7 | 2400 | 0.820 | ||
| B1-B9 | 180 | 0.790 | ||
| angles | B1-B4-B2 | 600 | 80 | |
| B1-B4-B3 | 560 | 135 | ||
| B3-B2-B5 | 180 | 109 | ||
| B4-B2-B5 | 100 | 105 | ||
| B2-B5-B6 | 28 | 122 | ||
| improper dihedrals | B4-B2-B3-B1 | 250 | –7 | |
| B2-B3-B5-B4 | 110 | –74 | ||
| regular dihedrals | B3-B2-B5-B6 | –3 | 155 | 2 |
| B2-B5-B6-B7 | –3 | 170 | 5 | |
| B2-B5-B6-B7 | 3 | 60 | 1 | |
| B2-B4-B8-B6 | 5 | –25 | 1 | |
| B2-B4-B8-B6 | –4 | 165 | 3 | |
Force constants correspond to eqs –5. Atom numbering concerns only the first linkage; the parameters for subsequent, nth linkage can be obtained by increasing the corresponding bead numbers by 4n.
Eq .
Eq .
Parameters for Glucopyranose Monomers (Bond Stretching, Bond-Angle Bending, and Improper-Dihedral Distortion) in the Presently Proposed Force Fielda
| type | topological pattern | parameters | |||
|---|---|---|---|---|---|
| α-anomer | β-anomer | ||||
| bonds: | B1-B4 | 12,000 | 0.268 | 12,000 | 0.268 |
| B2-B3 | 20,000 | 0.284 | 24,000 | 0.291 | |
| B3-B4 | 28,000 | 0.291 | 28,000 | 0.291 | |
| B2-B4 | 28,000 | 0.342 | 32,000 | 0.355 | |
| angles | B1-B4-B2 | 340 | 85 | 450 | 81 |
| B1-B4-B3 | 580 | 132 | 580 | 132 | |
| improper dihedrals | B4-B3-B2-B1 | 200 | 9 | 200 | 9 |
Force constants correspond to eqs –4.
Eq .
Parameters for β(1 → 4)-Linked Glucopyranose Di-, Oligo-, and Polysaccharides (Bond Stretching, Bond-Angle Bending, and Improper- and Regular-Dihedral Distortion) in the Presently Proposed Force Fielda
| type | topological pattern | parameters | ||
|---|---|---|---|---|
| bonds: | B1-B4 | 14,100 | 0.250 | |
| B2-B3 | 37,500 | 0.268 | ||
| B3-B4 | 27,000 | 0.273 | ||
| B2-B4 | 53,200 | 0.257 | ||
| B2-B8 | 7500 | 0.267 | ||
| B2-B6 | 16,300 | 0.520 | ||
| B4-B8 | 3770 | 0.542 | ||
| angles | B1-B4-B2 | 220 | 91 | |
| B1-B4-B3 | 159 | 143 | ||
| B3-B2-B8 | 245 | 115 | ||
| B1-B2-B8 | 350 | 127 | ||
| B2-B8-B5 | 16 | 123 | ||
| B2-B8-B7 | 52 | 93 | ||
| improper dihedrals | B4-B3-B2-B1 | 200 | 9 | |
| B8-B2-B7-B6 | 212 | 11 | ||
| B2-B3-B8-B4 | 229 | 9 | ||
| regular dihedrals | B3-B2-B8-B7 | –35 | –135 | 1 |
Force constants correspond to eqs –5. Atom numbering concerns only the first linkage; the parameters for subsequent, nth linkage can be obtained by increasing the corresponding bead numbers by 4n.
Eq .
Eq .
Concerns only the nonreducing end.
Parameters for β(1 → 3)-Linked Glucopyranose Di-, Oligo-, and Polysaccharides (Bond Stretching, Bond-Angle Bending, and Improper- and Regular-Dihedral Distortion) in the Presently Proposed Force Fielda
| type | topological pattern | parameters | ||
|---|---|---|---|---|
| bonds: | B1-B4 | 10,000 | 0.268 | |
| B2-B3 | 38,000 | 0.240 | ||
| B3-B4 | 24,000 | 0.287 | ||
| B2-B4 | 50,000 | 0.287 | ||
| B2-B7 | 12,000 | 0.250 | ||
| angles | B1-B4-B2 | 220 | 78 | |
| B1-B4-B3 | 400 | 123 | ||
| B3-B2-B7 | 60 | 75 | ||
| B1-B2-B7 | 80 | 148 | ||
| B2-B7-B6 | 110 | 128 | ||
| B2-B7-B8 | 40 | 65 | ||
| improper dihedrals | B4-B3-B2-B1 | 120 | 15 | |
| B7-B2-B8-B6 | 200 | 22 | ||
| B2-B7-B3-B4 | 200 | 3 | ||
| regular dihedrals | B3-B2-B7-B6 | –20 | –174 | 1 |
Force constants correspond to eqs –5. Atom numbering concerns only the first linkage; the parameters for subsequent, nth linkage can be obtained by increasing the corresponding bead numbers by 4n.
Eq .
Eq .
Parameters for β(1 → 2)-Linked Glucopyranose Di-, Oligo-, and Polysaccharides (Bond Stretching, Bond-Angle Bending, and Improper-Dihedral Distortion) in the Presently Proposed Force Fielda
| type | topological pattern | parameters | ||
|---|---|---|---|---|
| bonds: | B1-B4 | 10,000 | 0.265 | |
| B2-B3 | 50,000 | 0.263 | ||
| B3-B4 | 24,000 | 0.272 | ||
| B2-B4 | 50,000 | 0.290 | ||
| B2-B7 | 6000 | 0.274 | ||
| B4-B7 | 600 | 0.530 | ||
| angles | B1-B4-B2 | 200 | 80 | |
| B1-B4-B3 | 400 | 132 | ||
| B3-B2-B7 | 30 | 88 | ||
| B1-B2-B7 | 60 | 165 | ||
| B2-B7-B6 | 50 | 97 | ||
| B2-B7-B8 | 40 | 148 | ||
| improper dihedrals | B4-B3-B2-B1 | 150 | 11 | |
| B7-B6-B8-B2 | 300 | 15 | ||
| B2-B4-B3-B7 | 200 | 1 | ||
| regular dihedrals | B3-B2-B7-B6 | –22 | –141 | 1 |
Force constants correspond to eqs –5. Atom numbering concerns only the first linkage; the parameters for subsequent, nth linkage can be obtained by increasing the corresponding bead numbers by 4n.
Eq .
Eq .
CG Systems Considered in the Present Studya
| purpose | compositon | no. of solvent molecules | box vector lengths [nm3] | simulation time | remarks |
|---|---|---|---|---|---|
| conformational properties, SASA | α-Glc monomer | 323 | 3.0 × 3.0 × 3.0 nm3 | 5000 ns | unbiased MD |
| conformational properties, SASA, log | β-Glc monomer | 323 | 3.0 × 3.0 × 3.0 nm3 | 5000 ns unbiased MD,20 ns per TI window | Unbiased MD, TI, various nonbonded parameters |
| log | β-Glc monomer | 450 | 4.8 × 4.8 × 4.8 nm3 | 20 ns per TI window | TI, octanol solvent, various nonbonded parameters |
| solution density | 100–2000 β-Glc monomers | 1514–14,177 | 12 × 12 × 12 nm3 | 10 ns | unbiased MD, various Glc content |
| lipid bilayer | DPPC lipid bilayer | 11,645–11,663 | 10 × 10 × 18 nm3 | 500 ns | unbiased MD,various temperatures |
| influence on the lipid bilayer | 4200 β-Glc monomers + DPPC lipid bilayer | 4242–4299 | 10 × 10 × 23.5 nm3 | 500 ns | unbiased MD, various temperatures |
| log | α(1 → 4)-linked Glc octamer | 8100 | 9.9 × 9.9 × 9.9 nm3 | 50 ns per TI window | TI, various nonbonded parameters |
| log | α(1 → 4)-linked Glc octamer | 5064 | 10.8 × 10.8 × 10.8 nm3 | 50 ns per TI window | TI, octanol solvent, various nonbonded parameters |
| conformational properties, SASA | β(1 → 4)-linked Glc dimer | 3354 | 7 × 7 × 7 nm3 | 500 ns | unbiased MD |
| conformational properties, SASA | α(1 → 4)-linked Glc dimer | 3353 | 7 × 7 × 7 nm3 | 500 ns | unbiased MD |
| conformational properties, SASA | β(1 → 2)-linked Glc dimer | 3348 | 7 × 7 × 7 nm3 | 500 ns | unbiased MD |
| conformational properties, SASA | β(1 → 3)-linked Glc dimer | 3350 | 7 × 7 × 7 nm3 | 500 ns | unbiased MD |
| conformational properties, SASA | α(1 → 6)-linked Glc dimer | 3351 | 7 × 7 × 7 nm3 | 500 ns | unbiased MD |
| conformational properties, SASA | β(1 → 4)-linked Glc tetramer | 4871 | 8 × 8 × 8 nm3 | 500 ns | unbiased MD |
| conformational properties, SASA | α(1 → 4)-linked Glc tetramer | 4872 | 8 × 8 × 8 nm3 | 500 ns | unbiased MD |
| conformational properties, SASA | β(1 → 2)-linked Glc tetramer | 4870 | 8 × 8 × 8 nm3 | 500 ns | unbiased MD |
| conformational properties, SASA | β(1 → 3)-linked Glc tetramer | 4874 | 8 × 8 × 8 nm3 | 500 ns | unbiased MD |
| conformational properties, SASA | α(1 → 6)-linked Glc tetramer | 4874 | 8 × 8 × 8 nm3 | 500 ns | unbiased MD |
| conformational properties, SASA | β(1 → 4)-linked Glc octamer | 9187 | 10 × 10 × 10 nm3 | 500 ns | unbiased MD |
| conformational properties, SASA | α(1 → 4)-linked Glc octamer | 9195 | 10 × 10 × 10 nm3 | 500 ns | unbiased MD |
| conformational properties, SASA | β(1 → 2)-linked Glc octamer | 9181 | 10 × 10 × 10 nm3 | 500 ns | unbiased MD |
| conformational properties, SASA | β(1 → 3)-linked Glc octamer | 9187 | 10 × 10 × 10 nm3 | 500 ns | unbiased MD |
| conformational properties, SASA | α(1 → 6)-linked Glc octamer | 9197 | 10 × 10 × 10 nm3 | 500 ns | unbiased MD |
| aggregation properties | 3 β(1 → 3)-linked Glc chains (26 residues) | 29,364 | 15 × 15 × 15 nm3 | 200 ns | unbiased MD, initiated from triple-helix structure |
| aggregation properties | 1 α(1 → 4)-linked Glc chain (32 residues) | 132,476 | 25 × 25 × 25 nm3 | 250 ns | unbiased MD, initiated from V-helix structure, various solvents: water and 1-bead solvents (C1 to C6) |
| aggregation properties | 50 α(1 → 4)-linked Glc octamers | 271,906 | 32 × 32 × 32 nm3 | 1500 ns | unbiased MD |
| aggregation properties | 100 β(1 → 4)-linked Glc octamers | 112,182 | 24 × 24 × 24 nm3 | 4575 ns | unbiased MD |
| aggregation properties | 10 β(1 → 3)-linked Glc octamers | 35,376 | 16 × 16 × 16 nm3 | 10,000 ns | unbiased MD |
| aggregation properties | 50 α(1 → 4)-linked Glc polymers (26 residues) | 606,152 | 42 × 42 × 42 nm3 | 170 ns | unbiased MD |
| aggregation properties | 50 β(1 → 4)-linked Glc polymers (26 residues) | 606,248 | 42 × 42 × 42 nm3 | 780 ns | unbiased MD |
| aggregation properties | 40 β(1 → 3)-linked Glc polymers (26 residues) | 124,888 | 25 × 25 × 25 nm3 | 650 ns | unbiased MD |
| interactions with proteins | 1 protein (PDB: 1I8A) and + 1 monomer of β-Glc | 8579 + 98 Na+ + 91 Cl– | 10 × 10 × 10 nm3 | 10,000 ns | unbiased MD |
| Interactions with proteins | 1 protein (PDB: 4O7P) and + 25 maltose molecules | 27,444 + 310 Na+ + 294 Cl– | 15 × 15 × 15 nm3 | 5000 ns | unbiased MD |
| Interactions with proteins | 1 protein (PDB: 2OVW) and + 25 cellobiose molecules | 27,750 + 305 Na+ + 307 Cl– | 15 × 15 × 15 nm3 | 10,000 ns | unbiased MD |
If not indicated otherwise, the solvent was Martini 3 water, for which 1 CG bead = 4 water molecules.
Figure 2Results of the CG TI simulations aimed at determining the free energies associated with the transfer of glucose mono- and oligosaccharides from water to n-octanol (equivalent to log P). (A) Results for monosaccharides with rings composed of T, S, and R beads. (B) Results for monosaccharides with rings composed of T, S, and S beads. (C) Results for octamers with rings composed of T, S, and R beads (recalculated with respect to a single residue). Theoretical data are compared with predictions of KOWWIN (Estimation Programs Interface Suite for Microsoft Windows, v 4.11) and ALOGPS 2.1[51] programs as well as with the experimental data.[52]
Figure 3Comparison of the SASA values obtained from either AA of CG simulations. Two alternative assignments of the bead type for one of the ring beads were accepted: (A) S bead; (B) R bead.
Figure 4Comparison of the average values of the selected bead-bead distances, bead-bead-bead regular angles, and improper-dihedral angles calculated from unbiased MD simulations within either the AA or CG force field for different types of glycosidic linkages. Horizontal and vertical bars denote the fluctuation associated with a given descriptor, expressed as the standard deviation on the data set.
Figure 6Distributions of gyration radii (Rg) and end-to-end distance (e2e) values calculated within either the AA or CG force field. Calculations concerned homooctasaccharides containing different types of glycosidic linkages.
Figure 5Distributions of dihedral angle values calculated within either the AA or CG force field. The angles correspond to the selected rotation around glycosidic linkages of different types.
Figure 7Density of aqueous solutions of β-d-glucopyranose as a function of the molar ratio. Results from CG simulations (circles) are compared with the experimental data[54] (squares).
Figure 8(A) Exemplary snapshots from the MD simulation of octamers of glucopyranose, exploiting different types of glycosidic linkages. (B) Cellulose II-like sheet formed during the simulation of aqueous solution of cellulose oligomers. (C) Triple helices of curdlan formed during the simulation of the aqueous solution of curdlan oligomers (up) or obtained as a result of simulation of the XRD data-based structure (down).
Figure 9(A, B) Snapshots of CG DPPC bilayers at T = 290 K: (A) gel phase formed in pure water; (B) fluid phase stabilized in the presence of 4 M glucose. Lipid tails are depicted in green, head groups are shown in pink and violet, and glucose molecules shown in white. Water beads are represented by blue balls. (C) Average area per lipid of a DPPC bilayer as a function of temperature. Glucose-free (green) and 5 M glucose solution (blue) systems were considered.
Figure 10(A, B) Binding cavity-carbohydrate molecule distance recorded during MD simulations for protein–carbohydrate systems. Lowest values correspond to the binding event. (C, D) CG structures of the protein–carbohydrate complex and (E, F) atomistic, XRD structures, corresponding to them. In the case of panels (C–F), the same color code was used, highlighting, for example, carbohydrates (green) and two types of aromatic amino-acid residues, responsible for CH-π binding: tryptophane (violet) and tyrosine (red).