| Literature DB >> 35141275 |
Vivek S Bharadwaj1, Luke P Westawker1, Michael F Crowley1.
Abstract
Apiose is a naturally occurring, uncommon branched-chain pentose found in plant cell walls as part of the complex polysaccharide Rhamnogalacturonan II (RG-II). The structural elucidation of the three-dimensional structure of RG-II by nuclear magnetic resonance (NMR) spectroscopy is significantly complicated by the ability of apiose to cross-link via borate ester linkages to form RG-II dimers. Here, we developed a computational approach to gain insight into the structure-spectra relationships of apio-borate complexes in an effort to complement experimental assignments of NMR signals in RG-II. Our protocol involved structure optimizations using density functional theory (DFT) followed by isotropic magnetic shielding constant calculations using the gauge-invariant atomic orbital (GIAO) approach to predict chemical shifts. We evaluated the accuracy of 23 different functional-basis set (FBS) combinations with and without implicit solvation for predicting the experimental 1H and 13C shifts of a methyl apioside and its three borate derivatives. The computed NMR predictions were evaluated on the basis of the overall shift accuracy, relative shift ordering, and the ability to distinguish between dimers and monomers. We demonstrate that the consideration of implicit solvation during geometry optimizations in addition to the magnetic shielding constant calculations greatly increases the accuracy of NMR chemical shift predictions and can correctly reproduce the ordering of the 13C shifts and yield predictions that are, on average, within 1.50 ppm for 13C and 0.12 ppm for 1H shifts for apio-borate compounds.Entities:
Keywords: 1H and 13C; Apiose; Borate; DFT; Gauge invariance approach; NMR chemical shifts; Quantum chemistry; Rhamnogalacturonan II
Year: 2022 PMID: 35141275 PMCID: PMC8820409 DOI: 10.3389/fmolb.2021.756219
Source DB: PubMed Journal: Front Mol Biosci ISSN: 2296-889X
FIGURE 1Topological structure of Rhamnogalacturonan II (RG-II) depicted as linked sugar molecules, with apiose as a blue pentagon.
FIGURE 2Compounds 1–4 of methyl apioside and apio–borate esters (Ishii and Ono, 1999).
Nuclear magnetic resonance (NMR) functional–basis sets (FBS) 1–23 identified by their FBS combinations for NMR shielding constant calculations.
| NMR FBS no. | Functional | Basis set | Literature reference for NMR predictions |
13C MAEs for |
|---|---|---|---|---|
| 1 | B3LYP ( | 6-31G(d) |
| 1.50 |
| 2 | B3LYP ( | 6-311+G(2d,p) |
| 2.28 |
| 3 | B3LYP ( | cc-pVDZ |
| 2.37 |
| 4 | B3LYP ( | aug-cc-pVDZ |
| 1.93 |
| 5 | BMK ( | 6-31G(d) |
| 2.58 |
| 6 | BMK ( | 6-311G(d) |
| 3.39 |
| 7 | mPW1PW91 ( | 6-311+G(2d,p) |
| 1.85 |
| 8 | PBE0 ( | 6-311+G(2d,p) |
| 1.87 |
| 9 | WC04 ( | 6-31g(d) |
| 7.34 |
| 10 | WP04 ( | aug-cc-pvdz |
| 1.88 |
| 11 | CAM-B3LYP ( | 6-311+G(2d,p) |
| 2.13 |
| 12 | mPW1LYP | 6-311+G(2d,p) |
| 2.13 |
| 13 | B3LYP ( | 6- 311G(d, p) |
| 2.36 |
| 14 | CSGT-LC-TPSSTPSS ( | cc-pVTZ |
| 1.95 |
| 15 | PBE ( | 6-311G** |
| 1.91 |
| 16 | mPW1PW91 ( | 6-31G(d) |
| 1.90 |
| 17 | B3LYP ( | TZVP |
| 2.52 |
| 18 | BP86 | TZVP |
| 3.27 |
| 19 | B3PW91 ( | 6-31+G(d) |
| 1.72 |
| 20 | B3LYP ( | 6-311G++(2d,2p) |
| 2.31 |
| 21 | PBE ( | TZ2p |
| 2.04 |
| 22 | WC04 ( | aug-cc-pVDZ | — | 5.91 |
| 23 | WP04 ( | 6-31G(d) | — | 1.79 |
MAEs, mean absolute errors.
Specific route lines are listed in Supplementary Table S3.
Use of FBS in the literature for predicting chemical shifts.
MAEs for 13C predictions using implicit solvation during both geometry optimizations and shielding constant calculations.
FIGURE 3Protocol for quantum mechanical (QM)-based nuclear magnetic resonance (NMR) shift calculations. The dark blue and gray arrows indicate the analysis methodology for the compound of interest and the reference compound, respectively.
FIGURE 4Mean absolute errors (MAEs) in parts per million for 13C nuclear magnetic resonance (NMR) predictions for NMR functional–basis sets (FBS) 1–10 represented as donut plots for gas-phase geometry optimizations. The donut size reflects the averaged MAE for each NMR FBS. The four colors within each donut indicate errors for each compound, and the shades within each color indicate errors on a specific carbon atom. MAEs and r2 values are shown in blue and red, respectively. Donuts on white and blue backgrounds indicate NMR values calculated in the gas phase and with implicit solvation, respectively.
13C shifts for compounds 1–4 from Ishii and Ono (1999) compared to the computationally predicted shifts using either gas phase or implicit solvation (IEFPCM).
| Compound | Solvation model | Chemical shifts (ppm) (downfield → upfield) | MAE | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Optimization | NMR (FBS 1) | C1 | C3 | C2 | C4 | C3′ | OCH3 | ||
|
| – | Ishii et al. | 110.20 | 80.08 | 77.30 | 74.32 | 64.23 | 56.78 | – |
| Gas phase | IEFPCM | 108.53 | 79.62 | 78.76 | 73.17 | 64.47 | 52.73 | 1.50 | |
| IEFPCM | IEFPCM | 108.38 | 79.25 | 78.59 | 73.04 | 63.26 | 53.35 | 1.60 | |
|
| – | Ishii et al. | 110.70 | 86.92 | 83.42 | 75.08 | 65.38 | 55.08 | – |
| Gas phase | IEFPCM | 107.08 | 84.03 | 85.83 | 77.21 | 66.50 | 50.30 | 2.82 | |
| IEFPCM | IEFPCM | 110.41 | 83.92 | 83.30 | 72.33 | 64.20 | 52.24 | 1.70 | |
|
| – | Ishii et al. | 110.18 | 87.09 | 83.25 | 74.86 | 65.13 | 55.06 | – |
| Gas phase | IEFPCM | 108.45 | 84.92 | 85.85 | 76.95 | 66.43 | 50.96 | 2.33 | |
| IEFPCM | IEFPCM | 109.90 | 85.71 | 84.37 | 73.92 | 65.20 | 52.18 | 1.11 | |
|
| – | Ishii et al. | 110.28 | 86.94 | 83.42 | 74.90 | 65.13 | 55.06 | – |
| Gas phase | IEFPCM | 106.76 | 84.25 | 86.90 | 76.68 | 66.11 | 50.49 | 2.84 | |
| IEFPCM | IEFPCM | 110.48 | 85.39 | 84.24 | 72.16 | 63.64 | 52.22 | 1.61 | |
IEFPCM, polarizable continuun model using the integral equation formalism variant; FBS, functional–basis set; MAE, mean absolute error.
NMR shielding constant calculations using FBS 1 and implicit solvation (IEFPCM).
Incorrect ordering of C3/C2 shifts as compared with the experimental data of Ishii et al. is highlighted in red.
Accounting for implicit solvation during geometry optimizations decreases the MAEs.
FIGURE 5Mean absolute errors (MAEs, blue) in parts per million and r 2 values (red) for 13C nuclear magnetic resonance (NMR) predictions. NMR functional–basis sets (FBS) 11–23 are compared to the top-performing methods from NMR FBS 1–10, shown at the top (1, 7, 8, and 10). Donut plots for all NMR FBS 1-23 are shown in Supplementary Figure S2.
Statistical analysis [mean absolute errors (MAEs), mean absolute percent errors (MAPEs), and r 2 values] of the performance of the top 6 nuclear magnetic resonance (NMR) functional–basis set (FBS) combinations for predicting 13C and 1H shifts using implicit solvation during both geometry optimizations and NMR shielding constant calculations.
| NMR FBS no. | MAE (ppm) | MAPE (%) |
| MSE (ppm) | SD (ppm) | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| 13C | 1H | 13C | 1H | 13C | 1H | 13C | 1H | 13C | 1H | |
| 1 | 1.50 | 0.12 | 2.16 | 2.98 | 0.995 | 0.892 | −1.21 | −0.02 | 1.39 | 0.16 |
| 16 | 1.90 | 0.11 | 2.63 | 2.69 | 0.996 | 0.913 | −1.80 | −0.05 | 1.19 | 0.14 |
| 19 | 1.72 | 0.10 | 2.51 | 2.64 | 0.996 | 0.918 | −1.52 | 0.00 | 1.43 | 0.14 |
| 23 | 1.79 | 0.13 | 2.53 | 3.44 | 0.993 | 0.927 | −1.38 | −0.11 | 1.56 | 0.13 |
| 7 | 1.85 | 0.13 | 2.61 | 3.44 | 0.996 | 0.893 | −0.49 | −0.06 | 2.19 | 0.16 |
| 8 | 1.87 | 0.14 | 2.60 | 3.48 | 0.996 | 0.888 | −0.29 | −0.06 | 2.26 | 0.16 |
The combination of low MAE values and low standard deviation (SD) between individual errors suggests both high accuracy and precision for NMR FBS 1 and 23.
MSE, mean signed error.
FIGURE 6Mean absolute error (MAE) improvements upon solvation for the top-performing nuclear magnetic resonance (NMR) functional–basis sets (FBS): 1 (A), 19 (B), 23 (C), 16 (D), 7 (E), and 8 (F). Arrows are color-coded according to the compound number. Vertical arrows indicate MAE improvements due to the consideration of solvation during the geometry optimization, while horizontal arrows signify improvements during shielding constant calculations. The arrows point into the quadrant showing the most improvement. The listed MAE values (ppm) are the average of all compounds for that solvent combination. Their size indicates the size of MAE improvement upon a change in solvation model, where the full-length of a box is equal to an MAE improving by 1.23 ppm.
FIGURE 7Balancing accuracy with precision. Comparison of the top 6 performing nuclear magnetic resonance (NMR) functional–basis sets (FBS): 1 (A), 19 (B), 23 (C), 16 (D), 7 (E), and 8 (F). Implicit solvation was modeled for both geometry optimizations and shielding constant calculations. Each of the six carbons is denoted by a different shape, while each of the four compounds is denoted by a different color. The black line represents the literature values for the experimental chemical shifts, whereas the red line represents the best-fit line for the computed chemical shifts used to extract the r2 values.