| Literature DB >> 30523255 |
Cheng-Wei Cheng1,2,3, Yixuan Zhou4, Wen-Harn Pan5, Supriya Dey4, Chung-Yi Wu4, Wen-Lian Hsu6, Chi-Huey Wong7,8.
Abstract
The programmable one-pot oligosaccharide synthesis method was designed to enable the rapid synthesis of a large number of oligosaccharides, using the software Optimer to search Building BLocks (BBLs) with defined relative reactivity values (RRVs) to be used sequentially in the one-pot reaction. However, there were only about 50 BBLs with measured RRVs in the original library and the method could only synthesize small oligosaccharides due to the RRV ordering requirement. Here, we increase the library to include 154 validated BBLs and more than 50,000 virtual BBLs with predicted RRVs by machine learning. We also develop the software Auto-CHO to accommodate more data handling and support hierarchical one-pot synthesis using fragments as BBLs generated by the one-pot synthesis. This advanced programmable one-pot method provides potential synthetic solutions for complex glycans with four successful examples demonstrated in this work.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30523255 PMCID: PMC6283847 DOI: 10.1038/s41467-018-07618-8
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1The workflow of Auto-CHO program. a The basic concept of Auto-CHO. b The detailed workflow. Auto-CHO allows users to input a desired glycan structure and the program returns with one-pot glycan synthesis options. To facilitate the ability of Auto-CHO, we have not only expanded the thioglycoside BBLs but also constructed a virtual BBL library by enumerating monosaccharide structures with different protecting group combinations and theoretical RRVs that are estimated by our RRV predictor. RRV predictors are trained by SVM regression models, and features from experimental BBLs with known RRVs. The predictor with the best performance by leave-one-out cross-validation (LOOCV) has been chosen as the final model. Independent test has also been applied to validate the result. Since the use of virtual BBL in glycosylation may not have been validated and it is uncertain if it can be successfully used in the one-pot synthesis process, text mining could be used to identify those virtual BBL candidates that have been reported in literature and thus, have a good chance of participating in the one-pot synthetic reaction. The synthetic methods provided by Auto-CHO have been further validated by four synthetic experiments in this study
LOOCV performances of RRV predictors with different sugar classes, feature types, or settings
| Dataset size | Sugar class | Feature type | FS | Feature size | PCC | MAE | RAE |
|---|---|---|---|---|---|---|---|
| 136 | Hex, HexNAc, SA | BP + CS (Norm.) | No | 25 | 0.6964 | 7519.60 | 0.6161 |
| 117 | Hex, HexNAc | BP + CS (Norm.) | No | 16 | 0.6803 | 4220.57 | 0.4428 |
| 117 | Hex, HexNAc | BP + CS (Bina.) | No | 131 | 0.7675 | 3876.85 | 0.4067 |
| 117 | Hex, HexNAc | BP + CS (Bina. + Norm.) | No | 144 | 0.7768 | 3868.84 | 0.4059 |
| 117 | Hex, HexNAc | MD | No | 1444 | 0.7326 | 4131.05 | 0.4334 |
| 117 | Hex, HexNAc | BP + CS + MD | No | 1595 | 0.8803 | 2537.81 | 0.2662 |
| 117 | Hex, HexNAc | BP + CS + MD | Yes | 222 | 0.9706 | 1291.32 | 0.1355 |
|
|
|
|
|
|
|
|
The optimized performance is shown in bold
FS Feature selection; BP basic properties; CS (calculated) chemical shifts; MD molecular descriptors; Norm. normalized real values; Bina. binarization (Methods)
Predicted and observed RRVs of representative virtual building blocks
|
|
Fig. 2Illustration of the programmable one-pot synthesis of Globo-H by Auto-CHO. a The synthetic solution shows [1 + 3 + 2] strategy. The internal fragment can be synthesized by another one-pot approach. b The synthetic solution shows [1 + 2 + 3] strategy without further fragmentation
Fig. 3Examples of programmable one-pot synthesis. a Auto-CHO suggests that SSEA-4 can be synthesized with three BBLs: sialyl disaccharide 1 with RRV = 1462, monosaccharide 2 with RRV = 32.0, and reducing end acceptor 3 with RRV = 0. The calculated overall yield is 94% and the experimental yield is 43%. b Synthesis of a heparin pentasaccharide with differential protecting groups in color to allow selective introduction of the sulfate groups. c Synthesis of the oligoLacNAc module for the assembly of N-glycans with LacNAc repeats