| Literature DB >> 24673652 |
Feng Qiu1, James B McAlpine, David C Lankin, Ian Burton, Tobias Karakach, Shao-Nong Chen, Guido F Pauli.
Abstract
The interpretation of NMR spectroscopic information for structure elucidation involves decoding of complex resonance patterns that contain valuable molecular information (δ and J), which is not readily accessible otherwise. We introduce a new concept of 2D-NMR barcoding that uses clusters of fingerprint signals and their spatial relationships in the δ-δ coordinate space to facilitate the chemical identification of complex mixtures. Similar to widely used general barcoding technology, the structural information of individual compounds is encoded as a specifics pattern of their C,H correlation signals. Software-based recognition of these patterns enables the structural identification of the compounds and their discrimination in mixtures. Using the triterpenes from various Actaea (syn. Cimicifuga) species as a test case, heteronuclear multiple-bond correlation (HMBC) barcodes were generated on the basis of their structural subtypes from a statistical investigation of their δH and δC data in the literature. These reference barcodes allowed in silico identification of known triterpenes in enriched fractions obtained from an extract of A. racemosa (black cohosh). After dereplication, a differential analysis of heteronuclear single-quantum correlation (HSQC) spectra even allowed for the discovery of a new triterpene. The 2D barcoding concept has potential application in a natural product discovery project, allowing for the rapid dereplication of known compounds and as a tool in the search for structural novelty within compound classes with established barcodes.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24673652 PMCID: PMC4004191 DOI: 10.1021/ac500188j
Source DB: PubMed Journal: Anal Chem ISSN: 0003-2700 Impact factor: 6.986
The Reference HMBC Barcodes for the Ten Principal Types of Actaea Triterpenes
The HMBC cross-peaks were denoted by the corresponding 1H–13C correlations and discriminated by their spatial locations in the δH–δC coordinate of the HMBC spectrum, using the average δH and δC values listed in S4, Supporting Information.
Figure 1The basic workflow of 2D NMR barcoding. S1: The 2D NMR data of reference compounds are statistically analyzed to generate a virtual spectrum, representing a particular type of structure. S2: The distribution and correlation patterns of signals in the virtual 2D NMR spectrum are used as reference barcodes for the structural identification. S3: The chemical components of a (residually) complex sample are identified by in silico matching of their experimental NMR signals with the reference NMR barcodes.
Figure 2The triterpene components in sample A were identified by matching their HMBC correlations with the reference HMBC barcodes. (A) The 1H NMR signals of cyclopropane methylene protons indicated the significant (residual) chemical complexity of sample A, containing at least five Actaea triterpenes. (B) The HMBC spectrum of sample A was acquired and used for chemical identification. (C) By matching with the reference HMBC barcodes, these five Actaea triterpenes were identified as: (24R,25S)-24,25-epoxy-(26S)-26-hydroxyacta-(16S,23R)-16,23;23,26-binoxoside (26β-hydroxyacteol, 1), (24R,25S)-24,25-epoxy-(26R)-26-hydroxyacta-(16S,23R)-16,23;23,26-binoxoside (26α-hydroxyacteol, 2), (23R)-23-acetoxy-(24S)-24,25-epoxy-(15R)-15-hydroxy-16-oxoactanoside (23-O-acetylshengmanol, 3), (15R)-15,25-dihydroxyacta-(16S,23R,24R)-16,23;16,24-binoxoside (24-epi-cimigenol, 4), and (24S)-24-acetoxy-(15R,16R)-15,16,25-trihydroxyacta-(23R)-16,23-monoxoside (hydroshengmanol, 5).
The Individual Triterpenes Identified in the Residually Complex Samples A and B by Means of in Silico Matching of Their 2D HMBC Correlations with the Reference HMBC Barcodes
| sample | cpd | HMBC matches | structural types, new systematic name [common name] |
|---|---|---|---|
| A | (1.79, 63.6), (1.79, 65.8), (1.79, 98.7) | (24 | |
| (1.64, 63.1), (1.64, 64.0), (1.64, 98.4) | (24 | ||
| (1.25, 65.4), (1.25, 58.8), (1.41, 65.4), (1.41, 58.8) | (23 | ||
| (1.26, 84.3), (1.26, 68.7), (1.45, 84.3), (1.45, 68.7) | (15 | ||
| (1.47, 82.9), (1.47, 71.4), (1.48, 82.9), (1.48, 71.4) | (24 | ||
| B | (1.71, 83.6), (1.79, 83.6), (1.71, 83.6), (1.79, 83.6) | (23 | |
| (1.71, 83.6), (1.79, 83.6), (1.71, 83.6), (1.79, 83.6) | (23 | ||
| (1.53, 63.0), (1.53, 68.5) | (24 | ||
| (1.51, 63.0), (1.51, 68.5) | (24 | ||
| no matches | previously unknown |
Figure 3The triterpene components in sample B were identified by matching their HMBC correlations with the reference HMBC barcodes. (A) The 1H NMR signals of cyclopropane methylene protons suggested the presence of at least five Actaea triterpenes in sample B. (B) The full HMBC spectrum of sample B. (C) By matching with the reference HMBC barcodes, only two different skeleton types of triterpenes were found to be present, in contrast to sample A which contained five (Figure 2). The integral ratio between methyl and cyclopropane methylene protons further confirmed four of the five triterpenes as being two (23R,24R)-23,24-dihydroxyacta-(16S,22R)-16,23;22,25-binoxosides (cimiracemosides, 6 and 7) and two (24R,25R)-24,25-epoxyacta-(16S,23S)-16,23;23,26-binoxosides (26-deoxyacteols, 8 and 9). However, one triterpene (10) could not be matched to a known triterpene skeleton and was subsequently identified by differential analysis of HSQC spectra (Figure 4).
Figure 4The structural elucidation of compound 10 by differential analysis of HSQC spectra. (A) The NP-TLC analysis of the corresponding VLC fractions indicated that Fr-4 contained the same components as both nearby fractions Fr-2 and Fr-6, albeit in different ratios. (B) After subtracting the HSQC signals of Fr-2 and Fr-4 from Fr-6, a residual HSQC spectrum was obtained, showing only the signals belonging to 10.
The δH and δC (in ppm) Values of the Side Chains of the Cycloartane Triterpenes, Actein (11)a and the Newly Identified 27-Hydroxyactein (10)b
The 1H and 13C chemical shifts of actein were taken from ref (25).
The δC values were determined from the residual HSQC spectra (see main text); JH27a,H27b was 12.6 Hz.
Figure 5The H-27 signals and side-chain structures of the 26-hydroxyactein (10) newly identified in Sample B. This triterpene was present as an epimeric mixture containing a 7:3 ratio of 26β–OH (10a) and 26α–OH (10a) as indicated by two pairs of doublet signals (J 12.6 Hz) of their geminal protons H-27 at δH 4.949/4.437 and δH 4.606/4.444, respectively.