Literature DB >> 35696108

The chemical aspects of Raman spectroscopy: Statistical structure-spectrum relationship in the analyses of bioflavonoids.

Chih-Hsien Wang1, Chia-Chi Huang1, Wenlung Chen1, Yen-Shi Lai2.   

Abstract

Raman spectroscopy has been accepted as a useful tool for the characterization of natural products. However, to identify a specific compound in a mixture sample of natural products using Raman spectra alone is highly challenging if not impossible. We demonstrated an effective solution to such issues using a method combining statistical Raman spectroscopy and Mass spectrometry. The method was validated with a successful application to the identification of the major anthocyanin components in a purple yam (Dioscorea purpurea) extract. Of particular interest is that statistical grouping of the bioflavonoid standards that formed the database of this study was found to correspond closely to the conventional chemical classification. An initial theory on the chemical aspects of Raman spectroscopy pertaining to the connectivity of Raman-active functional groups in bioflavonoids was developed based on the statistical correlation between chemical classification and Raman spectroscopy.

Entities:  

Year:  2020        PMID: 35696108      PMCID: PMC9261862          DOI: 10.38212/2224-6614.1058

Source DB:  PubMed          Journal:  J Food Drug Anal            Impact factor:   6.157


1. Introduction

Dioscorea alata L., D. alata L. var. purpurea (Dioscorea purpurea in short) is a species of yam featuring purple colored flesh characteristic of anthocyanins. It is a common food and a traditional medicine as well because of its functional constituents. A number of these constituents, including allantoin, crude anthocyanin, β-carotene lecithin, and dioscorin were investigated in one of our previous studies [1]. Except anthocyanin, these active compounds were identified unambiguously in this earlier work by Fourier transform Raman spectroscopy (FT-Raman). FT-Raman with Nd: YAG lasers at 1064 nm, Michelson interferometers, and high-sensitivity GaAs detectors provided the advantages of high frequency precision, easy sample preparation, and low fluorescence interference, and so has been widely applied in food and agriculture research [2-6]. Raman active molecules undergo a change in polarizability during molecular vibration leading to a set of Raman fingerprints that is useful for the characterization of phytochemicals [7,8]. With incompletely purified samples, however, the power of characterization can decrease drastically due to overlapping fingerprints. Assemblies of phenolic fragments are ubiquitous in plant secondary metabolites such as bioflavonoids that include anthocyanins. Investigations of bioflavonoids using IR and Raman spectroscopies [9,10] have been conducted on purified standards and compilations of aromatic and hydroxy vibrational frequencies [11] were available. In our case, with omission of a tedious purification process, the composition of the principal D. purpurea anthocyanin fraction was so complex it was difficult to identify the major anthocyanin based on this standardized information. This was a common issue in the study of many other plant materials and affected the throughput of Raman analysis of mixture in general. There was a practical need for methods that catalog the huge collection of standard spectral data in a way that mixture can be effectively analyzed by a transformed pattern in lieu of molecular fingerprinting. A few studies have reorganized Raman spectral data with statistical methods such as hierarchical clustering and principal component analysis. The results provided correlations between spectral data and non-spectral properties such as color of flower petals in one example and species of citrus fruits in another [12,13]. The objective of the current study was to develop a simple spectral data processing method to aid the identification of anthocyanins in purple yam samples. Our method involved comparison of the FT-Raman spectrum of a real sample to a collection of standard bioflavonoids spectra. Specifically, the real sample was an HPLC fraction of purple yam extract. The comparison was made based on similarity distances between Raman spectra using a Pearson correlation algorithm [14]. Mass spectra of the real sample were subjected to structure analyses and used to validate the results of Pearson correlation. As bioflavonoids were chemically classified according to the connectivity pattern of their phenolic fragments, we further studied the correlation between structure classifications and Raman spectra of the bioflavonoid standards. Rationalizations of the outcome in the form of structure-spectrum relationships were developed.

2. Materials and methods

Materials used in this study and the preparation of the real sample are detailed in the Supporting Information.

2.1. Spectral acquisition

FT-Raman spectra were obtained on a Bruker RFS-100 FT-spectrophotometer (Bruker Optik GmbH, Lubeck, Germany). Nd:YAG laser at 1064 nm (Coherent Lubeck GmbH, Lubeck, Germany) pumped by a diode laser was employed for excitation. Aiming of light on the samples were assisted by a HeNe laser beam cooperated with the excitation light. The output laser power was 100 mW. The resolution was at 4 cm−1, and the scattered radiation was collected at 180° with a set of reflectors. GaAs detector was cooled by liquid nitrogen, and Raman spectra were recorded over the Raman shift range 1–3500 cm−1. Each sample was scanned 500 times. FT-Raman spectra reported in this study were all-original and were not smoothed, normalized, or baseline corrected by data manipulation.

2.2. Data treatment

Spectral data, in the Raman shift range of 1–3500 cm−1 at 2 wavenumber-unit intervals, amounting to a total of 1815 Raman shift variables, was processed and tabulated in Microsoft Excel. Statistical analyses were conducted in the IBM SPSS environment. Statistical score by Pearson correlation was used to measure the co-variance of Raman intensities within the space of 1815 dimensions of Raman shift. The initial result was then compared by means of similarity matrix, hierarchical clustering and multidimensional scaling. Statistical analyses of the structure indices and canonical correlation between spectral and structural data were conducted in a similar manner. For more detail about these methods and their applications, see Ref. [15]. Some result presentations such as conditional formatting and graphing were made with the aid of Microsoft Excel.

3. Result and discussion

3.1. FT-Raman database of flavonoid compounds

Twenty-eight bioflavonoids were chosen as the standards for this study and their FT-Raman spectra were acquired. These standards included 10 anthocyanins, 8 catechins (aka flavonols), and 10 flavonoids, with the last subgroup being loosely defined to allow for negative control. In Fig. 1 we show the basic skeleton and a representative molecule for each of these three subclasses. Table 1 gives the ID numbers and structure indices of the twenty-eight compounds. These indices were used as the basis of structure correlation later. The structures and FT-Raman spectra (printed over the range 350 cm−1 to 1800 cm−1) of all 28 standards are included in the Supporting Information.
Fig. 1

Basic skeleton, conventional classification, and examples of flavonoids subclasses.

Table 1

Pre-categorization and structure indices of 28 flavonoid standards.

NoNameC2C3C4C5C6C7C8C2′C3′C4′C5′C6′
anthocyanins1callistephin chlorideOGluHOHHOHHHOMeOHHH
2cyanidin chlorideOHHOHHOHHHOHOHHH
3delphinidin chlorideOHHOHHOHHHOHOHOHH
4malvidin chlorideOHHOHHOHHHOMeOHOMeH
5malvin chlorideOGluHOGluHOHHHOMeOHOMeH
6oenin chlorideOGluHOHHOHHHOMeOHOMeH
7pelargonidin chlorideOHHOHHOHHHHOHHH
8pelargonin chlorideOGluHOGluHOHHHHOHHH
9peonidin chlorideOHHOHHOHHHOMeOHHH
10petunidin chlorideOHHOHHOHHHOHOHOMeH
catechins11(+)-catechinβHβOH,αHH,HOHHOHHHOHOHHH
12(−)-catechin hydrateαHαOH,βHH,HOHHOHHHOHOHHH
13(−)-epicatechin gallateαHβOGal,αHH,HOHHOHHHOHOHHH
14(−)-epicatechinβHβOH,αHH,HOHHOHHHOHOHHH
15(−)-epigallocatechin gallateβHαOGal,βHH,HOHHOHHHOHOHOHH
16(−)-epigallocatechinβHαOH,βHH,HOHHOHHHOHOHOHH
17(−)-gallocatechin gallateαHαOGal,βHH,HOHHOHHHOHOHOHH
18(−)-gallocatechinαHαOH,βHH,HOHHOHHHOHOHOHH
flavonoids193-hydroxyflavoneOHOOHHHHHHHHH
20apigeninHOOOHHOHHHHOHHH
21daidzeinHOOHHOHHHHOHHH
22flavoneHOOHHHHHHHHH
23flavononeαHH,HOOHHHHHHHHH
24genisteinHOOOHHOHHHHOHHH
25kaempferolOHOOOHHOHHHHOHHH
26luteolinHOOOHHOHHHOHOHHH
27myricetinOHOOOHHOHHHOHOHOHH
28quercetinOHOOOHHOHHHOHOHHH
29real sample

3.2. Partial purification of Dioscorea purpurea anthocyanins by HPLC and preparation of the real sample

Initially, fractionation of the yam extract was preceded by hydrolysis and de-glycosylation treatments in an effort to release the anthocyanidins for analyses. We have since realized that these pretreatments were detrimental to the preservation of the anthocyanin integrity. Therefore, the yam extract was fractionated by HPLC without any pretreatments this time. The principal anthocyanin fractions included ten peaks, as shown in the chromatogram (Fig. S1, Supporting Information). Further analyses by Mass spectrometry showed that none of these peaks were homogeneous (Table S1, Supporting Information). The most populous fraction in the chromatogram, namely peak number 5, was used as the real sample in the FT-Raman study. It is difficult to identify an anthocyanin from the FT-Raman spectrum (Fig. S2, Supporting Information) of this sample. By referring to the closest literature, Raman peak assignments could be exhausted as follows: 476 cm−1 (τCC vibration), 540 cm−1 (δCC vibration), 1184 cm−1 (OH stretching), 1134, 1325, 1530 and 1598 cm−1 (νCC vibration), and 1631 cm−1 (νCC and νCO vibrations) [16]. This, however, did little to help one reconstruct the structure of the compound because information about the connectivity of functional groups that differentiates various anthocyanins is absent.

3.3. Statistical treatments of Raman spectra

Peak intensities of the 28 standard Raman spectra as well as the real sample were tabulated across wavenumber ticks between 1 cm−1 and 3500 cm−1 for a total of 29 cases and 1815 variables (ticks). This table was imported to the SPSS software wherein similarity distances between all possible pairs of the 29 spectra were calculated against the 1815 variables based on Pearson product-moment. The results were shown in Fig. 2 as a proximity matrix where each cell contained a number proportional to the degree of pairwise similarity ranging from 0.00 to 1.00, with 1.00 indicating statistical identicality. By color-coding all the cells with a number greater than 0.60, three blocks were formed as outlined by the blue rectangles which coincided more or less with the preassignments in Table 1. Based on the simple formula of the catechins block (row/column 11–18) had the highest rate at 100%, followed by the anthocyanins (row/column 1–10) at 82%, and the flavonoids (row/column 19–28) at a borderline 52% that conformed well with the allowance for negative controls.
Fig. 2

Spectral proximity matrix of the bioflavonoid standards and real sample.

In terms of individuals in each block, petunidin (No. 10) was highly similar to cyanidin (No. 2) and delphinidin (No. 3) but much less so to the remaining anthocyanins. Flavonone (No. 23) was notable for its similarity to the catechins as opposed to the other flavonoids. Other poor flavonoids performers included apigenin (No. 20), for its similarity to the anthocyanins, and flavone (No. 22) for essential dissociation from all the groups. Kaempferol (No. 25) and quercetin (No. 28) were both borderline between anthocyanins and flavonoids. A remarkable outcome was that glycosylation was not a determining cause of dissimilarity so that malvidin (No. 4) was indifferentiable from malvin (No. 5), malvidin from oenin (No. 6), and pelargonidin (No. 7) from pelargonin (No. 8). Esterification by gallic acid was not determining either. Thus, epicatechin gallate (No. 13) was indifferentiable from epicatechin (No. 14), epigallocatechin gallate (No. 15) from epigallocatechin (No.16) and gallocatechin gallate (No. 17) from gallocatechin (No. 18). This was fortunate for us since bypassing the hydrolysis/deglycosylation pretreatments during real sample preparation would be inconsequential for validating the statistical outcomes with Mass spectrometry as discussed later. Finally, the Raman spectrum of the real sample (No. 29) was found to associate with the anthocyanins only, highly similar to petunidin (No. 10) and cyanidin (No. 2). We next applied a multiple dimensional scaling (MDS) to the set of 29 Raman spectra to confirm the results of the Pearson correlation. This task assigned each spectrum to a specific location in a conceptual two-dimensional space such that the distances between points in the space matched the given dissimilarities. As shown in Fig. 3 the MDS trended toward three groups, tightly with the catechins (VAR 11–18) and anthocyanins (VAR1-10) and loosely with the flavonoids (VAR19-28). Again, the real sample (VAR29) was shown to be distant from the catechins and the flavonoids and resembled cyanidin (VAR2) and oenin (VAR6) better than petunidin (VAR10) as measured by the concentric purple dashed circle.
Fig. 3

Multiple dimensional scaling of the bioflavonoid standards and real sample.

Hierarchical clustering of the 29 spectra resulted in the dendrogram in Fig. 4. Three groups were formed at the heterogeneity level around 15. The flavonoids were the most dissociated, with apigenin, daidzein, flavonone and genistein being misplaced with the anthocyanins and the catechins. The real sample was clustered with the anthocyanins on a secondary level, being closest to cyanidin, delphinidin, and petunidin.
Fig. 4

Hierarchical clustering of the bioflavonoid standards and real sample.

In summary, all three statistical models agreed in two terms. There were clear associations within the catechins and the anthocyanins, but less so with the flavonoids as it was by design of negative control. The real sample was recognized as an anthocyanin and, specifically, an analog of cyanidin. For independent support, a cross reference was made to a structure analysis of the real sample’s MS spectrum. The results (peak 5, Table S1, Supporting Information) suggested that the anthocyanins in this sample were cyanidin glucoside or pyranoside esters of ferulic acid or sinapic acid. Since glycosylation/esterification would not prevent the statistical model from recognizing anthocyanidins by the Raman spectra of anthocyanins or esters, the statistical models have concluded highly consistently with MS in this case.

3.4. The chemical aspects of Raman spectroscopy

Though broadly applied to the study of chemicals, a particularly underdeveloped application of Raman spectroscopy was in the elucidation of chemical structures. We have learned from the current study that statistical modeling of Raman data cataloged bioflavonoids in a way similar to structure-based grouping. To gain more insight into the intersection of Raman spectrum and chemical structure, we conducted a statistical analysis of the structure of the 28 bioflavonoid standards. The structure indices in Table 1 were recoded to scale values as shown in Table S2 (Supporting Information). With the constant variables C6, C8, C2′, and C6′ in Table 1 excluded, distances between all possible pairs of the 28 structures were calculated by Pearson product-moment and the resultant proximity matrix is shown in Fig. 5. Application of the 60%-or-better-similarity rule cataloged the structures into anthocyanins, catechins, and flavonoids at 88%, 100%, and 62% responsive rates. By this result, it would seem that natural products chemists had run their own statistical analyses in mind when the conventional classification of bioflavonoids was devised decades ago. As there appeared to be a common place between Figs. 2 and 5, we were curious whether there was a dependency between the two seemingly independent datasets.
Fig. 5

Structural proximity matrix of the bioflavonoid standards.

Assignment of values to the structure indices in the recoding process was random. Therefore, they were standardized to Z scores. The Z scores of the structure variables C2, C3, C4, C5, C7, C3′, C4′, and C5′ were then correlated to the intensity of the 1815 spectrum variables using the canonical correlation algorithm provided by SPSS. The task was set to tell how the index variables will predict the spectrum variables and the results are presented in Fig. 6, which showed the association of individual structure indices and Raman shifts with a significance level of p < 0.005 (two-tailed). For discussion purpose, one exception was allowed for C4, which had a significance of 0.005 < p < 0.007. Fig. 6 shows that the structure influence on Raman spectra was long range and involved electronic resonance as a key mechanism.
Fig. 6

A presentation of Structure-Spectrum Relationship of the bioflavonoid standards (for the individual structure indices see Table 1).

Molecular orbital hybridizations at C2 and C3 are key for the conventional classification of flavonoids, i.e. sp3 in catechins and sp2 in anthocyanins/flavonoids. Anthocyanin/flavonoid C2=C3 bonds are conjugated to both ring A and ring B and are in place to influence the vibrations of the phenols by resonance with the oxygen lone pairs. Indeed, C2 and C3 were found to correlate with benzene ring stretching (1550–1630 cm−1). C3 might also affect the phenolic COH bending in the range of 1300–1360 cm−1 [17]. C4 was significant in anthocyanins where it could resonate with the oxygen lone pairs at C3 and couple with glucose vibrations at 2960–2900 cm−1 and 560-500 cm−1 [18]. The lower p value of C4 was consistent with the observation in Fig. 2 that glycosylation did not lessen the similarity between anthocyanin and anthocyanidin. Hydroxy substituents on C5, C7, and C4′ could contribute their lone pairs to an extended π-system involving ring C and affect the vibrations of the benzene rings as well as the phenol groups. C3′ and C5′ are meta to the rest of the molecules in all three subclasses of compounds and, being deprived of chance of remote couplings via resonance, have found no significance in Fig. 6. Retrospectively, the above scheme explains why petunidin (No. 10), cyanidin (No. 2), and delphinidin (No. 3) were not differentiated by FT-Raman despite that they were structurally differentiated at C5′. Flavonone (No. 23) was included in the catechins flock in Fig. 2 because they all have saturated C2–C3 bonds. The enhanced weight of C2–C3 in the absence of resonance was presumably also the main driver behind the high responsive rate within the catechins themselves. The structure-spectrum relationship would also have predicted the placement of apigenin (No. 20) and kaempferol (No. 25) with the anthocyanins because they all have sp2-hybridized C2–C3 as well as a C4′ OH group. The significant structural presence or absence of hydroxy groups at C3′ and C5′ was statistically inconsequential because C3′ and C5′ were not an effector. The effect of C4 was not universal, which might be an addition reason why its p value was high in the canonical correlation. Thus, monoglycosylation, though a massive structure modification, did not warrant a decisive spectral differentiation of malvidin (No.4) from malvin (No. 5), neither from oenin (No. 6). The same went to pelargonidin (No. 7) and pelargonin (No. 8). The effect of C4 was more significant between the bis-glucoside malvin and monoglucoside oenin as shown in Fig. 2. Bis-glucosidation would pose higher conformational restraint to and elevate the C4-mediated coupling with the sugar vibrations.

4. Conclusions

Similarity between spectra was a drawback in the analysis of flavonoids mixture by Raman spectroscopy. We have established a method of processing FT-Raman spectral data to turn this around. In this method, Pearson correlation, multiple dimensional scaling, and hierarchical clustering based on similarity measures to classify 28 bioflavonoids standards as anthocyanins, catechins, and flavonoids. The model was applied successfully to identify the major anthocyanin components in a purple yam extract as cyanidin derivatives. It was hoped that this would set a precedent for resolving similar issues in the studies of other plant materials in general; and, with this example in order, the FT-Raman data of the 28 bioflavonoid standards can be repeatedly used by us or an interested colleague to analyze another bioflavonoid-containing real sample. Statistical classifications of the bioflavonoid standards based on their Raman spectra and structure indices were both consistent with the conventional classification. Statistical correlation between the structure indices and Raman spectra was established using a canonical correlation method. The result was highly selective and could be explained using a long-range coupling theory. By this we have developed a structure-spectrum relationship for bioflavonoids which can be summarize as follows: (1) Hybridization status at C2 and C3 had a determinant predicting effect on the Raman spectra. Vice versa, by examining the aromatic ring stretching and phenolic COH bending alone, attribution to a subclass of C2–C3 hybridizations can often be made. (2) C4 is only important when it is sp2hybridized and where conformational difference exists to affect its participation in resonance with neighboring groups. (3) Bioflavonoids different only in the substitution at C3′ and C5′ may not be differentiated by Raman spectroscopy if no other rules apply.
  9 in total

1.  FT-Raman spectroscopic simultaneous determination of fructose and glucose in honey.

Authors:  Apostolos N Batsoulis; Nikolaos G Siatis; Athanasios C Kimbaris; Eleftherios K Alissandrakis; Christos S Pappas; Petros A Tarantilis; Paschalis C Harizanis; Moschos G Polissiou
Journal:  J Agric Food Chem       Date:  2005-01-26       Impact factor: 5.279

2.  Structural investigation of edible zein films/coatings and directly determining their thickness by FT-Raman spectroscopy.

Authors:  Bao-Lian Hsu; Yih-Ming Weng; Yu-Hsiu Liao; Wenlung Chen
Journal:  J Agric Food Chem       Date:  2005-06-29       Impact factor: 5.279

3.  Compositional and conformational analysis of yam proteins by near infrared fourier transform Raman spectroscopy.

Authors:  Yu-Hsiu Liao; Chih-Hsien Wang; Chin-Yin Tseng; Hsiu-Ling Chen; Long-Liu Lin; Wenlung Chen
Journal:  J Agric Food Chem       Date:  2004-12-29       Impact factor: 5.279

Review 4.  Profiling methods for the determination of phenolic compounds in foods and dietary supplements.

Authors:  James M Harnly; Seema Bhagwat; Long-Ze Lin
Journal:  Anal Bioanal Chem       Date:  2007-07-25       Impact factor: 4.142

5.  Estimation of cellulose crystallinity of lignocelluloses using near-IR FT-Raman spectroscopy and comparison of the Raman and Segal-WAXS methods.

Authors:  Umesh P Agarwal; Richard R Reiner; Sally A Ralph
Journal:  J Agric Food Chem       Date:  2012-12-27       Impact factor: 5.279

6.  Surface-enhanced Raman scattering (SERS) study of anthocyanidins.

Authors:  Chiara Zaffino; Bianca Russo; Silvia Bruni
Journal:  Spectrochim Acta A Mol Biomol Spectrosc       Date:  2015-04-25       Impact factor: 4.098

7.  Red Walnut: Characterization of the Phenolic Profiles, Activities and Gene Expression of Selected Enzymes Related to the Phenylpropanoid Pathway in Pellicle during Walnut Development.

Authors:  Martina Persic; Maja Mikulic-Petkovsek; Heidi Halbwirth; Anita Solar; Robert Veberic; Ana Slatnar
Journal:  J Agric Food Chem       Date:  2018-03-08       Impact factor: 5.279

Review 8.  Raman spectroscopy in the analysis of food and pharmaceutical nanomaterials.

Authors:  Ying-Sing Li; Jeffrey S Church
Journal:  J Food Drug Anal       Date:  2014-02-01       Impact factor: 6.157

9.  Raman spectroscopy coupled with chemometric methods for the discrimination of foreign fats and oils in cream and yogurt.

Authors:  Nazife Nur Yazgan Karacaglar; Tugba Bulat; Ismail Hakki Boyaci; Ali Topcu
Journal:  J Food Drug Anal       Date:  2018-07-04       Impact factor: 6.157

  9 in total
  1 in total

Review 1.  A Narrative Review of Recent Advances in Rapid Assessment of Anthocyanins in Agricultural and Food Products.

Authors:  Muhammad Faisal Manzoor; Abid Hussain; Nenad Naumovski; Muhammad Modassar Ali Nawaz Ranjha; Nazir Ahmad; Emad Karrar; Bin Xu; Salam A Ibrahim
Journal:  Front Nutr       Date:  2022-07-19
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.