Literature DB >> 29976396

Elemental profile and oxygen isotope ratio (δ18O) for verifying the geographical origin of Chinese wines.

Shuangxi Fan1,2, Qiding Zhong2, Hongbo Gao2, Daobing Wang2, Guohui Li2, Zhanbin Huang1.   

Abstract

The elemental profile and oxygen isotope ratio (δ18O) of 188 wine samples collected from the Changji, Mile, and Changli regions in China were analyzed by inductively coupled plasma mass spectrometry (ICP-MS), inductively coupled plasma optical emission spectroscopy (ICP-OES) and isotope ratio mass spectrometry (IRMS), respectively. By combining the data of δ18O and the concentration data of 52 elements, the analysis of variance (ANOVA) technique was firstly applied to obtain the important descriptors for the discrimination of the three geographical origins. Ca, Al, Mg, B, Fe, K, Rb, Mn, Na, P, Co, Ga, As, Sr, and δ18O were identified as the key explanatory factors. In the second step, the key elements were employed as input variables for the subsequent partial least squares discrimination analysis (PLS-DA) and support vector machine (SVM) analyses. Then, cross validation and random data splitting (training set: test set = 70:30, %) were performed to avoid the over-fitting problem. The average correct classification rates of the PLS-DA and SVM models for the training set were both 98%, while for the test set, these values were 95%, 97%, respectively. Thus, it was suggested that the combination of oxygen isotope ratio (δ18O) and elemental profile with multi-step multivariate analysis is a promising approach for the verification of the considered three geographical origins of Chinese wines.
Copyright © 2018. Published by Elsevier B.V.

Entities:  

Keywords:  Elemental profile; Geographical origin; Isotope ratio (δ(18)O); Multivariate analysis; Oxygen

Mesh:

Substances:

Year:  2018        PMID: 29976396      PMCID: PMC9303025          DOI: 10.1016/j.jfda.2017.12.009

Source DB:  PubMed          Journal:  J Food Drug Anal            Impact factor:   6.157


1. Introduction

Many countries and geographical areas are well known for producing high quality wines, underpinning that the geographical origin of a wine is being of great commercial value [1]. After years of development, China has established several important and distinctive geographical origins for wine production such as the Changji, Mile, and Changli regions. However, numerous economical profit-driven fraudulent events have been reported in the Chinese wine industry concerning the false declaration of geographical origins, thereby significantly damaging the reputation of the Chinese wine market. Unfortunately, there is still no effective official method or national standard to verify the geographical origin of wine produced in China. As such, significant effort has been directed to develop a scientific method for the characterization of Chinese wines according to their geographical origin, which would be particularly beneficial to individual producers, to the Chinese wine industry and to control authentication. To date, studies concerning the geographical origins of wines produced in China focused mainly on multi-element analysis by inductively coupled plasma mass spectrometry (ICP-MS) coupled with appropriate chemometric techniques [2,3]. In addition, only a relatively low number of Chinese wine samples have been analyzed using site-specific natural isotopic fractionation-nuclear magnetic resonance (SNIF-NMR), isotope ratio mass spectrometry (IRMS) and 1H NMR techniques [4-6]. These previous investigations were laboratory researches with limited sample sizes in China. In addition, although various wine databases in different parts of the world have been established [7], these do not include Chinese wines so far. In the past few decades, a number of studies for assessing the geographical origin of wines have been reported abroad [8-11]. For example, the analysis of different isotope ratios has been employed, including 88Sr/86Sr [12-14] and 11B/10B [15] analyses using ICP-MS, and 13C/12C and δ18O analyses in wine ethanol and wine water, respectively, using IRMS [16-19]. Other approaches, such as Fourier transform infrared spectroscopy (FTIR), near infrared spectroscopy (NIR), and nuclear magnetic resonance (NMR), are also suitable [4,20-22]. To date, various studies have demonstrated that multi-elemental trace analysis methods based on ICP-MS are useful to verify wine samples from different origins in Romanian [23,24], Canada [25,26], South Africa [27-29], Spain [30-32], Germany [33,34], Portugal [35], New Zealand [36], the Czech Republic [37], Argentina [38], Portugal, and France [12]. Indeed, the verification of the different geographical origins of wines by multi-element multivariate analysis is demonstrated in Table 1.
Table 1

ICP-MS method presented in the literature for the verification of geographical origin of wines.

Elements employedChemometric methodsGeographical originReferences
Cr, Sr, Rb, Ni, Ag, Cu, Co, Be, V, Pb, Zn, MnPCARomanian[23]
As, Cd, Cs, Li, Te, Zr, Mo, Ni, Sb, Ti, U, Y, REEs, Nb, Rb, Be, Co, Ga, TI, WPCAGermany[34]
Al, Cd, V, Ba, Li, Ni, Co, Pb, SbPCA and LDASpain[31]
Cs, Ga, Ni, Pb, Rb, Sr, Cd, Co, MnLDANew Zealand[36]
Bi, Sb, Fe, Mo, Ni, As, Ba, Th, Cs, Cu, Rb, Al, B, Ti, TI, Br, Cd, Co, Se, Sr, P, Pb, U, Ag, Mn, Cl, V, Ca, Ce, Zn, La, Li, MgPCA and DACanada[26]
B, Nb, Se, Si, TI, U, Cs, Cl, Mn, Ga, Li, Sr, Ni, Ba, Rb, Sc, W, Mg, La, AlDASouth Africa[27]
REEs, Au, Pd, Sb, Zr, Ni, Pb, Co, Cu, Re, Ti, TI, Cd, Ga, Li, Pt, Rb, Sr, Te, V, W, Sn, Cs, As, Ba, BeSIMCASpain (Canary Islands)[32]
Zn, As, Ba, Co, Li, V, Ni, Sr, Pb, Mo, Rb, Cd, Cu, La, U, Bi, Th, Cs, CeMDSCanada[25]
Ca, Sr, Mg, Cs, V, Li, Rb, Zn, Co, Mn, B, Fe, PbDAGermany[33]
Cs, Ag, Z, Ba, Rb, Li, Cu, Cd, Al, Sb, As, V, Ni, Be, Sr, Ti, U, Pb, Co, CrPCA, HCA and FACzech Republic[37]
Al, REEs, Y, Hf, B, Sc, Sr, Co, Cr, Cs, Fe, Mn, Mo, Ca, Zn, W, V, U, Cd, Ni, Th, Ti, TI, Ga, Li, Cu, Rb, Nb, Be, As, Ba, Pb, SbQDAPortugal[35]
Tl, Li, Se, Rb, La, Ga, Cl, Sc, Nb, Cs, Mg, Al, U, Sr, Ba, W, B, Si, Mn, NiDASouth Africa[29]
Li, Be, V, Mn, Co, Ni, Cu, Ge, As, Rb, Sr, Mo, Cd, Ba, Hg, TI, Pb, BiPCA and LDAArgentina[38]
11B/10BSouth Africa[27]
87Sr/86SrPortugal and France[35]

PCA: Principal component analysis, HCA: hierarchical clustering analysis, LDA: linear discriminant analysis, DA: discriminant analysis, SIMCA: soft independent modeling class analogy, QDA: quadratic discriminant analysis, MDS: multidimensional scaling, FA: factor analysis.

A combination of chemical composition determination with subsequent multivariate data analysis can also be regarded as a profiling approach to classify products based on their origin, variety, or other properties [39]. The most common multivariate analysis methods for such verification purposes include principal component analysis (PCA) and hierarchical clustering analysis (HCA) for unsupervised grouping, and discriminant analysis (DA) to select important variables for discrimination [11]. Furthermore, we wish to investigate the non-linear support vector machines (SVM) model and the linear partial least squares discrimination analysis (PLS-DA) technique in our study, as these methods have rarely been used to authenticate the geographical origins of Chinese wines. In this study, we herein report 52 trace elements and oxygen isotope ratio (δ18O) of water in 188 wine samples collected from the Changji, Mile, and Changli regions in China were determined by ICP-MS, ICP-OES and IRMS, respectively. Multivariate statistical techniques were used to extract significant information relating to the geographical origin of the wine samples based on oxygen isotope ratio (δ18O) and elemental profile. Initially, analysis of variance (ANOVA) was employed to determine the key trace elements required to discriminate between different geographical origins. Subsequent multivariate analysis (PLS-DA and SVM) was then carried out for classification of the wine samples.

2. Materials and methods

2.1. Chemicals and reagents

All reagents used in this study were of analytical grade or better. The HNO3 and ethanol employed were of chromatographic grade (Sigma Aldrich, USA). Ultrapure water (resistivity, 18.2 MΩ cm) was obtained from a Milli-Q system from Millipore (Milford, MA, USA). The microwave digestion system was purchased from CEM MARS, USA. Prior to use, all glassware was washed by immersing in a 20% (v/v) HNO3 solution over 24 h, then rinsing four times with ultrapure water. The chemicals and regents used for IRMS analysis were described in detail in our previous work for the rapid determination of the stable oxygen isotope ratio (δ18O) of water in alcoholic beverages [40].

2.2. Wine samples

A total of 188 wine samples were collected from three distinctive wine-producing regions in China, namely the Changji (90 samples), Mile (41 samples), and Changli (57 samples) regions. Five grape varieties of red wines (Cabernet Sauvignon, Rose Honey, Merlot, Pinot Noir, Cabernet Gernischt) and four grape varieties of white wines (Long Yan, Chardonnay, Crystal, Riesling) were examined. All wine samples were stored at 4 °C and in the absence of light. To guarantee the authenticity of the samples, all wine samples were taken from different wine-producing manufacturers, collected from the vintage years of 2010, 2011, 2012, 2013 and 2014. Each wine sample was obtained from one variety only and all wine samples met the requirements of the executive national standard of China [41]. The wine samples grown in the three geographical origins are outlined in Table 2. Different wine varieties from the Changji and Mile regions were collected, which covered the major grape varieties in the different geographical origins.
Table 2

Wine samples measured in three geographical origins (Changli, Xinjiang and Yunnan).

Geographical originGrape variety

Cabernet SauvignonRieslingPinot NoirMerlotCabernet GernischetChardonnayLongyanCrystalRose honey
Chanli57
Changji517108104
Mile921911

2.3. Sample preparation for ICP-MS and ICP-OES analysis

The organic components in the wine samples were removed using a microwave digestion method. More specifically, the wine sample (5 mL) was transferred into the digestion tube and HNO3 (5 mL) was added. The resulting mixture was then transferred into the microwave and digestion was carried out according to the following temperature program: 120 °C for 5 min, 160 °C for 5 min, and 180 °C for 15 min. After cooling to 25 °C, the mixture was transferred into a 50 mL volumetric flask. The digestion tubes were rinsed four times with ultra-pure water, and this water was transferred into the volumetric flask and diluted with ultrapure water to reach the desired final volume of 50 mL. A blank calibration sample which was made of HNO3 and water (2:98, v/v) and a quality control sample were also prepared using the above procedure.

2.4. Sample preparation for IRMS analysis

The sample preparation procedures for IRMS analysis were described in detail in our previous work [40].

2.5. IRMS analysis

GC-P-IRMS system and analytical conditions for IRMS analysis were the same as our previous work [40].

2.6. ICP-MS and ICP-OES analysis

Experimental measurements were carried out using an iCAP-6300 ICP-OES spectrometer (Thermo Fisher Scientific, USA) and an iCAP Q ICP-MS instrument (Thermo Fisher Scientific, USA). Following the preparation of each wine sample, all measurements were carried out in triplicate and the results were averaged. Detailed instrumental settings regarding the optimized operating conditions for ICP-MS and ICP-OES analyses are presented in Tables 3 and 4.
Table 3

Optimized ICP-MS operating parameters for the determination of trace elements.

Instrument parametersCondition
RF power1500 W
Coolant gas0 L min−1
Carrier gas1.17 L min−1
Nebulizer pump0.1 rsp
Integration time per point0.1 s (As: 1s, Cd: 1s, Se: 1s)
Scanning modePeak jumping
Observation point/peak3
Sampling depth8.0 mm
Replicates measured3
Isotopes measured7Li, 9Be, 47Ti, 51V, 52Cr, 59Co, 60Ni, 63Cu, 69Ga, 72Ge, 75As, 82Se, 88Sr, 89Y, 90Zr, 93Nb, 95Mo, 111Cd, 120Sn, 121Sb, 125Te, 138Ba, 140Ce, 141Pr, 146Nd, 147Sm, 153Eu, 157Gd, 159Tb, 163Dy, 165Ho, 166Er , 169Tm, 172Yb, 175Lu, 205Tl, 206Pb, 209Bi
Table 4

Optimized ICP-OES operating parameters for the determination of trace elements.

Instrument parametersCondition
RF power1150 W
Plasma flow15 L/min
Coolant gas1.5 L/min
stabilization time15 s
Washing time between samples30 s
Observation time3 s
Sample uptake rate1.5 mL/min
Replicates measured3
Elements measured and wavelengths (nm)Mg (285.2), Al (396.1), Na (589.5), Si (251.6), P (213.6), K (769.8), Ca (317.9), Mn (257.6), Fe (259.9), Zn (206.2), Rb (780.0), B (249.7), P (178.7), Sr (421.5)
External calibrating solutions (0, 1, 2.5, 5, 10, and 20 μg/L) for ICP-MS analysis were prepared by combining the appropriate dilution of a 100 mg/L multi-element stock standard solution (Analytical grade, General Research Institute for Nonferrous Metals, Beijing, China) containing As, Be, Dy, Co, Er Cr, Ba, Cu, Nd, Li, V, Ni, Y, Cd, Pb, Ho, Sb, Tl, Sn, Yb, Sr, Ti, Ce, Gd, Pr, Eu, Tb, Tm, Lu, Sm, Ga, Zr, Nb, Te, and Bi, and a 1 g/L multi-element stock standard solution (Analytical grade, General Research Institute for Nonferrous Metals, Beijing, China) containing Ge, Mo, and Se. External calibrating solutions (0, 1, 2.5, 5, 10, and 25 mg/L) for ICP-OES analysis were prepared by combining the appropriate dilution of a 100 mg/L multi-element stock standard solution (Analytical grade, General Research Institute for Nonferrous Metals, Beijing, China) containing B, Al, Zn, Sr, Cu, Fe, Mg, and Mn, and single standard element solutions (100 mg/L) of P, Rb, Si, Ca, K, and Na (Analytical grade, General Research Institute for Nonferrous Metals, Beijing, China). Blank solutions and working standard solutions were prepared by adding 2% (v/v) HNO3 and 8% (v/v) ethanol. To correct for instrumental drift, In (2 μg/L) was used as an internal standard. Weighted regression of the calibration curves was employed to quantify the concentrations of trace elements. A total of 188 wine samples from three different regions of China were investigated by ICP-MS, ICP-OES and IRMS, respectively. Trace analyses were performed by these methods to determine elemental profile and oxygen isotope ratio (δ18O) of water in the wine samples.

2.7. Statistical analysis

To facilitate data processing, all wine samples belonging to the same region were assigned to following codes (CJ for Changji, ML for Mile, and CL for Changli). The geographical locations of the three different Chinese wine-growing regions investigated are herein shown in Fig. 1.
Fig. 1

The three Chinese wine-growing regions in a map.

Multivariate data analysis was then performed, where one-way ANOVA was employed to detect any significant differences between the wine samples grouped by their indicated origins [42]. Prior to the ANOVA test in which the level of significance was set at p < 0.05, Levene’s test and the Shapir-o–Wilk test were applied that a typical procedure required for performing ANOVA to assess the homogeneity and normality of variances. Fisher’s least significant difference test was also implemented to check for differences between the different groups in the data matrix. The key elements for determining the geographical origins of the samples were identified by ANOVA in a first step, and were employed as input variables for the subsequent PLS-DA and SVM analyses. The supervised PLS-DA and SVM methods were employed for classification analysis. Before all performed techniques, the data were standardized to zero mean and unit variance using R version 3.2.3 (R Core Team, Vienna, Austria) statistical software. Creation and evaluation of the classification model were then performed by using splitting data sets: (1) the training set (132 samples, 70%) for building classification model with internal cross validation, and (2) the test set (56 samples, 30%) for external validation. The stratified random function was used for data splitting by R software using the caret package. The SPSS software version 19.0 (SPSS Inc., Chicago, USA) and R software were used for statistical analysis. More specifically, ANOVA analysis was carried out using SPSS, while PLS-DA was carried out by R using the packages: mixOmics [43], mdatools [44], while SVM was performed using the e1071 package [45].

3. Results and discussion

3.1. Quality control

The quality control for IRMS analysis was consistent with our previous work [40]. For analysis by both ICP-MS and ICP-OES, a 3-fold standard deviation of 10 replicate tests of the blank solution was used to estimate the analytical limit of detection (LOD) for each element [38]. The LODs of all measured elements are shown in Table 5. As the concentrations of several elements (Zr, Nb, Te, Bi, and Ge) in all the samples were mostly below their LODs, these elements were not used for subsequent multivariate data analysis. Therefore, the final data matrix contains 188 rows (wine samples) and 48 columns (47 elements and δ18O) (adding up in 9024 data points). The accuracy and reproducibility of the two methods (i.e., ICP-MS and ICP-OES) were also assessed. More specifically, the accuracy was checked by performing recovery experiments, where external calibrating solutions (0.2, 0.4, and 0.8 mg/L) for ICP-OES and external calibrating solutions (2.5, 5, and 10 μg/L) for ICP-MS were added to the quality control sample. The spiked wine samples were then measured 10 times, and the recoveries of the various elements were calculated. As indicated in Table 5, the recoveries ranged from 78 to 120% for ICP-OES and ranged from 94 to 112% for ICP-MS. Repeatability was then determined by measuring the multi-element concentrations of multiple bottles of the same kind of wine. Two kinds of wine (red and white) were considered, with six bottles of each being obtained. Three bottles of each kind of wine were randomly selected, and these 6 bottles of wine were prepared for the repeatability experiments according to the procedure described in sample preparation section of this study. Each wine sample was analyzed in triplicate. The relative standard deviations (RSDs) of the elemental concentrations in red wine ranged from 6 to 10%, while the RSDs of the elemental concentrations in white wine ranged from 5 to 9%, thereby indicating that the analytical methods employed herein were sufficiently repeatable. As all wine samples were analyzed in a single analysis sequence, the interday repeatability was not examined specifically. However, to ensure the quality of the obtained data, a quality control sample and a blank calibration sample were tested every six samples and monitored carefully.
Table 5

Analytical characteristics (LOD, recovery) of the determination of elements of wines by ICP-OES and ICP-MS.

Analytical methodElementLOD (μg/L)Recovery (%)aElementLOD (μg/L)Recovery (%)a
ICP-MS7Li0.8120.5 ± 2118Sn1.7104.8 ± 2
9Be1.0120 ± 3125Te0.993.6 ± 3
47Ti6.8103.5 ± 5137Ba1.2103.7 ± 0.5
52Cr0.498.5 ± 5140Ce0.4104.5 ± 0.4
51V0.2115.7 ± 1141Pr1.8103.9 ± 2
53Cr7.2106.1 ± 2146Nd1.394.1 ± 1
59Co2.4105.4 ± 4147Sm0.4114 ± 2
60Ni1.698.4 ± 4153Eu1.8105.3 ± 3
121Sb8.4101.6 ± 3157Gd1.3107.5 ± 0.9
69Ga1.6106.1 ± 1159Tb0.4105.1 ± 0.8
72Ge0.493 ± 4163Dy1.8116.4 ± 2
75As0.2110.4 ± 3165Ho1.3104.9 ± 1
82Se5.899.4 ± 1166Er0.4103.9 ± 3
88Sr2.3112.6 ± 2169Tm1.8103.5 ± 0.7
89Y0.7106 ± 0.6172Yb1.3102.7 ± 2
90Zr2.6117.7 ± 2175Lu0.4104.6 ± 1
93Nb1.9109.6 ± 1205Tl1.896.6 ± 2
95Mo0.7109.8 ± 2206Pb1.3103.8 ± 4
111Cd1.9101 ± 1209Bi1.995.6 ± 2
ICP-OESNa (589.5)58.694.4 ± 1Mn (257.6)16.497.2 ± 2
Mg (285.2)32.8105.7 ± 3Fe (259.9)43.6111 ± 2
Al (396.1)16.297.9 ± 1Zn (206.2)24.4102.8 ± 3
Si (251.6)56.495.5 ± 2Rb (780.0)61.2100.3 ± 1
P (213.6)23.796.8 ± 0.7B (249.7)79.6101.7 ± 0.8
K (769.8)60.2106.8 ± 2Cu (327.3)19.1112.2 ± 2
Ca (317.9)47.194.7 ± 2Sr (421.5)13.296.4 ± 1

Mean ± standard deviation (n = 10).

3.2. Analysis of variance (ANOVA)

Following analysis of the various wine samples from three different regions within China using ICP-OES, ICP-MS and IRMS, ANOVA was employed to identify the key variables from 52 elements and δ18O that may differentiate between the different geographical origins of the wine samples [46]. Based on the ANOVA results, it was figured out that the wine samples from the CJ, ML, and CL regions could be discriminated by their respective Ca, Al, Mg, B, Fe, K, Rb, Mn, Na, P, Co, Ga, As, Sr contents and δ18O values. After application of the ANOVA test, these identified key elements were regarded as reliable descriptors or explanatory factors [47]. The δ18O has strong relationship with the location, which is consistent with the previous investigation of geographical origin for Romanian wines [48]. A list of elements previously described in the literature can be found in Table 1. Statistical analysis of Table 1 shows that elements identified herein were also used as indicative descriptors for geographical origins in a number of previous studies. As indicated in Table 6, Ca was found in three studies. The frequencies of the other descriptors (i.e., Al, Mg, B, Fe, Rb, Mn, P, Co, Ga, As, and Sr) found in previous studies were 6, 4, 6, 3, 12, 8, 1, 11, 6, 7, and 12. Furthermore, Table 1 only shows the ICP-MS method presented in the literature; however, as Na and K are usually determined by ICP-OES, these elements have been omitted from the table. Fig. 2 shows a box plot of the concentrations of these key elements found by our present study, where the bottom and top of the box are the first and third quartiles of the data for each geographical origin. From Fig. 2, it is apparent that it was not possible to use each individual element for the discrimination of the different geographical origins. Therefore it is necessary to establish an robust model to accurately discriminate between Chinese wines from different geographical regions.
Table 6

The frequencies of important elements used as descriptors in a number of previous studies.

ElementsCaAlMgBFeRbMnPCoGaAsSr
Frequencies364631281116712

ANOVA is not the only one method used in the previous literature to extract important elements for the discrimination of geographical origins, other approaches are also used.

Fig. 2

Box plot of the concentration of important elements Al (a), B (b), Ca (c), Fe (d), K (e), Mg (f), Mn (g), Na (h), P (i), Rb (j), Co (k), Ga (m), As (n), Sr (o), δ18O (p) for the discrimination of wines from three geographical origins (Changli (CL), Changji (CJ), Mile (ML)) in China.

3.3. Partial least squares discrimination analysis (PLS-DA)

The 14 mineral elements and δ18O extracted by ANOVA were used as the input variables for subsequent PLS-DA analysis to achieve accurate classification [42,49]. The Bayes method function was selected for the data analysis, as this function has previously been widely used in discrimination analysis. The lowest value of the predicted residual error sum of squares (PRESS) can be calculated by leave-one-out cross validation (LOOCV) and this is subsequently employed to determine the optimum number of latent variables used in the PLS-DA model. The latent variables can then be used for subsequent linear model generation. Based on the cross-validation rule, the first two latent variables (i.e., X-variate 1 and X-variate 2) were used to achieve the optimal prediction accuracy. Fig. 3 demonstrates the score plot of the first two X-variates for the entire data set located in the reduced 2-dimensional space. As indicated, the first two latent variables (X-variate 1: 27% and X-variate 2: 22%) accounted for 49% of the total variance in the raw data, allowing an obvious distinction to be observed between the wine samples from the three different origins. Indeed, PLS-DA is required to maximize the variance between groups rather than within groups, ultimately providing key criteria for selecting the important variables based on the model prediction ability. The importance of the variables generated from PLS-DA can be classified as follows: Mn > K > Co > P > Rb > Ca > Sr > δ18O > Fe > Mg > Na > Al > B > Ga > As. Several overlapped wine samples from the CJ and ML regions were also observed. A total of 8 wine samples fell outside the 95% confidence interval (Hotellings T2 ellipsis) of the respective classes. The reason for these outlier samples may be that the raw data of outlier samples was recorded by artificial errors. The raw data should be checked again. Although a number of samples could not be distinguished in this plane, the overall classification performance figure achieved by PLS-DA was applied for discrimination of the wine samples from different geographical origins in China, as outlined in Table 7, which shows the internal LOOCV classification results for the training set of PLS-DA model. The correct classification rates of the three regions (i.e., CJ, ML, and CL) for the training set were 100, 97, and 97%, respectively, while those for the test set used for external validation were 100, 92, and 94%, respectively. In addition, the average correct classification rates of the PLS-DA model for the training and test sets were 98 and 95%, respectively. From the PLS-DA classification results, it can be concluded that PLS-DA can be considered an efficient method for discriminating between the geographical origins of Chinese wines, although a few samples were misclassified.
Fig. 3

Score plot of X-variate 1 vs X-variate 2 for wine samples (188) from three regions (CL, CJ and ML) in China. Solid lines represent the 95% confidence interval (Hotellings T2 ellipsis).

Table 7

PLS-DA and SVM classification results of China regional wines.

Data setGeographical originPLS-DASVM


MSaP (%)bP (%)cMSaP (%)bP (%)c
Training setCJ (n = 61)01009829798
ML (n = 32)1970100
CL (n = 39)197197
Test setCJ (n = 26)01009519697
ML (n = 12)1920100
CL (n = 18)194194

Misclassified samples.

Percentage of samples correctly classified.

The average correct classified rates for the training set and the test set.

3.4. Support vector machine (SVM)

To test a second data evaluation method and verify the mis-classified samples from the PLS-DA method, the SVM algorithm was performed using the same training and test sets. Suitable kernel functions and parameters were selected for the SVM model to aid in establishing an optimized model with high stability and prediction accuracy [30,50]. In this study, the radial basis function (RBF) kernel with a Gaussian profile was chosen for non-linear SVM classification analysis [51]. Two of the most significant parameters for the RBF are C and γ, and so these parameters should be carefully considered. More specifically, C is the cost parameter, i.e., the penalty that can balance the empirical risk minimization and structural risk minimization principles. To a certain extent, C can determine the performance of the SVM model. In addition, γ is the band width of the RBF kernel, and determines the sensitivity of the SVM model. It also affects the function regression error, which plays a vital role in selecting the number of initial eigenvalues/eigenvectors for the SVM model. As such, an inappropriate γ value may lead to over-fitting [52]. To reduce overfitting and achieve the optimal error performance, a grid-search algorithm combined with 10-fold cross-validation was employed. In this case, the grid-search algorithm was based on an elaborate search in a restrained range, and reflected a two-dimensional minimization process. In this study, C was optimized in the range of 2(−3:3), while γ was optimized in the range of 2(−4:1), to give values of C = 0.5 and γ = 0.125 as optimal parameters for the SVM model. Finally, a SVM model has been developed that employs the RBF function to minimize the model error with the optimal parameters (i.e., C and γ) and the optimal hyper plane to differentiate between the different geographical origins of wines. Table 7 shows the internal 10-fold cross-validation classification results for the training set of SVM model. As indicated, the correct classification rates of the CL, CJ, and ML regions for the training set were 97, 97, and 100%, respectively, while those for the test set used for external validation were 94, 96, and 100%, respectively. As such, the average correct classification rate of the CL, CJ, and ML regions for the training set was 98%, while for the test set, the average correct classification rate was 97%. These results suggest that the SVM model is an effective approach for discriminating between different regional wine samples.

3.5. Joint analysis of PLS-DA and SVM

As both PLS-DA and SVM are conducted using the same data set, the misclassified samples for both methods are comparable, and the correct classification performance of the SVM model was found to be similar to that of the PLS-DA model. In addition, the average rate of correct classification for both models was >95%. We therefore propose that the joint data analysis of PLS-DA and SVM techniques could be used to successfully verify the geographical origins of wine samples. The workflow chart of this joint data analysis is shown in Fig. 4, where the prediction classification results of Chinese wine based on their geographical origin were obtained by PLS-DA in the first step, prior to use of the SVM model in a second data evaluation step. As such, the prediction results from the PLS-DA and SVM models can be used to verify one another, thus if any wine samples are misclassified by both the PLS-DA and SVM models, these samples require further investigation. Therefore, the comprehensive classification results, obtained using the described workflow chart, suggest that joint data analysis using these methods may be preferable to determine the authenticity of the three claimed geographical origins (i.e., CJ, ML, and CL) of Chinese wines.
Fig. 4

The workflow chart of joint analysis of PLS-DA and SVM.

3.6. Comparing with the results of previous study

The work of Thiel et al. [34], 88 German commercial wines collected from four regions were analyzed for their concentration of 33 elements by ICP-OES and ICP-MS. The important discriminative elements (Li, Mg, Pb, W, Y, Cs, Sr, As, Si, Ti, B, Be, Sn) for distinguishing geographical origins of German white wines were identified by Plackett-Burman design and stepwise discriminant analysis. In our work, 52 elements were determined by ICP-OES and ICP-MS, oxygen isotope ratio (δ18O) was also determined by IRMS. Even though more wine samples collected from three regions were investigated in our study, better classification rates (>95%) were achieved than the work of Thiel et al. [34]. The latest work of Kokkinofta et al. [53], only four grape varieties (including red and white wines) from two regions were used for the geographic discrimination of Cypriot wines. Mg, Na, P, K, Ca, Mn, Fe, Zn, B, and Cu at mg/L were determined by inductively coupled plasma atomic emission spectroscopy (ICP-AES). The trace elements at ug/L were not tested in their study. Some important elements could contribute to the discrimination of wine regions might be missed. In our work, a total of 9 grape varieties (including red and white wines) from three regions were considered. 14 elements at mg/L and 38 elements at ug/L were determined. Therefore more grape varieties in three geographical origins and more mineral elements in our Chinese wine samples were analyzed. SNIF-NMR which was used by Kokkinofta et al. [53], was relative expensive equipment. Therefore it was not possible for every winery in China to use this kind of instrument. The work of Geană et al. [13], 87Sr/86Sr isotope ratio, Ga and Al, were identified as the key indicators for geographic differentiation of the Romanian wines. In our present work, Mn, K and Co which obtained from PLS-DA model, were the most significant descriptors for discriminating Chinese wine regions. Meanwhile, Mg, Al, Na, P, K, Ca, Mn, Fe, Zn, Rb, B, and P which were determined by ICP-OES, are highly abundant elements in Chinese wine. The key important elements in our present study for geographic differentiation also included these elements. The same finding were also confirmed by Geană. [13]. Highly abundant elements could contribute to the discrimination of wine regions. Therefore ICP-OES method appears to be advantageous by the fact that it is a simple and low-cost technique. The work of Geană. [13], stepwise linear discriminant analysis (LDA) was used to identify key elements for classification of the wine samples, however, in our work PLS-DA was employed. The methods used to extract key elements for Romanian wines in the work of Geană. [13] and Chinese wines in our work were totally different. The work of Geană. [13], only red wines collected from protected designation of origin (PDO) were investigated. Similarly, only white wines could be classified in the work of Thiel et al. [34]. In our work, both red and white wines were covered for the discrimination of Chinese regional wines. The recent work of Dinca et al. [48], C and O stable isotope ratios and ten trace elements (Co, V, Zn, Mn, Cu, Cr, Ni, Pb, Sr, and Rb) was used for the discrimination of Romanian regional wines. Only ANOVA and DA were used for multivariate data analysis. Misclassified Romanian wine samples in their study and previous work of Geană. [13], no second data evaluation method was employed. On the other hand, only ten trace elements and two stable isotope ratios were used and 100 wine samples collected from five regions were used in the study by Dinca et al. [48]. In our work 14 mineral elements from 52 elements and δ18O were extracted by ANOVA to the analysis of the discrimination of 188 wine samples collected from three geographical origins. With large wine sample sizes and more elements used for the multivariate data analysis, better classification rates (>95%) of Chinese regional wines could still be achieved in our study. In addition, in our work SVM was employed as a second data evaluation method for those misclassified samples from the PLS-DA method. The prediction results from the PLS-DA and SVM models can be used to verify against one another. Therefore our workflow for discriminating geographical origins of wines was more accurate, robust and rigorous than the previous work. Our results also indicate that the variables employed for discrimination, the selection of suitable multivariate methods and high-quality data are imperative for achieving accurate and reliable classification results. The same viewpoint is supported by both Cozzolino et al. [20] and Villagra et al. [54].

4. Conclusion

To the best of our knowledge, this is the first work to establish elemental profile and oxygen isotope ratio (δ18O) database for Chinese wines. Herein we also reported first time an innovative approach by the application of joint analysis of PLS-DA and SVM models for the successful classification of Chinese regional wines. A number of important descriptors for indicating geographical origins were extracted by ANOVA, and adequate classification rates (>95%) were obtained by both PLS-DA and SVM models. It was demonstrated that the combination of oxygen isotope ratio (δ18O) and elemental profile with multi-step multivariate analysis is a very promising tool to verify the authenticity of Chinese wines from the Changji, Mile, and Changli regions. However, it needs to be underpinned that the research reported herein represents only a starting point for a possible strategy to control the quality and authenticity of Chinese wines. For setting up a profound system for inspecting and controlling the correct labeling of the geographical origin of Chinese wines, additional samples with different vintages from the three investigated regions should be analyzed and samples originating from other geographical origins in China or abroad should be also collected, analyzed, and taken into the statistical model appropriately in the further. However, the data collection methods and the demonstrated approaches for data evaluation proposed in this study are considered to be essential to provide a solid backbone for the establishment of a Chinese wine database.
  22 in total

1.  Multi-element analysis of South African wines by ICP-MS and their classification according to geographical origin.

Authors:  Paul P Coetzee; Francois E Steffens; Riette J Eiselen; Ockert P Augustyn; Lieve Balcaen; Frank Vanhaecke
Journal:  J Agric Food Chem       Date:  2005-06-29       Impact factor: 5.279

2.  Influence of winemaking practices on the concentration of rare earth elements in white wines studied by inductively coupled plasma mass spectrometry.

Authors:  Eugenio C Rossano; Zoltán Szilágyi; Antonio Malorni; Gabriella Pocsfalvi
Journal:  J Agric Food Chem       Date:  2007-01-24       Impact factor: 5.279

3.  Internal standardization--atomic spectrometry and geographical pattern recognition techniques for the multielement analysis and classification of Catalonian red wines.

Authors:  Mònica Iglesias; Emili Besalú; Enriqueta Anticó
Journal:  J Agric Food Chem       Date:  2007-01-24       Impact factor: 5.279

4.  Usefulness of chemometrics and mass spectrometry-based electronic nose to classify Australian white wines by their varietal origin.

Authors:  Daniel Cozzolino; Heather E Smyth; Wies Cynkar; Robert G Dambergs; Mark Gishen
Journal:  Talanta       Date:  2005-10-03       Impact factor: 6.057

5.  Determination of the geographical origin of Brazilian wines by isotope and mineral analysis.

Authors:  S V Dutra; L Adami; A R Marcon; G J Carnieli; C A Roani; F R Spinelli; S Leonardelli; C Ducatti; M Z Moreira; R Vanderlinde
Journal:  Anal Bioanal Chem       Date:  2011-07-01       Impact factor: 4.142

6.  Geographical origin identification of Romanian wines by ICP-MS elemental analysis.

Authors:  Irina Geana; Andreea Iordache; Roxana Ionete; Adrian Marinescu; Aurora Ranca; Monica Culea
Journal:  Food Chem       Date:  2012-12-05       Impact factor: 7.514

7.  Targeted and nontargeted wine analysis by (1)h NMR spectroscopy combined with multivariate statistical analysis. Differentiation of important parameters: grape variety, geographical origin, year of vintage.

Authors:  Rolf Godelmann; Fang Fang; Eberhard Humpfer; Birk Schütz; Melanie Bansbach; Hartmut Schäfer; Manfred Spraul
Journal:  J Agric Food Chem       Date:  2013-05-29       Impact factor: 5.279

8.  Multi-element analysis of wines by ICP-MS and ICP-OES and their classification according to geographical origin in Slovenia.

Authors:  Vid S Selih; Martin Sala; Viktor Drgan
Journal:  Food Chem       Date:  2014-01-03       Impact factor: 7.514

9.  [Application of ICP-MS to the detection of forty elements in wine].

Authors:  Yu-kui Rui; Qing-quan Yu; Yin-hua Jin; Jing Guo; Yun-bo Luo
Journal:  Guang Pu Xue Yu Guang Pu Fen Xi       Date:  2007-05       Impact factor: 0.589

10.  Classification of aniseed drinks by means of cluster, linear discriminant analysis and soft independent modelling of class analogy based on their Zn, B, Fe, Mg, Ca, Na and Si content.

Authors:  J M Jurado; A Alcázar; F Pablos; M J Martín; A G González
Journal:  Talanta       Date:  2005-02-19       Impact factor: 6.057

View more
  5 in total

Review 1.  Selected Instrumental Techniques Applied in Food and Feed: Quality, Safety and Adulteration Analysis.

Authors:  Graciela Artavia; Carolina Cortés-Herrera; Fabio Granados-Chinchilla
Journal:  Foods       Date:  2021-05-13

2.  Geographic classification of U.S. Washington State wines using elemental and water isotope composition.

Authors:  Shirley Orellana; Anne M Johansen; Carey Gazis
Journal:  Food Chem X       Date:  2019-02-12

3.  Application of Spectroscopic UV-Vis and FT-IR Screening Techniques Coupled with Multivariate Statistical Analysis for Red Wine Authentication: Varietal and Vintage Year Discrimination.

Authors:  Elisabeta-Irina Geană; Corina Teodora Ciucure; Constantin Apetrei; Victoria Artem
Journal:  Molecules       Date:  2019-11-17       Impact factor: 4.411

4.  The Use of Macro, Micro, and Trace Elemental Profiles to Differentiate Commercial Single Vineyard Pinot noir Wines at a Sub-Regional Level.

Authors:  Courtney K Tanabe; Jenny Nelson; Roger B Boulton; Susan E Ebeler; Helene Hopfer
Journal:  Molecules       Date:  2020-05-30       Impact factor: 4.411

5.  From Soil to Grape and Wine: Geographical Variations in Elemental Profiles in Different Chinese Regions.

Authors:  Xiaoyun Hao; Feifei Gao; Hao Wu; Yangbo Song; Liang Zhang; Hua Li; Hua Wang
Journal:  Foods       Date:  2021-12-15
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.