Literature DB >> 31827484

Hand-Held Near-Infrared Spectroscopy for Authentication of Fengdous and Quantitative Analysis of Mulberry Fruits.

Hui Yan1, Yi-Chao Xu1, Heinz W Siesler2, Bang-Xing Han3, Guo-Zheng Zhang1.   

Abstract

Recently, miniaturization of Raman, mid-infrared (MIR) and near-infrared (NIR) spectrometers have made substantial progress, and marketing companies predict this segment of instrumentation a significant growth rate within the next few years. This increase will be based on a more frequent implementation for industrial quality and process control and a broader adoption of spectrometers for in-the-field testing, on-site measurements, and every-day-life consumer applications. The reduction in size, however, must not lead to compromises in measurement performance and the hand-held instrumentation will only have a real impact if spectra of comparable quality to laboratory spectrometers can be obtained. The present communication will, on the one hand, explain the instrumental reasons why NIR spectroscopy is presently the most advanced technique regarding miniaturization and on the other hand, it will emphasize the impact of NIR spectroscopy for plant analysis by discussing in some detail a qualitative and a quantitative application example.
Copyright © 2019 Yan, Xu, Siesler, Han and Zhang.

Entities:  

Keywords:  authentication of fengdous; hand-held spectrometers; instrumentation; near-infrared (NIR); nutritional parameters of mulberry fruits; qualitative and quantitative analysis

Year:  2019        PMID: 31827484      PMCID: PMC6890835          DOI: 10.3389/fpls.2019.01548

Source DB:  PubMed          Journal:  Front Plant Sci        ISSN: 1664-462X            Impact factor:   5.753


Introduction

Miniaturization of vibrational spectrometers started more than two decades ago, but only within the last decade real hand-held Raman, MIR and NIR scanning spectrometers have become commercially available and have been utilized for a broad range of analytical applications (Sorak et al., 2012; Guillemain et al., 2017; Crocombe, 2018; Karunathilaka et al., 2018; Soriano-Disla et al., 2018; Vargas Jentzsch et al., 2018). While the weight of the majority of Raman and MIR spectrometers is still in the s1 kg range, the miniaturization of NIR spectrometers has advanced down to the ∼100 g level and developments are underway to integrate them into mobile phones (Tino et al., 2016). Furthermore, miniaturized NIR systems have recently reached the <1,000 US$ level. Therefore, only the acquisition of NIR systems can be taken into consideration for private use whereas hand-held Raman and MIR spectrometers will be restricted to industrial, military or homeland security applications and public use by first responders, customs or environmental institutions. Because of the substantial progress in the miniaturization of near-infrared spectrometers in combination with a drastic cost reduction, marketing experts predict this type of instrumentation a significant growth rate. These trends have made hand-held NIR spectroscopy also attractive for everyday life consumer applications of a new, non-expert user community ranging from food testing to the detection of fraud and adulteration in a broad area of materials. Notwithstanding this wide-spread application range of hand-held NIR spectroscopy, the focus of this communication will be for plant analytical aspects only. The discussion of a qualitative and a quantitative analytical problem shall serve as examples, to demonstrate the vital role that hand-held NIR spectroscopy will play in the near future for plant analysis. Before these selected qualitative and quantitative case studies are discussed, however, an overview of the various instrumental features of the most frequently used hand-held NIR spectrometers will be given.

Instrumentation

The recent progress in miniaturization of hand-held NIR spectrometers has taken advantage of new micro-technologies such as MEMS (Micro-Electro-Mechanical Systems), MOEMS (Micro-Opto-Electro- Mechanical Systems), DMD™ (digital mirror device), or LVFs (Linear Variable Filters) and has led to a drastic reduction of spectrometer size (the weight of the spectrometers discussed in this communication varies between 100 and 200 g) while allowing excellent performance due to the high-precision implementation of essential elements in the final device (Wolffenbuttel, 2005). High-volume manufacturability will further reduce costs and thereby contribute towards broader dissemination of such instruments. In what follows the specific instrumental features of four different hand-held NIR spectrometers will be shortly outlined. Based on the type of detector, the hand-held NIR spectrometers can be classified in the two categories of array-detector and single-detector instruments (Wolffenbuttel, 2005). Probably the first commercial, real hand-held NIR spectrometer (VIAVI MicroNIR 1700 (formerly JDSU), Santa Rosa, CA, USA) has an array detector that covers the wavelength range from 908 to 1,676 nm and uses an LVF as a monochromator. It has so far been used for a multiplicity of applications ranging from authentication of seafood and determination of food nutrients to the analysis of hydrocarbon contaminants in soil and authentication and quantitative determination of pharmaceutical drugs (Altinpinar et al., 2013; O’Brien et al., 2013; Jantra et al., 2017; Yan and Siesler, 2018b). However, compared to an array detector, the price for a single detector is much lower, and in an attempt to further reduce the hardware costs, new developments focus on systems with single detectors. Thus, the DLP NIRscan Nano EVM (Dallas, TX, USA), for example, is based on Texas Instruments’ DMD™ in combination with a grating and a single-element detector and also covers the wavelength range from 900 to 1,701 nm. Very recently a MEMS-based FT-NIR instrument, that contains a single-chip Michelson interferometer with a monolithic opto-electro-mechanical structure has been introduced by Si-Ware Systems (Cairo, Egypt). Contrary to most of the other handheld spectrometers, this instrument can scan FT-NIR spectra over the extended range from 1,298 to 2,606 nm. Finally, Spectral Engines (Helsinki, Finland) developed miniaturized NIR spectrometers, that are based on a tunable Fabry-Perot interferometer. In order to cover the NIR wavelength region 1,350–2,450 nm, however, four spectrometers are required. The schematic principles of the different monochromator designs of the described NIR spectrometers are summarized in .
Figure 1

The optical schemes of hand-held NIR spectrometers based on different monochromator principles (A) VIAVI MicroNIR 1700, linear variable filter; (B) DLP NIRscan Nano EVM with Texas Instruments´ digital micromirror device (DMD™); (C) Si-Ware Systems, MEMS-based FT-NIR spectrometer; (D) Spectral Engines NIR spectrometer with tunable Fabry-Perot interferometer.

The optical schemes of hand-held NIR spectrometers based on different monochromator principles (A) VIAVI MicroNIR 1700, linear variable filter; (B) DLP NIRscan Nano EVM with Texas Instruments´ digital micromirror device (DMD™); (C) Si-Ware Systems, MEMS-based FT-NIR spectrometer; (D) Spectral Engines NIR spectrometer with tunable Fabry-Perot interferometer.

Applications

Although the NIR technique is usually applied for a broad range of industrial material quality and control applications (Grassi et al., 2018; Piao et al., 2018; Silva et al., 2018; Yan and Siesler, 2018a; Yan and Siesler, 2018b), the present communication is targeted at practical, everyday life applications in order to attract the attention of a prospective non-expert user community. These days, qualitative and quantitative analysis is more than ever needed also by ordinary people. Because both fraud and adulteration are widely spread, and public health awareness has grown strongly over the last years, the control of nutritional parameters of everyday life food and pharmaceuticals has become an important issue. Therefore, the progress in miniaturization and increasing affordability of hand-held NIR spectrometers make them an attractive tool to fight the above evils efficiently in the public domain. To demonstrate the potential of hand-held NIR spectrometers for plant analysis, a qualitative and a quantitative application example will be presented here.

Identification of Fengdous

In China, the stem of the Dendrobium is processed into a fengdou (), that is considered a convenient dosage form of not only a valuable health-care food but also Chinese traditional medicine (TCM) with efficacy in liver protection, treatment of pharyngitis and many other diseases (Chen and Guo, 2001). Fengdou processed from Dendrobium officinale Kimura et Migo (DOK) have not only high medicinal value but are also in short supply, and, are very expensive. Therefore, it would be desirable to discriminate them from fengdou based on Dendrobium devonianum Paxt (DDP) with lower efficacy and correspondingly much lower (1/4–1/5) price. However, this is not possible by visual inspection only ().
Figure 2

Schematic diagram of the fengdou processing.

Schematic diagram of the fengdou processing. Because of the high public interest, an analytical method based on hand-held NIR spectroscopy with the DLP NIRscan Nano EVM system in combination with a partial least squares discriminant analysis (PLS-DA) evaluation method was developed, to rapidly discriminate fengdou processed either from DOK or DDP.

Materials and Methods

Samples

A total of 468 fengdou samples based on DOK (288) and DDP (180) were collected from Luosiwan (Yunnan, China), and the calibration and validation sets were randomly distributed at a ratio of 2:1.

Measurement of Spectra

NIR spectra were collected with the DLP NIRScan Nano EVM spectrometer by accumulating 32 scans in the wavelength range of 909–1,649 nm (209 wavelength variables) in approximately 7 s. After each measurement, the sample was rotated for approximately 120°, and the average of three spectra was then used as the final raw spectrum (). A certified reflection standard (Labsphere, North Sutton, NA, USA) was used to measure the reference spectrum.
Figure 3

The sample presentation to record NIR spectra of fengdous.

The sample presentation to record NIR spectra of fengdous.

Evaluation of Spectra

Spectral Pretreatment
Due to the fact that NIR spectra frequently contain interferences of background information, drift, and noise, the raw NIR spectra were subjected to spectral preprocessing. For this purpose, the first derivative based on a Savitzky Golay smoothing procedure with a five data point window and a 2nd order polynomial followed by a standard normal variate (SNV) transformation as a scatter correction was used.
Competitive Adaptive Reweighted Sampling
In NIR spectroscopy, the spectral information is not evenly distributed over the whole wavelength range under investigation. Some data may be superimposed by noise or contain irrelevant information, that can decrease the performance of the calibration models. Therefore, the selection of the informative variables is a significant preprocessing step (Li et al., 2009; Li et al., 2013; Yun et al., 2013). In this work, the competitive adaptive reweighted sampling (CARS), based on the simple but effective principle “survival of the fittest” was applied to select the optimal combinations of spectral variables (Zhang et al., 2015). Compared to the moving window algorithm and Monte Carlo uninformative variable elimination procedure, CARS shows a strong capability of increasing the predictive accuracy (Li et al., 2009). For the present analysis, the CARS was run by the libPLS toolbox (www.libpls.net) based on the best combination of pretreated spectra.

PLSDA Analysis

The informative spectral variables determined by CARS were used to develop classification models with the PLS-DA. PLS-DA is a linear classification method that is based on the well-known partial least-squares (PLS) regression. In this work, the leave-one-out (LOO) method was applied to obtain the optimal number of latent variables (LVs) of each model, and the LVs with the lowest root mean square error (RMSE) of cross-validation set (RMSECV) were employed to establish the PLS-DA classification model. The indices of class accuracy, which are described in the following equation, were calculated to evaluate the performance of each classification model. The higher the accuracy values, the better the predictive ability of the classification model: All calculations were performed in MATLAB environment (R2009, Mathworks, Natick, MA, USA) and PLS-DA models were built using the “PLS Toolbox 6.21” from Eigenvector Research (Manson, WA, USA).

Results and Discussion

NIR Spectra

In the raw NIR spectra of the fengdou calibration set, the mean spectra of all DOK and DDP calibration samples, and the spectra of after the different pretreatment steps are shown. As can be seen from the pretreated NIR spectra in , the 1st derivative eliminates most of the baseline shift, whereas the SNV is applied for the scatter correction. The bands at 981 nm, 1,199 nm, and 1,450 nm can be assigned to the 2nd overtones of the N‒H, C‒H and O‒H stretching vibrations, respectively, while the band at 1,568 nm is the 1st overtone of the N‒H stretching vibration.
Figure 4

The raw NIR spectra of the fengdou calibration set (A), the mean spectra of the DOK and DDP calibration samples (B), the NIR spectra after pretreatment by the 1st derivative (C), and the NIR spectra pretreated by the 1st derivative and subsequent SNV (D).

The raw NIR spectra of the fengdou calibration set (A), the mean spectra of the DOK and DDP calibration samples (B), the NIR spectra after pretreatment by the 1st derivative (C), and the NIR spectra pretreated by the 1st derivative and subsequent SNV (D). The diagrams of the wavelength optimization variable screening are shown in . As the number of sampling operations increase, the number of selected wavelength variables decreases first gradually, and then quickly. It embodies the algorithm’s ability of an initial rough selection followed by a fine-tuning (). The gradual zone for the RMSECV screening process indicates that wavelength variables irrelevant to the type of fengdou were removed, and the growth zone indicates that the essential variables relating to the type of fengdou were excluded. Finally, the trend of the regression coefficient of each wavelength variable in the screening process was achieved. The position of "*" in the figure corresponds to the minimum value of the RMSECV (). The 65 selected variables, finally selected for the calibration procedure, are shown in .
Figure 5

Wavelength-variable screening by CARS: (A) number of sampled variables versus number of sampling runs; (B) RMSECV versus number of sampling runs; (C) regression coefficients path versus number of sampling runs; (D) wavelength variables selected by CARS.

Wavelength-variable screening by CARS: (A) number of sampled variables versus number of sampling runs; (B) RMSECV versus number of sampling runs; (C) regression coefficients path versus number of sampling runs; (D) wavelength variables selected by CARS.

Identification of DOK

After spectral pretreatment by 1st derivative, SNV and mean centering, the CARS wavelength optimization algorithm was used to filter out the wavelengths with high information, and then the optimized wavelength variables were used to develop a classification model with the PLS-DA method. The results showed that for the calibration, cross-validation and prediction sets the accuracy is 93.9%, 89.6%, and 84.1%, respectively (). As shown by the blue dots (calibration set) and the red dots (test set) in this graph, the samples clearly cluster in two categories and can be readily discriminated. Furthermore, the probabilities of being identified as DOK were calculated and summarized in . For the majority of samples, the probability was 1 or 0, which means that these samples were either DOK or DDP. Probability values >0.5 or <0.5 refer to DOK or DDP, respectively.
Figure 6

Classification results of the PLS-DA method: (A) DOK classification results of the calibration and test set, (B) classification probability of DOK of the calibration samples.

Classification results of the PLS-DA method: (A) DOK classification results of the calibration and test set, (B) classification probability of DOK of the calibration samples. Sensitivity and specificity are statistical measures of the performance of a binary classification test and are very important for qualitative analysis. Sensitivity (also called the true positive rate) measures the proportion of actual positives that are correctly identified as such. Specificity (also called the true negative rate), on the other hand, measures the proportion of actual negatives that are correctly identified. In this study, for the calibration set, cross-validation set, and test set, the sensitivities are 0.927, 0.875, 0.896, and the corresponding specificities are 0.950, 0.917, 0.783, respectively. The sensitivity and specificity derived from the PLS-DA model for the test set samples are represented in . In , the threshold value used to classify the DOK is drawn as a dashed line. With the increase of the threshold value, the specificity increases, i.e., the number of false-positives DECREASES. Likewise, a sensitivity decrease represents the INCREASE of the false-negatives. With the receiver operator characteristic curve (ROC) graph in similar information is provided in a different format. The presented results, clearly demonstrate that handheld spectroscopy, combined with CARS-PLS-DA data evaluation, can be utilized for the rapid discrimination of fengdous produced from DOK or DDP.
Figure 7

Specificity and sensitivity of the calibration model: (A) predicted ROC, (B) predicted responses.

Specificity and sensitivity of the calibration model: (A) predicted ROC, (B) predicted responses.

Quantitative Analysis of Mulberry Fruits

The mulberry fruits have a bumpy surface, and because of the fruits’ tightly-packed and seed-bearing ovaries, they have a superficial resemblance to blackberries (Huang et al., 2011). The mulberry fruits are eaten, mostly unprocessed, in their fresh state. As traditional Chinese medicine, the fresh mulberry fruit is used in the treatment of sore throats, fever, hypertension, and anemia (Kamiloglu et al., 2013); they are also used widely in the production of jams, pies, tarts, marmalades, juices, wines, and liquors, natural dyes and in the pharmaceutical, food and cosmetic industry (Huang et al., 2011; Khalifa et al., 2018). Mulberry fruits contain high nutrient and bioactive contents, including soluble solids content (SSC), polyphenols, flavonoids, ascorbic acid, fatty acids, minerals, and anthocyanin (Lou et al., 2012). The SSC and dry matter content (DMC) are closely related to senses and nutrition. polyphenols and flavonoids (contained in polyphenols) have many pharmacological effects. polyphenols are naturally secreting, and biologically active substances and a wide range of polyphenols are provided by mulberry fruits such as flavanols, phenolic acids, derivatives, and anthocyanins. Polyphenols show activities of antioxidant, detoxification, induction of apoptosis, antiangiogenic and antiproliferation, and so on (Khalifa et al., 2018). Polyphenols in mulberry fruits and their corresponding functionalities vary considerably according to the genetic diversity, climatic, agricultural practices, processing conditions, and stability during storage (Khalifa et al., 2018). Flavonoids are found mostly in glycosylated form, and they have complex flavonol glycosides profiles including 13 quercetin derivatives, five kaempferol derivatives, and O-methylated flavonol-analogs, such as rhamnetin and isorhamnetin. Levels of quercetin glycoside are reported to increase as the fruit ripens from white to black stages (Sánchez-Salcedo et al., 2015). The flavonoids variation in different breeds of mulberries is significant (Sánchez-Salcedo et al., 2015). Fruit quality has traditionally been determined by visual inspection of the external appearance and its internal content determined by destructive methods, which require operators with the expertise to perform the analysis in a professional laboratory. However, this is impractical for routine analysis by ordinary people. In recent times, consumers have grown conscious of the health benefits of the ingredients of this fruit and a new approach to determine their concentrations is required. In this context it has been reported, that NIR spectroscopy can be used to nondestructively analyze the internal contents, including the SSC, DMC, and total polyphenol content (TPC) of apples (Pissard et al., 2012). Furthermore, Chen et al. employed FT-NIR spectroscopy to determine the TPC in green tea (Chen et al., 2008). In view of this prior knowledge, the demand for a new analytical procedure of mulberry fruits, that will require little to no training originated. In the present work, this issue is addressed by applying the hand-held NIR spectrometer MicroNIR 1700 for a feasibility study of the fast determination of SSC, DMC, polyphenols, and flavonoids in fresh mulberry fruits. The mulberry varieties applied in this work are Zhongmu 1, 8632, Mengchang 4, and Dashi. A total of 434 mulberry fruits (6–9 maturity) were collected from the conservation of mulberry germplasm resources of the Institute of Sericulture, Chinese Academy of Agricultural Sciences (Zhenjiang, Jiangsu, China).

Measurement of NIR Spectra

As shown in , NIR diffuse reflection spectra of mulberry fruits were collected with the MicroNIR 1700 spectrometer by accumulating 50 scans with an integration time of 15 ms, and 125 wavelength variables in the range from 908 to 1,676 nm. Triplicate measurements were made at different spots, and the average of the three spectra was used as the final spectrum of the sample for further processing. The measurements were performed at an environmental temperature of 25 °C and a humidity of about 40%.
Figure 8

Presentation of the mulberry fruit for NIR spectra measurement with the MicroNIR 1700.

Presentation of the mulberry fruit for NIR spectra measurement with the MicroNIR 1700.

Reference Analysis

Determination of Soluble Solids Content
After collection of the NIR spectra, the SSC was determined immediately by a refractometer. First, the equipment was calibrated to zero with distilled water, then the detection surface was dried, and then a few drops of mulberry fruit juice were applied to the detection surface. The juice drops were spread on the prism surface by gently closing the cover of the refractometer, and the corresponding refractive index value was taken.
Determination of Dry Matter Content
The DMC was obtained by measuring the weight percentage of the dried fruit against the corresponding value of the fresh fruit. The weight of the fresh mulberry fruit was measured as m1, and then the fruit was dried at 65 °C for 24 h and finally dried to constant weight m2 at 105 °C. The DMC was calculated as DMC (%) = (m2/m1) × 100 (%).
Determination of Total Polyphenol Content
The TPC of mulberry fruit was determined by the Folin–Ciocalteau method (Yu and Dahlgren, 2000).
Determination of Total Flavonoid Content
The content of total flavonoids content (TFC) in the investigated mulberry fruits was measured by colorimetry (Marinova et al., 2005). The standard normal variate (SNV) transformation and the 1st derivative based on a Savitzky Golay smoothing procedure with a five data point window and a 2nd order polynomial were applied.
Wavelength Optimization
In this work, two kinds of wavelength selection methods have been applied: genetic algorithm (GA) and CARS. GA is an adaptive search procedure based on the mechanism of genetics and natural selection (Shao et al., 2004; Yan et al., 2011). At first, the GA algorithm randomly generates a population (each individual in the population represents a way of solving the problem) that is composed of a binary string (called chromosome). The bit value “1” represents a selected variable whereas “0” is a variable that is not selected. The fitness of an individual (its ability to adapt to the environment) is calculated; high-quality individuals are retained, low-quality individuals are out. New individuals are generated through inheritance and evolved through natural selection. In this way, eventually, the solution of the problem is achieved. In the present work, the parameters chromosomes 30, mutation 1% and cross-over 50% were adopted in the GA to optimize the variables. The principle of the CARS technique has been described for the previous application example and will not be repeated here.
PLS Calibration
PLS calibration was developed using the PLS toolbox (version 6.21, Eigenvector Inc., Manson, WA, USA), and internal cross-validation (CV) was used to select the optimum number of factors. CV estimated the prediction error by splitting all samples into 20 segments, and one segment was reserved for validation, and the remaining (Næs et al., 2002) segments were used for calibration. This process was repeated until all segments were used for validation once.
Calibrations and Validation Statistics
Calibration and validation statistics included the RMSEof calibration set (RMSEC), RMSECV and RMSE of prediction set (RMSEP) and R-squares (Fan et al., 2016). The RMSEC, RMSECV, and RMSEP were used to evaluate the feasibility of the model and its predictive ability. The lower the RMSEP and the closer its value to the RMSEC, the stronger is the prediction ability, and the greater is the robustness of the model. The residual predictive deviation (RPD) defined by the Std Dev/RMSEC of the calibration set was also included to estimate how well the calibration model can predict the compositional data. Generally, an RPD value greater than three can be considered as very good for prediction purposes (Fearn, 2002).

Validation With Unknown Samples

Unknown mulberry fruit samples were collected as a test set to validate the prediction capability of the calibration models developed for SSC, DMC, TPC, and TFC.

Results and Discussion

Reference Values. The reference values of SSC, DMC, TPC, and TFC in mulberry fruits were determined after the spectra were recorded. As shown in , the mean of SSC, DMC, TPC, and TFC were 10.21 Brix, 11.92%, 3.06 mg/g, and 2.26 mg/g, respectively, and the corresponding standard deviation values were 3.16 Brix, 2.26%, 1.25 mg/g and 0.84 mg/g, respectively. The coefficients of variation (C.V.) were 30.96%, 18.94%, 40.95%, and 37.32%, respectively, which suggested that the parameters vary strongly, especially for the TPC and TFC. It is indicative that the collected samples are representative, and the calibration model will show good performance for the determination of unknown samples.
Table 1

Statistical analysis of the reference results of the 4 parameters of mulberry fruits.

Statistical ParametersSSC (Brix)DMC(g/g, %)TPC (mg/g)TFC(mg/g)
TotalCal.*Test *TotalCal.TestTotalCal.TestTotalCal.Test
Number1137637946331916130815427
Mean10.1610.2110.0711.9411.9211.873.073.063.092.342.262.51
Max17.3917.3916.3716.5416.5416.436.676.676.484.014.013.98
Min3.803.804.007.877.177.870.971.220.971.071.071.09
Range13.5913.5912.378.679.378.555.705.455.512.942.942.90
Std.3.103.163.022.292.262.351.231.251.190.830.840.78
C.V.30.5130.9629.9419.2018.9419.8139.9340.9538.5035.2837.3830.97

*Cal and Test stand for calibration and test set, respectively.

Statistical analysis of the reference results of the 4 parameters of mulberry fruits. *Cal and Test stand for calibration and test set, respectively.

NIR Spectra

The raw NIR spectra of the calibration set are shown in . The absorption bands at 990 nm and 1,450 nm are related to the 2nd and 1st overtones of the ν(OH) stretching vibration, respectively. The absorption bands from 1,110 nm to 1255 nm belong to the 2nd overtones of ν(CH) stretchng vibrations.
Figure 9

The raw NIR spectra of the mulberry fruit calibration set.

The raw NIR spectra of the mulberry fruit calibration set.

Spectral Pretreatment

Different methods were used to pretreat the spectral data. The spectra pretreated by SNV only, and a combination of SNV + 1st derivative are shown in , respectively, and specifically in the second pretreatment, an accentuation of spectral features can be observed.
Figure 10

Calibration spectra pretreated by SNV (A), and by SNV + 1st derivative (B).

Calibration spectra pretreated by SNV (A), and by SNV + 1st derivative (B). The results in show that the pretreated spectra can significantly affect the prediction accuracy of the model. Because the SNV method corrects for scattering effects caused by sample roughness and particle heterogeneity (Yan and Siesler, 2018b) the prediction accuracy of the SSC and DMC calibration models is improved. For the TPC and TFC, the SNV followed by the 1st derivative yielded the best calibration performance. Obviously, besides the scatter correction effect of the SNV, the first derivative contributes spectral features that are beneficial for the calibration of low-content and complex components (such as the polyphenols and flavonoids).
Table 2

The influence of spectra pretreatment methods on the calibration performance (the best calibration results are reproduced in bold numbers).

ParametersPretreatment MethodsFactorsRc2RMSECRcv2RMSECVRp2RMSEP
SSCNone90.91420.91980.8721.19310.90010.9667
SNV70.91290.92660.88831.09620.88911.0412
SNV + 1st70.9180.89890.88671.10410.89741.0168
1st70.91120.93580.87511.22630.87811.076
1st + SNV60.91130.93520.88561.10380.88791.059
DSCNone70.90160.70310.84090.96950.86131.0215
SNV70.91480.6540.86830.88060.91640.7328
SNV + 1st70.93240.58250.88940.81630.89620.8901
1st70.91190.6650.86210.89180.89190.9753
1st + SNV50.87030.80720.82770.96560.86470.9776
TPCNone70.81760.53070.73120.6880.82880.5764
SNV70.86750.45240.80570.59170.84850.5184
SNV + 1st60.88290.42530.83430.53640.83850.537
1st60.8650.45670.81650.57030.84220.5568
1st + SNV50.83010.51230.75580.650.83950.5722
TFCNone70.77330.39770.66320.520.56650.5717
SNV70.81460.35960.71240.480.70270.4662
SNV + 1st60.82490.34940.72030.47370.73640.4023
1st60.80880.36520.72080.47510.58640.5625
1st + SNV60.82920.34510.74310.45670.71160.417
The influence of spectra pretreatment methods on the calibration performance (the best calibration results are reproduced in bold numbers).

Wavelength Selection

shows a diagram of the NIR wavelength selection screening for the SSC content that is similar to the previous application example. By the CARS selection, the most sensitive wavelength variables were obtained (see ). For SSC, TPC, and TFC, the performance of CARS was better than that of GA. As shown in for DMC, 54 variables were selected in 200 runs of the genetic algorithms and subsequently used for the development of a PLS model. The different variables selected by these two methods for the four components are shown in . It is of interest that the variables at about 900 nm, 1,110 nm and in the 1,380–1,440 nm range, selected for TFC are also selected for TPC; the reason maybe that the flavonoids belong to the class of polyphenols and these variables are important for both, TPC and TFC.
Figure 11

Wavelength-variable screening by CARS (A), and GA (B).

Table 3

Comparison of the impact of the two wavelength selection methods CARS and GA on the calibration performance of the four quality parameters of mulberry fruits (the bold numbers highlight the best calibration results).

MethodsParametersVariablesFactorsRc2RMSECRcv2RMSECVRPDRp2RMSEP
CarsSSC950.91790.89980.89791.04623.510.93130.8843
DMC1040.90360.69570.88420.78413.250.91940.7961
TPC1950.89890.39520.86430.48183.160.86510.4884
TFC1150.81540.35880.77110.4122.340.71770.4061
GASSC1470.92870.83820.90461.01083.770.90431.0146
DMC5470.92950.59500.89770.76083.800.90710.7758
TPC7560.89420.40420.85850.49163.090.86420.4876
TFC2760.79140.38150.72990.45362.200.71530.4097
Figure 12

The wavelength variables selected by CARS (♦, •, ▴) and GA (▪) for the four parameters under investigation.

Wavelength-variable screening by CARS (A), and GA (B). Comparison of the impact of the two wavelength selection methods CARS and GA on the calibration performance of the four quality parameters of mulberry fruits (the bold numbers highlight the best calibration results). The wavelength variables selected by CARS (♦, •, ▴) and GA (▪) for the four parameters under investigation.

Analysis of the Calibration Statistics

The number of optimal factors chosen for a calibration model has a significant impact on its prediction ability. When the number of factors is too low, the model does not entirely reflect the characteristics of the substance, which leads to lower prediction accuracy. Too many factors lead to over-fitting and yield an—apparently—high prediction accuracy. However, when the model is applied to unknown samples, the prediction effect is weak because the model is not robust. Cross-validation was applied to the calibration models with the smallest optimal number of factors. For SSC, DSC, TPC, and TFC, the optimal number of factors are 5,7,5 and 5, respectively. In the graphs of the RMSEC and RMSECV versus the number of factors are shown for the SSC, DMC, TPC, and TFC. The errors mark the final choice of the optimum number of factors for the individual parameter.
Figure 13

The effect of the number of factors on the RMSEC and RMSECV for SSC (A), DMC (B), TPC (C) and TFC (D).

The effect of the number of factors on the RMSEC and RMSECV for SSC (A), DMC (B), TPC (C) and TFC (D). The calibration parameters for the different components are summarized in . Although only nine wavelength variables were selected for SSC, the calibration performance is the highest. The Rc2 and Rcv2 are 0.9179 and 0.8979, and the corresponding RMSEC and RMSECV are 0.8998 Brix and 1.0462 Brix, respectively. The high R2 values and the low RMSEs are characteristic of a good prediction capability. Furthermore, the R2 and RMSE values for the calibration and cross-validation are similar, which indicates that the calibration model is robust. For DMC, the best calibration is built with the 54 wavelength variables selected by GA. The R2 values for the calibration and cross-validation are 0.9295 and 0.8977, respectively, and the corresponding RMSEC and RMSECV are 0.5950% and 0.7608%, also suggesting a good calibration performance. However, the robustness is not as good as that of the SSC calibration, because of the larger difference between the statistical parameters of the calibration and the cross-validation. For TPC, 19 wavelength variables were selected for the calibration, and the R2 values are not as high as that of the DMC calibration. Therefore, the calibration yields results of lower accuracy than the DMC calibration, and furthermore, its robustness is also lower. Finally, the performance of the TFC calibration with 11 wavelength variables is also not as high as that of the TPC component. The Rc2 and Rcv2 are 0.8154 and 0.7711, respectively, with the consequence of lower calibration accuracy. The RPD values are also included to estimate how well the calibration model can predict the compositional data (Williams and Sobering, 1993; Fearn, 2002). The RPDs for SSC, DMC, TPC, and TFC are 3.77, 3.80, 3.16 and 2.34, respectively, which furnish evidence that SSC, DMC, and TPC can be accurately predicted in the investigated concentration range, whereas, at best, a medium quality calibration has been achieved for TFC. The scatter plots of the measured versus the predicted parameters are shown in . In agreement with the previously discussed calibration statistics results, the scatter distances from the regression lines also reflect that proper calibrations have been developed for SSC, DMC and TPC whereas for TFC a comparatively lower calibration performance has been achieved.
Figure 14

Scatter plots of the measured and predicted parameter values of the calibration and test samples: SSC (A), DMC (B), TPC (C) and TFC (D).

Scatter plots of the measured and predicted parameter values of the calibration and test samples: SSC (A), DMC (B), TPC (C) and TFC (D).

Validation With Test Samples

In order to test the performance of the calibrations, a series of test samples (defined as “unknowns” despite available reference values) were used to validate the prediction accuracy. Their calibration statistics results have been summarized in . The Rp2 for SSC, DMC, TPC and TFC are 0.9313, 0.9071, 0.8651 and 0.9071, respectively, and the corresponding RMSEPs are 0.8843 Brix, 0.7758 %, 0.4884 mg/g and 0.4061 mg/g, respectively. The similar accuracy for the calibration set and cross-validation set suggests that the calibrations are robust. A detailed comparison of prediction and reference results is provided in . In general, for the SSC and DMC, the absolute and relative errors are small, which meets the application requirements. Large relative errors were obtained for the TPC and TFC, but because the absolute errors are small, the calibrations are suitable for screening purposes of consumers, who use a handheld NIR spectrometer to detect whether the mulberry fruits contain a high content of TPC or TFC that is beneficial for the human body.
Table 4

Prediction results for the “unknown” test samples.

ParametersNo.MeasuredPredictedAbsolute ErrorRelative Error(%)No.MeasuredPredictedAbsolute ErrorRelative Error(%)
SSC (Brix)S14.004.140.143.57S2010.0010.800.808.00
S25.504.44−1.06−19.30S2110.2010.410.212.06
S36.006.370.376.24S2210.4011.441.0410.00
S46.106.650.559.02S2310.009.07−0.93−9.30
S56.606.20−0.40−6.09S2413.9013.46−0.44−3.17
S66.906.23−0.67−9.77S2511.009.87−1.13−10.27
S77.408.280.8811.89S2611.2011.10−0.10−0.89
S87.606.12−1.48−19.48S2711.409.96−1.44−12.63
S97.809.051.2516.05S2811.6012.470.877.50
S108.006.90−1.10−13.78S2912.4013.260.866.94
S118.005.99−2.01−25.17S3012.7013.140.443.46
S128.309.381.0812.99S3112.9014.761.8614.42
S138.408.00−0.40−4.81S3213.2013.10−0.10−0.76
S148.908.20−0.70−7.85S3313.8013.60−0.20−1.45
S159.0010.331.3314.82S3414.2015.070.876.13
S169.009.200.202.21S3515.0015.750.755.00
S179.609.58−0.07−0.72S3615.7015.66-0.04−0.25
S189.709.34−0.02−0.20S3716.4016.920.523.17
S199.809.17−0.63−6.41
DMC (%)D17.877.61−0.27−3.37D1712.1611.57−0.59−4.83
D28.408.480.080.90D1812.1912.390.201.63
D38.8210.391.5717.79D1912.3111.69−0.61−4.98
D48.969.940.9810.94D2012.5112.29−0.21−1.72
D59.2510.641.3915.00D2112.7713.240.473.71
D69.479.45−0.03−0.29D2213.1411.43−1.71−13.04
D79.669.45−0.21−2.14D2313.2912.21−1.08-8.13
D810.0910.280.201.96D2413.4213.940.513.83
D910.1710.340.181.74D2513.8213.11−0.71−5.15
D1010.2310.20−0.03−0.33D2613.9913.37−0.62−4.40
D1110.8210.74−0.08−0.76D2714.9913.69−1.29−8.63
D1210.9110.920.010.05D2815.3815.35−0.02−0.15
D1310.9311.180.252.28D2915.7414.64−1.10−6.96
D1411.3110.17−1.13−10.02D3015.9615.44−0.52−3.28
D1511.4211.00−0.42−3.70D3116.4315.18−1.25−7.61
D1611.6010.83−0.77−6.63
TPC (mg/g)P12.052.270.2311.01P163.012.46−0.55−18.29
P21.581.800.2214.04P173.082.59−0.49−15.89
P31.401.38−0.02−1.38P183.132.86−0.27−8.61
P41.861.28−0.58−31.05P193.323.22−0.10−2.92
P51.941.27−0.66−34.30P203.493.22−0.27−7.63
P62.092.870.7837.45P213.573.24−0.33−9.26
P73.203.320.123.71P223.692.67−1.02−27.65
P82.351.77−0.58−24.63P233.893.22−0.67−17.29
P92.421.99−0.43−17.85P244.074.340.276.66
P102.552.660.114.41P254.103.96−0.14−3.44
P112.602.48−0.12−4.67P264.605.030.439.38
P122.742.19−0.56−20.28P274.683.54−1.14−24.37
P132.793.390.6021.58P285.334.82−0.51−9.54
P142.883.110.227.78P296.486.42−0.06−0.97
P152.972.90−0.07−2.43P300.970.48−0.49−50.44
TFC (mg/g)F11.742.050.3017.46F152.902.84−0.06−2.00
F21.982.530.5527.98F163.043.170.134.39
F33.133.140.010.17F173.843.42−0.42−10.92
F42.182.640.4721.52F181.371.830.4633.80
F52.211.92−0.29−13.04F192.022.00−0.02−1.04
F62.332.940.6126.17F202.732.15−0.58−21.23
F72.392.780.3916.30F213.193.500.319.76
F82.432.23−0.20−8.08F223.282.97−0.31−9.37
F92.622.25−0.37−14.23F231.371.600.2316.66
F102.832.15−0.68−24.00F241.090.92−0.16−15.07
F113.032.59−0.44−14.58F252.382.680.3012.45
F123.082.15−0.93−30.16F261.832.170.3318.30
F133.533.04−0.48−13.67F271.231.510.2822.99
F143.984.030.041.06
Prediction results for the “unknown” test samples.

Conclusions

Generally, hand-held NIR instruments have launched vibrational spectroscopy into a new era of in-the-field and on-site analysis. In the present communication hand-held NIR spectrometers were applied for qualitative and quantitative plant analytical case studies. In the qualitative example, it was demonstrated that high-value fengdous based on DOK plants can be successfully discriminated from lower quality fengdous of DDP plants. The quantitative application example outlined in detail the assay of the nutritional parameters SSC, DMC, TPC, and TFC of mulberry fruits by hand-held NIR spectroscopy. In both cases, the analysis of the spectroscopic data was performed with chemometric evaluation routines in combination with wavelength selection methods. Although the measurement and evaluation routines have not yet reached the convenience for public use by a non-expert user community, the integration of NIR spectrometers into mobile phones and the development of apps for specific analytical procedures in food, plant and material quality control will significantly change the every-day-life of consumers in the near future.

Data Availability Statement

All datasets generated for this study are included in the article/supplementary material.

Author Contributions

HY: Investigation, Data curation, Methodology. Y-CX: Investigation, Data curation. HS: Methodology, Supervision. B-XH: Funding acquisition, Investigation. G-ZZ: Funding acquisition, Investigation.

Funding

This Work Was Supported by a Special Project for the Construction of a Modern Agricultural Technology System (Grant Number CARS-18, CARS-21), National Key Research and Development Program of China (2017YFC1700701), Anhui Provincial Science Fund for Distinguished Young Scholars (1808085J17), Jiangsu Province Natural Science Foundation (Grant Number BK20131239).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  4 in total

Review 1.  Miniaturized NIR Spectroscopy in Food Analysis and Quality Control: Promises, Challenges, and Perspectives.

Authors:  Krzysztof B Beć; Justyna Grabska; Christian W Huck
Journal:  Foods       Date:  2022-05-18

2.  Characterization of Substrates and Surface-Enhancement in Atomic Force Microscopy Infrared Analysis of Amyloid Aggregates.

Authors:  Stanislav Rizevsky; Kiryl Zhaliazka; Tianyi Dou; Mikhail Matveyenka; Dmitry Kurouski
Journal:  J Phys Chem C Nanomater Interfaces       Date:  2022-02-17       Impact factor: 4.177

3.  Assessment of the Analytical Performance of Three Near-Infrared Spectroscopy Instruments (Benchtop, Handheld and Portable) through the Investigation of Coriander Seed Authenticity.

Authors:  Claire McVey; Una Gordon; Simon A Haughey; Christopher T Elliott
Journal:  Foods       Date:  2021-04-27

Review 4.  Handheld Devices for Food Authentication and Their Applications: A Review.

Authors:  Judith Müller-Maatsch; Saskia M van Ruth
Journal:  Foods       Date:  2021-11-23
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.