Literature DB >> 29410795

Non-destructive geographical traceability of sea cucumber (Apostichopus japonicus) using near infrared spectroscopy combined with chemometric methods.

Xiuhan Guo1,2, Rui Cai3, Shisheng Wang1,2, Bo Tang1,2, Yueqing Li1,2, Weijie Zhao1,2.   

Abstract

Sea cucumber is the major tonic seafood worldwide, and geographical origin traceability is an important part of its quality and safety control. In this work, a non-destructive method for origin traceability of sea cucumber (Apostichopus japonicus) from northern China Sea and East China Sea using near infrared spectroscopy (NIRS) and multivariate analysis methods was proposed. Total fat contents of 189 fresh sea cucumber samples were determined and partial least-squares (PLS) regression was used to establish the quantitative NIRS model. The ordered predictor selection algorithm was performed to select feasible wavelength regions for the construction of PLS and identification models. The identification model was developed by principal component analysis combined with Mahalanobis distance and scaling to the first range algorithms. In the test set of the optimum PLS models, the root mean square error of prediction was 0.45, and correlation coefficient was 0.90. The correct classification rates of 100% were obtained in both identification calibration model and test model. The overall results indicated that NIRS method combined with chemometric analysis was a suitable tool for origin traceability and identification of fresh sea cucumber samples from nine origins in China.

Entities:  

Keywords:  Apostichopus japonicus; chemometrics; geographical origin; near infrared spectroscopy; total fat content; traceability

Year:  2018        PMID: 29410795      PMCID: PMC5792872          DOI: 10.1098/rsos.170714

Source DB:  PubMed          Journal:  R Soc Open Sci        ISSN: 2054-5703            Impact factor:   2.963


Introduction

Sea cucumber is a typical bottom-dwelling animal which pertains to the phylum Echinodermata (Aspidochirotida: Holothuroidea). Apostichopus japonicus, a species of sea cucumber mainly distributed in northern waters of the western Pacific, has a long history of dietary and medical uses in Asian countries, especially in China, Japan and Korea, owing to its various healthcare functions, including anti-tumour [1], immunoregulation [2], anti-atherosclerosis, antioxidant and antiapoptotic effects [3]. In China, there are more than 20 species of edible sea cucumbers widely distributed in the areas of the Yellow Sea, the Bohai Sea and the East Sea. Among them, the Yellow Sea and the Bohai Sea are primary producing areas famous for the high quality of sea cucumber, such as A. japonicus, which is one of the most valuable species and possesses large market share in China. Since sea cucumbers of the same species growing in different sea regions have different qualities and medical applications, and they are easily confused, difficult to be identified by sensory methods (sight, smell, taste, etc.), the origin of A. japonicus is considered to be a main factor influencing the price and consumer preference. For making exorbitant profits, many tortious acts such as confusing origins of products, selling seconds at best quality prices and adulteration are increased noticeably, and the resulting unscrupulous competitions have damaged consumer benefits and the reputation of geographical indication sea cucumbers. With regard to these severe food safety problems, the urgent countermeasure is to establish rapid and feasible methods for source identification and traceability of sea cucumber. Many research works have been carried out for origin traceability and identification of sea cucumber. Multi-element analysis with pattern recognition techniques [4] and the stable isotope ratios and fatty acid (FA) profiles analysis [5] have been used to evaluate the applicability in the origin traceability of sea cucumbers from different sampling points in China Sea. Diffuse reflectance mid-infrared Fourier transform spectroscopy [6] and species-specific polymerase chain reaction method [7] have been used for rapid identification of dried sea cucumber products from different geographical areas. These traditional methods are precise and reliable; however, most of them are often time-consuming and sophisticated, or require complex pre-processing [8,9]. Hence, rapid and accurate control methods are usually preferred, and one of the most frequently chosen and proposed method is near infrared spectroscopy (NIRS) technology. NIRS combined with chemometric methods is time-saving, cost-efficient and without the need of tedious sample preparation, and it has shown a good adaptability in authentication [10,11], origin traceability, discrimination on food safety problems [12-15], qualitative and quantitative analysis of natural products [16,17], pharmaceuticals [18-20], as well as monitoring and controlling product quality in agri-food processing [21-23]. In recent years, many studies on the quality control of seafood, distinction of geographical origin and content determination by NIRS have been reported [24-27]. However, the identification of the geographical origins of fresh sea cucumber using NIRS has not been reported so far. Chemometric methods offer more possibilities to analyse a large amount of data obtained by NIRS. One of the main issues of NIRS model establishment is to select the most informative wavelength regions. To extract the effective information and to remove irrelevant features, various methods have been developed, and one of these approaches is the ordered predictor selection (OPS) algorithm [28]. OPS has been used to select wavelengths for NIR analysis of coffee [29] and chemicals [30]. In our preliminary experiment, the result reveals the difficulty of choosing satisfactory wavelengths for identification of fresh sea cucumber. In this study, we intend to take the total FA content as the index for the wavelength region selection for identification model establishment by OPS, because the total fat contents and profile have been successfully used as biomarkers to confirm and trace the geographical origins of A. japonicus in some reports [5,31]. The principal component analysis combined with Mahalanobis distance (PCA-MD) algorithm and the scaling to the first range algorithm [32] have been successfully applied in our previous work for origin traceability identification of Chinese propolis [12] and Corydalis tuber [33], respectively. In view of the power of these methods in identifying the origins of natural products with complex components, they were adopted as identification chemometric methods in this study. To supply a new access to develop rational quality standards for improving the quality control of various sea cucumber (A. japonicus) origins, the major objective of this study was to develop a rapid, cheap and non-destructive method for identification of fresh A. japonicus from nine sampling locations in northern China Sea and East China Sea by using NIRS. The detailed aims of this study were to (i) select suitable wavelengths which could give best results for origin identification of A. japonicus by using OPS algorithm, (ii) establish NIRS identification model with PCA-MD and scaling to the first range algorithms to identify origin of A. japonicus and (iii) develop NIRS quantitative model for contents of total fat in A. japonicus with partial least-squares (PLS) regression.

Material and methods

Apparatus and reagents

The NIR spectra of A. japonicus were collected using an MPA™ Fourier transform near infrared spectrometer (Bruker Optics, Germany), which was equipped with reflectance fibre-optic probe, indium-gallium-arsenide (InGaAs) and lead sulfide (PbS) detector. Data analysis was performed by Matlab 7.1 software and OPUS 6.5 software from Bruker Optics. OPS algorithm was performed using OPS_Toolbox package, which was available at www.deq.ufv.br/chemometrics. Methanol and chloroform (Merck, Germany) were of HPLC grade. The other chemical reagents used in the experiments were bought from Energy Chemical Corp. (Shanghai, China). Ultrapure water was produced by a Milli-Q water system (Millipore Corp., MA, USA). N2 (99.999% purity) and He (99.999% purity) were provided by Junfeng Gas Co. (Dalian, China).

Sample collection and preparation

We collected 189 A. japonicus samples (3-year-old adult individuals) from nine habitats of three water environments, including main producing areas of A. japonicus of the northern China Sea and the East China Sea. All the samples were grown in 30 m depth deep sea and were not artificially fed. The detailed information is listed in table 1 and locations of sampling sites are shown in figure 1. All samples were dissected as soon as they were caught, and their digestive tracts were cleared away immediately. The body walls were kept in a freezer at −20°C during sampling and 2-day transport to laboratory. Then, each sample (body wall) was transected into two even parts (Part-A and Part-A′). Part-A was used without any pretreatment prior to spectrum acquisition, and Part-A′ was used for determination of total fat contents. Completely frozen Part-A′ was freeze-dried at −50°C for 48 h, and the dried samples were pulverized with a mortar and pestle, passed through an 80-mesh sieve, and then stored under dry conditions.
Table 1.

Information of sea cucumber A. japonicus samples collected from different origins of China Sea.

water environmentoriginsample codeslatitude--longitude pairsnumbersampling timeaverage length (cm)average weight (g)
Yellow SeaChanghaiCH39°15′42.8′′ N,22Nov 201515.3 ± 2.9114 ± 12.3
122°36′1′′ E
Bohai SeaWanfangdianWFD39°46′3′′ N,17Nov 201516.0 ± 1.6115 ± 11.5
121°29′31′′ E
Yellow SeaZhangzi IslandZZ39°1′8′′ N,20Nov 201516.3 ± 1.8109 ± 20.7
122°45′34′′ E
Yellow SeaPulandianPLD39°26′54′′ N,22Nov 201516.3 ± 2.0115 ± 12.7
122°22′32′′ E
Yellow SeaQingdaoQD35°55′57′′ N,20Dec 201516.0 ± 1.9110 ± 10.6
120°21′44′′ E
Yellow SeaRushanRS36°46′46′′ N,21Dec 201515.7 ± 3.4115 ± 12.7
121°34′59′′ E
Yellow SeaYantaiYT37°26′54′′ N,30Dec 201516.1 ± 1.2115 ± 12.0
121°26′19′′ E
Bohai SeaLaizhouLZ37°15′34′′ N,17Dec 201515.9 ± 2.1112 ± 16.7
119°52′22′′ E
East China SeaXiapuXP26°52′54′′ N,20Dec 201516.2 ± 2.0116 ± 10.2
120°5′47′′ E
Figure 1.

Sampling locations of A. japonicus from the Bohai Sea, Yellow Sea and East China Sea.

Sampling locations of A. japonicus from the Bohai Sea, Yellow Sea and East China Sea. Information of sea cucumber A. japonicus samples collected from different origins of China Sea.

Spectra collection

To minimize the deviation caused by human factors, each A. japonicus sample was placed into a test tube with 1.2 cm inner diameter, and the reflectance fibre-optic probe with 1.1 cm diameter was plugged into the tube and ‘fixed’ tightly on the surface of the sample to record NIR spectra. Each spectrum was measured in diffuse reflection mode in the range of 800−2500 nm with a 2 nm spectral resolution and an average of 64 scans. A background spectrum was recorded prior to analysing each sample. The temperature was kept around 25°C and the humidity was kept constant. The original spectrum for establishing NIR model was the average of three spectra recorded in different sites of each sample.

Quantitative analysis of total fat contents in Apostichopus japonicus

FA analyses of the samples were carried out according to Zhang et al. [5] and Xu et al. [31]. The Folch method of lipid extraction [34] was used in this study for total lipid extraction from A. japonicus. The powdered sample (0.5 g) was added to 10 ml of chloroformmethanol solution (2:1, v/v), and extracted by mechanical shaking for 30 min at 37°C, and extracted overnight, and then filtered. The extract was washed with 1/5 vol of 0.88% NaCl solution and left to stand for separation. The chloroform layer was collected, dried by anhydrous sodium sulfate and then concentrated under a nitrogen stream to obtain the total lipids fraction. A 20 mg sample of the total lipids was dissolved in 1 ml of 1% sulfuric acid--methanol solution in a 60°C water bath for 60 min to carry out methylation and then cooled. Subsequently, 1 ml of n-hexane was added, and the mixture was sonicated for 30 s and then left to stand. The supernatant was reserved and analysed by an Agilent 7890A/7000B triple quadrupole gas chromatography/mass spectrometry (GC/MS) instrument equipped with a capillary column HP-5 (30 m × 0.25 mm ID coated with 0.25 µm film thickness; Agilent, Santa Clara, CA). The programme of temperature was as follows: 70°C for 3 min, held at 220°C for 33 min after increasing at a rate of 3°C min−1. Temperatures of the injector and detector were set at 220 and 280°C, respectively. The injection volume was 1.0 µl. The carrier gas was He (99.999% purity), introduced at a flow rate of 1.2 ml min−1 in constant-flow mode. For MS measurement, the transfer-line temperature was 250°C, and the ion-source temperature was 230°C. Ionization of samples was carried out in electron ionization mode at electron energy of 70 eV. The GC retention time and the MS spectra of samples were, respectively, compared against the GC retention time of Supelco 37 Component FAME Mix (Supelco, Bellfonte, PA) and the MS spectra of a standard library (NIST2010) to determine the types of FA. Total fat content value was defined as the sum of the identified individual FA value and reported as mean values ± standard deviation, and analysed by one-way analysis of variance (ANOVA). Significant differences were determined by using Tukey's honestly significant difference (HSD) post hoc test (SPSS Statistics 20.0).

Chemometric processing

To establish a feasible NIRS identification model and PLS model, pre-processing methods were applied to eliminate unnecessary physical information and magnify relevant variations in original spectra. Different pre-processing methods were compared in accordance with their predictive performance, including the first derivative (FD), second derivative (SD), standard normal variate (SNV), [35] multiplicative scatter correction (MSC), [36] vector normalization (VN; firstly, the spectra are centred; then, the sum of all squares of all Y-values is calculated, and the respective spectrum is divided by the square root of this sum; the vector norm of the resulting spectrum is 1) [36], and SD-VN. To avoid enhancing the noise by SD, spectra were smoothed at first by Savizky–Golay (S.G) smoothing algorithm [37]. In this study, visible and short wavelength region (800−1000 nm) was excluded due to the high level of noise caused by instruments. The region of 1800−2500 nm was not applied on account of the absorption spectra were truncated as small amount of low energy light, which could not pass through the samples, meaning absorption spectra above 1850 nm showed stray light [38,39]. The OPS algorithm was applied to select suitable wavelength regions for PLS regression of total FA contents. Then the selected wavelength regions were used to build identification model. The NIR spectra were organized into a matrix format (m × n), and matrix was transformed into matrix by using pre-processing method. Total FA content values were arranged as the dependent variables (), and the pre-processed spectra (matrix ) of A. japonicus samples were used as the independent variables to develop PLS regression model. To validate PLS model, leave-one-out cross validation was used, and the dataset was split as follows: 148 samples were randomly selected to be calibration set, and the remaining 41 samples were arranged in test set. A ‘two-step’ NIRS identification model (Step I tandem Step II model) was established for classifying the origins of A. japonicus samples. The 144 samples were used to establish the calibration model, and the remaining 45 samples (five for each origin) were used for validation. One hundred and forty four samples were classified by Step I model at first, and the unidentified spectra were subsequently entered into the tandem Step II model for further classification. The optimum factors for model establishment of each step (pre-processing method, wavelength regions and identification chemometric method) were selected, respectively, that would enhance the predictive ability of NIRS model. PCA-MD algorithm and scaling to the first range algorithm were used in Step I and Step II models as the identification chemometric methods, respectively. The detailed formulae could be found in [33]. Selectivity (S) was used to evaluate the discriminating ability of identification model, which had been successfully used in our previous work [33,40,41]. S > 1 means the classes can be completely separated; S = 1 means the classes are in contact; and S < 1 means the classes are overlapping. The larger the S value is, the better the discrimination result is, and the more accurate the prediction model is. S is calculated by formula (2.1), where D denotes the distance from the centre of one category to another. The distance between the unknown spectra and the average spectrum in calibration set (Hit) is calculated, and the threshold for each category (DT) is calculated by formula (2.2), where SDev is the standard deviation of Hit; Q is a coefficient, and 0.25 is usually selected [12]. and

Results and discussion

Quantitative analysis of total fat contents by gas chromatography/mass spectrometry

According to the method described in §2.4, the determination of total fat contents of A. japonicus body wall from nine origins by GC/MS, and the results of ANOVA and Tukey's HSD post hoc test are listed in table 2, and those values were used to establish NIRS quantitative model as the standard values. The total fat contents reveal significant differences between A. japonicus from different origins. Samples from LZ showed statistically higher levels of total fat contents compared with those of samples from other origins. By contrast, samples from XP showed lower levels of total fat contents.
Table 2.

The value of total fat content in the body wall of A. japonicus from nine origins in China. XP, Xiapu; LZ, Laizhou; YT, Yantai; RS, Rushan; QD, Qingdao; WFD, Wanfangdian; PLD, Pulandian; CH, Changhai; ZZ, Zhangzi Island.

OriginXPQDRSYTLZWFDCHPLDZZ
ΣFc (%)4.63 ± 0.38b4.96 ± 0.92ab5.01 ± 0.73ab5.13 ± 1.35ab5.96 ± 2.34a5.84 ± 2.05ab5.66 ± 0.53ab5.76 ± 0.92ab5.33 ± 1.05ab

a,bDifference of total fat contents is significant (p ≤ 0.05).

cTotal fat content was reported as mean values ± standard deviation.

The value of total fat content in the body wall of A. japonicus from nine origins in China. XP, Xiapu; LZ, Laizhou; YT, Yantai; RS, Rushan; QD, Qingdao; WFD, Wanfangdian; PLD, Pulandian; CH, Changhai; ZZ, Zhangzi Island. a,bDifference of total fat contents is significant (p ≤ 0.05). cTotal fat content was reported as mean values ± standard deviation.

Results of partial least-squares model

The OPS algorithm was conducted as follows. The 189 original spectra of A. japonicus samples were organized into a format matrix (189 × 700). After the pre-processing investigation, S.G smoothing with 17 spectral points before second derivative was chosen for quantitative model due to the lowest root mean square error of cross-validation (RMSECV) and highest correlation coefficient of cross-validation (R2) of calibration model. The equation of RMSECV, the root mean square error of prediction (RMSEP) and R2 could be found in [42]. The original and pre-processed spectra are shown in figure 2.
Figure 2.

Original (a), second derivative after Savitsky–Golay smoothing with a window size of 17 points pre-processed (b) and second derivative coupled vector normalization pre-processed (c) NIR diffuse reflectance spectra of A. japonicus samples.

Original (a), second derivative after Savitsky–Golay smoothing with a window size of 17 points pre-processed (b) and second derivative coupled vector normalization pre-processed (c) NIR diffuse reflectance spectra of A. japonicus samples. The number of latent variables (l) of PLS model was also determined according to RMSECV and R2 values. The results of total FA content PLS models achieved by using different pre-processing methods with optimum parameters are listed in table 3. When 17 points smoothing and SD was adopted, l equals 8, the lowest RMSECV (0.31) and highest R2 (0.93) could be obtained.
Table 3.

The statistics of PLS models of total fat contents with different pre-processing methods. S.G(9.0)-SD, Savizky–Golay smoothing (17 points) before second derivative; FD, first derivative; SD, second derivative; SNV, standard normal variate; MSC, multiplicative scatter correction; VN, vector normalization; l, the number of latent variables; RMSECV, the root mean square error of cross-validation; RMSEP, the root mean square error of prediction; R2, correlation coefficient.

pre-processed methodlRMSECVRMSEPR2 (calibration)R2 (test)
S.G(17.0)-SD80.310.450.930.90
SNV90.910.990.740.62
MSC71.872.010.450.40
VN91.091.890.440.37
SD + VN80.310.530.900.81
FD91.231.760.710.56
The statistics of PLS models of total fat contents with different pre-processing methods. S.G(9.0)-SD, Savizky–Golay smoothing (17 points) before second derivative; FD, first derivative; SD, second derivative; SNV, standard normal variate; MSC, multiplicative scatter correction; VN, vector normalization; l, the number of latent variables; RMSECV, the root mean square error of cross-validation; RMSEP, the root mean square error of prediction; R2, correlation coefficient. After l value was investigated to build the optimal regression vector used in OPS, 30 regions from 700 absorbance variables were selected to build PLS model of total FA content. The detailed information of selected wavelength regions and relevant absorption bands of functional groups are listed in table 4. The selected regions were mainly associated with the vibration of functional groups of FAs.
Table 4.

Wavelength regions selected by OPS algorithm for establishing PLS regression model of total fat contents and identification model.

region selectedselected wavelength regions (nm)vibrational modescfunctional groups
11145–1208asecond overtone of C–HCH2 CH3
21210–1223a,bsecond overtone of C–HCH CH2
31225–1246asecond overtone of C–HCH
41287–1295afirst overtone of C–H combination bandsCH3
61300–1310a,bfirst overtone of C–H combination bandsCH3
71321–1346a,bfirst overtone of C–H combination bandsCH3
81352–1364afirst overtone of C–H combination bandsCH3
91368–1378afirst overtone of C–H combination bandsCH2 CH3 ArOH
101381–1385a,bfirst overtone of C–H combination bandsCH2 CH3 ArOH
111386–1390bfirst overtone of C–H combination bandsCH2 CH3 ArOH
121391–1393afirst overtone of C–H combination bandsCH2 CH3 ArOH
131394–1400a,bfirst overtone of C–H combination bandsCH2 CH3 ArOH ROH
141402–1410a,bfirst overtone of C–H combination bandsCH2 CH3 ArOH ROH
151412–1418bfirst overtone of C–H combination bandsCH2 CH3 ArOH ROH
161420–1422afirst overtone of C–H combination bandsCH CH2 ROH
171450–1452bfirst overtone of O–HCH ROH CONH2 CONHR
181452–1455a,bfirst overtone of O–HCH ROH CONH2 CONHR
191458–1462a,bfirst overtone of O–HROH CONH2 CONHR
201464–1473a,bfirst overtone of O–HROH CONH2 CONHR
211532–1538a,bfirst overtone of N–HRNH2
221540–1566a,bfirst overtone of N–HRNH2
231568–1602afirst overtone of N–HRNH2
241604–1622bfirst overtone of C–HArCH
251624–1645a,bfirst overtone of C–HArCH
261648–1651a,bfirst overtone of C–HArCH CH3
271654–1660a,bfirst overtone of C–HCH3
281665–1670a,bfirst overtone of C–HCH2 CH3
291674–1680a,bfirst overtone of C–HCH CH2 CH3
301688–1690a,bfirst overtone of C–HCH CH2 CH3
311698–1700a,bfirst overtone of C–HCH CH2 CH3
321712–1734a,bfirst overtone of C–HCH CH2
331734–1738bfirst overtone of C–H and S–HCH CH2 SH
341740–1750afirst overtone of C–H and S–HCH CH2 SH
351750–1758a,bfirst overtone of C–HCH
361760–1770afirst overtone of C–HCH

aWavelength regions for establishing Step I calibration model.

bWavelength regions for establishing Step II calibration model.

cThe spectral band assignments were referred to in [29,32,42,43].

Wavelength regions selected by OPS algorithm for establishing PLS regression model of total fat contents and identification model. aWavelength regions for establishing Step I calibration model. bWavelength regions for establishing Step II calibration model. cThe spectral band assignments were referred to in [29,32,42,43]. The above optimized PLS model was used to predict the total fat contents of 41 samples in the test set, and the scatter plots in figure 3 show the correlation between NIR prediction value and reference GC/MS measurement for fat contents by optimized PLS model. The calibration and prediction data revealed good correlation with reference measurement data in figure 3 where many data points fell on or were close to the unity line.
Figure 3.

Correlation between the NIRS predicted values and the reference values of total fat in A. japonicus samples by PLS in Step I quantitative model (148 samples in calibration set, in black; 41 samples in test set, in red).

Correlation between the NIRS predicted values and the reference values of total fat in A. japonicus samples by PLS in Step I quantitative model (148 samples in calibration set, in black; 41 samples in test set, in red).

Results of classification model

The flow chart of procedures for the ‘two-step’ identification model establishment is shown in figure 4. The spectral band assignments (in table 4) were presented in [29,32,42,43]. The 30 wavelength regions selected by OPS for PLS model of total fat contents were also used for establishment of Step I identification model. S.G smoothing with 17 points before SD was applied as a pre-processed method, and PCA-MD method was performed on selected wavelength regions (in table 4, with superscript a). After distinguishing by Step I model, 49 samples (red block in figure 4) entered into the tandem Step II for further identification due to the ambiguous classification. The detailed discrimination results (values of S) of Step I model are listed in table 5. In table 5, the samples listed in column ‘GroupstepI or GroupstepII’ were the ones that got the minimum difference with the samples in column ‘Group’ (minimum S value is obtained), and these minimum S values are listed in column ‘SI or SII’. Taking XP samples in the first line (column Group) as an example, after the Step I classification, the ones that got the minimum difference with them were the WFD samples (column GroupstepI), and the S value of Step I was 2.48 (SI > 1), which indicates that they can be completely separated, not necessary for the Step II classification. But as to the CH samples in the seventh line, after the Step I classification, the ones that got the minimum difference with them were the ZZ samples (column GroupstepI), and the S value of them was 0.51 (SI < 1), which indicates that they have not been separated (overlapping), therefore they needed to go through the Step II classification, and luckily the S value of Step II is 1.10 (SII > 1), which indicates that they have been successfully separated into the correct clusters. The largest SI value was obtained from XP (2.48), and the SI value of each group in Step I model was larger than 1.06, except SI values of CH (0.51), PLD (0.04) and ZZ (0.04), which indicated significant differences among A. japonicus samples from XP, WFD, QD, YT, RS and LZ origins. The results could also be seen from figure 5, where the three-dimensional principal component space provided an overview of the ability of the NIRS identification model to classify A. japonicus from nine origins in China Sea. The best discrimination was observed on the score plots in three-dimensional space represented by PC2 (10.3%), PC3 (2.4%) and PC4 (0.3%) vectors. Though the variance contribution rate of PC1 was up to 83.4%, PC1 was not adopted in model construction due to its ambiguous classification. It was speculated that PC1 contained main chemical information of the samples rather than different information between samples. As shown in figure 5, sea cucumber samples from nine different origins were classified into seven concentrated districts. The XP samples could be completely separated from samples of other origins, and the good classification result of samples from XP could be attributed to the significantly different water environments and food sources between the East China Sea and the northern China Sea. On the contrary, the distributions of samples from PLD, ZZ and CH were partially overlapped, which was coincident with their adjacent geographical locations in Yellow Sea, resulting in the similarity in water environments and food sources.
Figure 4.

The flow chart of procedures for the ‘two-step’ identification model establishment. Samples in green blocks represent unambiguous classified and samples in red block represent ambiguous classified. XP, Xiapu; LZ, Laizhou; YT, Yantai; RS, Rushan; QD, Qingdao; WFD, Wanfangdian; PLD, Pulandian; CH, Changhai; ZZ, Zhangzi Island.

Table 5.

The results of identification model. SI: S value of Step I model; SII: S value of Step II model; S > 1: the classes are completely separated; S = 1: the classes are in contact; S < 1: the classes are overlapping.

GroupGroupStepISIGroupStepIISII
XPWFD2.48
QDRS1.06
RSQD1.06
YTRS1.30
LZRS2.21
WFDPLD2.05
CHZZ0.51ZZ1.10
PLDZZ0.04ZZ1.31
ZZPLD0.04PLD1.31
Figure 5.

Score plots with three principal components (PCs) for Step I calibration set samples from nine origins including Changhai (CH), Wafangdian (WFD), ZhangziIsland (ZZ), Pulandian (PLD), Qingdao (QD), Rushan (RS), Yantai (YT), Laizhou (LZ), Xiapu (XP), which were obtained by second derivative after Savitsky–Golay smoothing with a window size of 17 points pre-processed.

The flow chart of procedures for the ‘two-step’ identification model establishment. Samples in green blocks represent unambiguous classified and samples in red block represent ambiguous classified. XP, Xiapu; LZ, Laizhou; YT, Yantai; RS, Rushan; QD, Qingdao; WFD, Wanfangdian; PLD, Pulandian; CH, Changhai; ZZ, Zhangzi Island. Score plots with three principal components (PCs) for Step I calibration set samples from nine origins including Changhai (CH), Wafangdian (WFD), ZhangziIsland (ZZ), Pulandian (PLD), Qingdao (QD), Rushan (RS), Yantai (YT), Laizhou (LZ), Xiapu (XP), which were obtained by second derivative after Savitsky–Golay smoothing with a window size of 17 points pre-processed. The results of identification model. SI: S value of Step I model; SII: S value of Step II model; S > 1: the classes are completely separated; S = 1: the classes are in contact; S < 1: the classes are overlapping. The 49 samples (red block in figure 4) that were not successfully classified by Step I model were subsequently separated by the tandem Step II model. SD-VN was used as the pre-processing method, and the scaling to the first range algorithm was performed on 25 wavelength regions selected by OPS (with superscript b, listed in table 4). Finally, SII for CH, PLD and ZZ were all larger than 1.10, which indicated that A. japonicus samples from CH, PLD and ZZ origins could be properly classified. And the overall results of Step II are shown in table 5. The 45 samples in the validation set were classified using the established models, and none of them was identified as the wrong class. The overall results indicated that, though the sea cucumber samples investigated in this work belonged to the same species, different geographical regions and water environments would lead to discriminative constituents which revealed discrepant spectral information of samples, and NIRS technique combined with chemometric analysis was a feasible tool to correctly classify A. japonicus samples from nine origins in China Sea.

Conclusion

In this work, the origin traceability and identification method developed by NIRS combined with chemometric analysis provides a satisfactory predictive ability for A. japonicus samples from nine origins in China. The correct classification rate by means of the classification model reached to 100%. Moreover, the established PLS model was suitable for the determination of total fat contents of sea cucumber. It is worth mentioning that the NIRS method does not require any complex pre-processing of samples, and the utilization of reflectance fibre-optic probe makes it possible for in situ online detection of wild sea cucumbers. Nonetheless, before the NIRS method can be practically applied to origin traceability and identification in sea cucumber industry, more extensive samples are needed to improve the accuracy and applicability of the identification models.
  23 in total

1.  Separation and identification of flavonoids from complex samples using off-line two-dimensional liquid chromatography tandem mass spectrometry.

Authors:  Jing Zeng; Xiuli Zhang; Zhimou Guo; Jiatao Feng; Jianguo Zeng; Xingya Xue; Xinmiao Liang
Journal:  J Chromatogr A       Date:  2011-11-28       Impact factor: 4.759

2.  Estimation of total soluble solids in grape berries using a hand-held NIR spectrometer under field conditions.

Authors:  Ruben Urraca; Andres Sanz-Garcia; Javier Tardaguila; Maria P Diago
Journal:  J Sci Food Agric       Date:  2015-10-23       Impact factor: 3.638

3.  Effective 2D-RPLC/RPLC enrichment and separation of micro-components from Hedyotis diffusa Willd. and characterization by using ultra-performance liquid chromatography/quadrupole time-of-flight mass spectrometry.

Authors:  Cunman Li; Yanyan Zhao; Zhimou Guo; Xiuli Zhang; Xingya Xue; Xinmiao Liang
Journal:  J Pharm Biomed Anal       Date:  2014-07-09       Impact factor: 3.935

4.  Performance of NIR handheld spectrometers for the detection of counterfeit tablets.

Authors:  Aurélie Guillemain; Klara Dégardin; Yves Roggo
Journal:  Talanta       Date:  2017-01-06       Impact factor: 6.057

5.  Chemometric models for the quantitative descriptive sensory analysis of Arabica coffee beverages using near infrared spectroscopy.

Authors:  J S Ribeiro; M M C Ferreira; T J G Salva
Journal:  Talanta       Date:  2010-11-10       Impact factor: 6.057

6.  Qualitative and quantitative analysis of endophyte alkaloids in perennial ryegrass using near-infrared spectroscopy.

Authors:  Milton C Soto-Barajas; Iñigo Zabalgogeazcoa; Inmaculada González-Martin; Beatriz R Vázquez-de-Aldana
Journal:  J Sci Food Agric       Date:  2017-05-29       Impact factor: 3.638

7.  Estimation of cellulose crystallinity of sugarcane biomass using near infrared spectroscopy and multivariate analysis methods.

Authors:  Ítalo P Caliari; Márcio H P Barbosa; Sukarno O Ferreira; Reinaldo F Teófilo
Journal:  Carbohydr Polym       Date:  2016-12-05       Impact factor: 9.381

Review 8.  Anticancer and cancer preventive compounds from edible marine organisms.

Authors:  Marta Correia-da-Silva; Emília Sousa; Madalena M M Pinto; Anake Kijjoa
Journal:  Semin Cancer Biol       Date:  2017-04-07       Impact factor: 15.707

9.  Use of near infrared spectroscopy to predict microbial numbers on Atlantic salmon.

Authors:  N B Tito; T Rodemann; S M Powell
Journal:  Food Microbiol       Date:  2012-07-25       Impact factor: 5.516

10.  Traceability of 'Limone di Siracusa PGI' by a multidisciplinary analytical and chemometric approach.

Authors:  M Amenta; S Fabroni; C Costa; P Rapisarda
Journal:  Food Chem       Date:  2016-05-18       Impact factor: 7.514

View more
  4 in total

Review 1.  A Review of the Discriminant Analysis Methods for Food Quality Based on Near-Infrared Spectroscopy and Pattern Recognition.

Authors:  Jian Zeng; Yuan Guo; Yanqing Han; Zhanming Li; Zhixin Yang; Qinqin Chai; Wu Wang; Yuyu Zhang; Caili Fu
Journal:  Molecules       Date:  2021-02-01       Impact factor: 4.411

2.  Histamine Control in Raw and Processed Tuna: A Rapid Tool Based on NIR Spectroscopy.

Authors:  Sergio Ghidini; Luca Maria Chiesa; Sara Panseri; Maria Olga Varrà; Adriana Ianieri; Davide Pessina; Emanuela Zanardi
Journal:  Foods       Date:  2021-04-18

3.  Rapid Geographical Origin Identification and Quality Assessment of Angelicae Sinensis Radix by FT-NIR Spectroscopy.

Authors:  Zhen-Yu Zhang; Ying-Jun Wang; Hui Yan; Xiang-Wei Chang; Gui-Sheng Zhou; Lei Zhu; Pei Liu; Sheng Guo; Tina T X Dong; Jin-Ao Duan
Journal:  J Anal Methods Chem       Date:  2021-01-12       Impact factor: 2.193

Review 4.  Application of Visible/Infrared Spectroscopy and Hyperspectral Imaging With Machine Learning Techniques for Identifying Food Varieties and Geographical Origins.

Authors:  Lei Feng; Baohua Wu; Susu Zhu; Yong He; Chu Zhang
Journal:  Front Nutr       Date:  2021-06-17
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.