| Literature DB >> 36234771 |
Bei Li1,2, Miao Liu3, Feng Lin3, Cui Tai1, Yanfei Xiong3, Ling Ao3, Yumin Liu4, Zhixin Lin1, Fei Tao1, Ping Xu1.
Abstract
Reliable methods are always greatly desired for the practice of food inspection. Currently, most food inspection techniques are mainly dependent on the identification of special components, which neglect the combination effects of different components and often lead to biased results. By using Chinese liquors as an example, we developed a new food identification method based on the combination of machine learning with GC × GC/TOF-MS. The sample preparation methods SPME and LLE were compared and optimized for producing repeatable and high-quality data. Then, two machine learning algorithms were tried, and the support vector machine (SVM) algorithm was finally chosen for its better performance. It is shown that the method performs well in identifying both the geographical origins and flavor types of Chinese liquors, with high accuracies of 91.86% and 97.67%, respectively. It is also reasonable to propose that combining machine learning with advanced chromatography could be used for other foods with complex components.Entities:
Keywords: Chinese liquors; GC × GC/TOF-MS; food inspection; machine learning
Mesh:
Year: 2022 PMID: 36234771 PMCID: PMC9572226 DOI: 10.3390/molecules27196237
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.927
Figure 1Three-dimensional visualization of raw HS-SPMEDVB/CAR-PDMS /GC × GC/TOF-MS data. (a) chromatography 3D plot of Fen liquor extracted by the HS-SPME and (b) the LLE. More peaks were detected by the HS-SPMEDVB/CAR-PDMS compared to the LLE, especially, in 17 min in 1st dimensional time according to the 3D plot. Some low abundance peaks invisible using the HS-SPMEDVB/CAR-PDMS could be detected by performing the LLE around 25 min in 1st dimensional time.
Figure 2Score-loading biplot of GC × GC/TOF-MS data from the PLS model of Chinese liquor showed the correlation among fibers and the area of 292 compounds. Typical data points representing the experimental solid-phase microextraction fiber. Colorized box representing different fibers and black triangles representing the area of 292 compounds. Each SPME experiment was triplicated.
Figure 3Results of the number of peaks of the different fibers and the Pareto chart of the 75-μm CAR/PDMS. Comparison of the extraction efficiency for each fiber by testing the different groups used in the experiment and measured as number of peaks (a). Standardized main effect Pareto chart, representing the estimated effects of the parameters obtained from the P-B design for the determination of the 75-μm CAR/PDMS, Vertical line in the chart defines 95% confidence level (b).
Figure 4Multivariate statistical analysis of different flavoring types of Chinese liquors. (a) PLS score scatter plot of the different flavor liquors based on a correlation analysis. Ellipses and shapes show clustering of the samples. The plot shows a strong correlation between the various flavor liquors. (b) Hierarchical clustering heat map of twelve Chinese liquors, with the degree of 339 compounds change in the six different flavoring type liquors. Individual samples (horizontal axis) and compounds (vertical axis) are separated and the top dendrogram is scaled to represent the distance between each branch. Different bar colors represent six types of flavor liquors.
Figure 5Total mass spectrum of 262 kinds of Chinese liquors obtained using the GC × GC/TOF-MS method.
Figure 6The output results of the SVM machine learning about thirty-six geographical origins of Chinese liquors (a,b), and six flavor types of Chinese liquors (c,d).