| Literature DB >> 35754890 |
Huan Fang1, Yue Chen1, Hai-Long Wu1, Yao Chen1,2, Tong Wang1, Jian Yang3, Hai-Yan Fu4, Xiao-Long Yang4, Xu-Fu Li5, Ru-Qin Yu1.
Abstract
Geographical origin and authenticity are two core factors to promote the development of traditional Chinese medicine (TCM) herbs perception in terms of quality and price. Therefore, they are important to both sellers and consumers. Herein, we propose an efficient, accurate method for discrimination of genuine and non-authentic producing areas of TCM by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS). Take Atractylodes macrocephala Koidz (AMK) of compositae as an example, the MALDI-TOF MS spectra data of 120 AMK samples aided by principal component analysis-linear discriminant analysis (PCA-LDA), partial least squares discriminant analysis (PLS-DA) and random forest (RF) successfully differentiated Zhejiang province, Anhui province and Hunan province AMK according to their geographical location of origin. The correct classification rates of test set were above 93.3%. Furthermore, 5 recollected AMK samples were used to verify the performance of the classification models. The outcome of this study can be a good resource in building a database for AMK. The combined utility of MALDI-TOF MS and chemometrics is expected to be expanded and applied to the origin traceability of other TCMs. This journal is © The Royal Society of Chemistry.Entities:
Year: 2022 PMID: 35754890 PMCID: PMC9171747 DOI: 10.1039/d2ra02040h
Source DB: PubMed Journal: RSC Adv ISSN: 2046-2069 Impact factor: 4.036
Fig. 1The flow chart for geographical origin traceability of AMK.
Fig. 2The geographic map of AMK samples from three different provinces.
Fig. 3The MALDI-TOF MS profiles of AMK from Anhui (A), Hunan (B) and Zhejiang (C).
Fig. 4The MALDI-TOF MS profiles of main active ingredients of AMK.
Fig. 5The plots of the scores on the first two canonical variables (CVs) of PCA-LDA (A) and the first two latent variables (LVs) of PLS-DA (B).
The optimized parameters and correct classification rates (CCRs, %) for PCA-LDA, PLS-DA and RF model
| PCA-LDA | PLS-DA | RF | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| CVs | CV | Training | Test | LVs | CV | Training | Test |
| OOB | Training | Test |
| 16 | 94.4 | 98.9 | 93.3 | 7 | 91.1 | 100.0 | 96.7 | 100 | 13.3 | 98.9 | 100.0 |
CVs is the number of canonical variables.
CV is cross-validation.
LVs is the number of latent variables.
T is the number of decision tree.
OOB is the out-of-bag error.