Literature DB >> 28911496

Is it possible to rapidly and noninvasively identify different plants from Asteraceae using electronic nose with multiple mathematical algorithms?

Hui-Qin Zou1, Gang Lu1, Yong Liu1, Rudolf Bauer2, Ou Tao1, Jian-Ting Gong1, Li-Ying Zhao1, Jia-Hui Li1, Zhi-Yu Ren1, Yong-Hong Yan1.   

Abstract

Many plants originating from the Asteraceae family are applied as herbal medicines and also beverage ingredients in Asian areas, particularly in China. However, they may be confused due to their similar odor, especially when ground into powder, losing their typical macroscopic characteristics. In this paper, 11 different multiple mathematical algorithms, which are commonly used in data processing, were utilized and compared to analyze the electronic nose (E-nose) response signals of different plants from Asteraceae family. Results demonstrate that three-dimensional plot scatter figure of principal component analysis with less extracted components could offer the identification results more visually; simultaneously, all nine kinds of artificial neural network could give classification accuracies at 100%. This paper presents a rapid, accurate, and effective method to distinguish Asteraceae plants based on their response signals in E-nose. It also gives insights to further studies, such as to find unique sensors that are more sensitive and exclusive to volatile components in Chinese herbal medicines and to improve the identification ability of E-nose. Screening sensors made by other novel materials would be also an interesting way to improve identification capability of E-nose.
Copyright © 2015. Published by Elsevier B.V.

Entities:  

Keywords:  Asteraceae family; Chinese herbal medicine; artificial neural networks; discriminative model; electronic nose

Year:  2015        PMID: 28911496      PMCID: PMC9345448          DOI: 10.1016/j.jfda.2015.07.001

Source DB:  PubMed          Journal:  J Food Drug Anal            Impact factor:   6.157


1. Introduction

Plants used as Chinese herbal medicines (CHM) in traditional Chinese medicine system are getting more international attention based on their modern biological activities and as alternative treatments for some chronic [1] and refractory diseases [2]. Among such plants, most are also used as food ingredients, chemical materials, etc. especially in Asian countries. However, raw plants with low quality or even from the wrong source [3] are sold in the markets, resulting in wrong clinic medication, which will surely bring in economic loss, poor clinical effects, or even poisoning [4]. Therefore, there is an urgent need to establish rapid, accurate, and practical methods for medicinal plants identification. In recent years, many modern techniques have been introduced into CHM analysis, including high-performance liquid chromatography–UV fingerprints [5], mass spectrometry, DNA genetic analysis and so on. The whole chemical profile of CHM could be expressed in different fingerprints that are used to identify original materials, especially combined with multivariate statistical analyses. As for the analyses of volatile components in CHM, gas chromatography, and gas chromatography–mass spectrometry [6] are the most popular ways to determine volatile components in CHM. However, these methods normally only detect one or more chemical compositions, and most of the given information reflects the fragments instead of the holistic state of the volatile components. They are also time-consuming, need complex sample pretreatment, and do not give environmental protection. Compared to them, electronic nose (E-nose) is a simple, rapid, and noninvasive technology that require less sample and no organic reagents. The initial and unique chemical form of the volatile components in CHM could be reflected by their response to E-nose, which can be used to identify different CHM [7]. E-nose, which has already been applied in various fields in recent decades, is a very promising method for identifying different samples based on their different volatile components. In these studies, E-nose had been used for identifying different rice plants [8,9] and predicting storage time and quality of peanuts [10], peach [11], coffee [12], winter jujube [13], grass carp [14], etc. It has also been employed in clinical breath diagnosis [15]. The data processing methods differ in different areas, but all reach the same goal: establishing the discriminative model from response signals in E-nose with different mathematic algorithms [16]. Questions are as follows: is the data processing method unique for each research? Which one is most suitable and how to filtrate it? Could they simply be equally employed? Therefore, one of the key topics is how to screen and select the efficient discriminative model in the application of E-nose, which could certainly promote its practicability and generalization. In CHM identification, E-nose has been used in identification [17], quality assessment [18], and geoherbalism evaluation [19] combined with various multiple mathematical algorithms, such as diffusion maps, or principal component analysis (PCA). However, there are few studies on the comparison of various classification models including PCA, partial least squares (PLS), and artificial neural network (ANN). In this study, 11 different multiple mathematical algorithms, which are commonly used in data processing, were utilized and compared to analyze the E-nose response signals of different plants from Asteraceae family. Results demonstrate that three-dimensional plot scatter figure of PCA with less extracted components could offer the identification results more visually; simultaneously all nine kinds of ANN could give classification accuracies of 100%.

2. Materials and methods

2.1. Plant materials

Eight different species of plants, all originating from Asteraceae family, were purchased from Beijing Tongrentang Co., Ltd. (Beijing, China) and identified by Professor Y. H. Yan in Beijing University of Chinese Medicine (Beijing, China). Samples were labeled as Bai Zhu, Cang Zhu, Gong Ju, Ye Ju Hua, Ai Ye, Mu Xiang, E Bu Shi Cao, and Niu Bang Zi (Table 1).
Table 1

Labels and origins of eight plants from Asteraceae family.

NumberLabelHerbal name
1 Bai Zhu Dried Rhizoma of Atractylodes macrocephala Koidz.
2 Cang Zhu Dried Rhizoma of Atractylodes lancea (Thunb.) DC.
3 Gong Ju Dried Flos of Chrysanthemum morifolium Ramat.
4 Ye Ju Hua Dried Flos of Chrysanthemum indicum L.
5 Ai Ye Dried Folium of Artemisia argyi Levl. et Vant.
6 Mu Xiang Dried Radix of Aucklandia lappa Decne.
7 E Bu Shi Cao Dried Herba of Centipeda minima (L.) A. Br. et Aschers.
8 Niu Bang Zi Dried Fructus of Arctium lappa L.

2.2. E-nose

E-nose (α-FOX3000; Alpha MOS, Toulouse, France) consists of 12 metal oxide semiconductor (MOS) sensors, a head space sampler and a signal processing system. Twelve commercial metal oxide sensors are placed in two rectangular chambers, six in each. A list of their information and their application is in Table 2. They are LY2/LG, LY2/G, LY2/AA, LY2/GH, LY2/gCTL, LY2/gCT, T30/1, P10/1, P10/2, P40/1, T70/2, and PA/2, respectively numbered as S1, S2, S3, …, S12. The sensor response was expressed as the ratio of conductance (G/G).
Table 2

Main application of 12 metal oxide semiconductor sensors in α-FOX3000 E-nose.

No.NameMain application
S1LY2/LGOxidizing gas
S2LY2/GAmmonia, carbon monoxide
S3LY2/AAEthanol
S4LY2/GHAmmonia/organic amine
S5LY2/gCTLHydrogen sulfide
S6LY2/gCTPropane/butane
S7T30/1Organic solvents
S8P10/1Hydrocarbons
S9P10/2Methane
S10P40/1Fluorine
S11T70/2Aromatic compounds
S12PA/2Ethanol, ammonia/organic amine
Ground into small particles, 0.2 g of each sample was accurately weighed into a 10-mL septa-sealed bottle and loaded into the auto-sampler tray. After incubation with optimized parameters in the previous research (temperature 30°C, time 300 seconds), 2000 μL of headspace air was injected into the E-nose system automatically via a syringe and detected by the MOS sensor array. The conductance ratio of each sensor changed during the measurement process. The measurement phase lasted for 120 seconds, which was enough for all the sensors to reach stable values and return to the baseline. Signals were collected by the computer and the data acquisition cycle was 1 second. Six repeated samples were prepared for each kind of plant and a total of 48 measurements were performed by the dynamic headspace sampling procedure. The E-nose responses values of those plants were extracted and recorded by the computer. Then different kinds of classification models were established to process the data and identify them.

2.3. Data analysis

Raw data from E-nose system were imported into software by peak values of responses. Afterwards, the processed data set was analyzed by PCA and PLS using Simca-P software (version 11.0; Umetrics AB, Umea, Sweden) after components screening and data dimension reduction. Visual identification could be obtained both from PCA and PLS including two- and three-dimensional scatter plot figures. The same peak values data set was processed via nine ANN classifiers [Bayes net, naïve Bayes bet, naïve Bayes updateable, logistic analysis, multiple layer perception, radial basis function (RBF) network, NB tree, random tree, and random forest] from WEKA software (http://www.cs.waikato.ac.nz/ml/weka/) to get the exact classification accuracies. In order to investigate the inner network structure, the same data set was imported into SPSS statistics software version 17.0, (IBM Corporation, Armonk, NY, USA) and then three layers (input layer, hidden layer, and output layer) of RBF-ANN were illustrated.

2.4. Evaluation of the classifiers

To evaluate the established models, a 10-fold cross-validation method was applied to avoid over-fitting and get the classification accuracy. The classification results should not be considered if the classification accuracy was < 80%.

3. Results and discussion

3.1. E-nose responses of herbal samples from Asteraceae family

When detecting the sensor response to a given sample, the response values are used as: R = G/G, where R is the response, G is the conductance of a sensor in the reference air, and G is the conductance of the sensor in the sample gas. Fig. 1 shows the typical responses of 12 MOS sensors with one sample of Ai Ye (dried folium of Artemisia argyi Levl. et Vant.). Each line represents the signals of an Ai Ye sample in one of the 12 MOS sensors. The horizontal axis is the time line, a total of 120 seconds; the vertical axis is the response value of the MOS sensor. The curves represent the resistance value of each sensor against time due to the electro-valve action when the volatile compounds reached the detection chamber. In the initial period, the response value of each sensor was low and then increased continuously, and finally stabilized after a few seconds or minutes. In this study, 12 maximum response values of each sample from 12 MOS sensors were extracted and analyzed individually.
Fig. 1

Typical responses of 12 metal oxide semiconductors measuring of a Ai Ye sample.

The repeatability of the established method was evaluated with six parallel tests of the samples. The relative standard deviation (n = 6) values of 12 MOS sensors were calculated. The results were all < 3%, proving a high repeatability of E-nose response.

3.2. PCA and PLS analysis and dimension reduction

PCA is a typical data dimension reduction analysis method that applies linear transformation to process original data from high dimensions into low dimensions. These processed data carry enough crucial information to find the inner correlation and difference among these massive amounts of information. Basically speaking, PCA helps us to determine which samples are different from the others and which principal components extracted from the original variances contribute more to this difference. It is commonly used in many areas such as human demography, quantitative geography, and molecular dynamics. PLS is a fundamental method for data analysis based on optimal lurie function confirmation via minimizing the square of errors in the original data set. Theoretically speaking, PLS is an important derivation algorithm from multivariate linear regression analysis, canonical correlation analysis and PCA. Two of its outstanding features are: (1) it is practical in PLS system to regression modeling with less amount of variables than samples; and (2) it is easier in PLS system to distinguish the systematic noise, even some kind of random noise, which enables PLS to identify samples even with minor differences. Additionally, PLS is as popular as PCA in several fields, including economics, mechanical control engineering, drug design, etc. In this paper, both of them were employed to find out the characteristic information of each Asteraceae plant’s odor response signals in E-nose. As illustrated in two-dimensional scatter plots of PCA and PLS (Fig. 2), samples from eight kinds of herbal plants from Asteraceae family could be successfully classified. The left-most samples are from Cang Zhu and the lower-most samples from Bai Zhu, far away from the other six (E Bu Shi Cao, Mu Xiang, Gong Ju, Ai Ye, Ye Ju Hua, and Niu Bang Zi). From the point of sensory smell, the former two are strong while the later six are slight. In terms of chemical compositions, those two, Cang Zhu and Bai Zhu, contain more volatile oil. The others have less, or even major in other kind of chemical components. For instance, the required detecting compound in Ye Ju Hua in Chinese pharmacopeia is linarin, and in the case of Niu Bang Zi, it is arctiin, which is also glycosides. Therefore, the response signal of these plants in E-nose also offer potential information closely related to their chemical constituents, which indicates that E-nose could be a promising method to identify herbal plants not only qualitatively but also quantitatively.
Fig. 2

Two dimensional scatter plots of principal component analysis (PCA) and partial least squares (PLS) for E-nose response signals of eight plants from Asteraceae family.

From the demonstrations in the three-dimensional scatter plots of PCA and PLS (Fig. 3), it could be visually gained that PCA did a better job than PLS. This may be a result of the different numbers of extractive components for modeling between them. Seven principal components were extracted via PCA but three more via PLS. That means there is more redundant information in the PLS classifier.
Fig. 3

Three dimensional scatter plots of principal component analysis (PCA) and partial least squares (PLS) for E-nose response signals of eight plants from Asteraceae family.

3.3. Identification results of discriminative models from nine different ANN classifiers

Nine different kinds of ANN classifiers were utilized to obtain the precise classification accuracies and a 10-fold cross-validation method was employed to train them. The results gave us 100% for all nine (Table 3), which shows feasibility and veracity of them to identify eight kinds of herbal plants combined with E-nose. This leads us to the idea that there is no need to choose one exclusive classifier to work with E-nose. Coupled with the results of PCA and PLS, it should be significant to focus on attribution screening and redundant information reduction in the future.
Table 3

Identification accuracies of nine types of artificial neural networks classifiers.

Artificial neural networks classifiersClassification accuracy through 10-folder cross-validation (%)
Bayes net100
Naïve Bayes net100
Naïve Bayes updateable100
Logistic analysis100
Multiple layer perception100
Radial basis function network100
NB tree100
Random tree100
Random forest100
A typical architecture of RBF-ANN for training and identification is normally composed of three layers, namely the input layer, the hidden layer, and the output layer (Fig. 4). In this initial RBF-ANN model, the input layer contains 12 units from 12 MOS sensors. All the original data of the input layer were imported into the hidden layer and then calculated by RBF. Afterwards identification results were gained and the samples were divided into eight different groups. The ANN-related calculating mechanism could be explained in this way.
Fig. 4

Architecture of three layers of radial basis function artificial neural network with 12 units in the input layer (codes stand for samples and S1–S12 stand for 12 metal oxide semiconductor sensors).

4. Conclusion

Many plants originating from the Asteraceae family are applied as herbal medicines and also beverage ingredients in Asian areas, particularly in China. However, they may be confused due to their similar odor, especially when they were ground into powder, losing their typical macroscopic characteristics. In this paper, E-nose was employed to extract and analyze the volatile components fingerprints of eight species of Asteraceae plants. PCA, PLS, and nine ANN classifiers were applied to establish the discriminative models. In PCA and PLS, eight plants could be clearly identified and PCA did a better job as told from the three dimensions scatter plot. Nine classification accuracies at 100% showed us successful models for authentication. This paper presents a rapid, accurate, and effective method to distinguish Asteraceae plants based on their response signals in E-nose. Also, it gives insights to further studies, such as to search for some kind of unique sensor that is more sensitive and exclusive to volatile components in CHM, to improve the identification ability of E-nose. Screening sensors made by other novel materials would be also an interesting way to improve the identification capability of E-nose.
  10 in total

1.  Authentication is fundamental for standardization of Chinese medicines.

Authors:  Zhongzhen Zhao; Yani Hu; Zhitao Liang; Jessie Pui-Sai Yuen; Zhihong Jiang; Kelvin Sze-Yin Leung
Journal:  Planta Med       Date:  2006-08-10       Impact factor: 3.352

2.  Detecting internal quality of peanuts during storage using electronic nose responses combined with physicochemical methods.

Authors:  Zhenbo Wei; Jun Wang; Weilin Zhang
Journal:  Food Chem       Date:  2015-01-06       Impact factor: 7.514

3.  Electronic nose: current status and future trends.

Authors:  Frank Röck; Nicolae Barsan; Udo Weimar
Journal:  Chem Rev       Date:  2008-01-19       Impact factor: 60.622

4.  A novel bioelectronic nose based on brain-machine interface using implanted electrode recording in vivo in olfactory bulb.

Authors:  Qi Dong; Liping Du; Liujing Zhuang; Rong Li; Qingjun Liu; Ping Wang
Journal:  Biosens Bioelectron       Date:  2013-05-29       Impact factor: 10.618

5.  Detection of coffee flavour ageing by solid-phase microextraction/surface acoustic wave sensor array technique (SPME/SAW).

Authors:  Nicole Barié; Mark Bücking; Ullrich Stahl; Michael Rapp
Journal:  Food Chem       Date:  2014-12-17       Impact factor: 7.514

6.  Geoherbalism evaluation of Radix Angelica sinensis based on electronic nose.

Authors:  Sihao Zheng; Weiguang Ren; Linfang Huang
Journal:  J Pharm Biomed Anal       Date:  2014-12-05       Impact factor: 3.935

7.  Winter jujube (Zizyphus jujuba Mill.) quality forecasting method based on electronic nose.

Authors:  Guohua Hui; Jiaojiao Jin; Shanggui Deng; Xiao Ye; Mengtian Zhao; Minmin Wang; Dandan Ye
Journal:  Food Chem       Date:  2014-08-19       Impact factor: 7.514

8.  Salvia officinalis for hot flushes: towards determination of mechanism of activity and active principles.

Authors:  Sinikka Rahte; Richard Evans; Philippe J Eugster; Laurence Marcourt; Jean-Luc Wolfender; Andreas Kortenkamp; Deniz Tasdemir
Journal:  Planta Med       Date:  2013-05-13       Impact factor: 3.007

9.  Targeting cancer-related inflammation: Chinese herbal medicine inhibits epithelial-to-mesenchymal transition in pancreatic cancer.

Authors:  Juan Zhang; Peng Wang; Huaqiang Ouyang; Jianhua Yin; Aihua Liu; Chunzheng Ma; Luming Liu
Journal:  PLoS One       Date:  2013-07-29       Impact factor: 3.240

10.  Characterization of odor-active compounds in cooked meat of farmed obscure puffer (Takifugu obscurus) using gas chromatography-mass spectrometry-olfactometry.

Authors:  Ning-Ping Tao; Rong Wu; Pei-Gen Zhou; Sai-Qi Gu; Wei Wu
Journal:  J Food Drug Anal       Date:  2014-05-22       Impact factor: 6.157

  10 in total
  4 in total

1.  Exploration on Varying Patterns of Morphological Features and Quality of Armeniacae Semen Amarum in Rancid Process Based on Colorimeter, Electronic Nose, and GC/MS Coupled With Human Panel.

Authors:  Yuanyang Shao; Huirong Chen; Hongxin Lin; Huishang Feng; Jianting Gong; Guangzhao Cao; Weifeng Hong; Yuebao Yao; Huiqin Zou; Yonghong Yan
Journal:  Front Pharmacol       Date:  2022-05-03       Impact factor: 5.988

2.  Determination of the geographical origin of Tetrastigma hemsleyanum Diels & Gilg using an electronic nose technique with multiple algorithms.

Authors:  Zhizhuang Wu; Xiaodan Ye; Fangyuan Bian; Ganglei Yu; Guibing Gao; Jiande Ou; Yukui Wang; Yueqiao Li; Xuhua Du
Journal:  Heliyon       Date:  2022-09-28

3.  Organoleptic Evaluation of Amomi Fructus and Its Further Background Verified via Morphological Measurement and GC Coupled with E-Nose.

Authors:  Dong Xu; Yan Lin; Rudolf Bauer; Hui-Rong Chen; Rui-Qi Yang; Hui-Qin Zou; Yong-Hong Yan
Journal:  Evid Based Complement Alternat Med       Date:  2018-03-05       Impact factor: 2.629

4.  Phenolic constituents of Pulicaria undulata (L.) C.A. Mey. sub sp. undulata (Asteraceae): Antioxidant protective effects and chemosystematic significances.

Authors:  Sameh R Hussein; Mona M Marzouk; Maha M Soltan; Emad K Ahmed; Mahmoud M Said; Ahmed R Hamed
Journal:  J Food Drug Anal       Date:  2016-12-22       Impact factor: 6.157

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.