Xue Jintao1, Yang Quanwei2, Jing Yun1, Liu Yufei1, Li Chunyan3, Yang Jing4, Wu Yanfang1, Li Peng1, Wan Guangrui1. 1. Department of TCM, School of Pharmacy, Xinxiang Medical University, Xinxiang, PR China. 2. Department of pharmacy, Wu Han NO.1 Hospital, Wuhan, Hubei Province, PR China. 3. Department of TCM, School of Pharmacy, Xinxiang Medical University, Xinxiang, PR China; Department of pharmacy, Sanquan Medical College, Xinxiang, PR China. 4. Department of pharmacy, Puyang Health School, Puyang, Henan Province, PR China.
Abstract
BACKGROUND: Gegen (Puerariae Labatae Radix) is one of the important medicines in Traditional Chinese Medicine. The studies showed that Gegen and its preparation had effective actions for atherosclerosis. OBJECTIVE: Near-infrared (NIR) was used to develop a method for rapid determination of puerarin during percolation and concentration process of Gegen. MATERIALS AND METHODS: About ten batches of samples were collected with high-performance liquid chromatography analysis values as reference, calibration models are generated by partial least-squares (PLS) regression as linear regression, and artificial neural networks (ANN) as nonlinear regression. RESULTS: The root mean square error of prediction for the PLS and ANN model was 0.0396 and 0.0365 and correlation coefficients (r (2)) was 97.79% and 98.47%, respectively. CONCLUSIONS: The NIR model for the rapid analysis of puerarin can be used for on-line quality control in the percolation and concentration process. SUMMARY: Near-infrared was used to develop a method for on-line quality control in the percolation and concentration process of GegenCalibration models are generated by partial least-squares (PLS) regression as linear regression and artificial neural networks (ANN) as non-linear regressionThe root mean square error of prediction for the PLS and ANN model was 0.0396 and 0.0365 and correlation coefficients (r (2)) was 97.79% and 98.47%, respectively. Abbreviations used: NIR: Near-Infrared Spectroscopy; Gegen: Puerariae Loabatae Radix; TCM: Traditional Chinese Medicine; PLS: Partial least-squares; ANN: Artificial neural networks; RMSEP: Root mean square error of validation; R2: Correlation coefficients; PAT: Process analytical technology; FDA: The Food and Drug Administration; Rcal: Calibration set; RMSECV: Root mean square errors of cross-validation; RPD: Residual predictive deviation; SLS: Straight Line Subtraction; MLP: Multi-Layer Perceptron; MSE: Mean square error.
BACKGROUND: Gegen (Puerariae Labatae Radix) is one of the important medicines in Traditional Chinese Medicine. The studies showed that Gegen and its preparation had effective actions for atherosclerosis. OBJECTIVE: Near-infrared (NIR) was used to develop a method for rapid determination of puerarin during percolation and concentration process of Gegen. MATERIALS AND METHODS: About ten batches of samples were collected with high-performance liquid chromatography analysis values as reference, calibration models are generated by partial least-squares (PLS) regression as linear regression, and artificial neural networks (ANN) as nonlinear regression. RESULTS: The root mean square error of prediction for the PLS and ANN model was 0.0396 and 0.0365 and correlation coefficients (r (2)) was 97.79% and 98.47%, respectively. CONCLUSIONS: The NIR model for the rapid analysis of puerarin can be used for on-line quality control in the percolation and concentration process. SUMMARY: Near-infrared was used to develop a method for on-line quality control in the percolation and concentration process of GegenCalibration models are generated by partial least-squares (PLS) regression as linear regression and artificial neural networks (ANN) as non-linear regressionThe root mean square error of prediction for the PLS and ANN model was 0.0396 and 0.0365 and correlation coefficients (r (2)) was 97.79% and 98.47%, respectively. Abbreviations used: NIR: Near-Infrared Spectroscopy; Gegen: Puerariae Loabatae Radix; TCM: Traditional Chinese Medicine; PLS: Partial least-squares; ANN: Artificial neural networks; RMSEP: Root mean square error of validation; R2: Correlation coefficients; PAT: Process analytical technology; FDA: The Food and Drug Administration; Rcal: Calibration set; RMSECV: Root mean square errors of cross-validation; RPD: Residual predictive deviation; SLS: Straight Line Subtraction; MLP: Multi-Layer Perceptron; MSE: Mean square error.
Nowadays, high quality of drug products demands on system monitor not only in raw materials, but also in manufacturing processes and processing. The process analytical technology (PAT) initiative by the Food and Drug Administration makes specfic recommendations in improving the pharmaceutical manufacturing and quality assurance through process analysis and process control, since it will lead to idendification of critical sources of variability, better opportunities to manage process variability, and improved product quality.[123]PAT has also become more and more important in the pharmaceutical industry of Traditional Chinese Medicine (TCM). As is well-known, TCM and its formulae, which have developed a unique system in the past 3000 years, have good effect for different diseases.[4] However, the quality of TCM and its formulae were highly correlated to the quality of manufacturing processes, the crude herbs, and geographical origins (genuineness),[5] so the application of PAT for process analysis was highly desired in food and pharmaceutical industries.Near-infrared spectroscopy (NIR), associated with chemometrics, has been found to be an effective tool for process analysis.[3] As there are advantages of fast acquisition, no sample preparation, noninvasive, nondestructive, and simultaneous determination of physical properties, and content of several chemical components, NIR spectroscopy with the fiberoptic probes had been widely used in qualitative discrimination and quantitative determination of herb materials, separation monitoring, analysis of extraction process, and so on.[678910]Gegen (Puerariae Lobatae Radix) is among the earliest and most important medicine used in TCM.[10] Modern studies showed that the main active constituents, puerarin, had been widely used in cardiovascular diseases, acute dysentery, hypertension, alcoholism, deafness, cancer, and diarrhea.[9101112] The previous studies of our group showed that Gegen and its TCM preparation XinMaiJia could effectively inhibit the occurrence and development of atherosclerosis.[13141516]The objective of this study was to evaluate the possibilities of rapid determination of puerarin during percolation and concentration process of Gegen by NIR. The approach taken was to develop two calibration models by partial least-squares (PLS) as linear regression and artificial neural networks (ANN) as nonlinear regression. The calibration models were optimized individually with high-performance liquid chromatography (HPLC) analysis values as reference. The results of the validation showed the two models were robust, accurate, and repeatable for off-line fast analysis or on-line quality control.
MATERIALS AND METHODS
Samples and reagents
Ten batches of Gegen samples were obtained from different provinces of China. During the percolation and concentration process, 100 samples were collected at different times for HPLC analysis and recording NIR spectral.Puerarin standards (Batch No. 110752–201313) were obtained from the National Institute for the Control of Pharmaceutical and Biological Products (Beijing, PR China). HPLC grade methanol was purchased from Tedia Company Inc. (Ohio, USA). Water was purified by an ultrapure water instrument. All the other reagents were of analytical grade.
Determination of reference values by high-performance liquid chromatography
As referring to the Chinese Pharmacopoeia (volume II, 2010 edition), the reference values were analyzed immediately after the NIR measurements. A 1260 HPLC system (Agilent Technologies Inc., USA) consisting of ultraviolet–visible detector was used for the quantitative determination of puerarin at a wavelength of 250 nm. A Kromasil C18 column (250 mm × 4.6 mm, 5 μm) was employed in 25°C column temperature. The elution system was composed of methanol–water with 0.1% citric acid (25:75) at a flow rate of 1.0 mL/min. The aliquots of 10 μl sample injections were injected into the HPLC for the content analysis.
Near-infrared data collection
As shown in Figure 1, all NIR spectra were recorded by the Bruker Matrix-I FT-NIR spectrometer (Bruker Optik, Ettlingen, Germany) equipped with a PbS detector and a fiberoptic probe. The spectra were obtained over the spectral region of 12000–4000 cm−1 at a resolution of 8 cm−1 with air absorbance as the reference standard. Each sample measurement was repeated 2 times with 32 scans per spectrum.
Figure 1
Near-infrared spectra of Gegen samples
Near-infrared spectra of Gegen samples
Near-infrared data processing
As the most prominent absorption bands of NIR were broad, weak, extensively overlapped bands, so NIR quantification heavily relies on multivariate calibration methods, which were known as linear regression and nonlinear regression. PLS was most frequently used in linear regression, which was depending on a linear relationship between the reference values and the spectral intensity. ANN was most widely used mathematical algorithms for overcoming nonlinearity as the presence of nonlinearity in the spectra, changes of the effective light path, experimental conditions, and instrument variations.[81718192021]To develop the best calibration model, the NIR spectroscopic models were constructed by PLS with the OPUS software and ANN with NeuroSolutions for Excel 6.30 (NeuroDimension Inc., Gainesville, FL, USA), respectively.
RESULTS AND DISCUSSION
Development of a linear prediction model by partial least squares
Among the 100 samples, 80 samples were divided randomly into calibration set to establish the calibration model and the remaining 20 samples were into validation set to validate the PLS model. To build the best PLS model, the calibration set must exhibit the full range of potential variations, so when divided into calibration set and validation set, the samples with maximum and minimum contents of puerarin were assigned in the calibration set.In PLS method, it is well-known that the spectral pretreatment methods, spectral range, and the number of PLS factors are critical parameters for the optimum model, so the best parameter was evaluated and selected based on the correlation coefficients of the calibration set (Rcal), the root mean square errors of cross-validation (RMSECV), and the residual predictive deviation (RPD), and the model with the best prediction ability was with highest Rcal and RPD as well as lowest RMSECV.According to the above criteria, the calibration models were optimized individually by considering spectral pretreatment methods, spectral range, and number of model factors. As shown in Table 1, 11 spectral pretreatment methods were selected to eliminate or standardize the impact of the spectra and reduce baseline offsets and spectral noise. When compared the parameters of the model, the spectral pretreatment method “straight line subtraction (SLS)” had better performance than other methods. The spectra preprocessed by SLS are presented in Figure 2. Then, different spectral ranges were compared as listed in Table 2, so the best spectral region for the PLS model was in the range from 11995.2 to 4597.5 cm−1 which had more efficient information than other spectra range. At last, the number of PLS factors was selected as shown in Figure 3; when number of model factors <7, the RMSECV value declines sharply and the r2 increases significantly with the increment of PLS factors. However, when more than 7, the value of RMSECV and R2 became gradually, so including more factors in PLS model will not make it set better, but rupture “over-fitting”. Therefore, the optimal number of PLS factors was 7.
Table 1
Parameters of different spectral pretreatment methods
Figure 2
Results of pretreated spectral (a) raw near-infrared spectra, (b) pretreated by straight line subtraction
Table 2
Parameters of different spectral regions
Figure 3
Parameters of different model factors (Up: Root mean square errors of cross-validation factor, Down: r2-factor)
Parameters of different spectral pretreatment methodsResults of pretreated spectral (a) raw near-infrared spectra, (b) pretreated by straight line subtractionParameters of different spectral regionsParameters of different model factors (Up: Root mean square errors of cross-validation factor, Down: r2-factor)
Development of a nonlinear prediction model by artificial neural networks
Unlike sample assign method of the PLS model, samples in the ANN model were divided into three groups. There were 70 samples in training set, which was used to establish the calibration model, and 20 samples were selected randomly as test set to test the calibration models. The validation set including 43 samples was used to validate the ANN models.In the study, the ANN model was built based on the multi-layer perceptron network, which was a feed-forward network trained by back propagation. The back-propagation algorithm, the most widely used in ANN, refers to a process which was first performed on an input pattern to calculate the net error from the output to the hidden neurons, then the output error was used to change the weight values in the backward direction, so the error was slowly propagated backward through the hidden layers to minimize the error of the model.[2223]The ANN structure was designed individually in spectral pretreatment methods, parameters of network topology, number of hidden neurons, and times of epoch, etc., and the optimum structure was selected in a step-by-step manner based on the evaluation parameters: Higher R and lower mean square error (MSE).[2223]As shown in Table 3, the parameters of the best ANN model in this work were that vector normalization as spectral pretreatment method, 7 as the number of hidden neurons, TanhAxon function as the transfer function, and 1.1, 0.7, 0.1 and 0.7 as step size and momentum of hidden and output layer neurons, respectively. The times of epoch were 800 as recommended by NeuroSolutions 6.30 to prevent over fitting or early stopping.
Table 3
The information of the artificial neural networks models
The information of the artificial neural networks models
Evaluation and validation for the partial least squares and artificial neural networks models
The validation set was used to validate the predictive ability of the optimized PLS model as linear regression model and the optimized ANN model as nonlinear regression model. The results showed than the root MSE of prediction (RMSEP) for the PLS and ANN model was 0.0396 and 0.0365 and correlation coefficients (r2) was 97.79% and 98.47%, respectively. The T and F tests indicated there was no significant difference between the predicted result and the reference value, so the established two models were robust, accurate, and repeatable for monitoring the percolation and concentration process.Compared to the PLS model, both RMSEP and r2 in the ANN models were much better, so the ANN model had more advantages for the rapid analysis of puerarin and routine screening in the quality control of the XinMaiJia preparation.
CONCLUSIONS
In this study, NIR spectroscopy was developed for the rapid analysis of puerarin in the percolation and concentration process of Gegen. By means of PLS and ANN regression, robust NIR models were built with high correlations and low error of predictions. Compared with the HPLC methods, NIR realized fast acquisition, no sample preparation, and nondestructive, so it can siginificantly save manpower and time. As a PAT tool, NIR could be carried out in off-line fast analysis or on-line quality control in the quality control of the XinMaiJia preparation, and this method will significantly improve the efficiency of quality control in TCM.
Financial support and sponsorship
Nil.
Conflicts of interest
There are no conflicts of interest.Wan Guangrui
ABOUT AUTHOR
Wan Guangrui, is a Professor at the School of Pharmacy, Xinxiang Medical University, Xinxiang, Henan Province, PR China. His research interest is in the area of pharmacodynamic study of the active ingredients of Traditional Chinese Medicine (TCM). In the past years, he mainly studied the TCM preparation XinMaiJia that effectively inhibit the occurrence and development of atherosclerosis.
Authors: N Heigl; A Greiderer; C H Petter; O Kolomiets; H W Siesler; M Ulbricht; G K Bonn; C W Huck Journal: Anal Chem Date: 2008-10-11 Impact factor: 6.986