Literature DB >> 30250839

Raman Spectroscopy for Pharmaceutical Quantitative Analysis by Low-Rank Estimation.

Xiangyun Ma1, Xueqing Sun1, Huijie Wang1, Yang Wang1, Da Chen2, Qifeng Li1.   

Abstract

Raman spectroscopy has been widely used for quantitative analysis in biomedical and pharmaceutical applications. However, the signal-to-noise ratio (SNR) of Raman spectra is always poor due to weak Raman scattering. The noise in Raman spectral dataset will limit the accuracy of quantitative analysis. Because of high correlations in the spectral signatures, Raman spectra have the low-rank property, which can be used as a constraint to improve Raman spectral SNR. In this paper, a simple and feasible Raman spectroscopic analysis method by Low-Rank Estimation (LRE) is proposed. The Frank-Wolfe (FW) algorithm is applied in the LRE method to seek the optimal solution. The proposed method is used for the quantitative analysis of pharmaceutical mixtures. The accuracy and robustness of Partial Least Squares (PLS) and Support Vector Machine (SVM) chemometric models can be improved by the LRE method.

Entities:  

Keywords:  Raman spectroscopy; chemometric model; low-rank estimation; pharmaceuticals; quantitative analysis

Year:  2018        PMID: 30250839      PMCID: PMC6139353          DOI: 10.3389/fchem.2018.00400

Source DB:  PubMed          Journal:  Front Chem        ISSN: 2296-2646            Impact factor:   5.221


Intruduction

Raman spectroscopy is one of the vibrational spectroscopic techniques that has been commonly applied in quantitative analysis (Strachan et al., 2004; Numata and Tanaka, 2011; Ai et al., 2018). Being non-invasive and marker-free, it has been proved to be an effective tool in the field of physics, chemistry, and biology (Graf et al., 2007; Neugebauer et al., 2010; Ryu et al., 2012; Tan et al., 2017). Coupled with chemometrics methods, it has the advantages of high sensitivity and resolution in biomedical and pharmaceutical quantitative analysis. The quantitative analysis based on Raman spectra at low signal-to-noise ratio (SNR) levels is still problematic (Li, 2008; Chen et al., 2014). Generally, a Raman spectrum can be divided into two parts: the signal containing desired information and the noise containing unwanted information. Basically, the latter may include photon-shot noise, sample-generated noise, instrument-generated noise, computationally generated noise, and externally generated noise (Pelletier, 2003). Due to the inherently weak property of Raman scattering, the noise will lead to a deterioration in SNR of Raman spectra, affecting the accuracy of quantitative analysis. For instance, data of online monitoring in limited integration time always tend to be inaccurate (Han et al., 2017; Virtanen et al., 2017). Some approaches of preprocessing Raman spectra to minimize this problem have been proposed (Clupek et al., 2007; Ma et al., 2017), such as first and second derivatives (Johansson et al., 2010), polynomials fitting (Vickers et al., 2001), Fourier transform (Pelletier, 2003), and wavelet transform (Chen et al., 2011; Li et al., 2013). Among these approaches, wavelet transform can extract peak information and remove background noise, which has been the most widely used preprocessing method (Du et al., 2006). However, the processing of Raman spectra can be further optimized to improve the accuracy of pharmaceutical quantitative analysis. In this paper, we introduce a simple and feasible Raman spectroscopic analysis method based on Low-Rank Estimation (LRE). Our experiments are implemented based on the Partial Least Squares (PLS) and Support Vector Machine (SVM) chemometric models. The aim of this experimental design is to enhance the quality of pharmaceutical quantitative analysis by significantly improving the accuracy and robustness of the chemometric models used.

Materials and methods

Pharmaceutical substances (norfloxacin, penicillin potassium, and sulfamerazine) were purchased from Dalian Meilun Biotechnology Co., Ltd (China) and used without further purification. These substances were well blended in different proportions, pulverized, and compressed into three-component tablets. Other physical properties of these tablets (such as density, height, and diameter) were kept completely consistent. Mixed solutions were also prepared with methanol and ethanol in 100 different proportions. Raman spectral data were recorded by using a Renishaw inVia Raman spectrometer (Gloucestershire, U.K.). This system consisted of a 785-nm diode laser (~40 mW) and a 1,200 l/mm grating. In this work, the integration times of Raman spectra were 0.1–0.5 s. PLS and SVM regression methods were used to model and predict pharmaceutical concentration of the samples based on their Raman spectra. Eighty-five samples were selected as the training set and the remaining 15 samples as the testing set, based on Kennard-Stone (KS) algorithm. The parameters of PLS and SVM models were tuned based on grid search algorithm. The optimal parameters were obtained by k-folder cross-validation. The accuracy and robustness of above-mentioned chemometric models were further improved by conventional Wavelet Transform (WT) method and Low-Rank Estimation (LRE) method, respectively. In the WT method, the signals were split into different frequency components to remove simultaneously low-frequency background and high-frequency noise components. The Symlet wavelet filter (sym11, scale = 7) was optimally selected to provide the sharpest peaks associated with the analytes of interest. The LRE method was originally developed by our group in three-dimension to speed up Raman spectral imaging (Li et al., 2018). In this study, we used the LRE method in two-dimension to process the observed Raman spectral data matrix. In this method, the alternating least squares (ALS) algorithm is used to estimate the largest singular value of the matrix (Kroonenberg and Leeuw, 1980; Halko et al., 2011). The matrix estimation has two sets of parameters. Each set is estimated in turn by solving a least-squares problem and holding the other set fixed. After both sets have been estimated once, the procedure is repeated until convergence. The Frank-Wolfe (FW) algorithm is applied in the LRE method to seek the optimal solution. Recently, the FW algorithm has been popularly used in machine learning due to its characteristics of simple implementation and modest memory requirement (Jaggi, 2013; Guo et al., 2017). The steps of the LRE method are detailed in Table 1.
Table 1

The detail steps of the LRE method.

Algorithm: The algorithm for the LRE method
Input: the raw Raman spectral data matrix A;        the maximum number of iteration N, ranging from 5 to 20;        the low-rank constraint factor m, ranging from 0.01 to 0.001;
1: Initialize X0 = 0. X0 is an initial solution of the algorithm.
2: for i = 0,1,…,N do, ai represents the i-th iteration of any variable a.
3: Compute the search direction s, si+1 = ALS(AXi)
4: Compute the step length r, ri+1=argminr[0,1](A-(Xi+r(si+1-Xi)))
5: Xi+1 = (1−ri+1)Xi+ri+1si+1
6: stopping criterion: ALS(Xi+1)si+1>m
7: end for
8: The last iteration of X is the final solution of the LRE method.
Output X
The detail steps of the LRE method. Through being processed by the LRE method, the low-rank training and testing sets can be obtained from the raw training and testing data matrices, respectively. In general, an abundant data matrix can enhance the effect of the LRE method. When a number of testing spectral data is small, the training spectral data can be added to the raw testing data matrix as a supplement. The added spectral data are only used to strengthen the impact of the LRE method. The conventional regression models are applied to the low-rank training and testing sets to perform quantitative Raman analysis.

Results and discussion

Noise-free Raman spectral dataset is a low-rank matrix. In Figure 1, the red line shows the ranks of Raman spectral data matrix in an integration time of 1 s, suggesting that the Raman spectra have low-rank property when the noise is low. The low-rank property comes from high correlations among spectral signatures. Each spectral signature can be represented by a linear combination of a small number of pure spectral endmembers, which is known as linear spectral mixing model (Iordache et al., 2011; Golbabaee and Vandergheynst, 2012). The blue and green plots show singular values of the matrix in a shorter integration time, which implies that the ranks of Raman spectra increase with decreasing integration time owing to a greater proportion of the noise. The low-rank property can be used as a constraint to improve the accuracy of pharmaceutical quantitative analysis (Yi et al., 2017).
Figure 1

The ranks of the Raman spectra in different integration time.

The ranks of the Raman spectra in different integration time. Raw Raman spectra recorded for three pure pharmaceutical substances are shown in Figure 2A. Thirty Raman spectra obtained from three-component tablets with different proportions are shown in Figure 2B. It is clear that each pharmaceutical component has its own special characteristic peaks. However, their respective Raman bands are overlapped. Particularly, Raman signals of lower-concentration component are almost swamped and covered by those of higher- concentration one, which represents a common problem in practice for biomedical and pharmaceutical quantitative analysis. For clarity, the Raman spectra in Figure 2B were collected in an integration time of 5 s, which have a high SNR. In our experiments, the integration times of Raman spectra are in the range of 0.1–0.5 s, which is over 10 times shorter than that shown in Figure 2. Under this condition, the spectral signals are weaker and have poor SNR.
Figure 2

Raman spectra of (A) pure pharmaceuticals substances: (a) penicillin potassium, (b) norfloxacin, and (c) sulfamerazine, (B) their three component tablets containing different proportions in the integration time of 5s.

Raman spectra of (A) pure pharmaceuticals substances: (a) penicillin potassium, (b) norfloxacin, and (c) sulfamerazine, (B) their three component tablets containing different proportions in the integration time of 5s. The comparisons of predicted and actual values for norfloxacin are illustrated in Figure 3, which indicates the advantage of the LRE method for pharmaceutical quantitative analysis. The coefficient of determination (R2) and root mean square error (RMSE) of the chemometric models used for quantitative analysis of three pharmaceutical components are listed in Table 2. The unsatisfactory results of the raw spectral data show that the pre-treatment of Raman spectra is necessary. In this study, the LRE method and conventional wavelet transform (WT) method are applied to improve the accuracy of quantitative analysis. As shown in Figure 3, both the conventional WT and LRE methods can improve the predicted results. However, it is clear that the LRE method has a better performance than the conventional WT method in enhancing the prediction accuracy for pharmaceutical quantitative analysis.
Figure 3

Actual vs. predicted values of norfloxacin based on the PLS (A) and SVM (B) model, where the black solid line are diagonals. Raw Raman spectra are collected in an integration time of 0.2 s.

Table 2

R2 and RMSE values of the chemometric models for three pharmaceutical components.

NorfloxacinPenicillin potassiumSulfamerazine
ModelMethodsR2RMSER2RMSER2RMSE
PLSRaw0.75040.07800.86920.12180.73230.0608
WT0.85980.06420.95480.09740.88620.0376
LRE0.95530.02590.98480.05220.96090.0225
SVMRaw0.82970.10970.84600.12640.81350.0679
WT0.88080.08410.91250.08210.89070.0444
LRE0.95580.04680.97490.07550.97010.0397
Actual vs. predicted values of norfloxacin based on the PLS (A) and SVM (B) model, where the black solid line are diagonals. Raw Raman spectra are collected in an integration time of 0.2 s. R2 and RMSE values of the chemometric models for three pharmaceutical components. As shown in Table 2, the raw Raman spectra are all collected in an integration time of 0.2 s. The LRE method is significantly better than the conventional WT method in terms of R2 and RMSE for all components. Quantitation limit (QL) for each pharmaceutical substance is calculated. By definition in ICH guideline (ICH Harmonised Tripartite Guideline, 2005), QL is the lowest concentration of an analyte that can be quantitatively determined with suitable precision and accuracy. It is most often determined as 10 times the standard deviation of the noise from the blank. The LRE method can be used reliably with more than a 15-fold improvement of the practicalQL. Through being processed by the LRE method, QL values for norfloxacin, penicillin potassium, and sulfamerazine are 0.17, 0.13, and 0.19%, respectively. These results reveal that the LRE method can simultaneously improve the performance of quantitative analysis for pharmaceutical multi-component mixtures. Table 3 lists R2 and RMSE values of the chemometric models used for quantitative analysis of norfloxacin in different integration times. The integration times of raw Raman spectra are 0.1, 0.2, and 0.5 s. Raman spectrum's SNR is always proportional to integration time. For evaluating spectral quality, the SNR is defined as the ratio of the peak value of the signal to the root mean square of the noise. For integration times of 0.1, 0.2, and 0.5 s, the average SNR of Raman spectra are 2.47, 3.66, and 6.21, respectively. R2 and RMSE values of the chemometric models for methanol in different integration times are listed Table 4. The average SNR of the Raman spectra in the integration times of 0.1, 0.2, and 0.5 s are 2.13, 3.34, and 5.89, respectively.
Table 3

R2 and RMSE values of the chemometric models for norfloxacin in different integration times.

0.1 s0.2 s0.5 s
ModelMethodsR2RMSER2RMSER2RMSE
Raw0.72860.09390.76060.07330.87310.0476
PLSWT0.85030.06300.87470.06270.96100.0446
LRE0.94960.02960.96260.02360.97840.0229
Raw0.78030.09590.81160.08940.91360.0781
SVMWT0.86730.09760.89870.07890.92510.0668
LRE0.95880.04490.96650.02290.97640.0210
Table 4

R2 and RMSE values of the chemometric models for methanol in different integration times.

0.1 s0.2 s0.5 s
ModelMethodsR2RMSER2RMSER2RMSE
Raw0.70781.99800.80861.46550.84581.3075
PLSWT0.83110.65510.87760.57500.91780.4553
LRE0.90170.57940.93010.46920.94010.4117
Raw0.71580.86310.83820.76690.88130.6148
SVMWT0.83610.70300.87010.62040.94280.4506
LRE0.92770.64170.96280.51120.97680.3964
R2 and RMSE values of the chemometric models for norfloxacin in different integration times. R2 and RMSE values of the chemometric models for methanol in different integration times. As shown in Tables 3, 4, the accuracy of the quantitative analysis raises with increasing SNR. According to R2 and RMSE values, it can be proved that the LRE method has a better performance than the conventional WT method. The degree of improvement is higher for low-SNR Raman spectra, which indicates that the LRE method has good noise immunity. In summary, all predicted results of the Raman spectra preprocessed by the LRE method are in good agreement with corresponding actual values. This method can be applied to improve the accuracy of quantitative analysis based on both PLS and SVM models. It is unrelated to the selection of chemometric models. The LRE method is not restricted by the state of a sample, meaning that it is applicable to both solid and liquid samples. Therefore, it can be regarded as an efficient tool with satisfactory prediction accuracy for pharmaceutical quantitative analysis, especially in the case of low-SNR spectra.

Conclusion

The LRE method has been successfully applied in Raman spectroscopy for pharmaceutical quantitative analysis. It is a simply and feasibly method that can improve the accuracy and robustness of PLS and SVM chemometric models. Our data show that the LRE method has advantages in improving R2 and RMSE for quantitative analysis of pharmaceutical multi-component mixtures, especially in the case of low-SNR spectra. The LRE method will promote the development of Raman spectroscopy in biomedical and pharmaceutical quantitative analysis.

Author contributions

XM participated in the lab work, supervising lab work, interpretation of data, drafting the manuscript, performing the statistical analysis. XS participated in the lab work, interpretation of data, drafting the manuscript, performing the statistical analysis. HW design of the work, interpretation of data. YW supervised the research, performing the statistical analysis. DC supervised the research, final approval of the version to be published. QL participated in the lab work, supervising lab work, final approval of the version to be published.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  11 in total

Review 1.  Quantitative analysis using Raman spectrometry.

Authors:  M J Pelletier
Journal:  Appl Spectrosc       Date:  2003-01       Impact factor: 2.388

2.  Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching.

Authors:  Pan Du; Warren A Kibbe; Simon M Lin
Journal:  Bioinformatics       Date:  2006-07-04       Impact factor: 6.937

3.  Spatially resolved Raman spectroscopy of single- and few-layer graphene.

Authors:  D Graf; F Molitor; K Ensslin; C Stampfer; A Jungen; C Hierold; L Wirtz
Journal:  Nano Lett       Date:  2007-02       Impact factor: 11.189

4.  Adaptive wavelet transform suppresses background and noise for quantitative analysis by Raman spectrometry.

Authors:  Da Chen; Zhiwen Chen; Edward Grant
Journal:  Anal Bioanal Chem       Date:  2011-02-18       Impact factor: 4.142

5.  Rapid qualitative and quantitative determination of food colorants by both Raman spectra and Surface-enhanced Raman Scattering (SERS).

Authors:  Yu-Jie Ai; Pei Liang; Yan-Xiong Wu; Qian-Min Dong; Jing-Bin Li; Yang Bai; Bi-Jie Xu; Zhi Yu; Dejiang Ni
Journal:  Food Chem       Date:  2017-09-07       Impact factor: 7.514

6.  Single-Drop Raman Imaging Exposes the Trace Contaminants in Milk.

Authors:  Zong Tan; Ting-Ting Lou; Zhi-Xuan Huang; Jing Zong; Ke-Xin Xu; Qi-Feng Li; Da Chen
Journal:  J Agric Food Chem       Date:  2017-07-19       Impact factor: 5.279

7.  Identification and differentiation of single cells from peripheral blood by Raman spectroscopic imaging.

Authors:  Ute Neugebauer; Joachim H Clement; Thomas Bocklitz; Christoph Krafft; Jürgen Popp
Journal:  J Biophotonics       Date:  2010-08       Impact factor: 3.207

8.  Recovery of Raman spectra with low signal-to-noise ratio using Wiener estimation.

Authors:  Shuo Chen; Xiaoqian Lin; Clement Yuen; Saraswathi Padmanabhan; Roger W Beuerman; Quan Liu
Journal:  Opt Express       Date:  2014-05-19       Impact factor: 3.894

9.  Improving the resolution and the throughput of spectrometers by a digital projection slit.

Authors:  Xiangyun Ma; Huijie Wang; Yang Wang; Da Chen; Wenliang Chen; Qifeng Li
Journal:  Opt Express       Date:  2017-09-18       Impact factor: 3.894

10.  Speeding up Raman spectral imaging by the three-dimensional low rank estimation method.

Authors:  Qifeng Li; Xiangyun Ma; Huijie Wang; Yang Wang; Xinwei Zheng; Da Chen
Journal:  Opt Express       Date:  2018-01-08       Impact factor: 3.894

View more
  2 in total

1.  Raman Mapping-Based Reverse Engineering Facilitates Development of Sustained-Release Nifedipine Tablet.

Authors:  Ningyun Sun; Liang Chang; Yi Lu; Wei Wu
Journal:  Pharmaceutics       Date:  2022-05-13       Impact factor: 6.525

2.  A Spatial-Temporal Analysis of Cellular Biopolymers on Leaf Blight-Infected Tea Plants Using Confocal Raman Microspectroscopy.

Authors:  Alireza Sanaeifar; Dapeng Ye; Xiaoli Li; Liubin Luo; Yu Tang; Yong He
Journal:  Front Plant Sci       Date:  2022-04-18       Impact factor: 6.627

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.