| Literature DB >> 27029624 |
Lan Sun1, Chang Hsiung2, Christopher G Pederson2, Peng Zou2, Valton Smith2, Marc von Gunten2, Nada A O'Brien2.
Abstract
Near-infrared spectroscopy as a rapid and non-destructive analytical technique offers great advantages for pharmaceutical raw material identification (RMID) to fulfill the quality and safety requirements in pharmaceutical industry. In this study, we demonstrated the use of portable miniature near-infrared (MicroNIR) spectrometers for NIR-based pharmaceutical RMID and solved two challenges in this area, model transferability and large-scale classification, with the aid of support vector machine (SVM) modeling. We used a set of 19 pharmaceutical compounds including various active pharmaceutical ingredients (APIs) and excipients and six MicroNIR spectrometers to test model transferability. For the test of large-scale classification, we used another set of 253 pharmaceutical compounds comprised of both chemically and physically different APIs and excipients. We compared SVM with conventional chemometric modeling techniques, including soft independent modeling of class analogy, partial least squares discriminant analysis, linear discriminant analysis, and quadratic discriminant analysis. Support vector machine modeling using a linear kernel, especially when combined with a hierarchical scheme, exhibited excellent performance in both model transferability and large-scale classification. Hence, ultra-compact, portable and robust MicroNIR spectrometers coupled with SVM modeling can make on-site and in situ pharmaceutical RMID for large-volume applications highly achievable.Entities:
Keywords: MicroNIR; NIR; Near-infrared spectroscopy; RMID; SVM; large-scale classification; model transferability; raw material identification; support vector machine
Mesh:
Substances:
Year: 2016 PMID: 27029624 PMCID: PMC4871175 DOI: 10.1177/0003702816638281
Source DB: PubMed Journal: Appl Spectrosc ISSN: 0003-7028 Impact factor: 2.388
Figure 1.MicroNIR spectrometer equipped with a vial holder and tethered to a rugged 7” Windows 8.1 tablet for pharmaceutical raw material identification.
Prediction accuracy as a function of number of spectrometers to build training models.
| Classifier | nTU1 | nTU2 | nTU3 | nTU4 | nTU5 | nTU6 | # Units for 100% |
|---|---|---|---|---|---|---|---|
| SIMCA | 97.40 | 98.85 | 99.50 | 99.81 | 99.96 | 100 | 6 |
| PLS-DA | 99.66 | 99.68 | 99.90 | 99.96 | 99.99 | 100 | 6 |
| LDA | 86.41 | 97.84 | 98.83 | 99.98 | 100 | 100 | 5 |
| QDA | 86.47 | 97.88 | 98.85 | 99.98 | 100 | 100 | 5 |
| SVM-rbf | 99.66 | 100 | 100 | 100 | 100 | 100 | 2 |
| SVM-linear | 100 | 100 | 100 | 100 | 100 | 100 | 1 |
| Hier-SVM- linear | 100 | 100 | 100 | 100 | 100 | 100 | 1 |
Figure 2.PCA-SVM plot for 19 pharmaceutical compounds based on the SVM model.
Figure 3.Model validation using different lots of material and different MicroNIR units. (a) Same-unit-same-lot (SUSL) validation; (b) same-unit-cross-lot (SUXL) validation; (c) cross-unit-cross-lot (XUXL) validation. Unit# denotes the spectrometer number. T#P# denotes the spectrometer number for training (T#) and the spectrometer number for testing or prediction (P#).
Model validation using different lots of materials and different MicroNIR spectrometers.
| Classifier | SUSL | SUXL | XUXL | |||
|---|---|---|---|---|---|---|
| AVG[ | STD[ | AVG | STD | AVG | STD | |
| SIMCA | 100 | 0 | 90.93 | 0.22 | 83.92 | 8.92 |
| PLS-DA | 99.98 | 0.06 | 93.94 | 1.03 | 92.69 | 1.94 |
| LDA | 98.49 | 2.99 | 93.52 | 1.32 | 85.64 | 7.93 |
| QDA | 98.49 | 2.99 | 93.55 | 1.25 | 85.73 | 7.91 |
| SVM-rbf | 99.72 | 0.47 | 98.38 | 2.33 | 98.01 | 2.26 |
| SVM-linear | 100 | 0 | 99.54 | 1.14 | 99.88 | 0.65 |
AVG: Average
STD: Standard deviation
Figure 4.NIR spectra of the training set from 253 pharmaceutical compounds. (a) Raw spectra; (b) pretreated spectra.
Comparison of different models for classification of the 253 pharmaceutical compounds.
| Classifier | No. of spectra | Prediction success rate (%) | No. of missed predictions |
|---|---|---|---|
| SIMCA | 2566 | 97.54 | 63 |
| PLS-DA | 2566 | 85.23 | 379 |
| LDA | 2566 | 99.61 | 10 |
| QDA | 2566 | 99.73 | 7 |
| SVM-rbf | 2566 | 85.78 | 365 |
| SVM-linear | 2566 | 96.57 | 88 |
| hier-SVM-linear | 2566 | 100 | 0 |
Figure 5.Estimation probabilities of class membership.
Model validation using the large-scale classification model to predict samples from different sources with different MicroNIR units.
| Classifier | T[ | T-Unit7 P-Unit2 | T-Unit7 P-Unit3 | T-Unit7 P-Unit4 | T-Unit7 P-Unit5 | T-Unit7 P-Unit6 | AVG[ | STD[ |
|---|---|---|---|---|---|---|---|---|
| SIMCA | 84.84 | 76.10 | 84.75 | 84.92 | 85.40 | 83.90 | 83.32 | 3.57 |
| PLS-DA | 85.71 | 85.89 | 85.00 | 85.71 | 85.71 | 85.69 | 85.62 | 0.31 |
| LDA | 33.02 | 31.97 | 57.75 | 45.87 | 67.78 | 70.57 | 51.16 | 16.86 |
| QDA | 33.10 | 32.13 | 58.33 | 47.78 | 68.41 | 70.73 | 51.75 | 16.91 |
| SVM-rbf | 91.67 | 92.08 | 93.42 | 90.32 | 90.79 | 86.67 | 90.83 | 2.30 |
| SVM-linear | 92.30 | 92.95 | 95.33 | 92.54 | 92.38 | 87.80 | 92.22 | 2.44 |
| hier-SVM-linear | 94.92 | 93.03 | 97.92 | 95.71 | 95.56 | 92.85 | 95.00 | 1.89 |
T: Training
P: Prediction
AVG: Average
STD: Standard deviation