| Literature DB >> 31137688 |
Lan Sun1, Chang Hsiung2, Valton Smith3.
Abstract
Recent developments in compact near infrared (NIR) instruments, including both handheld and process instruments, have enabled easy and affordable deployment of multiple instruments for various field and online or inline applications. However, historically, instrument-to-instrument variations could prohibit success when applying calibration models developed on one instrument to additional instruments. Despite the usefulness of calibration transfer techniques, they are difficult to apply when a large number of instruments and/or a large number of classes are involved. Direct model transferability was investigated in this study using miniature near-infrared (MicroNIR™) spectrometers for both classification and quantification problems. For polymer classification, high cross-unit prediction success rates were achieved with both conventional chemometric algorithms and machine learning algorithms. For active pharmaceutical ingredient quantification, low cross-unit prediction errors were achieved with the most commonly used partial least squares (PLS) regression method. This direct model transferability is enabled by the robust design of the MicroNIR™ hardware and will make deployment of multiple spectrometers for various applications more manageable.Entities:
Keywords: MicroNIR™; NIR; PLS; PLS-DA; SIMCA; SVM; TreeBagger; calibration transfer; direct model transferability; hier-SVM
Mesh:
Substances:
Year: 2019 PMID: 31137688 PMCID: PMC6571657 DOI: 10.3390/molecules24101997
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Spectra of example polymer samples by three instruments: (a) raw spectra; (b) preprocessed spectra by Savitzky-Golay 1st derivative (5 smoothing points and 3rd polynomial order) and standard normal variate (SNV).
Prediction success rates (%) of polymer classification.
| Algorithm | Unit# Kit# for Modeling | Unit# Kit# for Testing | |||||
|---|---|---|---|---|---|---|---|
| Unit1 K1 | Unit2 K1 | Unit3 K1 | Unit1 K2 | Unit2 K2 | Unit3 K3 | ||
| PLS-DA | Unit 1 K1 | 99.64 | 89.68 | 83.99 | 95.87 | 88.91 | 82.39 |
| Unit 2 K1 | 91.96 | 100 | 81.52 | 90.87 | 99.57 | 84.49 | |
| Unit 3 K1 | 76.74 | 75.32 | 100 | 75.07 | 73.12 | 99.20 | |
| SIMCA | Unit 1 K1 | 100 | 99.42 | 96.45 | 99.35 | 97.32 | 96.81 |
| Unit 2 K1 | 98.77 | 100 | 95.43 | 97.68 | 99.93 | 95.80 | |
| Unit 3 K1 | 96.30 | 93.29 | 100 | 96.09 | 92.17 | 100 | |
| TreeBagger | Unit 1 K1 | 100 | 97.11 | 95.80 | 98.04 | 95.94 | 96.30 |
| Unit 2 K1 | 97.83 | 100 | 93.55 | 94.49 | 98.26 | 96.16 | |
| Unit 3 K1 | 95.14 | 98.41 | 100 | 96.09 | 98.84 | 98.84 | |
| SVM | Unit 1 K1 | 100 | 99.86 | 97.54 | 98.26 | 97.90 | 97.83 |
| Unit 2 K1 | 98.70 | 100 | 97.03 | 94.93 | 98.26 | 98.26 | |
| Unit 3 K1 | 97.83 | 96.18 | 100 | 96.30 | 95.00 | 99.57 | |
| Hier-SVM | Unit 1 K1 | 100 | 100 | 97.97 | 97.83 | 97.83 | 97.25 |
| Unit 2 K1 | 99.93 | 100 | 98.26 | 98.26 | 99.13 | 99.13 | |
| Unit 3 K1 | 99.13 | 100 | 100 | 96.88 | 97.83 | 100 | |
Number of missed predictions of polymer classification in the format of number of missed predictions/total number of predictions.
| Algorithm | Unit# Kit# for Modeling | Unit# Kit# for Testing | |||||
|---|---|---|---|---|---|---|---|
| Unit1 K1 | Unit2 K1 | Unit3 K1 | Unit1 K2 | Unit2 K2 | Unit3 K3 | ||
| PLS-DA | Unit 1 K1 | 1/276 | 143/1386 | 221/1380 | 57/1380 | 153/1380 | 243/1380 |
| Unit 2 K1 | 111/1380 | 0/277 | 255/1380 | 126/1380 | 6/1380 | 214/1380 | |
| Unit 3 K1 | 321/1380 | 342/1386 | 0/276 | 344/1380 | 371/1380 | 11/1380 | |
| SIMCA | Unit 1 K1 | 0/276 | 8/1386 | 49/1380 | 9/1380 | 37/1380 | 44/1380 |
| Unit 2 K1 | 17/1380 | 0/277 | 63/1380 | 32/1380 | 1/1380 | 58/1380 | |
| Unit 3 K1 | 51/1380 | 93/1386 | 0/276 | 54/1380 | 108/1380 | 0/1380 | |
| TreeBagger | Unit 1 K1 | 0/276 | 40/1386 | 58/1380 | 27/1380 | 56/1380 | 51/1380 |
| Unit 2 K1 | 30/1380 | 0/277 | 89/1380 | 76/1380 | 24/1380 | 53/1380 | |
| Unit 3 K1 | 67/1380 | 22/1386 | 0/276 | 54/1380 | 16/1380 | 16/1380 | |
| SVM | Unit 1 K1 | 0/276 | 2/1386 | 34/1380 | 24/1380 | 29/1380 | 30/1380 |
| Unit 2 K1 | 18/1380 | 0/277 | 41/1380 | 70/1380 | 24/1380 | 24/1380 | |
| Unit 3 K1 | 30/1380 | 53/1386 | 0/276 | 51/1380 | 69/1380 | 6/1380 | |
| Hier-SVM | Unit 1 K1 | 0/276 | 0/1386 | 28/1380 | 30/1380 | 30/1380 | 38/1380 |
| Unit 2 K1 | 1/1380 | 0/277 | 24/1380 | 24/1380 | 12/1380 | 12/1380 | |
| Unit 3 K1 | 12/1380 | 0/1386 | 0/276 | 43/1380 | 30/1380 | 0/1380 | |
Figure 2Spectra of samples with the highest and the lowest active pharmaceutical ingredient (API) concentrations measured by three instruments: (a) selected raw spectra based on the acetylsalicylic acid (ASA) concentration; (b) selected preprocessed spectra based on the ASA concentration by Savitzky-Golay 1st derivative (5 smoothing points and 2nd polynomial order) and SNV; (c) selected raw spectra based on the ascorbic acid (ASC) concentration; (d) selected preprocessed spectra based on the ASC concentration by Savitzky-Golay 2nd derivative (7 smoothing points and 3rd polynomial order) and SNV; (e) selected raw spectra based on the caffeine (CAF) concentration; (f) selected preprocessed spectra based on the CAF concentration by Savitzky-Golay 1st derivative (17 smoothing points and 3rd polynomial order) and SNV.
The normalized root mean square error of prediction (NRMSEP, %) for ASA.
| Test Sets | No Correction | Bias | PDS | GLS | ||
|---|---|---|---|---|---|---|
| Unit 1 | Unit 2 | Unit 3 | Unit 1 | Unit 1 | Unit 1 | |
| Unit 1 | 3.4 | 3.5 | 3.5 | - | - | - |
| Unit 2 | 4.0 | 4.2 | 3.9 | 3.7 | 3.3 | 3.6 |
| Unit 3 | 4.3 | 4.5 | 4.2 | 4.1 | 3.5 | 4.4 |
The normalized root mean square error of prediction (NRMSEP, %) for ASC.
| Test Sets | No Correction | Bias | PDS | GLS | ||
|---|---|---|---|---|---|---|
| Unit 1 | Unit 2 | Unit 3 | Unit 1 | Unit 1 | Unit 1 | |
| Unit 1 | 3.0 | 2.6 | 2.7 | - | - | - |
| Unit 2 | 2.7 | 2.7 | 2.6 | 2.3 | 3.5 | 2.6 |
| Unit 3 | 2.5 | 2.5 | 2.7 | 2.2 | 3.1 | 2.4 |
The normalized root mean square error of prediction (NRMSEP, %) for CAF.
| Test Sets | No Correction | Bias | PDS | GLS | ||
|---|---|---|---|---|---|---|
| Unit 1 | Unit 2 | Unit 3 | Unit 1 | Unit 1 | Unit 1 | |
| Unit 1 | 4.0 | 4.6 | 3.7 | - | - | - |
| Unit 2 | 4.1 | 4.7 | 4.2 | 4.2 | 4.3 | 3.2 |
| Unit 3 | 4.2 | 4.9 | 4.0 | 4.1 | 6.2 | 3.9 |
Figure 3Predicted values versus reference values using models developed on Unit 1: (a) validation sets by Unit 1 and Unit 2 for ASA prediction; (b) validation sets by Unit 1 and Unit 3 for ASA prediction; (c) validation sets by Unit 1 and Unit 2 for ASC prediction; (d) validation sets by Unit 1 and Unit 3 for ASC prediction; (e) validation sets by Unit 1 and Unit 2 for CAF prediction; (f) validation sets by Unit 1 and Unit 3 for CAF prediction. The corresponding bias, R2 for prediction, and root mean square error for prediction (RMSEP) are presented in each plot.
Figure 4The Bland-Altman plots comparing the cross-unit prediction results and the same-unit prediction results using models developed on Unit 1: (a) validation sets by Unit 1 and Unit 2 for ASA prediction; (b) validation sets by Unit 1 and Unit 3 for ASA prediction; (c) validation sets by Unit 1 and Unit 2 for ASC prediction; (d) validation sets by Unit 1 and Unit 3 for ASC prediction; (e) validation sets by Unit 1 and Unit 2 for CAF prediction; (f) validation sets by Unit 1 and Unit 3 for CAF prediction.
Figure 5Reduced Q residuals versus reduced Hotelling’s T2 for models developed on Unit 1: (a) validation sets by Unit 1 and Unit 2 for ASA prediction; (b) validation sets by Unit 1 and Unit 3 for ASA prediction; (c) validation sets by Unit 1 and Unit 2 for ASC prediction; (d) validation sets by Unit 1 and Unit 3 for ASC prediction; (e) validation sets by Unit 1 and Unit 2 for CAF prediction; (f) validation sets by Unit 1 and Unit 3 for CAF prediction.
Polymer materials used for the classification study.
| No. | Polymer Type | No. | Polymer Type |
|---|---|---|---|
| 1 | PolyStyrene-General Purpose | 24 | Polyethylene-High Density |
| 2 | PolyStyrene-High Impact | 25 | Polypropylene-Copolymer |
| 3 | Styrene-Acrylonitrile (SAN) | 26 | Polypropylene-Homopolymer |
| 4 | ABS-Transparent | 27 | Polyaryl-Ether |
| 5 | ABS-Medium Impact | 28 | Polyvinyl Chloride-Flexible |
| 6 | ABS-High Impact | 29 | Polyvinyl Chloride-Rigid |
| 7 | Styrene Butadiene | 30 | Acetal Resin-Homopolymer |
| 8 | Acrylic | 31 | Acetal Resin-Copolymer |
| 9 | Modified Acrylic | 32 | Polyphenylene Sulfide |
| 10 | Cellulose Acetate | 33 | Ethylene Vinyl Acetate |
| 11 | Cellulose Acetate Butyrate | 34 | Urethane Elastomer (Polyether) |
| 12 | Cellulose Acetate Propionate | 35 | Polypropylene-Flame Retardant |
| 13 | Nylon-Transparent | 36 | Polyester Elastomer |
| 14 | Nylon-Type 66 | 37 | ABS-Flame Retardant |
| 15 | Nylon-Type 6 (Homopolymer) | 38 | Polyallomer |
| 16 | Thermoplastic Polyester (PBT) | 39 | Styrenic Terpolymer |
| 17 | Thermoplastic Polyester (PETG) | 40 | Polymethyl Pentene |
| 18 | Phenylene Oxide | 41 | Talc-Reinforced Polypropylene |
| 19 | Polycarbonate | 42 | Calcium Carbonate-Reinforced Polypropylene |
| 20 | Polysulfone | 43 | Nylon (Type 66–33% Glass) |
| 21 | Polybutylene | 44 | Thermoplastic Rubber |
| 22 | Ionomer | 45 | Polyethylene (Medium Density) |
| 23 | Polyethylene-Low Density | 46 | ABS-Nylon Alloy |