| Literature DB >> 32641779 |
F Korinth1, A S Mondol1, C Stiebing1, I W Schie1,2, C Krafft3, J Popp1,4.
Abstract
Shifted excitation Raman difference spectroscopy (SERDS) is a background correction method for Raman spectroscopy. Here, the difference spectra were directly used as input for SERDS-based classification after an optimization procedure to correct for photobleaching of the autofluorescence. Further processing included a principal component analysis to compensate for the reduced signal to noise ratio of the difference spectra and subsequent classification by linear discriminant analysis. As a case study 6,028 Raman spectra of single pollen originating from plants of eight different genera and four different growth habits were automatically recorded at excitation wavelengths 784 and 786 nm using a high-throughput screening Raman system. Different pollen were distinguished according to their growth habit, i.e. tree versus non-tree with an accuracy of 95.9%. Furthermore, all pollen were separated according to their genus, providing also insight into similarities based on their families. Classification results were compared using spectra reconstructed from the differences and raw spectra after state-of-art baseline correction as input. Similar sensitivities, specificities, accuracies and precisions were found for all spectra with moderately background. Advantages of SERDS are expected in scenarios where Raman spectra are affected by variations due to detector etaloning, ambient light, and high background.Entities:
Mesh:
Year: 2020 PMID: 32641779 PMCID: PMC7343813 DOI: 10.1038/s41598-020-67897-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Overview of the data processing steps. top row, spectra of one birch pollen grain; bottom row, spectra of one hazel pollen grain; (a) raw spectra at the three different excitation wavelengths; (b) raw difference spectra; (c) normalized and optimized difference spectra; the color codes of excitation wavelengths are indicated on top.
Figure 2Mean and standard deviation of difference spectra (784–786 nm) after normalization and optimization for tree pollen (a) and non-tree pollen (b), spectra reconstructed from differences after SNIP baseline correction of tree pollen (c) and non-tree pollen (d) and Raman spectra (λex = 784 nm) after SNIP baseline correction for tree pollen (e) and non-tree pollen (f). High wavenumber regions were multiplied by 0.5.
Figure 3Box and whiskers LDA score plot for classification of tree vs. non-tree. Red, predicted classes of the scores of the test data set; green, real classes of the scores of the test data set.
Confusion matrix of non-tree versus tree classification: real classes vs. predicted classes of the classified test spectra; sensitivity, specificity, accuracy and precision in %.
| Tree vs. non-tree | Real classes | |
|---|---|---|
| Non-tree | Tree | |
| Non-tree, predicted | 2,700 | 133 |
| Tree, predicted | 86 | 2,371 |
| Sensitivity | 96.9 | 94.7 |
| Specificity | 94.7 | 96.9 |
| Accuracy | 95.9 | 95.9 |
| Precision | 95.3 | 96.5 |
Figure 4LDA score plots for classification and separation of different tree pollen genera. (a) LD1–LD2 plane of the score plot; (b) LD2–LD3 plane of the score plot; (c) LD1–LD3 plane of the score plot; (d) 3D score plot.
Confusion matrix for classification and separation of tree pollen types.
| Real classes | ||||
|---|---|---|---|---|
| Alder | Hazel | Larch | Birch | |
| Alder, predicted | 947 | 107 | 0 | 19 |
| Hazel, predicted | 54 | 696 | 2 | 16 |
| Larch, predicted | 3 | 2 | 59 | 0 |
| Birch, predicted | 52 | 14 | 0 | 533 |
| Sensitivity | 89.7 | 85.0 | 96.7 | 93.8 |
| Specificity | 91.3 | 95.7 | 99.8 | 96.6 |
| Accuracy | 90.6 | 92.2 | 99.7 | 96.0 |
| Precision | 88.3 | 90.6 | 92.2 | 89.0 |
Upper part: real classes vs. predicted classes of the classified test spectra; lower part: sensitivity, specificity, accuracy and precision in %.
Figure 5LDA score plot for classification and separation of different non-tree pollen types. (a) LD1–LD2 plane of the score plot; (b) LD2–LD3 plane of the score plot; (c) LD1–LD3 plane of the score plot; (d) 3D score plot.
Confusion matrix for classification and separation of non-tree pollen types.
| Real classes | ||||
|---|---|---|---|---|
| Cyclamen | Rumex | Mugwort | Moor grass | |
| Cyclamen, predicted | 897 | 0 | 18 | 0 |
| Rumex, predicted | 0 | 267 | 16 | 168 |
| Mugwort, predicted | 16 | 0 | 676 | 0 |
| Moor grass, predicted | 0 | 96 | 10 | 622 |
| Sensitivity | 98.2 | 73.6 | 93.9 | 78.7 |
| Specificity | 99.0 | 92.4 | 99.2 | 94.7 |
| Accuracy | 98.8 | 89.9 | 97.8 | 90.2 |
| Precision | 98.0 | 59.2 | 97.7 | 85.4 |
Upper part: real classes vs. predicted classes of the classified test spectra; lower part: sensitivity, specificity, accuracy and precision in %.