| Literature DB >> 32295273 |
Peng Gu1, Yao-Ze Feng1,2, Le Zhu1, Li-Qin Kong1, Xiu-Ling Zhang3, Sheng Zhang1, Shao-Wen Li3, Gui-Feng Jia1,2.
Abstract
A universal method by considering different types of culture media can enable convenient classification of bacterial species. The study combined hyperspectral technology and versatile chemometric algorithms to achieve the rapid and non-destructive classification of three kinds of bacterial colonies (Escherichia coli, Staphylococcus aureus and Salmonella) cultured on three kinds of agar media (Luria-Bertani agar (LA), plate count agar (PA) and tryptone soy agar (TSA)). Based on the extracted spectral data, partial least squares discriminant analysis (PLS-DA) and support vector machine (SVM) were employed to established classification models. The parameters of SVM models were optimized by comparing genetic algorithm (GA), particle swarm optimization (PSO) and grasshopper optimization algorithm (GOA). The best classification model was GOA-SVM, where the overall correct classification rates (OCCRs) for calibration and prediction of the full-wavelength GOA-SVM model were 99.45% and 98.82%, respectively, and the Kappa coefficient for prediction was 0.98. For further investigation, the CARS, SPA and GA wavelength selection methods were used to establish GOA-SVM simplified model, where CARS-GOA-SVM was optimal in model accuracy and stability with the corresponding OCCRs for calibration and prediction and the Kappa coefficients of 99.45%, 98.73% and 0.98, respectively. The above results demonstrated that it was feasible to classify bacterial colonies on different agar media and the unified model provided a continent and accurate way for bacterial classification.Entities:
Keywords: Visible-Near-infrared hyperspectral imaging; bacterial contamination; bacterial pathogens; grasshopper optimization algorithm; optimization; support vector machine; variable selection
Mesh:
Year: 2020 PMID: 32295273 PMCID: PMC7221630 DOI: 10.3390/molecules25081797
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Spectra of bacterial colonies. Note: E. coli, SA and SE are abbreviations of Escherichia coli, Staphylococcus aureus and Salmonella, respectively; LA, PA and TSA are abbreviations of Luria–Bertani agar, plate counting agar and tryptone soy agar, respectively. For example, E. coli LA represents the Escherichia coli cultured on Luria–Bertani agar.
Figure 2Score plot of bacterial colonies.
Performance of full wavelength models.
| Model | Optimization Methods | LVs | cp | g | OCCR (%) | Kappa | |
|---|---|---|---|---|---|---|---|
| Calibration | Prediction | ||||||
| PLS-DA | - | 5 | - | - | 75 | 74.03 | 0.71 |
| SVM | GA | - | 41.74 | 51.53 | 100 | 97.33 | 0.96 |
| PSO | - | 13.69 | 16.56 | 99.45 | 98.78 | 0.98 | |
| GOA | - | 70.61 | 5.49 | 99.45 | 98.82 | 0.98 | |
Confusion matrix of prediction for full-wavelength models.
| Predicted Label | ||||||
|---|---|---|---|---|---|---|
| Bacteria | E. coli | SA | SE | Unclassified | ||
| PLS-DA | True Label | E. coli | 80.39% | 7% | 0 | 4.30% |
| SA | 9.51% | 75.59% | 14.71% | |||
| SE | 0 | 38.24% | 61.13% | |||
| GOA-SVM | True Label | E. coli | 97.33% | 0 | 2.67% | 0 |
| SA | 0 | 99.71% | 0.29% | |||
| SE | 0.63% | 0.21% | 99.16% | |||
Note: E. coli, SA and SE are abbreviations of Escherichia coli, Staphylococcus aureus and Salmonella, respectively.
Performance of PLS-DA and GOA-SVM simplified models.
| Wavelength Selection Methods | Number of Wavelengths | PLS-DA | GOA-SVM | ||||
|---|---|---|---|---|---|---|---|
| OCCRC% | OCCRP% | Kappa | OCCRC% | OCCRP% | Kappa | ||
| CARS | 30 | 79.15 | 71.72 | 0.63 | 99.45 | 98.73 | 0.98 |
| SPA | 24 | 75.66 | 71.72 | 0.62 | 99.24 | 98.69 | 0.98 |
| GA | 69 | 78.50 | 73.53 | 0.64 | 99.45 | 98.60 | 0.98 |
Note: PLS-DA: partial least squares discrimination analysis; GOA-SVM: grasshopper optimization algorithm support vector machine.
Figure 3Wavelength selected by competitive adaptive reweighted sampling (CARS).
Confusion matrix of training set for CARS-GOA-SVM simplified model.
| Bacteria | Predicted Label | Unclassified | |||
|---|---|---|---|---|---|
|
| SA | SE | |||
| True Label | E. coli | 97.48% | 0 | 2.52% | 0 |
| SA | 0 | 99.8% | 0.20% | ||
| SE | 1.47% | 0.21% | 98.32% | ||
Note: E. coli, SA and SE are abbreviations of Escherichia coli, Staphylococcus aureus and Salmonella, respectively.
Comparison of performance of predicting pixel-level spectral data using existing GOA-SVM and CARS-GOA-SVM models.
| Predicted Label | ||||||
|---|---|---|---|---|---|---|
| Bacteria |
| SA | SE | OCCRP (%) | ||
| GOA-SVM | True Label |
| 94.99% | 0.12% | 4.89% | 91.86 |
| SA | 6.09% | 90.77% | 3.14% | |||
| SE | 21.06% | 0.88% | 78.06% | |||
| CARS-GOA-SVM | True Label |
| 95.35% | 0.12% | 4.53% | 93.32 |
| SA | 1.18% | 95.51% | 3.31% | |||
| SE | 19.27% | 0.64% | 80.09% | |||
Note: E. coli, SA and SE are abbreviations of Escherichia coli, Staphylococcus aureus and Salmonella, respectively.
Figure 4Hyperspectral imaging system.
Figure 5Flowchart of spectral data acquisition and analysis.
Number of samples in the calibration and prediction set.
| Calibration Set | Prediction Set | |||||||
|---|---|---|---|---|---|---|---|---|
| LA | PA | TSA | Total | LA | PA | TSA | Total | |
| E. coli | 103 | 83 | 94 | 280 | 325 | 167 | 222 | 714 |
| SA | 110 | 114 | 101 | 325 | 347 | 289 | 384 | 1020 |
| SE | 96 | 108 | 107 | 311 | 114 | 211 | 151 | 476 |
| Total | 309 | 305 | 302 | 916 | 786 | 667 | 757 | 2210 |
Note: E. coli, SA and SE are abbreviations of Escherichia coli, Staphylococcus aureus and Salmonella, respectively; LA, PA and TSA are abbreviations of Luria–Bertani agar, plate counting agar and tryptone soy agar, respectively.