| Literature DB >> 23577283 |
Khozima Hamasha1, Qassem I Mohaidat, Russell A Putnam, Ryan C Woodman, Sunil Palchaudhuri, Steven J Rehse.
Abstract
The determination of bacterial identity at the strain level is still a complex and time-consuming endeavor. In this study, visible wavelength spontaneous Raman spectroscopy has been used for the discrimination of four closely related Escherichia coli strains: pathogenic enterohemorrhagic E. coli O157:H7 and non-pathogenic E. coli C, E. coli Hfr K-12, and E. coli HF4714. Raman spectra from 600 to 2000 cm(-1) were analyzed with two multivariate chemometric techniques, principal component-discriminant function analysis and partial least squares-discriminant analysis, to determine optimal parameters for the discrimination of pathogenic E. coli from the non-pathogenic strains. Spectral preprocessing techniques such as smoothing with windows of various sizes and differentiation were investigated. The sensitivity and specificity of both techniques was in excess of 95%, determined by external testing of the chemometric models. This study suggests that spontaneous Raman spectroscopy with visible wavelength excitation is potentially useful for the rapid identification and classification of clinically-relevant bacteria at the strain level.Entities:
Keywords: (170.0170) Medical optics and biotechnology; (170.1420) Biology; (170.1530) Cell analysis; (170.1580) Chemometrics; (170.5660) Raman spectroscopy; (300.6450) Spectroscopy, Raman
Year: 2013 PMID: 23577283 PMCID: PMC3617710 DOI: 10.1364/BOE.4.000481
Source DB: PubMed Journal: Biomed Opt Express ISSN: 2156-7085 Impact factor: 3.732
Bacterial data sets tested in this study
| Data Set Number | Bacterial ID
( | # of Spectra in Set | Data Set Number | Bacterial ID
( | # of Spectra in Set | |
|---|---|---|---|---|---|---|
| 1 | C | 50 | 7 | HF4714 | 50 | |
| 2 | C | 50 | 8 | HF4714 | 25 | |
| 3 | C | 13 | 9 | HfrK-12 | 48 | |
| 4 | C | 30 | 10 | HfrK-12 | 52 | |
| 5 | O157:H7 | 35 | 11 | HfrK-12 | 25 | |
| 6 | O157:H7 | 65 | 12 | HfrK-12 | 35 |
Fig. 1(a) Normalized averaged Raman spectra of four strains of E. coli (top to bottom): C, O157:H7, Hfr K-12, and HF4714. Spectra have been offset vertically for clarity and the wavenumbers of important spectral features are indicated by a vertical line. The standard deviation of all the averaged spectra is indicated by dashed lines above and below the averaged spectrum. (b) A PC-DFA plot showing the first two discriminant function scores of the 478 Raman spectra. No smoothing was performed in this analysis.
Fig. 2The first principal component loading of the PCA (red) plotted with the difference of the average Raman spectrum of E. coli O157:H7 and E coli C (black). A strong correlation between these two shows that the difference between pathogenic E. coli O157:H7 and E coli C particularly in the important Raman bands at 1658, 1454, and 1338 cm−1 accounts for a significant amount of the overall variance in the data. No smoothing was performed in this analysis.
Classification accuracy of PLS-DA and PC-DFA E. coli O157:H7 tests with no spectral preprocessing
| Bacterial ID (# in group) | PLS-DA % ID’d as EHEC | PC-DFA % ID’d as EHEC | Bacterial ID (# in group) | PLS-DA % ID’d as EHEC | PC-DFA % ID’d as EHEC |
|---|---|---|---|---|---|
| C(50) | 0.00% | 0.00% | HF4714(50) | 0.00% | 0.00% |
| C(50) | 0.00% | 0.00% | HF4714(25) | 36.00% | 76.00% |
| C(13) | 15.38% | 30.77% | HfrK-12(48) | 0.00% | 0.00% |
| C(30) | 36.67% | 40.00% | HfrK-12(52) | 0.00% | 0.00% |
| O157:H7(35) | 100.00% | 100.00% | HfrK-12(25) | 0.00% | 0.00% |
| O157:H7(65) | 100.00% | 100.00% | HfrK-12(35) | 0.00% | 0.00% |
| PLS-DA | PC-DFA | ||||
| Sensitivity | 100.00% ± 0.00% | Sensitivity | 100.00% ± 0.00% | ||
| Specificity | 94.18% ± 13.61% | Specificity | 90.74% ± 22.50% | ||
Fig. 3PLS-DA results of an EHEC test on (a) non-pathogenic E. coli strain C and (b) pathogenic O157:H7. In (a) 100% of the non-pathogenic strain C spectra in the test group were correctly identified as being nonpathogenic E. coli, possessing Y predictor values above the determined discrimination line. In (b) 100% of the pathogenic strain O157:H7 spectra in the test group were correctly identified as being pathogenic, possessing Y predictor values below the determined discrimination line. No smoothing was performed in this analysis.
Fig. 4(a) The unprocessed (raw), smoothed (with a window size of 27), and smoothed 2nd derivative spectrum of E. coli O157:H7. (b) The same spectra zoomed in on the region from 800 – 1050 cm−1.
Classification accuracy of PLS-DA and PC-DFA E. coli O157:H7 tests using optimal preprocessing
| Bacterial ID (# in group) | PLS-DA % ID’d as EHEC | PC-DFA % ID’d as EHEC | Bacterial ID (# in group) | PLS-DA % ID’d as EHEC | PC-DFA % ID’d as EHEC |
|---|---|---|---|---|---|
| C(50) | 0.00% | 2.00% | HF4714(50) | 0.00% | 0.00% |
| C(50) | 0.00% | 0.00% | HF4714(25) | 4.17% | 4.00% |
| C(13) | 23.08% | 23.08% | HfrK-12(48) | 0.00% | 0.00% |
| C(30) | 0.00% | 6.67% | HfrK-12(52) | 3.85% | 0.00% |
| O157:H7(35) | 100.00% | 100.00% | HfrK-12(25) | 0.00% | 0.00% |
| O157:H7(65) | 100.00% | 100.00% | HfrK-12(35) | 0.00% | 0.00% |
| PLS-DA | PC-DFA | ||||
| Sensitivity | 100.00% ± 0.00% | Sensitivity | 100.00% ± 0.00% | ||
| Specificity | 98.41% ± 4.59% | Specificity | 98.15% ± 4.71% | ||