| Literature DB >> 35735526 |
Wenjing Yu1, Jia Shi2, Guorong Huang1, Jie Zhou1, Xinyu Zhan1, Zekang Guo2, Huiyan Tian1, Fengxin Xie1, Xiang Yang1, Weiling Fu1.
Abstract
The demand for rapid and accurate identification of microorganisms is growing due to considerable importance in all areas related to public health and safety. Here, we demonstrate a rapid and label-free strategy for the identification of microorganisms by integrating terahertz-attenuated total reflection (THz-ATR) spectroscopy with an automated recognition method based on multi-classifier voting. Our results show that 13 standard microbial strains can be classified into three different groups of microorganisms (Gram-positive bacteria, Gram-negative bacteria, and fungi) by THz-ATR spectroscopy. To detect clinical microbial strains with better differentiation that accounts for their greater sample heterogeneity, an automated recognition algorithm is proposed based on multi-classifier voting. It uses three types of machine learning classifiers to identify five different groups of clinical microbial strains. The results demonstrate that common microorganisms, once time-consuming to distinguish by traditional microbial identification methods, can be rapidly and accurately recognized using THz-ATR spectra in minutes. The proposed automatic recognition method is optimized by a spectroscopic feature selection algorithm designed to identify the optimal diagnostic indicator, and the combination of different machine learning classifiers with a voting scheme. The total diagnostic accuracy reaches 80.77% (as high as 99.6% for Enterococcus faecalis) for 1123 isolates from clinical samples of sputum, blood, urine, and feces. This strategy demonstrates that THz spectroscopy integrated with an automatic recognition method based on multi-classifier voting significantly improves the accuracy of spectral analysis, thereby presenting a new method for true label-free identification of clinical microorganisms with high efficiency.Entities:
Keywords: data analysis; microbial identification; terahertz spectroscopy
Mesh:
Year: 2022 PMID: 35735526 PMCID: PMC9221034 DOI: 10.3390/bios12060378
Source DB: PubMed Journal: Biosensors (Basel) ISSN: 2079-6374
Figure 1Identification of clinical microbes based on their THz-ATR spectra. (A) Schematic illustration of the THz-ATR spectrometer with a sample cell made of Si. (Inset) Diagram of the THz-ATR spectrometer with a “prism–sample” model. (B) Schematic showing sample loading.
Figure 2(A) A total of 387 THz absorption spectra curves of 13 species of standard microbial strains. (B) THz absorption spectra of eight Gram-positive bacterial strains (red), two Gram-negative bacterial strains (green), and three fungi (black). (C) PCA of THz spectra for Gram-positive bacterial strains (red), Gram-negative bacterial strains (green), and fungi (black). (D) Least-squares analysis representation of THz spectra for Gram-positive bacterial strains (red), Gram-negative bacterial strains (green), and fungi (black).
Figure 3(A) A total of 1123 THz absorption spectra curves of 5 species of clinical strains, specifically E. faecalis (black), E. coli (red), P. aeruginosa (blue), C. albicans (green), and C. tropicalis (purple). (B) THz absorption spectra of one Gram-positive bacterial strain (black), two Gram-negative bacterial strains (red), and two fungi (blue). (C) PCA representation of the THz spectra for the above five clinical strains. (D) Least-squares analysis representation of the THz spectra for the above five clinical strains.
Figure 4(A) The proposed automated recognition workflow for constructing the THz database of common microorganisms. (B) Discrimination function filter of the automatic recognition method is based on the multi-classifier voting scheme for clinical strain diagnosis.
Classification results obtained by the multi-classifier voting scheme.
|
|
|
|
|
| Total | |
|---|---|---|---|---|---|---|
| Standard strains for modeling | 25 | 29 | 27 | 26 | 30 | 137 |
| Clinical strains for identification | 253 | 222 | 227 | 208 | 213 | 1123 |
| Correctly identified strains | 252 | 177 | 166 | 165 | 147 | 907 |
E. faecalis: Enterococcus faecalis; E. coli: Escherichia coli; P. aeruginosa: Pseudomonas aeruginosa; C. albicans: Candida albicans; C. tropicalis: Candida tropicalis.
Figure 5Diagnostic evaluation of the automated recognition method based on two characteristic parameters: absorption (A) and refractive index (B). (C) ROC curves and AUC scores for identifying clinical strains with three types of machine learning classifiers.
Classification accuracy of the three types of classifiers for the five clinical strains.
| Classifier | Parameter | Accuracy (%) | Number of Extracted Characteristics |
|---|---|---|---|
| RI | 56.1% | 266 | |
| Absorption | 40.5% | 7 | |
| SVM | RI | 46.6% | 610 |
| SVM | Absorption | 39.4% | 654 |
| RF | RI | 61.5% | 130 |
| RF | Absorption | 58.8% | 265 |
kNN: k-nearest neighbor; SVM: support vector machine; RF: random forest; RI: refractive index.