| Literature DB >> 32010667 |
Denis Svechkarev1, Marat R Sadykov2, Lucas J Houser1, Kenneth W Bayles2, Aaron M Mohs1,3,4.
Abstract
Fast and reliable identification of infectious disease agents is among the most important challenges for the healthcare system. The discrimination of individual components of mixed infections represents a particularly difficult task. In the current study we further expand the functionality of a ratiometric sensor array technology based on small-molecule environmentally-sensitive organic dyes, which can be successfully applied for the analysis of mixed bacterial samples. Using pattern recognition methods and data from pure bacterial species, we demonstrate that this approach can be used to quantify the composition of mixtures, as well as to predict their components with the accuracy of ~80% without the need to acquire additional reference data. The described approach significantly expands the functionality of sensor arrays and provides important insights into data processing for the analysis of other complex samples.Entities:
Keywords: 3-hydroxyflavone; ESIPT; discriminant analysis; machine learning; multiparametric sensing; pathogenic bacteria; pattern analysis
Year: 2020 PMID: 32010667 PMCID: PMC6974461 DOI: 10.3389/fchem.2019.00916
Source DB: PubMed Journal: Front Chem ISSN: 2296-2646 Impact factor: 5.221
Figure 1(A) Fluorescence spectra of DOAF-loaded nanoparticles upon incubation with E. coli (EC), S. epidermidis (SE), and their mixtures in different proportions each line is the average of four spectra. The isoemissive point at ~470 nm serves as evidence of the mixed samples spectra being linear combinations of the emission spectra of the pure components. (B) Canonical score plot from LDA analysis of response patterns of pure bacteria and their mixtures. Signals from the mixtures are positioned along the line connecting the centroids of the 95% confidence ellipses for the pure bacteria.
Figure 2Canonical score plot of the results of LDA for pure bacterial samples and their binary mixtures. Signals from pure bacteria are represented by filled dots and solid ellipses; mixtures are represented by circles and dashed ellipses. Yellow dots are centroids of the 95% confidence ellipses. Lines connecting centroids of the mixture components serve as visual guide.
Figure 3(A) Quantification of the binary mixture of bacteria after the components are identified. The content of S. epidermidis (SE) is proportional to the distance between the 95% confidence ellipse centroids of the mixture and K. pneumoniae (KP). (B) Canonical score plot of the sensor's response to an equal mixture of S. aureus (SA), E. coli (EC) and K. pneumoniae (KP) related to the signals of pure components. The signal of the mixture represents an almost perfect linear combination, being located very close to the center of the triangle (marked with an orange cross).
Component quantification (see Supplementary Table 2 for detailed LDA quantification results).
| 50:50 | 64.0:36.0 | 14.0% | 79.6:20.4 | 29.6% | |
| 75:25 | 85.5:14.5 | 10.5% | 81.7:18.3 | 6.7% | |
| 50:50 | 51.6:48.4 | 1.6% | 70.8:29.2 | 20.8% | |
| 70:30 | 70.7:29.3 | 0.7% | 81.7:18.3 | 11.7% | |
| 50:50 | 44.5:55.5 | 5.5% | 87.5:12.5 | 37.5% | |
| 80:20 | 82.1:17.9 | 2.1% | 91.8:8.2 | 11.8% | |
| 50:50 | 58.1:41.9 | 8.1% | 27.6:72.4 | 22.4% | |
| 35:65 | 41.5:58.5 | 6.5% | 25.9:74.1 | 9.1% | |
| Average error | 6.1 ± 4.4% | 18.7 ± 10.1% | |||
Figure 4Example of a positive (A) and negative (B) decision regarding a potential component of an unknown mixture in the “one against the rest” analysis. The decision is made based on the overlap between the 95% confidence ellipses of the mixture and the complex of “other” bacteria, and the angle between the vectors connecting the ellipse centroid of the “other” aggregate group and the signal of the component under question, respectively.
Component prediction accuracy based on “one against the rest” analysis (see Supplementary Figures 2–9 for detailed LDA “one against the rest” classification).
| 50:50 | 6/8 (75%) | 7/8 (87.5%) | 84.4% | |
| 75:25 | 7/8 (87.5%) | 7/8 (87.5%) | 84.8% | |
| 50:50 | 6/8 (75%) | 6.5/8 (81.3%) | 81.4% | |
| 70:30 | 7/8 (87.5%) | 7/8 (87.5%) | 84.0% | |
| 50:50 | 5/8 (62.5%) | 7/8 (87.5%) | 85.2% | |
| 80:20 | 6/8 (75%) | 7/8 (87.5%) | 86.3% | |
| 50:50 | 7/8 (87.5%) | 5.5/8 (68.8%) | 70.6% | |
| 35:65 | 7/8 (87.5%) | 6.5/8 (81.3%) | 78.0% | |
| Average | 6.4/8 | 6.7/8 | 81.8 ± 4.9% | |
A correct hit (positive for a component present in the mixture, negative for a component absent in the mixture) is given 1 point, an ambiguous result is given 0.5 point, an incorrect hit (false positive or negative) is given 0 points.
The average of the probabilities calculated for a mixture's response to be associated with every single component against the aggregate of the rest of components.