| Literature DB >> 35359729 |
Ying Feng1,2,3, Moutong Chen2, Xianhu Wei2, Honghui Zhu2, Jumei Zhang2, Youxiong Zhang2, Liang Xue2, Lanyan Huang1,2,3, Guoyang Chen1,2,3, Minling Chen2, Yu Ding2,4, Qingping Wu2.
Abstract
Matrix-assisted laser desorption/ionization time-of-flight mass (MALDI-TOF) spectrometry fingerprinting has reduced turnaround times, costs, and labor as conventional procedures in various laboratories. However, some species strains with high genetic correlation have not been directly distinguished using conventional standard procedures. Metabolomes can identify these strains by amplifying the minor differences because they are directly related to the phenotype. The pseudotargeted metabolomics method has the advantages of both non-targeted and targeted metabolomics. It can provide a new semi-quantitative fingerprinting with high coverage. We combined this pseudotargeted metabolomic fingerprinting with deep learning technology for the identification and visualization of the pathogen. A variational autoencoder framework was performed to identify and classify pathogenic bacteria and achieve their visualization, with prediction accuracy exceeding 99%. Therefore, this technology will be a powerful tool for rapidly and accurately identifying pathogens.Entities:
Keywords: LC–QQQ–MS; convolutional neural network (CNN); deep learning; pseudotargeted metabolomic; variational autoencoder (VAE)
Year: 2022 PMID: 35359729 PMCID: PMC8960985 DOI: 10.3389/fmicb.2022.830832
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 2A variational autoencoder (VAE) for common foodborne pathogen classification. (A) The selected pseudo-targeted LC–MS profiles. We selected 253 ion pairs from the untargeted strategy and detected them in actual pathogen samples using the UHPLC/QQQ MRM MS-based pseudotargeted metabolomics method. (B) The schematic of the VAE model. The VAE is composed of an encoder network and a decoder network. The encoder network encodes spectra into a Gaussian probability distribution in the n-dimensional latent space, and the decoder network decodes sample points from the latent space back into the original spectra. The encoder and decoder networks used deep convolutional neural network. (C) Visualization of prediction results.
FIGURE 1Principal component analysis of common pathogens from Bacillus cereus, Escherichia coli, Enterobacter sakazakii, Listeria innocua, Listeria monocytohenes, and Staphylococcus aureus.
FIGURE 3Variational autoencoder space of common pathogens from Bacillus cereus, Escherichia coli, Enterobacter sakazakii, Listeria innocua, Listeria monocytohenes, and Staphylococcus aureus. (A) The trained dataset was used to train the model. (B) The validation dataset was used to test the model.
FIGURE 4The confusion matrix chart of the validated dataset predicted the results. Its predicted accuracy of identification exceeded 99%.
Prediction accuracy of each type of pathogens.
| Pathogens | Predicted accuracy |
|
| 100% |
|
| 100% |
|
| 100% |
|
| 100% |
|
| 100% |
|
| 95% |
| Total | 99.38% |