| Literature DB >> 33869258 |
Meritxell Deulofeu1,2, Esteban García-Cuesta3,4, Eladia María Peña-Méndez5, José Elías Conde5, Orlando Jiménez-Romero1,2, Enrique Verdú1, María Teresa Serrando1,2, Victoria Salvadó6, Pere Boadas-Vaello1,2.
Abstract
The high infectivity of SARS-CoV-2 makes it essential to develop a rapid and accurate diagnostic test so that carriers can be isolated at an early stage. Viral RNA in nasopharyngeal samples by RT-PCR is currently considered the reference method although it is not recognized as a strong gold standard due to certain drawbacks. Here we develop a methodology combining the analysis of from human nasopharyngeal (NP) samples by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) with the use of machine learning (ML). A total of 236 NP samples collected in two different viral transport media were analyzed with minimal sample preparation and the subsequent mass spectra data was used to build different ML models with two different techniques. The best model showed high performance in terms of accuracy, sensitivity and specificity, in all cases reaching values higher than 90%. Our results suggest that the analysis of NP samples by MALDI-TOF MS and ML is a simple, safe, fast and economic diagnostic test for COVID-19.Entities:
Keywords: MALDI-TOF MS analysis; NP samples; SARS-CoV-2; machine learning; viral transport media
Year: 2021 PMID: 33869258 PMCID: PMC8047105 DOI: 10.3389/fmed.2021.661358
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Figure 1Results of the first experiment. (A) Representative mass spectra of NP samples for SARS-CoV-2: positive (red) and negative (green). (B) Precision-recall curve from the different models. (C) Average confusion matrix and (D) average performance metrics including the standard deviation (note that the test was performed 20 times selecting random different samples for each iteration) of the best model (SVM + 10PCs cross-validation K = 10).
Model learning (cross validation results K = 10) of all the models tested in the different experiments.
| Experiment 1 | 0.625 ± 0.041 | 0.595 ± 0.073 | 0.658 ± 0.093 | 0.780 ± 0.115 | 0.684 ± 0.062 | 0.580 ± 0.113 | 0.677 ± 0.043 | 0.626 ± 0.078 | |
| 0.620 ± 0.043 | 0.580 ± 0.084 | 0.580 ± 0.110 | 0.716 ± 0.066 | 0.739 ± 0.065 | 0.678 ± 0.043 | 0.695 ± 0.056 | |||
| 0.587 ± 0.045 | 0.558 ± 0.057 | 0.283 ± 0.098 | 0.238 ± 0.094 | 0.890 ± 0.055 | 0.891 ± 0.033 | 0.679 ± 0.065 | 0.703 ± 0.048 | ||
| Experiment 2 (VTM1 results) | 0.749 ± 0.066 | 0.759 ± 0.086 | 0.538 ± 0.168 | 0.525 ± 0.150 | 0.946 ± 0.033 | 0.950 ± 0.051 | 0.852 ± 0.039 | 0.832 ± 0.070 | |
| 0.793 ± 0.116 | 0.646 ± 0.242 | 0.652 ± 0.176 | 0.939 ± 0.070 | 0.969 ± 0.031 | 0.884 ± 0.069 | 0.879 ± 0.065 | |||
| 0.754 ± 0.132 | 0.700 ± 0.160 | 0.548 ± 0.275 | 0.487 ± 0.319 | 0.961 ± 0.058 | 0.937 ± 0.071 | 0.876 ± 0.075 | 0.845 ± 0.080 | ||
| Experiment 2 (VTM2 results) | 0.433 ± 0.080 | 0.418 ± 0.104 | 0.273 ± 0.081 | 0.285 ± 0.200 | 0.618 ± 0.119 | 0.588 ± 0.196 | 0.494 ± 0.082 | 0.491 ± 0.139 | |
| 0.482 ± 0.106 | 0.241 ± 0.162 | 0.381 ± 0.206 | 0.762 ± 0.138 | 0.688 ± 0.133 | 0.581 ± 0.111 | 0.594 ± 0.107 | |||
| 0.511 ± 0.120 | 0.476 ± 0.093 | 0.158 ± 0.172 | 0.135 ± 0.148 | 0.914 ± 0.063 | 0.925 ± 0.075 | 0.711 ± 0.083 | 0.654 ± 0.098 | ||
| Experiment 3 | 0.891 ± 0.109 | 0.922 ± 0.127 | 0.974 ± 0.076 | 0.872 ± 0.157 | 0.988 ± 0.054 | 0.897 ± 0.109 | 0.980 ± 0.044 | ||
| 0.950 ± 0.050 | 0.947 ± 0.066 | 0.930 ± 0.070 | 0.987 ± 0.040 | 0.970 ± 0.090 | 0.918 ± 0.110 | 0.981 ± 0.052 | 0.950 ± 0.063 | ||
| 0.868 ± 0.109 | 0.968 ± 0.055 | 0.911 ± 0.106 | 0.989 ± 0.484 | 0.832 ± 0.214 | 0.941 ± 0.118 | 0.882 ± 0.093 | 0.972 ± 0.045 | ||
| Robustness analysis | 0.647 | 0.956 | 1.000 | 0.377 | 0.923 | 0.687 | 0.964 | ||
| 0.609 | 0.945 | 0.922 | 0.964 | 0.346 | 0.923 | 0.654 | 0.945 | ||
| 0.690 | 0.902 | 0.956 | 1.000 | 0.446 | 0.795 | 0.719 | 0.902 | ||
Bold values represent the best model for each experiment using the selected metric F1-score.
Figure 2Results for DeltaSwab-ViCUM (VTM1). (A) Representative mass spectra of NP samples for SARS-CoV-2: positive (red) and negative (green). (B) Precision-recall curve from the different models. (C) Average confusion matrix and (D) average performance metrics including their standard deviation (the test was performed 20 times selecting randomly different samples for each iteration) of the best model (SVM + 10PCs cross-validation K = 10).
Figure 3Results for DeltaSwab-Virus (VTM2). (A) Representative mass spectra of NP samples for SARS-CoV-2: positive (red) and negative (green). (B) Precision-recall curve from the different models. (C) Average confusion matrix and (D) average performance metrics including their standard deviation (the test was performed 20 times selecting randomly different samples for each iteration) of the best model (SVM + 10 PCs cross-validation K = 10).
Figure 4Results for DeltaSwab-ViCUM (VTM1) within the high incidence pandemic period. (A) Representative mass spectra of NP samples for SARS-CoV-2: positive (red) and negative (green). (B) Precision-recall curve from the different models. (C) Average confusion matrix and (D) average performance metrics including their standard deviation (the test was performed 20 times selecting randomly different samples for each iteration) of the best model (SVM + 5PCs cross-validation K = 10).
Figure 5Developed methodology robustness. (A) Representative mass spectra of NP samples [positive (red) and negative (green)] for SARS-CoV-2 of S1 (top mass spectra) and S2 (down mass spectra). (B) Precision-recall curve from the different models. (C) Matrix confusion showing the summary of the results of all the samples used in the test phases and performance measures of the best model (SVM + 5PCs cross-validation K = 10). Note that no errors are included given that the results correspond to a single test set.
Figure 6Infographic of the simulation of the results in the society. (A) Simulation in a pandemic situation. (B) Simulation in a post-pandemic situation.