| Literature DB >> 35880262 |
Camila M de Almeida1, Larissa C Motta1, Gabriely S Folli1, Wena D Marcarini2, Camila A Costa3, Ana C S Vilela3, Valério G Barauna2, Francis L Martin4, Maneesh N Singh4, Luciene C G Campos5, Nádia L Costa3, Paula F Vassallo6, Andrea R Chaves7, Denise C Endringer8, José G Mill2, Paulo R Filgueiras1, Wanderson Romão1,9.
Abstract
Rapid identification of existing respiratory viruses in biological samples is of utmost importance in strategies to combat pandemics. Inputting MALDI FT-ICR MS (matrix-assisted laser desorption/ionization Fourier-transform ion cyclotron resonance mass spectrometry) data output into machine learning algorithms could hold promise in classifying positive samples for SARS-CoV-2. This study aimed to develop a fast and effective methodology to perform saliva-based screening of patients with suspected COVID-19, using the MALDI FT-ICR MS technique with a support vector machine (SVM). In the method optimization, the best sample preparation was obtained with the digestion of saliva in 10 μL of trypsin for 2 h and the MALDI analysis, which presented a satisfactory resolution for the analysis with 1 M. SVM models were created with data from the analysis of 97 samples that were designated as SARS-CoV-2 positives versus 52 negatives, confirmed by RT-PCR tests. SVM1 and SVM2 models showed the best results. The calibration group obtained 100% accuracy, and the test group 95.6% (SVM1) and 86.7% (SVM2). SVM1 selected 780 variables and has a false negative rate (FNR) of 0%, while SVM2 selected only two variables with a FNR of 3%. The proposed methodology suggests a promising tool to aid screening for COVID-19.Entities:
Keywords: COVID-19; MALDI FT-ICR MS; SARS-CoV-2; machine learning; saliva; screening
Mesh:
Year: 2022 PMID: 35880262 PMCID: PMC9344790 DOI: 10.1021/acs.jproteome.2c00148
Source DB: PubMed Journal: J Proteome Res ISSN: 1535-3893 Impact factor: 5.370
Figure 1MALDI(+) mass spectra of saliva samples with and without sample preparation. (A,B) Mass range m/z 200–2000 (A) before and (B) after tryptic digestion and C18 spin column processing. (C,D) Mass range m/z 1000–2000 (C) before and (D) after tryptic digestion and C18 spin column processing.
Peptides Identified in Saliva Samples after Digestion of Proteins and SPE
| TIC (total ion current) | peptide sequence | parent protein | ref | |
|---|---|---|---|---|
| 1224.63859 | 4.5 × 107 | GPPQQGGHPRPP | Acidic proline-rich phosphoprotein or PRH1Protein (P02810) | ( |
| 1380.74922 | 1.0 × 107 | GPPQQGGHPRPPR | ( | |
| 1390.70365 | 3.6 × 106 | GGRPQGPPQGQSPQ | ( | |
| 1471.73636 | 1.1 × 107 | GRPQGPPQQGGHQQ | ( | |
| 1731.89775 | 3.7 × 107 | GPPQQGGHPPPPQGRPQ | ( |
Number of Signals Detected and Time of Acquisition of the MALDI FT-ICR Mass Spectra in the Optimization of the Parameters
| data acquisition size | number of signals with signal/noise > 4% | number of signals with relative intensity > % noise | time of acquisition (min) | total time acquisition (h) | total ion current |
|---|---|---|---|---|---|
| 8 M | 43 228 | 169 | 1.89 | 14.1 | 1.1 × 108 |
| 4 M | 22 699 | 234 | 1.06 | 7.9 | 2.1 × 108 |
| 2 M | 11 729 | 223 | 0.69 | 5.2 | 1.9 × 108 |
| 1 M | 6 493 | 221 | 0.51 | 3.8 | 1.2 × 108 |
| 512 k | 3 345 | 169 | 0.36 | 2.7 | 3.9 × 107 |
Figure 2PCA scores and loadings plot of MALDI(+) spectra of saliva samples positive or negative for SARS-CoV-2.
Figure 3Characteristic signal in the individual MALDI(+) spectra among SARS-CoV-2 positive samples (green) versus negative samples (gray).
Parameter Performance of SVM Models by MALDI Spectra with Fisher’s Discriminant (FD)a
| model | FD (%) | varsel | group | TP | TN | FP | FN | SENS (%) | SPEC (%) | ACC (%) | PPV (%) | PNV (%) | FPR (%) | FNR (%) | MCC (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 30 | 780 | cal | 204 | 107 | 1 | 0 | 100.0 | 99.1 | 99.7 | 99.5 | 100.0 | 0.1 | 0.0 | 99.3 |
| test | 87 | 42 | 6 | 0 | 100.0 | 87.5 | 95.6 | 93.6 | 100.0 | 12.5 | 0.0 | 90.5 | |||
| 2 | 85 | 2 | cal | 204 | 108 | 0 | 0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 0.0 | 0.0 | 100.0 |
| test | 84 | 33 | 15 | 3 | 96.6 | 68.8 | 86.7 | 84.9 | 91.7 | 31.3 | 3.5 | 70.7 |
FD (Fisher’s Discriminant); varsel (selected variables); TP (true positive); TN (true negative); FP (false positive); FN (false negative); SENS (sensitivity); ACC (accuracy); SPEC (specificity); PPV (prevalence positive value); PNV (prevalence negative value); FPR (false positive rate); FNR (false negative rate); MCC (Matthews correlation coefficient).