Literature DB >> 35880262

MALDI(+) FT-ICR Mass Spectrometry (MS) Combined with Machine Learning toward Saliva-Based Diagnostic Screening for COVID-19.

Camila M de Almeida1, Larissa C Motta1, Gabriely S Folli1, Wena D Marcarini2, Camila A Costa3, Ana C S Vilela3, Valério G Barauna2, Francis L Martin4, Maneesh N Singh4, Luciene C G Campos5, Nádia L Costa3, Paula F Vassallo6, Andrea R Chaves7, Denise C Endringer8, José G Mill2, Paulo R Filgueiras1, Wanderson Romão1,9.   

Abstract

Rapid identification of existing respiratory viruses in biological samples is of utmost importance in strategies to combat pandemics. Inputting MALDI FT-ICR MS (matrix-assisted laser desorption/ionization Fourier-transform ion cyclotron resonance mass spectrometry) data output into machine learning algorithms could hold promise in classifying positive samples for SARS-CoV-2. This study aimed to develop a fast and effective methodology to perform saliva-based screening of patients with suspected COVID-19, using the MALDI FT-ICR MS technique with a support vector machine (SVM). In the method optimization, the best sample preparation was obtained with the digestion of saliva in 10 μL of trypsin for 2 h and the MALDI analysis, which presented a satisfactory resolution for the analysis with 1 M. SVM models were created with data from the analysis of 97 samples that were designated as SARS-CoV-2 positives versus 52 negatives, confirmed by RT-PCR tests. SVM1 and SVM2 models showed the best results. The calibration group obtained 100% accuracy, and the test group 95.6% (SVM1) and 86.7% (SVM2). SVM1 selected 780 variables and has a false negative rate (FNR) of 0%, while SVM2 selected only two variables with a FNR of 3%. The proposed methodology suggests a promising tool to aid screening for COVID-19.

Entities:  

Keywords:  COVID-19; MALDI FT-ICR MS; SARS-CoV-2; machine learning; saliva; screening

Mesh:

Year:  2022        PMID: 35880262      PMCID: PMC9344790          DOI: 10.1021/acs.jproteome.2c00148

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   5.370


Introduction

Coronavirus disease is a viral infection caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Its spread may be due to airborne transport of aerosol particles from normal breathing or larger airborne suspended droplets from coughing and sneezing.[1] COVID-19 was declared a pandemic on March 11th, 2020 by the World Health Organization, with >420 000 000 people infected and 5 800 000 deaths worldwide as of March 2022.[2] Defining methods for early and rapid detection of COVID-19 is crucial to provide timely treatment, tracking individuals who have had contact with infected patients, and interrupting transmission routes.[3] Reverse transcription–polymerase chain reaction (RT-PCR) in swabs collected from the nasopharyngeal mucosa is routinely used and considered the gold standard in detecting SARS-CoV-2 infection.[4] Smear collection requires qualified health professionals, and there is great concern regarding handling, transporting, and storing samples. In addition to the nasopharyngeal swab causing discomfort to patients, it can also cause bleeding. Therefore, there are several contraindications to this procedure, such as clinical conditions, as a consequence of coagulopathy or anticoagulant therapy and significant deviation of the nasal septum.[5] Additionally, the RT-PCR test has limitations associated with expenses and time that need to be overcome.[6,7] To improve issues associated with transportation of highly contaminated materials, availability of equipment, and the need for faster testing, the use of rapid antigen tests (RATs) for COVID-19 was seen as a solution to quickly detect and track communities suspected of contracting COVID-19. A rapid antigen test is a rapid diagnostic test that detects the presence of the virus directly at the time of analysis.[8] Despite meeting the needs of a rapid response in a mass-screening program, we still need tests with greater sensitivity and accuracy as the frontline testing for COVID-19 diagnosis.[9] The diagnosis for COVID-19 still has limitations for general use, especially in developing countries, due to the gold technique used and the need for inputs to apply numerous tests in the population. With high sensitivity and specificity, an alternative to these methods would be applying diagnostic tools based on mass spectrometry (MS), using the MALDI (matrix-assisted laser desorption/ionization) MS technique to identify disease-specific proteins. Recently, MS techniques have been explored to detect COVID-19 on the basis of the identification of salivary biomarkers.[10−12] Saliva is a secretion produced by the salivary glands. It has several functions, such as protecting and cleaning the oral cavity and aiding digestion. In addition, due to the ease and safety of collecting saliva samples via noninvasive techniques, this biofluid may be an alternative source to screen disease biomarkers.[13,14] Thus, COVID-19 could be detected through salivary diagnostic tests, presenting advantages for healthcare professionals and patients.[15−17] The large amount of data obtained in MS spectra requires the combination of the output of this technique with robust statistical strategies to identify diagnostic patterns.[18] The current trend in associating algorithms labeled as artificial intelligence and “omics” techniques has yielded platforms involving machine learning (ML) for MS data analysis, aiming to identify disease biomarkers, including COVID-19 severity assessment.[19] Recently, Nachtigall et al. (2020) reported the development of a MALDI MS method to diagnose SARS-CoV-2 infection. In this study, nasal swab samples were analyzed directly by MALDI TOF MS, and a spectral pattern was used to classify infected and noninfected patients, an interesting application of mass spectrometry to detect viral infections.[20] Another study, carried out by Chivte et al. (2021), used saliva samples and established five viral protein signals as potential biomarkers for COVID-19. MALDI TOF analysis showed ≥90% agreement with RT-qPCR results, concluding that MALDI-TOF can be used to develop assays for COVID-19 in a relatively quick and inexpensive way.[11] Given the above, the development of a methodology using high-resolution MALDI FT-ICR MS combined with machine learning algorithms can provide a quick and highly accurate response to classify positive samples for diseases, favoring rapid and accurate diagnoses that complement genomic information. This may also allow us to increase our current knowledge of COVID-19.

Materials and Methods

Saliva Samples

The study included 149 saliva samples (97 RT-PCR positives for SARS-CoV-2 and 52 negatives for SARS-CoV-2). Among them, 86 samples (34 with positive RT-PCR for SARS-CoV-2 and 52 negatives) were obtained by the research group of the Center for Health Sciences at UFES (CAAE ethics committee: 30993920.1.0000.5071) and the other 63 positive samples were provided by the research group of the Faculty of Dentistry of the Federal University of Goiás (ethics committee CAAE: 38088920.9.0000.5083, 38088920.9.3002.5082, 38088920.9.3001.5078). All participants were >18 years old and were tested for SARS-CoV-2 infection using a qRT-PCR test. Saliva collection took place at the hospitals of the respective cities, Vila Velha and São Mateus, in patients who had at least one characteristic symptom of COVID-19 and spontaneously attended the hospitals to undergo the RT-PCR test for COVID-19. Immediately after oropharyngeal swab collection, each patient received a sterile tube to facilitate self-collection of saliva (around 1 mL) under direct supervision of a nurse. Table S1 describes participant demographic profiles with gender and symptoms.

Sample Preparation

In the optimization of sample preparation, protein in saliva samples from voluntary donors without symptoms were determined by the dye-binding Bradford assay.[21] An aliquot of saliva (500 μL) was centrifuged at 1903.67g for 15 min at 4 °C. The supernatant aliquot (10 μL) was mixed with 190 μL Bradford reagent in a well plate. Protein absorbance was measured at 595 nm against a blank reagent. Standard curves were prepared using bovine serum albumin (BSA). To the 50 μL-supernatant saliva sample 10 μL AmBic (50 mM; ammonium bicarbonate, Sigma-Aldrich Chemicals, USA) and 25 μL RapiGest SF (0.2%; Waters, Milford, MA, USA) were added, and the tubes were heated to at 37 °C for 1 h. Then, these tubes were submitted to disulfide reduction with 2.5 μL DTT at a concentration of 100 mM in AmBic (dithiothreitol in 50 mM AmBic) for 40 min at 37 °C and 2.5 μL IAA at a concentration of 300 mM in AmBic (Iodoacetamide, Sigma-Aldrich Chemicals, USA). Then, they were placed in the dark for 30 min at room temperature. Protein digestion was performed by adding 5 or 10 μL 0.05 μg/μL Trypsin (Promega, Madison, WI, USA) and incubating the samples at 37 °C for 2 h or overnight. Digestion was terminated with 10 μL of TFA 5% (trifluoroacetic acid, Sigma-Aldrich Chemicals, USA), followed by heating the sample at 37 °C for 90 min. The samples were centrifuged and the pellets were discarded. The samples were purified with a C18 spin column (Pierce C18 Spin Column, Thermo Scientific, San Jose, CA, USA). These columns were activated by 2 × 200 μL of 50% ACN and equilibrated with 2 × 200 μL of 0.5% TFA. The tryptic peptides were adsorbed to the media using two repeat cycles of 60 μL (consisting of 15 μL 2% TFA in 20% ACN plus 45 μL sample) sample buffer loading, and the column was washed using 2 × 200 μL of 0.5% TFA. Finally, the peptides were eluted in 2 × 20 μL of 70% ACN. After optimization, samples with suspected COVID-19 were digested and analyzed by MALDI MS.

MALDI MS Analysis

MALDI MS analyses were performed with an FT-ICR mass spectrometer (model 9.4 T Solarix, Bruker Daltonics, Bremen, Germany) equipped with a Smart beam II Nd: YAG laser (355 nm) and MALDI source. Two μL of each sample were spotted in triplicate on the MALDI target plate (Bruker Daltonics), followed by the addition of 1 μL of the matrix solution composed of 5 mM CHCA in methanol (α-ciano-hydroxy-cinnamic acid, Sigma-Aldrich Chemicals, USA) at 50% in acetonitrile:0.1% trifluoroacetic acid and air-drying.[22] The mass spectra were acquired in positive ionization mode in the mass range m/z 200–4000. The main experimental parameters used were as follows: laser frequency, 200 Hz; plate offset, −100.0 V; deflector plate, −210.0 V; the number of laser shots, 200; laser power, 35%; laser focus, “small”; 16 scans, and the random walk. The time spent on the analysis and the numbers of signals were evaluated in the MALDI FT-ICR analysis, using different resolving powers (512k, 1M, 2M, 4M, and 8M).

Chemometrics

The data set obtained with the DataAnalysis software (Bruker Daltonics, USA) was exported in ASC format and imported into the MATLAB 9.0 software (R2013a). Pretreatments such as mean centering, normalization, autoscaling, and combinations were performed.

Principal Component Analysis (PCA)

PCA pattern recognition model was developed to identify possible anomalous samples, visualize similarities and possible natural groupings between samples, and analyze the behavior and dispersion of spectral variables. This modeling is given by decomposing the study X matrix into two matrices: scores containing sample information and loadings containing variable information.

Support Vector Machine (SVM)

SVM classification models are designed to identify relevant ion disease-specific biomarkers and classify SARS-CoV-2 positive and negative samples. For each set of techniques and properties to be estimated, the sample set was previously divided into calibration (70%) and prediction (30%) by the Kennard Stone method.[23] The cost (C), margin (ε), and gamma (γ) parameters of the kernel function were optimized.

Results and Discussion

The MALDI FT-ICR MS technique is a widely used tool for studying biological compounds due to its high sensitivity and mass accuracy. Its combination with machine learning can favor molecular diagnostics by using inexpensive, safe, easy, and noninvasive collection samples, such as saliva.[24] Saliva is a bodily fluid produced by the salivary glands, composed of various substances and a rich reservoir of proteins and peptides; in fact, it brings together >3652 proteins and 12 562 peptides. In general, saliva contains molecules of high molecular weight, such as mucins, lysozyme, immunoglobulins, proline-rich protein (PRP), and statherins.[13,25] In this sense, protein digestion is an essential step for MALDI FT-ICR MS analysis since the worked m/z range (200–4000) does not identify high molecular weight proteins. Protein identification is based on m/z measurements for a set of peptides and digestion products of the original protein.[26] However, peptide analysis can present several problems that affect the detector response due to unwanted interference from the matrix or from reagents and other additives used to facilitate protein digestion.[27] Thus, an alternative to eliminate these interfering signals is using C18 spin columns after tryptic digestion, which removes interfering contaminants and releases peptides in MS-compatible solutions, resulting in greater sensitivity and a high-quality spectrum.[28] After the saliva digestion protocol, it was possible to observe the spectral differences between an unprepared saliva sample (Figure A) and another sample after the protein digestion step (Figure B). This allows to make a comparison of the ions detected by the MALDI FT-ICR MS analysis, giving indications of some peptides already identified in saliva studies in the literature (Table ).
Figure 1

MALDI(+) mass spectra of saliva samples with and without sample preparation. (A,B) Mass range m/z 200–2000 (A) before and (B) after tryptic digestion and C18 spin column processing. (C,D) Mass range m/z 1000–2000 (C) before and (D) after tryptic digestion and C18 spin column processing.

Table 1

Peptides Identified in Saliva Samples after Digestion of Proteins and SPE

m/z (M + H+)TIC (total ion current)peptide sequenceparent proteinref
1224.638594.5 × 107GPPQQGGHPRPPAcidic proline-rich phosphoprotein or PRH1Protein (P02810)(30)
1380.749221.0 × 107GPPQQGGHPRPPR(30)
1390.703653.6 × 106GGRPQGPPQGQSPQ(22,29)
1471.736361.1 × 107GRPQGPPQQGGHQQ(30)
1731.897753.7 × 107GPPQQGGHPPPPQGRPQ(22)
MALDI(+) mass spectra of saliva samples with and without sample preparation. (A,B) Mass range m/z 200–2000 (A) before and (B) after tryptic digestion and C18 spin column processing. (C,D) Mass range m/z 1000–2000 (C) before and (D) after tryptic digestion and C18 spin column processing. The digestion protocol was adapted. The trypsin volume (0.05 μg/μL) and the digestion time were evaluated to optimize and speed up the sample preparation process. The volumes used were 5 and 10 μL of trypsin and the digestion times of 2 and 16 h (“overnight”), totaling 4 variations in the digestion process of the saliva sample. The analysis results by MALDI(+)FT-ICR MS after digestion showed no significant spectral differences between the four variations in the digestion protocol (Figure S1) in the m/z range from 1000 to 2200. When analyzing the full scan spectra of the samples that used 10 μL of trypsin for the digestion, these presented a higher ratio of signals detected with a signal-to-noise ratio (S/N) > 4, compared to the samples that used 5 μL of trypsin for the digestion. The samples that had overnight digestion also exhibited more signals, but the 2 h time was satisfactory and sped up the diagnostic process by at least 14 h (Table S2). Additionally, the sample with 10 μL of trypsin and 2 h of digestion showed a higher TIC for all peptide sequences (Table S3). All four samples after digestion showed a different spectrum from the one of the saliva sample without digestion (Figure S1a), in which it was possible to identify the peptide sequence, represented by the ions of m/z 1224; 1380; 1390; 1471; and 1731. They are phosphoprotein derivatives, rich in salivary acid proline (1/2), P02810.[22,30] Human glandular salivary secretions contain several phosphoproteins rich in acidic proline (PRP). These proteins have essential biological functions that provide a protective environment for teeth and appear to have other activities associated with the modulation of bacterial adhesion to oral surfaces.[31] In optimizing the parameters of MALDI analysis, the resolving power and the acquisition time of the samples were evaluated. The size of the generated data set is related to the number of signals in the mass spectrum, which increases according to the resolving power but decreases the acquisition speed. Thus, it is necessary to use an optimal resolving power that does not affect the amount of information acquired faster in the FT-ICR MS spectrum.[32] In the analysis of the raw spectra, we observed in Table that the analysis time for an acquisition at the highest resolution (8 M) was 1.89 min. It generated a greater number of signals with S/N > 4 (43 228), but the total time for the 447 analyses was approximately 14 h. Noise filtering separates component signals from the background originating from the chemical matrix or instrumental interference. Thus, the spectra were corrected by eliminating signals with relative intensity below the noise.[33]
Table 2

Number of Signals Detected and Time of Acquisition of the MALDI FT-ICR Mass Spectra in the Optimization of the Parameters

data acquisition sizenumber of signals with signal/noise > 4%number of signals with relative intensity > % noisetime of acquisition (min)total time acquisition (h)total ion current
8 M43 2281691.8914.11.1 × 108
4 M22 6992341.067.92.1 × 108
2 M11 7292230.695.21.9 × 108
1 M6 4932210.513.81.2 × 108
512 k3 3451690.362.73.9 × 107
After this correction, the number of signals decreased significantly, making the difference between the data set size minimal and the time difference significant. Therefore, the resolution power chosen to perform the analysis was 1 M, which generated 221 signals with relative intensity above the noise and an analysis time of 3.8 h, only 1.1 h more than the fastest analysis of the resolution of 512 k, in which it presented only 169 signals above the noise. After parameter optimization, 149 samples (97 SARS-CoV-2 positives and 52 negatives) were analyzed by MALDI FT-ICR MS (Figure S2). An independent PCA was performed as an exploratory data analysis. The PCA model built with SARS-CoV-2 positive and negative saliva samples showed 96.88% of the total variance, and the PC1 × PC2 × PC3 scores and loadings plot are shown in Figure . From the scores, it is possible to notice a tendency of separation between the two main groups, with the influence of m/z 525.4 and 598.5 in positive samples and 1410.4 and 2242.3 in negative samples.
Figure 2

PCA scores and loadings plot of MALDI(+) spectra of saliva samples positive or negative for SARS-CoV-2.

PCA scores and loadings plot of MALDI(+) spectra of saliva samples positive or negative for SARS-CoV-2. Loadings that present vectors with an angle close to 90° to each other, that is, perpendicular vectors, indicate little or no intercorrelation. That is obeyed for the variables of m/z of 525.4 with 1410.8 and 2242.3, and m/z of 598.5 with 1410.8 and 2242.3. One could suggest that the variables of m/z 525.4 and 598.5 are independent of m/z 1410.8 and 2242.3. Also, they do not impact the variables of m/z 1410.8 and 2242.3. However, loadings that present vectors with angles close to zero among themselves, superimposed, present a strong intercorrelation. That explains the behavior of variables of m/z 525.4 with 598.5 in m/z, m/z 1410.8, and 2242.3, in which the variation of m/z 525.4 influences the m/z 598.5 and the variation of m/z 1410.8 influences the m/z 2242.3. In the graph of the most important variables for the SVR nonlinear classification model (Figure ), it is noted that the m/z of 525.4 with 598.5 showed higher intensity in the positive samples, while the m/z 1410.8 and 2242.3 showed higher intensity in negative samples. The dependence and independence of the respective variables are justified since the m/z vectors below 1000 influence positive samples and m/z above 1000 influence negative ones.
Figure 3

Characteristic signal in the individual MALDI(+) spectra among SARS-CoV-2 positive samples (green) versus negative samples (gray).

Characteristic signal in the individual MALDI(+) spectra among SARS-CoV-2 positive samples (green) versus negative samples (gray). Subsequently, ML analysis was employed to identify the best-performing model for distinguishing positive and negative samples for COVID-19. Despite the differences shown, no specific biomarkers for any positive or negative sample groups were found in the analyses performed by MALDI FT-ICR MS. The ML approach was then applied to analyze the data from the entire range of the selected m/z pattern, applying the FD (Fisher’s Discriminant) for the variable selection. Two models showed better results than the built models, one with a FD of 30% and the other with 85%. When training the different models and selecting variables with a FD above 30%, Model 1 gave better performance, despite having a greater number of selected variables (780), with a sensitivity of 100%, specificity of 87.5%, and accuracy of 95.6%. However, model 2, which used 85% of the FD, selected only 2 variables, m/z 525.4 (FD = 0.70) and m/z 1410.8 (FD = 0.58) and presented a sensitivity of 96.6%, specificity of 68.7% and accuracy of 86.7%. The most important variables were the same and presented the same profile as observed in the PCA model. Herein, variables with higher m/z show a higher correlation with negative samples, while variables with lower m/z show a strong correlation with positive samples. This profile can also be seen in the raw spectra and after processing (Figure S2) either the positive or negative samples for SARS-CoV-2. There are more signals and a greater relative intensity at higher m/z for negative sample spectra. In comparison, positive samples have a higher intensity and lower signal numbers for m/z. The models gave an excellent performance, and it is possible to discriminate the positive and negative classes since there is little spectral variance between samples of the same class. The results of model 1 (Table ) show a FNR (false negative rate) of 0.0%, making the methodology used efficient in distinguishing positive samples for SARS-CoV-2 from negative ones in saliva in a noninvasive matrix, avoiding false-negative diagnoses.
Table 3

Parameter Performance of SVM Models by MALDI Spectra with Fisher’s Discriminant (FD)a

modelFD (%)varselgroupTPTNFPFNSENS (%)SPEC (%)ACC (%)PPV (%)PNV (%)FPR (%)FNR (%)MCC (%)
130780cal20410710100.099.199.799.5100.00.10.099.3
   test874260100.087.595.693.6100.012.50.090.5
2852cal20410800100.0100.0100.0100.0100.00.00.0100.0
   test843315396.668.886.784.991.731.33.570.7

FD (Fisher’s Discriminant); varsel (selected variables); TP (true positive); TN (true negative); FP (false positive); FN (false negative); SENS (sensitivity); ACC (accuracy); SPEC (specificity); PPV (prevalence positive value); PNV (prevalence negative value); FPR (false positive rate); FNR (false negative rate); MCC (Matthews correlation coefficient).

FD (Fisher’s Discriminant); varsel (selected variables); TP (true positive); TN (true negative); FP (false positive); FN (false negative); SENS (sensitivity); ACC (accuracy); SPEC (specificity); PPV (prevalence positive value); PNV (prevalence negative value); FPR (false positive rate); FNR (false negative rate); MCC (Matthews correlation coefficient). After obtaining the supervised and unsupervised classification models, it was noted that signals of m/z below 1000 have a higher spectral intensity in positive samples and m/z above 1000 have a higher spectral intensity in negative samples. Figure represents the most important m/z for SVM models. A positive (green) and a negative (gray) sample were used. The higher intensities in m/z below 1000 are correlated with positive samples, and there is an intensity predominance for negative samples in m/z above 1000. The greater importance of variables was with lower m/z for positive samples (525.4 and 598.5) and higher m/z for negative samples (1410.8 and 2242.3). This can also be seen in the results of the PCA model, Figure . On the basis of the above results, we compared other studies within the literature that also applied multivariate data analysis or ML to data obtained by the MALDI MS technique to study biological samples from patients suspected of being contaminated by SARS-CoV-2. Rocca et al. (2020) obtained preliminary results for detecting protein profiles directly from SARS-CoV-2 positive and negative nasopharyngeal swab samples, where accuracy, sensitivity, and specificity were 67.66%, 61.76%, and 71.72%, respectively.[34] In the study by Nachtigall et al. (2020), 362 nasal swab samples without prior preparation were analyzed using the MALDI–MS with ML. The model showed higher accuracy (93.9%) with 7% false positives and 5% false negatives.[21] Deulofeu et al. (2021) developed a methodology that combines MALDI-TOF MS with ML. The best model showed high performance in terms of accuracy, sensitivity, and specificity, reaching values greater than 90%.[19] Unlike the studies mentioned above, the following investigations employed blood samples (plasma or serum). Lazari et al. (2021) obtained optimized results with an accuracy of 92%, sensitivity of 93%, and specificity of 92% in the analyzed data set.[35] Gomila et al. (2021) performed MALDI-TOF MS analysis of serum peptides in patients classified as mild, severe, critical, and healthy control. The ML approach was rated with 100% accuracy in clinically stable patients according to the severity and correctly predicted the outcome of unstable patients in all cases.[36] The study by Wan et al. (2021) developed a new approach to detect asymptomatic SARS-CoV-2 infection using metabolic patterns combined with ML. This approach provided 93.4% accuracy with only 5% false-negative and 7% false-positive rates.[37] Despite the excellent results found in the literature relating the MALDI MS technique with ML in biological samples from patients suspected of having COVID-19, the protocols for collecting these samples are still very invasive, uncomfortable, and have some risk of viral transmission for the operator who performs the procedure. Thus, the use of saliva as a biological sample has already aroused interest in diagnostic studies for COVID-19, such as in the study by Chivte et al. (2021) and Costa et al. (2021), which identified potential biomarkers of infection by SARS-CoV-2.[11,24] The high accuracy of mass measurement of the FT-ICR MS reduces the probability of an incorrect assignment of a protein in the peptide mass profile. Additionally, the high resolution of the equipment provides a greater amount of variables that make the separation more accurate.[38] The absence of studies using MALDI FT-ICR MS in rapid diagnosis of biological samples makes this work of great value for developing new techniques with results concerning sensitivity, specificity, and accuracy superior to other techniques previously studied. However, the uniqueness of MS profiles among individuals requires the development of sophisticated biostatistical analysis, such as machine learning approaches, making our study promising in advancing research using the MALDI FT-ICR MS technique combined with ML.

Conclusions

SVM is efficient for diagnosing COVID-19 with a FNR (false negative rate) of 0.0%. Thus, the methodology created, besides being noninvasive, avoids false-negative diagnoses. In addition, the SVM had a sensitivity of 100%, specificity of 87.5%, and accuracy of 95.6%. Signals with m/z below 1000 have higher intensity in positive samples, and signals with m/z above 1000 have higher intensity in negative samples for the PCA and SVM models. The rapid screening method for patients with suspected COVID-19 developed in this work allows for a simple, safe, and noninvasive collection. The MALDI FT-ICR MS technique possessed high sensitivity and specificity for COVID-19 detection combined with machine learning methods. In optimizing the methodology, using 10 μL of trypsin and 2 h of digestion showed a higher TIC for all peptide sequences, streamlining the sample preparation process. As for the MALDI FT-ICR MS analysis, 1 M resolving power showed an excellent ratio of detected signals (221 signals) with a relative intensity above the noise and an individual analysis time of half a minute.
  32 in total

Review 1.  Potential uses of human salivary protein and peptide analysis in the diagnosis of disease.

Authors:  Sausan Al Kawas; Zubaidah H A Rahim; David B Ferguson
Journal:  Arch Oral Biol       Date:  2011-07-19       Impact factor: 2.633

2.  A rapid and sensitive method for the quantitation of microgram quantities of protein utilizing the principle of protein-dye binding.

Authors:  M M Bradford
Journal:  Anal Biochem       Date:  1976-05-07       Impact factor: 3.365

3.  Profiling human saliva endogenous peptidome via a high throughput MALDI-TOF-TOF mass spectrometry.

Authors:  Chun-Ming Huang; Wenhong Zhu
Journal:  Comb Chem High Throughput Screen       Date:  2009-06       Impact factor: 1.339

4.  Potential preanalytical and analytical vulnerabilities in the laboratory diagnosis of coronavirus disease 2019 (COVID-19).

Authors:  Giuseppe Lippi; Ana-Maria Simundic; Mario Plebani
Journal:  Clin Chem Lab Med       Date:  2020-06-25       Impact factor: 3.694

5.  Mass Spectrometric Identification of SARS-CoV-2 Proteins from Gargle Solution Samples of COVID-19 Patients.

Authors:  Christian Ihling; Dirk Tänzler; Sven Hagemann; Astrid Kehlen; Stefan Hüttelmaier; Christian Arlt; Andrea Sinz
Journal:  J Proteome Res       Date:  2020-06-23       Impact factor: 4.466

Review 6.  Protein-Based Salivary Profiles as Novel Biomarkers for Oral Diseases.

Authors:  Alejandro I Lorenzo-Pouso; Mario Pérez-Sayáns; Susana B Bravo; Pía López-Jornet; María García-Vence; Manuela Alonso-Sampedro; Javier Carballo; Abel García-García
Journal:  Dis Markers       Date:  2018-11-07       Impact factor: 3.434

7.  Detection of SARS-CoV-2 Infection in Human Nasopharyngeal Samples by Combining MALDI-TOF MS and Artificial Intelligence.

Authors:  Meritxell Deulofeu; Esteban García-Cuesta; Eladia María Peña-Méndez; José Elías Conde; Orlando Jiménez-Romero; Enrique Verdú; María Teresa Serrando; Victoria Salvadó; Pere Boadas-Vaello
Journal:  Front Med (Lausanne)       Date:  2021-04-01

8.  Noninvasive Diagnostic for COVID-19 from Saliva Biofluid via FTIR Spectroscopy and Multivariate Analysis.

Authors:  Márcia H C Nascimento; Wena D Marcarini; Gabriely S Folli; Walter G da Silva Filho; Leonardo L Barbosa; Ellisson Henrique de Paulo; Paula F Vassallo; José G Mill; Valério G Barauna; Francis L Martin; Eustáquio V R de Castro; Wanderson Romão; Paulo R Filgueiras
Journal:  Anal Chem       Date:  2022-01-25       Impact factor: 6.986

9.  Low performance of rapid antigen detection test as frontline testing for COVID-19 diagnosis.

Authors:  Anaïs Scohy; Ahalieyah Anantharajah; Monique Bodéus; Benoît Kabamba-Mukadi; Alexia Verroken; Hector Rodriguez-Villalobos
Journal:  J Clin Virol       Date:  2020-05-21       Impact factor: 3.168

10.  Prognostic accuracy of MALDI-TOF mass spectrometric analysis of plasma in COVID-19.

Authors:  Lucas Cardoso Lazari; Fabio De Rose Ghilardi; Livia Rosa-Fernandes; Diego M Assis; José Carlos Nicolau; Veronica Feijoli Santiago; Talia Falcão Dalçóquio; Claudia B Angeli; Adriadne Justi Bertolin; Claudio Rf Marinho; Carsten Wrenger; Edison Luiz Durigon; Rinaldo Focaccia Siciliano; Giuseppe Palmisano
Journal:  Life Sci Alliance       Date:  2021-06-24
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.