| Literature DB >> 32792638 |
Marfran C D Santos1, Joelma D Monteiro2,3, Josélio M G Araújo2,3, Kássio M G Lima4.
Abstract
Significant attempts are being made worldwide in an attempt to develop a tool that, with a simple analysis, is capable of distinguishing between different arboviruses. Herein, we employ molecular fluorescence spectroscopy as a sensitive and specific rapid tool, with simple methodology response, capable of identifying spectral variations between serum s<span class="Chemical">amples with or without the dengue or <span class="Species">chikungunya viruses. Towards this, excitation emission matrices (EEM) of clinical samples from patients with dengue or chikungunya, in addition to uninfected controls, were separated into a training or test set and analysed using multi-way classification models such as n-PLSDA, PARAFAC-LDA and PARAFAC-QDA. Results were evaluated based on calculations of accuracy, sensitivity, specificity and F score; the most efficient model was identified to be PARAFAC-QDA, whereby 100% was obtained for all figures of merit. QDA was able to predict all samples in the test set based on the scores present in the factors selected by PARAFAC. The loadings obtained by PARAFAC can be used in future studies to prove the direct or indirect relationship of spectral changes caused by the presence of these viruses. This study demonstrates that molecular fluorescence spectroscopy has a greater capacity to detect spectral variations related to the presence of such viruses when compared to more conventional techniques.Entities:
Mesh:
Year: 2020 PMID: 32792638 PMCID: PMC7426909 DOI: 10.1038/s41598-020-70811-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Excitation–emission molecular fluorescence spectra obtained for clinical serum samples: (a) uninfected; (b) with DENV; and, (c) with CHIKV. Rayleigh and Raman were removed from the spectra. The excitation/emission wavelength range was 250–320 nm for excitation and 240–800 nm for emission, with steps of 10 and 1 nm, respectively.
Correct classification rates obtained for n-PLSDA, PARAFAC-LDA and PARAFAC-QDA classification models between Uninfected, DENV and CHIKV.
| Model | Class | CC training (%) | CC test (%) |
|---|---|---|---|
| n-PLSDA (6)a | Uninfected | 78.57 | 66.66 |
| DENV | 71.42 | 83.33 | |
| CHIKV | 92.85 | 100.0 | |
| PARAFAC-LDA (3)b | Uninfected | 85.71 | 66.66 |
| DENV | 92.85 | 100.0 | |
| CHIKV | 100.0 | 100.0 | |
| PARAFAC-QDA (3)b | Uninfected | 100.0 | 100.0 |
| DENV | 100.0 | 100.0 | |
| CHIKV | 100.0 | 100.0 |
The CC% represents the percentage of samples correctly classified, considering their true classes. The calculation is made based on Eq. (8) (see “Quality performance”), where ε1 represents class 1 errors (class of interest) and ε2 represents class 2 errors (all samples from another class).
aNumber of latent variables; bnumber of parallel factors.
Figure 2(a) Canonical scores of the n-PLSDA for the 2 main latent variables; and, (b) predicted class values. The points refer to clinical samples of uninfected serum (blue circle), with DENV (red square), and with CHIKV (black square).
Figure 3(a) Canonical scores of the PARAFAC; (b) predicted class values by PARAFAC-LDA; and, (c) predicted class values by PARAFAC-QDA. The points refer to clinical samples of uninfected serum (blue circle), with DENV (red square), and with CHIKV (black square).
Figures of merit for the models n-PLSDA, PARAFAC-LDA and PARAFAC-QDA applied to emission-excitation matrices of serum samples.
| Figures of merit | Models | ||
|---|---|---|---|
| n-PLSDA | PARAFAC-LDA | PARAFAC-QDA | |
| Accuracy | 78.57 | 88.88 | 100.0 |
| Sensitivity | 87.50 | 100.0 | 100.0 |
| Specificity | 66.66 | 66.66 | 100.0 |
| F score | 75.67 | 80.0 | 100.0 |
| Accuracy | 81.25 | 88.88 | 100.0 |
| Sensitivity | 80.0 | 83.33 | 100.0 |
| Specificity | 83.33 | 100.00 | 100.0 |
| F score | 81.63 | 90.90 | 100.0 |
| Accuracy | 83.33 | 100.0 | 100.0 |
| Sensitivity | 75.0 | 100.0 | 100.0 |
| Specificity | 100.0 | 100.0 | 100.0 |
| F score | 85.71 | 100.0 | 100.0 |
The calculations of the figures of merit are based on Eqs. (9), (10), (11) and (12) (see “Quality performance”), and considers only the test step.
Figure 4Scores and loadings on FAC 1, FAC 2 and FAC 3 selected in PARAFAC. (a) Scores; (b) loadings for excitation; and, (c) loadings for emission. In the PARAFAC model, a decomposition of the data is condensed into trilinear factors (or FAC). Each FAC consists of a scoring vector and two loading vectors.