| Literature DB >> 35181702 |
Emilio Gomez-Gonzalez1,2, Alejandro Barriga-Rivera3,4, Beatriz Fernandez-Muñoz5, Jose Manuel Navas-Garcia6, Isabel Fernandez-Lizaranzu3,7, Francisco Javier Munoz-Gonzalez3, Ruben Parrilla-Giraldez8, Desiree Requena-Lancharro3, Pedro Gil-Gamboa3, Cristina Rosell-Valle7,5, Carmen Gomez-Gonzalez9,10, Maria Jose Mayorga-Buiza7,11,12, Maria Martin-Lopez7,5, Olga Muñoz13, Juan Carlos Gomez-Martin13, Maria Isabel Relimpio-Lopez12,14,15, Jesus Aceituno-Castro13,16, Manuel A Perales-Esteve17, Antonio Puppo-Moreno9,10, Francisco Jose Garcia-Cozar18, Lucia Olvera-Collantes19, Raquel Gomez-Diaz7, Silvia de Los Santos-Trigo20, Monserrat Huguet-Carrasco21, Manuel Rey22, Emilia Gomez23, Rosario Sanchez-Pernaute5, Javier Padillo-Ruiz7,12,24, Javier Marquez-Rivas7,12,25,26.
Abstract
Effective testing is essential to control the coronavirus disease 2019 (COVID-19) transmission. Here we report a-proof-of-concept study on hyperspectral image analysis in the visible and near-infrared range for primary screening at the point-of-care of SARS-CoV-2. We apply spectral feature descriptors, partial least square-discriminant analysis, and artificial intelligence to extract information from optical diffuse reflectance measurements from 5 µL fluid samples at pixel, droplet, and patient levels. We discern preparations of engineered lentiviral particles pseudotyped with the spike protein of the SARS-CoV-2 from those with the G protein of the vesicular stomatitis virus in saline solution and artificial saliva. We report a quantitative analysis of 72 samples of nasopharyngeal exudate in a range of SARS-CoV-2 viral loads, and a descriptive study of another 32 fresh human saliva samples. Sensitivity for classification of exudates was 100% with peak specificity of 87.5% for discernment from PCR-negative but symptomatic cases. Proposed technology is reagent-free, fast, and scalable, and could substantially reduce the number of molecular tests currently required for COVID-19 mass screening strategies even in resource-limited settings.Entities:
Mesh:
Year: 2022 PMID: 35181702 PMCID: PMC8857323 DOI: 10.1038/s41598-022-06393-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Schematic of the hyperspectral imaging assay. (a) Three different types of samples were analyzed. Top, samples containing spike pseudotyped lentiviral particles (i.e., synthetic coronaviruses). Middle, human saliva of SARS-CoV-2 suspects. Bottom, inactivated nasopharyngeal swabs for SARS-CoV-2 PCR tests. (b) Several fluid droplets were placed on a supporting plate. (c) The samples were illuminated using two halogen lamps. Sub-surface scattering is illustrated by the red arrows. (d) A sliding sensor recorded hyperspectral images the VNIR range. (e) The reflectance spectrum of each pixel within the hyperspectral matrix was then digitally pre-processed to obtain the pseudo-absorbance spectra. (f) Spectral features descriptors were obtained from pixel spectra for analysis. (g) A feed-forward neural network (FFNN) was trained[37] to detect viral content from spectral features and output a pixel-based binary classification. (h) A partial least square-discriminant analysis (PLS-DA) was performed[37] using pixel reflectance spectra.
Experiment 1: Sample distribution of synthetic viral models (SARS-CoV-2 spike pseudotyped lentiviral particles (S-LP) and lentiviral particles pseudotyped with the G protein of the vesicular stomatitis virus (G-LP)) and their respective negative controls (S-LP Ctrl and G-LP Ctrl) in phosphate buffered solution (PBS) and in artificial saliva (AS) at droplet and pixel levels.
| PBS | AS | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| G-LP | G-LP Ctrl | S-LP | S-LP Ctrl | G-LP | G-LP Ctrl | S-LP | S-LP Ctrl | ||
| Droplet | C4 | 24 | 2 | 4 | 4 | 15 | 3 | 4 | 4 |
| C3 | 9 | 2 | 4 | 4 | 14 | 3 | 4 | 4 | |
| C2 | 9 | 2 | 4 | 4 | 14 | 3 | 4 | 4 | |
| C1 | 9 | 2 | 4 | 4 | 15 | 3 | 4 | 4 | |
| Total | 51 | 8 | 16 | 16 | 58 | 12 | 16 | 16 | |
| Pixel | C4 | 25,504 | 1605 | 4636 | 4725 | 13,381 | 2642 | 5065 | 5076 |
| C3 | 10,532 | 1600 | 4703 | 4243 | 12,755 | 2651 | 4535 | 4373 | |
| C2 | 8536 | 1061 | 3701 | 3091 | 11,315 | 1847 | 4084 | 4534 | |
| C1 | 8205 | 1134 | 3203 | 3519 | 16,594 | 2826 | 4742 | 4989 | |
| Total | 52,777 | 5400 | 16,243 | 15,578 | 54,045 | 9966 | 18,426 | 18,972 | |
Concentrations were C1 = 800 TU µL−1, C2 = 1500 TU µL−1, C3 = 3000 TU µL−1 and C4 = 4000 TU µL−1. Data from G-LP samples were obtained from a previous study[37].
Figure 2Experiment 1: Spectral discrimination of SARS-CoV-2 spike pseudotyped lentiviral particles. (a) Mean pseudo-absorbance (PA) pixel spectra of spike pseudotyped lentiviral particles in phosphate buffered solution (S-LPPBS) for high concentration (HC = 4.0 × 103 TU µL−1) and low concentration (LC = 0.8 × 103 TU µL−1) and their respective negative controls (CtrlPBS). (b) Mean pseudo-absorbance (PA) pixel spectra of spike pseudotyped lentiviral particles in artificial saliva (S-LPAS) for high concentration (HC = 4.0 × 103 TU µL−1) and low concentration (LC = 0.8 × 103 TU µL−1) and their respective negative controls (CtrlAS). (c) Overall score value of the tenfold cross-validation essay performed in partial least square-discriminant analysis (PLS-DA Model 1) in both preparations (per pixel). Dashed lines represent their linear correlations. (d) Scatter plot of the two latent variables (V1 and V2) used in the PLS-DA Model 1 for different viral concentrations. Brown and green dots represent positive and control samples respectively. (e,f) Spectral feature descriptor (SFD), computed in the band 483–610 nm for S-LP and G-LP samples in phosphate buffered solution (PBS), artificial saliva (AS), and the corresponding culture media as control (Ctrl). ***p-value = 0 using the Wilcoxon rank sum test. Outliers were removed for clarity.
Experiment 2: Sample distribution of nasopharyngeal exudates at different levels (total numbers of patients, droplets and pixels) among the experimental Training (Tra), Validation (Val), and Test groups (Trial 1).
| Pixel | Droplet | Patient | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| H | M | L | H | M | L | H | M | L | ||||||
| PLS-DA | 144,426 | 35,715 | 30,545 | 808,184 | 78 | 24 | 18 | 150 | 13 | 4 | 3 | 20 | 25 | |
| 50,264 | 32,902 | 26,579 | 146,440 | 30 | 18 | 18 | 96 | 5 | 3 | 3 | 11 | 16 | ||
| FFNN | 111,676 | 27,924 | 21,498 | 744,855 | 60 | 18 | 12 | 108 | 10 | 3 | 2 | 15 | 18 | |
| 32,750 | 7791 | 9047 | 63,329 | 18 | 6 | 6 | 42 | 3 | 1 | 1 | 5 | 7 | ||
| 50,264 | 32,902 | 26,579 | 146,440 | 30 | 18 | 18 | 96 | 5 | 3 | 3 | 11 | 16 | ||
Positive (Pos) and negative (Neg) cases were determined by qRT-PCR. Positive cases include three viral load levels (H = 106 copies mL−1, M = 104 copies mL−1 and L = 102 copies mL−1). Note that classification algorithms provide results as (per-pixel, per-droplet and per-patient) ‘positive’ (with any level of viral load) or ‘negative’ assignations.
PLS-DA partial least square-discriminant analysis, FFNN feed-forward neural network.
Figure 3Experiment 2: Analysis of SARS-CoV-2 nasopharyngeal exudates. (a) Mean pseudo-absorbance (PA) pixel spectra from positive (viral loads H = 106 copies mL−1 (high), M = 104 copies mL−1 (medium) and L = 102 copies mL−1 (low)) and negative cases of SARS-CoV-2. In addition, the panel shows the mean pixel spectra of the supporting plate (background). (b) Value of the spectral feature descriptor (SFD) in the band between 870 and 910 nm for all cases. Positive (Pos) box includes all H, M and L samples. (c–h) Classification results obtained using partial least square-discriminant analysis (PLS-DA Model 2, Trial 1). (i–n) Classification results obtained using a feed-forward neural network (FFNN, Trial 1). (c,i) Median value of the output used for pixel classification. Positive and negative samples were determined by qRT-PCR. ***p-value = 0 using Wilcoxon rank-sum test. The blue dashed line illustrates the classification threshold. (d,j) Receiver operating characteristic (ROC) curves from the pixel classification. (e,k) Example of the resulting pixel classification in a droplet. Red and green pixels were classified as positive and negative respectively. Note red and green backgrounds indicate a positive sample and its negative control. (f,l) ROC curves obtained from droplet classifications. (g,m) Example of the classification of several droplets from a positive (red background) and a negative (green background) patient. (h,n) ROC curves obtained from patient diagnosis. Outliers were removed for clarity.
Experiment 2: Values of sensitivity (SE), specificity (SP) and area under the receiving operating characteristic (AUROC) curve obtained at per-patient classification from both types of analysis, partial least square-discriminant analysis (PLS-DA Models 2 and 3) and the feed-forward neural network (FFNN), of inactivated nasopharyngeal exudates using the same patient sets (Trial 1).
| PLS-DA (Model 2) (per-patient, from per-pixel classification) | FFNN (per-patient, from per-pixel classification) | PLS-DA (Model 3) (per-patient, from patient-averaged spectra) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| SE (%) | SP (%) | AUROC | SE (%) | SP (%) | AUROC | SE (%) | SP (%) | AUROC | |
| Pixel | 83.3 | 78.8 | 0.88 | 83.5 | 79.2 | 0.88 | – | – | – |
| Droplet | 97.0 | 89.6 | 0.95 | 97.0 | 88.5 | 0.95 | – | – | – |
| Patient | 100.0 | 87.5 | 0.98 | 100.0 | 87.5 | 0.93 | 90.9 | 87.5 | 0.97 |
Figure 4Experiment 3: Spectral information from SARS-CoV-2 in fresh saliva specimens. (a) Mean (± std) of the pseudo-absorbance pixel spectra from fresh saliva samples. Positive (pos) and negative (neg) cases were determined by PCR test. (b) Scatterplot of the two latent variables obtained in partial least square-discriminant analysis (PLS-DA) Model 1 from the pixel spectra. Positive and negative PCR tests are brown and green dots respectively. (c) Value of the spectral feature descriptor (SFD) obtained between 483 and 610 nm (grey band in (a)) for positive and negative cases. (d) Value of the spectral descriptor SFD obtained between 407 and 470 nm for positive and negative cases. (c,d) Both panels include differentiation between male and female pixel sets. ***p-value = 0 using the Wilcoxon rank sum test. Outliers were removed for clarity.
Figure 5Summary of the three performed experiments. Experiment 1: Analysis of two synthetic models of SARS-CoV-2 (S-LP and G-LP) and their negative controls, in four levels of concentration (viral load), in two biofluids (phosphate buffered solution and artificial saliva). Experiment 2: Analysis of nasopharyngeal exudate samples of SARS-CoV-2-positive symptomatic patients (in three viral loads) and negative symptomatic controls. Experiment 3: Analysis of fresh saliva samples of SARS-CoV-2-positive asymptomatic patients and negative asymptomatic controls. ROI region of interest, PCA principal component analysis, S-LP lentiviral particles pseudotyped with the SARS-CoV-2 Spike protein, G-LP lentiviral particles pseudotyped with the vesicular stomatitis virus G protein, PLS-DA partial least square-discriminant analysis, FFNN feed-forward neural network, SFD spectral feature descriptor, qRT-PCR quantitative reverse transcription polymerase chain reaction (PCR).