| Literature DB >> 34408169 |
Driss Lahlou Kitane1, Salma Loukman2, Nabila Marchoudi2, Alvaro Fernandez-Galiana3, Fatima Zahra El Ansari2, Farah Jouali2, Jamal Badir4, Jean-Luc Gala4, Dimitris Bertsimas1, Nawfal Azami5, Omar Lakbita6, Omar Moudam6, Rachid Benhida7,8, Jamal Fekkak2.
Abstract
The coronavirus pandemic, which appeared in Wuhan, China, in December 2019, rapidly spread all over the world in only a few weeks. Faster testing techniques requiring less resources are key in managing the pandemic, either to enable larger scale testing or even just provide developing countries with limited resources, particularly in Africa, means to perform tests to manage the crisis. Here, we report an unprecedented, rapid, reagent-free and easy-to-use screening spectroscopic method for the detection of SARS-CoV-2 on RNA extracts. This method, validated on clinical samples collected from 280 patients with quantitative predictive scores on both positive and negative samples, is based on a multivariate analysis of FTIR spectra of RNA extracts. This technique, in agreement with RT-PCR, achieves 97.8% accuracy, 97% sensitivity and 98.3% specificity while reducing the testing time post RNA extraction from hours to minutes. Furthermore, this technique can be used in several laboratories with limited resources.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34408169 PMCID: PMC8373901 DOI: 10.1038/s41598-021-95568-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Sequential FT-IR-based assay for detection of SARS-CoV-2: sample collection, RNA extraction, FTIR analysis and then machine learning.
Figure 2Clinical data of Covid-19 tested patients. (a) Ct values distribution based on the number of samples (b) Distribution of symptomatic and asymptomatic positive patients, (c) Distribution of patients by age. (d) Distribution of patients by sex.
Comparison of predicting performance on the testing set using Logistic Regression.
| Spectral region (cm−1) | Spectra signal | Accuracy (%) | Sensitivity (%) | Specificity (%) | AUC (× 100) | Accuracy (%) | Sensitivity (%) | Specificity (%) | AUC (× 100) |
|---|---|---|---|---|---|---|---|---|---|
| Out-of-sample | In-sample | ||||||||
| 600 to 4500 | Raw | 95.7 | 94.1 | 96.7 | 98.4 | 100 | 100 | 100 | 100 |
| Baseline Corrected | 96.8 | 94.1 | 98.3 | 99.8 | 100 | 100 | 100 | 100 | |
| 1st Der | 96.8 | 97 | 98.3 | 99.9 | 100 | 100 | 100 | 100 | |
| 2nd Der | 97.8 | 97 | 98.3 | 99.1 | 100 | 100 | 100 | 100 | |
| 900 to 1800 | Raw | 83.1 | 70.5 | 86.5 | 89 | 92.1 | 82.3 | 100 | 94.3 |
| Baseline Corrected | 84.2 | 73.5 | 86.5 | 90.5 | 94.7 | 88.2 | 100 | 95.5 | |
| 1st Der | 93.6 | 94.1 | 92.4 | 97.7 | 100 | 100 | 100 | 100 | |
| 2nd Der | 94.7 | 94.1 | 95.4 | 97.2 | 100 | 100 | 100 | 100 | |
Figure 3Detection of SARS-CoV-2 with multivariable analysis. (a) Raw Spectra (b) Sample of 2nd derivative of Savitzky–Golay smoothened spectra of positive and negative samples. (c, d, e) First three latent variables of PLS-DA. (f) Coefficients of variables selected by sparse classification algorithm of second derivative of raw spectra (g, h) Zooms on regions indicated by sparse classification. (i)Projection of the 280 spectra used according to the first two latent variables obtained. (j) Projection of the 280 spectra used according to the first three latent variables obtained.
Tentative assignment of wavenumber markers used by sparse classification.
| Wave number (cm−1) | Tentative Assignments[ |
|---|---|
| 638, 665 | Guanine breathing mode |
| 878 | Out-of-plane vibrations of nucleobases |
| 901, 940 | Ribose phosphate backbone |
| 973 | C–O and C–C ribose |
| 1101 | Symmetric stretching P–O–C, RNA ν(C–O) ribose |
| 1182 | C–O and phosphate vibrations |
| 1251 | P = O, PO2−asym |
| 1416 | Stretching C–N, N–H and C–H deformation |
| 1521 | C nucleobase both in RNA ss or ds, amide II |
| 1689 | C6=O6 of G in ds, Amid I, C=O Guanine and N–H deformation, νasym(C2=O) vibration in RNA |
| 1913, 2086 | bands of second order |
| 2218, 2248, 2334, 2389, 2441, 2489, 2709, 2845, 2882, 2952 | Stretch C–H, N–H |
| 3611, 3436, 3659, 3748, 3927 | Stretch N–H asym and O–H asym |
Synthetic RNA viruses used to assess the specificity. (a) Anti-sense strand, (b) sense strand.
| Name | Accession | Virus Type | Length |
|---|---|---|---|
| Twist synthetic Influenza H1N1 (2009) | NC_026431, NC_026431, NC_026431, NC_026431, NC_026431, NC_026431, NC_026431 | ssRNA (−)a | 13,158 |
| Twist synthetic Influenza H3N2 | NC_007366, NC_007367, NC_007368, NC_007369, NC_007370, NC_007371, NC_007372, NC_007373, | ssRNA (−) | 13,627 |
| Twist synthetic Influenza B | NC_002204, NC_002205, NC_002206, NC_002207, NC_002208, NC_002209, NC_002211 | ssRNA (−) | 14,452 |
| Twist synthetic human Bocavirus | MG953830.1 | ssDNA | 5164 |
| Twist synthetic human Enterovirus 68 | NC_038308.1 | ssRNA (+)b | 7367 |
| Twist synthetic human Rhinovirus 89 | NC_001617.1 | ssRNA (+) | 7152 |
| Twist synthetic Mumps virus | NC_002200.1 | ssRNA (−) | 15,384 |
| Twist synthetic human Parainfluenza virus 1 | NC_003461.1 | ssRNA (−) | 15,600 |
| Twist synthetic Measles virus | NC_001498.1 | ssRNA (−) | 15,894 |
| Twist synthetic human Parainfluenza virus 4 | NC_21928.1 | ssRNA (−) | 17,052 |
| Twist synthetic human Coronavirus 4 | NC_0022645.1 | ssRNA (+) | 27,317 |
| Twist synthetic human Coronavirus NL63 | NC_005831.2 | ssRNA (+) | 27,553 |
| Twist synthetic human Coronavirus OC43 | NC_006213.1 | ssRNA (+) | 30,741 |
| Twist synthetic human Coronavirus Tor 2 | NC_004718.3 | ssRNA (+) | 29,751 |
| Twist synthetic human Coronavirus 2c EMC/2012 | JX869059.2 | ssRNA (+) | 30,119 |
Variables used to separate positives and negative samples with 100% accuracy.
| Wave number (cm−1) | Tentative assignments[ |
|---|---|
| 1038 | ν(C–O) ribose. symmetric stretching P–O–C |
| 1074 | νasym(PO2−) symmetric and ν (C-O) ribose |
| 1131 and 1172 | RNA ν (C=O). ribose |
| 1172 and 1174 | ν (C-O) and phosphate vibrations (non-H-bonded mode of C–OH) x |
| 1210 | νasym (PO2−) |
| 1640. 1670. 1673 | Amid I.νa (C2=O).νasym(C5=O).νasym (C6=O). can overlap with water (ν2) |
| 1760 | ν carbonyl stretching |
Detection limit: distribution of predictions on positive samples after training depending on samples’ concentration.
| Concentration (copies/ul) | Number of samples | Number of correct predictions |
|---|---|---|
| 25 | 3 | 3 |
| 10 | 3 | 3 |
| 5 | 3 | 1 |
| 3 | 3 | 0 |
| 0.5 | 5 | 0 |
Characteristics of different Sars-CoV-2 detection techniques. (a) Authors' estimates. (b) The matrix solution was prepared with α-cyano-hydroxy-cinnamic acid (CHCA) at 1% in acetonitrile/0.1% trifluoroacetic acid (1:1). c) RT-PCR Limit of Detection in copies/mL[44].
| ATR-FTIR | CRISPR–Cas12-based[ | MALDI-MS[ | RT-PCR | |
|---|---|---|---|---|
| Test Ct range | 11 to 39 | 20 to 37 | 16 to 37 | – |
| Accuracy | 98% | 98% | 94% | – |
| Sensitivity | 97% | 95% | 95% | – |
| Specificity | 98% | 100% | 93% | – |
| LoD (in copies/μl) | 10 | 10 | NA | 10 to 511c |
| Sample-to-result time (min) | 21 | 30–40 | 25-30a | 120 |
| Consumables | RNA extraction kits | RNA extraction & Crisper kits | CHCAb | RNA extraction & PCR kits |
| Equipment | FTIR Spectrometer, RNA extractor | RT-Lamp, Lateral flow Strip, RNA extractor | Mass Spectrometer | PCR machine, RNA extractor |