| Literature DB >> 32808759 |
Benjie Shan1,2, Yoav Y Broza3, Wenjuan Li1, Yong Wang2, Sihan Wu1, Zhengzheng Liu1, Jiong Wang4, Shuyu Gui4, Lin Wang5, Zhihong Zhang6, Wei Liu7, Shoubing Zhou1, Wei Jin1, Qianyu Zhang1, Dandan Hu1, Lin Lin1,2, Qiujun Zhang1, Wenyu Li1, Jinquan Wang8, Hu Liu1, Yueyin Pan1,2, Hossam Haick3.
Abstract
This article reports on a noninvasive approach in detecting and following-up individuals who are at-risk or have an existing COVID-19 infection, with a potential ability to serve as an epidemic control tool. The proposed method uses a developed breath device composed of a nanomaterial-based hybrid sensor array with multiplexed detection capabilities that can detect disease-specific biomarkers from exhaled breath, thus enabling rapid and accurate diagnosis. An exploratory clinical study with this approach was examined in Wuhan, China, during March 2020. The study cohort included 49 confirmed COVID-19 patients, 58 healthy controls, and 33 non-COVID lung infection controls. When applicable, positive COVID-19 patients were sampled twice: during the active disease and after recovery. Discriminant analysis of the obtained signals from the nanomaterial-based sensors achieved very good test discriminations between the different groups. The training and test set data exhibited respectively 94% and 76% accuracy in differentiating patients from controls as well as 90% and 95% accuracy in differentiating between patients with COVID-19 and patients with other lung infections. While further validation studies are needed, the results may serve as a base for technology that would lead to a reduction in the number of unneeded confirmatory tests and lower the burden on hospitals, while allowing individuals a screening solution that can be performed in PoC facilities. The proposed method can be considered as a platform that could be applied for any other disease infection with proper modifications to the artificial intelligence and would therefore be available to serve as a diagnostic tool in case of a new disease outbreak.Entities:
Keywords: COVID-19; SARS-CoV-2; biomarker; breath; diagnosis; sensor
Mesh:
Substances:
Year: 2020 PMID: 32808759 PMCID: PMC7457376 DOI: 10.1021/acsnano.0c05657
Source DB: PubMed Journal: ACS Nano ISSN: 1936-0851 Impact factor: 15.881
Figure 1(a) Example of breath collection with the developed hand-held breathalyzer system from a patient in Wuhan, China. (b) Representative response of a sensor to three different breath samples. The normalized response of sensor 7 of the breathalyzer system to three different samples: patient A, COVID-19, first sample while infected; patient A, second sample after determined as recovered; and a healthy control. The x-axis represents the cycle measurement; each unit is one cycle of the sensor. The infected sample had a positive change response, while the recovered and control showed negative charges.
Figure 2Patient enrollment and observational design.
Figure 3Diagnosis of COVID-19 patients based on cumulative breath sample response. Panels A, B, and C show data classification from cumulative sensor responses to breath samples as represented by the canonical variable of the discriminant analysis. Box plots of the first canonical score of the training set (70% of samples) and test set (30% of samples). The horizontal dashed line in the box plots represents the cutoff value of the model. True positive (TP), true negative (TN), false positive (FP), false negative (FN). Panel A: COVID-19 patients (n = 41) and healthy controls (n = 57). Panel B: COVID-19 patients (n = 41) and other lung infection/condition controls (n = 32). Panel C: COVID-19 patients at first (n = 41) and second sampling (n = 21). P-values are for the comparisons of the training set for each of two binary classifications. The horizontal line in the boxes represents the median, the cross represents the mean, and the bottom and top of the boxes represent the 25th and 75th percentiles, respectively. Bars represent the upper 90th and lower 10th percentile, and the square dots are outliers. All P-values were adjusted for multiple comparisons using the Tukey–Kramer method. For panel C, the P-value is also adjusted for paired analysis. Panel D shows ROC curves for the cumulative breath-sensor response in patients with COVID-19 (Co) infection compared with controls (C) (black); in COVID-19 infection compared with other lung infection/conditions (LI), (red); and in COVID-19 infection first sample (Co1) compared to COVID-19 infection second sample (Co2) (blue). †P < 0.0001.
Breath Test Outcomes for the Study Population
| training set | testing set | |||||
|---|---|---|---|---|---|---|
| statistics | COVID-19 | COVID-19 | COVID-19 1st | COVID-19 | COVID-19 | COVID-19 1st |
| accuracy (%) | 94 | 90 | 90 | 76 | 95 | 88 |
| sensitivity (%) | 100 | 90 | 100 | 100 | 100 | 83 |
| specificity (%) | 90 | 91 | 69 | 61 | 90 | 100 |
| PPV (%) | 88 | 93 | 86 | 61 | 92 | 100 |
| NPV (%) | 100 | 87 | 100 | 100 | 100 | 71 |
| TP (cases) | 30 | 26 | 32 | 11 | 12 | 10 |
| TN (cases) | 35 | 20 | 11 | 11 | 9 | 5 |
| FP (cases) | 4 | 2 | 5 | 7 | 1 | 0 |
| FN (cases) | 0 | 3 | 0 | 0 | 0 | 2 |
Classification based on QDA.
Classification based on LDA.
Classification based on the ROC cutoff.
Figure 4Evaluation of the confounding factors for COVID-19 patients and control group based on cumulative breath sample response. Panels A, B, C, and D show the data classification of cumulative sensor responses to breath samples as represented by the canonical variable of the discriminant analysis, based on the main DFA model of patients vs controls. Box plots of the first canonical score of the data set of the COVID-19 and control data. Panel A compares male (34 participants) and female (35 participants), P = 0.90. Panel B compares age above 60 years (21 participants) and below 60 years (48 participants), P = 0.11. Panel C compares smokers (21 participants) and nonsmokers (48 participants), P = 0.66. Panel D compares coexisting conditions (18 participants) and noncoexisting conditions (51 participants), P = 0.07. P values are for the comparisons of each set of the confounding factors. None of the P values differ significantly between the confounders, suggesting no effect on the models. The horizontal line in the boxes represents the median, the cross represents the mean, and the bottom and top of the boxes represent the 25th and 75th percentiles, respectively. The I bars represent the upper 90th and the lower 10th percentiles, and the square dots are outliers.