| Literature DB >> 34007956 |
Elad Maor1,2, Nir Tsur3,2, Galia Barkai3,4,2, Ido Meister5, Shmuel Makmel5, Eli Friedman5, Daniel Aronovich6, Dana Mevorach6, Amir Lerman7, Eyal Zimlichman1,2, Gideon Bachar2.
Abstract
OBJECTIVE: To investigate the association of voice analysis with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection. PATIENTS AND METHODS: A vocal biomarker, a unitless scalar with a value between 0 and 1, was developed based on 434 voice samples. The biomarker training was followed by a prospective, multicenter, observational study. All subjects were tested for SARS-CoV-2, had their voice recorded to a smartphone application, and gave their informed consent to participate in the study. The association of SARS-CoV-2 infection with the vocal biomarker was evaluated.Entities:
Keywords: AUC, area under the receiver operating curve; COVID-19, coronavirus disease 2019; PCR, polymerase chain reaction; SARS-CoV-2, severe acute respiratory syndrome coronavirus 2
Year: 2021 PMID: 34007956 PMCID: PMC8120447 DOI: 10.1016/j.mayocpiqo.2021.05.007
Source DB: PubMed Journal: Mayo Clin Proc Innov Qual Outcomes ISSN: 2542-4548
Figure 1Feature extraction process using transfer learning and adaptation methods. The process produces a 512-features vector for each x seconds of a given recording. The process is agnostic to the chosen recording length, due to a global average pooling layer at the end of the fine-tuned Vggish network.
Baseline characteristics of the study populationa,b
| Characteristic | All (N=80) | COVID-19 (-) | COVID-19 (+) | |
|---|---|---|---|---|
| Age, years (all data) | 29 (23,36)] | 29 (24, 35) | 28 (22, 37) | .415 |
| Male | 54 (68) | 25 (63) | 29 (73) | .340 |
| Smokers | 8 (10) | 2 (5) | 6 (15) | <0.001 |
| Obesity (BMI > 30 kg/m2) | 5 (6) | 1 (3) | 4 (10) | .358 |
| Asthma | 3 (4) | 1 (3) | 2 (5) | .500 |
| Neurological disease | 2 (3) | 1 (3) | 1 (3) | .747 |
| Hypertension | 3 (4) | 1 (3) | 2 (5) | .510 |
| Diabetes mellitus | 2 (3) | 0 | 2 (5) | .253 |
| GERD | 1 (1) | 0 | 1 (3) | .506 |
| CKD | 1 (1) | 0 | 1 (3) | .506 |
| Treated respiratory condition | 6 (8) | 1 (3%) | 5 (13) | .09 |
| Vocal biomarker (all date) | 0.14 [0.1-0.28] | 0.11 [0.06-0.17] | 0.19 [0.12-0.3] | .001 |
| Days between PCR and voice recording (all date) | 10 [4-20.3] | 19 [9.8-32] | 6 [2,10] | .02 |
BMI, body mass index; CKD, chronic kidney disease; COVID-19, coronavirus disease 2019; GERD, gastroesophageal reflux disease, PCR, polymerase chain reaction.
Values are n (%) unless otherwise stated.
Defined as treated with pulmonary medications.
Self-reported symptoms at the time of the recordinga,b
| All | COVID-19 (-) | COVID-19 (+) | ||
|---|---|---|---|---|
| Fever | 4 (5) | 0 | 4 (10) | <.001 |
| Cough | 20 (25) | 2 (5) | 18 (45) | <.001 |
| Shortness of breath | 8 (10) | 1 (3) | 7 (18) | .025 |
| Runny nose | 17 (21) | 5 (13) | 12 (30) | .05 |
| Loss of smell | 14 (18) | 0 | 14 (35) | <.001 |
| At least one symptom | 33 (41) | 6 (15) | 27 (68) | <.001 |
COVID-19, coronavirus disease 2019.
Values are n (%).
Figure 2Scatter plot of vocal biomarker values by the two study groups can be seen on the Y-axis.
Figure 3Receiver operating characteristic (ROC) curves for the vocal biomarker and symptom-based classifier are shown. Study population (n=80) comparison between the average area under the curve (AUC) for coronavirus disease 2019 detection (positive/negative) of the combined biomarker (red, AUC = 0.85), the vocal biomarker (green, AUC = 0.72), and the symptoms-based classifier (blue, AUC = 0.77). The blue represents the symptoms classification result.
Vocal biomarker classification compared to PCR resultsa,b
| Vocal biomarker | PCR test result | Total | |
|---|---|---|---|
| Positive | Negative | ||
| Positive | 34 | 19 | 53 |
| Negative | 6 | 21 | 27 |
| Total | 40 | 40 | 80 |
PCR, polymerase chain reaction.
The classification results of the Vocalis coronavirus disease 2019 biomarker in comparison to the PCR test results in the study population (n=80) with a threshold of 0.115. The sensitivity of the biomarker was 85% (95% CI, 70.2% to 94.3%), the specificity was 52.5% (95% CI, 36.2% to 68.5%), and the positive predictive value was 64.2% (95% CI, 55.8% to 71.8%) and negative predictive value was 77.8% (95% CI, 61.3% to 88.6%).
Figure 4Effect of varying the voice window length on the performance of the coronavirus disease 2019 classifier. Between 3 seconds and up to 20 seconds, the grey bars are the resulted average area under the curve (AUC) on the training set cross-validation process. The blue bars are the resulted AUC on the study population (n=80).