| Literature DB >> 34965907 |
Carlo Robotti1, Giovanni Costantini2, Giovanni Saggio3, Valerio Cesarini4, Anna Calastri5, Eugenia Maiorano5, Davide Piloni6, Tiziano Perrone7, Umberto Sabatini7, Virginia Valeria Ferretti8, Irene Cassaniti9, Fausto Baldanti10, Andrea Gravina11, Ahmed Sakib11, Elena Alessi12, Matteo Pascucci12, Daniele Casali4, Zakarya Zarezadeh4, Vincenzo Del Zoppo4, Antonio Pisani13, Marco Benazzo14.
Abstract
Many virological tests have been implemented during the Coronavirus Disease 2019 (COVID-19) pandemic for diagnostic purposes, but they appear unsuitable for screening purposes. Furthermore, current screening strategies are not accurate enough to effectively curb the spread of the disease. Therefore, the present study was conducted within a controlled clinical environment to determine eventual detectable variations in the voice of COVID-19 patients, recovered and healthy subjects, and also to determine whether machine learning-based voice assessment (MLVA) can accurately discriminate between them, thus potentially serving as a more effective mass-screening tool. Three different subpopulations were consecutively recruited: positive COVID-19 patients, recovered COVID-19 patients and healthy individuals as controls. Positive patients were recruited within 10 days from nasal swab positivity. Recovery from COVID-19 was established clinically, virologically and radiologically. Healthy individuals reported no COVID-19 symptoms and yielded negative results at serological testing. All study participants provided three trials for multiple vocal tasks (sustained vowel phonation, speech, cough). All recordings were initially divided into three different binary classifications with a feature selection, ranking and cross-validated RBF-SVM pipeline. This brough a mean accuracy of 90.24%, a mean sensitivity of 91.15%, a mean specificity of 89.13% and a mean AUC of 0.94 across all tasks and all comparisons, and outlined the sustained vowel as the most effective vocal task for COVID discrimination. Moreover, a three-way classification was carried out on an external test set comprised of 30 subjects, 10 per class, with a mean accuracy of 80% and an accuracy of 100% for the detection of positive subjects. Within this assessment, recovered individuals proved to be the most difficult class to identify, and all the misclassified subjects were declared positive; this might be related to mid and short-term vocal traces of COVID-19, even after the clinical resolution of the infection. In conclusion, MLVA may accurately discriminate between positive COVID-19 patients, recovered COVID-19 patients and healthy individuals. Further studies should test MLVA among larger populations and asymptomatic positive COVID-19 patients to validate this novel screening technology and test its potential application as a potentially more effective surveillance strategy for COVID-19.Entities:
Keywords: Accuracy; Cough; SARS-CoV-2; Screening test; Sensitivity; Surveillance
Year: 2021 PMID: 34965907 PMCID: PMC8616736 DOI: 10.1016/j.jvoice.2021.11.004
Source DB: PubMed Journal: J Voice ISSN: 0892-1997 Impact factor: 2.009
Inclusion and Exclusion Criteria for the Three Study Groups
| Inclusion Criteria | Group P | Group R | Group H | Exclusion Criteria | Group P | Group R | Group H |
|---|---|---|---|---|---|---|---|
| Age between 18 and 80 y | ■ | ■ | ■ | Drugs acting on CNS | ■ | ■ | ■ |
| European ethnicity | ■ | ■ | ■ | Head and neck cancer | ■ | ■ | ■ |
| Italian native speaker | ■ | ■ | ■ | Lung cancer | ■ | ■ | ■ |
| Positive NS (< 10 d) | ■ | ■ | NA | Chemoradiation therapy | ■ | ■ | ■ |
| Two consecutive negative NS | NA | ■ | NA | C-PAP therapy | ■ | ■ | ■ |
| LUS ≤ 3 | NA | ■ | NA | Tracheal intubation | ■ | ■ | ■ |
| Negative SS (> 20 d) | NA | NA | ■ | Tracheostomy | ■ | ■ | ■ |
Abbreviations: P, positive COVID-19 patients; R, recovered negative COVID-19 patients; H, healthy control subjects; NS, SARS-CoV-2 nasal swab for RNA detection; LUS, lung ultrasound score; SS, SARS-CoV-2 serum sample for IgM and IgG quantification; CNS, Central Nervous System; C-PAP, Continuous Positive Airway Pressure; NA, not Apre-cable.
Clinical and Demographic Characteristics of the Three Study Groups
| Variables | Group P( | Group R( | Group H( | ||||
|---|---|---|---|---|---|---|---|
| Global | P Versus H | P Versus R | R Versus H | ||||
| Age, median (IQR), years | 57 (39-67) | 59 (48-69) | 41 (29-54) | 0.215 | |||
| Gender | |||||||
| Males, | 40 (57%) | 45 (64%) | 37 (53%) | 0.402 | NC | NC | NC |
| Females, | 30 (43%) | 25 (36%) | 33 (47%) | ||||
| BMI, median (IQR), kg/m2 | 27.8 (26.1-31.2) | 26.5 (24.4-30.5) | 24.3 (22.4-28.6) | 0.458 | |||
| Smoking habits | |||||||
| Non-smokers, | 35 (50%) | 38 (54%) | 38 (54%) | 0.333 | 0.522 | ||
| Smokers, | 8 (11%) | 2 (3%) | 15 (21%) | ||||
| Ex-smokers, | 27 (39%) | 30 (43%) | 17 (24%) | ||||
| COVID-19 pneumoniadiagnosis, | 40 (57%) | 67 (96%) | - | - | - | - | |
| COVID-19 symptoms | |||||||
| Presence of symptoms, | 54 (77%) | 54 (77%) | - | > 0.90 | - | - | - |
| Number of symptoms,median (IQR) | 2 (1-4) | 2 (1-3) | - | 0.096 | - | - | - |
| Asthenia ( | 29 (41%) | 39 (56%) | - | 0.128 | - | - | - |
| Dyspnea on exertion ( | 29 (41%) | 31 (44%) | - | 0.864 | - | - | - |
| Cough ( | 34 (49%) | 8 (11%) | - | - | - | - | |
| Muscle pain ( | 10 (14%) | 25 (36%) | - | - | - | - | |
| Dysphonia ( | 23 (33%) | 5 (7%) | - | - | - | - | |
| Olfaction disorder ( | 13 (19%) | 6 (9%) | - | 0.137 | - | - | - |
| Taste disorder ( | 12 (17%) | 5 (7%) | - | 0.119 | - | - | - |
| Olfaction and tastedisorder ( | 13 (19%) | 6 (9%) | - | 0.137 | - | - | - |
| Dyspnea at rest ( | 15 (21%) | 2 (3%) | - | - | - | - | |
| Blocked nose ( | 11 (16%) | 2 (3%) | - | - | - | - | |
| Headache ( | 6 (9%) | 7 (10%) | - | > 0.90 | - | - | - |
| Fever ( | 7 (10%) | 0 (0%) | - | - | - | - | |
| Dysphagia ( | 1 (1%) | 5 (7%) | - | 0.209 | - | - | - |
| Chest pain ( | 2 (3%) | 3 (4%) | - | > 0.90 | - | - | - |
Data regarding COVID-19 pneumonia and COVID-19 symptoms were collected only for positive and recovered COVID-19 patients, therefore cells are left blank for healthy control subjects. Data about pneumonia for group P refer to ongoing COVID-19 pneumonia diagnosis at the time of enrollment, while for group R they refer to previously diagnosed and currently recovered COVID-19 pneumonia. Significant p values are reported in bold font.
Abbreviations: P, positive COVID-19 patients; R, recovered negative COVID-19 patients; H, healthy control subjects; IQR, interquartile range; NC, not calculated; BMI, body mass index; COVID-19, coronavirus disease 2019.
FIGURE 1ROC curves comparing MLVA performances for all tasks within the discrimination between positive COVID-19 patients (group P) and healthy individuals (group H).
Accuracy, Sensitivity, Specificity and Area Under the Curve (AUC) of Machine-Learning Based Voice Analysis for All Tasks and All Comparisons Between Groups
| Comparison | Vocal Task | Accuracy (%) | Sensitivity (%) | Specificity (%) | AUC (CI) | Cut-Off |
|---|---|---|---|---|---|---|
| Group P versus Group H | Vowel /a/ | 90.07 | 92.11 | 88.00 | 0.94 (0.90-0.98) | 0.93 |
| Sentence | 87.88 | 83.58 | 92.31 | 0.91 (0.86-0.96) | 0.85 | |
| Cough | 89.44 | 91.28 | 87.50 | 0.92 (0.90-0.94) | 0.91 | |
| Group P versus Group R | Vowel /a/ | 92.81 | 92.86 | 91.43 | 0.97 (0.95-1.00) | 0.94 |
| Sentence | 91.18 | 91.04 | 91.30 | 0.96 (0.92-1.00) | 0.94 | |
| Cough | 91.50 | 93.27 | 89.58 | 0.94 (0.92-0.96) | 0.92 | |
| Group R versus Group H | Vowel /a/ | 89.21 | 92.86 | 85.51 | 0.92 (0.87-0.97) | 0.93 |
| Sentence | 89.55 | 92.75 | 86.15 | 0.96 (0.92-0.99) | 0.93 | |
| Cough | 90.49 | 90.63 | 90.36 | 0.92 (0.90-0.94) | 0.91 |
Abbreviations: P, positive COVID-19 patients; R, recovered negative COVID-19 patients; H, healthy control subjects; CI, 95% confidence interval.
FIGURE 2Discrimination between positive COVID-19 patients and healthy individuals based on the first 20 top ranking features of the vowel task. The red line of this radar plot corresponds to positive COVID-19 patients (group P), while the blue line corresponds to healthy individuals (group H). Each radius represents a distinct audio feature. Each point on the red line represents the feature's mean value for group P, normalized to its mean value for group H. Out of the original 50 top-ranking features, only the first 20 were reported for convenient viewing reasons. The list of all 20 top-ranking features is depicted in Table S2 (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article).
Confusion Matrices for Each Sub-Classifier Over the External Test Set, Along With Binary Accuracy, Sensitivity and Specificity Calculated on the Two Respective Classes for Each Comparison
| Real Class | P Versus H | P Versus R | R Versus H | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Vowel /a/ | Sentence | Cough | Vowel /a/ | Sentence | Cough | Vowel /a/ | Sentence | Cough | ||
| 1 | H | H | H | H | P | P | P | H | H | R |
| 2 | H | H | H | H | R | P | P | H | H | R |
| 3 | H | P | H | H | P | P | P | H | H | R |
| 4 | H | H | H | H | P | P | P | H | H | H |
| 5 | H | H | H | H | P | P | R | H | H | R |
| 6 | H | H | H | H | P | P | P | H | H | R |
| 7 | H | P | H | P | R | P | P | H | H | R |
| 8 | H | P | H | P | P | P | P | H | H | H |
| 9 | H | H | H | H | P | P | P | H | H | R |
| 10 | H | H | H | H | P | P | P | H | H | H |
| 11 | P | P | P | P | P | P | P | R | R | R |
| 12 | P | P | P | P | P | P | P | R | R | R |
| 13 | P | P | P | P | P | P | P | R | R | R |
| 14 | P | P | H | P | P | P | P | H | R | R |
| 15 | P | P | P | P | P | R | P | R | R | R |
| 16 | P | H | P | P | P | P | P | R | R | R |
| 17 | P | P | P | P | P | P | P | R | R | R |
| 18 | P | H | P | P | P | R | P | R | R | R |
| 19 | P | P | P | P | P | P | P | R | R | R |
| 20 | P | P | P | P | P | P | P | H | R | R |
| 21 | R | H | H | H | R | R | R | R | R | R |
| 22 | R | P | P | H | R | R | R | R | R | R |
| 23 | R | H | H | H | R | R | P | R | R | R |
| 24 | R | P | P | H | R | P | P | R | R | R |
| 25 | R | P | P | H | R | P | P | R | R | R |
| 26 | R | P | H | H | R | R | P | R | R | R |
| 27 | R | P | H | H | R | P | P | R | R | P |
| 28 | R | H | P | H | R | R | P | R | P | R |
| 29 | R | P | P | H | R | P | P | R | R | R |
| 30 | R | H | H | H | R | R | P | R | R | R |
| Accuracy (%) | 75 | 95 | 90 | 100 | 80 | 60 | 100 | 95 | 60 | |
| Sensitivity (%) | 80 | 90 | 100 | 100 | 100 | 100 | 100 | 90 | 90 | |
| Specificity (%) | 70 | 100 | 80 | 100 | 60 | 20 | 100 | 100 | 30 | |
Abbreviations: #, number of test subject; P, positive COVID-19 patients; R, recovered negative COVID-19 patients; H, healthy control subjects.
Final Confusion Matrix for the Three Classifiers on the External Test Set Along With Mean Accuracy and Per-Class Accuracies
| # | Real Class | Binary Classifiers Output: | Final | Error | ||
|---|---|---|---|---|---|---|
| P Versus H | P Versus R | R Versus H | ||||
| 1 | H | H | H | P | H | |
| 2 | H | H | H | P | H | |
| 3 | H | H | H | P | H | |
| 4 | H | H | H | P | H | |
| 5 | H | H | H | P | H | |
| 6 | H | H | H | P | H | |
| 7 | H | P | H | P | P | yes |
| 8 | H | P | H | P | P | yes |
| 9 | H | H | H | P | H | |
| 10 | H | H | H | P | H | |
| 11 | P | P | R | P | P | |
| 12 | P | P | R | P | P | |
| 13 | P | P | R | P | P | |
| 14 | P | P | R | P | P | |
| 15 | P | P | R | P | P | |
| 16 | P | P | R | P | P | |
| 17 | P | P | R | P | P | |
| 18 | P | P | R | P | P | |
| 19 | P | P | R | P | P | |
| 20 | P | P | R | P | P | |
| 21 | R | H | R | R | R | |
| 22 | R | P | R | R | R | |
| 23 | R | H | R | R | R | |
| 24 | R | P | R | P | P | yes |
| 25 | R | P | R | P | P | yes |
| 26 | R | P | R | R | R | |
| 27 | R | H | R | P | uncertain | uncertain |
| 28 | R | H | R | R | R | |
| 29 | R | P | R | P | P | yes |
| 30 | R | H | R | R | R | |
| Accuracy (%) | 80 | |||||
| H accuracy (%) | 80 | |||||
| P accuracy (%) | 100 | |||||
| R accuracy (%) | 60 | |||||
Abbreviations: #, number of test subject; P, positive COVID-19 patients; R, recovered negative COVID-19 patients; H, healthy control subjects; Final, final prediction obtained through majority voting of the three classifiers; Error, whether a mis-classification has occurred.
Final 3 × 4 Confusion Matrix
| True Class | Classified as | |||
|---|---|---|---|---|
| H (%) | P (%) | R (%) | Uncertain (%) | |
| H (%) | 80 | 20 | 0 | 0 |
| P (%) | 0 | 100 | 0 | 0 |
| R (%) | 0 | 30 | 60 | 10 |
Abbreviations: P, positive COVID-19 patients; R, recovered negative COVID-19 patients; H, healthy control subjects.