| Literature DB >> 35835648 |
Zhao Ren1, Yi Chang2, Katrin D Bartl-Pokorny3, Florian B Pokorny4, Björn W Schuller5.
Abstract
OBJECTIVES: The coronavirus disease 2019 (COVID-19) has caused a crisis worldwide. Amounts of efforts have been made to prevent and control COVID-19's transmission, from early screenings to vaccinations and treatments. Recently, due to the spring up of many automatic disease recognition applications based on machine listening techniques, it would be fast and cheap to detect COVID-19 from recordings of cough, a key symptom of COVID-19. To date, knowledge of the acoustic characteristics of COVID-19 cough sounds is limited but would be essential for structuring effective and robust machine learning models. The present study aims to explore acoustic features for distinguishing COVID-19 positive individuals from COVID-19 negative ones based on their cough sounds.Entities:
Keywords: Acoustics; Automatic disease detection; COVID-19; Computational paralinguistics; Cough
Year: 2022 PMID: 35835648 PMCID: PMC9197794 DOI: 10.1016/j.jvoice.2022.06.011
Source DB: PubMed Journal: J Voice ISSN: 0892-1997 Impact factor: 2.300
Total and Gender-Specific Distribution of Number of Cough Samples Across COVID-19 Status and Symptom Conditions.
| Status | Symptoms | Total | Gender (f/m) | |
|---|---|---|---|---|
| neg | neg– | no | 996 | 293/703 |
| neg+ | respiratory only | 124 | 48/76 | |
| muscle pain/fever only | 57 | 20/37 | ||
| both symptoms | 24 | 8/16 | ||
| Σ | 1 201 | 369/832 | ||
| pos | pos– | no | 111 | 36/75 |
| pos+ | respiratory only | 40 | 13/27 | |
| muscle pain/fever only | 27 | 12/15 | ||
| both symptoms | 32 | 14/18 | ||
| Σ | 210 | 75/135 | ||
neg, COVID-19 negative; pos, COVID-19 positive; f, female; m, male; +, symptomatic; –, asymptomatic.
FIGURE 1The age distribution of the 1,411 cough samples of the dataset for COVID-19 neg(ative) or (pos)itive. Mus., muscle; fev., fever.
Categorization of the 65 Low-Level Descriptors (LLDs) of the ComParE Feature Set and Specification of Involvement (√) or Non-Involvement (×) in a Top Feature of the Respective Differentiation Task (Task 1: pos vs neg, Task 2: pos+ vs neg+, and Task 3: pos– vs neg–).
| Group | Energy-related LLDs (4) | Task 1: | Task 2: | Task 3: |
|---|---|---|---|---|
| Prosodic | Auditory spectrum sum (loudness) | √ | √ | √ |
| Prosodic | RASTA-filtered auditory spectrum sum | √ | √ | × |
| Prosodic | RMS energy | √ | √ | √ |
| Prosodic | zero-crossing rate | × | √ | × |
DDP, difference of differences of periods; HNR, harmonics-to-noise ratio; MFCC, Mel-frequency cepstral coefficient; neg, COVID-19 negative; pos, COVID-19 positive; RASTA, relative spectral transform; RMS, root mean square; +, symptomatic; –, asymptomatic.
FIGURE 2Comparison between part A: COVID-19 positive (pos) and COVID-19 negative (neg) participants (Task 1), part B: symptomatic COVID-19 positive (pos+) and symptomatic COVID-19 negative (neg+) participants (Task 2), and part C: asymptomatic COVID-19 positive (pos–) and asymptomatic COVID-19 negative (neg–) participants (Task 3) by means of the probability density estimate (PDE) of the top one feature of the respective differentiation task. MFCC, Mel-frequency cepstral coefficient; RMS, root mean square; *, real measurement unit does not exist as feature values refer to the amplitude of the digital audio signal.
Joint Top Features Between the pos+ vs neg+ (Symptomatic COVID-19 Positive vs Symptomatic COVID-19 Negative) and the pos– vs neg– (Asymptomatic COVID-19 Positive vs Asymptomatic COVID-19 Negative) Differentiation Tasks Listed According to Their Mean Ranks Rounded to Integers.
| Mean rank | Feature |
|---|---|
| 17 | Flatness (Δ spectral energy 250–650 Hz) |
| 19 | Flatness (spectral energy 250–650 Hz) |
| 27 | Flatness (RMS energy) |
| 33 | Flatness (spectral flux) |
| 149 | Mean inter-peak distance (RMS energy) |
| 194 | Quartile 3 (HNR) |
| 195 | IQR 1–3 (HNR) |
| 196 | IQR 2–3 (HNR) |
| 202 | Mean inter-peak distance (loudness) |
| 291 | Mean inter-peak distance (spectral flux) |
| 410 | Skewness (Δ RASTA-filtered auditory spectral band 12) |
| 635 | Mean value of peaks (RMS energy) |
| 725 | Mean inter-peak distance (spectral harmonicity) |
| 786 | Mean value of peaks (loudness) |
HNR, harmonics-to-noise ratio; IQR, interquartile range; RASTA, relative spectral transform; RMS, root mean square; Δ= first-order derivative.
Classification Performance in Terms of Unweighted Average Recall (UAR) and Area Under the Receiver Operating Characteristic Curve (AUC) for the Three Tasks.
| Task | (1) pos vs neg | (2) pos+ vs neg+ | (3) pos– vs neg– | ||||
|---|---|---|---|---|---|---|---|
| Samples (#) | 210/1,201 | 99/205 | 111/996 | ||||
| Models | UAR | AUC | UAR | AUC | UAR | AUC | |
| Linear | LASSO | 0.586 | 0.625 | 0.573 | 0.547 | 0.536 | 0.549 |
| Ridge | 0.558 | 0.594 | |||||
| ElasticNet | 0.615 | 0.650 | 0.598 | 0.596 | |||
| SVM | 0.610 | 0.642 | 0.601 | 0.609 | 0.563 | 0.617 | |
| Non-linear | Decision Tree | 0.521 | 0.521 | 0.538 | 0.538 | 0.501 | 0.501 |
| Random Forest | 0.500 | 0.651 | 0.544 | 0.606 | 0.500 | 0.600 | |
| MLP | 0.558 | 0.600 | 0.593 | 0.606 | 0.505 | 0.570 | |
neg, COVID-19 negative; pos, COVID-19 positive; +, symptomatic; –, asymptomatic.
FIGURE 3Confusion matrices for the three classification tasks complementary to the best results given in Table 4. Part A: (Task 1) pos vs neg, part B: (Task 2) pos+ vs neg+, part C: (Task 3) pos– vs neg–. neg, COVID-19 negative; pos, COVID-19 positive; UAR, unweighted average recall; +, symptomatic; –, asymptomatic.
Gender-Wise and Age Group-Wise Classification Performance in Terms of Unweighted Average Recall (UAR) and Area Under the Receiver Operating Characteristic Curve (AUC) for the Three Tasks. The first two tasks (pos vs neg and pos+ vs neg+) are achieved by the Ridge models, and the third one (pos– vs neg–) is achieved by the ElasticNet models. neg, COVID-19 negative; pos, COVID-19 positive; y, years; +, symptomatic; –, asymptomatic.
| Task | Female | Male | ||
|---|---|---|---|---|
| UAR | AUC | UAR | AUC | |
| (1) pos vs neg | 0.602 | 0.620 | 0.601 | 0.636 |
| (2) pos+ vs neg+ | 0.573 | 0.563 | 0.679 | 0.676 |
| (3) pos– vs neg– | 0.517 | 0.508 | 0.604 | 0.584 |
FIGURE 4Feature ranking according to the absolute value of Ridge feature weights for part A: (Task 1) pos vs neg and part B: (Task 2) pos+ vs neg+, as well as of ElasticNet feature weights for part C: (Task 3) pos– vs neg–. Green bars indicate top features according to the effect size in the non-parametric group difference test. A different x-axis scaling is used for (Task 3) as the ElasticNet model only builds upon 250 non-zero feature coefficients.