| Literature DB >> 34162883 |
Lara Orlandic1, Tomas Teijeiro2, David Atienza2.
Abstract
Cough audio signal classification has been successfully used to diagnose a variety of respiratory conditions, and there has been significant interest in leveraging Machine Learning (ML) to provide widespread COVID-19 screening. The COUGHVID dataset provides over 25,000 crowdsourced cough recordings representing a wide range of participant ages, genders, geographic locations, and COVID-19 statuses. First, we contribute our open-sourced cough detection algorithm to the research community to assist in data robustness assessment. Second, four experienced physicians labeled more than 2,800 recordings to diagnose medical abnormalities present in the coughs, thereby contributing one of the largest expert-labeled cough datasets in existence that can be used for a plethora of cough audio classification tasks. Finally, we ensured that coughs labeled as symptomatic and COVID-19 originate from countries with high infection rates. As a result, the COUGHVID dataset contributes a wealth of cough recordings for training ML models to address the world's most urgent health crises.Entities:
Mesh:
Year: 2021 PMID: 34162883 PMCID: PMC8222356 DOI: 10.1038/s41597-021-00937-4
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Metadata variables, as they appear in the JSON files.
| Name | Mandatory | Range of possible values | Description |
|---|---|---|---|
| datetime | Yes | UTC date and time in ISO 8601 format | Timestamp of the received recording. |
| cough_detected | Yes | Floating point in the interval [0, 1] | Probability that the recording contains cough sounds, according to the automatic detection algorithm described in the Methods section. |
| SNR | Yes | Floating point in the interval [0, ∞) | An estimation of the signal-to-noise ratio of the cough recording. |
| latitude | No | Floating point value | Self-reported latitude geolocation coordinate with reduced precision. |
| longitude | No | Floating point value | Self-reported longitude geolocation coordinate with reduced precision. |
| age | No | Integer value | Self-reported age value. |
| gender | No | {female, male, other} | Self-reported gender. |
| respiratory_condition | No | {True, False} | The patient has other respiratory conditions (self-reported). |
| fever_muscle_pain | No | {True, False} | The patient has fever or muscle pain (self-reported). |
| status | No | {COVID, symptomatic, healthy} | The patient self-reports that has been diagnosed with COVID-19 (COVID), that has symptoms but no diagnosis (symptomatic), or that is healthy (healthy). |
| expert_labels_{1,2,3} | No | JSON dictionary with the manual labels from expert 1, 2 or 3 | The expert annotation variables are described in Table |
Cough Classifier Features.
| Feature Class | Domain | Count | Computation Parameters |
|---|---|---|---|
| MFCC[ | Mel Frequency | 26 | Mean and St. Dev of 13 MFCCs over time |
| EEPD[ | Time | 19 | BPF intervals in 50-1000 Hz; See Chatrazzin |
| Power Spectral Density[ | Frequency | 8 | Frequency bands (Hz): 0-200, 300-425, 500-650, 950-1150, 1400-1800, 2300–2400, 2850–2950, 3800–3900 |
| RMS Power[ | Time | 1 | None |
| Zero Crossing Rate[ | Time | 1 | None |
| Crest Factor[ | Time | 1 | None |
| Recording Length | Time | 1 | None |
| Dominant Frequency[ | Frequency | 1 | None |
| Spectral Centroid[ | Frequency | 1 | None |
| Spectral Rolloff [ | Frequency | 1 | None |
| Spectral Spread[ | Frequency | 1 | None |
| Spectral Skewness[ | Frequency | 1 | None |
| Spectral Kurtosis[ | Frequency | 1 | None |
| Spectral Bandwidth[ | Frequency | 1 | None |
| Spectral Flatness[ | Frequency | 1 | None |
| Spectral St. Dev[ | Frequency | 1 | None |
| Spectral Slope[ | Frequency | 1 | None |
| Spectral Decrease[ | Frequency | 1 | None |
Cough Classifier Performance (cough_detected threshold = 0.8).
| Metric | CV Mean ± St. Dev | Final Model |
|---|---|---|
| Precision | 95.4 ± 7.1 | 95.5 |
| Sensitivity | 78.2 ± 10 | 80.8 |
| Specificity | 95.3 ± 8.4 | 95.5 |
| Balanced Accuracy | 86.7 ± 3.9 | 88.1 |
| AUC | 96.4 ± 3.3 | 96.5 |
Fig. 1Averaged receiver operating characteristic curve across the 10 cross-validation folds of the cough classifier.
Variables provided by the expert annotators.
| Name | Range of possible values | Description |
|---|---|---|
| quality | {good, ok, poor, no_cough} | Quality of the recorded cough sound. |
| cough_type | {wet, dry, unknown} | Type of cough. |
| dyspnea | {True, False} | Audible dyspnea. |
| wheezing | {True, False} | Audible wheezing. |
| stridor | {True, False} | Audible stridor. |
| choking | {True, False} | Audible choking. |
| congestion | {True, False} | Audible nasal congestion. |
| nothing | {True, False} | Nothing specific is audible. |
| diagnosis | {upper_infection, lower_infection, obstructive_disease, COVID-19, healthy_cough} | Impression of the expert about the condition of the patient. It can be an upper or lower respiratory tract infection, an obstructive disease (Asthma, COPD, etc), COVID-19, or a healthy cough. |
| severity | {pseudocough, mild, severe, unknown} | Impression of the expert about the severity of the cough. It can be a pseudocough from a healthy patient, a mild or severe cough from a sick patient, or unknown if the expert can’t tell. |
Fig. 2Cumulative COVID-19 cases in April and May 2020 per 1 million population, along with the GPS coordinates of the received recordings.
Inter-Expert Label Consistency.
| Item | Agreement[ | |
|---|---|---|
| quality | −0.06 | Poor |
| cough_type | 0.26 | Fair |
| dyspnea | −0.02 | Poor |
| wheezing | 0.06 | Slight |
| stridor | −0.01 | Poor |
| choking | −0.01 | Poor |
| congestion | 0.41 | Moderate |
| nothing | 0.13 | Slight |
| diagnosis | 0.07 | Slight |
| severity | 0.15 | Slight |
Fig. 3Histogram of estimated SNRs of every recording in the database with a cough_detected value greater than 0.8.
| Measurement(s) | Cough |
| Technology Type(s) | Microphone Device |
| Factor Type(s) | COVID-19 status • location • age • gender • respiratory condition |
| Sample Characteristic - Organism | Homo sapiens |