| Literature DB >> 17007636 |
Samantha J Barry1, Adrie D Dane, Alyn H Morice, Anthony D Walmsley.
Abstract
BACKGROUND: Cough recordings have been undertaken for many years but the analysis of cough frequency and the temporal relation to trigger factors have proven problematic. Because cough is episodic, data collection over many hours is required, along with real-time aural analysis which is equally time-consuming. A method has been developed for the automatic recognition and counting of coughs in sound recordings.Entities:
Year: 2006 PMID: 17007636 PMCID: PMC1601963 DOI: 10.1186/1745-9974-2-8
Source DB: PubMed Journal: Cough ISSN: 1745-9974
Figure 1Pattern Recognition Approach to cough/non-cough classification.
Symbols used and their settings.
| Symbol | Meaning | Value |
| Sampling Frequency | 11025 Hz | |
| Time in milliseconds | ||
| σsignal | Windowed standard deviation of signal | Calculated as a function of time |
| Δ | Background interval | 11026 points (1000 ms) |
| High (event detection) threshold | 10 (×σbackground) | |
| Low (event start and end) threshold | 2 (×σbackground) | |
| σbackground | Standard deviation of background | |
| Number of reference patterns | 150 (75 cough/75 non-cough) | |
| Number of mel bank-of-filters cepstral coefficients | 42 (14+14 1st derivatives +14 2nd derivatives) | |
| Number of LPC cepstral coefficients | 14 (no derivatives) | |
| Total number of cepstral coefficients ( | 56 | |
| Reduced number of features | 45 |
Settings are based on established values and preliminary experiments. Symbols only used locally are explained in the text.
Figure 2Sound detection. The top graph shows the original sound signal. In the bottom graph depicts σsignal and the two baseline threshold lines in which threshpeak = 10 and threshlimits = 1.5. Point 2(a) indicates the first standard deviation larger than threshpeak × σbackground. Points 2(b) and 2(c) are the points nearest to point 2(a) where σsignal is smaller than threshlimits × σbackground. The whole region between points 2(b) and 2(c) is a sound event. In the same way, the region between points 2(d) and 2(e) will be detected as a sound event.
Figure 3Graphical User Interface (GUI) for human listener.
Counted coughs
| A | B | C | |
| subject 1 | 8 | 6 | 8 |
| subject 2 | 21 | 22 | 25 |
| subject 3 | 5 | 6 | 9 |
| subject 4 | 26 | 25 | 31 |
| subject 5 | 14 | 30 | 28 |
| subject 6 | 9 | 13 | 9 |
| subject 7 | 8 | 8 | 15 |
| subject 8 | 20 | 29 | 27 |
| subject 9 | 28 | 53 | 50 |
| subject 10 | 98 | 150 | 140 |
| Mean | 23.7 | 34.2 | 34.2 |
Figure 4The Bland Altman plot showing the difference between the total number of coughs per subject as recorded by the experienced listener (A) compared to the HACC program (C).