| Literature DB >> 35746254 |
Junki Kawaguchi1, Mitsuharu Matsumoto1.
Abstract
In this study, we propose a method to reduce noise from speech obtained from a general microphone using the information of a throat microphone. A throat microphone records a sound by detecting the vibration of the skin surface near the throat directly. Therefore, throat microphones are less prone to noise than ordinary microphones. However, as the acoustic characteristics of the throat microphone differ from those of ordinary microphones, its sound quality degrades. To solve this problem, this study aims to improve the speech quality while suppressing the noise of a general microphone by using the information recorded by a throat microphone as reference information to extract the speech signal in general microphones. In this paper, the framework of the proposed method is formulated, and several experiments are conducted to evaluate the noise suppression and speech quality improvement effects of the proposed method.Entities:
Keywords: noise reduction; sensor fusion; throat microphone
Mesh:
Year: 2022 PMID: 35746254 PMCID: PMC9230528 DOI: 10.3390/s22124473
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1Overview of the proposed method.
Experimental condition.
| Target Speech | Male Voice |
|---|---|
| Noise signal | Intersection noise, |
| Noise reduction threshold | −90 to −30 [dB] |
Intersection noise.
| Threshold (dB) |
|
|
|---|---|---|
| −90 | 0.704 | −3.347 |
| −80 | 2.632 | 0.008068 |
| −70 | 8.384 | 7.795 |
| −60 | 9.593 | 12.97 |
| −50 | 7.393 | 15.50 |
| −40 | 4.351 | 17.10 |
| −30 | 1.527 | 20.83 |
White noise (0 dB).
| Threshold (dB) |
|
|
|---|---|---|
| −90 | 1.665 | −9.767 |
| −80 | 3.199 | −7.361 |
| −70 | 5.007 | −4.756 |
| −60 | 6.838 | −1.369 |
| −50 | 6.646 | 2.083 |
| −40 | 4.236 | 5.033 |
| −30 | 1.522 | 8.821 |
White noise (−15 dB).
| Threshold (dB) |
|
|
|---|---|---|
| −90 | 7.510 | 2.233 |
| −80 | 9.579 | 4.639 |
| −70 | 10.66 | 7.244 |
| −60 | 9.764 | 10.63 |
| −50 | 7.534 | 14.08 |
| −40 | 4.415 | 17.03 |
| −30 | 1.540 | 20.82 |
SNR results.
| Noise Speech |
|
|---|---|
| Intersection noise | −4.158 |
| White noise (0 dB) | −22.96 |
| White noise (−15 dB) | −10.96 |
Figure 2Waveform (Intersection noise).
Figure 3Spectrogram (Intersection noise).
Figure 4Waveform (White noise (0 dB)).
Figure 5Spectrogram (White noise (0 dB)).
Figure 6Waveform (White noise (−15 dB)).
Figure 7Spectrogram (White noise (−15 dB)).