| Literature DB >> 24284764 |
Sheng Li1, Ying Tian, Guohua Lu, Yang Zhang, Hao Lv, Xiao Yu, Huijun Xue, Hua Zhang, Jianqi Wang, Xijing Jing.
Abstract
High frequency millimeter-wave (MMW) radar-like sensors enable the detection of speech signals. This novel non-acoustic speech detection method has some special advantages not offered by traditional microphones, such as preventing strong-acoustic interference, high directional sensitivity with penetration, and long detection distance. A 94-GHz MMW radar sensor was employed in this study to test its speech acquisition ability. A 34-GHz zero intermediate frequency radar, a 34-GHz superheterodyne radar, and a microphone were also used for comparison purposes. A short-time phase-spectrum-compensation algorithm was used to enhance the detected speech. The results reveal that the 94-GHz radar sensor showed the highest sensitivity and obtained the highest speech quality subjective measurement score. This result suggests that the MMW radar sensor has better performance than a traditional microphone in terms of speech detection for detection distances longer than 1 m. As a substitute for the traditional speech acquisition method, this novel speech acquisition method demonstrates a large potential for many speech related applications.Entities:
Mesh:
Year: 2013 PMID: 24284764 PMCID: PMC3871134 DOI: 10.3390/s131114248
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1.Schematic diagram of the three MMW radar speech acquisition systems. (a) The 94-GHz MMW radar system; (b) The 34-GHz superheterodyne radar system [ 1]; (c) The 34-GHz simple radar system [ 6].
Figure 2.Diagram of experimental system for speech signal detection. The experimental system includes a 3 mm radar system, a 8 mm simple radar system, a 8 mm superheterodyne radar system, and a microphone.
Figure 3.The time-domain waveform and the spectrogram of a typical speech material. The four figures of each column are corresponding to microphone speeches (first column), 34-GHz simple radar speeches (second column), 34-GHz superheterodyne radar speeches (third column), and the 94-GHz quadrature superheterodyne radar speeches (fourth column). The first two rows are corresponding to the original speeches, the other rows are corresponding to the enhanced speeches.
Speech perceptual evaluation (MOS score) for speeches before and after enhancement.
| 94-GHz | 2.56 | 2.72 |
| 34-GHz-S | 2.32 | 2.53 |
| 34-GHz | 2.11 | 2.36 |
| Microphone | 1.52 | 2.07 |