Tao Long1, Israel Cohen2, Baruch Berdugo3, Yan Yang4, Jingdong Chen5. 1. Center of Intelligent Acoustics and Immersive Communications, Northwestern Polytechnical University, 127 Youyi West Road, Xi'an 710072, Shaanxi, China. longtao@nwpu.edu.cn. 2. Department of Electrical Engineering, Technion-Israel Institute of Technology, Technion City, Haifa 32000, Israel. icohen@ee.technion.ac.il. 3. Department of Electrical Engineering, Technion-Israel Institute of Technology, Technion City, Haifa 32000, Israel. bbaruch@technion.ac.il. 4. Center of Intelligent Acoustics and Immersive Communications, Northwestern Polytechnical University, 127 Youyi West Road, Xi'an 710072, Shaanxi, China. y.yang@nwpu.edu.cn. 5. Center of Intelligent Acoustics and Immersive Communications, Northwestern Polytechnical University, 127 Youyi West Road, Xi'an 710072, Shaanxi, China. jingdongchen@ieee.org.
Abstract
Beamformers have been widely used to enhance signals from a desired direction and suppress noise and interfering signals from other directions. Constant beamwidth beamformers enable a fixed beamwidth over a wide range of frequencies. Most of the existing approaches to design constant beamwidth beamformers are based on optimization algorithms with high computational complexity and are often sensitive to microphone mismatches. Other existing methods are based on adjusting the number of sensors according to the frequency, which simplify the design, but cannot control the sidelobe level. Here, we propose a window-based technique to attain the beamwidth constancy, in which different shapes of standard window functions are applied for different frequency bins as the real weighting coefficients of microphones. Thereby, not only do we keep the beamwidth constant, but we also control the sidelobe level. Simulation results show the advantages of our method compared with existing methods, including lower sidelobe level, higher directivity factor, and higher white noise gain.
Beamformers have been widely used to enhance signals from a desired direction and suppress noise and interfering signals from other directions. Constant beamwidth beamformers enable a fixed beamwidth over a wide range of frequencies. Most of the existing approaches to design constant beamwidth beamformers are based on optimization algorithms with high computational complexity and are often sensitive to microphone mismatches. Other existing methods are based on adjusting the number of sensors according to the frequency, which simplify the design, but cannot control the sidelobe level. Here, we propose a window-based technique to attain the beamwidth constancy, in which different shapes of standard window functions are applied for different frequency bins as the real weighting coefficients of microphones. Thereby, not only do we keep the beamwidth constant, but we also control the sidelobe level. Simulation results show the advantages of our method compared with existing methods, including lower sidelobe level, higher directivity factor, and higher white noise gain.
Beamformers, or spatial filters, enhance signals from a desired direction and suppress noise and interfering signals from other directions. Deterministic and adaptive beamforming techniques have been widely studied and used in radar, sonar, seismology, tomography, communication and many other areas [1,2,3,4,5].A variety of beamforming techniques are available, including data-independent, statistically optimal and adaptive approaches. Traditional beamforming techniques suffer from a frequency varying beamwidth, which restricts their deployment in broad-band applications (e.g., speech communication). The basic approach of solving this problem is to design a constant beamwidth beamformer, where the beampattern maintains a fixed beamwidth over a wide frequency band. Many existing methods have been investigated to obtain constant beamwidths [6,7,8,9,10,11,12]. These methods are mainly based on optimization algorithms with high computational design complexity, and they are often sensitive to microphone mismatches.Recently, Rosen et al. [13] proposed finite impulse response (FIR) based beamformers [14] with constant beamwidths. The main idea behind this approach is to change the effective array aperture in each frequency bin to maintain the beamwidth constant over the desired frequency band. This method is characterized by low computational complexity, but cannot control the sidelobe level. If we regard the coefficients of the FIR filter as a window function, then the beamformer is analogous to the discrete Fourier transform of the window. It can be shown that the FIR filter proposed in [13] is a kind of modified rectangular window, and it inspires us to use other kinds of windows. As a result, some standard window functions whose shapes are controlled by a single parameter can be directly used to obtain different beamwidths and sidelobe levels, e.g., a discrete prolate spheroidal sequences (DPSS) window can maximize the relative beamforming power that is concentrated in an angular region, and a Chebyshev window minimizes the beamwidth for a given sidelobe level.In this paper, we propose window-based beamformers with constant beamwidths. The main idea is to apply different shapes of windows for different frequency bins as real weighting coefficients of microphones, so that the beamwidth is maintained constant by controlling the window parameters. The rest of this paper is organized as follows. In Section 2, we introduce the signal model and illustrate the frequency varying beamwidth problem for traditional uniform linear arrays. Section 3 proposes the window-based beamformer with a constant beamwidth, using modified rectangular, DPSS, Kaiser and Chebyshev windowss. Section 4 evaluates the performances of different window-based methods using the white noise gain and directivity factor. Finally, some conclusions are drawn in Section 5.
2. Signal Model and Problem Formulation
2.1. Signal Model
Consider a uniform linear array (ULA) consisting of M omnidirectional microphones, with an interelement spacing equal to . Assume that there are an odd number () of microphones, as shown in Figure 1, whose locations are:
where m denotes the microphone index, and denotes the location of the mth microphone.
Figure 1
Beamforming with a uniform linear array consisting of M
microphones with spacing , where m denotes the microphone index, x denotes the location of the microphone, is the direction of arrival angle measured with respect to the broadside of the linear array.
We consider the farfield case where a source of interest radiates an acoustic wave that propagates in an anechoic environment at the speed of sound, i.e., m/s. The direction of the source signal is parameterized by the angle measured with respect to the broadside of the linear array. In the frequency domain, the signal model at the frequency index f can be written as:
where , , and are the frequency-domain representations of the received noisy signal, the desired source signal and the additive noise signal at the mth microphone, respectively, is the temporal frequency, and j is the imaginary unit with . In a vector form, we can rearrange the signal model in Equation (2) as
where
the superscript is the transpose operator, and
is the signal propagation vector corresponding to , which is in the same form as the steering vector.
2.2. Beamformer
As shown in Figure 1, the beamformer estimates the desired signal by applying a spatial filter to the sensors’ outputs [2], i.e.,
where and denote complex conjugation and conjugate-transpose operator, is an estimate of the desired signal , and
is the linear filter of length .One of the most important measures to quantify the performance of a beamformer is the so-called beampattern or directivity pattern, which describes the sensitivity of the beamformer to a plane wave impinging on the array from the direction . Mathematically, the beampattern is defined as
2.3. Beamwidth
We now give an example of the uniform weighting beamformer (i.e., delay-and-sum beamformer),
where . When , we easily getIf we define the beamwidth as the angle between the two lowest values at both sides of the main lobe (i.e., the beamwidth null to null), can be obtained in this case:This expression indicates the dependance of the beamwidth on the number of sensors M, interelement spacing and frequency f. One can observe that the beamwidth decreases as the frequency increases, which implies that this beamforming method suffers from a frequency varying beamwidth. The beampattern based on the delay-and-sum beamformer is shown in Figure 2. One observes that the beamwidth decreases as the frequency increases.
Figure 2
Beampattern using delay-and-sum beamformer. , cm.
3. Window-Based Acoustic Beamformer with a Constant Beamwidth
In this section, we propose a window-based symmetrical beamformer method with a constant beamwidth over a wide frequency range.Define , then the steering vector can be rewritten as:Accordingly, the beampattern is obtained through the discrete Fourier transform of the spatial filter:In this work, we restrict ourselves to real weights, then the beampattern is given byThe real weights are regarded as a spatial window function. Hence, the beamwidth can be maintained by applying different shapes of windows for different frequency bins. Next, we present four different kinds of windows, i.e., modified rectangular, DPSS, Kaiser, Chebyshev. For each window, we first introduce the mathematical representation, and then show how to control its shape as a function of frequency by setting the proper parameter of the window in order to maintain constant beamwidth.
3.1. Modified Rectangular Window
From Equation (11), in order to keep the beamwidth constant for varying frequency, the product should remain constant, which means that the number of microphones should decrease as the frequency increases. To alleviate the beamwidth fluctuations, Rosen et al. [13] proposed a modified rectangular window based on smoothing coefficients.The weights in [13] can be regarded as a kind of modified rectangular window:
where is the number of activated microphones, and g is the smoothing coefficient.Next, we show how to obtain the parameter K and g to keep the beamwidth constant.
3.1.1. Lowest and Highest Frequencies
Since the number of activated microphones should be less than M and more than 3, we should first determine the lowest frequency and the highest frequency for which the desired beamwidth is feasible. For the given array configuration and fixed beamwidth , based on Equation (11), the lowest and highest frequencies using rectangular window areMeanwhile, in order to avoid maximum grating lobe, the highest frequency should also be smaller than .
3.1.2. The Parameter K
For the feasible frequency range , we need to reduce the number of activated microphones to keep constant as the frequency varies. So the value of K is obtained by the maximum integer which satisfied .
3.1.3. The Parameter g
The smoothing coefficient g can be derived as follows. The beampattern using the modified rectangular window is given byThe optimal value of smoothing coefficient g is obtained by setting , which yieldsIn order to normalize the beampattern, we use the normalized weighting coefficients as:The beampattern based on a modified rectangular window is shown in Figure 3. The beamwidth is fixed to , and cm. We show the beampattern in three different frequencies Hz. One observes that the rectangular window-based method can effectively fix the beamwidth, but the sidelobe level is high in this case.
Figure 3
Constant beamwidth beampattern using a modified rectangular window. The beamwidth is fixed to , , cm.
3.2. DPSS Window
Rectangular window-based method can effectively fix the beamwidth but cannot control the sidelobes. So, we attempt to find some other windows with real weights in order to control the sidelobe level. Based on Equation (13), we can first define as the ratio of the total beamforming power that is concentrated in a given angular region:
where , , the th element of A is
and similarly,
where is the identity matrix. To maximize , the optimum solution is obtained from the eigenvalue problem
or equivalently,
where is an eigenvalue of . Thus in this case, is maximized by the maximum eigenvalue . The resulting weight sequences are called discrete prolate spheroidal sequences [15] (DPSS, or Slepian sequences).
3.2.1. Lowest and Highest Frequencies
When , the DPSS window becomes a rectangular window. From Equation (11), we get the lowest frequency which enables the desired beamwidth:On the other side, in order to avoid maximum grating lobe, the highest frequency is given by
3.2.2. The Parameter
For the given array configuration, it is easy to verify that the beamwidth increases as the parameter increases. For the given constant beamwidth , we can set for the frequency f
or we can search the optimal parameter to fix the beamwidth in practice, then can be obtained following the DPSS window in Equation (24).At last, in order to normalize the beampattern, we use the normalized weighting coefficients as:The beampattern based on a DPSS window is shown in Figure 4. The beamwidth is still fixed to , and cm. We show the beampattern in three different frequencies Hz. It can be seen that the DPSS window based approach cannot only keep the beamwidth constant, but also effectively suppresses the sidelobe level.
Figure 4
Constant beamwidth beampattern using a DPSS window. The beamwidth is fixed to , , cm.
3.3. Kaiser Window
A Kaiser window is a simple approximation to the DPSS window using Bessel functions. The details of calculating a Kaiser window can be found in [16]. The weighting coefficient of the microphone with index m is:
where is the zeroth-order modified Bessel function of the first kind. The parameter specifies a beampattern tradeoff between the sidelobe amplitude and the main lobe width. When , the Kaiser window becomes a rectangular window.
3.3.1. Lowest and Highest Frequencies
Since the Kaiser window approximates the DPSS window, in oder to make the desired beamwidth feasible, the lowest and highest frequencies using kaiser window are
3.3.2. The Parameter
For the given array configuration, it is easy to verify that the beamwidth increases as the parameter increases. For a given constant beamwidth , the parameter can be obtained following the approximate piecewise relation [16]:In practice, we can also search the optimal parameter for a constant beamwidth. Furthermore, in order to normalize the beampattern, we use the normalized weighting coefficients as:The beampattern based on a Kaiser window is shown in Figure 5. We fix the beamwidth to , and cm, and plot the beampattern in three different frequencies Hz. It is shown that the Kaiser window based method can get a similar beampattern as the DPSS window.
Figure 5
Constant beamwidth beampattern using a Kaiser window. The beamwidth is fixed to , , cm.
3.4. Chebyshev Window
Another window that can be used to control the main lobe beamwidth and sidelobe level is the Chebyshev Window [17,18], which minimizes the beamwidth for a given maximum sidelobe level. The coefficients of the Chebyshev window are given by
where , , , r is defined as the amplitude ratio between maximum sidelobe and mainlobe, and is the Chebyshev polynomial of the first kind, defined by
3.4.1. Lowest and Highest Frequencies
When the sidelobe attenuation is the same as the mainlobe, or , the coefficients of the Chebyshev window areIt is equivalent to a two elements array with an interelement spacing . According to Equations (25) and (26), the lowest and highest frequencies using Chebyshev window are:
3.4.2. The Parameter r
For the given array configuration, it is easy to verify that the beamwidth increases as the parameter r decreases. For a given constant beamwidth , the parameter r can be obtained following the approximate piecewise relation:In practice, we can also search the optimal parameter r for a constant beamwidth. Again, in order to normalize the beampattern, we use the normalized weighting coefficients as:The beampatterns for different frequencies based on a Chebyshev window are shown in Figure 6. Again, the beamwidth is fixed to , and one can find that the Chebyshev window based method can also effectively keep the beamwidth constant and yield equi-level sidelobes.
Figure 6
Constant beamwidth beampattern using the Chebyshev window. The beamwidth is fixed to , , cm.
4. Experiments and Results
We have discussed different window-based beamformers with a constant beamwidth in Section 3. In this section, we compare the performances of different windows via several simulations. All of the simulated uniform linear arrays are configured with omnidirectional microphones, with an interelement spacing equal to . In these experiments, the modified rectangular, DPSS, Kaiser and Chebyshev window-based methods are used and the beamwidth is fixed to . In Section 4.1 we show how to set the optimal parameter to shape the window and design the beamformer in order to attain beamwidth constancy. Section 4.2 provides the performance measures of the beamformer. Section 4.3 shows the results.
4.1. Optimal Window Parameter
In Section 3, we have shown how to set the parameter to shape the window in order to attain beamwidth constancy, e.g., the parameter g for the modified rectangular window, the parameter for the DPSS window, the parameter for the Kaiser window, and the parameter r for the Chebyshev window. Compared with the optimization-based method, one can see that our method has very low computational complexity in the design process, because the problem has been simplified as obtaining a single parameter of the standard window.In practice, we can also search the optimal parameter of a given window. There are two reasons for using the search method: (1) The relationship between the fixed beamwidth and the parameter is approximate piecewise in some cases. (2) The search method can be easily extended to non uniform arrays. Algorithm 1 shows the search algorithm of a window-based beamformer with a constant beamwidth, where we search the optimal parameter of a given window for each frequency bin to keep the beamwidth fixed to , and are the lower and upper cutoff frequencies of the frequency band, respectively.
4.2. Performance Measures
We evaluate the beamformers using white noise gain (WNG) and directivity factor (DF) [1,4,19]. The WNG is a measure indicating the array gain in the presence of uncorrelated white noise, which is also a measure of the sensitivity of the microphone array to some of its imperfections, such as sensor noise and mismatch. The DF of the array is the gain in signal-to-noise ratio (SNR) for the case of spherical diffuse noise. Mathematically, they are respectively defined as and (note that the main lobe is perpendicular to the line that connects all the array elements):We also evaluate the beamformers beamwidth which is defined as the angle between the two lowest values at both sides of the main lobe (i.e., the beamwidth null to null).
4.3. Results
4.3.1. Wideband Beampatterns with a Constant Beamwidth Using Different Windows
We first compare the constant beamwidth beampatterns using different kinds of windows, where the frequency range is Hz. The results are plotted in Figure 7 for the modified rectangular window, Figure 8 for the DPSS window, Figure 9 for the Kaiser window, and Figure 10 for the Chebyshev window. One can see that all the window-based methods can effectively keep the beamwidth constant over a wide frequency band. Compared with modified rectangular window [13], the other proposed windows can obtain much lower sidelobe levels.
Figure 7
Constant beamwidth beamforming using the modified rectangular window-based method: (a) constant beamwidth beampattern, (b) the parameter , (c) the weights of microphones for different frequency bins. The beamwidth is fixed to , , cm.
Figure 8
Constant beamwidth beamforming using the DPSS window-based method: (a) constant beamwidth beampattern, (b) the parameter , (c) the weights of microphones for different frequency bins. The beamwidth is fixed to , , cm.
Figure 9
Constant beamwidth beamforming using the Kaiser window-based method: (a) constant beamwidth beampattern, (b) the parameter , (c) the weights of microphones for different frequency bins. The beamwidth is fixed to , , cm.
Figure 10
Constant beamwidth beamforming using the Chebyshev window-based method: (a) constant beamwidth beampattern, (b) the parameter r, (c) the weights of microphones for different frequency bins. The beamwidth is fixed to , , cm.
4.3.2. Chebyshev Window
In order to explain the performance based on the Chebyshev window in low frequency bins, we show the weights of the microphones using the Chebyshev window when f = 1000, 1500, 2000, 2500, 3000 Hz, and the results are plotted in Figure 11. One can find that the Chebyshev approach designs a kind of ’saddle’ shape window, where high weights are set for the microphones at the edges of the array. Suppose an extreme situation that we only use the two microphones at the edge, which means the Chebyshev window will be
the interelement spacing will be and the number of microphones is . Based on Equation (11), the lowest frequency which can attain the fixed beamwidth is
Figure 11
The weights of microphones for low frequency bins using the Chebyshev window, , cm.
With this method, it is equivalent to increasing the interelement spacing and the virtual length of the array. As a result, the Chebyshev window-based beamformer can attain beamwidth constancy in lower frequencies compared with the other windows-based beamformers.In order to improve the DF using a Chebyshev window in low frequency bins, we can search the parameter r to reach a compromise between beamwidth and DF. We call this compromised method a Chebyshev window-I. The search criteria in this case will increase (or decrease) the parameter of the window to maximize the directivity index, given the beamwidth is above the desired minimal beamwidth. Figure 12 shows the weights of microphones using Chebyshev window-I when . The beampatterns and the parameters as functions of frequency using this method are shown in Figure 13.
Figure 12
The weights of microphones for low frequency bins using the Chebyshev window-I, , cm.
Figure 13
Constant beamwidth beamforming using the Chebyshev window-I method: (a) constant beamwidth beampattern, (b) the parameter r, (c) the weights of microphones for different frequency bins. The beamwidth is fixed to , , cm.
4.3.3. Directivity Factor and White Noise Gain as Function of Frequency
The DF and WNG as functions of frequency for different window-based beamformer are plotted in Figure 14 and Figure 15. It is shown that the DPSS, Kaiser and Chebyshev window-based beamformer can achieve higher WNG and DF compared with the rectangular window in high frequencies. For frequencies below 2500 Hz, one can also see that the Chebyshev window-I beamformer gets highest DF compared with the other beamformers.
Figure 14
DF as a function of frequency for different window-based beamformers with a constant beamwidth. The beamwidth is fixed to , , cm.
Figure 15
WNG as a function of frequency for different window-based beamformers with a constant beamwidth. The beamwidth is fixed to , , cm.
4.3.4. Beamwidth as Function of Frequency
At last, we compare the beamwidth as function of frequency for different window-based beamformers, and the results are plotted in Figure 16. For high frequencies ( Hz), all the window-based beamformers can effectively fix the beamwidth to be . For frequencies below 2500 Hz, it can be found that the Chebyshev window-based beamformer obtains smaller beamwidth compared with the other beamformers. As a result, the Chebyshev window-based beamformer can reach to the fixed beamwidth () at nearly 1400 Hz, but the other window-based beamformers attain beamwidth constancy only above 2500 Hz.
Figure 16
Beamwidth as a function of frequency for different window-based beamformers. The beamwidth is fixed to , , cm.
5. Conclusions
Traditional beamforming techniques suffer from a frequency varying beamwidth, which restricts their deployment in broadband applications. We have proposed window-based beamformers with constant beamwidths. Our method can effectively fix the beamwidth and exhibits the following advantages: (1) Compared with the optimization-based method, the proposed window-based approach is characterized by lower computational design complexity and higher white noise gain (which means it is less sensitive to microphone mismatches). (2) Compared with the FIR-based method proposed in [13], our window-based approach can reduce the sidelobe level and obtain higher directivity factor. Experiments corroborate the theoretical analysis and show that we can adjust the parameter of window to get the tradeoff between WNG and DF. Furthermore, hybrid window based beamformer design method is a topic for future research, which facilitates different windows for different frequency bins.