Dual Microphone Sound Source Localization Using Reconfigurable Hardware

doi:10.1007/978-981-10-6427-2_32

Home
/
Papers
/
Dual Microphone Sound Source Localization Using Reconfigurable Hardware

Book Chapter•DOI•

Dual Microphone Sound Source Localization Using Reconfigurable Hardware

Tanmay Biswas¹, Sudhindu Bikash Mandal¹, Debasri Saha¹, Amlan Chakrabarti¹•Institutions (1)

University of Calcutta¹

24 Mar 2017-Vol. 775, pp 397-406

TL;DR: An efficient reconfigurable hardware design of sound source localization system using one microphone pair using phase transform (PHAT) based filter using time delay of arrival (TDOA) algorithm to achieve time difference between the microphone signals is proposed.

read less

Abstract: This paper proposes an efficient reconfigurable hardware design of sound source localization system using one microphone pair We have used time delay of arrival (TDOA) algorithm using phase transform (PHAT) to achieve time difference between the microphone signals Phased transform (PHAT) based filter can reach high SNR gains, which makes it very suitable for localizing the sound source in a microphone array system Our design has implemented on Spartan6 Lx45 FPGA and presented the implementation results in terms of accuracy and speech quality We compare the original angle between the microphones to the resultant angle between the microphones and observed that proposed design provides very sharp accuracy in every corresponding angle We also compare the speech quality of proposed design signal with normal delayed signal The evaluation infers that our proposed hardware induce feasible for hand-held devices

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

FPGA based dual microphone speech enhancement

[...]

Tanmay Biswas¹, Sudhindu Bikash Mandal¹, Debasri Saha¹, Amlan Chakrabarti¹•Institutions (1)

Information Technology University¹

16 Jan 2019-Microsystem Technologies-micro-and Nanosystems-information Storage and Processing Systems

TL;DR: The objective and subjective evaluation established that the proposed hardware provides better throughput compared to the existing state of the art research works and induces feasibility for hand-held devices in background noisy environment.

...read moreread less

Abstract: This paper proposes an efficient reconfigurable hardware design of dual microphone speech enhancement technique using sound source localization and multi band spectral subtraction methods with elimination of background noise. Firstly, we have used a time delay of arrival algorithm using phase transform (PHAT) to achieve the time difference between the microphone signals. PHAT based filter can reach high SNR gains, which makes it very suitable for localizing the sound source in a microphone array system. After adjustment of the delay between the signals, multi band spectral subtraction technique enhances the signal from the background noise environment in each of the frequency bands. Our design has been implemented in Spartan6 Lx45 FPGA and we have presented the implementation results in terms of subjective and objective evaluations. We have also compared the angular separation between the microphones to the resultant angle between the microphones and observed that proposed design provides very sharp accuracy in every corresponding angle. The objective and subjective evaluation established that our design provides better throughput compared to the existing state of the art research works. The evaluation infers that our proposed hardware induce feasibility for hand-held devices in background noisy environment.

...read moreread less

3 citations

Book Chapter•DOI•

FPGA-Based Novel Speech Enhancement System Using Microphone Activity Detector

[...]

Tanmay Biswas¹, Shuvadeep Bhattacharjee¹, Sudhindu Bikash Mandal¹, Debasri Saha¹, Amlan Chakrabarti¹ - Show less +1 more•Institutions (1)

University of Calcutta¹

01 Jan 2019

TL;DR: The evaluation of the quality of speech of enhanced signal and its correctness of MAD to detect the single or dual microphone system implies that the proposed hardware can work as a proper embedded component for hardware-based execution for speech enhancement.

...read moreread less

Abstract: In this paper, we have proposed field-programmable gate array (FPGA) based design and implementation of a novel speech enhancement system, which can work for a single microphone device as well as that of a dual microphone device providing background noise immunity. We proposed a microphone activity detector (MAD), which detects the presence of single or dual microphone scenario. After detecting the microphones, multiband spectral subtraction technique enhances the speech signal from different background noisy surrounds. We have implemented our proposed design in Spartan 6 LX45 FPGA using Xilinx system generator tools. The evaluation of the quality of speech of enhanced signal and its correctness of MAD to detect the single or dual microphone system implies that our proposed hardware can work as a proper embedded component for hardware-based execution for speech enhancement.

...read moreread less

1 citations

References

PDF

Open Access

More filters

Journal Article•DOI•

The generalized correlation method for estimation of time delay

[...]

C. Knapp¹, G. Carter•Institutions (1)

University of Connecticut¹

01 Aug 1976-IEEE Transactions on Acoustics, Speech, and Signal Processing

TL;DR: In this paper, a maximum likelihood estimator is developed for determining time delay between signals received at two spatially separated sensors in the presence of uncorrelated noise, where the role of the prefilters is to accentuate the signal passed to the correlator at frequencies for which the signal-to-noise (S/N) ratio is highest and suppress the noise power.

...read moreread less

Abstract: A maximum likelihood (ML) estimator is developed for determining time delay between signals received at two spatially separated sensors in the presence of uncorrelated noise. This ML estimator can be realized as a pair of receiver prefilters followed by a cross correlator. The time argument at which the correlator achieves a maximum is the delay estimate. The ML estimator is compared with several other proposed processors of similar form. Under certain conditions the ML estimator is shown to be identical to one proposed by Hannan and Thomson [10] and MacDonald and Schultheiss [21]. Qualitatively, the role of the prefilters is to accentuate the signal passed to the correlator at frequencies for which the signal-to-noise (S/N) ratio is highest and, simultaneously, to suppress the noise power. The same type of prefiltering is provided by the generalized Eckart filter, which maximizes the S/N ratio of the correlator output. For low S/N ratio, the ML estimator is shown to be equivalent to Eckart prefiltering.

...read moreread less

4,317 citations

Journal Article•DOI•

Adaptive eigenvalue decomposition algorithm for passive acoustic source localization

[...]

Jacob Benesty¹•Institutions (1)

Alcatel-Lucent¹

01 Jan 2000-Journal of the Acoustical Society of America

TL;DR: A new approach is proposed that is based on eigenvalue decomposition that performs well and is very accurate for time delay estimation of acoustic source locations.

...read moreread less

Abstract: To find the position of an acoustic source in a room, the relative delay between two (or more) microphone signals for the direct sound must be determined. The generalized cross-correlation method is the most popular technique to do so and is well explained in a landmark paper by Knapp and Carter. In this paper, a new approach is proposed that is based on eigenvalue decomposition. Indeed, the eigenvector corresponding to the minimum eigenvalue of the covariance matrix of the microphone signals contains the impulse responses between the source and the microphone signals (and therefore all the information we need for time delay estimation). In experiments, the proposed algorithm performs well and is very accurate.

...read moreread less

395 citations

Journal Article•DOI•

A practical methodology for speech source localization with microphone arrays

[...]

M.S. Brandstein¹, Harvey F. Silverman²•Institutions (2)

Harvard University¹, Brown University²

01 Apr 1997-Computer Speech & Language

TL;DR: This paper addresses the specific application of source localization algorithms for estimating the position of speech sources in a real-room environment given limited computational resources and presents theoretical foundations of a speech source localization system.

...read moreread less

332 citations

Journal Article•DOI•

Energy-based collaborative source localization using acoustic microsensor array

[...]

Dan Li¹, Yu Hen Hu¹•Institutions (1)

University of Wisconsin-Madison¹

01 Jan 2003-EURASIP Journal on Advances in Signal Processing

TL;DR: A novel sensor network source localization method based on acoustic energy measurements that makes use of the characteristics that the acoustic energy decays inversely with respect to the square of distance from the source is presented.

...read moreread less

Abstract: A novel sensor network source localization method based on acoustic energy measurements is presented. This method makes use of the characteristics that the acoustic energy decays inversely with respect to the square of distance from the source. By comparing energy readings measured at surrounding acoustic sensors, the source location during that time interval can be accurately estimated as the intersection of multiple hyperspheres. Theoretical bounds on the number of sensors required to yield unique solution are derived. Extensive simulations have been conducted to characterize the performance of this method under various parameter perturbations and noise conditions. Potential advantages of this approach include low intersensor communication requirement, robustness with respect to parameter perturbations and measurement noise, and low-complexity implementation.

...read moreread less

272 citations

Journal Article•DOI•

Performance of time-delay estimation in the presence of room reverberation

[...]

Benoit Champagne¹, S. Bedard, Alex Stephenne¹•Institutions (1)

Institut national de la recherche scientifique¹

01 Mar 1996-IEEE Transactions on Speech and Audio Processing

TL;DR: Synthetic microphone signals generated with the image model technique are used to study the effects of room reverberation on the performance of the maximum likelihood estimator of the time delay, in which the estimate is obtained by maximizing the cross correlation between filtered versions of the microphone signals.

...read moreread less

Abstract: Synthetic microphone signals generated with the image model technique are used to study the effects of room reverberation on the performance of the maximum likelihood (ML) estimator of the time delay, in which the estimate is obtained by maximizing the cross correlation between filtered versions of the microphone signals. The results underscore the adverse effects of reverberation on the bias, variance and probability of anomaly of the ML estimator. Explanations of these effects are provided.

...read moreread less

189 citations