A 0.5 V 55 $\mu \text{W}$ 64 $\times $ 2 Channel Binaural Silicon Cochlea for Event-Driven Stereo-Audio Sensing

doi:10.1109/JSSC.2016.2604285

Journal Article•DOI•

A 0.5 V 55 $\mu \text{W}$ 64 $\times $ 2 Channel Binaural Silicon Cochlea for Event-Driven Stereo-Audio Sensing

Minhao Yang¹, Chen-Han Chien¹, Tobi Delbruck¹, Shih-Chii Liu¹•Institutions (1)

22 Sep 2016-IEEE Journal of Solid-state Circuits (IEEE)-Vol. 51, Iss: 11, pp 2554-2569

TL;DR: A 0.5V 55μW 64×2-channel binaural silicon cochlea aiming for ultra-low-power IoE applications like event-driven VAD, sound source localization, speaker identification and primitive speech recognition is presented.

read less

Abstract: This paper presents a $64 \times 2$ channel stereo-audio sensing front end with parallel asynchronous event output inspired by the biological cochlea. Each binaural channel performs feature extraction by analog bandpass filtering, and the filtered signal is encoded into events via asynchronous delta modulation (ADM). The channel central frequencies $f_{0}$ are geometrically scaled across the human hearing range. Two design techniques are highlighted to achieve the high system power efficiency: source-follower-based bandpass filters (BPFs) and asynchronous delta modulation (ADM) with adaptive self-oscillating comparison. The chip was fabricated in 0.18 $\mu \text{m}$ 1P6M CMOS, and occupies an area of $10.5 \times 4.8$ mm2. The core cochlea system operating under a 0.5 V power supply consumes 55 $\mu \text{W}$ at an output rate of 100k event/s. The measured range of $f_{0}$ is from 8 Hz to 20 kHz, and the BPF quality factor ${Q}$ can be tuned from 1 to almost 40. The 1 $\sigma $ mismatch of $f_{0}$ and ${Q}$ between two ears is 3.3% and 15%, respectively, across all channels at ${Q}\approx $ 10. Reconstruction of speech input from the event output of the chip is performed to validate the information integrity in event-domain representation, and vowel discrimination is demonstrated as a simple application using histograms of the output events. This type of silicon cochlea front end targets integration with embedded event-driven processors for low-power smart audio sensing with classification capabilities, such as voice activity detection and speaker identification.

...read moreread less

A 0.5 V 55 $\mu \text{W}$ 64 $\times $ 2 Channel Binaural Silicon Cochlea for Event-Driven Stereo-Audio Sensing

Citations

Cites background from "A 0.5 V 55 $\mu \text{W}$ 64 $\time..."

Cites background from "A 0.5 V 55 $\mu \text{W}$ 64 $\time..."

Cites background or methods from "A 0.5 V 55 $\mu \text{W}$ 64 $\time..."

References

"A 0.5 V 55 $\mu \text{W}$ 64 $\time..." refers background in this paper

"A 0.5 V 55 $\mu \text{W}$ 64 $\time..." refers background in this paper

"A 0.5 V 55 $\mu \text{W}$ 64 $\time..." refers background in this paper

"A 0.5 V 55 $\mu \text{W}$ 64 $\time..." refers background in this paper

Related Papers (5)