Chip design of portable speech memopad suitable for persons with visual disabilities

doi:10.1109/TSA.2002.805645

Journal ArticleDOI

Chip design of portable speech memopad suitable for persons with visual disabilities

Jhing-Fa Wang, +5 more

- 01 Nov 2002 -

IEEE Transactions on Speech and Audio Pr...

- Vol. 10, Iss: 8, pp 644-658

Chats0

TLDR

The proposed speech recognition and compression chip for portable memopad devices, especially suitable for use by the visually impaired, is presented, based on several cores of which they can be regarded as intellectual property cores to be used for a variety of speech-related application systems.

Abstract:

This paper presents the design of a speech recognition and compression chip for portable memopad devices, especially suitable for use by the visually impaired. The proposed chip design is based on several cores of which they can be regarded as intellectual property (IP) cores to be used for a variety of speech-related application systems. A cepstrum extraction core and a dynamic warping core are designed for mapping the speech recognition algorithms. In the cepstrum extraction core, a novel architecture computes the autocorrelation between the overlapping frames using two pairs of shift registers and an intelligent accumulation procedure. The architecture of the dynamic time warping core uses only a single processing element, and is based on our extensive study of the relationship among the nodes in the dynamic time warping lattice. Bit rate is the key factor affecting the memory size for speech compression; therefore, a very low bit-rate speech coder is used. The speech coder exploits a line-spectrum-based interpolation method, which yields fine quality synthesized speech despite the low 1.6 kbps bit rate. The 1.6 kbps vocoder core is cost-effective, and it integrates both encoder and decoder algorithms. The proposed design has been tested via hardware simulations on Xilinx Virtex series FPGAs and a semi-custom chip fabricated by 0.35 /spl mu/m CMOS single-poly-four-metal technology on a die size approximately 4.46/spl times/4.46 mm/sup 2/.

Chip design of portable speech memopad suitable for persons with visual disabilities

Citations

Robust Environmental Sound Recognition for Home Automation

VLSI Design for SVM-Based Speaker Verification System

A Low Power Wake-Up Circuitry Based on Dynamic Time Warping for Body Sensor Networks

Chip Design of LPC-cepstrum for Speech Recognition

An architecture of HMM-based isolated-word speech recognition with tone detection function

References

Fundamentals of speech recognition

Linear prediction: A tutorial review

Discrete-Time Processing of Speech Signals

Vlsi Digital Signal Processing Systems: Design And Implementation

Cepstrum analysis technique for automatic speaker verification

Related Papers (5)

Application of a VLSI Vector Quantization Processor to Real-Time Speech Coding

Scalable architecture for word HMM-based speech recognition and VLSI implementation in complete system

Parallelized Viterbi Processor for 5,000-Word Large-Vocabulary Real-Time Continuous Speech Recognition FPGA System

A low memory bandwidth Gaussian mixture model (GMM) processor for 20,000-word real-time speech recognition FPGA system

Realizing Low-Cost High-Throughput General-Purpose Block Encoder for JPEG2000