scispace - formally typeset
Search or ask a question
Topic

Codebook

About: Codebook is a research topic. Over the lifetime, 8492 publications have been published within this topic receiving 115995 citations.


Papers
More filters
Patent
18 Jul 1994
TL;DR: In this article, a speech recognizer is provided which uses a computationally-feasible method for constructing a set of Hidden Markov Models (HMMs) for speech recognition that utilize a partial and optimal degree of mixture tying.
Abstract: In accordance with the invention, a speech recognizer is provided which uses a computationally-feasible method for constructing a set of Hidden Markov Models (HMMs) for speech recognition that utilize a partial and optimal degree of mixture tying. With partially-tied HMMs, improved recognition accuracy of a large vocabulary word corpus as compared to systems that use fully-tied HMMs is achieved with less computational overhead than with a fully untied system. The computationally-feasible technique comprises the steps of determining a cluster of HMM states that share Gaussian components which are close together, developing a subset codebook for those clusters, and recalculating the Gaussians in the codebook to best estimate the clustered states.

43 citations

Journal ArticleDOI
TL;DR: An easily implementable stochastic relaxation algorithm for vector quantisation design is given, which generalises the usual Lloyd iteration in codebook design by perturbing the computed entroids with independent multidimensional noise, whose variance diminishes as the algorithm progresses.
Abstract: An easily implementable stochastic relaxation algorithm for vector quantisation design is given. It generalises the usual Lloyd iteration in codebook design by perturbing the computed entroids with independent multidimensional noise, whose variance diminishes as the algorithm progresses. A significant improvement is often achieved.

43 citations

Journal ArticleDOI
TL;DR: A new image classification method by spatial pyramid robust sparse coding (SP-RSC), which tries to find the maximum likelihood estimation solution by alternatively optimizing over the codebook and local feature coding parameters, hence is more robust to outliers than traditional sparse coding based methods.

43 citations

Proceedings ArticleDOI
15 Jul 2001
TL;DR: This paper presents an application of a hierarchical SOM for image compression which reduces the search complexity from O(N) to O(log N), enabling a faster training and image coding.
Abstract: The increase of the need for image storage and transmission in computer systems has increased the importance of signal and image compression algorithms. The approach involving vector quantization (VQ) relies on the design of a finite set of codes which will substitute the original signal during transmission with a minimal of distortion, taking advantage of the spatial redundancy of image to compress them. Algorithms such as LBG and SOM work in an unsupervised way toward finding a good codebook for a given training data. However, the number of code vectors (N) needed for VQ increases with the vector dimension, and full-search algorithms such as LBG and SOM can lead to large training and coding times. An alternative for reducing the computational complexity is the use of a tree-structured vector quantization algorithm. This paper presents an application of a hierarchical SOM for image compression which reduces the search complexity from O(N) to O(log N), enabling a faster training and image coding. Results are given for conventional SOM, LBG and HSOM, showing the advantage of the proposed method.

43 citations

Patent
24 Jan 2005
TL;DR: In this paper, a method and apparatus for reducing the complexity of linear prediction analysis-by-synthesis (LPAS) speech coders is presented, which includes a multi-tap pitch predictor having various parameters and utilizing an adaptive codebook subdivided into at least a first vector codebook and a second vector code book.
Abstract: A method and apparatus for reducing the complexity of linear prediction analysis-by-synthesis (LPAS) speech coders. The speech coder includes a multi-tap pitch predictor having various parameters and utilizing an adaptive codebook subdivided into at least a first vector codebook and a second vector codebook. The pitch predictor removes certain redundancies in a subject speech signal and vector quantizes the pitch predictor parameters. Further included is a source excitation (fixed) codebook that indicates pulses in the subject speech signal by deriving corresponding vector values. Serial optimization of the adaptive codebook first and then the fixed codebook produces a low complexity LPAS speech coder of the present invention.

43 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
88% related
Wireless network
122.5K papers, 2.1M citations
88% related
Network packet
159.7K papers, 2.2M citations
87% related
Wireless
133.4K papers, 1.9M citations
87% related
Wireless sensor network
142K papers, 2.4M citations
86% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
2023217
2022495
2021237
2020383
2019432
2018364