A large-vocabulary continuous speech recognition system for Hindi

doi:10.1147/RD.485.0703

Journal ArticleDOI

A large-vocabulary continuous speech recognition system for Hindi

M. Kumar, +2 more

- 01 Sep 2004 -

Ibm Journal of Research and Development

- Vol. 48, Iss: 5, pp 703-715

Chats0

TLDR

This paper presents two new techniques that have been used to build a large-vocabulary continuous Hindi speech recognition system and proposes a hybrid approach that combines rule-based and statistical approaches in a two-step fashion.

Abstract:

In this paper we present two new techniques that have been used to build a large-vocabulary continuous Hindi speech recognition system. We present a technique for fast bootstrapping of initial phone models of a new language. The training data for the new language is aligned using an existing speech recognition engine for another language. This aligned data is used to obtain the initial acoustic models for the phones of the new language. Following this approach requires less training data. We also present a technique for generating baseforms (phonetic spellings) for phonetic languages such as Hindi. As is inherent in phonetic languages, rules generally capture the mapping of spelling to phonemes very well. However, deep linguistic knowledge is required to write all possible rules, and there are some ambiguities in the language that are difficult to capture with rules. On the other hand, pure statistical techniques for base and generation require large amounts of training data that are not readily available. We propose a hybrid approach that combines rule-based and statistical approaches in a two-step fashion. We evaluate the performance of the proposed approaches through various phonetic classification and recognition experiments.

A large-vocabulary continuous speech recognition system for Hindi

Citations

Determining text to speech pronunciation based on an utterance from a user

Systems and methods for building a native language phoneme lexicon having native pronunciations of non-native words derived from non-native pronunciations

Indian Language Speech Database: A Review

Using Gaussian Mixtures for Hindi Speech Recognition System

Speech recognition of Malayalam numbers

References

Statistical methods for speech recognition

A tree-based statistical language model for natural language speech recognition

Issues in Building General Letter to Sound Rules

Performance of the IBM large vocabulary continuous speech recognition system on the ARPA Wall Street Journal task

Translingual visual speech synthesis

Related Papers (5)

A tutorial on hidden Markov models and selected applications in speech recognition

Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences

Fundamentals of speech recognition

The HTK book

Spoken Language Processing: A Guide to Theory, Algorithm, and System Development