Showing papers in &quot;Computer Speech &amp; Language in 1993&quot;

Perceptual experiments for diagnostic testing of text-to-speech systems

TL;DR: This work proposes an alternative representation of hidden Markov models in which a state of an HMM is defined as a template, i.e. a "typical" sequence of observations, derived from an ensemble of segments corresponding to that state.

...read moreread less

73 citations

Journal Article•DOI•

[...]

Jan P. H. van Santen¹•Institutions (1)

Bell Labs¹

Parse scoring with prosodic information: an analysis/synthesis approach

TL;DR: Perceptual methods for diagnosing problems in text-to-speech systems are described and a battery of experimental paradigms that address different facets of speech quality and intelligibility are discussed.

...read moreread less

54 citations

Journal Article•DOI•

[...]

Mari Ostendorf¹, Colin W. Wightman¹, Nanette Veilleux¹•Institutions (1)

Boston University¹

Discriminative feature selection for speech recognition

TL;DR: Two scoring algorithms to rank candidate parses are proposed, both based on an analysis/synthesis approach that compares the recognized prosodic phrase structure (analysis) with the predicted structure (Synthesis) for each candidate parse.

...read moreread less

51 citations

Journal Article•DOI•

[...]

Enrico Bocchieri¹, Jay G. Wilpon¹•Institutions (1)

Bell Labs¹

A feature-based formalism for two-level phonology: a description and implementation

TL;DR: The dimension of the frame feature vectors, and hence the number of model parameters, were greatly reduced without a significant loss of recognition performance.

...read moreread less

41 citations

Journal Article•DOI•

[...]

Stephen Pulman¹, Mark Hepple¹•Institutions (1)

University of Cambridge¹

The mu + system for corpus based speech research

TL;DR: The goal of this paper is to introduce and describe a formalism for segment based phonology and phonological processing.

...read moreread less

33 citations

Journal Article•DOI•

[...]

Jonathan Harrington¹, Steve Cassidy¹, Janet Fletcher¹, Andrew McVeigh¹•Institutions (1)

Macquarie University¹

Synthesizing natural-sounding intonation for Dutch : rules and perceptual evaluation

TL;DR: The mu+ system has been developed to provide a common environment for experimentation in numerous facets of corpus based speech and language research including: articulatory and acoustic phonetics, prosodic analysis, speech technology research, and linguistic corpus development.

...read moreread less

31 citations

Journal Article•DOI•

[...]

Jacques Terken

A comparative study of discrete, semicontinuous, and continuous hidden Markov models

TL;DR: It is suggested that the decomposition of the utterance into intonation phrases by means of pitch makes an essential contribution to the naturalness of synthetic speech.

...read moreread less

28 citations

Journal Article•DOI•

[...]

Xuedong Huang¹, Hsiao-Wuen Hon¹, Mei-Yuh Hwang¹, Kai-Fu Lee¹•Institutions (1)

Carnegie Mellon University¹

Hidden Markov model representation of quantized articulatory features for speech recognition

TL;DR: The semicontinuous hidden Markov model was extended to incorporate multiple code-books and it was found that the SCHMM can have a large number of free parameters in comparison with the discrete HMM because of its smoothing ability.

...read moreread less

28 citations

Journal Article•DOI•

[...]

Kevin Erler¹, Li Deng¹•Institutions (1)

University of Waterloo¹

Intonation component of a text-to-speech system for Hindi

TL;DR: Speech recognition results show that the new system outperforms the traditional HMM approaches in small tasks and Examination of the source of error, using Viterbi analysis, suggests that this new scheme is able to achieve better modelling of the acoustic transitions and coarticulation in speech.

...read moreread less

22 citations

Journal Article•DOI•

[...]

A. S. Madhukumar¹, S. Rajendran¹, B. Yegnanarayana¹•Institutions (1)

Indian Institute of Technology Madras¹

An information theoretical investigation into the distribution of phonetic information across the auditory spectrogram

TL;DR: Some features of the fundamental frequency (F0) contours of speech in Hindi are described and an approach to represent and activate this intonation knowledge for an unrestricted text-to-speech system for Hindi is proposed.

...read moreread less

20 citations

Journal Article•DOI•

[...]

Andrew C. Morris¹, Jean-Luc Schwartz¹, Pierre Escudier¹•Institutions (1)

Stendhal University¹

Continuous hidden Markov models integrating transitional and instantaneous features for Mandarin syllable recognition

TL;DR: In this paper, the authors used the information theoretic measure of mutual information to investigate the distribution of phonetic information across the on/off aligned auditory spectrogram for a corpus of vowel-plosive-vowel utterances.

...read moreread less

Journal Article•DOI•

[...]

Yumin Lee¹, Lin-Shan Lee¹•Institutions (1)

National Taiwan University¹

Speech Maker: a flexible and general framework for text-to-speech synthesis, and its application to Dutch

TL;DR: The performance of continuous HMMs using one type of transitional features in speaker-dependent recognition of the highly confusing Mandarin syllables is first evaluated and discussed in detail under the constraint of very limited training data.

...read moreread less

Journal Article•DOI•

[...]

Hugo C. van Leeuwen, Enrico te Lindert

Connected digit recognition based on improved acoustic resolution

TL;DR: A text-to-speech system for Dutch, called Spraakmaker, is described, based on a flexible underlying framework which has been devised for buildingText Maker, which is a multi-level, synchronized data structure based on the work of Hertz, Kadin and Karplus (1985), which is to contain all linguistic information relevant to the text- to-speech conversion process.

...read moreread less

Journal Article•DOI•

[...]

Jay G. Wilpon¹, Chin-Hui Lee¹, Lawrence R. Rabiner¹•Institutions (1)

Bell Labs¹

Phoneme detection as a tool for comparing perception of natural and synthetic speech

TL;DR: This paper shows how the improved acoustic modeling techniques (using a continuous density hidden Markov model framework), developed for large-vocabulary speech recognition applications, can be applied to the problem of connected digit recognition with no changes made to the basic modeling techniques and with no vocabulary-specific information used.

...read moreread less

Journal Article•DOI•

[...]

Andrew J. Nix, Gita Mehta, Julie Dye, Anne Cutler

A knowledge-based system for stop consonant identification based on speech spectrogram reading

TL;DR: It is concluded that the phoneme detection task is a useful tool for investigating phonetic processing of synthetic speech input, but subjects must be encouraged to adopt a response criterion which emphasizes rapid responding.

...read moreread less

Journal Article•DOI•

[...]

Lori Lamel¹•Institutions (1)

Centre national de la recherche scientifique¹

Speech Maker Formalism: a rule formalism operating on a multi-level, synchronized data structure

TL;DR: The evidence indicates that the process of formalizing spectrogram reading can be modeled with rules, and the knowledge acquisition and knowledge representation, in terms of descriptions and rules, are described.

...read moreread less

Journal Article•DOI•

[...]

Hugo C. van Leeuwen

A feature based formalism for two−level phonology

TL;DR: An extensive and detailed description is given of the possibilities of SMF, concerning both the specification of patterns in the data structure and the alteration of its contents, and the completeness and the expressive power of the formalism are discussed.

...read moreread less

Journal Article•

[...]

Stephen Pulman