Showing papers in &quot;Speech Communication in 1988&quot;

On large-vocabulary speaker-independent continuous speech recognition

TL;DR: The articulatory-acoustic relation is simplified, the quantal nature of speech is confirmed, phonetic universals and phonetic systems are considered from a new point of view, formant transitions are explained and normalized, and an easy to control 9-parameter model proposed.

...read moreread less

105 citations

Journal Article•DOI•

[...]

Kai-Fu Lee¹•Institutions (1)

Carnegie Mellon University¹

An efficient stochastically excited linear predictive algorithm for high quality low bit rate transmission of speech

TL;DR: In this article, the authors describe sphinx, the world's first accurate large-vocabulary speaker-independent continuous speech recognition system, and compare its performance against similar systems, and account for its high accuracy.

...read moreread less

55 citations

Journal Article•DOI•

[...]

Willem Bastiaan Kleijn¹, Daniel John Krasinski¹, Richard Harry Ketchum¹•Institutions (1)

Bell Labs¹

Institut national de la recherche scientifique¹

TL;DR: Improvements to the SELP algorithm are described which result in better speech quality and higher computational efficiency, and a new recursive algorithm which performs a very fast search through the adaptive codebook.

...read moreread less

53 citations

Journal Article•DOI•

Diphone speech synthesis

[...]

Douglas O'Shaughnessy¹, Louis Barbeau¹, David Bernardi¹, Daniele Archanbault¹•Institutions (1)

Isolated digit recognition experiments using the multi-layer perceptron

TL;DR: Different methods for the second task are reviewed, emphasizing the advantages and disadvantages of the linear predictive (LPC) diphone approach.

...read moreread less

28 citations

Journal Article•DOI•

[...]

S. M. Peeling, Roger K. Moore

On the verification of triangle inequality by dynamic time-warping dissimilarity measures

TL;DR: The choice of the parameter values used by the Multi-Layer Perceptron is discussed and experimental results are quoted to show how the choice of these parameter values influences the performance of the MLP.

...read moreread less

26 citations

Journal Article•DOI•

[...]

Enrique Vidal¹, Francisco Casacuberta¹•Institutions (1)

Polytechnic University of Valencia¹

Perceptual normalization of the vowels of a man and a child in various contexts

TL;DR: The conclusions suggest the use of time-compressing preprocessing techniques and the application of suboptimal DTW procedures as the most likely causes of the TI dissatisfaction rates reported elsewere.

...read moreread less

24 citations

Journal Article•DOI•

[...]

D. R. van Bergem¹, Louis C. W. Pols¹, F.J. Koopmans-van Beinum¹•Institutions (1)

University of Amsterdam¹

Coproduction: evidence from EPG data

TL;DR: It is suggested that the acoustic context provides information about the formant frequencies of the talker's vowels with which a vowel space can be constructed that serves as a reference frame for the identification of the vowels in the test words.

...read moreread less

21 citations

Journal Article•DOI•

[...]

A. Marchal

A robust and fast CELP coder at 16 kbit/s

TL;DR: Data on stop sequences in French which supports the corpoductionist's point of view is presented and the articulatory pattern observed by electropalatography cannot be interpreted simply as the concatenation of assimilated segments.

...read moreread less

21 citations

Journal Article•DOI•

[...]

A. Le Guyader¹, D. Massaloux¹, F. Zurcher¹•Institutions (1)

Centre national d'études des télécommunications¹

A regular-pulse excited linear predictive codec

TL;DR: Both objective and subjective results confirm the high level of performances obtained by the 16 kbit/s CELP coder in different realistic transmission conditions as transmission with errors and ambient noise.

...read moreread less

19 citations

Journal Article•DOI•

[...]

Peter Vary¹, Rudolf Dipl Ing Hofmann¹, Karl Hellwig¹, Robert Johannes Sluyter¹•Institutions (1)

Philips¹

Intra-speaker variability of the long term speech spectrum

TL;DR: A 16 kbit/s speech codec with low complexity and low signal delay is presented which is a special version of the Regular-Pulse Excitation LPC approach (RPE-LPC).

...read moreread less

16 citations

Journal Article•DOI•

[...]

Bernard Harmegnies¹, Albert Landercy¹•Institutions (1)

University of Mons¹

MPE/LTP speech coder for moblie radio application

TL;DR: The cross-correlation coefficient was used to investigate LTS residual intra-speaker variability both in inter- and intra-text conditions, and significant subject-dependent differences have been revealed in both conditions.

...read moreread less

Journal Article•DOI•

[...]

Claude Galand¹, Michele Rosso¹, Emmanuel Lancon¹•Institutions (1)

IBM¹

Fast speaker-independent DTW recognition of isolated words using a metric-space algorithm (AESA)

TL;DR: The Multipulse excitation with long term prediction (MPE/LTP) algorithm and details on the implementation (constants, quantizing tables) respectively in the analysis part, the error protection/correction, and the synthesis part of the codec are reported.

...read moreread less

Journal Article•DOI•

[...]

Enrique Vidal¹, M. J. Lloret¹•Institutions (1)

Polytechnic University of Valencia¹

Strategies for lexical access to very large vocabularies

TL;DR: It is shown that, by increasing the number of tokens included in dictionaries with multiply represented words, a simultaneous reduction can be achieved in both the error-rate and thenumber of distance computations required.

...read moreread less

Journal Article•DOI•

[...]

L. Fissore¹, G. Micca¹, R. Pieraccini¹, P. Laface²•Institutions (2)

CSELT¹, University of Salerno²

Comparison of two speech codecs for DMR systems

TL;DR: The results show that a complexity reduction of about 73% can be achieved by using the two pass approach with respect to the direct approach, while the recognition accuracy remains comparable.

...read moreread less

Journal Article•DOI•

[...]

Vincenzo Lazzari¹, Roberto Montagna¹, Daniele Sereno¹•Institutions (1)

CSELT¹

Pan-European speech coding standard for digital mobile radio

TL;DR: The main result reported in the paper is that the performances of the two schemes are almost equivalent although their structure is very different.

...read moreread less

Journal Article•DOI•

[...]

Jon E. Natvig

A subjective testing methodology for evaluating medium rate codecs for mobile radio applications

TL;DR: A Regular Pulse Excitation/Long-Term Prediction LPC (RPE-LTP) coding algorithm has been selected as the basis for the standard for the Pan-European cellular system.

...read moreread less

Journal Article•DOI•

[...]

Alan E. Coleman¹, Norman Gleiss, Paolino Usai²•Institutions (2)

Suffolk University¹, CSELT²

Phoneme-based continuous speech recognition results for different language models in the 1000-word SPICOS system

TL;DR: In this article, the authors describe subjective testing methodologies adopted to select suitable candidate codecs capable of being used in the proposed Pan-European cellular digital mobile radio (DMR) system.

...read moreread less

Journal Article•DOI•

[...]

Hermann Ney¹, Annedore Paeseler¹•Institutions (1)

Philips¹

Harmonic coding-state of the art and future trends

TL;DR: An overview of a system for phoneme-based large-vocabulary continuous-speech recognition that provides the speaker dependent recognition component in the speech understanding system spicos that is designed to recognize and understand database queries spoken in natural German language.

...read moreread less

Journal Article•DOI•

[...]

Isabel Trancoso¹, Luís B. Almeida¹, Joaquim S. Rodrigues¹, Jorge S. Marques¹, José Tribolet¹ - Show less +1 more•Institutions (1)

Instituto Superior Técnico¹

Spellmode recognition based on vector quantization

TL;DR: An 8 kbit/s simulation is presented, using hard switching between harmonic coding and ATC to discuss the state of the art in analysis-synthesis methods and their application to coding.

...read moreread less

Journal Article•DOI•

[...]

Shan-shan Huang, Robert M. Gray¹•Institutions (1)

Stanford University¹

Medium-rate speech coding-trial of a review

TL;DR: A Markov-modelling Spellmode recognizer is described which uses LPC-VQ as a front-end for analog to digital conversion and data compression and it suffers from high computational cost.

...read moreread less

Journal Article•DOI•

[...]

Ulrich Heute¹•Institutions (1)

University of Erlangen-Nuremberg¹

An acoustic-phonetic oriented system for synthesizing Chinese

TL;DR: A “coding gap” of roughly 32-2.4 kbit/s is shown to actually define “medium-rate” speech coding, and the fundamental approaches trying to close the gap are exposed.

...read moreread less

Journal Article•DOI•

[...]

S. Yang¹, Yi Xu¹•Institutions (1)

Chinese Academy of Social Sciences¹

Modified condensed nearest neighbor rule as applied to speaker independent word recognition

TL;DR: All the 1268 syllables in Standard Chinese have been synthesized by this system, which produces a sound quality close to that of natural speech with respect to both intelligibility and naturalness.

...read moreread less

Journal Article•DOI•

[...]

N. Yalabik¹, F. Yarman-Vural¹•Institutions (1)

Middle East Technical University¹

Several approaches to speaker adaptation in automatic speech recognizers: Original French title: Quelques approches pour l'adaptation aux locuteurs en reconnaissance automatique de la parole

TL;DR: Modified versions of Edited and Condensed Nearest Neighbor Rules are applied to speaker-independent isolated word recognition to select the word templates, as opposed to the clustering techniques.

...read moreread less

Journal Article•DOI•

[...]

Khalid Choukri

A network of actions for automatic speech recognition

Journal Article•DOI•

[...]

Renato De Mori¹, Régis Cardin¹, Ettore Merlo¹, Mathew Palakal¹, Jean Rouat¹ - Show less +1 more•Institutions (1)

McGill University¹

The UK candidate 16 kbit/s speech codec for the GSM Pan-European study on a digital cellular land mobile radio

TL;DR: A paradigm for automatic speech recognition using networks of actions performing variable depth analysis produces descriptions of speech properties that are related to speech units through Markov models representing system performance.

...read moreread less

Journal Article•DOI•

[...]

R. B. Hanes¹, P. M. Attkfins¹•Institutions (1)

BT Group¹

Higher pole correction in vocal tract models and terminal analogs

TL;DR: The 16 kbit/s speech codec developed by British Telecom Research Laboratories and selected as the UK candidate to the GSM Pan-European study on digital cellular land mobile radio offers several important features including low delay, low computational complexity and a good tolerance to transmission errors.

...read moreread less

Journal Article•DOI•

[...]

Unto K. Laine¹•Institutions (1)

Helsinki University of Technology¹

Real-time large vocabulary word recognition via diphone spotting and multiprocessor implementation

TL;DR: A polynomial analysis of the vocal tract transfer function was done to obtain new practical models for the Higher Pole Correction (HPC), which can be used in analog as well as digital all-pole realizations to form a new type of pole-zero model for speech production.

...read moreread less

Journal Article•DOI•

[...]

C. Scagliola, A. Carossino, A. M. Colla, C. Favareto, P. Pedrazzi, D. Sciarra, C. Vicenzi - Show less +3 more

01 Jan 1988-Speech Communication

TL;DR: Elsag’s Large Vocabulary Isolated Word Recognition system DSPELL makes use of a diphone-based speech model and an extremely efficient word decoding algorithm, and is implemented on Elsag's multiprocessor EMMA-2 1.

...read moreread less

Journal Article•DOI•

An experimental environment for the generation and verification of word hypotheses in continuous speech

[...]

S. Kunzmann¹, Thomas Kuhn¹, Heinrich Niemann¹•Institutions (1)

University of Erlangen-Nuremberg¹

Algorithms and architectures for continuous speech acoustic-phonetic decoding : Original French title: Algorithmes et architectures pour le décodage acoustico-phonétique de la parole continue.

TL;DR: A verification algorithm and a word spotting technique both based on HMM will be discussed and some preliminary results for the matching procedures are given.

...read moreread less

Journal Article•DOI•

[...]

Dominique Vicard¹•Institutions (1)

École Normale Supérieure¹