Showing papers in &quot;Speech Communication in 1989&quot;

A four-parameter model of the glottis and vocal fold contact area

TL;DR: A robust new algorithm for accurate endpointing of speech signals is described in this paper after an overview of the literature, which uses simple measures based on energy and zero-crossing rate for speech/silence detection.

...read moreread less

113 citations

Journal Article•DOI•

[...]

Ingo R. Titze¹, Ingo R. Titze²•Institutions (2)

University of Iowa¹, Denver Center for the Performing Arts²

Long range coarticulation effects for tongue dorsum contact in VCVCV sequences

TL;DR: A four parameter model of the glottis is described with similar kinematic parameters to complement this approach and provides an alternative to flow pulse modeling because it can include some source-system interactions with relatively little computational overhead.

...read moreread less

70 citations

Journal Article•DOI•

[...]

Daniel Recasens¹•Institutions (1)

Autonomous University of Barcelona¹

Temporal decomposition of speech

TL;DR: Anticipatory effects appear to be more tightly controlled than carryover effects presumably because of phonemic preplanning, and gestural antagonism in the contextual phonemes affects the two coarticulatory types differently.

...read moreread less

64 citations

Journal Article•DOI•

[...]

Astrid M.L. Van Dijk-Kappers, Stephen M. Marcus

A strong evidence for the existence of a large-scale integrated spectral representation in vowel perception

TL;DR: In articulatory phonetics speech is described as a sequence of distinct articulatory gestures, each of which produces an acoustic event that should approximate a phonetic target as discussed by the authors, but due to the overlap of the gestures these phonetic targets are often only partly realized.

...read moreread less

51 citations

Journal Article•DOI•

[...]

Jean-Luc Schwartz¹, Pierre Escudier¹•Institutions (1)

École nationale supérieure d'électronique et de radioélectricité de Grenoble¹

Male and female voice source characteristics: Inverse filtering results

TL;DR: This large-scale (3–3.5 Bark) spectral integration theory derived from the work of Chistovich and colleagues and supposed to provide a basis for the computation of the F2 parameter is not in fact supported by an actual proof, since all presumed evidence can be understood without this theory.

...read moreread less

50 citations

Journal Article•DOI•

[...]

Patti Price¹•Institutions (1)

SRI International¹

Integration of rhythmic and syntactic constraints in a model of generation of French prosody

TL;DR: The magnitudes of the male-female differences are similar to those observed for the creaky-normal voicing differences and breathy-normal differences, and may arise from a combination of biological, sociological and acoustical effects.

...read moreread less

47 citations

Journal Article•DOI•

[...]

Gérard Bailly

The uniqueness point effect in the shadowing of spoken words

TL;DR: The model presented here shows that syntax-driven and rhythm-driven strategies could be extreme cases of a more complex model which integrates both syntactic and rhythmic constraints.

...read moreread less

42 citations

Journal Article•DOI•

[...]

Monique Radeau¹, Jose Morais¹•Institutions (1)

Université libre de Bruxelles¹

Jitter in sustained and isolated sentences produced by dysphonic speakers

TL;DR: It is concluded that the UP strongly mediates the recognition of spoken words with early UP, and the shadowing of late-UP items is best predicted by word length in slower, and by word frequency in faster subjects; this suggests the intervention of different mechanisms.

...read moreread less

34 citations

Journal Article•DOI•

[...]

Jean Schoentgen¹•Institutions (1)

Université libre de Bruxelles¹

Perceptual compensation for transmission channel and speaker effects on vowel quality

TL;DR: No intrinsic superiority in the discrimination performance of connected speech as opposed to sustained vowels could be found and in the case of running speech absolute microperturbation values appeared to be higher during inter-segment transitions and during voice onset and offset.

...read moreread less

29 citations

Journal Article•DOI•

[...]

Christopher J. Darwin¹, J Denis McKeown¹, David Kirby¹•Institutions (1)

University of Sussex¹

Quality of speech produced by analysis-synthesis

TL;DR: A tentative conclusion from these experiments is that it is easier for the perceptual system to compensate for the effects of a transmission channel if it only changes the relative amplitudes of formants than if it changes estimated formant frequencies.

...read moreread less

24 citations

Journal Article•DOI•

[...]

Donald G. Childers¹, K. Wu¹•Institutions (1)

University of Florida¹

Comparison of parameter sets for temporal decomposition

TL;DR: A two-channel approach to speech analysis is recommended to aid the automatic processing of speech, where one channel is the conventional acoustic signal, while the other channel isThe electroglottogram (EGG).

...read moreread less

Journal Article•DOI•

[...]

Astrid M.L. Van Dijk-Kappers

Pitch detection based on zero-phase filtering

TL;DR: This paper compares the results obtained with nine different sets of speech parametes, including log- area parameters, formants, reflection coefficients and band-filter parameters and concludes that log-area parameters from the most suitable parameter set available for temporal decomposition are obtained.

...read moreread less

Journal Article•DOI•

[...]

Ioannis Dologlou¹, George Carayannis¹•Institutions (1)

National Technical University of Athens¹

On text independent speaker identification using a quadratic classifier with optimal features

TL;DR: The algorithm is based on the iterative use of a linear filter with zero phase and monotonically decreasing frequency response, providing an estimate for the locations of the closure and opening of the vocal chords.

...read moreread less

Journal Article•DOI•

[...]

A. Cohen¹, I. Froind¹•Institutions (1)

Ben-Gurion University of the Negev¹

Just noticeable differences of articulation rate at sentence level

TL;DR: The use of the quadratic classifier together with the individual feature space is shown to drastically improve recognition accuracy while the added memory requirements are shown to be negligible.

...read moreread less

Journal Article•DOI•

[...]

W. Eefting¹, A.C.M. Rietveld¹•Institutions (1)

Radboud University Nijmegen¹

A study of line spectrum pair frequencies for vowel recognition

TL;DR: In a paired comparison task, two factors appeared to affect the tempo judgements to a certain extent: the response category to be used by the listeners and the position of the stimulus with standard tempo.

...read moreread less

Journal Article•DOI•

[...]

Kuldip K. Paliwal¹•Institutions (1)

Tata Institute of Fundamental Research¹

Duration in contest clustering for speech recognition

TL;DR: The LSP representation is studied for speech recognition, and the weighted LSP distance measure is found to perform significantly better than these popular LP distance measures.

...read moreread less

Journal Article•DOI•

[...]

Joseph Picone¹•Institutions (1)

Texas Instruments¹

Perception of voicing in dutch two-obstruent sequences: covariation of voicing cues

TL;DR: A clustering algorithm based on the standard KMEANS procedure that generates reference models for continuous density Hidden Markov Model (HMM) based systems by simultaneously considering spectral and duration information is introduced.

...read moreread less

Journal Article•DOI•

[...]

R. J. Van Den Berg¹•Institutions (1)

Radboud University Nijmegen¹

Improving performance of code excited LPC-coders by joint optimization

TL;DR: The results indicated that the effects of the parameters are additive and that, although presence/absence of periodicity (VOT and VTT) is the most important determinant of perceived voicing, perception is also to a large extent affected by “C2”-duration and “preceding vowel” duration.

...read moreread less

Journal Article•DOI•

[...]

Jörg-Martin Müller

On instantaneous and transitional spectral information for text-dependent speaker verification

TL;DR: A CELP speech coding algorithm where the coder parameters are jointly optimized where the relation between pitch period, pitch predictor coefficient, codebook entry and scaling factor is derived.

...read moreread less

Journal Article•DOI•

[...]

C. Berasconi¹•Institutions (1)

École Polytechnique Fédérale de Lausanne¹

A PCMN neural network for isolated word recognition

TL;DR: Investigations on a population of 22 speakers showed that the elimination of the time-invariant spectral components from the speech features, taking place when performing cepstral normalization or computing first-order orthogonal coefficients, brings a substantial reliability improvement.

...read moreread less

Journal Article•DOI•

[...]

H. Ye, S. Wang¹, F. Robert¹•Institutions (1)

Centre national de la recherche scientifique¹

The use of the Dempster-Shafer rule in the lexical component of a man-machine oral dialogue system

TL;DR: A Partial Connection Multilayered Network (PCMN), based on a technique of partial connection between layers, is presented, which permits the efficient treatment of temporal information, which is very important in speech processing, unlike image processing.

...read moreread less

Journal Article•DOI•

[...]

Laurent Romary¹, Jean-Marie Pierrel•Institutions (1)

Supélec¹

ESCA tutorial day and workshop on speech input/output assessment and speech databases

TL;DR: The Dempster-Shafer formalism is applied in order to combine information in the lexicon, using a frequency distribution as the basis for evidence evaluation and has suitable properties in the case of an oral dialogue system, as it preserves module autonomy and allows backtracking at any time during the recognition process.

...read moreread less

Journal Article•DOI•

[...]

Louis C. W. Pols

01 Dec 1989-Speech Communication

Journal Article•DOI•

Intelligibility of synthetic speech in the presence of interfering speech

[...]

J. H. Eggen

Lexical stress detection in isolated English words

TL;DR: This work used both a more conventional articulation test and a monosyllabic adaptive speech interference test to evaluate the intelligibility of nine different speech-coding techniques, and found different patterns of responses.

...read moreread less

Journal Article•DOI•

[...]

Bernard Kiriakos¹, Douglas D. O'Shaughnessy²•Institutions (2)

McGill University¹, Université du Québec²

An experimental dutch keyboard-to-speech system for the speech impaired

TL;DR: It is proposed that the study of lexical stress in continuous speech be accompanied by theStudy of prosodics and their general use in sentences, to avoid the problem of syllable segmentation.

...read moreread less

Journal Article•DOI•

[...]

R. J. Deliege

Comparison of several speech signal feature parameters for automatic speech recognition

TL;DR: An experimental Dutch keyboard-to-speech system has been developed to explore the possibilities and limitations of Dutch speech synthesis in a communication aid for the speech impaired as mentioned in this paper, using diphones and a formant synthesizer chip for speech synthesis.

...read moreread less

Journal Article•DOI•

[...]

Momir Partalo, Zlatko Sijerčic¹•Institutions (1)

Telecom Australia¹

How phonetic is a phonological feature representation? The case of labiodental fricatives

TL;DR: Discrete power spectrum features, i.e. the sign and rank-order functions of a bandpass filter output are analyzed together with more standard features such as LPC coefficients and the short-time spectrum measured by means of aBandpass filter bank.

...read moreread less

Journal Article•DOI•

[...]

Thomas Berg¹•Institutions (1)

Braunschweig University of Technology¹

Fixed-shape adaptive-gain vector quantization for speech waveform coding

TL;DR: It is concluded that phonetic and psycholinguistic feature representations need not match.

...read moreread less

Journal Article•DOI•

[...]

Michael J. Sabin

The recognition of speech by machine - a bibliography: Academic Press, London/San Diego, 1988, 498 pp., ISBN 0-12-356785-8

TL;DR: The gain portion of a shape-gain quantizer is made adaptive, yielding a vector quantizer that can adjust itself to the time-varying amplitude of a speech signal.

...read moreread less

Journal Article•DOI•

[...]

Jean Schoentgen