scispace - formally typeset
PatentDOI

Speech recognition apparatus having a speech coder outputting acoustic prototype ranks

TLDR
In this paper, a speech coding and speech recognition apparatus is presented, where the value of at least one feature of an utterance is measured over each of a series of successive time intervals to produce the series of feature vector signals, and the closeness of the feature value of each feature vector signal to the parameter value of a set of prototype vector signals determined to obtain prototype match scores for each vector signal and each prototype vector signal.
Abstract
A speech coding and speech recognition apparatus. The value of at least one feature of an utterance is measured over each of a series of successive time intervals to produce a series of feature vector signals. The closeness of the feature value of each feature vector signal to the parameter value of each of a set of prototype vector signals is determined to obtain prototype match scores for each vector signal and each prototype vector signal. For each feature vector signal, first-rank and second-rank scores are associated with the prototype vector signals having the best and second best prototype match scores, respectively. For each feature vector signal, at least the identification value and the rank score of the first-ranked and second-ranked prototype vector signals are output as a coded utterance representation signal of the feature vector signal, to produce a series of coded utterance representation signals. For each of a plurality of speech units, a probabilistic model has a plurality of model outputs, and output probabilities for each model output. Each model output comprises the identification value of a prototype vector and a rank score. For each speech unit, a match score comprises an estimate of the probability that the probabilistic model of the speech unit would output a series of model outputs matching a reference series comprising the identification value and rank score of at least one prototype vector from each coded utterance representation signal in the series of coded utterance representation signals.

read more

Citations
More filters
Patent

Intelligent Automated Assistant

TL;DR: In this article, an intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions.
Patent

Automated Response to and Sensing of User Activity in Portable Devices

TL;DR: In this paper, various methods and devices described herein relate to devices which, in at least certain embodiments, may include one or more sensors for providing data relating to user activity and at least one processor for causing the device to respond based on the user activity which was determined, at least in part, through the sensors.
Patent

Using context information to facilitate processing of commands in a virtual assistant

TL;DR: In this article, a virtual assistant uses context information to supplement natural language or gestural input from a user, which helps to clarify the user's intent and reduce the number of candidate interpretations of user's input, and reduces the need for the user to provide excessive clarification input.
Patent

Method and apparatus for building an intelligent automated assistant

TL;DR: In this paper, a method for building an automated assistant includes interfacing a service-oriented architecture that includes a plurality of remote services to an active ontology, where the active ontologies includes at least one active processing element that models a domain.
Patent

Contextual voice commands

TL;DR: In this paper, techniques and systems for implementing contextual voice commands are described and a physical input that relates the selected data item to an operation in a second context is received, and the operation is performed on the input data item in the second context.
References
More filters
Journal ArticleDOI

Continuous speech recognition by statistical methods

TL;DR: Experimental results are presented that indicate the power of the methods and concern modeling of a speaker and of an acoustic processor, extraction of the models' statistical parameters and hypothesis search procedures and likelihood computations of linguistic decoding.
PatentDOI

Speech recognition system

TL;DR: In this paper, the likelihood of a word in a vocabulary of words is evaluated for each word, each total score being the re-sult of combining at least two word scores generated by differing algorithms.
PatentDOI

Hidden Markov model speech recognition arrangement

TL;DR: In this paper, a speech recognizer includes a plurality of stored constrained hidden Markov model reference templates and a set of stored signals representative of prescribed acoustic features of the said plurality of reference patterns.
PatentDOI

Constructing Markov model word baseforms from multiple utterances by concatenating model sequences for word segments

TL;DR: In this paper, a method for segmenting multiple utterances of a vocabulary word in a consistent and coherent manner and determining a Markov model sequence for each segment was presented, where fenemic Markov models correspond to each label.
Related Papers (5)