scispace - formally typeset
Patent

System and method for improving the accuracy of a speech recognition program

Reads0
Chats0
TLDR
In this article, a speech recognition system that automatically converts a pre-recorded audio file into a written text is described. But the system is based on a speech-to-text conversation.
Abstract
A system and method for quickly improving the accuracy of a speech recognition program. The system is based on a speech recognition program that automatically converts a pre-recorded audio file into a written text. The system parses the written text into segments, each of which is corrected by the system and saved in a retrievable manner in association with the computer. The standard speech files are saved towards improving accuracy in speech-to-text conversation by the speech recognition program. The system further includes facilities to repetitively establish an independent instance of the written text from the pre-recorded audio file using the speech recognition program. This independent instance can then be broken into segments and each segment in said independent instance replaced with a corrected segment associated with the segment. In this manner, repetitive instruction of a speech recognition program can be facilitated. A system and method for directing pre-recorded audio files to a speech recognition program that does not accept such files is also disclosed. Such system and method are necessary to sue the system and method for quickly improving the accuracy of a speech recognition program with some pre-existing speech recognition programs.

read more

Citations
More filters
Patent

Intelligent Automated Assistant

TL;DR: In this article, an intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions.
Patent

Method and apparatus for building an intelligent automated assistant

TL;DR: In this paper, a method for building an automated assistant includes interfacing a service-oriented architecture that includes a plurality of remote services to an active ontology, where the active ontologies includes at least one active processing element that models a domain.
Patent

Intelligent automated assistant for tv user interactions

TL;DR: In this article, a virtual assistant can interact with a television set-top box to control content shown on a television and execute tasks according to the user's intent, including causing playback of media on the television.
Patent

Intelligent text-to-speech conversion

TL;DR: In this article, improved text-to-speech processing can convert text from an electronic document into an audio output that includes speech associated with the text as well as audio contextual cues.
Patent

Method of and apparatus for improving productivity of human reviewers of automatically transcribed documents generated by media conversion systems

TL;DR: In this paper, an apparatus for improving productivity of human reviewers of transcribed documents generated by media conversion systems includes a server/client network of computers, memories, and file systems.
References
More filters
Journal ArticleDOI

Random texts exhibit Zipf's-law-like word frequency distribution

TL;DR: It is shown that the distribution of word frequencies for randomly generated texts is very similar to Zipf's law observed in natural languages such as English.
Patent

Multimedia document retrieval by application of multimedia queries to a unified index of multimedia data for a plurality of multimedia data types

TL;DR: In this paper, the authors propose a unified index for multimedia document retrieval by evaluation of a query structure which can contain any of the multimedia data types, and operators which can be evaluated on any of these data types.
Patent

Error correction in speech recognition

TL;DR: In this paper, new techniques and systems may be implemented to improve error correction in speech recognition systems, which may be used in a standard desktop environment, in a mobile environment, or in any other type of environment that can receive and/or present recognized speech.
Patent

Automatic indexing and aligning of audio and text using speech recognition

TL;DR: In this paper, a method of automatically aligning a written transcript with speech in video and audio clips is presented. But it does not address the problem of automatic alignment of the transcript with the original transcript.
Patent

Adaptive natural language computer interface system

TL;DR: In this article, a system for computer translation between a natural language such as English, and a second language, such as the command language of a computer operating system, a job control language, a robot control language or a numerical control machine program language or subset of another natural language.