Open AccessProceedings Article
An experimental study of an audio indexing system for the web.
Beth Logan,Pedro J. Moreno,Jean-Manuel Van Thong,Edward W. D. Whittaker +3 more
- pp 676-679
Reads0
Chats0
TLDR
A speech recognition based audio search engine for indexing spoken documents found on the World Wide Web, focusing on the speech recognition and retrieval aspects, and the results of retrieval experiments demonstrate that the system can index effectively.Abstract:
We have developed a speech recognition based audio search engine for indexing spoken documents found on the World Wide Web Our site (http://wwwcompaqcom/speechbot) indexes around 20 news and talk radio shows covering a wide range of topics, speaking styles and acoustic conditions from a selection of public Web sites with multimedia archives In this paper, we describe our system and its performance, focusing on the speech recognition and retrieval aspects We describe our training procedure in some detail and report our historical error rate since the site launch We also investigate the impact of Out Of Vocabulary (OOV) words Finally we report the results of retrieval experiments which demonstrate that our system can index effectivelyread more
Citations
More filters
Proceedings ArticleDOI
Vocabulary independent spoken term detection
TL;DR: This work presents a vocabulary independent system that can handle arbitrary queries, exploiting the information provided by having both word transcripts and phonetic transcripts, in order to retrieve information from speech data.
Journal ArticleDOI
Spoken content retrieval: beyond cascading speech recognition with text retrieval
TL;DR: This overview article is intended to provide a thorough overview of the concepts, principles, approaches, and achievements of major technical contributions along this line of investigation.
Proceedings ArticleDOI
Vocabulary-independent search in spontaneous speech
TL;DR: This work presents a vocabulary-independent system to index and to search rapidly spontaneous speech, and introduces a new method of phonetic word-fragment lattice generation, which uses longer-span language knowledge than a phoneme recognizer.
Journal ArticleDOI
Speechbot: an experimental speech-based search engine for multimedia content on the web
TL;DR: This paper uses speech recognition technology to index spoken audio and video files from the World Wide Web when no transcriptions are available, and shows that, even if the transcription is inaccurate, it can still achieve good retrieval performance for typical user queries.
Proceedings ArticleDOI
Fast Vocabulary-Independent Audio Search Using Path-Based Graph Indexing
Olivier Siohan,Michiel Bacchiani +1 more
TL;DR: A fast vocabulary independent audio search approach that operates on phonetic lattices and is suitable for any query, inspired by a general graph indexing method that defines an automatic procedure to select a small number of paths as indexing features, keeping the index size small while allowing fast retrieval of the lattices matching a given query.
References
More filters
Book
Introduction to Modern Information Retrieval
Gerard Salton,Michael J. McGill +1 more
TL;DR: Reading is a need and a hobby at once and this condition is the on that will make you feel that you must read.
Analysis of a Very Large AltaVista Query Log
TL;DR: In this paper, an analysis of a 280 GB AltaVista search engine query log consisting of approximately 1 billion entries for search requests over a period of six weeks is presented, which represents approximately 285 million user sessions, each an attempt to fill a single information need.
1998 TREC-7 Spoken Document Retrieval Track Overview and Results
TL;DR: The 1998 TREC-7 Spoken Document Retrieval (SDR) Track which implemented an evaluation of retrieval of broadcast news excerpts using a combination of automatic speech recognition and information retrieval technologies is described.
Patent
Method for indexing information of a database
TL;DR: In this paper, an indexing method is provided for a database storing information as records at unique addresses, where pairs are generated for each record, each pair includes a word representing a portion of the information of the record and an associated location.
Proceedings ArticleDOI
The Cambridge University spoken document retrieval system
TL;DR: The retrieval performance over a wide range of speech transcription error rates is presented and a number of recognition error metrics that more accurately reflect the impact of transcription errors on retrieval accuracy are defined and computed.