scispace - formally typeset
Proceedings ArticleDOI

Comparison Of Grapheme-to-Phoneme Conversions For Spoken Document Retrieval

Dmitriy Prozorov, +1 more
- pp 1-4
TLDR
Analysis of spoken document retrieval techniques which apply word similarity based on phonemic transcriptions building or approximate string matching on the collection of spoken documents with speech on Russian language is obtained.
Abstract
The article contains analysis of spoken document retrieval techniques which apply word similarity based on phonemic transcriptions building or approximate string matching. Results are obtained on the collection of spoken documents with speech on Russian language. Grapheme-to-phoneme conversion methods based on a hidden Markov model and 1,2-order finite Markov chain is discussed on the article.

read more

Citations
More filters
Posted Content

Automatic Speech Recognition using limited vocabulary: A survey.

TL;DR: Automatic Speech Recognition (ASR) is an active field of research due to its huge number of applications and the proliferation of interfaces or computing devices that can support speech processing.
Journal ArticleDOI

Automatic Speech Recognition Using Limited Vocabulary: A Survey

TL;DR: A comprehensive view of mechanisms behind ASR systems as well as techniques, tools, projects, recent contributions, and possibly future directions in ASR using a limited vocabulary is provided.
Proceedings ArticleDOI

Properties of Two-Dimensional Discrete Exponential Functions with Variable Parameter in Spatial-Frequency Domain

TL;DR: In this paper, the authors studied the properties of two-dimensional discrete exponential functions with a variable parameter in the spatial-frequency domain and proved linearity, shift, and correlation properties for Fourier transforms with variable parameters.
Proceedings ArticleDOI

Two-Dimensional Discrete Fourier Transform with Variable Parameter in the Spatial-Frequency Domain

TL;DR: In this article, a new method for splitting a rectangular discrete Fourier transform matrix into square matrices was proposed based on the application of the modulus comparability relation to order the rows (columns) of the Fourier matrix.
Journal ArticleDOI

Theoretical foundations of digital vector Fourier analysis of two-dimensional signals padded with zero samples

TL;DR: In this paper, the modulus comparability relation is used to order the rows (columns) of the discrete Fourier transform matrix, which can be used to reduce the number of calculations.
References
More filters
Book

Stochastic processes

J. L. Doob, +1 more
Journal ArticleDOI

Joint-sequence models for grapheme-to-phoneme conversion

TL;DR: A novel estimation algorithm is presented that demonstrates high accuracy on a variety of databases and studies the impact of the maximum approximation in training and transcription, the interaction of model size parameters, n-best list generation, confidence measures, and phoneme-to-grapheme conversion.
Proceedings Article

WFST-Based Grapheme-to-Phoneme Conversion: Open Source tools for Alignment, Model-Building and Decoding

TL;DR: This paper introduces a new open source, WFST-based toolkit for Grapheme-toPhoneme conversion that is efficient, accurate and currently supports a range of features including EM sequence alignment and several decoding techniques novel in the context of G2P.
Proceedings ArticleDOI

Grapheme-to-phoneme conversion based on high-order Markov chain for spoken term detection by text query

TL;DR: The paper presents a new grapheme-to-phoneme conversion method based on high-order Markov chain that is applied to retrieve of spoken documents in Russian language.
Proceedings ArticleDOI

Building Test Speech Dataset on Russian Language for Spoken Document Retrieval Task

TL;DR: A technique of creation of speech dataset is presented which is applied for test of spoken document retrieval methods and contains expert's indication of documents which are relevant to queries.
Related Papers (5)
Trending Questions (1)
What are the current state-of-the-art techniques used in spoken document retrieval?

Current techniques in spoken document retrieval involve grapheme-to-phoneme conversion using hidden Markov models and finite Markov chains for word similarity based on phonemic transcriptions or string matching.