scispace - formally typeset
A

Andrey Shulipa

Researcher at Saint Petersburg State University of Information Technologies, Mechanics and Optics

Publications -  19
Citations -  308

Andrey Shulipa is an academic researcher from Saint Petersburg State University of Information Technologies, Mechanics and Optics. The author has contributed to research in topics: Speaker recognition & Cosine similarity. The author has an hindex of 8, co-authored 18 publications receiving 242 citations.

Papers
More filters
Proceedings ArticleDOI

Triplet Loss Based Cosine Similarity Metric Learning for Text-independent Speaker Recognition.

TL;DR: This work demonstrates that performance of deep speaker embeddings based systems can be improved by using Cosine Similarity Metric Learning (CSML) with the triplet loss training scheme.
Posted Content

On deep speaker embeddings for text-independent speaker recognition

TL;DR: It is demonstrated that using angular softmax activation at the last classification layer of a classification neural network instead of a simple softmaxactivation allows to train a more generalized discriminative speaker embedding extractor.
Proceedings ArticleDOI

On deep speaker embeddings for text-independent speaker recognition

TL;DR: In this article, the authors used angular softmax activation at the last classification layer of a classification neural network to train a more generalized discriminative speaker embedding extractor for text-independent speaker recognition.
Proceedings ArticleDOI

Text-dependent GMM-JFA system for password based speaker verification

TL;DR: The proposed State-GMM-supervector extractor makes it possible to create more accurate statistical models of speech signals and to achieve a 44% relative reduction of EER compared to the best state-of-the-art systems of text-dependent verification for a text-prompted passphrase.
Proceedings ArticleDOI

Deep Speaker Embeddings for Far-Field Speaker Recognition on Short Utterances.

TL;DR: In this paper, the authors proposed approaches aimed to improve the quality of far-field speaker verification systems in the presence of environmental noise, reverberation and reduce the system quality degradation for short utterances.