J
Jesper Jensen
Researcher at Aalborg University
Publications - 378
Citations - 10880
Jesper Jensen is an academic researcher from Aalborg University. The author has contributed to research in topics: Speech enhancement & Intelligibility (communication). The author has an hindex of 37, co-authored 350 publications receiving 8287 citations. Previous affiliations of Jesper Jensen include Delft University of Technology & Philips.
Papers
More filters
Journal ArticleDOI
An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech
TL;DR: A short-time objective intelligibility measure (STOI) is presented, which shows high correlation with the intelligibility of noisy and time-frequency weighted noisy speech (e.g., resulting from noise reduction) of three different listening experiments and showed better correlation with speech intelligibility compared to five other reference objective intelligible models.
Proceedings ArticleDOI
A short-time objective intelligibility measure for time-frequency weighted noisy speech
TL;DR: An objective intelligibility measure is presented, which shows high correlation (rho=0.95) with the intelligibility of both noisy, and TF-weighted noisy speech, and shows significantly better performance than three other, more sophisticated, objective measures.
Proceedings ArticleDOI
Permutation invariant training of deep models for speaker-independent multi-talker speech separation
TL;DR: In this paper, a permutation invariant training (PIT) was proposed for speaker independent multi-talker speech separation, commonly known as the cocktail-party problem, which minimizes the separation error directly.
Journal ArticleDOI
Multitalker Speech Separation With Utterance-Level Permutation Invariant Training of Deep Recurrent Neural Networks
TL;DR: In this article, the utterance-level permutation invariant training (uPIT) technique was proposed for speaker independent multitalker speech separation, where RNNs, trained with uPIT, can separate multitalker mixed speech without any prior knowledge of signal duration, number of speakers, speaker identity, or gender.
Journal ArticleDOI
An Algorithm for Predicting the Intelligibility of Speech Masked by Modulated Noise Maskers
Jesper Jensen,Cees H. Taal +1 more
TL;DR: It is shown that ESTOI can be interpreted in terms of an orthogonal decomposition of short-time spectrograms into intelligibility subspaces, i.e., a ranking of spectrogram features according to their importance to intelligibility.