scispace - formally typeset
Open AccessBook ChapterDOI

The 2018 Signal Separation Evaluation Campaign

TLDR
SiSEC 2018 as mentioned in this paper was focused on audio and pursued the effort towards scaling up and making it easier to prototype audio separation software in an era of machine-learning-based systems.
Abstract
This paper reports the organization and results for the 2018 community-based Signal Separation Evaluation Campaign (SiSEC 2018). This year’s edition was focused on audio and pursued the effort towards scaling up and making it easier to prototype audio separation software in an era of machine-learning based systems. For this purpose, we prepared a new music separation database: MUSDB18, featuring close to 10 h of audio. Additionally, open-source software was released to automatically load, process and report performance on MUSDB18. Furthermore, a new official Python version for the BSS Eval toolbox was released, along with reference implementations for three oracle separation methods: ideal binary mask, ideal ratio mask, and multichannel Wiener filter. We finally report the results obtained by the participants.

read more

Citations
More filters
Proceedings ArticleDOI

SDR – Half-baked or Well Done?

TL;DR: The scale-invariant signal-to-distortion ratio (SI-SDR) as mentioned in this paper is a more robust measure for single-channel separation, which has been proposed in the BSS_eval toolkit.
Journal ArticleDOI

A Consolidated Perspective on Multimicrophone Speech Enhancement and Source Separation

TL;DR: This paper proposes to analyze a large number of established and recent techniques according to four transverse axes: 1) the acoustic impulse response model, 2) the spatial filter design criterion, 3) the parameter estimation algorithm, and 4) optional postfiltering.
Journal ArticleDOI

Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge

TL;DR: The emergence of deep learning as the most popular classification method is observed, replacing the traditional approaches based on Gaussian mixture models and support vector machines.
Posted Content

Music Source Separation in the Waveform Domain

TL;DR: Demucs is proposed, a new waveform-to-waveform model, which has an architecture closer to models for audio generation with more capacity on the decoder, and human evaluations show that Demucs has significantly higher quality than Conv-Tasnet, but slightly more contamination from other sources, which explains the difference in SDR.
Journal ArticleDOI

Open-Unmix - A Reference Implementation for Music Source Separation

TL;DR: Open-Unmix provides implementations for the most popular deep learning frameworks, giving researchers a flexible way to reproduce results and provides a pre-trained model for end users and even artists to try and use source separation.
References
More filters
Journal ArticleDOI

Performance measurement in blind audio source separation

TL;DR: This paper considers four different sets of allowed distortions in blind audio source separation algorithms, from time-invariant gains to time-varying filters, and derives a global performance measure using an energy ratio, plus a separate performance measure for each error term.
Proceedings ArticleDOI

The third ‘CHiME’ speech separation and recognition challenge: Dataset, task and baselines

TL;DR: The design and outcomes of the 3rd CHiME Challenge, which targets the performance of automatic speech recognition in a real-world, commercially-motivated scenario: a person talking to a tablet device that has been fitted with a six-channel microphone array, are presented.
Proceedings ArticleDOI

Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks

TL;DR: A phase-sensitive objective function based on the signal-to-noise ratio (SNR) of the reconstructed signal is developed, and it is shown that in experiments it yields uniformly better results in terms of signal- to-distortion ratio (SDR).
Book ChapterDOI

On Ideal Binary Mask As the Computational Goal of Auditory Scene Analysis

TL;DR: This chapter is an attempt at a computational-theory analysis of auditory scene analysis, where the main task is to understand the character of the CASA problem.
Journal ArticleDOI

Melody Extraction From Polyphonic Music Signals Using Pitch Contour Characteristics

TL;DR: A comparative evaluation of the proposed approach shows that it outperforms current state-of-the-art melody extraction systems in terms of overall accuracy.
Related Papers (5)