The 2018 Signal Separation Evaluation Campaign

doi:10.1007/978-3-319-93764-9_28

Open AccessBook ChapterDOI

The 2018 Signal Separation Evaluation Campaign

- Iss: 10891, pp 293-305

TLDR

SiSEC 2018 as mentioned in this paper was focused on audio and pursued the effort towards scaling up and making it easier to prototype audio separation software in an era of machine-learning-based systems.

Abstract:

This paper reports the organization and results for the 2018 community-based Signal Separation Evaluation Campaign (SiSEC 2018). This year’s edition was focused on audio and pursued the effort towards scaling up and making it easier to prototype audio separation software in an era of machine-learning based systems. For this purpose, we prepared a new music separation database: MUSDB18, featuring close to 10 h of audio. Additionally, open-source software was released to automatically load, process and report performance on MUSDB18. Furthermore, a new official Python version for the BSS Eval toolbox was released, along with reference implementations for three oracle separation methods: ideal binary mask, ideal ratio mask, and multichannel Wiener filter. We finally report the results obtained by the participants.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

SDR – Half-baked or Well Done?

Jonathan Le Roux, +3 more

TL;DR: The scale-invariant signal-to-distortion ratio (SI-SDR) as mentioned in this paper is a more robust measure for single-channel separation, which has been proposed in the BSS_eval toolkit.

...read moreread less

Journal ArticleDOI

A Consolidated Perspective on Multimicrophone Speech Enhancement and Source Separation

Sharon Gannot, +3 more

- 01 Apr 2017 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: This paper proposes to analyze a large number of established and recent techniques according to four transverse axes: 1) the acoustic impulse response model, 2) the spatial filter design criterion, 3) the parameter estimation algorithm, and 4) optional postfiltering.

...read moreread less

Journal ArticleDOI

Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge

Annamaria Mesaros, +6 more

- 01 Feb 2018 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: The emergence of deep learning as the most popular classification method is observed, replacing the traditional approaches based on Gaussian mixture models and support vector machines.

...read moreread less

Posted Content

Music Source Separation in the Waveform Domain

Alexandre Défossez, +5 more

- 25 Sep 2019 -

arXiv: Sound

TL;DR: Demucs is proposed, a new waveform-to-waveform model, which has an architecture closer to models for audio generation with more capacity on the decoder, and human evaluations show that Demucs has significantly higher quality than Conv-Tasnet, but slightly more contamination from other sources, which explains the difference in SDR.

...read moreread less

Journal ArticleDOI

Open-Unmix - A Reference Implementation for Music Source Separation

Fabian-Robert Stöter, +3 more

TL;DR: Open-Unmix provides implementations for the most popular deep learning frameworks, giving researchers a flexible way to reproduce results and provides a pre-trained model for end users and even artists to try and use source separation.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Performance measurement in blind audio source separation

Emmanuel Vincent, +2 more

- 01 Jul 2006 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: This paper considers four different sets of allowed distortions in blind audio source separation algorithms, from time-invariant gains to time-varying filters, and derives a global performance measure using an energy ratio, plus a separate performance measure for each error term.

...read moreread less

Proceedings ArticleDOI

The third ‘CHiME’ speech separation and recognition challenge: Dataset, task and baselines

Jon Barker, +3 more

TL;DR: The design and outcomes of the 3rd CHiME Challenge, which targets the performance of automatic speech recognition in a real-world, commercially-motivated scenario: a person talking to a tablet device that has been fitted with a six-channel microphone array, are presented.

...read moreread less

Proceedings ArticleDOI

Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks

Hakan Erdogan, +3 more

TL;DR: A phase-sensitive objective function based on the signal-to-noise ratio (SNR) of the reconstructed signal is developed, and it is shown that in experiments it yields uniformly better results in terms of signal- to-distortion ratio (SDR).

...read moreread less

Book ChapterDOI

On Ideal Binary Mask As the Computational Goal of Auditory Scene Analysis

DeLiang Wang

TL;DR: This chapter is an attempt at a computational-theory analysis of auditory scene analysis, where the main task is to understand the character of the CASA problem.

...read moreread less

Journal ArticleDOI

Melody Extraction From Polyphonic Music Signals Using Pitch Contour Characteristics

Justin Salamon, +1 more

- 01 Aug 2012 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: A comparative evaluation of the proposed approach shows that it outperforms current state-of-the-art melody extraction systems in terms of overall accuracy.

...read moreread less

Collapse

Related Papers (5)

Performance measurement in blind audio source separation

Emmanuel Vincent, +2 more

- 01 Jul 2006 -

IEEE Transactions on Audio, Speech, and ...

The 2018 Signal Separation Evaluation Campaign

Citations

SDR – Half-baked or Well Done?

A Consolidated Perspective on Multimicrophone Speech Enhancement and Source Separation

Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge

Music Source Separation in the Waveform Domain

Open-Unmix - A Reference Implementation for Music Source Separation

References

Performance measurement in blind audio source separation

The third ‘CHiME’ speech separation and recognition challenge: Dataset, task and baselines

Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks

On Ideal Binary Mask As the Computational Goal of Auditory Scene Analysis

Melody Extraction From Polyphonic Music Signals Using Pitch Contour Characteristics

Related Papers (5)

Performance measurement in blind audio source separation

Singing voice separation with deep u-net convolutional networks

Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation

Improving music source separation based on deep neural networks through data augmentation and network blending

Conv-TasNet: Surpassing Ideal Time–Frequency Magnitude Masking for Speech Separation