Audio Enhancement and Synthesis using Generative Adversarial Networks: A Survey
Citations
344 citations
Cites background from "Audio Enhancement and Synthesis usi..."
...1) GANs for specific applications: There are surveys of using GANs for specific applications such as image synthesis and editing [5], audio enhancement and synthesis [6]....
[...]
77 citations
20 citations
Additional excerpts
...The field of synthesizing and enhancing audio using GANs architectures has also been reviewed [37]....
[...]
10 citations
Additional excerpts
...enhancement and synthesis [15], image synthesis [16], and text synthesis [17]....
[...]
10 citations
References
38,211 citations
7,987 citations
"Audio Enhancement and Synthesis usi..." refers background or methods in this paper
...Further work may require combining the best properties of various GAN architectures [5][8][10][15][9] to improve existing structures....
[...]
...Paper [3] has attempted to address the issue by combining cGAN (conditional GAN) [15] and SPSS in a multi-task learning framework....
[...]
4,133 citations
"Audio Enhancement and Synthesis usi..." refers background in this paper
...Further work may require combining the best properties of various GAN architectures [5][8][10][15][9] to improve existing structures....
[...]
...[8] proposed that instead of a weight clipping, a gradient penalty can be used....
[...]
1,413 citations
1,001 citations
"Audio Enhancement and Synthesis usi..." refers background or methods in this paper
...The results show that SEGAN works well as an end-to-end method for speech enhancement....
[...]
...While many speech enhancement methods use spectrograms or SPSS methods, Speech Enhancement GAN (SEGAN) [17] operates on the waveform level....
[...]
...SEGAN can operate on raw audio and learn from different speaker and noise conditions....
[...]
..."SEGAN: Speech enhancement generative adversarial network." arXiv preprint arXiv:1703.09452 (2017)....
[...]
...One key feature is the use of skip connections in which low level details of the signal pass straight through to the decoder [17]....
[...]