Q1. What are the contributions in "Boffin tts: few-shot speaker adaptation by bayesian optimization" ?
The authors present BOFFIN TTS ( Bayesian Optimization For FInetuning Neural Text To Speech ), a novel approach for few-shot speaker adaptation. The authors demonstrate that there does not exist a one-size-fits-all adaptation strategy, with convincing synthesis requiring a corpus-specific configuration of the hyperparameters that control fine-tuning. By using Bayesian optimization to efficiently optimize these hyper-parameter values for a target speaker, the authors are able to perform adaptation with an average 30 % improvement in speaker similarity over standard techniques.