Home
/
Authors
/
Cristina Garbacea

Author

Cristina Garbacea

Other affiliations: University of Amsterdam

Bio: Cristina Garbacea is an academic researcher from University of Michigan. The author has contributed to research in topics: Natural language generation & Text simplification. The author has an hindex of 5, co-authored 12 publications receiving 117 citations. Previous affiliations of Cristina Garbacea include University of Amsterdam.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Low Bit-rate Speech Coding with VQ-VAE and a WaveNet Decoder

[...]

Cristina Garbacea¹, Aaron van den Oord, Yazhe Li, Felicia S. C. Lim², Alejandro Luebs², Oriol Vinyals, Thomas C. Walters - Show less +3 more•Institutions (2)

University of Michigan¹, Google²

12 May 2019

TL;DR: This work demonstrates that a neural network architecture based on VQ-VAE with a WaveNet decoder can be used to perform very low bit-rate speech coding with high reconstruction quality.

...read moreread less

Abstract: In order to efficiently transmit and store speech signals, speech codecs create a minimally redundant representation of the input signal which is then decoded at the receiver with the best possible perceptual quality. In this work we demonstrate that a neural network architecture based on VQ-VAE with a WaveNet decoder can be used to perform very low bit-rate speech coding with high reconstruction quality. A prosody-transparent and speaker-independent model trained on the LibriSpeech corpus coding audio at 1.6 kbps exhibits perceptual quality which is around halfway between the MELP codec at 2.4 kbps and AMR-WB codec at 23.05 kbps. In addition, when training on high-quality recorded speech with the test speaker included in the training set, a model coding speech at 1.6 kbps produces output of similar perceptual quality to that generated by AMR-WB at 23.05 kbps.

...read moreread less

96 citations

Posted Content•

The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics.

[...]

Sebastian Gehrmann¹, Tosin P. Adewumi, Karmanya Aggarwal, Pawan Sasanka Ammanamanchi, Aremu Anuoluwapo, Antoine Bosselut, Khyathi Raghavi Chandu, Miruna Clinciu, Dipanjan Das, Kaustubh Dhole, Wanyu Du, Esin Durmus, Ondřej Dušek, Chris Chinenye Emezue, Varun Gangal, Cristina Garbacea, Tatsunori Hashimoto, Yufang Hou, Yacine Jernite, Harsh Jhamtani, Yangfeng Ji, Shailza Jolly, Mihir Kale, Dhruv Kumar, Faisal Ladhak, Aman Madaan, Mounica Maddela, Khyati Mahajan, Saad Mahamood, Bodhisattwa Prasad Majumder, Pedro Henrique Martins, Angelina McMillan-Major, Simon Mille, Emiel van Miltenburg, Moin Nadeem, Shashi Narayan, Vitaly Nikolaev, Rubungo Andre Niyongabo, Salomey Osei, Ankur P. Parikh, Laura Perez-Beltrachini, Niranjan Ramesh Rao, Vikas Raunak, Juan Diego Rodriguez, Sashank Santhanam, João Sedoc, Thibault Sellam, Samira Shaikh, Anastasia Shimorina, Marco Antonio Sobrevilla Cabezudo, Hendrik Strobelt, Nishant Subramani, Wei Xu, Diyi Yang, Akhila Yerukola, Jiawei Zhou - Show less +52 more•Institutions (1)

Google¹

02 Feb 2021-arXiv: Computation and Language

TL;DR: GEM as discussed by the authors is a living benchmark for natural language generation (NLG), its Evaluation and Metrics, which provides an environment in which models can easily be applied to a wide set of tasks and in which evaluation strategies can be tested.

...read moreread less

Abstract: We introduce GEM, a living benchmark for natural language Generation (NLG), its Evaluation, and Metrics. Measuring progress in NLG relies on a constantly evolving ecosystem of automated metrics, datasets, and human evaluation standards. Due to this moving target, new models often still evaluate on divergent anglo-centric corpora with well-established, but flawed, metrics. This disconnect makes it challenging to identify the limitations of current models and opportunities for progress. Addressing this limitation, GEM provides an environment in which models can easily be applied to a wide set of tasks and in which evaluation strategies can be tested. Regular updates to the benchmark will help NLG research become more multilingual and evolve the challenge alongside models. This paper serves as the description of the data for which we are organizing a shared task at our ACL 2021 Workshop and to which we invite the entire NLG community to participate.

...read moreread less

44 citations

Proceedings Article•DOI•

The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics

[...]

Sebastian Gehrmann¹, Tosin P. Adewumi², Karmanya Aggarwal, Pawan Sasanka Ammanamanchi, Anuoluwapo Aremu, Antoine Bosselut³, Khyathi Raghavi Chandu⁴, Miruna-Adriana Clinciu⁵, Dipanjan Das¹, Kaustubh Dhole⁶, Wanyu Du⁷, Esin Durmus⁸, Ondřej Dušek⁹, Chris Chinenye Emezue¹⁰, Varun Gangal⁴, Cristina Garbacea¹¹, Tatsunori Hashimoto¹², Yufang Hou¹³, Yacine Jernite¹⁴, Harsh Jhamtani⁴, Yangfeng Ji⁷, Shailza Jolly¹⁵, Mihir Kale¹, Dhruv Kumar¹⁶, Faisal Ladhak⁸, Aman Madaan⁴, Mounica Maddela, Khyati Mahajan¹⁷, Saad Mahamood¹⁸, Bodhisattwa Prasad Majumder¹⁹, Pedro Henrique Martins²⁰, Angelina McMillan-Major, Simon Mille²¹, Emiel van Miltenburg, Moin Nadeem, Shashi Narayan¹, Vitaly Nikolaev¹, Andre Niyongabo Rubungo, Salomey Osei²², Ankur P. Parikh¹, Laura Perez-Beltrachini²³, Niranjan Ramesh Rao, Vikas Raunak⁴, Juan Diego Rodriguez²⁴, Sashank Santhanam¹⁷, João Sedoc¹⁴, Thibault Sellam¹, Samira Shaikh¹⁷, Anastasia Shimorina²⁵, Marco Antonio Sobrevilla Cabezudo²⁶, Hendrik Strobelt¹³, Nishant Subramani²⁷, Wei Xu²⁸, Diyi Yang²⁹, Akhila Yerukola¹², Jiawei Zhou - Show less +52 more•Institutions (29)

Google¹, Luleå University of Technology², Allen Institute for Artificial Intelligence³, Carnegie Mellon University⁴, Heriot-Watt University⁵, Tata Institute of Fundamental Research⁶, University of Virginia⁷, Columbia University⁸, Charles University in Prague⁹, Technische Universität München¹⁰, University of Michigan¹¹, Stanford University¹², IBM¹³, New York University¹⁴, Kaiserslautern University of Technology¹⁵, University of Waterloo¹⁶, University of North Carolina at Charlotte¹⁷, University of Aberdeen¹⁸, University of California, San Diego¹⁹, University of Lisbon²⁰, Pompeu Fabra University²¹, African Institute for Mathematical Sciences²², University of Edinburgh²³, University of Texas at Austin²⁴, Centre national de la recherche scientifique²⁵, University of São Paulo²⁶, Intel²⁷, Georgia Institute of Technology²⁸, Association for Computing Machinery²⁹

02 Feb 2021

...read moreread less

Abstract: We introduce GEM, a living benchmark for natural language Generation (NLG), its Evaluation, and Metrics. Measuring progress in NLG relies on a constantly evolving ecosystem of automated metrics, datasets, and human evaluation standards. Due to this moving target, new models often still evaluate on divergent anglo-centric corpora with well-established, but flawed, metrics. This disconnect makes it challenging to identify the limitations of current models and opportunities for progress. Addressing this limitation, GEM provides an environment in which models can easily be applied to a wide set of tasks and in which evaluation strategies can be tested. Regular updates to the benchmark will help NLG research become more multilingual and evolve the challenge alongside models. This paper serves as the description of the data for the 2021 shared task at the associated GEM Workshop.

...read moreread less

26 citations

Posted Content•

Neural Language Generation: Formulation, Methods, and Evaluation.

[...]

Cristina Garbacea, Qiaozhu Mei¹•Institutions (1)

University of Michigan¹

31 Jul 2020-arXiv: Computation and Language

TL;DR: There is no standard way to assess the quality of text produced by these generative models, which constitutes a serious bottleneck towards the progress of the field, so this survey will provide an informative overview of formulations, methods, and assessments of neural natural language generation.

...read moreread less

Abstract: Recent advances in neural network-based generative modeling have reignited the hopes in having computer systems capable of seamlessly conversing with humans and able to understand natural language. Neural architectures have been employed to generate text excerpts to various degrees of success, in a multitude of contexts and tasks that fulfil various user needs. Notably, high capacity deep learning models trained on large scale datasets demonstrate unparalleled abilities to learn patterns in the data even in the lack of explicit supervision signals, opening up a plethora of new possibilities regarding producing realistic and coherent texts. While the field of natural language generation is evolving rapidly, there are still many open challenges to address. In this survey we formally define and categorize the problem of natural language generation. We review particular application tasks that are instantiations of these general formulations, in which generating natural language is of practical importance. Next we include a comprehensive outline of methods and neural architectures employed for generating diverse texts. Nevertheless, there is no standard way to assess the quality of text produced by these generative models, which constitutes a serious bottleneck towards the progress of the field. To this end, we also review current approaches to evaluating natural language generation systems. We hope this survey will provide an informative overview of formulations, methods, and assessments of neural natural language generation.

...read moreread less

22 citations

Proceedings Article•DOI•

Judge the Judges: A Large-Scale Evaluation Study of Neural Language Models for Online Review Generation

[...]

Cristina Garbacea¹, Samuel Carton¹, Shiyan Yan¹, Qiaozhu Mei¹•Institutions (1)

University of Michigan¹

01 Jan 2019

TL;DR: A large-scale, systematic study to evaluate the existing evaluation methods for natural language generation in the context of generating online product reviews finds lexical diversity an intriguing metric that is indicative of the assessments of different evaluators.

...read moreread less

Abstract: We conduct a large-scale, systematic study to evaluate the existing evaluation methods for natural language generation in the context of generating online product reviews. We compare human-based evaluators with a variety of automated evaluation procedures, including discriminative evaluators that measure how well machine-generated text can be distinguished from human-written text, as well as word overlap metrics that assess how similar the generated text compares to human-written references. We determine to what extent these different evaluators agree on the ranking of a dozen of state-of-the-art generators for online product reviews. We find that human evaluators do not correlate well with discriminative evaluators, leaving a bigger question of whether adversarial accuracy is the correct objective for natural language generation. In general, distinguishing machine-generated text is challenging even for human evaluators, and human decisions correlate better with lexical overlaps. We find lexical diversity an intriguing metric that is indicative of the assessments of different evaluators. A post-experiment survey of participants provides insights into how to evaluate and improve the quality of natural language generation systems.

...read moreread less

19 citations

Cited by

PDF

Open Access

More filters

Journal Article•

“Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告

[...]

杉山拓海

12 Sep 2017-Computers & Graphics

3,940 citations

Posted Content•

A Review on Generative Adversarial Networks: Algorithms, Theory, and Applications

[...]

Jie Gui¹, Zhenan Sun, Yonggang Wen², Dacheng Tao³, Jieping Ye⁴ - Show less +1 more•Institutions (4)

Southeast University¹, Nanyang Technological University², University of Sydney³, University of Michigan⁴

20 Jan 2020-arXiv: Learning

TL;DR: This paper attempts to provide a review on various GANs methods from the perspectives of algorithms, theory, and applications, and compares the commonalities and differences of these GAns methods.

...read moreread less

Abstract: Generative adversarial networks (GANs) are a hot research topic recently. GANs have been widely studied since 2014, and a large number of algorithms have been proposed. However, there is few comprehensive study explaining the connections among different GANs variants, and how they have evolved. In this paper, we attempt to provide a review on various GANs methods from the perspectives of algorithms, theory, and applications. Firstly, the motivations, mathematical representations, and structure of most GANs algorithms are introduced in details. Furthermore, GANs have been combined with other machine learning algorithms for specific applications, such as semi-supervised learning, transfer learning, and reinforcement learning. This paper compares the commonalities and differences of these GANs methods. Secondly, theoretical issues related to GANs are investigated. Thirdly, typical applications of GANs in image processing and computer vision, natural language processing, music, speech and audio, medical field, and data science are illustrated. Finally, the future open research problems for GANs are pointed out.

...read moreread less

344 citations

Posted Content•

Evaluation of Text Generation: A Survey

[...]

Asli Celikyilmaz¹, Elizabeth Clark², Jianfeng Gao³•Institutions (3)

Facebook¹, University of Washington², Microsoft³

26 Jun 2020-arXiv: Computation and Language

TL;DR: This paper surveys evaluation methods of natural language generation (NLG) systems that have been developed in the last few years, with a focus on the evaluation of recently proposed NLG tasks and neural NLG models.

...read moreread less

Abstract: The paper surveys evaluation methods of natural language generation (NLG) systems that have been developed in the last few years We group NLG evaluation methods into three categories: (1) human-centric evaluation metrics, (2) automatic metrics that require no training, and (3) machine-learned metrics For each category, we discuss the progress that has been made and the challenges still being faced, with a focus on the evaluation of recently proposed NLG tasks and neural NLG models We then present two examples for task-specific NLG evaluations for automatic text summarization and long text generation, and conclude the paper by proposing future research directions

...read moreread less

186 citations

Posted Content•

A Survey of Knowledge-Enhanced Text Generation.

[...]

Wenhao Yu¹, Chenguang Zhu², Zaitang Li³, Zhiting Hu, Qingyun Wang⁴, Heng Ji⁴, Meng Jiang¹ - Show less +3 more•Institutions (4)

University of Notre Dame¹, Microsoft², The Chinese University of Hong Kong³, University of Illinois at Urbana–Champaign⁴

09 Oct 2020-arXiv: Computation and Language

TL;DR: A comprehensive review of the research on knowledge-enhanced text generation over the past five years is presented, which includes two parts: (i) general methods and architectures for integrating knowledge into text generation; (ii) specific techniques and applications according to different forms of knowledge data.

...read moreread less

Abstract: The goal of text generation is to make machines express in human language. It is one of the most important yet challenging tasks in natural language processing (NLP). Since 2014, various neural encoder-decoder models pioneered by Seq2Seq have been proposed to achieve the goal by learning to map input text to output text. However, the input text alone often provides limited knowledge to generate the desired output, so the performance of text generation is still far from satisfaction in many real-world scenarios. To address this issue, researchers have considered incorporating various forms of knowledge beyond the input text into the generation models. This research direction is known as knowledge-enhanced text generation. In this survey, we present a comprehensive review of the research on knowledge enhanced text generation over the past five years. The main content includes two parts: (i) general methods and architectures for integrating knowledge into text generation; (ii) specific techniques and applications according to different forms of knowledge data. This survey can have broad audiences, researchers and practitioners, in academia and industry.

...read moreread less

115 citations

Posted Content•

A Survey on Generative Adversarial Networks: Variants, Applications, and Training

[...]

Abdul Jabbar¹, Xi Li¹, Bourahla Omar¹•Institutions (1)

Zhejiang University¹

09 Jun 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: This work surveys several training solutions proposed by different researchers to stabilize GAN training, and surveys the original GAN model and its modified classical versions, and detail analysis of various GAN applications in different domains.

...read moreread less

Abstract: The Generative Models have gained considerable attention in the field of unsupervised learning via a new and practical framework called Generative Adversarial Networks (GAN) due to its outstanding data generation capability. Many models of GAN have proposed, and several practical applications emerged in various domains of computer vision and machine learning. Despite GAN's excellent success, there are still obstacles to stable training. The problems are due to Nash-equilibrium, internal covariate shift, mode collapse, vanishing gradient, and lack of proper evaluation metrics. Therefore, stable training is a crucial issue in different applications for the success of GAN. Herein, we survey several training solutions proposed by different researchers to stabilize GAN training. We survey, (I) the original GAN model and its modified classical versions, (II) detail analysis of various GAN applications in different domains, (III) detail study about the various GAN training obstacles as well as training solutions. Finally, we discuss several new issues as well as research outlines to the topic.

...read moreread less

104 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43

Collapse