Sebastian Goodman

Proceedings Article

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

TL;DR: This work presents two parameter-reduction techniques to lower memory consumption and increase the training speed of BERT, and uses a self-supervised loss that focuses on modeling inter-sentence coherence.

...read moreread less

Posted Content

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Zhenzhong Lan, +5 more

- 26 Sep 2019 -

arXiv: Computation and Language

TL;DR: The authors proposed a self-supervised loss that focuses on modeling inter-sentence coherence, and showed it consistently helps downstream tasks with multientence inputs, achieving state-of-the-art results on the GLUE, RACE, and \squad benchmarks.

...read moreread less

Proceedings ArticleDOI

Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning

Piyush Sharma, +3 more

TL;DR: The Conceptual Captions dataset as discussed by the authors contains an order of magnitude more images than the MS-COCO dataset and represents a wider variety of both images and image caption styles.

...read moreread less

Proceedings ArticleDOI

PaLI: A Jointly-Scaled Multilingual Language-Image Model

Xi Chen, +28 more

TL;DR: PaLI achieves state-of-the-art in multiple vision and language tasks, while retaining a simple, modular, and scalable design.

...read moreread less

Journal ArticleDOI

Scaling Up Models and Data with t5x and seqio

Adam Roberts, +42 more

- 31 Mar 2022 -

arXiv.org

TL;DR: Two software libraries are presented: t5x simplifies the process of building and training large language models at scale while maintaining ease of use, and seqio provides a task-based API for simple creation of fast and reproducible training data and evaluation pipelines.

...read moreread less

Papers

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning

PaLI: A Jointly-Scaled Multilingual Language-Image Model

Scaling Up Models and Data with t5x and seqio