scispace - formally typeset
S

Sebastian Gehrmann

Researcher at Google

Publications -  87
Citations -  6000

Sebastian Gehrmann is an academic researcher from Google. The author has contributed to research in topics: Computer science & Language model. The author has an hindex of 26, co-authored 63 publications receiving 2233 citations. Previous affiliations of Sebastian Gehrmann include IBM & Bielefeld University.

Papers
More filters
Proceedings ArticleDOI

Bottom-Up Abstractive Summarization

TL;DR: This work explores the use of data-efficient content selectors to over-determine phrases in a source document that should be part of the summary, and shows that this approach improves the ability to compress text, while still generating fluent summaries.
Journal ArticleDOI

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao, +386 more
- 09 Nov 2022 - 
TL;DR: BLOOM as discussed by the authors is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total).
Journal Article

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Aarohi Srivastava, +439 more
- 09 Jun 2022 - 
TL;DR: Evaluation of OpenAI's GPT models, Google-internal dense transformer architectures, and Switch-style sparse transformers on BIG-bench, across model sizes spanning millions to hundreds of billions of parameters finds that model performance and calibration both improve with scale, but are poor in absolute terms.
Journal ArticleDOI

LSTMVis: A Tool for Visual Analysis of Hidden State Dynamics in Recurrent Neural Networks

TL;DR: This work presents LSTMVis, a visual analysis tool for recurrent neural networks with a focus on understanding these hidden state dynamics, and describes the domain, the different stakeholders, and their goals and tasks.