Clemens Winter

Publications - 7

Citations - 12096

Clemens Winter is an academic researcher. The author has contributed to research in topics: Language model & Computer science. The author has an hindex of 3, co-authored 4 publications receiving 3078 citations.

Papers

PDF

Open Access

More filters

Proceedings Article

Language Models are Few-Shot Learners

Tom B. Brown, +30 more

TL;DR: GPT-3 achieves strong performance on many NLP datasets, including translation, question-answering, and cloze tasks, as well as several tasks that require on-the-fly reasoning or domain adaptation, such as unscrambling words, using a novel word in a sentence, or performing 3-digit arithmetic.

...read moreread less

Posted Content

Language Models are Few-Shot Learners

Tom B. Brown, +30 more

- 28 May 2020 -

arXiv: Computation and Language

TL;DR: This article showed that scaling up language models greatly improves task-agnostic, few-shot performance, sometimes even reaching competitiveness with prior state-of-the-art fine-tuning approaches.

...read moreread less

Posted Content

Evaluating Large Language Models Trained on Code

Mark Chen, +57 more

- 07 Jul 2021 -

arXiv: Learning

TL;DR: Codex as discussed by the authors is a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities, showing that repeated sampling from the model is a surprisingly effective strategy for producing working solutions to difficult prompts.

...read moreread less

Journal ArticleDOI

A smile is all you need: predicting limiting activity coefficients from SMILES with natural language processing

Benedikt Winter, +3 more

- 15 Jun 2022 -

Digital discovery

TL;DR: This poster presents a probabilistic procedure for solving phase equilibria calculations of mixtures with known coefficients, and demonstrates the ability of this procedure to resolve the uncertainty in the values of the coefficients.

...read moreread less

Posted Content

A Generalizable Approach to Learning Optimizers

Diogo Almeida, +3 more

- 02 Jun 2021 -

arXiv: Learning

TL;DR: The authors proposed a generalization-first approach to learn to update optimizer hyperparameters instead of model parameters directly using novel features, actions, and a reward function, which achieved 2x speedups on ImageNet and a 2.5x speedup on a language modeling task using over 5 orders of magnitude more compute than the training tasks.

...read moreread less