scispace - formally typeset
E

Eric Wallace

Researcher at University of California, Berkeley

Publications -  55
Citations -  5088

Eric Wallace is an academic researcher from University of California, Berkeley. The author has contributed to research in topics: Computer science & Language model. The author has an hindex of 23, co-authored 47 publications receiving 2164 citations. Previous affiliations of Eric Wallace include University of Edinburgh & Allen Institute for Artificial Intelligence.

Papers
More filters
Proceedings ArticleDOI

Universal Adversarial Triggers for Attacking and Analyzing NLP

TL;DR: This article propose a gradient-guided search over tokens which finds short trigger sequences (e.g., one word for classification and four words for language modeling) that successfully trigger the target prediction.
Posted Content

Extracting Training Data from Large Language Models

TL;DR: This paper demonstrates that in such settings, an adversary can perform a training data extraction attack to recover individual training examples by querying the language model, and finds that larger models are more vulnerable than smaller models.
Posted Content

Calibrate Before Use: Improving Few-Shot Performance of Language Models

TL;DR: This work first estimates the model's bias towards each answer by asking for its prediction when given the training prompt and a content-free test input such as "N/A", and then fits calibration parameters that cause the prediction for this input to be uniform across answers.