scispace - formally typeset
D

Daniel M. Ziegler

Researcher at Massachusetts Institute of Technology

Publications -  15
Citations -  13290

Daniel M. Ziegler is an academic researcher from Massachusetts Institute of Technology. The author has contributed to research in topics: Language model & File system. The author has an hindex of 9, co-authored 13 publications receiving 3582 citations.

Papers
More filters
Proceedings Article

Learning to summarize from human feedback

TL;DR: The authors use reinforcement learning to fine-tune a summarization policy according to human feedback, which results in better summaries than optimizing ROUGE according to humans, and transfer to CNN/DM news articles, producing summaries nearly as good as the human reference.
Posted Content

Fine-Tuning Language Models from Human Preferences.

TL;DR: This paper builds on advances in generative pretraining of language models to apply reward learning to four natural language tasks: continuing text with positive sentiment or physically descriptive language, and summarization tasks on the TL;DR and CNN/Daily Mail datasets.
Proceedings ArticleDOI

Using Crash Hoare logic for certifying the FSCQ file system

TL;DR: The Crash Hoare logic (CHL), which extends traditionalHoare logic with a crash condition, a recovery procedure, and logical address spaces for specifying disk states at different abstraction levels, is introduced, which reduces the proof effort for developers through proof automation.