Esin Durmus

Researcher at Cornell University

Publications - 40

Citations - 1093

Esin Durmus is an academic researcher from Cornell University. The author has contributed to research in topics: Computer science & Argument. The author has an hindex of 11, co-authored 24 publications receiving 416 citations. Previous affiliations of Esin Durmus include Stanford University & Columbia University.

Papers

PDF

Open Access

More filters

Journal ArticleDOI

Holistic Evaluation of Language Models

Percy Liang, +48 more

- 16 Nov 2022 -

Annals of the New York Academy of Scienc...

TL;DR: The Holistic Evaluation of Language Models (HELM) as mentioned in this paper ) is a popular benchmark for language models, with 30 models evaluated on 16 core scenarios and 7 metrics, exposing important trade-offs.

...read moreread less

Proceedings ArticleDOI

FEQA: A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization

Esin Durmus, +2 more

- 07 May 2020 -

arXiv: Computation and Language

TL;DR: An automatic question answering (QA) based metric for faithfulness, FEQA, is proposed, which leverages recent advances in reading comprehension and has significantly higher correlation with human faithfulness scores, especially on highly abstractive summaries.

...read moreread less

Proceedings ArticleDOI

FEQA: A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization

Esin Durmus, +2 more

TL;DR: The authors proposed an automatic question answering (QA) based metric for faithfulness, FEQA, which leverages recent advances in reading comprehension, given question-answer pairs generated from the summary; non-matched answers indicate unfaithful information in the summary.

...read moreread less

Posted Content

WikiLingua: A New Benchmark Dataset for Cross-Lingual Abstractive Summarization

Faisal Ladhak, +3 more

- 07 Oct 2020 -

arXiv: Computation and Language

TL;DR: A method for direct crosslingual summarization without requiring translation at inference time is proposed by leveraging synthetic data and Neural Machine Translation as a pre-training step, which significantly outperforms the baseline approaches, while being more cost efficient during inference.

...read moreread less

Posted Content

On the Opportunities and Risks of Foundation Models.

Rishi Bommasani, +113 more

- 16 Aug 2021 -

arXiv: Learning

TL;DR: The authors provides a thorough account of the opportunities and risks of foundation models, ranging from their capabilities (e.g., language, vision, robotics, reasoning, human interaction) and technical principles(e. g.g. model architectures, training procedures, data, systems, security, evaluation, theory) to their applications.

...read moreread less

Collapse