scispace - formally typeset
A

Alon Jacovi

Researcher at Bar-Ilan University

Publications -  28
Citations -  1159

Alon Jacovi is an academic researcher from Bar-Ilan University. The author has contributed to research in topics: Computer science & Interpretability. The author has an hindex of 11, co-authored 22 publications receiving 470 citations.

Papers
More filters
Proceedings ArticleDOI

Towards Faithfully Interpretable NLP Systems: How Should We Define and Evaluate Faithfulness?

TL;DR: The current binary definition for faithfulness sets a potentially unrealistic bar for being considered faithful, and is called for discarding the binary notion of faithfulness in favor of a more graded one, which is of greater practical utility.
Proceedings ArticleDOI

Understanding Convolutional Neural Networks for Text Classification

TL;DR: An analysis into the inner workings of Convolutional Neural Networks for processing text shows that filters may capture several different semantic classes of ngrams by using different activation patterns, and that global max-pooling induces behavior which separates important n grams from the rest.
Posted Content

Formalizing Trust in Artificial Intelligence: Prerequisites, Causes and Goals of Human Trust in AI

TL;DR: This work discusses a model of trust inspired by, but not identical to, interpersonal trust as defined by sociologists, and incorporates a formalization of 'contractual trust', such that trust between a user and an AI model is trust that some implicit or explicit contract will hold.
Proceedings ArticleDOI

Formalizing Trust in Artificial Intelligence: Prerequisites, Causes and Goals of Human Trust in AI

TL;DR: In this paper, the authors discuss a model of trust inspired by sociologists' notion of interpersonal trust (i.e., trust between people) and discuss how to design trustworthy AI, how to evaluate whether trust has manifested, and whether it is warranted.
Posted Content

Amnesic Probing: Behavioral Explanation with Amnesic Counterfactuals

TL;DR: The inability to infer behavioral conclusions from probing results is pointed out, and an alternative method that focuses on how the information is being used is offered, rather than on what information is encoded is offered.