Analyzing and Interpreting Neural Networks for NLP: A Report on the First BlackboxNLP Workshop

Open AccessPosted Content

Analyzing and Interpreting Neural Networks for NLP: A Report on the First BlackboxNLP Workshop

- 05 Apr 2019 -

TLDR

A number of representative studies in each category are reviewed, including systematic manipulation of input to neural networks and the impact on their performance, and testing whether interpretable knowledge can be decoded from intermediate representations acquired by neural networks.

Abstract:

The EMNLP 2018 workshop BlackboxNLP was dedicated to resources and techniques specifically developed for analyzing and understanding the inner-workings and representations acquired by neural models of language. Approaches included: systematic manipulation of input to neural networks and investigating the impact on their performance, testing whether interpretable knowledge can be decoded from intermediate representations acquired by neural networks, proposing modifications to neural network architectures to make their knowledge state or generated output more explainable, and examining the performance of networks on simplified or formal languages. Here we review a number of representative studies in each category.

Citations

PDF

Open Access

More filters

Density-based clustering based on hierarchical density estimates

Ricardo J. G. B. Campello, +2 more

TL;DR: In this article, the authors proposed a hierarchical density-based hierarchical clustering method, which provides a clustering hierarchy from which a simplified tree of significant clusters can be constructed, and demonstrated that their approach outperforms the current, state-of-the-art, densitybased clustering methods.

...read moreread less

Posted Content

Neural Machine Translation: A Review

Felix Stahlberg

TL;DR: This work traces back the origins of modern NMT architectures to word and sentence embeddings and earlier examples of the encoder-decoder network family and concludes with a survey of recent trends in the field.

...read moreread less

Journal ArticleDOI

Distributional Semantics and Linguistic Theory

Gemma Boleda

- 06 May 2019 -

arXiv: Computation and Language

TL;DR: This review provides a critical discussion of the literature on distributional semantics, with an emphasis on methods and results that are of relevance for theoretical linguistics, in three areas: semantic change, polysemy and composition, and the grammar-semantics interface.

...read moreread less

Posted Content

Evaluating Recurrent Neural Network Explanations

Leila Arras, +3 more

- 26 Apr 2019 -

arXiv: Learning

TL;DR: In this article, several methods have been proposed to explain the predictions of recurrent neural networks (RNNs), in particular of LSTMs, by assigning to each input variable, e.g., a word, a relevance indicating to which extent it contributed to a particular prediction.

...read moreread less

Proceedings ArticleDOI

Pareto Probing: Trading Off Accuracy for Complexity

Tiago Pimentel, +3 more

TL;DR: This work argues for a probe metric that reflects the fundamental trade-off between probe complexity and performance: the Pareto hypervolume, and presents a number of parametric and non-parametric metrics to measure complexity.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

Journal ArticleDOI

Finding Structure in Time

Jeffrey L. Elman

- 01 Mar 1990 -

Cognitive Science

TL;DR: A proposal along these lines first described by Jordan (1986) which involves the use of recurrent links in order to provide networks with a dynamic memory and suggests a method for representing lexical categories and the type/token distinction is developed.

...read moreread less

Proceedings Article

Categorical Reparameterization with Gumbel-Softmax

Eric Jang, +2 more

TL;DR: Gumbel-Softmax as mentioned in this paper replaces the non-differentiable samples from a categorical distribution with a differentiable sample from a novel Gumbel softmax distribution, which has the essential property that it can be smoothly annealed into the categorical distributions.

...read moreread less

Journal ArticleDOI

On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation.

Sebastian Bach, +5 more

- 10 Jul 2015 -

PLOS ONE

TL;DR: This work proposes a general solution to the problem of understanding classification decisions by pixel-wise decomposition of nonlinear classifiers by introducing a methodology that allows to visualize the contributions of single pixels to predictions for kernel-based classifiers over Bag of Words features and for multilayered neural networks.

...read moreread less

Collapse

Analyzing and Interpreting Neural Networks for NLP: A Report on the First BlackboxNLP Workshop

Citations

Density-based clustering based on hierarchical density estimates

Neural Machine Translation: A Review

Distributional Semantics and Linguistic Theory

Evaluating Recurrent Neural Network Explanations

Pareto Probing: Trading Off Accuracy for Complexity

References

Long short-term memory

Attention is All you Need

Finding Structure in Time

Categorical Reparameterization with Gumbel-Softmax

On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation.

Related Papers (5)

Analyzing and interpreting neural networks for NLP: A report on the first BlackboxNLP workshop

Visualizing learning and computation in artificial neural networks

Neural Network Methods in Natural Language Processing

Neural Network and Its Application in IR

Neural Networks and Structured Knowledge: Rule Extraction andApplications