Which Shortcut Cues Will DNNs Choose? A Study from the Parameter-Space Perspective.

Open AccessPosted Content

Which Shortcut Cues Will DNNs Choose? A Study from the Parameter-Space Perspective.

Luca Scimeca, +4 more

- 06 Oct 2021 -

arXiv: Learning

Chats0

TLDR

In this article, the authors design a training setup with several shortcut cues, named WCST-ML, where each cue is equally conducive to the visual recognition problem at hand, and observe that certain cues are preferred to others, solutions biased to the easy-to-learn cues tend to converge to relatively flat minima on the loss surface.

Abstract:

Deep neural networks (DNNs) often rely on easy-to-learn discriminatory features, or cues, that are not necessarily essential to the problem at hand. For example, ducks in an image may be recognized based on their typical background scenery, such as lakes or streams. This phenomenon, also known as shortcut learning, is emerging as a key limitation of the current generation of machine learning models. In this work, we introduce a set of experiments to deepen our understanding of shortcut learning and its implications. We design a training setup with several shortcut cues, named WCST-ML, where each cue is equally conducive to the visual recognition problem at hand. Even under equal opportunities, we observe that (1) certain cues are preferred to others, (2) solutions biased to the easy-to-learn cues tend to converge to relatively flat minima on the loss surface, and (3) the solutions focusing on those preferred cues are far more abundant in the parameter space. We explain the abundance of certain cues via their Kolmogorov (descriptional) complexity: solutions corresponding to Kolmogorov-simple cues are abundant in the parameter space and are thus preferred by DNNs. Our studies are based on the synthetic dataset DSprites and the face dataset UTKFace. In our WCST-ML, we observe that the inborn bias of models leans toward simple cues, such as color and ethnicity. Our findings emphasize the importance of active human intervention to remove the inborn model biases that may cause negative societal impacts.

Which Shortcut Cues Will DNNs Choose? A Study from the Parameter-Space Perspective.

Citations

Learning Fair Classifiers with Partially Annotated Group Labels

References

Input-output maps are strongly biased towards simple outputs.

Random deep neural networks are biased towards simple functions

What shapes feature representations? Exploring datasets, architectures, and training

The Pitfalls of Simplicity Bias in Neural Networks

Wisconsin Card Sorting Test scores and clinical and sociodemographic correlates in Schizophrenia: multiple logistic regression analysis

Related Papers (5)

Exploring the Role of Stress in Bayesian Word Segmentation using Adaptor Grammars

Relative cue encoding in the context of sophisticated models of categorization: Separating information from categorization.

Parameter learning but not structure learning: a Bayesian network model of constraints on early perceptual learning.

Cognitive Psychology for Deep Neural Networks: A Shape Bias Case Study

Learning visual biases from human imagination

Trending Questions (1)