Showing papers by "Thomas G. Dietterich published in 2019"

PDF

Open Access

Posted Content•

Benchmarking Neural Network Robustness to Common Corruptions and Perturbations

[...]

Dan Hendrycks¹, Thomas G. Dietterich²•Institutions (2)

University of California, Berkeley¹, Oregon State University²

28 Mar 2019-arXiv: Learning

TL;DR: In this paper, the authors established rigorous benchmarks for image classifier robustness and proposed ImageNet-C, a robustness benchmark that evaluates performance on common corruptions and perturbations not worst-case adversarial perturbation.

...read moreread less

Abstract: In this paper we establish rigorous benchmarks for image classifier robustness. Our first benchmark, ImageNet-C, standardizes and expands the corruption robustness topic, while showing which classifiers are preferable in safety-critical applications. Then we propose a new dataset called ImageNet-P which enables researchers to benchmark a classifier's robustness to common perturbations. Unlike recent robustness research, this benchmark evaluates performance on common corruptions and perturbations not worst-case adversarial perturbations. We find that there are negligible changes in relative corruption robustness from AlexNet classifiers to ResNet classifiers. Afterward we discover ways to enhance corruption and perturbation robustness. We even find that a bypassed adversarial defense provides substantial common perturbation robustness. Together our benchmarks may aid future work toward networks that robustly generalize.

...read moreread less

1,134 citations

Proceedings Article•

Benchmarking Neural Network Robustness to Common Corruptions and Perturbations

[...]

Dan Hendrycks¹, Thomas G. Dietterich²•Institutions (2)

University of California, Berkeley¹, Oregon State University²

28 Mar 2019

TL;DR: This paper standardizes and expands the corruption robustness topic, while showing which classifiers are preferable in safety-critical applications, and proposes a new dataset called ImageNet-P which enables researchers to benchmark a classifier's robustness to common perturbations.

...read moreread less

736 citations

Journal Article•DOI•

Computational sustainability: computing for a better world and a sustainable future

[...]

Carla P. Gomes¹, Thomas G. Dietterich², Christopher B. Barrett¹, Jon M. Conrad¹, Bistra Dilkina³, Stefano Ermon⁴, Fei Fang⁵, Andrew Farnsworth¹, Alan Fern², Xiaoli Z. Fern², Daniel Fink¹, Douglas H. Fisher⁶, Alexander S. Flecker¹, Daniel Freund⁷, Angela K. Fuller⁸, John M. Gregoire⁹, John E. Hopcroft¹, Steve Kelling¹, Zico Kolter⁵, Warren B. Powell¹⁰, Nicole D. Sintov¹¹, John S. Selker², Bart Selman¹, Daniel Sheldon¹², David B. Shmoys¹, Milind Tambe¹³, Weng-Keen Wong², Christopher L. Wood¹, Xiaojian Wu¹⁴, Yexiang Xue¹⁵, Amulya Yadav¹⁶, Abdul-Aziz Yakubu¹⁷, Mary Lou Zeeman¹⁸ - Show less +29 more•Institutions (18)

Cornell University¹, Oregon State University², University of Southern California³, Stanford University⁴, Carnegie Mellon University⁵, Vanderbilt University⁶, Massachusetts Institute of Technology⁷, United States Geological Survey⁸, California Institute of Technology⁹, Princeton University¹⁰, Ohio State University¹¹, University of Massachusetts Amherst¹², Harvard University¹³, Microsoft¹⁴, Purdue University¹⁵, Pennsylvania State University¹⁶, Howard University¹⁷, Bowdoin College¹⁸

21 Aug 2019-Communications of The ACM

TL;DR: Computer and information scientists join forces with other fields to help solve societal and environmental challenges facing humanity, in pursuit of a sustainable future.

...read moreread less

Abstract: Computer and information scientists join forces with other fields to help solve societal and environmental challenges facing humanity, in pursuit of a sustainable future.

...read moreread less

58 citations

Journal Article•DOI•

Sequential Feature Explanations for Anomaly Detection

[...]

Amran Siddiqui¹, Alan Fern¹, Thomas G. Dietterich¹, Weng-Keen Wong¹•Institutions (1)

Oregon State University¹

09 Jan 2019-ACM Transactions on Knowledge Discovery From Data

TL;DR: In this paper, the authors study the problem of computing and evaluating sequential feature explanations (SFEs) for anomaly detectors and present both greedy algorithms and an optimal algorithm, based on branch-and-bound search, for optimizing SFEs.

...read moreread less

Abstract: In many applications, an anomaly detection system presents the most anomalous data instance to a human analyst, who then must determine whether the instance is truly of interest (e.g., a threat in a security setting). Unfortunately, most anomaly detectors provide no explanation about why an instance was considered anomalous, leaving the analyst with no guidance about where to begin the investigation. To address this issue, we study the problems of computing and evaluating sequential feature explanations (SFEs) for anomaly detectors. An SFE of an anomaly is a sequence of features, which are presented to the analyst one at a time (in order) until the information contained in the highlighted features is enough for the analyst to make a confident judgement about the anomaly. Since analyst effort is related to the amount of information that they consider in an investigation, an explanation’s quality is related to the number of features that must be revealed to attain confidence. In this article, we first formulate the problem of optimizing SFEs for a particular density-based anomaly detector. We then present both greedy algorithms and an optimal algorithm, based on branch-and-bound search, for optimizing SFEs. Finally, we provide a large scale quantitative evaluation of these algorithms using a novel framework for evaluating explanations. The results show that our algorithms are quite effective and that our best greedy algorithm is competitive with optimal solutions.

...read moreread less

41 citations

Journal Article•DOI•

Robust artificial intelligence and robust human organizations

[...]

Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

31 Jan 2019-Frontiers of Computer Science

TL;DR: In this article, a short note reviews the properties of high-reliability organizations and draws implications for the development of AI technology and the safe application of that technology in high risk applications.

...read moreread less

Abstract: Every AI system is deployed by a human organization. In high risk applications, the combined human plus AI system must function as a high-reliability organization in order to avoid catastrophic errors. This short note reviews the properties of high-reliability organizations and draws implications for the development of AI technology and the safe application of that technology.

...read moreread less

25 citations

Proceedings Article•DOI•

Anomaly detection in the presence of missing values for weather data quality control

[...]

Tadesse Zemicheal¹, Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

03 Jul 2019-The Compass

TL;DR: This paper evaluates five strategies for handling missing values in anomaly detection and suggests that proportional distribution for IF, MAP imputation for LODA, and marginalization for EGMM should give better results than mean imputation, reduction, andmarginalization.

...read moreread less

Abstract: Accurate weather data is important for improving agricultural productivity in developing countries. Unfortunately, weather sensors can fail for a wide variety of reasons. One approach to detecting failed sensors is to identify statistical anomalies in the joint distribution of sensor readings. This powerful method can break down if some of the sensor readings are missing. This paper evaluates five strategies for handling missing values in anomaly detection: (a) mean imputation, (b) MAP imputation, (c) reduction (reduced-dimension anomaly detectors via feature bagging), (d) marginalization (for density estimators only), and (e) proportional distribution (for tree-based methods only). Our analysis suggests that MAP imputation and proportional distribution should give better results than mean imputation, reduction, and marginalization. These hypotheses are largely confirmed by experimental studies on synthetic data and on anomaly detection benchmark data sets using the Isolation Forest (IF), LODA, and EGMM anomaly detection algorithms. However, marginalization worked surprisingly well for EGMM, and there are exceptions where reduction works well on some benchmark problems. We recommend proportional distribution for IF, MAP imputation for LODA, and marginalization for EGMM.

...read moreread less

22 citations

Proceedings Article•DOI•

Three-quarter Sibling Regression for Denoising Observational Data

[...]

Shiv Shankar¹, Daniel Sheldon¹, Daniel Sheldon², Tao Sun³, John Pickering⁴, Thomas G. Dietterich⁵ - Show less +2 more•Institutions (5)

University of Massachusetts Amherst¹, Mount Holyoke College², Amazon.com³, University of Georgia⁴, Oregon State University⁵

01 Aug 2019

TL;DR: A technique called “three-quarter sibling regression” is presented, which can filter the effect of systematic noise when the latent variables have observed common causes and reduces systematic detection variability due to moon brightness in moth surveys.

...read moreread less

Abstract: Many ecological studies and conservation policies are based on field observations of species, which can be affected by systematic variability introduced by the observation process. A recently introduced causal modeling technique called 'half-sibling regression' can detect and correct for systematic errors in measurements of multiple independent random variables. However, it will remove intrinsic variability if the variables are dependent, and therefore does not apply to many situations, including modeling of species counts that are controlled by common causes. We present a technique called 'three-quarter sibling regression' to partially overcome this limitation. It can filter the effect of systematic noise when the latent variables have observed common causes. We provide theoretical justification of this approach, demonstrate its effectiveness on synthetic data, and show that it reduces systematic detection variability due to moon brightness in moth surveys.

...read moreread less

5 citations