Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses.

Open AccessPosted Content

Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses.

Micah Goldblum, +8 more

- 18 Dec 2020 -

arXiv: Learning

Chats0

TLDR

In this article, the authors systematically categorize and discuss a wide range of dataset vulnerabilities and exploits, approaches for defending against these threats, and an array of open problems in this space.

Abstract:

As machine learning systems grow in scale, so do their training data requirements, forcing practitioners to automate and outsource the curation of training data in order to achieve state-of-the-art performance. The absence of trustworthy human supervision over the data collection process exposes organizations to security vulnerabilities; training data can be manipulated to control and degrade the downstream behaviors of learned models. The goal of this work is to systematically categorize and discuss a wide range of dataset vulnerabilities and exploits, approaches for defending against these threats, and an array of open problems in this space. In addition to describing various poisoning and backdoor threat models and the relationships among them, we develop their unified taxonomy.

Citations

PDF

Open Access

More filters

Posted Content

Evaluating Large Language Models Trained on Code

Mark Chen, +57 more

- 07 Jul 2021 -

arXiv: Learning

TL;DR: Codex as discussed by the authors is a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities, showing that repeated sampling from the model is a surprisingly effective strategy for producing working solutions to difficult prompts.

...read moreread less

Proceedings ArticleDOI

Strong Data Augmentation Sanitizes Poisoning and Backdoor Attacks Without an Accuracy Tradeoff

Eitan Borgnia, +7 more

TL;DR: In this paper, strong data augmentations, such as mixup and CutMix, can significantly diminish the threat of poisoning and backdoor attacks without trading off performance, and they further verify the effectiveness of this simple defense against adaptive poisoning methods, and compare to baselines including the popular differentially private SGD (DP-SGD) defense.

...read moreread less

Posted Content

Property Inference From Poisoning

Melissa Chase, +2 more

- 26 Jan 2021 -

arXiv: Learning

TL;DR: In this paper, the authors proposed a poisoning attack that allows the adversary to learn the prevalence in the training data of any property it chooses, which can boost the information leakage significantly and should be considered as a stronger threat model in sensitive applications.

...read moreread less

Journal ArticleDOI

Adversarial XAI Methods in Cybersecurity

Aditya Kuppa, +1 more

- 01 Oct 2021 -

IEEE Transactions on Information Forensi...

TL;DR: In this paper, a black-box attack that leverages explainable artificial intelligence (XAI) methods to compromise the confidentiality and privacy properties of underlying classifiers is proposed, which can also facilitate powerful evasion attacks such as poisoning and back door attacks.

...read moreread less

Posted Content

What Doesn't Kill You Makes You Robust(er): Adversarial Training against Poisons and Backdoors.

Jonas Geiping, +5 more

- 26 Feb 2021 -

arXiv: Learning

TL;DR: In this paper, the authors extend the adversarial training framework to instead defend against (training-time) poisoning and backdoor attacks, and they show that this defense withstands adaptive attacks, generalizes to diverse threat models, and incurs a better performance trade-off than previous defenses.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Posted Content

TABOR: A Highly Accurate Approach to Inspecting and Restoring Trojan Backdoors in AI Systems.

Wenbo Guo, +4 more

- 02 Aug 2019 -

arXiv: Cryptography and Security

TL;DR: TABOR formalizes a trojan detection task as a non-convex optimization problem, and the detection of a Trojan backdoor as the task of resolving the optimization through an objective function, and designs a new objective function that could guide optimization to identify aTrojan backdoor in a more effective fashion.

...read moreread less

Posted Content

Recent Advances in Algorithmic High-Dimensional Robust Statistics.

Ilias Diakonikolas, +1 more

- 14 Nov 2019 -

arXiv: Data Structures and Algorithms

TL;DR: The core ideas and algorithmic techniques in the emerging area of algorithmic high-dimensional robust statistics with a focus on robust mean estimation are introduced and an overview of the approaches that have led to computationally efficient robust estimators for a range of broader statistical tasks are provided.

...read moreread less

Proceedings ArticleDOI

Universal Litmus Patterns: Revealing Backdoor Attacks in CNNs

Soheil Kolouri, +3 more

TL;DR: The concept of Universal Litmus Patterns (ULPs) is introduced, which enable one to reveal backdoor attacks by feeding these universal patterns to the network and analyzing the output (i.e., classifying the network as `clean' or `corrupted').

...read moreread less

Journal ArticleDOI

Robust Covariance and Scatter Matrix Estimation under Huber's Contamination Model

Mengjie Chen, +2 more

- 01 Oct 2018 -

Annals of Statistics

TL;DR: A new concept called matrix depth is defined and a robust covariance matrix estimator is proposed that is shown to achieve minimax optimal rate under Huber's $\epsilon$-contamination model for estimating covariance/scatter matrices with various structures including bandedness and sparsity.

...read moreread less

Proceedings ArticleDOI

Model-Reuse Attacks on Deep Learning Systems

Yujie Ji, +4 more

TL;DR: It is demonstrated that malicious primitive models pose immense threats to the security of ML systems, and analytical justification for the effectiveness of model-reuse attacks is provided, which points to the unprecedented complexity of today's primitive models.

...read moreread less

Collapse

Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses.

Citations

Evaluating Large Language Models Trained on Code

Strong Data Augmentation Sanitizes Poisoning and Backdoor Attacks Without an Accuracy Tradeoff

Property Inference From Poisoning

Adversarial XAI Methods in Cybersecurity

What Doesn't Kill You Makes You Robust(er): Adversarial Training against Poisons and Backdoors.

References

TABOR: A Highly Accurate Approach to Inspecting and Restoring Trojan Backdoors in AI Systems.

Recent Advances in Algorithmic High-Dimensional Robust Statistics.

Universal Litmus Patterns: Revealing Backdoor Attacks in CNNs

Robust Covariance and Scatter Matrix Estimation under Huber's Contamination Model

Model-Reuse Attacks on Deep Learning Systems

Related Papers (5)

Deep Residual Learning for Image Recognition

Targeted Backdoor Attacks on Deep Learning Systems Using Data Poisoning

Intriguing properties of neural networks

Explaining and Harnessing Adversarial Examples

Learning Multiple Layers of Features from Tiny Images