Trustworthy AI

doi:10.1145/3430984.3431966

Proceedings ArticleDOI

Trustworthy AI

TLDR

The tutorial on “Trustworthy AI” is proposed to address six critical issues in enhancing user and public trust in AI systems, namely: bias and fairness, explainability, robust mitigation of adversarial attacks, improved privacy and security in model building, and being decent.

Abstract:

Modern AI systems are reaping the advantage of novel learning methods. With their increasing usage, we are realizing the limitations and shortfalls of these systems. Brittleness to minor adversarial changes in the input data, ability to explain the decisions, address the bias in their training data, high opacity in terms of revealing the lineage of the system, how they were trained and tested, and under which parameters and conditions they can reliably guarantee a certain level of performance, are some of the most prominent limitations. Ensuring the privacy and security of the data, assigning appropriate credits to data sources, and delivering decent outputs are also required features of an AI system. We propose the tutorial on “Trustworthy AI” to address six critical issues in enhancing user and public trust in AI systems, namely: (i) bias and fairness, (ii) explainability, (iii) robust mitigation of adversarial attacks, (iv) improved privacy and security in model building, (v) being decent, and (vi) model attribution, including the right level of credit assignment to the data sources, model architectures, and transparency in lineage.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Assessing the Alignment of Social Robots with Trustworthy AI Design Guidelines: A Preliminary Research Study

Ankur Chattopadhyay, +2 more

TL;DR: In this article, the authors explored flaws within the robot's system, and analyzed these flaws to assess the overall alignment of the robot system design with the IEEE global standards on the design of ethically aligned trustworthy autonomous intelligent systems (IEEE A/IS Standards).

...read moreread less

Posted Content

Socially Responsible AI Algorithms: Issues, Purposes, and Challenges

Lu Cheng, +2 more

- 01 Jan 2021 -

arXiv: Computers and Society

TL;DR: In this article, the authors provide a systematic framework of socially responsible AI algorithms and discuss how to leverage this framework to improve societal well-being through protection, information, and prevention/mitigation.

...read moreread less

Proceedings ArticleDOI

Ethics of Trust/worthiness in Autonomous Systems: a scoping review.

TL;DR: In this paper , a scoping review surveys the literature to identify the problematic nature of adaptive autonomous systems with evolving functionality (AASEFs), the ethical worries that they generate, and the ethical principles affected.

...read moreread less

References

PDF

Open Access

More filters

Journal ArticleDOI

Adversarial Examples: Attacks and Defenses for Deep Learning

Xiaoyong Yuan, +3 more

- 14 Jan 2019 -

IEEE Transactions on Neural Networks

TL;DR: In this paper, the authors review recent findings on adversarial examples for DNNs, summarize the methods for generating adversarial samples, and propose a taxonomy of these methods.

...read moreread less

Proceedings ArticleDOI

Feature Squeezing: Detecting Adversarial Examples in Deep Neural Networks.

Weilin Xu, +2 more

Abstract: Although deep neural networks (DNNs) have achieved great success in many tasks, they can often be fooled by \emph{adversarial examples} that are generated by adding small but purposeful distortions to natural examples. Previous studies to defend against adversarial examples mostly focused on refining the DNN models, but have either shown limited success or required expensive computation. We propose a new strategy, \emph{feature squeezing}, that can be used to harden DNN models by detecting adversarial examples. Feature squeezing reduces the search space available to an adversary by coalescing samples that correspond to many different feature vectors in the original space into a single sample. By comparing a DNN model's prediction on the original input with that on squeezed inputs, feature squeezing detects adversarial examples with high accuracy and few false positives. This paper explores two feature squeezing methods: reducing the color bit depth of each pixel and spatial smoothing. These simple strategies are inexpensive and complementary to other defenses, and can be combined in a joint detection framework to achieve high detection rates against state-of-the-art attacks.

...read moreread less

Journal ArticleDOI

Adversarial Attacks and Defenses in Deep Learning

Kui Ren, +3 more

- 01 Mar 2020 -

Engineering

TL;DR: The theoretical foundations, algorithms, and applications of adversarial attack techniques are introduced and a few research efforts on the defense techniques are described, which cover the broad frontier in the field.

...read moreread less

Journal ArticleDOI

Bias in data-driven artificial intelligence systems—An introductory survey

Eirini Ntoutsi, +25 more

- 01 May 2020 -

Wiley Interdisciplinary Reviews-Data Min...

TL;DR: A broad multidisciplinary overview of the area of bias in AI systems is provided, focusing on technical challenges and solutions as well as to suggest new research directions towards approaches well‐grounded in a legal frame.

...read moreread less

Proceedings ArticleDOI

SafetyNet: Detecting and Rejecting Adversarial Examples Robustly

Jiajun Lu, +2 more

TL;DR: In this paper, the authors describe a method to produce a network where current methods such as DeepFool have great difficulty producing adversarial samples, and provide a reasonable analyses that their construction is difficult to defeat, and show experimentally that their method is hard to defeat with both Type I and Type II attacks using several standard networks and datasets.

...read moreread less

IEEE Transactions on Dependable and Secu...

Blockchain Enabled AI Marketplace: The Price You Pay for Trust

Kanthi K. Sarpatwar, +4 more

Trustworthy AI

Citations

Assessing the Alignment of Social Robots with Trustworthy AI Design Guidelines: A Preliminary Research Study

Socially Responsible AI Algorithms: Issues, Purposes, and Challenges

Ethics of Trust/worthiness in Autonomous Systems: a scoping review.

References

Adversarial Examples: Attacks and Defenses for Deep Learning

Feature Squeezing: Detecting Adversarial Examples in Deep Neural Networks.

Adversarial Attacks and Defenses in Deep Learning

Bias in data-driven artificial intelligence systems—An introductory survey

SafetyNet: Detecting and Rejecting Adversarial Examples Robustly

Related Papers (5)

Differentially Private and Fair Deep Learning: A Lagrangian Dual Approach

Fairness and Transparency of Machine Learning for Trustworthy Cloud Services

Achieving Differential Privacy and Fairness in Logistic Regression

How to Democratise and Protect AI: Fair and Differentially Private Decentralised Deep Learning

Blockchain Enabled AI Marketplace: The Price You Pay for Trust

Trending Questions (3)