Showing papers by "Margaret Mitchell published in 2019"

PDF

Open Access

Proceedings Article•DOI•

[...]

Margaret Mitchell, Simone Wu, Andrew Zaldivar, Parker Barnes, Lucy Vasserman, Ben Hutchinson, Elena Spitzer, Inioluwa Deborah Raji, Timnit Gebru - Show less +5 more

29 Jan 2019

TL;DR: Model cards as discussed by the authors are short documents accompanying trained machine learning models that provide benchmarked evaluation in a variety of conditions, such as across different cultural, demographic, or phenotypic groups (e.g., race, geographic location, sex, Fitzpatrick skin type) that are relevant to the intended application domains.

...read moreread less

Abstract: Trained machine learning models are increasingly used to perform high-impact tasks in areas such as law enforcement, medicine, education, and employment. In order to clarify the intended use cases of machine learning models and minimize their usage in contexts for which they are not well suited, we recommend that released models be accompanied by documentation detailing their performance characteristics. In this paper, we propose a framework that we call model cards, to encourage such transparent model reporting. Model cards are short documents accompanying trained machine learning models that provide benchmarked evaluation in a variety of conditions, such as across different cultural, demographic, or phenotypic groups (e.g., race, geographic location, sex, Fitzpatrick skin type [15]) and intersectional groups (e.g., age and race, or sex and Fitzpatrick skin type) that are relevant to the intended application domains. Model cards also disclose the context in which models are intended to be used, details of the performance evaluation procedures, and other relevant information. While we focus primarily on human-centered machine learning models in the application fields of computer vision and natural language processing, this framework can be used to document any trained machine learning model. To solidify the concept, we provide cards for two supervised models: One trained to detect smiling faces in images, and one trained to detect toxic comments in text. We propose model cards as a step towards the responsible democratization of machine learning and related artificial intelligence technology, increasing transparency into how well artificial intelligence technology works. We hope this work encourages those releasing trained machine learning models to accompany model releases with similar detailed evaluation numbers and other relevant documentation.

...read moreread less

678 citations

Proceedings Article•DOI•

50 Years of Test (Un)fairness: Lessons for Machine Learning

[...]

Ben Hutchinson, Margaret Mitchell

29 Jan 2019

TL;DR: This work traces how the notion of fairness has been defined within the testing communities of education and hiring over the past half century, exploring the cultural and social context in which different fairness definitions have emerged.

...read moreread less

Abstract: Quantitative definitions of what is unfair and what is fair have been introduced in multiple disciplines for well over 50 years, including in education, hiring, and machine learning. We trace how the notion of fairness has been defined within the testing communities of education and hiring over the past half century, exploring the cultural and social context in which different fairness definitions have emerged. In some cases, earlier definitions of fairness are similar or identical to definitions of fairness in current machine learning research, and foreshadow current formal work. In other cases, insights into what fairness means and how to measure it have largely gone overlooked. We compare past and current notions of fairness along several dimensions, including the fairness criteria, the focus of the criteria (e.g., a test, a model, or its use), the relationship of fairness to individuals, groups, and subgroups, and the mathematical method for measuring fairness (e.g., classification, regression). This work points the way towards future research and measurement of (un)fairness that builds from our modern understanding of fairness while incorporating insights from the past.

...read moreread less

256 citations

Proceedings Article•DOI•

Perturbation Sensitivity Analysis to Detect Unintended Model Biases

[...]

Vinodkumar Prabhakaran¹, Ben Hutchinson¹, Margaret Mitchell¹•Institutions (1)

Google¹

01 Nov 2019

TL;DR: A generic evaluation framework, Perturbation Sensitivity Analysis, is proposed, which detects unintended model biases related to named entities, and requires no new annotations or corpora to be employed.

...read moreread less

Abstract: Data-driven statistical Natural Language Processing (NLP) techniques leverage large amounts of language data to build models that can understand language. However, most language data reflect the public discourse at the time the data was produced, and hence NLP models are susceptible to learning incidental associations around named referents at a particular point in time, in addition to general linguistic meaning. An NLP system designed to model notions such as sentiment and toxicity should ideally produce scores that are independent of the identity of such entities mentioned in text and their social associations. For example, in a general purpose sentiment analysis system, a phrase such as I hate Katy Perry should be interpreted as having the same sentiment as I hate Taylor Swift. Based on this idea, we propose a generic evaluation framework, Perturbation Sensitivity Analysis, which detects unintended model biases related to named entities, and requires no new annotations or corpora. We demonstrate the utility of this analysis by employing it on two different NLP models — a sentiment model and a toxicity model — applied on online comments in English language from four different genres.

...read moreread less

94 citations

Posted Content•

Detecting Bias with Generative Counterfactual Face Attribute Augmentation

[...]

Emily Denton, Ben Hutchinson, Margaret Mitchell, Timnit Gebru

14 Jun 2019-arXiv: Computer Vision and Pattern Recognition

TL;DR: A simple framework for identifying biases of a smiling attribute classifier is introduced and a set of metrics that measure the effect of manipulating a specific property of an image on the output of a trained classifier are introduced.

...read moreread less

Abstract: We introduce a simple framework for identifying biases of a smiling attribute classifier. Our method poses counterfactual questions of the form: how would the prediction change if this face characteristic had been different? We leverage recent advances in generative adversarial networks to build a realistic generative model of face images that affords controlled manipulation of specific image characteristics. We introduce a set of metrics that measure the effect of manipulating a specific property of an image on the output of a trained classifier. Empirically, we identify several different factors of variation that affect the predictions of a smiling classifier trained on CelebA.

...read moreread less

88 citations

Posted Content•

Image Counterfactual Sensitivity Analysis for Detecting Unintended Bias

[...]

Emily Denton, Ben Hutchinson, Margaret Mitchell, Timnit Gebru, Andrew Zaldivar¹ - Show less +1 more•Institutions (1)

Google¹

14 Jun 2019-arXiv: Computer Vision and Pattern Recognition

TL;DR: A framework called image counterfactual sensitivity analysis is proposed, which is explored as a proof-of-concept in analyzing a smiling attribute classifier trained on faces of celebrities and demonstrates potential ways generative models can be leveraged for fine-grained analysis of bias and fairness.

...read moreread less

Abstract: Facial analysis models are increasingly used in applications that have serious impacts on people's lives, ranging from authentication to surveillance tracking. It is therefore critical to develop techniques that can reveal unintended biases in facial classifiers to help guide the ethical use of facial analysis technology. This work proposes a framework called \textit{image counterfactual sensitivity analysis}, which we explore as a proof-of-concept in analyzing a smiling attribute classifier trained on faces of celebrities. The framework utilizes counterfactuals to examine how a classifier's prediction changes if a face characteristic slightly changes. We leverage recent advances in generative adversarial networks to build a realistic generative model of face images that affords controlled manipulation of specific image characteristics. We then introduce a set of metrics that measure the effect of manipulating a specific property on the output of the trained classifier. Empirically, we find several different factors of variation that affect the predictions of the smiling classifier. This proof-of-concept demonstrates potential ways generative models can be leveraged for fine-grained analysis of bias and fairness.

...read moreread less

44 citations

Proceedings Article•DOI•

Fairness-Aware Machine Learning: Practical Challenges and Lessons Learned

[...]

Sarah Bird¹, Krishnaram Kenthapadi², Emre Kiciman³, Margaret Mitchell⁴•Institutions (4)

Facebook¹, LinkedIn², Microsoft³, Google⁴

30 Jan 2019

TL;DR: This tutorial aims to present an overview of algorithmic bias / discrimination issues observed over the last few years and the lessons learned, key regulations and laws, and evolution of techniques for achieving fairness in machine learning systems.

...read moreread less

Abstract: Researchers and practitioners from different disciplines have highlighted the ethical and legal challenges posed by the use of machine learned models and data-driven systems, and the potential for such systems to discriminate against certain population groups, due to biases in algorithmic decision-making systems. This tutorial aims to present an overview of algorithmic bias / discrimination issues observed over the last few years and the lessons learned, key regulations and laws, and evolution of techniques for achieving fairness in machine learning systems. We will motivate the need for adopting a "fairness-first" approach (as opposed to viewing algorithmic bias / fairness considerations as an afterthought), when developing machine learning based models and systems for different consumer and enterprise applications. Then, we will focus on the application of fairness-aware machine learning techniques in practice, by presenting case studies from different technology companies. Based on our experiences in industry, we will identify open problems and research challenges for the data mining / machine learning community.

...read moreread less

39 citations

Proceedings Article•DOI•

Fairness-Aware Machine Learning: Practical Challenges and Lessons Learned

[...]

Sarah Bird¹, Ben Hutchinson², Krishnaram Kenthapadi³, Emre Kiciman⁴, Margaret Mitchell² - Show less +1 more•Institutions (4)

Facebook¹, Google², LinkedIn³, Microsoft⁴

13 May 2019

...read moreread less

Abstract: Researchers and practitioners from different disciplines have highlighted the ethical and legal challenges posed by the use of machine learned models and data-driven systems, and the potential for such systems to discriminate against certain population groups, due to biases in algorithmic decision-making systems. This tutorial aims to present an overview of algorithmic bias / discrimination issues observed over the last few years and the lessons learned, key regulations and laws, and evolution of techniques for achieving fairness in machine learning systems. We will motivate the need for adopting a “fairness-first” approach (as opposed to viewing algorithmic bias / fairness considerations as an afterthought), when developing machine learning based models and systems for different consumer and enterprise applications. Then, we will focus on the application of fairness-aware machine learning techniques in practice, by highlighting industry best practices and case studies from different technology companies. Based on our experiences in industry, we will identify open problems and research challenges for the data mining / machine learning community.

...read moreread less

13 citations

Posted Content•

Perturbation Sensitivity Analysis to Detect Unintended Model Biases.

[...]

Vinodkumar Prabhakaran¹, Ben Hutchinson¹, Margaret Mitchell¹•Institutions (1)

Google¹

09 Oct 2019-arXiv: Computation and Language

TL;DR: This paper proposed a generic evaluation framework, Perturbation Sensitivity Analysis, which detects unintended model biases related to named entities, and demonstrates the utility of this analysis by employing it on two different NLP models (a sentiment model and a toxicity model) applied on online comments in English language from four different genres.

...read moreread less

Abstract: Data-driven statistical Natural Language Processing (NLP) techniques leverage large amounts of language data to build models that can understand language. However, most language data reflect the public discourse at the time the data was produced, and hence NLP models are susceptible to learning incidental associations around named referents at a particular point in time, in addition to general linguistic meaning. An NLP system designed to model notions such as sentiment and toxicity should ideally produce scores that are independent of the identity of such entities mentioned in text and their social associations. For example, in a general purpose sentiment analysis system, a phrase such as I hate Katy Perry should be interpreted as having the same sentiment as I hate Taylor Swift. Based on this idea, we propose a generic evaluation framework, Perturbation Sensitivity Analysis, which detects unintended model biases related to named entities, and requires no new annotations or corpora. We demonstrate the utility of this analysis by employing it on two different NLP models --- a sentiment model and a toxicity model --- applied on online comments in English language from four different genres.

...read moreread less

5 citations

Posted Content•

Interpreting Social Respect: A Normative Lens for ML Models

[...]

Ben Hutchinson, K. J. Pittl, Margaret Mitchell

01 Aug 2019-arXiv: Computers and Society

TL;DR: This paper argues that because minority and marginalized members of society are often statistically underrepresented in data sets, models may have undesirable disparate impact on such groups.

...read moreread less

Abstract: Machine learning is often viewed as an inherently value-neutral process: statistical tendencies in the training inputs are "simply" used to generalize to new examples. However when models impact social systems such as interactions between humans, these patterns learned by models have normative implications. It is important that we ask not only "what patterns exist in the data?", but also "how do we want our system to impact people?" In particular, because minority and marginalized members of society are often statistically underrepresented in data sets, models may have undesirable disparate impact on such groups. As such, objectives of social equity and distributive justice require that we develop tools for both identifying and interpreting harms introduced by models.

...read moreread less

3 citations

Proceedings Article•DOI•

Fairness-Aware Machine Learning: Practical Challenges and Lessons Learned

[...]

Sarah Bird¹, Ben Hutchinson², Krishnaram Kenthapadi³, Emre Kiciman¹, Margaret Mitchell² - Show less +1 more•Institutions (3)

Microsoft¹, Google², LinkedIn³

25 Jul 2019

...read moreread less

Abstract: Researchers and practitioners from different disciplines have highlighted the ethical and legal challenges posed by the use of machine learned models and data-driven systems, and the potential for such systems to discriminate against certain population groups, due to biases in algorithmic decision-making systems. This tutorial aims to present an overview of algorithmic bias / discrimination issues observed over the last few years and the lessons learned, key regulations and laws, and evolution of techniques for achieving fairness in machine learning systems. We will motivate the need for adopting a "fairness-first" approach (as opposed to viewing algorithmic bias / fairness considerations as an afterthought), when developing machine learning based models and systems for different consumer and enterprise applications. Then, we will focus on the application of fairness-aware machine learning techniques in practice, by highlighting industry best practices and case studies from different technology companies. Based on our experiences in industry, we will identify open problems and research challenges for the data mining / machine learning community.

...read moreread less

2 citations