Home
/
Authors
/
Anvith Thudi

Author

Anvith Thudi

Bio: Anvith Thudi is an academic researcher from University of Toronto. The author has contributed to research in topics: Computer science & Stochastic gradient descent. The author has an hindex of 1, co-authored 5 publications receiving 7 citations.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Proof-of-Learning: Definitions and Practice

[...]

Hengrui Jia¹, Mohammad Yaghini¹, Christopher A. Choquette-Choo¹, Natalie Dullerud¹, Anvith Thudi¹, Varun Chandrasekaran², Nicolas Papernot¹ - Show less +3 more•Institutions (2)

University of Toronto¹, University of Wisconsin-Madison²

23 May 2021

TL;DR: In this paper, the authors introduce the concept of proof-of-learning in machine learning and demonstrate how a seminal training algorithm accumulates secret information due to its stochasticity.

...read moreread less

Abstract: Training machine learning (ML) models typically involves expensive iterative optimization. Once the model’s final parameters are released, there is currently no mechanism for the entity which trained the model to prove that these parameters were indeed the result of this optimization procedure. Such a mechanism would support security of ML applications in several ways. For instance, it would simplify ownership resolution when multiple parties contest ownership of a specific model. It would also facilitate the distributed training across untrusted workers where Byzantine workers might otherwise mount a denial-ofservice by returning incorrect model updates.In this paper, we remediate this problem by introducing the concept of proof-of-learning in ML. Inspired by research on both proof-of-work and verified computations, we observe how a seminal training algorithm, stochastic gradient descent, accumulates secret information due to its stochasticity. This produces a natural construction for a proof-of-learning which demonstrates that a party has expended the compute require to obtain a set of model parameters correctly. In particular, our analyses and experiments show that an adversary seeking to illegitimately manufacture a proof-of-learning needs to perform at least as much work than is needed for gradient descent itself.We also instantiate a concrete proof-of-learning mechanism in both of the scenarios described above. In model ownership resolution, it protects the intellectual property of models released publicly. In distributed training, it preserves availability of the training procedure. Our empirical evaluation validates that our proof-of-learning mechanism is robust to variance induced by the hardware (e.g., ML accelerators) and software stacks.

...read moreread less

39 citations

Journal Article•DOI•

Selective Classification Via Neural Network Training Dynamics

[...]

Stephan Rabanser, Anvith Thudi, Kimia Hamidieh, Adam Dziedzic, Nicolas Papernot - Show less +1 more

26 May 2022-arXiv.org

TL;DR: This work instantiates a method that tracks when the label predicted during training stops disagreeing with the ﬁnal predicted label, and achieves state-of-the-art accuracy/coverage trade-oﬀs on typical selective classiﬁcation benchmarks.

...read moreread less

Abstract: Selective classiﬁcation is the task of rejecting inputs a model would predict incorrectly on through a trade-oﬀ between input space coverage and model accuracy. Current methods for selective classiﬁcation impose constraints on either the model architecture or the loss function; this inhibits their usage in practice. In contrast to prior work, we show that state-of-the-art selective classiﬁcation performance can be attained solely from studying the (discretized) training dynamics of a model. We propose a general framework that, for a given test input, monitors metrics capturing the disagreement with the ﬁnal predicted label over intermediate models obtained during training; we then reject data points exhibiting too much disagreement at late stages in training. In particular, we instantiate a method that tracks when the label predicted during training stops disagreeing with the ﬁnal predicted label. Our experimental evaluation shows that our method achieves state-of-the-art accuracy/coverage trade-oﬀs on typical selective classiﬁcation benchmarks. For example, we improve coverage on CIFAR-10/SVHN by 10.1%/1.5% respectively at a ﬁxed target error of 0.5%.

...read moreread less

9 citations

Journal Article•

Bounding Membership Inference

[...]

Anvith Thudi, Ilia Shumailov, Franziska Boenisch, Nicolas Papernot

24 Feb 2022-arXiv.org

TL;DR: This paper provides a tighter bound on the positive accuracy of any MI adversary when a training algorithm provides (cid:15) -DP or ( (cID:15), δ )-DP, and informs the design of a novel privacy ampliﬁcation scheme, where an eﬀective training set is sub-sampled from a larger set prior to the beginning of training, to greatly reduce the bound on MI accuracy.

...read moreread less

Abstract: Differential Privacy (DP) is the de facto standard for reasoning about the privacy guarantees of a training algorithm. Despite the empirical observation that DP reduces the vulnerability of models to existing membership inference (MI) attacks, a theoretical underpinning as to why this is the case is largely missing in the literature. In practice, this means that models need to be trained with DP guarantees that greatly decrease their accuracy. In this paper, we provide a tighter bound on the positive accuracy (i.e., attack precision) of any MI adversary when a training algorithm provides $(\varepsilon, \delta)$-DP. Our bound informs the design of a novel privacy amplification scheme: an effective training set is sub-sampled from a larger set prior to the beginning of training. We find this greatly reduces the bound on MI positive accuracy. As a result, our scheme allows the use of looser DP guarantees to limit the success of any MI adversary; this ensures that the model's accuracy is less impacted by the privacy guarantee. While this clearly benefits entities working with far more data than they need to train on, it can also improve the accuracy-privacy trade-off on benchmarks studied in the academic literature. Consequently, we also find that subsampling decreases the effectiveness of a state-of-the-art MI attack (LiRA) much more effectively than training with stronger DP guarantees on MNIST and CIFAR10. We conclude by discussing implications of our MI bound on the field of machine unlearning.

...read moreread less

8 citations

Posted Content•

Unrolling SGD: Understanding Factors Influencing Machine Unlearning

[...]

Anvith Thudi, Gabriel Deza, Varun Chandrasekaran, Nicolas Papernot

27 Sep 2021-arXiv: Learning

TL;DR: In this paper, the authors taxonomize approaches and metrics of approximate unlearning and identify verification error, i.e., the L 2 difference between the weights of an approximately unlearned and a naively retrained model, as a metric approximate un learning should optimize for as it implies a large class of other metrics.

...read moreread less

Abstract: Machine unlearning is the process through which a deployed machine learning model forgets about one of its training data points. While naively retraining the model from scratch is an option, it is almost always associated with a large computational effort for deep learning models. Thus, several approaches to approximately unlearn have been proposed along with corresponding metrics that formalize what it means for a model to forget about a data point. In this work, we first taxonomize approaches and metrics of approximate unlearning. As a result, we identify verification error, i.e., the L2 difference between the weights of an approximately unlearned and a naively retrained model, as a metric approximate unlearning should optimize for as it implies a large class of other metrics. We theoretically analyze the canonical stochastic gradient descent (SGD) training algorithm to surface the variables which are relevant to reducing the verification error of approximate unlearning for SGD. From this analysis, we first derive an easy-to-compute proxy for verification error (termed unlearning error). The analysis also informs the design of a new training objective penalty that limits the overall change in weights during SGD and as a result facilitates approximate unlearning with lower verification error. We validate our theoretical work through an empirical evaluation on CIFAR-10, CIFAR-100, and IMDB sentiment analysis.

...read moreread less

3 citations

Journal Article•DOI•

On the Fundamental Limits of Formally (Dis)Proving Robustness in Proof-of-Learning

[...]

Cong Fang, He Jia, Anvith Thudi, Mohammad Yaghini, Christopher A. Choquette-Choo, Natalie Dullerud, Varun Chandrasekaran, Nicolas Papernot - Show less +4 more

arXiv.org

TL;DR: It is shown that, until the aforementioned open problems are addressed, relying more heavily on cryptography is likely needed to formulate a new class of PoL protocols with formal robustness guarantees, and that establishing precedence robustly also reduces to an open problem in learning theory.

...read moreread less

Abstract: —Proof-of-learning (PoL) proposes a model owner use machine learning training checkpoints to establish a proof of having expended the necessary compute for training. The authors of PoL forego cryptographic approaches and trade rigorous security guarantees for scalability to deep learning by being applicable to stochastic gradient descent and adaptive variants. This lack of formal analysis leaves the possibility that an attacker may be able to spoof a proof for a model they did not train. We contribute a formal analysis of why the PoL protocol cannot be formally (dis)proven to be robust against spooﬁng adversaries. To do so, we disentangle the two roles of proof veriﬁcation in PoL: (a) efﬁciently determining if a proof is a valid gradient descent trajectory, and (b) establishing precedence by making it more expensive to craft a proof after training completes (i.e., spooﬁng). We show that efﬁcient veriﬁcation results in a tradeoff between accepting legitimate proofs and rejecting invalid proofs because deep learning necessarily involves noise. Without a precise analytical model for how this noise affects training, we cannot formally guarantee if a PoL veriﬁcation algorithm is robust. Then, we demonstrate that establishing precedence robustly also reduces to an open problem in learning theory: spooﬁng a PoL post hoc training is akin to ﬁnding different trajectories with the same endpoint in non-convex learning. Yet, we do not rigorously know if priori knowledge of the ﬁnal model weights helps discover such trajectories. We conclude that, until the aforementioned open problems are addressed, relying more heavily on cryptography is likely needed to formulate a new class of PoL protocols with formal robustness guarantees. In particular, this will help with establishing precedence. As a by-product of insights from our analysis, we

...read moreread less

3 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Trustworthy Graph Neural Networks: Aspects, Methods and Trends

[...]

He Zhang, Bang Wu, Xingliang Yuan, Shirui Pan, Hanghang Tong, Jianjun Pei - Show less +2 more

16 May 2022-arXiv.org

TL;DR: A comprehensive roadmap to build trustworthy GNNs from the view of the various computing technologies involved is proposed, including robustness, explainability, privacy, fairness, accountability, and environmental well-being.

...read moreread less

Abstract: Graph neural networks (GNNs) have emerged as a series of competent graph learning methods for diverse real-world scenarios, ranging from daily applications like recommendation systems and question answering to cutting-edge technologies such as drug discovery in life sciences and n-body simulation in astrophysics. However, task performance is not the only requirement for GNNs. Performance-oriented GNNs have exhibited potential adverse effects like vulnerability to adversarial attacks, unexplainable discrimination against disadvantaged groups, or excessive resource consumption in edge computing environments. To avoid these unintentional harms, it is necessary to build competent GNNs characterised by trustworthiness. To this end, we propose a comprehensive roadmap to build trustworthy GNNs from the view of the various computing technologies involved. In this survey, we introduce basic concepts and comprehensively summarise existing efforts for trustworthy GNNs from six aspects, including robustness, explainability, privacy, fairness, accountability, and environmental well-being. Additionally, we highlight the intricate cross-aspect relations between the above six aspects of trustworthy GNNs. Finally, we present a thorough overview of trending directions for facilitating the research and industrialisation of trustworthy GNNs.

...read moreread less

29 citations

Journal Article•DOI•

The alignment problem from a deep learning perspective

[...]

Richard Ngo

30 Aug 2022-arXiv.org

TL;DR: The authors argue that if AGIs are trained in ways similar to today's most capable models, they could learn to act deceptively to receive higher reward, learn internally-represented goals which generalize beyond their training distributions, and pursue those goals using power-seeking strategies.

...read moreread less

Abstract: Within the coming decades, artificial general intelligence (AGI) may surpass human capabilities at a wide range of important tasks. We outline a case for expecting that, without substantial effort to prevent it, AGIs could learn to pursue goals which are undesirable (i.e. misaligned) from a human perspective. We argue that if AGIs are trained in ways similar to today's most capable models, they could learn to act deceptively to receive higher reward, learn internally-represented goals which generalize beyond their training distributions, and pursue those goals using power-seeking strategies. We outline how the deployment of misaligned AGIs might irreversibly undermine human control over the world, and briefly review research directions aimed at preventing this outcome.

...read moreread less

28 citations

Proceedings Article•DOI•

Measuring Forgetting of Memorized Training Examples

[...]

Matthew Jagielski, Om Thakkar, Florian Tramèr, Daphne Ippolito, Katherine Lee, Nicholas Carlini, Eric Wallace, Shuang Song, Abhradeep Guha Thakurta, Nicolas Papernot, Chiyuan Zhang - Show less +7 more

30 Jun 2022

TL;DR: It is shown that, while non-convexity can prevent forgetting from happening in the worst-case, standard image and speech models empirically do forget examples over time, and nondeterminism is identified as a potential explanation, showing that deterministically trained models do not forget.

...read moreread less

Abstract: Machine learning models exhibit two seemingly contradictory phenomena: training data memorization, and various forms of forgetting. In memorization, models overfit specific training examples and become susceptible to privacy attacks. In forgetting, examples which appeared early in training are forgotten by the end. In this work, we connect these phenomena. We propose a technique to measure to what extent models"forget"the specifics of training examples, becoming less susceptible to privacy attacks on examples they have not seen recently. We show that, while non-convex models can memorize data forever in the worst-case, standard image, speech, and language models empirically do forget examples over time. We identify nondeterminism as a potential explanation, showing that deterministically trained models do not forget. Our results suggest that examples seen early when training with extremely large datasets - for instance those examples used to pre-train a model - may observe privacy benefits at the expense of examples seen later.

...read moreread less

24 citations

Journal Article•DOI•

A Survey of Machine Unlearning

[...]

Thanh Tam Nguyen, Thanh Trung Huynh, Phi-Le Nguyen, Alan Wee-Chung Liew, Hongzhi Yin, Quoc Viet Hung Nguyen - Show less +2 more

06 Sep 2022-arXiv.org

TL;DR: This paper aspires to present a comprehensive examination of machine unlearning’s concepts, scenarios, methods, and applications as a category collection of cutting-edge studies to serve as a comprehensive resource for researchers and practitioners seeking an introduction to machine un learning.

...read moreread less

Abstract: Today, computer systems hold large amounts of personal data. Yet while such an abundance of data allows breakthroughs in artificial intelligence, and especially machine learning (ML), its existence can be a threat to user privacy, and it can weaken the bonds of trust between humans and AI. Recent regulations now require that, on request, private information about a user must be removed from both computer systems and from ML models, i.e. ``the right to be forgotten''). While removing data from back-end databases should be straightforward, it is not sufficient in the AI context as ML models often `remember' the old data. Contemporary adversarial attacks on trained models have proven that we can learn whether an instance or an attribute belonged to the training data. This phenomenon calls for a new paradigm, namely machine unlearning, to make ML models forget about particular data. It turns out that recent works on machine unlearning have not been able to completely solve the problem due to the lack of common frameworks and resources. Therefore, this paper aspires to present a comprehensive examination of machine unlearning's concepts, scenarios, methods, and applications. Specifically, as a category collection of cutting-edge studies, the intention behind this article is to serve as a comprehensive resource for researchers and practitioners seeking an introduction to machine unlearning and its formulations, design criteria, removal requests, algorithms, and applications. In addition, we aim to highlight the key findings, current trends, and new research areas that have not yet featured the use of machine unlearning but could benefit greatly from it. We hope this survey serves as a valuable resource for ML researchers and those seeking to innovate privacy technologies. Our resources are publicly available at https://github.com/tamlhp/awesome-machine-unlearning.

...read moreread less

20 citations

Proceedings Article•DOI•

Fingerprinting Deep Neural Networks Globally via Universal Adversarial Perturbations

[...]

Zirui Peng, Shaofeng Li, Guo-Yue Chen, Cheng Zhang, Haojin Zhu, Minhui Xue - Show less +2 more

17 Feb 2022

TL;DR: A UAP fingerprinting method for DNN models is proposed and an encoder via contrastive learning that takes fingerprints as inputs, outputs a similarity score is trained that has good generalizability across different model architectures and is robust against post-modifications on stolen models.

...read moreread less

Abstract: In this paper, we propose a novel and practical mechanism to enable the service provider to verify whether a suspect model is stolen from the victim model via model extraction attacks. Our key insight is that the profile of a DNN model's decision boundary can be uniquely characterized by its Universal Adversarial Perturbations (UAPs). UAPs belong to a low-dimensional subspace and piracy models' subspaces are more consistent with victim model's subspace compared with non-piracy model. Based on this, we propose a UAP fingerprinting method for DNN models and train an encoder via contrastive learning that takes fingerprints as inputs, outputs a similarity score. Extensive studies show that our framework can detect model Intellectual Property (IP) breaches with confidence > 99.99 % within only 20 fingerprints of the suspect model. It also has good generalizability across different model architectures and is robust against post-modifications on stolen models.

...read moreread less

13 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13

Collapse