Home
/
Authors
/
Chulin Xie

Author

Chulin Xie

University of Illinois at Urbana–Champaign

Bio: Chulin Xie is an academic researcher from University of Illinois at Urbana–Champaign. The author has contributed to research in topics: Computer science & Backdoor. The author has an hindex of 4, co-authored 10 publications receiving 186 citations. Previous affiliations of Chulin Xie include Zhejiang University.

Papers

PDF

Open Access

More filters

Proceedings Article•

DBA: Distributed Backdoor Attacks against Federated Learning

[...]

Chulin Xie¹, Keli Huang, Pin-Yu Chen², Bo Li³•Institutions (3)

University of Illinois at Urbana–Champaign¹, IBM², University of California, Los Angeles³

30 Apr 2020

TL;DR: The distributed backdoor attack (DBA) is proposed --- a novel threat assessment framework developed by fully exploiting the distributed nature of FL that can evade two state-of-the-art robust FL algorithms against centralized backdoors.

...read moreread less

Abstract: Backdoor attacks aim to manipulate a subset of training data by injecting adversarial triggers such that machine learning models trained on the tampered dataset will make arbitrarily (targeted) incorrect prediction on the testset with the same trigger embedded. While federated learning (FL) is capable of aggregating information provided by different parties for training a better model, its distributed learning methodology and inherently heterogeneous data distribution across parties may bring new vulnerabilities. In addition to recent centralized backdoor attacks on FL where each party embeds the same global trigger during training, we propose the distributed backdoor attack (DBA) --- a novel threat assessment framework developed by fully exploiting the distributed nature of FL. DBA decomposes a global trigger pattern into separate local patterns and embed them into the training set of different adversarial parties respectively. Compared to standard centralized backdoors, we show that DBA is substantially more persistent and stealthy against FL on diverse datasets such as finance and image data. We conduct extensive experiments to show that the attack success rate of DBA is significantly higher than centralized backdoors under different settings. Moreover, we find that distributed attacks are indeed more insidious, as DBA can evade two state-of-the-art robust FL algorithms against centralized backdoors. We also provide explanations for the effectiveness of DBA via feature visual interpretation and feature importance ranking. To further explore the properties of DBA, we test the attack performance by varying different trigger factors, including local trigger variations (size, gap, and location), scaling factor in FL, data distribution, and poison ratio and interval. Our proposed DBA and thorough evaluation results shed lights on characterizing the robustness of FL.

...read moreread less

310 citations

Proceedings Article•DOI•

Style-based Point Generator with Adversarial Rendering for Point Cloud Completion

[...]

Chulin Xie¹, Chuxin Wang², Bo Zhang³, Hao Yang³, Dong Chen³, Fang Wen³ - Show less +2 more•Institutions (3)

University of Illinois at Urbana–Champaign¹, University of Science and Technology of China², Microsoft³

03 Mar 2021

TL;DR: Wang et al. as mentioned in this paper proposed a Style-based Point Generator with Adversarial Rendering (SpareNet) for point cloud completion, which applies adversarial training to advocate the perceptual realism under different viewpoints.

...read moreread less

Abstract: In this paper, we proposed a novel Style-based Point Generator with Adversarial Rendering (SpareNet) for point cloud completion. Firstly, we present the channel-attentive EdgeConv to fully exploit the local structures as well as the global shape in point features. Secondly, we observe that the concatenation manner used by vanilla foldings limits its potential of generating a complex and faithful shape. Enlightened by the success of StyleGAN, we regard the shape feature as style code that modulates the normalization layers during the folding, which considerably enhances its capability. Thirdly, we realize that existing point supervisions, e.g., Chamfer Distance or Earth Mover’s Distance, cannot faithfully reflect the perceptual quality of the reconstructed points. To address this, we propose to project the completed points to depth maps with a differentiable renderer and apply adversarial training to advocate the perceptual realism under different viewpoints. Comprehensive experiments on ShapeNet and KITTI prove the effectiveness of our method, which achieves state-of-the-art quantitative performance while offering superior visual quality.

...read moreread less

58 citations

Posted Content•

Attack-Resistant Federated Learning with Residual-based Reweighting

[...]

Shuhao Fu, Chulin Xie, Bo Li, Qifeng Chen

25 Sep 2019-arXiv: Learning

TL;DR: This work presents a novel aggregation algorithm with residual-based reweighting to defend federated learning and demonstrates that it outperforms other alternative algorithms in the presence of label-flipping, backdoor, and Gaussian noise attacks.

...read moreread less

Abstract: Federated learning has a variety of applications in multiple domains by utilizing private training data stored on different devices. However, the aggregation process in federated learning is highly vulnerable to adversarial attacks so that the global model may behave abnormally under attacks. To tackle this challenge, we present a novel aggregation algorithm with residual-based reweighting to defend federated learning. Our aggregation algorithm combines repeated median regression with the reweighting scheme in iteratively reweighted least squares. Our experiments show that our aggregation algorithm outperforms other alternative algorithms in the presence of label-flipping and backdoor attacks. We also provide theoretical analysis for our aggregation algorithm.

...read moreread less

54 citations

Posted Content•

Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses.

[...]

Micah Goldblum, Dimitris Tsipras, Chulin Xie, Xinyun Chen, Avi Schwarzschild, Dawn Song, Aleksander Madry, Bo Li, Tom Goldstein - Show less +5 more

18 Dec 2020-arXiv: Learning

TL;DR: In this article, the authors systematically categorize and discuss a wide range of dataset vulnerabilities and exploits, approaches for defending against these threats, and an array of open problems in this space.

...read moreread less

Abstract: As machine learning systems grow in scale, so do their training data requirements, forcing practitioners to automate and outsource the curation of training data in order to achieve state-of-the-art performance. The absence of trustworthy human supervision over the data collection process exposes organizations to security vulnerabilities; training data can be manipulated to control and degrade the downstream behaviors of learned models. The goal of this work is to systematically categorize and discuss a wide range of dataset vulnerabilities and exploits, approaches for defending against these threats, and an array of open problems in this space. In addition to describing various poisoning and backdoor threat models and the relationships among them, we develop their unified taxonomy.

...read moreread less

41 citations

Zhejiang University at ImageCLEF 2019 Visual Question Answering in the Medical Domain.

[...]

Xin Yan, Lin Li, Chulin Xie, Jun Xiao, Lin Gu - Show less +1 more

01 Jan 2019

TL;DR: A novel convolutional neural network based on VGG16 network and Global Average Pooling strategy to extract visual features to effectively capture the medical image features under small training set of ImageCLEF 2019.

...read moreread less

Abstract: This paper describes the submission of Zhejiang University for Visual Question Answering task in medical domain (VQA-Med) of ImageCLEF 2019[2]. We propose a novel convolutional neural network (CNN) based on VGG16 network and Global Average Pooling strategy to extract visual features. Our proposed CNN is able to effectively capture the medical image features under small training set. The semantic features of the raised question is encoded by a BERT model. We then leverage a co-attention mechanism to fuse these two features enhanced with jointly learned attention. These vectors then are then fed to a decoder to predict the answer in a manner of classification. Our model achieves the score with 0.624 in accuracy and 0.644 in BLEU, which ranked first among all participating groups in the ImageCLEF 2019 VQA-Med task[2].

...read moreread less

25 citations

1
2
3
4
…

Cited by

PDF

Open Access

More filters

Introduction To Robust Estimation And Hypothesis Testing

[...]

Julia Kastner

01 Jan 2016

TL;DR: This introduction to robust estimation and hypothesis testing helps people to enjoy a good book with a cup of coffee in the afternoon, instead they cope with some harmful bugs inside their laptop.

...read moreread less

Abstract: Thank you very much for downloading introduction to robust estimation and hypothesis testing. As you may know, people have search numerous times for their favorite books like this introduction to robust estimation and hypothesis testing, but end up in harmful downloads. Rather than enjoying a good book with a cup of coffee in the afternoon, instead they cope with some harmful bugs inside their laptop.

...read moreread less

968 citations

Journal Article•DOI•

A survey on security and privacy of federated learning

[...]

Viraaji Mothukuri¹, Reza M. Parizi¹, Seyedamin Pouriyeh¹, Yan Huang¹, Ali Dehghantanha², Gautam Srivastava³, Gautam Srivastava⁴ - Show less +3 more•Institutions (4)

Kennesaw State University¹, University of Guelph², Brandon University³, China Medical University (Taiwan)⁴

01 Feb 2021-Future Generation Computer Systems

TL;DR: This paper aims to provide a comprehensive study concerning FL’s security and privacy aspects that can help bridge the gap between the current state of federated AI and a future in which mass adoption is possible.

...read moreread less

565 citations

Posted Content•

A Survey on Federated Learning Systems: Vision, Hype and Reality for Data Privacy and Protection

[...]

Qinbin Li¹, Zeyi Wen², Zhaomin Wu¹, Sixu Hu¹, Naibo Wang¹, Yuan Li¹, Xu Liu¹, Bingsheng He¹ - Show less +4 more•Institutions (2)

National University of Singapore¹, University of Western Australia²

23 Jul 2019-arXiv: Learning

TL;DR: A comprehensive review of federated learning systems can be found in this paper, where the authors provide a thorough categorization of the existing systems according to six different aspects, including data distribution, machine learning model, privacy mechanism, communication architecture, scale of federation and motivation of federation.

...read moreread less

Abstract: Federated learning has been a hot research topic in enabling the collaborative training of machine learning models among different organizations under the privacy restrictions. As researchers try to support more machine learning models with different privacy-preserving approaches, there is a requirement in developing systems and infrastructures to ease the development of various federated learning algorithms. Similar to deep learning systems such as PyTorch and TensorFlow that boost the development of deep learning, federated learning systems (FLSs) are equivalently important, and face challenges from various aspects such as effectiveness, efficiency, and privacy. In this survey, we conduct a comprehensive review on federated learning systems. To achieve smooth flow and guide future research, we introduce the definition of federated learning systems and analyze the system components. Moreover, we provide a thorough categorization for federated learning systems according to six different aspects, including data distribution, machine learning model, privacy mechanism, communication architecture, scale of federation and motivation of federation. The categorization can help the design of federated learning systems as shown in our case studies. By systematically summarizing the existing federated learning systems, we present the design factors, case studies, and future research opportunities.

...read moreread less

305 citations

Book Chapter•DOI•

Reflection Backdoor: A Natural Backdoor Attack on Deep Neural Networks

[...]

Yunfei Liu¹, Xingjun Ma², James Bailey³, Feng Lu¹•Institutions (3)

Beihang University¹, Deakin University², University of Melbourne³

23 Aug 2020

TL;DR: Refool is proposed, a new type of backdoor attack inspired by an important natural phenomenon: reflection to plant reflections as backdoor into a victim model and can attack state-of-the-art DNNs with high success rate, and is resistant to state of theart backdoor defenses.

...read moreread less

Abstract: Recent studies have shown that DNNs can be compromised by backdoor attacks crafted at training time. A backdoor attack installs a backdoor into the victim model by injecting a backdoor pattern into a small proportion of the training data. At test time, the victim model behaves normally on clean test data, yet consistently predicts a specific (likely incorrect) target class whenever the backdoor pattern is present in a test example. While existing backdoor attacks are effective, they are not stealthy. The modifications made on training data or labels are often suspicious and can be easily detected by simple data filtering or human inspection. In this paper, we present a new type of backdoor attack inspired by an important natural phenomenon: reflection. Using mathematical modeling of physical reflection models, we propose reflection backdoor (Refool) to plant reflections as backdoor into a victim model. We demonstrate on 3 computer vision tasks and 5 datasets that, Refoolcan attack state-of-the-art DNNs with high success rate, and is resistant to state-of-the-art backdoor defenses.

...read moreread less

279 citations

Posted Content•

FedML: A Research Library and Benchmark for Federated Machine Learning

[...]

Chaoyang He¹, Songze Li, Jinhyun So, Mi Zhang, Hongyi Wang, Xiaoyang Wang, Praneeth Vepakomma, Abhishek Singh, Hang Qiu, Li Shen, Peilin Zhao, Kang Yan, Yang Liu, Ramesh Raskar, Qiang Yang, Murali Annavaram, A. Salman Avestimehr - Show less +13 more•Institutions (1)

University of Southern California¹

27 Jul 2020-arXiv: Learning

TL;DR: FedML is introduced, an open research library and benchmark that facilitates the development of new federated learning algorithms and fair performance comparisons and can provide an efficient and reproducible means of developing and evaluating algorithms for the Federated learning research community.

...read moreread less

Abstract: Federated learning (FL) is a rapidly growing research field in machine learning. However, existing FL libraries cannot adequately support diverse algorithmic development; inconsistent dataset and model usage make fair algorithm comparison challenging. In this work, we introduce FedML, an open research library and benchmark to facilitate FL algorithm development and fair performance comparison. FedML supports three computing paradigms: on-device training for edge devices, distributed computing, and single-machine simulation. FedML also promotes diverse algorithmic research with flexible and generic API design and comprehensive reference baseline implementations (optimizer, models, and datasets). We hope FedML could provide an efficient and reproducible means for developing and evaluating FL algorithms that would benefit the FL research community. We maintain the source code, documents, and user community at this https URL.

...read moreread less

275 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96

Collapse