SecureBoost: A Lossless Federated Learning Framework

doi:10.1109/MIS.2021.3082561

Open AccessJournal ArticleDOI

SecureBoost: A Lossless Federated Learning Framework

Kewei Cheng, +6 more

- 25 May 2021 -

IEEE Intelligent Systems

- pp 1-1

TLDR

The SecureBoost framework is shown to be as accurate as other nonfederated gradient tree-boosting algorithms that require centralized data, and thus, it is highly scalable and practical for industrial applications such as credit risk analysis.

Abstract:

The protection of user privacy is an important concern in machine learning, as evidenced by the rolling out of the General Data Protection Regulation (GDPR) in the European Union (EU) in May 2018 The GDPR is designed to give users more control over their personal data, which motivates us to explore machine learning frameworks for data sharing that do not violate user privacy To meet this goal, in this paper, we propose a novel lossless privacy-preserving tree-boosting system known as SecureBoost in the setting of federated learning This federated-learning system allows the learning process to be jointly conducted over multiple parties with partially common user samples but different feature sets, which corresponds to a vertically partitioned data set An advantage of SecureBoost is that it provides the same level of accuracy as the non privacy-preserving approach while at the same time, reveals no information of each private data provider We formally prove that the SecureBoost framework is as accurate as other non-federated gradient tree-boosting algorithms that concentrate data in one place In addition, we describe information leakage during the protocol execution and propose ways to provably reduce it

Citations

PDF

Open Access

More filters

Posted Content

Advances and Open Problems in Federated Learning

Peter Kairouz, +58 more

- 10 Dec 2019 -

arXiv: Learning

TL;DR: Motivated by the explosive growth in FL research, this paper discusses recent advances and presents an extensive collection of open problems and challenges.

...read moreread less

Journal ArticleDOI

A survey on security and privacy of federated learning

Viraaji Mothukuri, +6 more

- 01 Feb 2021 -

Future Generation Computer Systems

TL;DR: This paper aims to provide a comprehensive study concerning FL’s security and privacy aspects that can help bridge the gap between the current state of federated AI and a future in which mass adoption is possible.

...read moreread less

Journal ArticleDOI

Federated Learning for Healthcare Informatics

Jie Xu, +5 more

TL;DR: In this article, the authors provide a review of federated learning in the biomedical space, and summarize the general solutions to the statistical challenges, system challenges, and privacy issues in federated Learning, and point out the implications and potentials in healthcare.

...read moreread less

Posted Content

A Survey on Federated Learning Systems: Vision, Hype and Reality for Data Privacy and Protection

Qinbin Li, +7 more

- 23 Jul 2019 -

arXiv: Learning

TL;DR: A comprehensive review of federated learning systems can be found in this paper, where the authors provide a thorough categorization of the existing systems according to six different aspects, including data distribution, machine learning model, privacy mechanism, communication architecture, scale of federation and motivation of federation.

...read moreread less

Posted Content

FedML: A Research Library and Benchmark for Federated Machine Learning

Chaoyang He, +16 more

- 27 Jul 2020 -

arXiv: Learning

TL;DR: FedML is introduced, an open research library and benchmark that facilitates the development of new federated learning algorithms and fair performance comparisons and can provide an efficient and reproducible means of developing and evaluating algorithms for the Federated learning research community.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, +1 more

TL;DR: XGBoost as discussed by the authors proposes a sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning to achieve state-of-the-art results on many machine learning challenges.

...read moreread less

Book ChapterDOI

Public-key cryptosystems based on composite degree residuosity classes

Pascal Paillier

TL;DR: A new trapdoor mechanism is proposed and three encryption schemes are derived : a trapdoor permutation and two homomorphic probabilistic encryption schemes computationally comparable to RSA, which are provably secure under appropriate assumptions in the standard model.

...read moreread less

Journal ArticleDOI

Additive Logistic Regression : A Statistical View of Boosting

Jerome H. Friedman, +2 more

- 01 Apr 2000 -

Annals of Statistics

TL;DR: This work shows that this seemingly mysterious phenomenon of boosting can be understood in terms of well-known statistical principles, namely additive modeling and maximum likelihood, and develops more direct approximations and shows that they exhibit nearly identical results to boosting.

...read moreread less

Book

The Algorithmic Foundations of Differential Privacy

Cynthia Dwork, +1 more

TL;DR: The preponderance of this monograph is devoted to fundamental techniques for achieving differential privacy, and application of these techniques in creative combinations, using the query-release problem as an ongoing example.

...read moreread less

Book ChapterDOI

Differential privacy: a survey of results

Cynthia Dwork

TL;DR: This survey recalls the definition of differential privacy and two basic techniques for achieving it, and shows some interesting applications of these techniques, presenting algorithms for three specific tasks and three general results on differentially private learning.

...read moreread less