Showing papers by "Raluca Ada Popa published in 2020"

PDF

Open Access

Proceedings Article•

Delphi: A Cryptographic Inference Service for Neural Networks

[...]

Pratyush Mishra¹, Ryan Lehmkuhl¹, Akshayaram Srinivasan¹, Wenting Zheng¹, Raluca Ada Popa¹ - Show less +1 more•Institutions (1)

University of California, Berkeley¹

01 Jan 2020

TL;DR: This work designs, implements, and evaluates DELPHI, a secure prediction system that allows two parties to execute neural network inference without revealing either party’s data, and develops a hybrid cryptographic protocol that improves upon the communication and computation costs over prior work.

...read moreread less

Abstract: Many companies provide neural network prediction services to users for a wide range of applications. However, current prediction systems compromise one party’s privacy: either the user has to send sensitive inputs to the service provider for classification, or the service provider must store its proprietary neural networks on the user’s device. The former harms the personal privacy of the user, while the latter reveals the service provider’s proprietary model. We design, implement, and evaluate DELPHI, a secure prediction system that allows two parties to execute neural network inference without revealing either party’s data. DELPHI approaches the problem by simultaneously co-designing cryptography and machine learning. We first design a hybrid cryptographic protocol that improves upon the communication and computation costs over prior work. Second, we develop a planner that automatically generates neural network architecture configurations that navigate the performance-accuracy trade-offs of our hybrid protocol. Together, these techniques allow us to achieve a 22× improvement in online prediction latency compared to the state-of-the-art prior work.

...read moreread less

234 citations

Journal Article•DOI•

The Seattle Report on Database Research

[...]

Daniel J. Abadi¹, Anastasia Ailamaki², David G. Andersen³, Peter Bailis⁴, Magdalena Balazinska⁵, Philip A. Bernstein⁶, Peter Boncz⁷, Surajit Chaudhuri⁶, Alvin Cheung⁵, AnHai Doan⁸, Luna Dong, Michael J. Franklin⁹, Juliana Freire, Alon Halevy¹⁰, Joseph M. Hellerstein¹¹, Stratos Idreos¹², Donald Kossmann⁶, Tim Kraska¹³, Sailesh Krishnamurthy¹⁴, Volker Markl¹⁵, Sergey Melnik⁶, Tova Milo¹⁶, Chandrasekaran Mohan¹⁷, Thomas Neumann¹⁸, Beng Chin Ooi¹⁹, Fatma Ozcan¹⁷, Jignesh M. Patel⁸, Andrew Pavlo³, Raluca Ada Popa¹¹, Raghu Ramakrishnan⁶, Christopher Ré⁴, Michael Stonebraker¹³, Dan Suciu⁵ - Show less +29 more•Institutions (19)

University of Maryland, College Park¹, École Polytechnique Fédérale de Lausanne², Carnegie Mellon University³, Stanford University⁴, University of Washington⁵, Microsoft⁶, Centrum Wiskunde & Informatica⁷, University of Wisconsin-Madison⁸, University of Chicago⁹, Facebook¹⁰, University of California, Berkeley¹¹, Harvard University¹², Massachusetts Institute of Technology¹³, Google¹⁴, Technical University of Berlin¹⁵, Tel Aviv University¹⁶, IBM¹⁷, Technische Universität München¹⁸, National University of Singapore¹⁹

25 Feb 2020

TL;DR: This report summarizes the discussion and conclusions of the 9th self-assessment meeting of database researchers, held during October 9-10, 2018 in Seattle.

...read moreread less

Abstract: Approximately every five years, a group of database researchers meet to do a self-assessment of our community, including reflections on our impact on the industry as well as challenges facing our research community. This report summarizes the discussion and conclusions of the 9th such meeting, held during October 9-10, 2018 in Seattle.

...read moreread less

61 citations

Proceedings Article•DOI•

Delphi: A Cryptographic Inference System for Neural Networks

[...]

Pratyush Mishra¹, Ryan Lehmkuhl¹, Akshayaram Srinivasan¹, Wenting Zheng¹, Raluca Ada Popa¹ - Show less +1 more•Institutions (1)

University of California, Berkeley¹

09 Nov 2020

TL;DR: This work designs and implements Delphi, a secure prediction system that allows two parties to execute neural network inference without revealing either party's data, and develops a planner that automatically generates neural network architecture configurations that navigate the performance-accuracy trade-offs of the hybrid protocol.

...read moreread less

Abstract: Many companies provide neural network prediction services to users for a wide range of applications. However, current prediction systems compromise one party's privacy: either the user has to send sensitive inputs to the service provider for classification, or the service provider must store its proprietary neural networks on the user's device. The former harms the personal privacy of the user, while the latter reveals the service provider's proprietary model.We design, implement, and evaluate Delphi, a secure prediction system that allows two parties to execute neural network inference without revealing either party's data. Delphi approaches the problem by simultaneously co-designing cryptography and machine learning. We first design a hybrid cryptographic protocol that improves upon the communication and computation costs over prior work. Second, we develop a planner that automatically generates neural network architecture configurations that navigate the performance-accuracy trade-offs of our hybrid protocol. Together, these techniques allow us to achieve a 22x improvement in online prediction latency compared to the state-of-the-art prior work.

...read moreread less

46 citations

Proceedings Article•

Visor: Privacy-Preserving Video Analytics as a Cloud Service.

[...]

Rishabh Poddar¹, Ganesh Ananthanarayanan¹, Srinath Setty¹, Stavros Volos¹, Raluca Ada Popa² - Show less +1 more•Institutions (2)

Microsoft¹, University of California, Berkeley²

12 Aug 2020

TL;DR: Visor as discussed by the authors is a system that provides confidentiality for the user's video stream as well as the ML models in the presence of a compromised cloud platform and untrusted co-tenants.

...read moreread less

Abstract: Video-analytics-as-a-service is becoming an important offering for cloud providers. A key concern in such services is privacy of the videos being analyzed. While trusted execution environments (TEEs) are promising options for preventing the direct leakage of private video content, they remain vulnerable to side-channel attacks. We present Visor, a system that provides confidentiality for the user’s video stream as well as the ML models in the presence of a compromised cloud platform and untrusted co-tenants. Visor executes video pipelines in a hybrid TEE that spans both the CPU and GPU. It protects the pipeline against side-channel attacks induced by data-dependent access patterns of video modules, and also addresses leakage in the CPU-GPU communication channel. Visor is up to 1000× faster than naive oblivious solutions, and its overheads relative to a non-oblivious baseline are limited to 2×–6×.

...read moreread less

30 citations

Posted Content•

DORY: An Encrypted Search System with Distributed Trust.

[...]

Emma Dauterman¹, Eric Feng¹, Ellen Luo¹, Raluca Ada Popa¹, Ion Stoica¹ - Show less +1 more•Institutions (1)

University of California, Berkeley¹

01 Jan 2020-IACR Cryptology ePrint Archive

TL;DR: DORY is designed and built, an encrypted search system that addresses real-world requirements and protects search access patterns and performs orders of magnitude better than a baseline built on ORAM.

...read moreread less

Abstract: Efficient, leakage-free search on encrypted data has remained an unsolved problem for the last two decades; efficient schemes are vulnerable to leakage-abuse attacks, and schemes that eliminate leakage are impractical to deploy. To overcome this tradeoff, we reexamine the system model. We surveyed five companies providing end-to-end encrypted filesharing to better understand what they require from an encrypted search system. Based on our findings, we design and build DORY, an encrypted search system that addresses real-world requirements and protects search access patterns; namely,when a user searches for a keyword over the fileswithin a folder, the server learns only that a search happens in that folder, but does not learn which documents match the search, the number of documents that match, or other information about the keyword. DORY splits trust betweenmultiple servers to protect against a malicious attacker who controls all but one of the servers. We develop new cryptographic and systems techniques to meet the efficiency and trust model requirements outlined by the companies we surveyed. We implement DORY and show that it performs orders of magnitude better than a baseline built on ORAM. Parallelized across 8 servers, each with 16 CPUs, DORY takes 116ms to search roughly 50K documents and 862ms to search over 1M documents.

...read moreread less

28 citations

Proceedings Article•DOI•

Metal: A Metadata-Hiding File-Sharing System.

[...]

Weikeng Chen¹, Raluca Ada Popa²•Institutions (2)

University of Science and Technology of China¹, University of California, Berkeley²

01 Jan 2020

TL;DR: Metal is the first file-sharing system that hides metadata from malicious users and that has a latency of only a few seconds, which is 500× faster (in terms of amortized latency) or 10× faster than PIR-MCORAM, which does not hide user identities.

...read moreread less

26 citations

Proceedings Article•DOI•

Oblivious coopetitive analytics using hardware enclaves

[...]

Ankur Dave¹, Chester Leung¹, Raluca Ada Popa¹, Joseph E. Gonzalez¹, Ion Stoica¹ - Show less +1 more•Institutions (1)

University of California, Berkeley¹

15 Apr 2020

TL;DR: Oblivious Coopetitive Queries (OCQ), an efficient, general framework for oblivious coopetitive analytics using hardware enclaves, is proposed and implemented as an extension to Apache Spark SQL, finding that OCQ is up to 9.9x faster than Opaque, a state-of-the-art secure analytics framework which outsources all data and computation to an enclave-enabled cloud.

...read moreread less

Abstract: Coopetitive analytics refers to cooperation among competing parties to run queries over their joint data. Regulatory, business, and liability concerns prevent these organizations from sharing their sensitive data in plaintext. We propose Oblivious Coopetitive Queries (OCQ), an efficient, general framework for oblivious coopetitive analytics using hardware enclaves. OCQ builds on Opaque, a Spark-based framework for secure distributed analytics, to execute coopetitive queries using hardware enclaves in a decentralized manner. Its query planner chooses how and where to execute each relational operator to prevent data leakage through side channels such as memory access patterns, network traffic statistics, and cardinality, while minimizing overhead. We implemented OCQ as an extension to Apache Spark SQL. We find that OCQ is up to 9.9x faster than Opaque, a state-of-the-art secure analytics framework which outsources all data and computation to an enclave-enabled cloud; and is up to 219x faster than implementing analytics using AgMPC, a state-of-the-art secure multi-party computation framework.

...read moreread less

25 citations

Proceedings Article•

Civet: An Efficient Java Partitioning Framework for Hardware Enclaves

[...]

Chia-Che Tsai¹, Jeongseok Son², Bhushan P. Jain³, John McAvey⁴, Raluca Ada Popa², Donald E. Porter³ - Show less +2 more•Institutions (4)

Texas A&M University¹, University of California, Berkeley², University of North Carolina at Chapel Hill³, Hendrix College⁴

01 Jan 2020

TL;DR: Civet is a framework for partitioning Java applications into enclaves that reduces the number of lines of code in the enclave and uses language-level defenses, including deep type checks and dynamic taint-tracking, to harden the enclave interface.

...read moreread less

Abstract: Hardware enclaves are designed to execute small pieces of sensitive code or to operate on sensitive data, in isolation from larger, less trusted systems. Partitioning a large, legacy application requires significant effort. Partitioning an application written in a managed language, such as Java, is more challenging because of mutable language characteristics, extensive code reachability in class libraries, and the inevitability of using a heavyweight runtime. Civet is a framework for partitioning Java applications into enclaves. Civet reduces the number of lines of code in the enclave and uses language-level defenses, including deep type checks and dynamic taint-tracking, to harden the enclave interface. Civet also contributes a partitioned Java runtime design, including a garbage collection design optimized for the peculiarities of enclaves. Civet is efficient for data-intensive workloads; partitioning a Hadoop mapper reduces the enclave overhead from 10× to 16–22% without taint-tracking or 70–80% with taint-tracking.

...read moreread less

25 citations

Proceedings Article•DOI•

Secure Collaborative Training and Inference for XGBoost

[...]

Andrew Law¹, Chester Leung¹, Rishabh Poddar¹, Raluca Ada Popa¹, Chenyu Shi¹, Octavian Sima¹, Chaofan Yu, Xingmeng Zhang, Wenting Zheng¹ - Show less +5 more•Institutions (1)

University of California, Berkeley¹

09 Nov 2020

TL;DR: This work proposes Secure XGBoost, a privacy-preserving system that enables multiparty training and inference of X GBoost models and augments the security of the enclaves using novel data-oblivious algorithms that prevent access side-channel attacks on enclaves induced via access pattern leakage.

...read moreread less

Abstract: In recent years, gradient boosted decision tree learning has proven to be an effective method of training robust models. Moreover, collaborative learning among multiple parties has the potential to greatly benefit all parties involved, but organizations have also encountered obstacles in sharing sensitive data due to business, regulatory, and liability concerns.We propose Secure XGBoost, a privacy-preserving system that enables multiparty training and inference of XGBoost models. Secure XGBoost protects the privacy of each party's data as well as the integrity of the computation with the help of hardware enclaves. Crucially, Secure XGBoost augments the security of the enclaves using novel data-oblivious algorithms that prevent access side-channel attacks on enclaves induced via access pattern leakage.

...read moreread less

24 citations

Proceedings Article•

Ghostor: Toward a Secure Data-Sharing System from Decentralized Trust

[...]

Yuncong Hu¹, Sam Kumar¹, Raluca Ada Popa¹•Institutions (1)

University of California, Berkeley¹

01 Jan 2020

TL;DR: This work proposes Ghostor, a data-sharing system that, using only decentralized trust, hides user identities from the server, and allows users to detect server-side integrity violations, and develops a technique called verifiable anonymous history.

...read moreread less

Abstract: Data-sharing systems are often used to store sensitive data. Both academia and industry have proposed numerous solutions to protect the user privacy and data integrity from a compromised server. Practical state-of-the-art solutions, however, use weak threat models based on centralized trust—they assume that part of the server will remain uncompromised, or that the adversary will not perform active attacks. We propose Ghostor, a data-sharing system that, using only decentralized trust, (1) hides user identities from the server, and (2) allows users to detect server-side integrity violations. To achieve (1), Ghostor avoids keeping any per-user state at the server, requiring us to redesign the system to avoid common paradigms like per-user authentication and user-specific mailboxes. To achieve (2), Ghostor develops a technique called verifiable anonymous history. Ghostor leverages a blockchain rarely, publishing only a single hash to the blockchain for the entire system once every epoch. We measured that Ghostor incurs a 4–5x throughput overhead compared to an insecure baseline. Although significant, Ghostor’s overhead may be worth it for securityand privacy-sensitive applications.

...read moreread less

21 citations

Posted Content•

Practical Volume-Based Attacks on Encrypted Databases

[...]

Rishabh Poddar¹, Stephanie Wang¹, Jianan Lu², Raluca Ada Popa¹•Institutions (2)

University of California, Berkeley¹, Princeton University²

15 Aug 2020-arXiv: Cryptography and Security

TL;DR: In this article, the authors present new attacks for recovering the content of individual user queries, assuming no leakage from the system except the number of results and avoiding the limiting assumptions that are unrealistic in practice, such as requiring a large number of queries to be issued by the user, or assuming certain distributions on the queries or underlying data.

...read moreread less

Abstract: Recent years have seen an increased interest towards strong security primitives for encrypted databases (such as oblivious protocols), that hide the access patterns of query execution, and reveal only the volume of results. However, recent work has shown that even volume leakage can enable the reconstruction of entire columns in the database. Yet, existing attacks rely on a set of assumptions that are unrealistic in practice: for example, they (i) require a large number of queries to be issued by the user, or (ii) assume certain distributions on the queries or underlying data (e.g., that the queries are distributed uniformly at random, or that the database does not contain missing values). In this work, we present new attacks for recovering the content of individual user queries, assuming no leakage from the system except the number of results and avoiding the limiting assumptions above. Unlike prior attacks, our attacks require only a single query to be issued by the user for recovering the keyword. Furthermore, our attacks make no assumptions about the distribution of issued queries or the underlying data. Instead, our key insight is to exploit the behavior of real-world applications. We start by surveying 11 applications to identify two key characteristics that can be exploited by attackers: (i) file injection, and (ii) automatic query replay. We present attacks that leverage these two properties in concert with volume leakage, independent of the details of any encrypted database system. Subsequently, we perform an attack on the real Gmail web client by simulating a server-side adversary. Our attack on Gmail completes within a matter of minutes, demonstrating the feasibility of our techniques. We also present three ancillary attacks for situations when certain mitigation strategies are employed.

...read moreread less

Proceedings Article•DOI•

Practical Volume-Based Attacks on Encrypted Databases

[...]

Rishabh Poddar¹, Stephanie Wang¹, Jianan Lu², Raluca Ada Popa¹•Institutions (2)

University of California, Berkeley¹, Princeton University²

01 Sep 2020

TL;DR: In this paper, the authors present new attacks for recovering the content of individual user queries assuming no leakage from the system except the number of results and avoiding the limiting assumptions that are unrealistic in practice for example they (i) require a large number of queries to be issued by the user or (ii) assume certain distributions on the queries or underlying data.

...read moreread less

Abstract: Recent years have seen an increased interest towards strong security primitives for encrypted databases (such as oblivious protocols) that hide the access patterns of query execution and reveal only the volume of results. However recent work has shown that even volume leakage can enable the reconstruction of entire columns in the database. Yet existing attacks rely on a set of assumptions that are unrealistic in practice for example they (i) require a large number of queries to be issued by the user or (ii) assume certain distributions on the queries or underlying data (e.g. that the queries are distributed uniformly at random or that the database does not contain missing values). In this work we present new attacks for recovering the content of individual user queries assuming no leakage from the system except the number of results and avoiding the limiting assumptions above. Unlike prior attacks our attacks require only a single query to be issued by the user for recovering the keyword. Furthermore our attacks make no assumptions about the distribution of issued queries or the underlying data. Instead our key insight is to exploit the behavior of real-world applications. We start by surveying 11 applications to identify two key characteristics that can be exploited by attackers-(l) file injection and (ii) automatic query replay. We present attacks that leverage these two properties in concert with volume leakage independent of the details of any encrypted database system. Subsequently we perform an attack on the real Gmail web client by simulating a server-side adversary. Our attack on Gmail completes within a matter of minutes demonstrating the feasibility of our techniques. We also present three ancillary attacks for situations when certain mitigation strategies are employed.

...read moreread less

Posted Content•

Visor: Privacy-Preserving Video Analytics as a Cloud Service

[...]

Rishabh Poddar¹, Ganesh Ananthanarayanan¹, Srinath Setty², Stavros Volos², Raluca Ada Popa² - Show less +1 more•Institutions (2)

University of California, Berkeley¹, Microsoft²

17 Jun 2020-arXiv: Cryptography and Security

TL;DR: Visor is a system that provides confidentiality for the user's video stream as well as the ML models in the presence of a compromised cloud platform and untrusted co-tenants and protects the pipeline against side-channel attacks induced by data-dependent access patterns of video modules, and also addresses leakage in the CPU-GPU communication channel.

...read moreread less

Abstract: Video-analytics-as-a-service is becoming an important offering for cloud providers A key concern in such services is privacy of the videos being analyzed While trusted execution environments (TEEs) are promising options for preventing the direct leakage of private video content, they remain vulnerable to side-channel attacks We present Visor, a system that provides confidentiality for the user's video stream as well as the ML models in the presence of a compromised cloud platform and untrusted co-tenants Visor executes video pipelines in a hybrid TEE that spans both the CPU and GPU It protects the pipeline against side-channel attacks induced by data-dependent access patterns of video modules, and also addresses leakage in the CPU-GPU communication channel Visor is up to $1000\times$ faster than naive oblivious solutions, and its overheads relative to a non-oblivious baseline are limited to $2\times$--$6\times$

...read moreread less

Journal Article•

Metal: A Metadata-Hiding File-Sharing System.

[...]

Weikeng Chen¹, Raluca Ada Popa²•Institutions (2)

University of Science and Technology of China¹, University of California, Berkeley²

01 Jan 2020-IACR Cryptology ePrint Archive

TL;DR: The first file-sharing system that hides metadata from malicious users and has a latency of only a few seconds is Metal as discussed by the authors, which consists of a new two-server multi-user oblivious RAM (ORAM) scheme, a metadata-hiding access control protocol, and a capability sharing protocol.

...read moreread less

Abstract: File-sharing systems like Dropbox offer insufficient privacy because a compromised server can see the file contents in the clear. Although encryption can hide such contents from the servers, metadata leakage remains significant. The goal of our work is to develop a file-sharing system that hides metadata— including user identities and file access patterns. Metal is the first file-sharing system that hides such metadata from malicious users and that has a latency of only a few seconds. The core of Metal consists of a new two-server multi-user oblivious RAM (ORAM) scheme, which is secure against malicious users, a metadata-hiding access control protocol, and a capability sharing protocol. Compared with the state-of-the-art malicious-user filesharing scheme PIR-MCORAM (Maffei et al.’17), which does not hide user identities, Metal hides the user identities and is 500× faster (in terms of amortized latency) or 10× faster (in terms of worst-case latency).

...read moreread less

Posted Content•

Ghostor: Toward a Secure Data-Sharing System from Decentralized Trust.

[...]

Yuncong Hu, Sam Kumar, Raluca Ada Popa

01 Jan 2020-IACR Cryptology ePrint Archive

Posted Content•

Secure Collaborative Training and Inference for XGBoost

[...]

Andrew Law¹, Chester Leung¹, Rishabh Poddar¹, Raluca Ada Popa¹, Chenyu Shi¹, Octavian Sima¹, Chaofan Yu, Xingmeng Zhang, Wenting Zheng¹ - Show less +5 more•Institutions (1)

University of California, Berkeley¹

06 Oct 2020-arXiv: Cryptography and Security

TL;DR: Secure XGBoost as mentioned in this paper protects the privacy of each party's data as well as the integrity of the computation with the help of hardware enclaves and augments the security of the enclaves using novel data-oblivious algorithms that prevent access side-channel attacks on enclaves induced via access pattern leakage.

...read moreread less

Abstract: In recent years, gradient boosted decision tree learning has proven to be an effective method of training robust models. Moreover, collaborative learning among multiple parties has the potential to greatly benefit all parties involved, but organizations have also encountered obstacles in sharing sensitive data due to business, regulatory, and liability concerns. We propose Secure XGBoost, a privacy-preserving system that enables multiparty training and inference of XGBoost models. Secure XGBoost protects the privacy of each party's data as well as the integrity of the computation with the help of hardware enclaves. Crucially, Secure XGBoost augments the security of the enclaves using novel data-oblivious algorithms that prevent access side-channel attacks on enclaves induced via access pattern leakage.

...read moreread less

Posted Content•

Delphi: A Cryptographic Inference Service for Neural Networks.

[...]

Pratyush Mishra¹, Ryan Lehmkuhl¹, Akshayaram Srinivasan¹, Wenting Zheng¹, Raluca Ada Popa¹ - Show less +1 more•Institutions (1)

University of California, Berkeley¹

01 Jan 2020-IACR Cryptology ePrint Archive

Proceedings Article•

DORY: An Encrypted Search System with Distributed Trust.

[...]

Emma Dauterman¹, Eric Feng¹, Ellen Luo¹, Raluca Ada Popa¹, Ion Stoica¹ - Show less +1 more•Institutions (1)

University of California, Berkeley¹

01 Jan 2020

Posted Content•

Senate: A Maliciously-Secure MPC Platform for Collaborative Analytics

[...]

Rishabh Poddar¹, Sukrit Kalra¹, Avishay Yanai², Ryan Deng¹, Raluca Ada Popa¹, Joseph M. Hellerstein¹ - Show less +2 more•Institutions (2)

University of California, Berkeley¹, VMware²

26 Oct 2020-arXiv: Cryptography and Security

TL;DR: In this article, the authors propose a secure multi-party computation (MPC) protocol that allows multiple parties to collaboratively run analytical SQL queries without revealing their individual data to each other.

...read moreread less

Abstract: Many organizations stand to benefit from pooling their data together in order to draw mutually beneficial insights -- e.g., for fraud detection across banks, better medical studies across hospitals, etc. However, such organizations are often prevented from sharing their data with each other by privacy concerns, regulatory hurdles, or business competition. We present Senate, a system that allows multiple parties to collaboratively run analytical SQL queries without revealing their individual data to each other. Unlike prior works on secure multi-party computation (MPC) that assume that all parties are semi-honest, Senate protects the data even in the presence of malicious adversaries. At the heart of Senate lies a new MPC decomposition protocol that decomposes the cryptographic MPC computation into smaller units, some of which can be executed by subsets of parties and in parallel, while preserving its security guarantees. Senate then provides a new query planning algorithm that decomposes and plans the cryptographic computation effectively, achieving a performance of up to 145$\times$ faster than the state-of-the-art.

...read moreread less