Search or ask a question

Showing papers by "Jatin Chhugani published in 2016"

PDF

Open Access

Proceedings Article•DOI•

Matrix factorizations at scale: A comparison of scientific data analytics in spark and C+MPI using three case studies

[...]

Alex Gittens¹, Aditya Devarakonda¹, Evan Racah², Michael F. Ringenburg³, L. Gerhardt², Jey Kottalam¹, Jialin Liu², Kristyn Maschhoff³, Shane Canon², Jatin Chhugani, Pramod Sharma³, Jiyan Yang⁴, James Demmel¹, Jim Harrell³, Venkat Krishnamurthy³, Michael W. Mahoney¹, Prabhat² - Show less +13 more•Institutions (4)

University of California, Berkeley¹, Lawrence Berkeley National Laboratory², Cray³, Stanford University⁴

12 May 2016

TL;DR: In this article, the authors explore the trade-offs of performing linear algebra using Apache Spark compared to traditional C and MPI implementations on HPC platforms, and apply these methods to 1.6TB particle physics, 2.2TB and 16TB climate modeling and 1.1TB bioimaging data.

...read moreread less

Abstract: We explore the trade-offs of performing linear algebra using Apache Spark, compared to traditional C and MPI implementations on HPC platforms. Spark is designed for data analytics on cluster computing platforms with access to local disks and is optimized for data-parallel tasks. We examine three widely-used and important matrix factorizations: NMF (for physical plausability), PCA (for its ubiquity) and CX (for data interpretability). We apply these methods to 1.6TB particle physics, 2.2TB and 16TB climate modeling and 1.1TB bioimaging data. The data matrices are tall-and-skinny which enable the algorithms to map conveniently into Spark's data-parallel model. We perform scaling experiments on up to 1600 Cray XC40 nodes, describe the sources of slowdowns, and provide tuning guidance to obtain high performance.

...read moreread less

57 citations

Posted Content•

Matrix Factorization at Scale: a Comparison of Scientific Data Analytics in Spark and C+MPI Using Three Case Studies

[...]

Alex Gittens, Aditya Devarakonda, Evan Racah, Michael F. Ringenburg, L. Gerhardt, Jey Kottalam, Jialin Liu, Kristyn Maschhoff, Shane Canon, Jatin Chhugani, Pramod Sharma, Jiyan Yang, James Demmel, Jim Harrell, Venkat Krishnamurthy, Michael W. Mahoney, Prabhat - Show less +13 more

05 Jul 2016-arXiv: Distributed, Parallel, and Cluster Computing

TL;DR: This work explores the trade-offs of performing linear algebra using Apache Spark, compared to traditional C and MPI implementations on HPC platforms, and examines three widely-used and important matrix factorizations: NMF, PCA and CX.

...read moreread less

Abstract: We explore the trade-offs of performing linear algebra using Apache Spark, compared to traditional C and MPI implementations on HPC platforms. Spark is designed for data analytics on cluster computing platforms with access to local disks and is optimized for data-parallel tasks. We examine three widely-used and important matrix factorizations: NMF (for physical plausability), PCA (for its ubiquity) and CX (for data interpretability). We apply these methods to TB-sized problems in particle physics, climate modeling and bioimaging. The data matrices are tall-and-skinny which enable the algorithms to map conveniently into Spark's data-parallel model. We perform scaling experiments on up to 1600 Cray XC40 nodes, describe the sources of slowdowns, and provide tuning guidance to obtain high performance.

...read moreread less

33 citations

Proceedings Article•DOI•

A Multi-Platform Evaluation of the Randomized CX Low-Rank Matrix Factorization in Spark

[...]

Alex Gittens¹, Jey Kottalam¹, Jiyan Yang², Michael F. Ringenburg³, Jatin Chhugani⁴, Evan Racah⁵, Mohitdeep Singh, Yushu Yao⁵, Curt R. Fischer⁵, Oliver Ruebel⁵, Benjamin P. Bowen⁵, Norman G. Lewis⁶, Michael W. Mahoney¹, Venkat Krishnamurthy³, Prabhat⁵ - Show less +11 more•Institutions (6)

University of California, Berkeley¹, Stanford University², Cray³, eBay⁴, Lawrence Berkeley National Laboratory⁵, Washington State University⁶

23 May 2016

TL;DR: The performance and scalability of the randomized CX low-rank matrix factorization is investigated and its applicability through the analysis of a 1TB mass spectrometry imaging (MSI) dataset is demonstrated using Apache Spark on an Amazon EC2 cluster, a Cray XC40 system, and an experimental Cray cluster.

...read moreread less

Abstract: We investigate the performance and scalability of the randomized CX low-rank matrix factorization and demonstrate its applicability through the analysis of a 1TB mass spectrometry imaging (MSI) dataset, using Apache Spark on an Amazon EC2 cluster, a Cray XC40 system, and an experimental Cray cluster. We implemented this factorization both as a parallelized C implementation with hand-tuned optimizations and in Scala using the Apache Spark high-level cluster computing framework. We obtained consistent performance across the three platforms: using Spark we were able to process the 1TB size dataset in under 30 minutes with 960 cores on all systems, with the fastest times obtained on the experimental Cray cluster. In comparison, the C implementation processed the 1TB size dataset 21X faster on the Amazon EC2 system, due to careful cache optimizations, bandwidth-friendly access of matrices and vector computation using SIMD units. We report these results and their implications on the hardware and software issues arising in supporting data-centric workloads in parallel and distributed environments.

...read moreread less

9 citations