X
Xiangrui Meng
Researcher at LinkedIn
Publications - 33
Citations - 5952
Xiangrui Meng is an academic researcher from LinkedIn. The author has contributed to research in topics: Spark (mathematics) & Matrix (mathematics). The author has an hindex of 18, co-authored 33 publications receiving 5134 citations. Previous affiliations of Xiangrui Meng include Peking University & Stanford University.
Papers
More filters
Journal ArticleDOI
Apache Spark: a unified engine for big data processing
Matei Zaharia,Reynold Xin,Patrick Wendell,Tathagata Das,Michael Armbrust,Ankur Dave,Xiangrui Meng,Josh Rosen,Shivaram Venkataraman,Michael J. Franklin,Ali Ghodsi,Joseph E. Gonzalez,Scott Shenker,Ion Stoica +13 more
TL;DR: This open source computing framework unifies streaming, batch, and interactive big data workloads to unlock new applications.
Journal Article
MLlib: machine learning in apache spark
Xiangrui Meng,Joseph K. Bradley,Burak Yavuz,Evan R. Sparks,Shivaram Venkataraman,Davies Liu,Jeremy Freeman,DB Tsai,Manish Amde,Sean Owen,Doris Xin,Reynold Xin,Michael J. Franklin,Reza Bosagh Zadeh,Matei Zaharia,Ameet Talwalkar +15 more
TL;DR: MLlib as mentioned in this paper is an open-source distributed machine learning library for Apache Spark that provides efficient functionality for a wide range of learning settings and includes several underlying statistical, optimization, and linear algebra primitives.
Proceedings ArticleDOI
Spark SQL: Relational Data Processing in Spark
Michael Armbrust,Reynold Xin,Cheng Lian,Yin Huai,Davies Liu,Joseph K. Bradley,Xiangrui Meng,Tomer Kaftan,Michael J. Franklin,Ali Ghodsi,Matei Zaharia +10 more
TL;DR: Spark SQL is a new module in Apache Spark that integrates relational processing with Spark's functional programming API, and includes a highly extensible optimizer, Catalyst, built using features of the Scala programming language.
Proceedings ArticleDOI
Low-distortion subspace embeddings in input-sparsity time and applications to robust linear regression
Xiangrui Meng,Michael W. Mahoney +1 more
TL;DR: In this article, a low-distortion embedding matrix Π ∈ RO(poly(d)) x n that embeds Ap, the lp subspace spanned by A's columns, into the poly(d)), |~cdot~|p, was constructed in O(nnz(A)) time.
Journal ArticleDOI
Lsrn: a parallel iterative solver for strongly over- or underdetermined systems.
TL;DR: In this article, a parallel iterative least squares solver based on random normal projection is proposed, and the preconditioning phase consists of a pre-conditioned singular value decomposition of size Θ(m,n) = 2.