D
Davies Liu
Publications - 4
Citations - 2930
Davies Liu is an academic researcher. The author has contributed to research in topics: Spark (mathematics) & Programming with Big Data in R. The author has an hindex of 4, co-authored 4 publications receiving 2608 citations.
Papers
More filters
Journal Article
MLlib: machine learning in apache spark
Xiangrui Meng,Joseph K. Bradley,Burak Yavuz,Evan R. Sparks,Shivaram Venkataraman,Davies Liu,Jeremy Freeman,DB Tsai,Manish Amde,Sean Owen,Doris Xin,Reynold Xin,Michael J. Franklin,Reza Bosagh Zadeh,Matei Zaharia,Ameet Talwalkar +15 more
TL;DR: MLlib as mentioned in this paper is an open-source distributed machine learning library for Apache Spark that provides efficient functionality for a wide range of learning settings and includes several underlying statistical, optimization, and linear algebra primitives.
Proceedings ArticleDOI
Spark SQL: Relational Data Processing in Spark
Michael Armbrust,Reynold Xin,Cheng Lian,Yin Huai,Davies Liu,Joseph K. Bradley,Xiangrui Meng,Tomer Kaftan,Michael J. Franklin,Ali Ghodsi,Matei Zaharia +10 more
TL;DR: Spark SQL is a new module in Apache Spark that integrates relational processing with Spark's functional programming API, and includes a highly extensible optimizer, Catalyst, built using features of the Scala programming language.
Posted Content
MLlib: Machine Learning in Apache Spark
Xiangrui Meng,Joseph K. Bradley,Burak Yavuz,Evan R. Sparks,Shivaram Venkataraman,Davies Liu,Jeremy Freeman,DB Tsai,Manish Amde,Sean Owen,Doris Xin,Reynold Xin,Michael J. Franklin,Reza Bosagh Zadeh,Matei Zaharia,Ameet Talwalkar +15 more
TL;DR: MLlib as discussed by the authors is an open-source distributed machine learning library for Apache Spark that provides efficient functionality for a wide range of learning settings and includes several underlying statistical, optimization, and linear algebra primitives.
Proceedings ArticleDOI
SparkR: Scaling R Programs with Spark
Shivaram Venkataraman,Zongheng Yang,Davies Liu,Eric Liang,Hossein Falaki,Xiangrui Meng,Reynold Xin,Ali Ghodsi,Michael J. Franklin,Ion Stoica,Matei Zaharia +10 more
TL;DR: SparkR is presented, an R package that provides a frontend to Apache Spark and uses Spark's distributed computation engine to enable large scale data analysis from the R shell.