scispace - formally typeset
D

Davies Liu

Publications -  4
Citations -  2930

Davies Liu is an academic researcher. The author has contributed to research in topics: Spark (mathematics) & Programming with Big Data in R. The author has an hindex of 4, co-authored 4 publications receiving 2608 citations.

Papers
More filters
Journal Article

MLlib: machine learning in apache spark

TL;DR: MLlib as mentioned in this paper is an open-source distributed machine learning library for Apache Spark that provides efficient functionality for a wide range of learning settings and includes several underlying statistical, optimization, and linear algebra primitives.
Proceedings ArticleDOI

Spark SQL: Relational Data Processing in Spark

TL;DR: Spark SQL is a new module in Apache Spark that integrates relational processing with Spark's functional programming API, and includes a highly extensible optimizer, Catalyst, built using features of the Scala programming language.
Posted Content

MLlib: Machine Learning in Apache Spark

TL;DR: MLlib as discussed by the authors is an open-source distributed machine learning library for Apache Spark that provides efficient functionality for a wide range of learning settings and includes several underlying statistical, optimization, and linear algebra primitives.
Proceedings ArticleDOI

SparkR: Scaling R Programs with Spark

TL;DR: SparkR is presented, an R package that provides a frontend to Apache Spark and uses Spark's distributed computation engine to enable large scale data analysis from the R shell.