C
Cheng Lian
Researcher at Intel
Publications - 3
Citations - 1247
Cheng Lian is an academic researcher from Intel. The author has contributed to research in topics: Speedup & Cache. The author has an hindex of 3, co-authored 3 publications receiving 1081 citations.
Papers
More filters
Proceedings ArticleDOI
Spark SQL: Relational Data Processing in Spark
Michael Armbrust,Reynold Xin,Cheng Lian,Yin Huai,Davies Liu,Joseph K. Bradley,Xiangrui Meng,Tomer Kaftan,Michael J. Franklin,Ali Ghodsi,Matei Zaharia +10 more
TL;DR: Spark SQL is a new module in Apache Spark that integrates relational processing with Spark's functional programming API, and includes a highly extensible optimizer, Catalyst, built using features of the Scala programming language.
Proceedings ArticleDOI
NativeTask: A Hadoop compatible framework for high performance
Dong Yang,Xiang Zhong,Dong Yan,Fangqin Dai,Xusen Yin,Cheng Lian,Zhongliang Zhu,Weihua Jiang,Gan-Sha Wu +8 more
TL;DR: The results show that the speedup NativeTask achieves ranges from 10% to 160%, and it paves the way for a better MapReduce that excels on both single node performance and scalability.
Journal ArticleDOI
Using Memory in the Right Way to Accelerate Big Data Processing
TL;DR: Two optimizations to tackle the dramatic inefficiency of the big data processing are introduced, the slice-and-merge strategy, which reduces the cache miss rate of the sort procedure and the direct-memory-access, which reforms the data structure used in key/value storage.