P
Patrick Wendell
Researcher at University of California, Berkeley
Publications - 9
Citations - 3085
Patrick Wendell is an academic researcher from University of California, Berkeley. The author has contributed to research in topics: Spark (mathematics) & Big data. The author has an hindex of 7, co-authored 9 publications receiving 2618 citations. Previous affiliations of Patrick Wendell include Princeton University.
Papers
More filters
Journal ArticleDOI
Apache Spark: a unified engine for big data processing
Matei Zaharia,Reynold Xin,Patrick Wendell,Tathagata Das,Michael Armbrust,Ankur Dave,Xiangrui Meng,Josh Rosen,Shivaram Venkataraman,Michael J. Franklin,Ali Ghodsi,Joseph E. Gonzalez,Scott Shenker,Ion Stoica +13 more
TL;DR: This open source computing framework unifies streaming, batch, and interactive big data workloads to unlock new applications.
Proceedings ArticleDOI
Sparrow: distributed, low latency scheduling
TL;DR: It is demonstrated that a decentralized, randomized sampling approach provides near-optimal performance while avoiding the throughput and availability limitations of a centralized design.
Book
Learning Spark: Lightning-Fast Big Data Analytics
TL;DR: This book introduces Spark, an open source cluster computing system that makes data analytics fast to run and fast to write, and learns how to run programs faster, using primitives for in-memory cluster computing.
Proceedings ArticleDOI
DONAR: decentralized server selection for cloud services
TL;DR: This paper presents DONAR, a distributed system that can offload the burden of replica selection, while providing these services with a sufficiently expressive interface for specifying mapping policies, and demonstrates that the distributed algorithm is stable and effective.
Journal ArticleDOI
Scaling spark in the real world: performance and usability
Michael Armbrust,Tathagata Das,Aaron Davidson,Ali Ghodsi,Andrew Or,Josh Rosen,Ion Stoica,Patrick Wendell,Reynold Xin,Matei Zaharia +9 more
TL;DR: The main challenges and requirements that appeared in taking Spark to a wide set of users, and usability and performance improvements made to the engine in response are described.