Top 2 papers published by Eugene Wu from Columbia University in 2010

Proceedings Article•DOI•

TrajStore: An adaptive storage system for very large trajectory data sets

[...]

Philippe Cudré-Mauroux¹, Eugene Wu¹, Samuel Madden¹•Institutions (1)

01 Mar 2010

TL;DR: TrajStore is a dynamic storage system optimized for efficiently retrieving all data in a particular spatiotemporal region that maintains an optimal index on the data and dynamically co-locates and compresses spatially and temporally adjacent segments on disk.

...read moreread less

Abstract: The rise of GPS and broadband-speed wireless devices has led to tremendous excitement about a range of applications broadly characterized as “location based services”. Current database storage systems, however, are inadequate for manipulating the very large and dynamic spatio-temporal data sets required to support such services. Proposals in the literature either present new indices without discussing how to cluster data, potentially resulting in many disk seeks for lookups of densely packed objects, or use static quadtrees or other partitioning structures, which become rapidly suboptimal as the data or queries evolve. As a result of these performance limitations, we built TrajStore, a dynamic storage system optimized for efficiently retrieving all data in a particular spatiotemporal region. TrajStore maintains an optimal index on the data and dynamically co-locates and compresses spatially and temporally adjacent segments on disk. By letting the storage layer evolve with the index, the system adapts to incoming queries and data and is able to answer most queries via a very limited number of I/Os, even when the queries target regions containing hundreds or thousands of different trajectories.

...read moreread less

205 citations

Relational Cloud: The Case for a Database Service

[...]

Eugene Wu¹, Samuel Madden¹, Yang Zhang¹, Evan P. C. Jones¹, Carlo Curino¹ - Show less +1 more•Institutions (1)

Massachusetts Institute of Technology¹

14 Mar 2010

TL;DR: The case for “databases as a service” (DaaS) is made, with two target scenarios in mind: (i) consolidation of data management functionality for large organizations and (ii) outsourcing data management to a cloud-based service provider for small/medium organizations.

...read moreread less

Abstract: In this paper, we make the case for “databases as a service” (DaaS), with two target scenarios in mind: (i) consolidation of data management functionality for large organizations and (ii) outsourcing data management to a cloud-based service provider for small/medium organizations. We analyze the many challenges to be faced, and discuss the design of a database service we are building, called Relational Cloud. The system has been designed from scratch and combines many recent advances and novel solutions. The prototype we present exploits multiple dedicated storage engines, provides high-availability via transparent replication, supports automatic workload partitioning and live data migration, and provides serializable distributed transactions. While the system is still under active development, we are able to present promising initial results that showcase the key features of our system. The tests are based on TPC benchmarks and real-world data from epinions.com, and show our partitioning, scalability and balancing capabilities.

...read moreread less

50 citations

Showing papers by "Eugene Wu published in 2010"