scispace - formally typeset
M

Michael J. Carey

Researcher at University of California, Irvine

Publications -  284
Citations -  19116

Michael J. Carey is an academic researcher from University of California, Irvine. The author has contributed to research in topics: Big data & Database design. The author has an hindex of 77, co-authored 284 publications receiving 18573 citations. Previous affiliations of Michael J. Carey include National Academy of Engineering & University of California, Berkeley.

Papers
More filters
Book ChapterDOI

XMark: a benchmark for XML data management

TL;DR: This work provides a framework to assess the abilities of an XML database to cope with a broad range of different query types typically encountered in real-world scenarios and offers a set of queries where each query is intended to challenge a particular aspect of the query processor.
Proceedings ArticleDOI

Efficient parallel set-similarity joins using MapReduce

TL;DR: This paper proposes a 3-stage approach for end-to-end set-similarity joins in parallel using the popular MapReduce framework, and reports results from extensive experiments on real datasets to evaluate the speedup and scaleup properties of the proposed algorithms using Hadoop.
Journal ArticleDOI

The HiPAC project: combining active databases and timing constraints

TL;DR: The HiPAC (High Performance ACtive database system) project addresses two critical problems in time-constrained data management: the handling of timing constraints in databases, and the avoidance of wasteful polling through the use of situation-action rules that are an integral part of the database and are monitored by DBMS's condition monitor.
Proceedings ArticleDOI

Shoring up persistent applications

TL;DR: The goals and motivation for SHORE are given, and some novel aspects of the SHORE architecture are described, including a symmetric peer-to-peer server architecture, server customization through an extensible value-added server facility, and support for scalability on multiprocessor systems.
Journal ArticleDOI

Concurrency control performance modeling: alternatives and implications

TL;DR: It is shown that differences in the underlying assumptions explain the seemingly contradictory performance results, and the question of how realistic the various assumptions are for actual database systems is addressed.