R
Robert E. Gruber
Researcher at Google
Publications - 6
Citations - 4982
Robert E. Gruber is an academic researcher from Google. The author has contributed to research in topics: XML & Parsing. The author has an hindex of 6, co-authored 6 publications receiving 4838 citations.
Papers
More filters
Journal ArticleDOI
Bigtable: A Distributed Storage System for Structured Data
Fay W. Chang,Jeffrey Dean,Sanjay Ghemawat,Wilson C. Hsieh,Deborah A. Wallach,Michael Burrows,Tushar Deepak Chandra,Andrew Fikes,Robert E. Gruber +8 more
TL;DR: The simple data model provided by Bigtable is described, which gives clients dynamic control over data layout and format, and the design and implementation of Bigtable are described.
Proceedings ArticleDOI
Bigtable: a distributed storage system for structured data
Fay W. Chang,Jeffrey Dean,Sanjay Ghemawat,Wilson C. Hsieh,Deborah A. Wallach,Michael Burrows,Tushar Deepak Chandra,Andrew Fikes,Robert E. Gruber +8 more
TL;DR: Bigtable as discussed by the authors is a distributed storage system for managing structured data that is designed to scale to a very large size: petabytes of data across thousands of commodity servers, including web indexing, Google Earth and Google Finance.
Journal ArticleDOI
PADS: a domain-specific language for processing ad hoc data
Kathleen Fisher,Robert E. Gruber +1 more
TL;DR: From such descriptions, the PADS compiler generates libraries and tools for manipulating the data, including parsing routines, statistical profiling tools, translation programs to produce well-behaved formats such as Xml or those required for loading relational databases, and Tools for running XQueries over raw PADS data sources.
PADS: Processing Arbitrary Data Streams
Kathleen Fisher,Robert E. Gruber +1 more
TL;DR: Often such streams are high-volume: AT&T’s call-detail stream contains roughly 300 million calls per day requiring approximately 7GBs of storage space.
PADX: Querying Large-scale Ad Hoc Data with XQuery.
TL;DR: This paper describes the experience designing and implementing PADX, a system for querying large-scale ad hoc data sources with XQuery, the synthesis and extension of two existing systems: PADS and Galax.