M
Matthew B. Tolton
Researcher at Google
Publications - 7
Citations - 805
Matthew B. Tolton is an academic researcher from Google. The author has contributed to research in topics: Data element & The Internet. The author has an hindex of 4, co-authored 7 publications receiving 729 citations.
Papers
More filters
Journal ArticleDOI
Dremel: interactive analysis of web-scale datasets
Sergey Melnik,Andrey Gubarev,Jing Jing Long,Geoffrey M. Romer,Shiva Shivakumar,Matthew B. Tolton,Theodore Vassilakis +6 more
TL;DR: The architecture and implementation of Dremel are described, and how it complements MapReduce-based computing is explained, and a novel columnar storage representation for nested records is presented.
Journal ArticleDOI
Dremel: interactive analysis of web-scale datasets
Sergey Melnik,Andrey Gubarev,Jing Jing Long,Geoffrey M. Romer,Shiva Shivakumar,Matthew B. Tolton,Theodore Vassilakis +6 more
TL;DR: Dremel as discussed by the authors is a scalable, interactive ad hoc query system for analysis of read-only nested data, which combines multilevel execution trees and columnar data layout, it is capable of running aggregation queries over trillion-row tables in seconds.
Journal ArticleDOI
Dremel: a decade of interactive SQL analysis at web scale
Sergey Melnik,Andrey Gubarev,Jing Jing Long,Geoffrey M. Romer,Shiva Shivakumar,Matthew B. Tolton,Theodore Vassilakis,Hossein Ahmadi,Daniel P. Delorey,Slava Min,Mosha Pasumansky,Jeff Shute +11 more
TL;DR: How Dremel evolved in the past decade and became the foundation for Google BigQuery is discussed, including disaggregated storage and compute, in situ analysis, and columnar storage for semistructured data.
Patent
Columnar storage representations of records
Andrey Gubarev,Sergey Melnik,Jing Jing Long,Geoffrey M. Romer,Narayanan Shivakumar,Matthew B. Tolton,Theodore Vassilakis +6 more
TL;DR: In this article, a computer system accesses a collection of data records and generates a set of columnar stripes that correspond to a specific data element from each record in the collection of records.
Patent
Repartitioning data in a distributed computing system
TL;DR: In this article, the authors present a system for allocating, by a source of one or more sources, a segment of a data file of a transient memory for exclusive access by the source, the transient memory being a distributed in-memory file system that supports remote direct memory access.