scispace - formally typeset
K

Kevin Scott Beyer

Researcher at IBM

Publications -  54
Citations -  4669

Kevin Scott Beyer is an academic researcher from IBM. The author has contributed to research in topics: XML database & Efficient XML Interchange. The author has an hindex of 24, co-authored 54 publications receiving 4606 citations.

Papers
More filters
Proceedings ArticleDOI

Storing and querying ordered XML using a relational database system

TL;DR: This paper shows that XML's ordered data model can indeed be efficiently supported by a relational database system, and proposes three order encoding methods that can be used to represent XML order in the relational data model, and also proposes algorithms for translating ordered XPath expressions into SQL using these encoding methods.
Journal ArticleDOI

Jaql: a scripting language for large scale semistructured data analysis

TL;DR: Jaql, a declarative scripting language for analyzing large semistructured datasets in parallel using Hadoop’s MapReduce framework, is described and the results of scale-up experiments onHadoop running Jaql scripts for intranet data analysis and log processing are presented.
Proceedings ArticleDOI

Ricardo: integrating R and Hadoop

TL;DR: R Ricardo is part of the eXtreme Analytics Platform (XAP) project at the IBM Almaden Research Center, and rests on a decomposition of data-analysis algorithms into parts executed by the R statistical analysis system and parts handled by the Hadoop data management system.
Proceedings ArticleDOI

On synopses for distinct-value estimation under multiset operations

TL;DR: This work provides DV estimation techniques that are designed for use within a flexible and scalable "synopsis warehouse" architecture and uses the theory of order statistics to show that the DV estimators are unbiased, and to establish moment formulas and sharp error bounds.
Book ChapterDOI

A framework for using materialized XPath views in XML query processing

TL;DR: This paper proposes a framework for exploiting materialized XPath views to expedite processing of XML queries, and develops an XPath matching algorithm to determine when such views can be used to answer a user query containing XPath expressions.