K
Kevin Scott Beyer
Researcher at IBM
Publications - 54
Citations - 4669
Kevin Scott Beyer is an academic researcher from IBM. The author has contributed to research in topics: XML database & Efficient XML Interchange. The author has an hindex of 24, co-authored 54 publications receiving 4606 citations.
Papers
More filters
Proceedings ArticleDOI
Storing and querying ordered XML using a relational database system
Igor Tatarinov,Stratis D. Viglas,Kevin Scott Beyer,Jayavel Shanmugasundaram,Eugene J. Shekita,Chun Zhang +5 more
TL;DR: This paper shows that XML's ordered data model can indeed be efficiently supported by a relational database system, and proposes three order encoding methods that can be used to represent XML order in the relational data model, and also proposes algorithms for translating ordered XPath expressions into SQL using these encoding methods.
Journal ArticleDOI
Jaql: a scripting language for large scale semistructured data analysis
Kevin Scott Beyer,Vuk Ercegovac,Rainer Gemulla,Andrey Balmin,Mohamed Y. Eltabakh,Carl-Christian Kanne,Fatma Ozcan,Eugene J. Shekita +7 more
TL;DR: Jaql, a declarative scripting language for analyzing large semistructured datasets in parallel using Hadoop’s MapReduce framework, is described and the results of scale-up experiments onHadoop running Jaql scripts for intranet data analysis and log processing are presented.
Proceedings ArticleDOI
Ricardo: integrating R and Hadoop
TL;DR: R Ricardo is part of the eXtreme Analytics Platform (XAP) project at the IBM Almaden Research Center, and rests on a decomposition of data-analysis algorithms into parts executed by the R statistical analysis system and parts handled by the Hadoop data management system.
Proceedings ArticleDOI
On synopses for distinct-value estimation under multiset operations
TL;DR: This work provides DV estimation techniques that are designed for use within a flexible and scalable "synopsis warehouse" architecture and uses the theory of order statistics to show that the DV estimators are unbiased, and to establish moment formulas and sharp error bounds.
Book ChapterDOI
A framework for using materialized XPath views in XML query processing
TL;DR: This paper proposes a framework for exploiting materialized XPath views to expedite processing of XML queries, and develops an XPath matching algorithm to determine when such views can be used to answer a user query containing XPath expressions.