scispace - formally typeset
Open AccessProceedings ArticleDOI

Transforming statistical linked data for use in OLAP systems

Reads0
Chats0
TLDR
An extract-transform-load (ETL) pipeline is used to convert statistical Linked Data into a format suitable for loading into an open-source OLAP system, and thus it is demonstrated how standard OLAP infrastructure can be used for elaborate querying and visualisation of integrated statistical Linking Data.
Abstract
The amount of available Linked Data on the Web is increasing, and data providers start to publish statistical datasets that comprise numerical data. Such statistical datasets differ significantly from the currently predominant network-style data published on the Web. We explore the possibility of integrating statistical data from multiple Linked Data sources. We provide a mapping from statistical Linked Data into the Multidimensional Model used in data warehouses. We use an extract-transform-load (ETL) pipeline to convert statistical Linked Data into a format suitable for loading into an open-source OLAP system, and thus demonstrate how standard OLAP infrastructure can be used for elaborate querying and visualisation of integrated statistical Linked Data. We discuss lessons learned from three experiments and identify areas which require future work to ultimately arrive at a well-interlinked set of statistical data from multiple sources which is processable with standard OLAP systems.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Using Semantic Web Technologies for Exploratory OLAP: A Survey

TL;DR: The convergence of some of the most influential technologies in the last few years, namely data warehousing (DW), on-line analytical processing (OLAP), and the Semantic Web (SW) is described, including SW support for intelligent MD querying, using SW technologies for providing context to data warehouses, and scalability issues.
BookDOI

Data Warehouse Systems: Design and Implementation

TL;DR: Students, practitioners and researchers alike will find this book the most comprehensive reference work on data warehouses, with key topics described in a clear and educational style.
Book ChapterDOI

Interacting with Statistical Linked Data via OLAP Operations

TL;DR: This work investigates the problem of executing OLAP queries via SPARQL on an RDF store and defines projection, slice, dice and roll-up operations on single data cubes published as Linked Data reusing the RDF Data Cube vocabulary and shows how a nested set of operations lead to an OLAP query.
Book ChapterDOI

Generating possible interpretations for statistics from linked open data

TL;DR: This paper introduces Explain-a-LOD, an approach which uses data from Linked Open Data for generating hypotheses that explain statistics, and shows an implemented prototype and compares different approaches for generating hypothesis by analyzing the perceived quality of those hypotheses in a user study.
Journal ArticleDOI

Scalable graph-based OLAP analytics over process execution data

TL;DR: A model for process OLAP (P-OLAP) is presented and OLAP specific abstractions in process context such as process cubes, dimensions, and cells are defined and a MapReduce-based graph processing engine is presented, to support big data analytics over process graphs.
References
More filters

A UML profile for multidimensional modeling in data warehouses

TL;DR: In this paper, an extension of the Unified Modeling Language (UML) using a UML profile is defined by a set of stereotypes, constraints and tagged values to elegantly represent main MD properties at the conceptual level.
Journal ArticleDOI

A foundation for capturing and querying complex multidimensional data

TL;DR: The data model and query evaluation techniques discussed in this paper can be implemented using relational database technology and is also capable of exploiting multidimensional query processing techniques like pre-aggregation.
Journal ArticleDOI

A UML profile for multidimensional modeling in data warehouses

TL;DR: This paper presents an extension of the Unified Modeling Language (UML) using a UML profile defined by a set of stereotypes, constraints and tagged values to elegantly represent main MD properties at the conceptual level and uses the Object Constraint Language (OCL) to specify the constraints attached to the defined stereotypes.
Proceedings ArticleDOI

Research in data warehouse modeling and design: dead or alive?

TL;DR: Issues regarding conceptual models, logical models, methods for design, interoperability, and design for new architectures and applications are considered.
Journal ArticleDOI

Integrating Data Warehouses with Web Data: A Survey

TL;DR: The paper addresses the application of information retrieval technology in a DW to exploit text-rich documents collections and introduces the problem of dealing with semi-structured data in aDW.
Related Papers (5)