scispace - formally typeset
Search or ask a question
Topic

Data mart

About: Data mart is a research topic. Over the lifetime, 559 publications have been published within this topic receiving 8550 citations.


Papers
More filters
01 Jan 2002
TL;DR: MF-Retarget is presented, a query retargeting mechanism that deals with both conventional star schemas and multiple facttable (MFT) schemas that is often used to implement a DW using distinct, but interrelated Data Marts.
Abstract: . Performance is a critical issue in Data Warehouse systems (DWs),due to the large amounts of data manipulated, and the type of analysisperformed. A common technique used to improve performance is the use ofpre-computed aggregate data, but the use of aggregates must be transparent forDW users. In this work, we present MF-Retarget, a query retargetingmechanism that deals with both conventional star schemas and multiple facttable (MFT) schemas. This type of multidimensional schema is often used toimplement a DW using distinct, but interrelated Data Marts. The paper presentsthe retargeting algorithm and initial performance tests. 1 Introduction Data warehouses (DW) are analytical databases aimed at providing intuitive access toinformation useful for decision-making processes. A Data Mart (DM), often referredto as a subject-oriented DW, represents a subset of the DW, comprised of relevantdata for a particular business function (e.g. marketing, sales). DW/DM handle largevolumes of data, and they are often designed using a star schema, which containsrelatively few tables and well-defined join paths. On-line Analytical Processing(OLAP) systems are the predominant front-end tools used in DW environments,which typically explore this multidimensional data structure [3, 13]. OLAP operations(e.g. drill down, roll up, slice and dice) typically result in SQL queries in whichaggregation functions (e.g. SUM, COUNT) are applied to fact table attributes, usingdimension table attributes as grouping columns (group by clause).A
Patent
04 Jul 2019
TL;DR: In this article, the authors present a method for the automated generation of software code for a corporate data warehouse, which includes obtaining metadata describing a setting for data transformation mechanisms for loading data to a detailed layer level and calculating the data marts of a warehouse; generating at least one template for updating the data of the detailed layer and a data mart of the data warehouse.
Abstract: This technical solution relates generally to the field of computer technology, and more particularly to systems and methods for the automated generation of software code for a corporate data warehouse. A method for the automated generation of software code for a corporate data warehouse includes: obtaining metadata describing a setting for data transformation mechanisms for loading data to a detailed layer level and calculating the data marts of a warehouse; generating at least one template for updating the data of the detailed layer and a data mart of the data warehouse; generating software code for loading data to the detailed layer of the data warehouse and calculating data marts on the basis of the metadata obtained and the data update template generated; installing the software code generated in the previous step in the data warehouse environment for performing loading; reusing the detailed layer and data mart update metadata. The technical result is an increase in the stability of detailed layer and data mart algorithms and a decrease in the number of incidents in the data warehouse.
01 Dec 2009
TL;DR: In this paper, attempt is made to design data mart for utilizing micro-data through data analysis and guidelines for development of statistics information service of research and development are provided.
Abstract: The survey of research and development is conducted for estimating of national status in science and technology, and the micro-data obtained from survey had been used for generating indicators which are supplied in the form of printed materials. But survey micro-data had not been managed in a systematic way then end user didn't acquire and manipulate statistical information for their purposes. In this paper, attempt is made to design data mart for utilizing micro-data through data analysis. And this article provides guidelines for development of statistics information service of research and development.
Proceedings Article
01 Jan 2006
TL;DR: This paper is interested in the graphical manipulation of data mart schemes described in XML and issued from a generation module of multidimensional models through a set of operations that consist in adding, deleting and renaming the multiddimensional models.
Abstract: This paper is interested in the graphical manipulation of data mart schemes described in XML and issued from a generation module of multidimensional models. This manipulation is performed through a set of operations we have defined. These operations consist in adding, deleting and renaming the multidimensional
Proceedings Article
01 Jan 2000
TL;DR: This paper describes the modeling of a data mart designed for use as a data repository in the “knowledge discovery in data bases” (KDD) process applied to a multimedia metadata environment whose data structure is based on the MHEG-5 standard.
Abstract: This paper describes the modeling of a data mart designed for use as a data repository in the “knowledge discovery in data bases” (KDD) process applied to a multimedia metadata environment whose data structure is based on the MHEG-5 standard. The data mart design uses the features of the original repository which are meaningful for the KDD process. The typical data warehouse structure was adapted to the object-oriented multimedia environment, and a structure that is both homogeneous – capable of integrating information from different sources – and objective – providing specific content and functionality for analysis – was generated to support the data preparation process before the data mining operation.

Network Information
Related Topics (5)
Information system
107.5K papers, 1.8M citations
77% related
The Internet
213.2K papers, 3.8M citations
72% related
Scheduling (computing)
78.6K papers, 1.3M citations
72% related
Cloud computing
156.4K papers, 1.9M citations
71% related
Software
130.5K papers, 2M citations
70% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202113
202020
201926
201823
201726
201627