scispace - formally typeset
Search or ask a question

Showing papers on "Data mart published in 2004"


Patent
05 Apr 2004
TL;DR: In this paper, a data analysis workbench enables a user to define data analysis process that includes an extract sub-process to obtain transactional data from a source system, a load subprocess for providing the extracted data to a data warehouse or data mart, a data mining analysis subprocess to use the obtained transactional datasets, and a deployment subprocess is used to make the data mining results accessible by another computer program.
Abstract: A data analysis workbench enables a user to define a data analysis process that includes an extract sub-process to obtain transactional data from a source system, a load sub-process for providing the extracted data to a data warehouse or data mart, a data mining analysis sub-process to use the obtained transactional data, and a deployment sub-process to make the data mining results accessible by another computer program. Common settings used by each of the sub-processes are defined, as are specialized settings relevant to each of the sub-processes. The invention also enables a user to define an order in which the defined sub-processes are to be executed.

38 citations


Book ChapterDOI
24 Jun 2004
TL;DR: The comparison indicates that the proposed system can be used for non-critical case-finding applications such as: finding appropriate patients for clinical trials and several caching mechanisms.
Abstract: This paper presents a novel information retrieval system designed specifically for medical case finding applications. The proposed system begins by extracting medical information from free-text narrative reports and storing it in a predefined relational clinical data mart. The extraction is performed using a medical thesaurus and a regular expression pattern match. Following the extraction phase, inclusion/exclusion criteria are provided to the system using a physician-friendly user interface. The system converts the entered criteria into a single SQL command which can be then executed on the relational data mart. In order to achieve the appropriate response time required for on-line analysis, the system implements several caching mechanisms. The proposed system has been examined on real-world database. The performance of the system has been compared to the results obtained manually by a physician. The comparison indicates that the proposed system can be used for non-critical case-finding applications such as: finding appropriate patients for clinical trials.

38 citations


Patent
03 Jun 2004
TL;DR: In this paper, the authors present a system for managing purchase data of a credit card account associated with a vehicle through the use of a management system, which downloads data regarding a vehicle from vehicle master system, and data regarding the U.S. Postal Service organizational hierarchy structure from financial data mart system.
Abstract: System and method for managing purchase data of a credit card account associated with a vehicle through the use of a management system. The management system downloads data regarding a vehicle from vehicle master system, and data regarding the U.S. Postal Service organizational hierarchy structure from financial data mart system. The management system receives transaction data from a credit card system, summarizes the data by vehicle, driver, product group and supplier at eight (8) summary levels of organizational structure, analyzes the transaction data for instances of possible fraudulent use of the credit card, and creates exception records to indicate possibility of fraud. On behalf of the credit card provider, the management system creates an invoice and a control report, and sends the invoice to the financial department for payment. The management system produces spreadsheets and reports used for quarterly or annual filing with States to recoup exempt tax not taken off at the pump. The management system organizes the transaction, summary, invoice and exception data into sorted listings for display over one or more intranet web pages. The web pages on which transactions are shown provide the capability for the user to reconcile the transactions. The sorted listings may indicate a probability of fraud.

28 citations


Patent
05 Apr 2004
TL;DR: A data analysis workbench enables a user to define a data analysis process that includes an extract sub-process to obtain transactional data from a source system, a load subprocess for providing the extracted data to a data warehouse or data mart, a data mining analysis subprocess to use the obtained transaction data, and a deployment sub-processor to make the data mining results accessible by another computer program as discussed by the authors.
Abstract: A data analysis workbench enables a user to define a data analysis process that includes an extract sub-process to obtain transactional data from a source system, a load sub-process for providing the extracted data to a data warehouse or data mart, a data mining analysis sub-process to use the obtained transactional data, and a deployment sub-process to make the data mining results accessible by another computer program Common settings used by each of the sub-processes are defined, as are specialized settings relevant to each of the sub-processes The invention also enables a user to define an order in which the defined sub-processes are to be executed The defined data analysis process then is able to be performed by one or more computer systems

19 citations


Journal ArticleDOI
TL;DR: This work presents a solution which integrates and consolidates all research relevant data in a data mart without imposing any considerable operational or maintenance contract liability risk for the existing HIS.
Abstract: For many new medical research questions in heart surgery comprehensive and large data bases are essential. We discuss typical challenges for the integration of real-time and legacy data stored in multiple unconnected hospital information systems (HIS). Furthermore the HIS are often operated by autonomous departments whose data base structures are subject to occasional modifications. We present a solution which integrates and consolidates all research relevant data in a data mart without imposing any considerable operational or maintenance contract liability risk for the existing HIS. The problems of partial consistency and partial redundancy in the data are discussed. The data mart system serves multiple purposes: beside clinical reporting and quality assessment, the preparation steps for comprehensive studies are enormously simplified.

18 citations


01 Jan 2004
TL;DR: This work proposes a novel notion of dimension compatibility and characterize its general properties and shows the significance of dimension compatible in performing drill-across queries over autonomous data marts.
Abstract: The problem of integrating autonomous data marts arises when, e.g., a large organization (or a federation thereof) needs to combine independently developed data warehouses. It turns out that this problem can be tackled in a systematic way because of two main reasons. First, data marts are usually structured in a rather uniform way, along dimensions and facts. Second, data quality in data marts is usually higher than in generic databases, since they are obtained by reconciling several data sources. Our scenario of reference is a federation of various data marts that we need to query in a unified way by means of drillacross operations. We propose a novel notion of dimension compatibility and characterize its general properties. We then show the significance of dimension compatibility in performing drill-across queries over autonomous data marts.

13 citations


Proceedings ArticleDOI
02 May 2004
TL;DR: A novel architecture based on multi-agents technology to support information and knowledge extraction over distributed data sources in order to use them in the decision making process is proposed.
Abstract: Mission critical decision making in enterprises depends heavily on intelligent systems for extracting, analyzing and interpreting information from multiple heterogeneous, distributed data and knowledge sources. It is assumed that data warehouses (DW) and data marts (DM) are required for optimized data accessibility and use. This paper discusses issues with the current DW/DM systems and propose a novel architecture based on multi-agents technology to support information and knowledge extraction over distributed data sources in order to use them in the decision making process. The proposed framework is applied to a real-world project lifecycle case, the EPC (Engineering Procurement and Construction) project.

8 citations


Proceedings ArticleDOI
11 Sep 2004
TL;DR: This work presents an adaptation of the FPA approach for DM size measurement, which is largely used in traditional software development projects, and discusses results on 10 data marts project developed in the industry.
Abstract: To better control the time, cost and resources assigned to software projects, organizations need a proper estimate of their size even before the projects actually start. Accordingly, several approaches were proposed to estimate the size of a software project, as the well-known function point analysis (FPA), which is largely used in traditional software development projects. However, we observed in our company that it is not fit for data mart software measurement. Data mart (DM) systems have particularities in their development that are different from the traditional software systems (e.g. a DM uses other software systems as data sources and does not create new information). It is important, therefore, to have a measurement approach that considers those particularities while measuring the DM size. We present an adaptation of the FPA approach for DM size measurement and discuss results on 10 data marts project developed in the industry.

4 citations


Journal ArticleDOI
TL;DR: The data mart represents the hardware-software complex providing collection, storage and displaying of the industrial data and is considered as the set of appropriate way organized and combined technical, software, linguistic and information resources.
Abstract: Monitoring Data Mart (MDM) of the production system for an oil refinery is considered. The data mart represents the hardware-software complex providing collection, storage and displaying of the industrial data. The industrial data forms an information model of the enterprise production system. Thus, the data mart is considered as the set of appropriate way organized and combined technical, software, linguistic and information resources.

3 citations


Patent
31 Mar 2004
TL;DR: In this article, an application interface is used to retrieve information from a data storage system and in particular for a financial/business system or general ledger system, using a data mart that supports a generalized or system-independent format.
Abstract: An application interface is used to retrieve information from a data storage system and in particular, for a financial/business system or general ledger system. Data from the data storage system is retrieved using a data mart that supports a generalized or system-independent format. The interface instantiates a detail interface identified in a field of the data mart to communicate with, or drill back to, the data storage system to retrieve detailed data such as detail transaction data through an interface.

2 citations


Journal Article
TL;DR: A feasible architecture of real-time data warehouse is introduced after analyzeing the requirement, technical and performance of the data warehouse and comparing it with traditional data warehouse.
Abstract: Traditional data warehouse involves a widely accepted topology of ODS, data warehouse, data mart and BI tools. Real-time data warehouse extends the application of traditional data warehouse and can support tactical queries for enterprise. The article discusses several architecture of real-time data warehouse, compares them with the traditional data warehouse. Then it introduces a feasible architecture of real-time data warehouse after analyzeing the requirement, technical and performance.

Journal ArticleDOI
TL;DR: The management, the accounts, patient affairs and the medical record can have been analyzied various changing through the study.
Abstract: Objective: A purpose of this study was developing the Decision-Support Patient affairs management and building of Data-mart for individually specified department Because it had a OLTP system where we were researching the hospital It can not be analyzied variously because the data for the management, accounts, patient affairs and medical record were changing monthly/annually in conditions also it can not have the specificity of the hospital management information Methods: The Data Set that used for analysis process was examined with the patient's Database for a year They were in hospital where locates in Sun Cheon For developing patient affairs management program, The Microsoft Visual Basic 60 was used and A analysis method of ROLAP which is based on Oracle relation Database was used for general analysis of required works Results: The patient affairs Data-mart and Decision-Support System was built with the main results of the study Those took advantage of the analysis for the each department's revenue and doctor's actual results, it also can compare the patient's hospital using frequency With the analyzied results, each department can have their own special marketing They can offer to the resistants a lecture, consultant and free service for the health Conclusion: The management, the accounts, patient affairs and the medical record can have been analyzied various changing through the study The host System had enormous volume, so it takes lots of times what they want to inquire The host system usually took at night because of low transaction occurrence The new system which was built with Data-mart had been improved the problem a lot

Patent
25 Aug 2004
TL;DR: In this paper, a method and system and a computer program are provided for allowing users to extract predetermined, specialized content out of an Online transaction processing system (OLTP) into the portfolio management system database using data warehouse as a filtering device in an integrated database system.
Abstract: A method and system and a computer program are provided for allowing users to extract predetermined, specialized content out of an Online transaction processing system (OLTP) into the portfolio management system database using data warehouse as a filtering device in an integrated database system. The use of the data warehouse as a filter involves determining a data structure representation for data sets stored in the data warehouse that conforms to the portfolio management system database schema, mapping data structure defined in the data warehouse schema to data structure supported by the portfolio management database and extracting predetermined data sets for use in the future processes.

Proceedings ArticleDOI
12 Apr 2004
TL;DR: This paper develops a methodology in which a number of the search engines are compared and how to increase overall coverage and thus a more comprehensive data mart.
Abstract: To uncover new relationships or patterns one must first build a corpus of data or what some call a data mart. How can we make sure we have collected all the pertinent data and have maximized coverage? There are hundreds of search engines that are available for use on the Internet today. Which one is best? Is one better for one problem and a second better for another? Are meta-search engines better than individual search engines? In this paper we look at one possible approach in developing a methodology to compare a number of search engines. Before we present this methodology, we first provide our motivation towards the need for increased coverage. We next investigate how we can obtain ground truth and what the ground truth can provide us in the way of some insight into the Internet and search engine capabilities. We then conclude our discussion by developing a methodology in which we compare a number of the search engines and how we can increase overall coverage and thus a more comprehensive data mart.