scispace - formally typeset
Search or ask a question
Proceedings ArticleDOI

A Framework to Convert XML Schema to ROLAP

TL;DR: An approach for integrating XML data, modeled by XML schema to its compatible data warehouse based on relational online analytical processing (ROLAP) is illustrated.
Abstract: Data warehouse provides architecture and tools for business executive to systematically organize, understand and use their data to make strategic decision. On the other hand, XML is used for e-commerce and Internet based application. Since many organizations use web for their business purpose, researches has been done for integrating XML data into data warehouse. This paper illustrates an approach for integrating XML data, modeled by XML schema to its compatible data warehouse based on relational online analytical processing (ROLAP). In this paper, this modeling is performed in three steps. Firstly, a schema tree is constructed from the given XML schema. In the next step, an entity-relationship diagram (ER Diagram) is derived from this schema tree. At the end, dimension tables as well as the corresponding fact table have been identified from the ER Diagram to provide a suitable multidimensional data model for online analytical processing (OLAP). The relationship between the fact tables and dimension tables have been organized in the form of star schema or snowflake schema as required.
Citations
More filters
Journal Article
TL;DR: In this paper, a semi-automated methodology for designing web warehouses from XML sources modeled by XML Schemas is proposed, which is carried out by first creating a schema graph, then navigating its arcs in order to derive a correct multidimensional representation.
Abstract: Web warehousing plays a key role in providing the managers with up-to-date and comprehensive information about their business domain. On the other hand, since XML is now a standard de facto for the exchange of semi-structured data, integrating XML data into web warehouses is a hot topic. In this paper we propose a semi-automated methodology for designing web warehouses from XML sources modeled by XML Schemas. In the proposed methodology, design is carried out by first creating a schema graph, then navigating its arcs in order to derive a correct multidimensional representation. Differently from previous approaches in the literature, particular relevance is given to the problem of detecting shared hierarchies and convergence of dependencies, and of modeling many-to-many relationships. The approach is implemented in a prototype that reads an XML Schema and produces in output the logical schema of the warehouse.

85 citations

Journal ArticleDOI
TL;DR: This paper focuses on integrating XML data based on multiple related XML schemas, to an equivalent data warehouse schemas based on relational online analytical processing (ROLAP) and a new data structure, Schema Graph has been proposed in the process.
Abstract: Data Warehouse is one of the most common ways for analyzing large data for decision based system. These data are often sourced from online transactional system. The transactional data are represented in different formats. XML is one of the worldwide standards to represent data in web based system. Numbers of organizations use XML for e-commerce and internet based applications. Integration of XML and data warehouse for the innovation of business logic and to enhance decision making has therefore emerged as a demanding area of research interest. This paper focuses on integrating XML data based on multiple related XML schemas, to an equivalent data warehouse schemas based on relational online analytical processing (ROLAP). This work bears a high relevance towards standardizing of the ETL phase (Extraction, Transformation, and Loading) of the OLAP projects. The novelty of the work is that more than one data warehouse schemas could be identified from a single related XML schema and each of them could be categorized as star schema or snowflake schema. Moreover if the individual schemas are found to be related according to the analysis, fact constellation could be identified. A new data structure, Schema Graph has been proposed in the process.

15 citations


Cites methods from "A Framework to Convert XML Schema t..."

  • ...The paper [2] converts XML schema either to star schema or snowflake schema....

    [...]

  • ...Works in [1, 2, 12, 13] show different ways of transformation from XML to relational model schema....

    [...]

Journal ArticleDOI
TL;DR: An overview of current state of the art concerning how XML technologies can be combined with data warehousing systems is given by the presentation of the most relevant XML warehousing approaches proposed in the literature.
Abstract: XML – the Extensible Markup Language has recently emerged as a standard for information representation and exchange on the internet, because of its portability, versatility and extensibility. As the large increase in the availability of such data, there is a need of an appropriate storage and an integration solution for future analysis and decision-making. Data warehouses systems provide a suitable means to organise and manage XML data for decision-making. However, XML data warehouse has some special requirements such as mixed content and irregular structure of XML data sources, which make existing traditional data warehouse approaches and techniques unsuitable. In this paper, we give an overview of current state of the art concerning how XML technologies can be combined with data warehousing systems, by the presentation of the most relevant XML warehousing approaches proposed in the literature.

10 citations

Proceedings ArticleDOI
16 Mar 2015
TL;DR: An integrated architecture to manage and design business intelligence environment by co-ordinating several associated entities to achieve business agility is presented.
Abstract: Business Intelligence (BI) provides historical, current and predictive views of business operations with the help of some technologies, that include reporting, online analytical processing, analytics, data mining, process mining, complex event processing, business performance management, benchmarking, text mining, predictive analytics and prescriptive analytics. As analytics plays a major role in BI, OLAP is an integrated part of BI in modern day business application. Data warehouse is the most popular way to design and build OLAP. Data warehouse along with ETL and reporting tools provides an integrated environment for business processing. Business processing also demands decision making system and knowledge representation. Moreover the data sources are physically distributed in different locations. Hence modern day business environment is a complex architecture with a numbers of entities. In this paper authors present an integrated architecture to manage and design business intelligence environment by co-ordinating several associated entities to achieve business agility.

10 citations


Cites background or methods from "A Framework to Convert XML Schema t..."

  • ...The previous research work [10] has been extended further in [11] to identify star schema, snowflake schema as well as fact constellation from the related XML schemas....

    [...]

  • ...A framework [10] is proposed to convert XML schema to ER diagram and then to relational model....

    [...]

Proceedings ArticleDOI
14 Sep 2012
TL;DR: This paper emphasizes on integrating heterogeneous data sources to create virtual data warehouses that could be deployed in a cloud environment to handle multiple OLAP data sources.
Abstract: Cloud Computing is an emerging technology that empowers the present day business scenario by providing services on demand instead of an integrated product. Many of applications on cloud deals with huge amount of data and these are often used for analytical processing to exploit the business intelligence. However working with very large scale of data is often time consuming and requires higher processing time. Materialized views are built and maintained to pre-fetch an effective subset of the entire database for current and immediate future usage. The materialized views are constructed on data warehouse, data marts and virtual data warehouse. In a cloud computing scenario, quite often the materialized views for the distributed data centers resides in different data servers. One of the major challenges is to handle multiple OLAP data sources. The data needs to be, integrated and analyzed continually in an efficient manner before the views are built. This paper emphasizes on integrating heterogeneous data sources to create virtual data warehouses that could be deployed in a cloud environment.

6 citations

References
More filters
Journal Article
TL;DR: XML is an extremely simple dialect of SGML which is completely described in this document, to enable generic SGML to be served, received, and processed on the Web in the way that is now possible with HTML.
Abstract: Extensible Markup Language (XML) is an extremely simple dialect of SGML which is completely described in this document. The goal is to enable generic SGML to be served, received, and processed on the Web in the way that is now possible with HTML. For this reason, XML has been designed for ease of implementation, and for interoperability with both SGML and HTML. Note on status of this document: This is even more of a moving target than the typical W3C working draft. Several important decisions on the details of XML are still outstanding members of the W3C SGML Working Group will recognize these areas of particular volatility in the spec, but those who are not intimately familiar with the deliberative process should be careful to avoid actions based on the content of this document, until the notice you are now reading has been removed.

5,749 citations


"A Framework to Convert XML Schema t..." refers background in this paper

  • ...XML 1.0 [ 8 ] provides document type definition (DTD), which explains precisely what elements can appear as document and the content of the element and attribute....

    [...]

Proceedings ArticleDOI
09 Nov 2001
TL;DR: This paper shows how the design of a data mart can be carried out starting directly from an XML source, and proposes a semi-automatic approach for building the conceptual schema for a dataMart starting from the XML sources.
Abstract: A large amount of data needed in decision-making processes is stored in the XML data format, which is widely used for e-commerce and Internet-based information exchange. Thus, as more organizations view the web as an integral part of their communication and business, the importance of integrating XML data in data warehousing environments is becoming increasingly high. In this paper we show how the design of a data mart can be carried out starting directly from an XML source. Two main issues arise: on the one hand, since XML models semi-structured data, not all the information needed for design can be safely derived; on the other, different approaches for representing relationships in XML DTDs and Schemas are possible, each with different expressive power. After discussing these issues, we propose a semi-automatic approach for building the conceptual schema for a data mart starting from the XML sources.

137 citations


"A Framework to Convert XML Schema t..." refers methods in this paper

  • ...Some of the approaches [1], [2], [5] use DTD for integrating XML data in data warehouse....

    [...]

Journal ArticleDOI
18 Jul 2001
TL;DR: This paper presents an approach to specification of OLAP DBs based on web data, using Unified Modeling Language (UML) as a basis for so-called UML snowflake diagrams that precisely capture the multidimensional structure of the data.
Abstract: On-Line Analytical Processing (OLAP) enables analysts to gain insight into data through fast and interactive access to a variety of possible views on information, organized in a dimensional model. The demand for data integration is rapidly becoming larger as more and more information sources appear in modern enterprises. In the data warehousing approach, selected information is extracted in advance and stored in a repository. This approach is used because of its high performance. However, in many situations a logical (rather than physical) integration of data is preferable. Previous Web-based data integration efforts have focused almost exclusively on the logical level of data models, creating a need for techniques focused on the conceptual level. Also, previous integration techniques for Web-based data have not addressed the special needs of OLAP tools such as handling dimensions with hierarchies. Extensible Markup Language (XML) is fast becoming the new standard for data representation and exchange on the World Wide Web. The rapid emergence of XML data on the Web, e.g., business-to-business (B2B) e-commerce, is making it necessary for OLAP and other data analysis tools to handle XML data as well as traditional data formats. Based on a real-world case study, the paper presents an approach to the conceptual specification of OLAP DBs based on Web data. Unlike previous work, this approach takes special OLAP issues such as dimension hierarchies and correct aggregation of data into account. Additionally, an integration architecture that allows the logical integration of XML and relational data sources for use by OLAP tools is presented.

105 citations

Proceedings ArticleDOI
07 Nov 2003
TL;DR: XCube is a family of XML based document templates to exchange data warehouse data, i.
Abstract: Data warehouse systems are nowadays a well known and widely spread approach for supporting management decisions. In several companies or even across companies the idea of integrating several data warehouses into a virtual or federated data warehouse is of growing interest. But the technical and semantic problems are very demanding. An essential part for solving this problem is a standardized, vendor independent format for describing multidimensional data. This paper introduces XCube, a family of XML based document templates to exchange data warehouse data, i. e. data cubes, over any kind of network. XCube is organized in a modular fashion, so the multidimensional schema, the descriptions of the single dimensions and the fact data itself can be transmitted in separate steps. In addition to the describing formats XCube also offers two kinds of dynamic document types that can be used to explore the (multidimensional) content of another warehouse in a vendor independent way. They are primarily meant to reduce the amount of data transferred over the network.

102 citations

Journal Article
TL;DR: In this paper, a semi-automated methodology for designing web warehouses from XML sources modeled by XML Schemas is proposed, which is carried out by first creating a schema graph, then navigating its arcs in order to derive a correct multidimensional representation.
Abstract: Web warehousing plays a key role in providing the managers with up-to-date and comprehensive information about their business domain. On the other hand, since XML is now a standard de facto for the exchange of semi-structured data, integrating XML data into web warehouses is a hot topic. In this paper we propose a semi-automated methodology for designing web warehouses from XML sources modeled by XML Schemas. In the proposed methodology, design is carried out by first creating a schema graph, then navigating its arcs in order to derive a correct multidimensional representation. Differently from previous approaches in the literature, particular relevance is given to the problem of detecting shared hierarchies and convergence of dependencies, and of modeling many-to-many relationships. The approach is implemented in a prototype that reads an XML Schema and produces in output the logical schema of the warehouse.

85 citations