Topic

Data management

About: Data management is a research topic. Over the lifetime, 31574 publications have been published within this topic receiving 424326 citations.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

A probabilistic XML approach to data integration

[...]

M. van Keulen, A. de Keijzer, W. Alink

05 Apr 2005

TL;DR: This paper takes a first step in the development of a probabilistic XML DBMS by dropping the assumption that data in the database should be certain: subtrees in XML documents may denote possible views on the real world.

...read moreread less

Abstract: In mobile and ambient environments, devices need to become autonomous, managing and resolving problems without interference from a user. The database of a (mobile) device can be seen as its knowledge about objects in the 'real world'. Data exchange between small and/or large computing devices can be used to supplement and update this knowledge whenever a connection gets established. In many situations, however, data from different data sources referring to the same real world objects, may conflict. It is the task of the data management system of the device to resolve such conflicts without interference from a user. In this paper, we take a first step in the development of a probabilistic XML DBMS. The main idea is to drop the assumption that data in the database should be certain: subtrees in XML documents may denote possible views on the real world. We formally define the notion of probabilistic XML tree and several operations thereon. We also present an approach for determining a logical semantics for queries on probabilistic XML data. Finally, we introduce an approach for XML data integration where conflicts are resolved by the introduction of possibilities in the database.

...read moreread less

130 citations

Journal Article•DOI•

Big Data management in smart grid: concepts, requirements and implementation

[...]

Houda Daki, Asmaa El Hannani, Abdelhak Aqqal, Abdelfattah Haidine, Aziz Dahbi - Show less +1 more

28 Apr 2017-Journal of Big Data

TL;DR: An overview of data management for smart grids is provided, the added value of Big Data technologies for this kind of data is summarized, and the technical requirements, the tools and the main steps to implement Big Data solutions in the smart grid context are discussed.

...read moreread less

Abstract: A smart grid is an intelligent electricity grid that optimizes the generation, distribution and consumption of electricity through the introduction of Information and Communication Technologies on the electricity grid. In essence, smart grids bring profound changes in the information systems that drive them: new information flows coming from the electricity grid, new players such as decentralized producers of renewable energies, new uses such as electric vehicles and connected houses and new communicating equipments such as smart meters, sensors and remote control points. All this will cause a deluge of data that the energy companies will have to face. Big Data technologies offers suitable solutions for utilities, but the decision about which Big Data technology to use is critical. In this paper, we provide an overview of data management for smart grids, summarise the added value of Big Data technologies for this kind of data, and discuss the technical requirements, the tools and the main steps to implement Big Data solutions in the smart grid context.

...read moreread less

130 citations

Book Chapter•DOI•

XMach-1: A Benchmark for XML Data Management

[...]

Timo Böhme¹, Erhard Rahm¹•Institutions (1)

Leipzig University¹

07 Mar 2001

TL;DR: A scaleable multi-user benchmark called XMach-1 (AML Data Management benchmark) is proposed, based on a web application, that considers different types of XML data, in particular text documents, schema-less data and structured data, and measures the query throughput of a system under response time constraints.

...read moreread less

Abstract: We propose a scaleable multi-user benchmark called XMach-1 (AML Data Management benchmark) for evaluating the performance of XML data management systems. It is based on a web application and considers different types of XML data, in particular text documents, schema-less data and structured data. We specify the structure of the benchmark database and the generation of its contents. Furthermore, we define a mix of XML queries and update operations for which system performance is determined. The primary performance metric, Xqps, measures the query throughput of a system under response time constraints. We will use XMach-1 to evaluate both native XML data management systems and XML-enabled relational DBMS.

...read moreread less

130 citations

Proceedings Article•DOI•

Crowdsourced Data Management: A Survey

[...]

Guoliang Li¹, Jiannan Wang², Yudian Zheng³, Michael J. Franklin⁴•Institutions (4)

Tsinghua University¹, Simon Fraser University², University of Hong Kong³, University of Chicago⁴

01 Apr 2017

TL;DR: This paper surveys and synthesizes a wide spectrum of existing studies on crowdsourced data management and outlines key factors that need to be considered to improve crowdsourcing data management.

...read moreread less

Abstract: Many important data management and analytics tasks cannot be completely addressed by automated processes. These tasks, such as entity resolution, sentiment analysis, and image recognition can be enhanced through the use of human cognitive ability. Crowdsouring is an effective way to harness the capabilities of people (i.e., the crowd) to apply human computation for such tasks. Thus, crowdsourced data management has become an area of increasing interest in research and industry. We identify three important problems in crowdsourced data management. (1) Quality Control: Workers may return noisy or incorrect results so effective techniques are required to achieve high quality, (2) Cost Control: The crowd is not free, and cost control aims to reduce the monetary cost, (3) Latency Control: The human workers can be slow, particularly compared to automated computing time scales, so latency-control techniques are required. There has been significant work addressing these three factors for designing crowdsourced tasks, developing crowdsourced data manipulation operators, and optimizing plans consisting of multiple operators. We survey and synthesize a wide spectrum of existing studies on crowdsourced data management.

...read moreread less

130 citations

Proceedings Article•DOI•

Secure and Scalable Cloud-Based Architecture for e-Health Wireless Sensor Networks

[...]

Ahmed Lounis, Abdelkrim Hadjidj, Abdelmadjid Bouabdallah, Yacine Challal

31 Aug 2012

TL;DR: This paper proposes an innovative architecture for collecting and accessing large amount of data generated by medical sensor networks and proposes an effective and flexible security mechanism that guarantees confidentiality, integrity as well as fine grained access control to outsourced medical data.

...read moreread less

Abstract: There has been a host of research works on wireless sensor networks for medical applications. However, the major shortcoming of these efforts is a lack of consideration of data management. Indeed, the huge amount of high sensitive data generated and collected by medical sensor networks introduces several challenges that existing architectures cannot solve. These challenges include scalability, availability and security. In this paper, we propose an innovative architecture for collecting and accessing large amount of data generated by medical sensor networks. Our architecture resolves all the aforementioned challenges and makes easy information sharing between healthcare professionals. Furthermore, we propose an effective and flexible security mechanism that guarantees confidentiality, integrity as well as fine grained access control to outsourced medical data. This mechanism combines several cryptographic schemes to achieve high flexibility and performance

...read moreread less

130 citations

Collapse

Network Information

Performance

Metrics

32,259

Papers

465,338

Citations

No. of papers in the topic in previous years
Year	Papers
2023	218
2022	485
2021	959
2020	1,435
2019	1,745
2018	1,719

Data management

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics