scispace - formally typeset
Search or ask a question
Topic

Data management

About: Data management is a research topic. Over the lifetime, 31574 publications have been published within this topic receiving 424326 citations.


Papers
More filters
Proceedings Article
01 Jan 2011
TL;DR: To fully exploit DBMS features, the user must dene a schema, load the data, tune the system for the expected workload, and answer several questions, creating a formidable and time-consuming hurdle.
Abstract: Database management systems (DBMS) provide incredible exibility and performance when it comes to query processing, scalability and accuracy. To fully exploit DBMS features, however, the user must dene a schema, load the data, tune the system for the expected workload, and answer several questions. Should the database use a column-store, a row-store or some hybrid format? What indices should be created? All these questions make for a formidable and time-consuming hurdle, often deterring new applications or imposing high cost to existing ones. A characteristic example is that of scientic databases with huge data sets. The prohibitive initialization cost and complexity still forces scientists to rely on \ancient" tools for their data management tasks, delaying scientic understanding and progress.

101 citations

Journal ArticleDOI
TL;DR: The nature of data literacy is described and enumerating the related skills and the application of phenomenographic approaches to data literacy and its relationship to the digital humanities have been identified as subjects for further investigation.
Abstract: This paper describes data literacy and emphasizes its importance. Data literacy is vital for researchers who need to become data literate science workers and also for (potential) data management pr...

101 citations

Journal ArticleDOI
01 May 2010
TL;DR: The IIAS is used to close the loop between the insulin pump and the continuous glucose monitoring system, by providing the pump with the appropriate insulin infusion rate in order to keep the patient's glucose levels within predefined limits.
Abstract: SMARTDIAB is a platform designed to support the monitoring, management, and treatment of patients with type 1 diabetes mellitus (T1DM), by combining state-of-the-art approaches in the fields of database (DB) technologies, communications, simulation algorithms, and data mining. SMARTDIAB consists mainly of two units: 1) the patient unit (PU); and 2) the patient management unit (PMU), which communicate with each other for data exchange. The PMU can be accessed by the PU through the internet using devices, such as PCs/laptops with direct internet access or mobile phones via a Wi-Fi/General Packet Radio Service access network. The PU consists of an insulin pump for subcutaneous insulin infusion to the patient and a continuous glucose measurement system. The aforementioned devices running a user-friendly application gather patient's related information and transmit it to the PMU. The PMU consists of a diabetes data management system (DDMS), a decision support system (DSS) that provides risk assessment for long-term diabetes complications, and an insulin infusion advisory system (IIAS), which reside on a Web server. The DDMS can be accessed from both medical personnel and patients, with appropriate security access rights and front-end interfaces. The DDMS, apart from being used for data storage/retrieval, provides also advanced tools for the intelligent processing of the patient's data, supporting the physician in decision making, regarding the patient's treatment. The IIAS is used to close the loop between the insulin pump and the continuous glucose monitoring system, by providing the pump with the appropriate insulin infusion rate in order to keep the patient's glucose levels within predefined limits. The pilot version of the SMARTDIAB has already been implemented, while the platform's evaluation in clinical environment is being in progress.

101 citations

Journal ArticleDOI
TL;DR: This paper proposes an adaptive data collection approach on the biosensor node level that uses an early warning score system to optimize data transmission and estimates in real time the sensing frequency and presents a data fusion model on the coordinator level using a decision matrix and fuzzy set theory.
Abstract: In the past few years, wireless body sensor networks (WBSNs) emerged as a low-cost solution for healthcare applications. In WBSNs, biosensors collect periodically physiological measurement and send them to the coordinator where the data fusion process takes place. However, processing the huge amount of data captured by the limited lifetime biosensors and taking the right decisions when there is an emergency are major challenges in WBSNs. In this paper, we introduce a biosensor data management framework, starting from data collection to decision making. First, we propose an adaptive data collection approach on the biosensor node level. This approach uses an early warning score system to optimize data transmission and estimates in real time the sensing frequency. Second, we present a data fusion model on the coordinator level using a decision matrix and fuzzy set theory. To evaluate our approach, we conducted multiple series of simulations on real sensor data. The results show that our approach reduces the amount of collected data, while maintaining data integrity. In addition, we show the impact of sampling and filtering data on the accuracy of the taken decisions and compare our data fusion approach with a basic decision tree algorithm.

101 citations

Proceedings ArticleDOI
18 Jun 2014
TL;DR: This interactive demonstration will guide visitors through an exploration of several key Myria features by interfacing with the live system to analyze big datasets over the web.
Abstract: In this demonstration, we will showcase Myria, our novel cloud service for big data management and analytics designed to improve productivity. Myria's goal is for users to simply upload their data and for the system to help them be self-sufficient data science experts on their data -- self-serve analytics. Using a web browser, Myria users can upload data, author efficient queries to process and explore the data, and debug correctness and performance issues. Myria queries are executed on a scalable, parallel cluster that uses both state-of-the-art and novel methods for distributed query processing. Our interactive demonstration will guide visitors through an exploration of several key Myria features by interfacing with the live system to analyze big datasets over the web.

101 citations


Network Information
Related Topics (5)
Information system
107.5K papers, 1.8M citations
90% related
Software
130.5K papers, 2M citations
88% related
Cluster analysis
146.5K papers, 2.9M citations
83% related
The Internet
213.2K papers, 3.8M citations
82% related
Cloud computing
156.4K papers, 1.9M citations
81% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
2023218
2022485
2021959
20201,435
20191,745
20181,719