scispace - formally typeset
Search or ask a question
Topic

Data management

About: Data management is a research topic. Over the lifetime, 31574 publications have been published within this topic receiving 424326 citations.


Papers
More filters
Patent
01 Feb 2011
TL;DR: In this paper, a method for retrieving data from one of multiple copies of the data is provided, referred to as the data management system, in which a request identifying at least one data object to be accessed is made.
Abstract: A method in a computer system for retrieving data from one of multiple copies of the data is provided, referred to as the data management system. The data management system receives a request identifying at least one data object to be accessed. Then, the data management system queries a metabase to locate data copies that contain the identified at least one data object, wherein the data copies are created from similar source data, and wherein for each data copy the metabase contains an indication of the availability of the copy relative to other copies. Next, the data management system determines one of the located data copies to use to access the identified at least one data object, wherein the determination is made based on the indicated availability contained in the metabase for each of the located data copies. Then, the data management system accesses the identified at least one data object using the determined one of the located data copies.

279 citations

Journal ArticleDOI
TL;DR: The generally high quality of the Manitoba registry file and the hospital claims is supported by comparisons with other data sources, and some of the research possibilities associated with population registries and administrative data are outlined.
Abstract: In this article the organization and accuracy of the population registry and administrative data base in Manitoba, Canada are discussed. The overall data management strategy and a framework for analyzing the accuracy of such data are presented. The generally high quality of the Manitoba registry file (necessary to track individuals over time) and the hospital claims is supported by comparisons with other data sources. Hospital claims' main quality problems concern the reliability of certain secondary diagnoses and the level of aggregation necessary for reasonable agreement with other data collection methods (such as chart reviews). Finally, some of the research possibilities associated with population registries and administrative data are outlined.

279 citations

Journal ArticleDOI
TL;DR: This paper reviews big data challenges from a data management respective, and discusses big data diversity, big data reduction,big data integration and cleaning,Big data indexing and query, and finally big data analysis and mining.
Abstract: There is a trend that, virtually everyone, ranging from big Web companies to traditional enterprisers to physical science researchers to social scientists, is either already experiencing or anticipating unprecedented growth in the amount of data available in their world, as well as new opportunities and great untapped value. This paper reviews big data challenges from a data management respective. In particular, we discuss big data diversity, big data reduction, big data integration and cleaning, big data indexing and query, and finally big data analysis and mining. Our survey gives a brief overview about big-data-oriented research and problems.

278 citations

Patent
29 Jan 1999
TL;DR: In this paper, a data management system has a plurality of data managers in one or more layers of a layered architecture and a common access method is disclosed to enable disparate pervasive computing devices to interact with centralized data management systems.
Abstract: A common access method is disclosed to enable disparate pervasive computing devices to interact with centralized data management systems. A modular, scalable data management system is envisioned to further expand the role of the pervasive devices as direct participants in the data management system. This data management system has a plurality of data managers and is provided with a plurality of data managers in one or more layers of a layered architecture. The system performs with a data manager and with a input from a user or pervasive computing device via an API a plurality of process on data residing in heterogeneous data repositories of computer system including promotion, check-in, check-out, locking, library searching, setting and viewing process results, tracking aggregations, and managing parts, releases and problem fix data under management control of a virtual control repository having one or more physical heterogeneous repositories. The system provides for storing, accessing, tracking data residing in said one or more data repositories managed by the virtual control repository. DMS applications executing directly within, on or behalf of, the pervasive computing device organize data using the PFVL paradigm. Configurable managers include a query control repository for existence of peer managers and provide logic switches to dynamically interact with peers. A control repository layer provides a common process interface across all managers. A command translator performs the appropriate mapping of generic control repository layer calls to the required function for the underlying storage engine.

277 citations

Journal ArticleDOI
TL;DR: A mobiscope is a federation of distributed mobile sensors into a taskable sensing system that achieves high-density sampling coverage over a wide area through mobility as discussed by the authors, which introduces challenges in data management and integrity, privacy, and network system design.
Abstract: Mobiscopes extend the traditional sensor network model, introducing challenges in data management and integrity, privacy, and network system design. Researchers need an architecture and general methodology for designing future mobiscopes. A mobiscope is a federation of distributed mobile sensors into a taskable sensing system that achieves high-density sampling coverage over a wide area through mobility.

277 citations


Network Information
Related Topics (5)
Information system
107.5K papers, 1.8M citations
90% related
Software
130.5K papers, 2M citations
88% related
Cluster analysis
146.5K papers, 2.9M citations
83% related
The Internet
213.2K papers, 3.8M citations
82% related
Cloud computing
156.4K papers, 1.9M citations
81% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
2023218
2022485
2021959
20201,435
20191,745
20181,719