scispace - formally typeset
Search or ask a question
Topic

Data access

About: Data access is a research topic. Over the lifetime, 13141 publications have been published within this topic receiving 172859 citations. The topic is also known as: Data access.


Papers
More filters
07 Oct 2008
TL;DR: A new broker portal based on the experience of the project Soda aims to unify and ease the access to distributed data sources and applications providing solar resource information.
Abstract: Knowledge of the solar energy resource is essential for the planning and operation of solar energy systems. In past years there has been substantial European and national funding to develop information systems on solar radiation data, leading to the situations that several data bases exist in parallel, developed by different approaches, various spatial and temporal coverages and resolutions including those exploiting satellite data. The user of these products may end up with different results for the same requested sites. To better guide the users, a benchmarking exercise is under preparation. A set of reference data has been collected and benchmarking measures and rules have been defined. The results of the benchmarking and the feedback from stakeholders will be integrated into a guide of best practices in the application of solar resource knowledge. Access to data has been quite fragmented. Each service has its own way of access to the data and delivery format. A new broker portal based on the experience of the project Soda aims to unify and ease the access to distributed data sources and applications providing solar resource information.

49 citations

Journal ArticleDOI
08 May 2001
TL;DR: This work shows how to apply a Structured Parallel Programming methodology based on skeletons to Data Mining problems, and examines the addition of an object/component interface to the skeleton structured model, to simplify the development of environment-integrated, parallel Data Mining applications.
Abstract: We show how to apply a Structured Parallel Programming methodology based on skeletons to Data Mining problems, reporting several results about three commonly used mining techniques, namely association rules, decision tree induction and spatial clustering. We analyze the structural patterns common to these applications, looking at application performance and software engineering efficiency. Our aim is to clearly state what features a Structured Parallel Programming Environment should have to be useful for parallel Data Mining. Within the skeleton-based PPE SkIE that we have developed, we study the different patterns of data access of parallel implementations of Apriori, C4.5 and DBSCAN. We need to address large partitions reads, frequent and sparse access to small blocks, as well as an irregular mix of small and large transfers, to allow efficient development of applications on huge databases. We examine the addition of an object/component interface to the skeleton structured model, to simplify the development of environment-integrated, parallel Data Mining applications.

49 citations

Journal ArticleDOI
TL;DR: It is shown that the proposed approach provides high data availability, low bandwidth consumption, increased fault-tolerance and improved scalability of the overall system as compared to standard replica control protocols.

49 citations

Proceedings ArticleDOI
08 Jun 2011
TL;DR: A cost-intelligent data access strategy to improve the performance of parallel file systems and a hybrid data replication strategy for those applications so that a file can have replications with different layout policies for the best performance.
Abstract: I/O data access is a recognized performance bottleneck of high-end computing. Several commercial and research parallel file systems have been developed in recent years to ease the performance bottleneck. These advanced file systems perform well on some applications but may not perform well on others. They have not reached their full potential in mitigating the I/O-wall problem. Data access is application dependent. Based on the application-specific optimization principle, in this study we propose a cost-intelligent data access strategy to improve the performance of parallel file systems. We first present a novel model to estimate data access cost of different data layout policies. Next, we extend the cost model to calculate the overall I/O cost of any given application and choose an appropriate layout policy for the application. A complex application may consist of different data access patterns. Averaging the data access patterns may not be the best solution for those complex applications that do not have a dominant pattern. We then further propose a hybrid data replication strategy for those applications, so that a file can have replications with different layout policies for the best performance. Theoretical analysis and experimental testing have been conducted to verify the newly proposed cost-intelligent layout approach. Analytical and experimental results show that the proposed cost model is effective and the application-specific data layout approach achieved up to 74% performance improvement for data-intensive applications.

49 citations

Journal ArticleDOI
TL;DR: In this paper, the authors present a pro-competitive framework for issues of both ownership and access in the Free Flow of Data (FFoD) initiative, which aims to enhance the growth potential of the emerging data economy, which is characterized by the digitisation of production (smart factories) and the advent of digitised products such as smart cars or smart wearables that will be able to communicate with each other and the environment through the Internet of Things.
Abstract: As part of the project to establish a Digital Single Market the European Commission has launched a ‘Free Flow of Data’ initiative. This initiative is meant to enhance the growth potential of the emerging data economy, which is characterised by the digitisation of production (smart factories) and the advent of digitised products such as smart — driverless — cars or smart wearables that will be able to communicate with each other and the environment through the Internet of Things. Furthermore, the enormous amount of data generated and controlled by the industry could serve as a most valuable input for other new data-driven services and for applications in the public interest such as the operation of smart cities, smart and resource-efficient farming or measures to prevent the spread of infectious diseases. Obviously, this new data economy has to rely on the commercialisation of data. But what kind of regulation is needed in order to make the data economy work? Do we need new ownership rights in data? Or should regulation focus on access in order to make data as widely available as possible? The European Commission is currently working on a Communication to provide answers to these questions by January 2017. This article tries to assist the Commission by working on a pro-competitive framework for issues of both ownership and access. In so doing, this article undertakes two things: first, it analyses to what extent intellectual property laws already provide control over data and then discusses the need and justification for introducing new rules on data ownership. Second, it analyses whether EU competition law already provides remedies to promote access to data and furthermore explores whether and under which conditions introduction of new access regimes would be advisable.This article is to be considered as on-going research. It is only made available online. A later publication will take into account the Commission Communication expected for January 2017.

49 citations


Network Information
Related Topics (5)
Software
130.5K papers, 2M citations
86% related
Cloud computing
156.4K papers, 1.9M citations
86% related
Cluster analysis
146.5K papers, 2.9M citations
85% related
The Internet
213.2K papers, 3.8M citations
85% related
Information system
107.5K papers, 1.8M citations
83% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202351
2022125
2021403
2020721
2019906
2018816