Topic

Data access

About: Data access is a research topic. Over the lifetime, 13141 publications have been published within this topic receiving 172859 citations. The topic is also known as: Data access.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

GraphH: A Processing-in-Memory Architecture for Large-Scale Graph Processing

[...]

Guohao Dai¹, Tianhao Huang¹, Yuze Chi², Jishen Zhao³, Guangyu Sun⁴, Yongpan Liu¹, Yu Wang¹, Yuan Xie⁵, Huazhong Yang¹ - Show less +5 more•Institutions (5)

Tsinghua University¹, University of California, Los Angeles², University of California, San Diego³, Peking University⁴, University of California, Santa Barbara⁵

01 Apr 2019-IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

TL;DR: GraphH, a PIM architecture for graph processing on the hybrid memory cube array, is proposed to tackle all four problems mentioned above, including random access pattern causing local bandwidth degradation, poor locality leading to unpredictable global data access, heavy conflicts on updating the same vertex, and unbalanced workloads across processing units.

...read moreread less

Abstract: Large-scale graph processing requires the high bandwidth of data access. However, as graph computing continues to scale, it becomes increasingly challenging to achieve a high bandwidth on generic computing architectures. The primary reasons include: the random access pattern causing local bandwidth degradation, the poor locality leading to unpredictable global data access, heavy conflicts on updating the same vertex, and unbalanced workloads across processing units. Processing-in-memory (PIM) has been explored as a promising solution to providing high bandwidth, yet open questions of graph processing on PIM devices remain in: 1) how to design hardware specializations and the interconnection scheme to fully utilize bandwidth of PIM devices and ensure locality and 2) how to allocate data and schedule processing flow to avoid conflicts and balance workloads. In this paper, we propose GraphH, a PIM architecture for graph processing on the hybrid memory cube array, to tackle all four problems mentioned above. From the architecture perspective, we integrate SRAM-based on-chip vertex buffers to eliminate local bandwidth degradation. We also introduce reconfigurable double-mesh connection to provide high global bandwidth. From the algorithm perspective, partitioning and scheduling methods like index mapping interval-block and round interval pair are introduced to GraphH, thus workloads are balanced and conflicts are avoided. Two optimization methods are further introduced to reduce synchronization overhead and reuse on-chip data. The experimental results on graphs with billions of edges demonstrate that GraphH outperforms DDR-based graph processing systems by up to two orders of magnitude and $5.12 {\times }$ speedup against the previous PIM design.

...read moreread less

135 citations

Proceedings Article•

MineSet: an integrated system for data mining

[...]

Cliff Brunk, James Kelly, Ron Kohavi

14 Aug 1997

TL;DR: MineSet supports the knowledge discovery process from data access and preparation through iterative analysis and visualization to deployment, and third party vendors can interface to the MineSet tools for model deployment and for integration with other packages.

...read moreread less

Abstract: MineSet™, Silicon Graphics' interactive system for data mining, integrates three powerful technologies: database access, analytical data mining, and data visualization. It supports the knowledge discovery process from data access and preparation through iterative analysis and visualization to deployment. Mine-Set is based on a client-server architecture that scales to large databases. The database access component provides a rich set of operators that can be used to preprocess and transform the stored data into forms appropriate for visualization and analytical mining. The 3D visualization capabilities allow direct data visualization for exploratory analysis, including tools for displaying high-dimensional data containing geographical and hierarchical information. The analytical mining algorithms help identify potentially interesting models of the data, which can be viewed using visualization tools specialized for the learned models. Third party vendors can interface to the MineSet tools for model deployment and for integration with other packages.

...read moreread less

135 citations

Journal Article•DOI•

Blockchain-based data management for digital twin of product

[...]

Huang Sihan¹, Guoxin Wang¹, Yan Yan¹, Xiongbing Fang•Institutions (1)

Beijing Institute of Technology¹

01 Jan 2020-Journal of Manufacturing Systems

TL;DR: A data management method for digital twin of product based on blockchain technology is proposed and the results show that the proposed method can solve the abovementioned data management problems simultaneously.

...read moreread less

135 citations

Journal Article•DOI•

Dynamic replication algorithms for the multi-tier Data Grid

[...]

Ming Tang¹, Bu-Sung Lee¹, Chai Kiat Yeo¹, Xueyan Tang¹•Institutions (1)

Nanyang Technological University¹

01 May 2005

TL;DR: Two dynamic replication algorithms, Simple Bottom-Up (SBU) and Aggregate Bottom- up (ABU) are proposed for the multi-tier Data Grid and comparing the two algorithms to Fast Spread dynamic replication strategy, ABU proves to be superior.

...read moreread less

Abstract: Data replication is a common method used to improve the performance of data access in distributed systems. In this paper, two dynamic replication algorithms, Simple Bottom-Up (SBU) and Aggregate Bottom-Up (ABU), are proposed for the multi-tier Data Grid. A multi-tier Data Grid simulator called DRepSim is developed for studying the performances of the dynamic replication algorithms. The simulation results show that both algorithms can reduce the average response time of data access greatly compared to the static replication method. ABU can achieve great performance improvements for all access patterns even if the available storage size of the replication server is very small. Comparing the two algorithms to Fast Spread dynamic replication strategy, ABU proves to be superior. As for SBU, although the average response time of Fast Spread is better in most cases, Fast Spread's replication frequency is too high to be applicable in the real world.

...read moreread less

134 citations

Journal Article•DOI•

Performance modeling of distributed and replicated databases

[...]

Matthias Nicola, Matthias Jarke¹•Institutions (1)

RWTH Aachen University¹

01 Jul 2000-IEEE Transactions on Knowledge and Data Engineering

TL;DR: This paper surveys performance models for distributed and replicated database systems and selects a combination of these proven modeling concepts and gives an example of how to compose a balanced analytical model of a replicated database.

...read moreread less

Abstract: The paper surveys performance models for distributed and replicated database systems. Over the last 20 years (1980-2000), a variety of such performance models have been developed and they differ in: (1) which aspects of a real system are or are not captured in the model (e.g., replication, communication, nonuniform data access, etc.); and (2) how these aspects are modeled. We classify the different alternatives and modeling assumptions and discuss their interdependencies and expressiveness for the representation of distributed databases. This leads to a set of building blocks for analytical performance models. To illustrate the work that is surveyed, we select a combination of these proven modeling concepts and give an example of how to compose a balanced analytical model of a replicated database. We use this example to show how to derive meaningful performance values and to discuss the applicability and expressiveness of performance models for distributed and replicated databases. Finally, we compare the analytical results to measurements in a distributed database system.

...read moreread less

133 citations

Collapse

Network Information

Performance

Metrics

13,314

Papers

188,075

Citations

No. of papers in the topic in previous years
Year	Papers
2023	51
2022	125
2021	403
2020	721
2019	906
2018	816

Data access

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics