Topic

Memory management

About: Memory management is a research topic. Over the lifetime, 16743 publications have been published within this topic receiving 312028 citations. The topic is also known as: memory allocation.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Memory bank disambiguation using modulo unrolling for Raw machines

[...]

Rajeev Barua¹, Whay S. Lee¹, Saman Amarasinghe¹, Anant Agarwal¹•Institutions (1)

Massachusetts Institute of Technology¹

17 Dec 1998

TL;DR: Modulo Unrolling as discussed by the authors is a code transformation technique for enabling array references to be accessed through the fast static network on a Raw machine, which allows the static communication of a large class of array accesses.

...read moreread less

Abstract: We present modulo unrolling, a code transformation technique for enabling array references to be accessed through the fast static network on a Raw machine. A Raw machine comprises of a mesh of simple, replicated tiles connected by an interconnect which supports fast, static near-neighbor communication. Like all other resources, memory is distributed across the tiles. Management of the memory can be performed by well known techniques which generate the requisite communication code on distributed address-space architectures. On the other hand, the fast, static network provides the compiler with a simple interface to optimize such communication. This paper addresses the problem of taking advantage of such static communication for memory accesses. The requirement for static memory communication is the compile-time knowledge of the exact communication required for each memory reference. This knowledge, in turn, can be obtained if a memory reference refers exclusively to memory residing on a single processing tile. We introduce modulo unrolling as a technique which allows the static communication of a large class of array accesses. We show how this technique achieves the goal of static communication by using a relatively small unroll factor. For a set of dense matrix scientific applications, we are able to access all the array references on the static network, enabling scalable speedups on the Raw machine.

...read moreread less

68 citations

Journal Article•DOI•

Synthesis of application-specific memory designs

[...]

Herman Schmit¹, D. E. Thomas¹•Institutions (1)

Carnegie Mellon University¹

01 Mar 1997-IEEE Transactions on Very Large Scale Integration Systems

TL;DR: This paper introduces a novel approach to the design of memory systems, which is based on a variety of array grouping techniques and dimensional transformations, and the binding of array groups to memory components with different dimensions, access times, and number of ports.

...read moreread less

Abstract: This paper discusses the mapping of arrays in a behavior to memories in an implementation. We introduce a novel approach to the design of memory systems, which is based on a variety of array grouping techniques and dimensional transformations, and the binding of array groups to memory components with different dimensions, access times, and number of ports. The results of design actions are computed in terms of memory cost, the number of wires necessary to connect the memory to the data path, and the limit of performance imposed by the memory design on the implementation. Three different procedures can be used to find a suitable memory design. All three procedures are directed by a weighted and constrained system cost function, which enables the expression of the user's design priorities. Compared to related research efforts, our approach improves performance by as much as 19%, reduces memory cost as 40%, and decreases the number of wires required to connect the memory to the data path by up to 57%.

...read moreread less

68 citations

Patent•

Hierarchical memory system with variable regulation and priority of writeback from cache memory to bulk memory

[...]

James R. Hamstra, Merlin L. Hanson

03 Mar 1982

TL;DR: In a hierarchical memory system, replacement of segments in a cache memory is governed by a least recently used algorithm, while trickling of segments from the cache memory to the bulk memory was governed by the age since first write.

...read moreread less

Abstract: In a hierarchical memory system, replacement of segments in a cache memory is governed by a least recently used algorithm, while trickling of segments from the cache memory to the bulk memory is governed by the age since first write The host processor passes an AGEOLD parameter to the memory subsystem and this parameter regulates the trickling of segments Unless the memory system is idle (no I/O activity), no trickling takes place until the age of the oldest written-to segment is at least as great as AGEOLD A command is generated for each segment to be trickled and the priority of execution assigned to such commands is variable and determined by the relationship of AGEOLD to the oldest age since first write of any of the segments If the subsystem receives no command from the host processor for a predetermined interval, AGEOLD is ignored and any written-to segment becomes a candidate for trickling

...read moreread less

68 citations

Journal Article•DOI•

A review of automatic differentiation and its efficient implementation

[...]

Charles C. Margossian¹•Institutions (1)

Columbia University¹

01 Jul 2019-Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery

TL;DR: In this paper, the authors present a broad range of computational techniques to improve applicability, run time, and memory management of automatic differentiation packages, including operation overloading, region based memory, and expression templates.

...read moreread less

Abstract: Derivatives play a critical role in computational statistics, examples being Bayesian inference using Hamiltonian Monte Carlo sampling and the training of neural networks. Automatic differentiation is a powerful tool to automate the calculation of derivatives and is preferable to more traditional methods, especially when differentiating complex algorithms and mathematical functions. The implementation of automatic differentiation however requires some care to insure efficiency. Modern differentiation packages deploy a broad range of computational techniques to improve applicability, run time, and memory management. Among these techniques are operation overloading, region based memory, and expression templates. There also exist several mathematical techniques which can yield high performance gains when applied to complex algorithms. For example, semi-analytical derivatives can reduce by orders of magnitude the runtime required to numerically solve and differentiate an algebraic equation. Open problems include the extension of current packages to provide more specialized routines, and efficient methods to perform higher-order differentiation.

...read moreread less

68 citations

Proceedings Article•DOI•

QUEST: A 7.49TOPS multi-purpose log-quantized DNN inference engine stacked on 96MB 3D SRAM using inductive-coupling technology in 40nm CMOS

[...]

Kodai Ueyoshi¹, Kota Ando¹, Kazutoshi Hirose¹, Shinya Takamaeda-Yamazaki¹, Junichiro Kadomoto², Tomoki Miyata², Mototsugu Hamada², Tadahiro Kuroda², Masato Motomura¹ - Show less +5 more•Institutions (2)

Hokkaido University¹, Keio University²

08 Mar 2018

TL;DR: A key consideration for deep neural network (DNN) inference accelerators is the need for large and high-bandwidth external memories, and an architectural concept for stacking a DNN accelerator with DRAMs has been proposed.

...read moreread less

Abstract: A key consideration for deep neural network (DNN) inference accelerators is the need for large and high-bandwidth external memories. Although an architectural concept for stacking a DNN accelerator with DRAMs has been proposed previously, long DRAM latency remains problematic and limits the performance [1]. Recent algorithm-level optimizations, such as network pruning and compression, have shown success in reducing the DNN memory size [2]; however, since networks become irregular and sparse, they induce an additional need for agile random accesses to the memory systems.

...read moreread less

68 citations

Collapse

Network Information

Performance

Metrics

16,861

Papers

331,311

Citations

No. of papers in the topic in previous years
Year	Papers
2023	33
2022	88
2021	629
2020	467
2019	461
2018	591

Memory management

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics