The tensor algebra compiler

doi:10.1145/3133901

Open AccessJournal ArticleDOI

The tensor algebra compiler

Fredrik Kjolstad, +4 more

- Vol. 1, pp 77

Chats0

TLDR

TACO as mentioned in this paper is a C++ library that automatically generates compound tensor algebra operations on dense and sparse tensors, which can be used in machine learning, data analytics, engineering and the physical sciences.

Abstract:

Tensor algebra is a powerful tool with applications in machine learning, data analytics, engineering and the physical sciences. Tensors are often sparse and compound operations must frequently be computed in a single kernel for performance and to save memory. Programmers are left to write kernels for every operation of interest, with different mixes of dense and sparse tensors in different formats. The combinations are infinite, which makes it impossible to manually implement and optimize them all. This paper introduces the first compiler technique to automatically generate kernels for any compound tensor algebra operation on dense and sparse tensors. The technique is implemented in a C++ library called taco. Its performance is competitive with best-in-class hand-optimized kernels in popular libraries, while supporting far more tensor operations.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Timeloop: A Systematic Approach to DNN Accelerator Evaluation

Angshuman Parashar, +9 more

TL;DR: Timeloop's underlying models and algorithms are described in detail and results from case studies enabled by Timeloop are shown, which reveal that dataflow and memory hierarchy co-design plays a critical role in optimizing energy efficiency.

...read moreread less

Posted Content

Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions

Nicolas Vasilache, +8 more

- 13 Feb 2018 -

arXiv: Programming Languages

TL;DR: A language close to the mathematics of deep learning called Tensor Comprehensions offering both imperative and declarative styles, a polyhedral Just-In-Time compiler to convert a mathematical description of a deep learning DAG into a CUDA kernel with delegated memory management and synchronization, and a compilation cache populated by an autotuner are contributed.

...read moreread less

Proceedings ArticleDOI

ExTensor: An Accelerator for Sparse Tensor Algebra

Kartik Hegde, +7 more

TL;DR: The ExTensor accelerator is proposed, which builds these novel ideas on handling sparsity into hardware to enable better bandwidth utilization and compute throughput and evaluated on several kernels relative to industry libraries and state-of-the-art tensor algebra compilers.

...read moreread less

Posted Content

Learning to Optimize Tensor Programs

Tianqi Chen, +7 more

- 21 May 2018 -

arXiv: Learning

TL;DR: In this article, a learning-based framework is introduced to optimize tensor programs for deep learning workloads, such as matrix multiplication and high dimensional convolution, which are key enablers of effective deep learning systems.

...read moreread less

Journal ArticleDOI

Efficient Processing of Deep Neural Networks

Vivienne Sze, +3 more

- 24 Jun 2020 -

Synthesis Lectures on Computer Architect...

TL;DR: This book provides a structured treatment of the key principles and techniques for enabling efficient processing of deep neural networks (DNNs).

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

The NumPy Array: A Structure for Efficient Numerical Computation

Stefan van der Walt, +2 more

- 01 Mar 2011 -

Computing in Science and Engineering

TL;DR: In this article, the authors show how to improve the performance of NumPy arrays through vectorizing calculations, avoiding copying data in memory, and minimizing operation counts, which is a technique similar to the one described in this paper.

...read moreread less

Book

The Art of Computer Programming: Volume 3: Sorting and Searching

Donald E. Knuth

Journal ArticleDOI

The university of Florida sparse matrix collection

Timothy A. Davis, +1 more

- 07 Dec 2011 -

ACM Transactions on Mathematical Softwar...

TL;DR: The University of Florida Sparse Matrix Collection, a large and actively growing set of sparse matrices that arise in real applications, is described and a new multilevel coarsening scheme is proposed to facilitate this task.

...read moreread less

Journal ArticleDOI