scispace - formally typeset
D

Dirk Weissenborn

Researcher at Google

Publications -  44
Citations -  15572

Dirk Weissenborn is an academic researcher from Google. The author has contributed to research in topics: Question answering & Recurrent neural network. The author has an hindex of 19, co-authored 40 publications receiving 2987 citations. Previous affiliations of Dirk Weissenborn include Dresden University of Technology & German Research Centre for Artificial Intelligence.

Papers
More filters
Posted Content

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

TL;DR: Vision Transformer (ViT) attains excellent results compared to state-of-the-art convolutional networks while requiring substantially fewer computational resources to train.
Proceedings Article

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

TL;DR: The Vision Transformer (ViT) as discussed by the authors uses a pure transformer applied directly to sequences of image patches to perform very well on image classification tasks, achieving state-of-the-art results on ImageNet, CIFAR-100, VTAB, etc.
Proceedings Article

Object-Centric Learning with Slot Attention

TL;DR: An architectural component that interfaces with perceptual representations such as the output of a convolutional neural network and produces a set of task-dependent abstract representations which are exchangeable and can bind to any object in the input by specializing through a competitive procedure over multiple rounds of attention is presented.
Posted Content

Axial Attention in Multidimensional Transformers

TL;DR: Axial Transformers is proposed, a self-attention-based autoregressive model for images and other data organized as high dimensional tensors that maintains both full expressiveness over joint distributions over data and ease of implementation with standard deep learning frameworks, while requiring reasonable memory and computation.