A
Antonio Torralba
Researcher at Massachusetts Institute of Technology
Publications - 437
Citations - 105763
Antonio Torralba is an academic researcher from Massachusetts Institute of Technology. The author has contributed to research in topics: Computer science & Object detection. The author has an hindex of 119, co-authored 388 publications receiving 84607 citations. Previous affiliations of Antonio Torralba include Vassar College & Nvidia.
Papers
More filters
Proceedings Article
Deep Audio Priors Emerge From Harmonic Convolutional Networks
Zhoutong Zhang,Yunyun Wang,Chuang Gan,Jiajun Wu,Joshua B. Tenenbaum,Antonio Torralba,William T. Freeman +6 more
TL;DR: Harmonic Convolution is proposed, an operation that helps deep networks distill priors in audio signals by explicitly utilizing the harmonic structure within by engineering the kernel to be supported by sets of harmonic series, instead of local neighborhoods for convolutional kernels.
Proceedings Article
Learning visual biases from human imagination
TL;DR: A novel method is introduced that, inspired by well-known tools in human psychophysics, estimates the biases that the human visual system might use for recognition, but in computer vision feature spaces, and presents an SVM formulation that constrains the orientation of the SVM hyperplane to agree with the bias from human visualSystem.
Proceedings ArticleDOI
Synthesizing Environment-Aware Activities via Activity Sketches
TL;DR: This work builds upon VirtualHome, to create a new dataset VirtualHome-Env, where it collects program sketches to represent activities and match programs with environments that can afford them, and proposes RNN-ResActGraph, a network that generates a program from a given sketch and an environment graph and tracks the changes in the environment induced by the program.
Journal ArticleDOI
ConceptFusion: Open-set Multimodal 3D Mapping
Krishna Murthy Jatavallabhula,Ali Kuwajerwala,Qiao Gu,Mohd. Omama,Tao Chen,Shuang Li,Ganesh M. Iyer,Soroush Saryazdi,Nikhil Varma Keetha,Ayush Tewari,Joshua B. Tenenbaum,Celso M. de Melo,M. Krishna,Liam Paull,Florian Shkurti,Antonio Torralba +15 more
TL;DR: ConceptFusion as mentioned in this paper is a scene representation that is fundamentally open-set, enabling reasoning beyond a closed set of concepts and inherently multimodal, enabling a diverse range of possible queries to the 3D map, from language, to images, to audio, to 3D geometry, all working in concert.
Proceedings ArticleDOI
How to Make a Pizza: Learning a Compositional Layer-Based GAN Model
TL;DR: In this article, a generative adversarial network (GAN) is proposed to learn composable module operations that can either add or remove a particular ingredient in a food recipe, which can be seen as a way to change the visual appearance of a dish by adding extra objects or changing the appearance of the existing ones.