A
Abhishek Das
Researcher at Facebook
Publications - 61
Citations - 15366
Abhishek Das is an academic researcher from Facebook. The author has contributed to research in topics: Dialog box & Computer science. The author has an hindex of 27, co-authored 52 publications receiving 9447 citations. Previous affiliations of Abhishek Das include Georgia Institute of Technology.
Papers
More filters
Proceedings ArticleDOI
Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization
Ramprasaath R. Selvaraju,Michael Cogswell,Abhishek Das,Ramakrishna Vedantam,Devi Parikh,Dhruv Batra +5 more
TL;DR: This work combines existing fine-grained visualizations to create a high-resolution class-discriminative visualization, Guided Grad-CAM, and applies it to image classification, image captioning, and visual question answering (VQA) models, including ResNet-based architectures.
Journal ArticleDOI
Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization
Ramprasaath R. Selvaraju,Michael Cogswell,Abhishek Das,Ramakrishna Vedantam,Devi Parikh,Devi Parikh,Dhruv Batra,Dhruv Batra +7 more
TL;DR: Grad-CAM as mentioned in this paper uses the gradients of any target concept (e.g., a dog in a classification network or a sequence of words in captioning network) flowing into the final convolutional layer to produce a coarse localization map highlighting the important regions in the image for predicting the concept.
Grad-CAM: Why did you say that? Visual Explanations from Deep Networks via Gradient-based Localization
Ramprasaath R. Selvaraju,Abhishek Das,Ramakrishna Vedantam,Michael Cogswell,Devi Parikh,Dhruv Batra +5 more
TL;DR: It is shown that Guided Grad-CAM helps untrained users successfully discern a "stronger" deep network from a "weaker" one even when both networks make identical predictions, and also exposes the somewhat surprising insight that common CNN + LSTM models can be good at localizing discriminative input image regions despite not being trained on grounded image-text pairs.
Proceedings Article
Visual Dialog
Abhishek Das,Satwik Kottur,Khushi Gupta,Avi Singh,Deshraj Yadav,Jose M. F. Moura,Devi Parikh,Dhruv Batra +7 more
TL;DR: In this article, the authors introduce the task of Visual Dialog, which requires an AI agent to hold a meaningful dialog with humans in natural, conversational language about visual content, given an image, a dialog history and a question about the image, the agent has to ground the question in image, infer context from history, and answer the question accurately.
Journal Article
Visual Dialog
Abhishek Das,Satwik Kottur,Khushi Gupta,Avi Singh,Deshraj Yadav,Stefan Lee,Jose M. F. Moura,Devi Parikh,Dhruv Batra +8 more
TL;DR: The authors introduced the task of Visual Dialog, which requires an AI agent to hold a meaningful dialog with humans in natural, conversational language about visual content, given an image, a dialog history and a question about the image, the agent has to ground the question in image, infer context from history, and answer the question accurately.