scispace - formally typeset
D

Dhruv Batra

Researcher at Georgia Institute of Technology

Publications -  272
Citations -  43803

Dhruv Batra is an academic researcher from Georgia Institute of Technology. The author has contributed to research in topics: Question answering & Dialog box. The author has an hindex of 69, co-authored 272 publications receiving 29938 citations. Previous affiliations of Dhruv Batra include Facebook & Toyota Technological Institute at Chicago.

Papers
More filters
Posted Content

Evaluating Visual Conversational Agents via Cooperative Human-AI Games

TL;DR: In this paper, a cooperative game called GuessWhich is designed to measure human-AI team performance in the specific context of the AI being a visual conversational agent, where the AI is provided an image which is unseen by the human and the human questions the AI about this secret image to identify it from a fixed pool of images.
Patent

Producing object cutouts in topically related images

TL;DR: In this paper, a method for extracting an object out of each image in a group of digital images that contain the object, which includes selecting a seed image from the group of images, and displaying the seed image to a user, is presented.
Book ChapterDOI

Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation

TL;DR: In this paper, the authors introduce a learning-based approach for room navigation using semantic maps, which learns to predict top-down belief maps of regions that lie beyond the agent's field of view while modeling architectural and stylistic regularities in houses.
Proceedings ArticleDOI

Semi-Supervised Clustering via Learnt Codeword Distances.

TL;DR: Inspired by the success of recent bag-of-words models, the proposed technique learns non-parametric distance metrics over codewords from these equivalence (and optionally, in-equivalence) constraints, which are then able to propagate back to compute a dissimilarity measure between any two points in the feature space.
Proceedings ArticleDOI

CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual Dialog.

TL;DR: The CLEVR-Dialog dataset as discussed by the authors is a large diagnostic dataset for studying multi-round reasoning in visual dialog, which contains 5 instances of 10-round dialogs for about 85k CLEVR images, totaling to 4.25M question-answer pairs.