scispace - formally typeset
S

Santosh K. Divvala

Researcher at Allen Institute for Artificial Intelligence

Publications -  25
Citations -  29702

Santosh K. Divvala is an academic researcher from Allen Institute for Artificial Intelligence. The author has contributed to research in topics: Object detection & Semantic similarity. The author has an hindex of 17, co-authored 25 publications receiving 17317 citations. Previous affiliations of Santosh K. Divvala include Carnegie Mellon University & University of Washington.

Papers
More filters
Proceedings ArticleDOI

You Only Look Once: Unified, Real-Time Object Detection

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.
Proceedings ArticleDOI

An empirical study of context in object detection

TL;DR: This paper presents an empirical evaluation of the role of context in a contemporary, challenging object detection task - the PASCAL VOC 2008, using top-performing local appearance detectors as baseline and evaluates several different sources of context and ways to utilize it.
Posted Content

You Only Look Once: Unified, Real-Time Object Detection

TL;DR: YOLO as discussed by the authors predicts bounding boxes and class probabilities directly from full images in one evaluation, which can be optimized end-to-end directly on detection performance, and achieves state-of-the-art performance.
Proceedings ArticleDOI

Learning Everything about Anything: Webly-Supervised Visual Concept Learning

TL;DR: A fully-automated approach for learning extensive models for a wide range of variations within any concept, which leverages vast resources of online books to discover the vocabulary of variance, and intertwines the data collection and modeling steps to alleviate the need for explicit human supervision in training the models.
Proceedings ArticleDOI

Asynchronous Temporal Fields for Action Recognition

TL;DR: This work proposes a fully-connected temporal CRF model for reasoning over various aspects of activities that includes objects, actions, and intentions, where the potentials are predicted by a deep network.