scispace - formally typeset
D

Diane Larlus

Researcher at Xerox

Publications -  82
Citations -  6174

Diane Larlus is an academic researcher from Xerox. The author has contributed to research in topics: Computer science & Object (computer science). The author has an hindex of 27, co-authored 69 publications receiving 4722 citations. Previous affiliations of Diane Larlus include Technische Universität Darmstadt & Naver Corporation.

Papers
More filters
Journal ArticleDOI

Data-Driven Detection of Prominent Objects

TL;DR: This article studies a fundamentally different solution by formulating the supervised prediction of a bounding box as an image retrieval task, and uses a representation of images as object probability maps computed from low-level patch classifiers.
Proceedings ArticleDOI

StacMR: Scene-Text Aware Cross-Modal Retrieval

TL;DR: In this paper, the authors propose a new dataset that allows exploration of cross-modal retrieval where images contain scene-text instances, and describe several approaches which leverage scene text, including a better scenetext aware cross-mode retrieval method which uses specialized representations for text from captions and text from the visual scene, and reconcile them in a common embedding space.

Fake it till you make it: Learning transferable representations from synthetic ImageNet clones

TL;DR: In this paper , the authors explore the ability of Stable Diffusion to generate synthetic clones of ImageNet and measure how useful these are for training classification models from scratch, and they show that with minimal and class-agnostic prompt engineering, ImageNet clones are able to close a large part of the gap between models produced by synthetic images and models trained with real images.
Journal ArticleDOI

Capturing the Geometry of Object Categories from Video Supervision

TL;DR: An unsupervised method to learn the 3D geometry of object categories by looking around them using only video sequences showing object instances from a moving viewpoint, which achieves state-of-the-art results on viewpoint prediction, depth estimation, and 3D point cloud estimation on public benchmarks.
Proceedings Article

Category Level Object Segmentation