scispace - formally typeset
T

Thomas M. Breuel

Researcher at Nvidia

Publications -  240
Citations -  10811

Thomas M. Breuel is an academic researcher from Nvidia. The author has contributed to research in topics: Optical character recognition & Image segmentation. The author has an hindex of 43, co-authored 237 publications receiving 9547 citations. Previous affiliations of Thomas M. Breuel include Google & Xerox.

Papers
More filters
Posted Content

On the Convergence of SGD Training of Neural Networks

TL;DR: Empirical results presented here suggest that local minima are not significant factors in SGD optimization of MLP-related objective functions, and that the behavior of stochastic gradient descent in these problems is better described as the simultaneous convergence at different rates of many, largely non-interacting subproblems.
Proceedings ArticleDOI

Recognition Driven Page Orientation Detection

TL;DR: The proposed Recognition Driven Page Orientation Detection (RD-POD) uses the OCR engine for estimating the right orientation with a few lines of the document image and is highly robust and accurate, and is able to detect multiple orientations.
Journal ArticleDOI

A branch and bound algorithm for finding the modes in kernel density estimates

TL;DR: This work proposes a novel, globally optimal branch and bound algorithm for finding the modes in kernel densities, which gives statistically significantly more accurate solutions than the mean shift and the mode localization accuracy is about five times more precise.
Journal ArticleDOI

Displacement-Invariant Cost Computation for Stereo Matching

TL;DR: In this article , the authors propose a displacement-invariant cost computation module to compute the matching costs without needing a 4D feature volume, which can be applied on each disparity-shifted feature map pair independently.

Participation at TRECVID 2011 Semantic Indexing & Content-based Copy Detection Tasks.

TL;DR: This paper describes the TRECVID 2011 participation of the IUPR-DFKI team in the semantic indexing task (SIN) and content based copy detection task (CCD) task, which was dominated by an significant increase of vocabulary concept size.