scispace - formally typeset
Open AccessJournal ArticleDOI

Characterness: An Indicator of Text in the Wild

Reads0
Chats0
TLDR
This work proposes a bottom-up approach to the problem of detecting general text in images, which reflects the characterness of an image region, and develops three novel cues that are tailored for character detection and a Bayesian method for their integration.
Abstract
Text in an image provides vital information for interpreting its contents, and text in a scene can aid a variety of tasks from navigation to obstacle avoidance and odometry. Despite its value, however, detecting general text in images remains a challenging research problem. Motivated by the need to consider the widely varying forms of natural text, we propose a bottom-up approach to the problem, which reflects the characterness of an image region. In this sense, our approach mirrors the move from saliency detection methods to measures of objectness. In order to measure the characterness, we develop three novel cues that are tailored for character detection and a Bayesian method for their integration. Because text is made up of sets of characters, we then design a Markov random field model so as to exploit the inherent dependencies between characters. We experimentally demonstrate the effectiveness of our characterness cues as well as the advantage of Bayesian multicue integration. The proposed text detector outperforms state-of-the-art methods on a few benchmark scene text detection data sets. We also show that our measurement of characterness is superior than state-of-the-art saliency detection models when applied to the same task.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Text-Attentional Convolutional Neural Network for Scene Text Detection

TL;DR: A new system for scene text detection by proposing a novel text-attentional convolutional neural network (Text-CNN) that particularly focuses on extracting text-related regions and features from the image components and a powerful low-level detector called contrast-enhancement maximally stable extremal regions (MSERs) is developed.
Journal ArticleDOI

TextField: Learning a Deep Direction Field for Irregular Scene Text Detection

TL;DR: TextField as discussed by the authors learns a direction field pointing away from the nearest text boundary to each text point, which is represented by an image of 2D vectors and learned via a fully convolutional neural network.
Proceedings ArticleDOI

WordSup: Exploiting Word Annotations for Character Based Text Detection

TL;DR: A weakly supervised framework that can utilize word annotations, either in tight quadrangles or the more loose bounding boxes, for character detector training is proposed, able to train a robust character detector by exploiting word annotations in the rich large-scale real scene text datasets, e.g. ICDAR15 and COCO-text.
Journal ArticleDOI

TextBoxes++: A Single-Shot Oriented Scene Text Detector

TL;DR: TextBoxes++ as discussed by the authors is an end-to-end trainable fast scene text detector, which detects arbitrary-oriented scene text with both high accuracy and efficiency in a single network forward pass.
Posted Content

WordSup: Exploiting Word Annotations for Character based Text Detection

TL;DR: Zhang et al. as discussed by the authors proposed a weakly supervised framework that can utilize word annotations, either in tight quadrangles or the more loose bounding boxes, for character detector training.
References
More filters
Proceedings ArticleDOI

Histograms of oriented gradients for human detection

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Journal ArticleDOI

A model of saliency-based visual attention for rapid scene analysis

TL;DR: In this article, a visual attention system inspired by the behavior and the neuronal architecture of the early primate visual system is presented, where multiscale image features are combined into a single topographical saliency map.
Journal ArticleDOI

Fast approximate energy minimization via graph cuts

TL;DR: This work presents two algorithms based on graph cuts that efficiently find a local minimum with respect to two types of large moves, namely expansion moves and swap moves that allow important cases of discontinuity preserving energies.
Proceedings ArticleDOI

Frequency-tuned salient region detection

TL;DR: This paper introduces a method for salient region detection that outputs full resolution saliency maps with well-defined boundaries of salient objects that outperforms the five algorithms both on the ground-truth evaluation and on the segmentation task by achieving both higher precision and better recall.
Proceedings ArticleDOI

Global contrast based salient region detection

TL;DR: This work proposes a regional contrast based saliency extraction algorithm, which simultaneously evaluates global contrast differences and spatial coherence, and consistently outperformed existing saliency detection methods.
Related Papers (5)