scispace - formally typeset
Open AccessProceedings Article

An Improved Method for Image Retrieval using Speech Annotation

J. Chen, +3 more
- Vol. 49, Iss: 3, pp 12-30
Reads0
Chats0
TLDR
This paper presents a system for the image indexing and retrieval using speech annotations based on a pre-defined structured syntax, and a query expansion technique is explored to enhance the query terms and to improve retrieval effectiveness.
Abstract
In this paper, we present a system for the image indexing and retrieval using speech annotations based on a pre-defined structured syntax. In addition to the introduction of N-best lists for index generation, a query expansion technique is explored to enhance the query terms and to improve retrieval effectiveness. By adding the most probable substitutions for the query terms, more relevant images are distinguished from the data collection. This approach is particularly helpful to deal with those less frequently used words, including out-of-vocabulary (OOV) words, which are very common for names of people and places. Experiments on a collection of 1,200 photos show that the retrieval effectiveness is increased considerably for segment of individual domain on People, Location and Event. With this method, the average value of precision versus recall over a combination of segments has improved significantly, from 50% to 72.4%.

read more

Citations
More filters
Patent

Intelligent Automated Assistant

TL;DR: In this article, an intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions.
Patent

Method and apparatus for building an intelligent automated assistant

TL;DR: In this paper, a method for building an automated assistant includes interfacing a service-oriented architecture that includes a plurality of remote services to an active ontology, where the active ontologies includes at least one active processing element that models a domain.
Patent

Electronic Devices with Voice Command and Contextual Data Processing Capabilities

TL;DR: In this paper, an electronic device may capture a voice command from a user and store contextual information about the state of the electronic device when the voice command is received, such as a desktop computer or a remote server.
Patent

Crowd sourcing information to fulfill user requests

TL;DR: In this article, a failure to provide a satisfactory response to a user request is detected and information relevant to the user request was crowd-sourced by querying one or more crowd sourcing information sources.
Patent

Device access using voice authentication

TL;DR: In this paper, a speech input can be compared to a voiceprint (e.g., text-independent voiceprint) of the user's voice to authenticate the user to the device.
References
More filters
Book

Fundamentals of speech recognition

TL;DR: This book presents a meta-modelling framework for speech recognition that automates the very labor-intensive and therefore time-heavy and therefore expensive and expensive process of manually modeling speech.
Journal ArticleDOI

Query by image and video content: the QBIC system

TL;DR: The Query by Image Content (QBIC) system as discussed by the authors allows queries on large image and video databases based on example images, user-constructed sketches and drawings, selected color and texture patterns, camera and object motion, and other graphical information.
Proceedings ArticleDOI

VisualSEEk: a fully automated content-based image query system

TL;DR: The VisualSEEk system is novel in that the user forms the queries by diagramming spatial arrangements of color regions by utilizing color information, region sizes and absolute and relative spatial locations.
Journal ArticleDOI

Photobook: content-based manipulation of image databases

TL;DR: The Photobook system is described, which is a set of interactive tools for browsing and searching images and image sequences that make direct use of the image content rather than relying on text annotations to provide a sophisticated browsing and search capability.
Proceedings Article

Query by image and video content: the QBIC system

TL;DR: The Query by Image Content (QBIC) system as mentioned in this paper allows queries on large image and video databases based on example images, user-constructed sketches and drawings, selected color and texture patterns, camera and object motion, and other graphical information.