Open AccessProceedings Article
An Improved Method for Image Retrieval using Speech Annotation
J. Chen,Tele Tan,Philippe Mulhem,M. Kakanhalli +3 more
- Vol. 49, Iss: 3, pp 12-30
Reads0
Chats0
TLDR
This paper presents a system for the image indexing and retrieval using speech annotations based on a pre-defined structured syntax, and a query expansion technique is explored to enhance the query terms and to improve retrieval effectiveness.Abstract:
In this paper, we present a system for the image indexing and retrieval using speech annotations based on a pre-defined structured syntax. In addition to the introduction of N-best lists for index generation, a query expansion technique is explored to enhance the query terms and to improve retrieval effectiveness. By adding the most probable substitutions for the query terms, more relevant images are distinguished from the data collection. This approach is particularly helpful to deal with those less frequently used words, including out-of-vocabulary (OOV) words, which are very common for names of people and places. Experiments on a collection of 1,200 photos show that the retrieval effectiveness is increased considerably for segment of individual domain on People, Location and Event. With this method, the average value of precision versus recall over a combination of segments has improved significantly, from 50% to 72.4%.read more
Citations
More filters
Patent
Intelligent Automated Assistant
Thomas R. Gruber,Adam Cheyer,Dag Kittlaus,Didier Rene Guzzoni,Christopher Dean Brigham,Richard Donald Giuli,Marcello Bastea-Forte,Harry J. Saddler +7 more
TL;DR: In this article, an intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions.
Patent
Method and apparatus for building an intelligent automated assistant
Adam Cheyer,Didier Rene Guzzoni +1 more
TL;DR: In this paper, a method for building an automated assistant includes interfacing a service-oriented architecture that includes a plurality of remote services to an active ontology, where the active ontologies includes at least one active processing element that models a domain.
Patent
Electronic Devices with Voice Command and Contextual Data Processing Capabilities
TL;DR: In this paper, an electronic device may capture a voice command from a user and store contextual information about the state of the electronic device when the voice command is received, such as a desktop computer or a remote server.
Patent
Crowd sourcing information to fulfill user requests
TL;DR: In this article, a failure to provide a satisfactory response to a user request is detected and information relevant to the user request was crowd-sourced by querying one or more crowd sourcing information sources.
Patent
Device access using voice authentication
TL;DR: In this paper, a speech input can be compared to a voiceprint (e.g., text-independent voiceprint) of the user's voice to authenticate the user to the device.
References
More filters
Book
Fundamentals of speech recognition
TL;DR: This book presents a meta-modelling framework for speech recognition that automates the very labor-intensive and therefore time-heavy and therefore expensive and expensive process of manually modeling speech.
Journal ArticleDOI
Query by image and video content: the QBIC system
Myron D. Flickner,Harpreet Sawhney,W. Niblack,Jonathan Ashley,Qian Huang,Byron Dom,Monika Gorkani,James Lee Hafner,D. Lee,Dragutin Petkovic,David Steele,Peter Cornelius Yanker +11 more
TL;DR: The Query by Image Content (QBIC) system as discussed by the authors allows queries on large image and video databases based on example images, user-constructed sketches and drawings, selected color and texture patterns, camera and object motion, and other graphical information.
Proceedings ArticleDOI
VisualSEEk: a fully automated content-based image query system
John R. Smith,Shih-Fu Chang +1 more
TL;DR: The VisualSEEk system is novel in that the user forms the queries by diagramming spatial arrangements of color regions by utilizing color information, region sizes and absolute and relative spatial locations.
Journal ArticleDOI
Photobook: content-based manipulation of image databases
TL;DR: The Photobook system is described, which is a set of interactive tools for browsing and searching images and image sequences that make direct use of the image content rather than relying on text annotations to provide a sophisticated browsing and search capability.
Proceedings Article
Query by image and video content: the QBIC system
Myron D. Flickner,Harpreet Sawhney,W. Niblack,Jonathan Ashley,Qian Huang,Byron Dom,Monika Gorkani,James Lee Hafner,D. Lee,Dragutin Petkovic,David Steele,Peter Cornelius Yanker +11 more
TL;DR: The Query by Image Content (QBIC) system as mentioned in this paper allows queries on large image and video databases based on example images, user-constructed sketches and drawings, selected color and texture patterns, camera and object motion, and other graphical information.