scispace - formally typeset
Book ChapterDOI

Text-Based Semantic Video Annotation for Interactive Cooking Videos

Reads0
Chats0
TLDR
A text-based semantic video annotation system for interactive cooking videos to facilitate user interactions and is superior to existing alignment algorithms and effective in semantic cooking video annotation.
Abstract
Videos represent one of the most frequently used forms of multimedia applications. In addition to watching videos, people control slider bars of video players to find specific scenes and want detailed information on certain objects in scenes. However, it is difficult to support user interactions in current video formats because of a lack of metadata for facilitating such interactions. This paper proposes a text-based semantic video annotation system for interactive cooking videos to facilitate user interactions. The proposed annotation process includes three parts: the synchronization of recipes and corresponding cooking videos based on a caption-recipe alignment algorithm; the information extraction of food recipes based on lexico-syntactic patterns; and the semantic interconnection between recognized entities and web resources. The experimental results show that the proposed system is superior to existing alignment algorithms and effective in semantic cooking video annotation.

read more

Citations
More filters
Journal ArticleDOI

The Proof is in the Pudding: Using Automated Theorem Proving to Generate Cooking Recipes

Louis Mahon, +1 more
- 05 Mar 2022 - 
TL;DR: FASTFOOD is presented, a rule-based Natural Language Generation Program for cooking recipes which generates recipes by using an Automated Theorem Proving procedure to select the ingredients and instructions, with ingredients corresponding to axioms and instructions to implications.
References
More filters
Journal ArticleDOI

Event detection and recognition for semantic annotation of video

TL;DR: This paper surveys the field of event recognition, from interest point detectors and descriptors, to event modelling techniques and knowledge management technologies, and provides an overview of the methods, categorising them according to video production methods and video domains.
Book ChapterDOI

Movie/Script: Alignment and Parsing of Video and Text Transcription

TL;DR: A weakly supervised algorithm is presented that uses the screenplay and closed captions to parse a movie into a hierarchy of shots and scenes, and the recovered structure is used to improve character naming and retrieval of common actions in several episodes of popular TV series.
Proceedings ArticleDOI

Cooking navi: assistant for daily cooking in kitchen

TL;DR: According to the result of a preliminary experiment, all users from novice to experienced cooks could finish two dishes in parallel while enjoyeing the cooking very much, shows the effectiveness of the multimedia navigation that is proposed.

Using lexico-syntactic ontology design patterns for ontology creation and population

TL;DR: This paper refine the patterns using a term extraction tool and some semantic restrictions derived from WordNet and VerbNet, in order to prevent the overgeneration that occurs with the use of the Ontology Design Patterns for this purpose.
Proceedings ArticleDOI

Lexico-semantic pattern matching as a companion to parsing in text understanding

TL;DR: The most useful forms of pre-processing for text interpretation use fairly superficial analysis that complements the style of ordinary parsing but uses much of the same knowledge base, and Lexico-semantic pattern matching is a good method for this form of analysis.
Related Papers (5)