Semantic Multimedia
read more
Citations
Designing core ontologies
A survey of semantic image and video annotation tools
Connecting the dots: a multi-pivot approach to data exploration
Editorial: Using ontologies with UML class-based modeling: The TwoUse approach
Guidelines for Linked Data generation and publication: An example in building energy consumption
References
The Nature of Statistical Learning Theory
Support-Vector Networks
An Introduction to Support Vector Machines and Other Kernel-based Learning Methods
Face recognition using eigenfaces
Towards a Better Understanding of Context and Context-Awareness
Related Papers (5)
Frequently Asked Questions (15)
Q2. What are the future works in "Semantic multimedia" ?
The authors briefly motivate and summarize them in order to give an outlook to future work. In a future work, it will be very interesting to elaborate how the work on the five types of semantics defined in the ecosystem, the layers of intelligence considered in the WeKnowIt project, and the work in the field of semiotics can be integrated. Looking at the current state of the art, a future research issue is providing efficient support for a recursive querying in structured multimedia content over a large dataset. A multimedia ontology like COMM presented in Section 5 is an annotation model that can be used to organize and structure multimedia semantics.
Q3. What are the three patterns used to annotate a multimedia document?
The decomposition pattern handles the structure of a multimedia document, while the media annotation pattern, the content annotation pattern, and the semantic annotation pattern are useful for annotating the media, the features, and the semantic content of the multimedia document respectively.
Q4. What are the properties of the Contour Shape descriptor?
The Contour Shape descriptor has a number of important properties, namely: (i) it captures very well characteristic features of the shape, enabling similarity-based retrieval; (ii) it reflects properties of the perception of human visual system and offers good generalization; (iii) it is robust to non-rigid motion; (iv) it is robust to partial occlusion of the shape; (v) it is robust to perspective transformations which result from the changes of the camera parameters and are common in images and video; (vi) it is compact.
Q5. How is the canonical representation of the separating hyperplane obtained?
A canonical representation of the separating hyperplane is obtained by rescaling the pair (v, b) into the pair (v′, b′) in such a way that the distance of the closest feature vector equals |v′|−1.
Q6. What are the current formats used for serializing data type concepts?
simple string representation formats are used for serializing data type concepts (e.g., rectangle) that are currently not covered by W3C standards.
Q7. What is the way to describe a multimedia ontology?
Given that the (semantic) web is an important repository of both media assets and annotations, a semantic description of the multimedia ontology should be expressible in a web language such as OWL, RDF/XML, or RDFa11.
Q8. What is the definition of a texture descriptor?
The Homogeneous Texture descriptor is designed to characterize the properties of texture in an image (or region), based on the assumption that the texture is homogeneous, i.e., the visual properties of the texture are relatively constant over the region.
Q9. What is the mechanism used to drop a concept on an ontology?
Using a simple drag&drop mechanism, a region is dropped on a concept or an instance of the ontology, which creates an according annotation.
Q10. What is the disadvantage of the segmentation-based approach?
the segmentation-based approach often suffers from errors due to loss of image details or other inaccuracies resulting from the segmentation process.
Q11. What is the need for a large amount of memory and computation power?
According to current state of the art, for analysis with a large number of variables a large amount of memory and computation power is needed.
Q12. How can the authors avoid reasoning at runtime for many queries?
Using a pre-classified COMM and some comparable simple query rewriting, the authors are able to completely avoid reasoning at runtime for many queries.
Q13. What is the definition of a decomposition of a multimedia data entity?
Following the D&S pattern, the authors consider that a decomposition of a multimedia-data entity is a situation (a segment-decomposition) that satisfies a description such as a segmentation-algorithm or a method (e.g., a user drawing a bounding box around a depicted face), which has been applied to perform the decomposition, see Fig. 4-B.
Q14. What is the specialization of the pattern for describing image decompositions?
The specialization of the pattern for describing image decompositions is shown in Fig. 5-F. According to MPEG-7, an image or an image segment (image-data) can be composed into still regions.
Q15. What is the future challenge of using provenance?
With respect to provenance, a future challenge is to leverage this information to make decisions about the trustworthiness of specific statements made about the multimedia content.