scispace - formally typeset
Search or ask a question

Showing papers on "Multimedia database published in 2005"


Patent
13 Jun 2005
TL;DR: In this paper, a database search system that retrieves multimedia information in a flexible, user friendly system is presented, which uses a multimedia database consisting of text, picture, audio, and animated data.
Abstract: A database search system that retrieves multimedia information in a flexible, user friendly system. The search system uses a multimedia database consisting of text, picture, audio and animated data. That database is searched through multiple graphical and textual entry paths. Those entry paths include an idea search, a title finder search, a topic tree search, a picture explorer search, a history timeline search, a world atlas search, a researcher's assistant search, and a feature articles search.

232 citations


Journal ArticleDOI
TL;DR: The CODAC Project, led by Harald Kosch, implements different multimedia processes and ties them together in the life cycle through metadata, which stores content and MPEG-7-based metadata.
Abstract: During its lifetime, multimedia content undergoes different stages or cycles from production to consumption. Content is created, processed or modified in a postproduction stage, delivered to users, and finally, consumed. Metadata, or descriptive data about the multimedia content, pass through similar stages but with different time lines. Metadata may be produced, modified, and consumed by all actors involved in the content production-consumption chain. At each step of the chain, different kinds of metadata may be produced by highly different methods and of substantially different semantic value. Different metadata let us tie the different multimedia processes in a life cycle together. However, to employ these metadata, they must be appropriately generated. The CODAC Project, led by Harald Kosch, implements different multimedia processes and ties them together in the life cycle. CODAC uses distributed systems to implement multimedia processes. The project's core component is a multimedia database management system (MMDBMS) which stores content and MPEG-7-based metadata. It communicates with a streaming server for data delivery. The database is realized in the multimedia data cartridge (MDC) - which is an extension of the Oracle database management system - to handle multimedia content and MPEG-7 metadata.

52 citations


Patent
26 Feb 2005
TL;DR: In this paper, a client and server are directed at providing a simplified scheme for delivering multimedia messages from mobile to mobile via wireless networks and internet, where the multimedia can be either resident on mobile devices or stored in a multimedia database on a server.
Abstract: A method, client and server are directed at providing a simplified scheme for delivering multimedia messages from mobile to mobile via wireless networks and internet. The multimedia can be either resident on mobile devices or stored in a multimedia database on a server. The present invention provides a server that receives and stores uploaded multimedia; generates a unique id and a link for accessing the multimedia message; collects mobile device info from request; converts the multimedia message to formats and dimensions supported by the mobile device; delivers the multimedia message to mobile devices. The present invention also provides a mobile client application that can capture or locate multimedia content on mobile device; upload multimedia messages; formulate and send notification message with an embedded link to other mobile devices via mobile network.

50 citations


Book ChapterDOI
01 Jan 2005
TL;DR: In this chapter, an innovative shot boundary detection method using an unsupervised segmentation algorithm and the technique of object tracking based on the segmentation mask maps is presented and results show that the method can obtain object-level information of the video frames as well as accurateShot boundary detection, which are very useful for video content indexing.
Abstract: Recently, multimedia information, especially video data, has been made overwhelmingly accessible with the rapid advances in communication and multimedia computing technologies. Video is popular in many applications, which makes the efficient management and retrieval of the growing amount of video information very important. Toward such a demand, an effective video shot boundary detection method is necessary, which is a fundamental operation required in many multimedia applications. In this chapter, an innovative shot boundary detection method using an unsupervised segmentation algorithm and the technique of object tracking based on the segmentation mask maps is presented. A series of experiments on various types of video types are performed, and the experimental results show that our method can obtain object-level information of the video frames as well as accurate shot boundary detection, which are very useful for video content indexing. 701 E. Chocolate Avenue, Suite 200, Hershey PA 17033-1240, USA Tel: 717/533-8845; Fax 717/533-8661; URL-http://www.irm-press.com IRM PRE S This chapter appears in the book, Video Data Management and Information Retrieval by Sagarmay Deb. Copyright © 2005, IRM Press, an imprint of Idea Group Inc. Copying or distributing in print or electronic forms without written permission of Idea Group Inc. is prohibited. 218 Chen, Shyu, & Zhang Copyright © 2005, Idea Group Inc. Copying or distributing in print or electronic forms without written permission of Idea Group Inc. is prohibited. INTRODUCTION Unlike traditional database systems that have text or numerical data, a multimedia database or information system may contain different media such as text, image, audio, and video. Video, in particular, has become more and more popular in many applications such as education and training, video conferencing, video-on-demand (VOD), and news services. The traditional way for the users to search for certain content in a video is to fast-forward or rewind, which are sequential processes, making it difficult for the users to browse a video sequence directly based on their interests. Hence, it becomes important to be able to organize video data and provide the visual content in compact forms in multimedia applications (Zabih, Miller, & Mai, 1995). In many multimedia applications such as digital libraries and VOD, video shot boundary detection is fundamental and must be performed prior to all other processes (Shahraray, 1995; Zhang & Smoliar, 1994). A video shot is a video sequence that consists of continuous video frames for one action, and shot boundary detection is an operation to divide the video data into physical video shots. Many video shot boundary detection methods have been proposed in the literature. Most of them use low-level global features in the matching process between two consecutive frames for shot boundary detection, for example, using the luminance pixel-wise difference (Zhang, Kankanhalli, & Smoliar, 1993), luminance or color histogram difference (Swanberg, Shu, & Jain, 1993), edge difference (Zabih et al., 1995), and the orientation histogram (Ngo, Pong, & Chin, 2000). However, these low-level features cannot provide satisfactory results for shot boundary detection since luminance or color is sensitive to small changes. For example, Yeo and Liu (1995) proposed a method that uses the luminance histogram difference of DC images, which is very sensitive to luminance changes. There are also approaches focusing on the compressed video data domain. For example, Lee, Kim, and Choi (2000) proposed a fast scene/shot change detection method, and Hwang and Jeong (1998) proposed the directional information retrieving method by using the discrete cosine transform (DCT) coefficients in MPEG video data. In addition, dynamic and adaptive threshold determination is also applied to enhance the accuracy and robustness of the existing techniques in shot cuts detection (Alattar, 1997; Gunsel, Ferman, & Tekalp, 1998; Truong, Dorai, & Venkatesh, 2000). In Gunsel et al. (1998), the unsupervised clustering algorithm proposed a generic technique that does not need threshold setting and allows multiple features to be used simultaneously; while an adaptive threshold determination method that reduces the artifacts created by noise and motion in shot change detection was proposed by Truong et al. (2000). In this chapter, we present an innovative shot boundary detection method using an unsupervised image-segmentation algorithm and the object-tracking technique on the uncompressed video data. In our method, the image-segmentation algorithm extracts the segmentation mask map of each video frame automatically, which can be deemed as the clustering feature map of each frame and where the pixels in each frame have been grouped into different classes (e.g., two classes). Then the difference between the segmentation mask maps of two frames is checked. Moreover, due to camera panning and tilting, we propose an object-tracking method based on the segmentation results to enhance the matching. The cost for object tracking is almost trivial since the segmentation results are already available. In addition, the bounding boxes and the positions of 18 more pages are available in the full version of this document, which may be purchased using the "Add to Cart" button on the product's webpage: www.igi-global.com/chapter/innovative-shot-boundarydetection-video/30767?camid=4v1 This title is available in InfoSci-Books, InfoSci-Database Technologies, Library Science, Information Studies, and Education, InfoSci-Library Information Science and Technology. Recommend this product to your librarian: www.igi-global.com/e-resources/libraryrecommendation/?id=1

47 citations


Journal ArticleDOI
TL;DR: This work deals with the problem of expressing MTGM data by XML and of managing MTGM clinical data through an XML-based system, and provides both a technique for translating (a part of) an MTGM database into an XML document and some techniques allowing us to obtain presentations defined by means of the Synchronized Multimedia Integration Language (SMIL) from MTGM presentations.

17 citations



Book ChapterDOI
13 Nov 2005
TL;DR: The algorithm, called Gradient-Based Fuzzy c-Means Algorithm with Divergence Measure (GBFCM(DM), is a neural network-based algorithm which utilizes the DivergenceMeasure to exploit the statistical nature of the audio data to improve the classification accuracy.
Abstract: Multimedia databases usually store thousands of audio files such as music, speech and other sounds. One of the challenges in modern multimedia system is to classify and retrieve certain kinds of audio from the database. This paper proposes a novel classification algorithm for a content-based audio retrieval. The algorithm, called Gradient-Based Fuzzy c-Means Algorithm with Divergence Measure (GBFCM(DM)), is a neural network-based algorithm which utilizes the Divergence Measure to exploit the statistical nature of the audio data to improve the classification accuracy. Experiment results confirm that the proposed algorithm outperforms 3.025%-5.05% in accuracy in comparison with conventional algorithms such as the k-Means or the Self-Organizing Map.

13 citations


Patent
08 Sep 2005
TL;DR: In this paper, a method to generate a multimedia database that represents a collection of files having different file extensions is presented, which includes extracting and storing metadata in multimedia objects stored in the multimedia database.
Abstract: A computer readable medium and method to generate a multimedia database that represents a collection of files having different file extensions is provided. The method includes receiving the collection of files from a file system, extracting and storing metadata in multimedia objects stored in the multimedia database. The metadata includes fast and slow metadata associated with the collection of files, which are extracted from the collection of files. Policies associated with the file extensions are applied to the collection of files to determine the fast metadata that should be extracted before the slow metadata. The multimedia objects are created to store the fast metadata and references to the collection of files. After extracting the fast metadata from the collection of files, the slow metadata is extracted and stored in the multimedia objects.

11 citations


Book ChapterDOI
14 Sep 2005
TL;DR: A fast video retrieval algorithm that bases its search core on the statistics of object motion, which starts with extracting object motions from a shot and then transforms/quantizes them into the form of probability distributions.
Abstract: It is a very important issue to quickly retrieve semantic information from a vast multimedia database. In this paper, we propose a statistic-based algorithm to retrieve the videos that contain the requested object motion from video database. In order to speed up our algorithm, we only utilize the local motion embedded in the region-of-interest as the query to retrieve data from MPEG bitstreams. Experimental results demonstrate that our fast video retrieval algorithm is powerful in terms of accuracy and efficiency.

10 citations


Proceedings Article
01 Oct 2005
TL;DR: A scenario in which a mobile agent searches a multimedia database on behalf of its user, is used to provide a common ground for discussion of the legal and technical issues involved.
Abstract: Mobile agents traverse the Internet, often on behalf of their users. Intelligent search agents access information in dynamic heterogeneous environments. The legal and technical implications of the use of agents in such situations are not fully understood. In this paper a scenario in which a mobile agent searches a multimedia database on behalf of its user, is used to provide a common ground for discussion of the legal and technical issues involved. Requirements related to identity management, integrity, traceability and availability are identified and discussed in the context of existing technology.

9 citations


Book ChapterDOI
13 Oct 2005
TL;DR: The color distribution, the mean value and the standard deviation, of an image as global information for image retrieval is used and the common bitmap is used to represent the local characteristics of the image.
Abstract: Image retrieval has emerged as an important problem in multimedia database management. This paper uses the color distribution, the mean value and the standard deviation, of an image as global information for image retrieval. Furthermore, this paper uses the common bitmap to represent the local characteristics of the image. The performance of the method is tested on three different image databases consisting of 410, 235, and 10,235 images. The third database has been partitioned into 10 categories for exploring the category retrieval ability. According to the experimental results, we find that the proposed method can effectively retrieve more similar images than other methods and the category ability is also higher than others. In addition, the total memory space for saving the image features of the proposed method is less than other methods.

Proceedings ArticleDOI
TL;DR: This paper provides a comparison and evaluation of a range of 3-D shape descriptors and distance metrics introduced into the SCULPTEUR project to demonstrate their use for content-based retrieval applications and shows that while particular descriptors provide good overall performance, it can be more appropriate to choose different descriptors for different search tasks.
Abstract: The growing number of large multimedia collections has led to an increased interest in content-based retrieval research. Applications of content-based techniques to image retrieval is an active research area but much less work has been reported on content-based retrieval of 3-D objects in a multimedia database context. Increasingly such objects are being captured and added to multimedia collections and the European project, SCULPTEUR, is developing a museum information system which includes the introduction of facilities for content-based retrieval of the 3-D representations. This paper provides a comparison and evaluation of a range of 3-D shape descriptors and distance metrics which have been introduced into the SCULPTEUR project to demonstrate their use for content-based retrieval applications. Results show that while particular descriptors and distance metrics provide good overall performance, it can be more appropriate to choose different descriptors for different search tasks.

Proceedings ArticleDOI
08 Apr 2005
TL;DR: This research proposes a new content-based retrieval method based on a multi-objective genetic algorithm (MOGA), which is capable of finding multiple trade-off solutions in one run and providing a natural way for integrating multiple image representation schemes.
Abstract: Content-based retrieval from multimedia databases is an important multimedia research area where traditional keyword-based approaches are not adequate. Multimedia data is significantly different from alphanumeric data because multimedia data is generally meaningless to a human and multimedia objects are typically large. Moreover, the traditional keyword-based approaches require an enormous amount of human effort during manual annotation and maintaining the consistency of annotations throughout database evolution. Research on content-based retrieval focus on using low-level features like color and texture for image representation, and a geometric framework of distances in the feature space for similarity. However, systematic retrieval of the best matches in a large multimedia database requires exhaustive and exponential search and does not guarantee worst-case performance. In addition, it has been observed that certain image representation schemes perform better than others under certain query situations, and these schemes should be somehow integrated and adjusted on the fly to facilitate effective and efficient image retrieval. Some work has been done applying simple genetic algorithms for content-based retrieval to provide good, but not necessary optimal solutions. However, these simple genetic algorithms can find only one optimum solution in a single run. This research proposes a new content-based retrieval method based on a multi-objective genetic algorithm (MOGA), which is capable of finding multiple trade-off solutions in one run and providing a natural way for integrating multiple image representation schemes. This research focuses on structural similarity framework that addresses topological, directional and distance relations of image objects.

Proceedings ArticleDOI
01 Oct 2005
TL;DR: The following topics are dealt with: content-aware video coding; video communication; video processing; audio processing; multimedia processing; peer-peer media communication; multimedia networking; human perception; multimedia database; human-machine interface; networks and coding; multimedia systems.
Abstract: The following topics are dealt with: content-aware video coding; video communication; video processing; audio processing; multimedia processing; peer-peer media communication; multimedia networking; human perception; multimedia database; human-machine interface; networks and coding; multimedia systems; biomedical application; multimedia assurance; image processing

01 Jan 2005
TL;DR: This paper proposes an object-oriented multimedia representation model and overviews content-based searching in text, image, and video database systems.
Abstract: Multimedia Database Systems (MMDS) support rich data types, such as text, images, video, and sound. Queries in MMDSs may refer to the content of the stored multimedia objects. This is called content-based querying. However, manual entry of content descriptions is very difficult and subjective. A better approach is to provide automatic content-based retrieval through image, text, and sound interpretation. To support queries by content in a MMDS, multimedia data should be analyzed so that description of their content can be extracted and stored in the database together with the original data. These descriptions are then used to search the MMDS and determine which multimedia objects satisfy the query selection criteria. Because content-based queries tend to be imprecise, database search should be approximate and multimedia objects up to a prespecified degree of similarity with the query specification should be retrieved. This implies the definition of some distance measure between the query and the stored multimedia objects which captures what humans perceive as similarity between the objects. The contents of the multimedia objects may be queried from different aspects, depending on the type of the multimedia objects. For example, subject queries apply to all multimedia types whereas spatial queries apply only to images and video, and temporal queries apply only to video. This paper proposes an object-oriented multimedia representation model and overviews content-based searching in text, image, and video database systems.

Proceedings ArticleDOI
06 Jul 2005
TL;DR: A new approach of approximating the selectivity of multimedia range queries using the DBSCAN clustering technique for finding high density areas in the data set with the help of a density function in combination with the volume of the query's hyper sphere.
Abstract: This paper introduces a new approach of approximating the selectivity of multimedia range queries. Estimating the selectivity of a range query is a pre-requisite to optimize a multimedia database query. We use the DBSCAN clustering technique for finding high density areas in the data set. Then, the selectivity is approximated with the help of a density function in combination with the volume of the query's hyper sphere. Our approach is fast and accurate which was evaluated on an image data set using the MPEG-7 scalable color descriptor. The technique is integrated with the help of the extensible optimizer architecture in the Oracle multimedia database system.

Book ChapterDOI
27 Aug 2005
TL;DR: This paper proposes a three-layer replay scene based video abstraction in MPEG sports video that can achieve real time performance in the MPEG compressed domain, which is validated by experimental results.
Abstract: Video abstraction can be useful in multimedia database indexing and querying and can illustrate the important content of a longer video to quick browsing. Further, in sports video, replay scene often demonstrates the highlight of the video. The detection of replay scene in the sports video is a key clue to sports video summarizing. In this paper, we present a framework of replay scene based video abstraction in MPEG sports video. Moreover, we detect identical events using color and camera information after detecting replay scene using MPEG feature. At last, we propose a three-layer replay scene based sports video abstraction. It can achieve real time performance in the MPEG compressed domain, which is validated by experimental results.

Book ChapterDOI
01 Jan 2005
TL;DR: A novel concept to deal with the problem of quick query response in multimedia data management under wireless mobile environments by making use of the concept of quality of service (QoS) to achieve adaptive query processing is presented.
Abstract: Among the challenges of multimedia and mobile computing, providing a mechanism for data retrieval in multimedia databases under wireless mobile environments is one of the most difficult issues (Shih, 2001). Up to now, the fundamental technologies that are specialized for wireless mobile, multimedia environments are not mature in object-oriented, object-relational, as well as relational databases (Hillborg, 2002; Ramakrishnan & Gehrke, 2003; Watson, 2004). An important issue is how to ensure quick query response for the users. If a user found out that the retrieved multimedia object is neither interesting nor useful after it is displayed, then the time and bandwidth used for transmitting the multimedia objects have already been wasted. In order to save precious time and expensive bandwidth, it could be a good idea to let users browse objects at an acceptable resolution without paying much attention to the details or at the limited device display capability. This article presents a novel concept to deal with this problem by making use the concept of quality of service (QoS) to achieve adaptive query processing. In general, traditional QoS management is defined as the necessary supervision and control to ensure that the desired quality of service properties are attained and sustained, which applies both to continuous media interactions and to discrete interactions (Chalmers & Sloman, 1999). QoS thus consists of a set of specific requirements for a particular service provided by a network to users. However, little work has been done in extending QoS principles to multimedia data management in wireless network environments.

Proceedings ArticleDOI
05 Dec 2005
TL;DR: A new image retrieval technique based on concept lattices is presented, named Concept Lattices- Based Image Retrieval, lattice browsing allows one to reach a group of images via one path and a method of attribute reduction of concept lattice based on discernibility matrix and boolean calculation is proposed.
Abstract: With the development of the World-Wide Web, there has been an explosion in the volume of multimedia database. In order to search desired information on such huge and quick changing internet, more exact and faster retrieval tools should be developed. In this paper, a new image retrieval technique based on concept lattices is presented, named Concept Lattices- Based Image Retrieval, lattice browsing allows one to reach a group of images via one path. Because it can produce a lot of redundancies attributes when constructing the concept lattices by using a general method, this paper also proposes a method of attribute reduction of concept lattices based on discernibility matrix and boolean calculation to reduce the context of concept lattices. The scale of the problem is reduced by using this method. At the same time, the efficiency of image retrieval is improved, which is reflected in our experiment.

Proceedings ArticleDOI
12 Jan 2005
TL;DR: The modeling of output constraints is shown, which shows that the multimedia database system must guarantee that the stored multimedia data and the data output of multimedia data are according to the defined output constraints.
Abstract: Constraints are used in traditional database systems to define consistent database states. For multimedia data it is also important to define constraints for a correct data output. The producer of multimedia data should specify constraints for a correct data output. We show the modeling of output constraints. The multimedia database system must guarantee that the stored multimedia data and the data output of multimedia data are according to the defined output constraints. For that an efficient check of output constraints must be possible.

Journal ArticleDOI
TL;DR: A novel video mining algorithm that uses genetic programming for evolved rule-based scene boundary detection and whose key advantage is that is video content independent is presented, which can be applied without modification to different video sequences just by feeding it different training data.
Abstract: Segmentation of video data stream aims to divide the stream into temporally shorter, meaningful and manageable segments. This is the first step towards content-based multimedia database management, contented-based retrieval and browsing, and is very important to many other applications that aim to work with content. This paper presents a novel video mining algorithm that uses genetic programming for evolved rule-based scene boundary detection and whose key advantage is that is video content independent. Hence, the algorithm can be applied without modification to different video sequences just by feeding it different training data.

01 Jan 2005
TL;DR: Five types of smelling objects were implemented that were used for storing and retrieving olfactory information together with audio-visual information in/from a multimedia database system.
Abstract: The paper presents our view on the ways that olfactory information could be integrated into multimedia database applications by means of smelling objects. Until recently, multimedia applications have been mainly limited to the use of audio- visual information, stimulating only two of the five senses of human beings. The development of olfactory devices, along with significant progress in understanding the human olfactory system in the last years offers tremendous opportunities for building multimedia applications stimulating also the sense of smell. This could bring important advance in fields like education, medicine, military, and entertainment in the near future. We implemented five types of smelling objects that we used for storing and retrieving olfactory information together with audio-visual information in/from a multimedia database system.

Book ChapterDOI
24 Oct 2005
TL;DR: The utilization of the UML framework is described for the instantiation of a model for an image database of scanned handwritten music scores and the advantages of using UML for representing multimedia data as well as shortcomings are discussed.
Abstract: To support the design of data models for multimedia applications, we employ the concept of a framework introduced in object-oriented design. We define a UML framework, which can be used for deriving application-specific multimedia database models. With the UML framework, we define the core elements of a multimedia database model, such as mediatype- and application-independent structure, content, relationships and operations. Thereby, the advantages of using UML for representing multimedia data as well as shortcomings of this approach are discussed. Furthermore, we describe the utilization of the UML framework for the instantiation of a model for an image database of scanned handwritten music scores.

Proceedings ArticleDOI
23 Oct 2005
TL;DR: LLV1 has been developed on the basis of XviD, a state-of-the-art implementation of the MPEG-4 Part 2 standard and is designed for use in multimedia servers to facilitate real-time format conversions, a requirement to reach data-independent access to media objects.
Abstract: For multimedia servers, data independence is as beneficial as it is for databases. It means that users access the data without referring to the storage format, and the server returns them in many different formats and qualities. The storage format can then be chosen at will, and it should be selected to support a large variety of accesses. When looking at video, some of the accesses even require real-time processing. LLV1 is a layered video format for storing videos without loss of information. Its layers can be read separately, so that scalability is achieved in terms of bandwidth and computational resources. LLV1 has been developed on the basis of XviD, a state-of-the-art implementation of the MPEG-4 Part 2 standard, and is designed for use in multimedia servers to facilitate real-time format conversions, a requirement to reach data-independent access to media objects. Thus, XviD's advantages in efficient video compression are inherited by LLV1. Orthogonality of the layering is provided by the different enhancement layers in respect to temporal resolution and spatial properties. The compression efficiency is comparable to other lossless formats, however only LLV1 provides scalability features, which can be exploited in real-time processing. Moreover, the scalable design of the decompression algorithm allows for adaptable execution and thus makes QoS control possible. Additionally, the coding algorithm is asymmetric, which further reduces the computational requirements for delivering the multimedia content from storage to the end user.

Proceedings Article
11 Jul 2005
TL;DR: The Department of Informatics of the Aristotle University of Thessaloniki in collaboration with AUTh's Department of History and Archaeology in the framework of the SOCRATES-MINERVA SEEArch Web project has developed a system for recording multiple excavations driven by GIS services, based on a flexible multimedia database.
Abstract: Considering the problems in archaeological data recording and analysis, the Department of Informatics of the Aristotle University of Thessaloniki (AUTh) in collaboration with AUTh's Department of History and Archaeology in the framework of the SOCRATES-MINERVA SEEArch Web project has developed a system for recording multiple excavations driven by GIS services, based on a flexible multimedia database. The system provides all its services over the Internet. Its innovation lies in the fact that it is promoting a generic template supporting data from different excavations while older implementations were custom made for the specific method implemented in each excavation. Emphasis is also given to the development of instructional models organizing the existing educational resources of the system.

Patent
17 Jun 2005
TL;DR: In this article, a system of dynamically displaying an associated message in a message employed in a handheld data processing device, including a multimedia database storing a plurality of sets of multimedia data, a keyword identification module capable of identifying the content of a draft message and a sent message based on a predetermined keyword list to generate a set of keyword information.
Abstract: A System of dynamically displaying an associated message in a message employed in a handheld data processing device, including a multimedia database storing a plurality of sets of multimedia data, a keyword identification module capable of identifying the content of a draft message and a sent message based on a predetermined keyword list to generate a set of keyword information, a keyword search module searching a set of multimedia data corresponding to the set of keyword information from the multimedia database and an associated message prompt module generating the associated message based on the searched set of a set of multimedia data and displaying the associated message on the handheld data processing device.

Book ChapterDOI
13 Dec 2005
TL;DR: A semantic-based multimedia database system that enables users to search the meaning of video data in a diverse manner that uses both semantic and dependency weights to perform video retrieval for the environment education of infants is proposed.
Abstract: To effectively deal with video data, a semantic-based retrieval scheme that allows for processing diverse user queries and saving them on the database is required. This paper proposes a semantic-based multimedia database system that enables users to search the meaning of video data in a diverse manner. It uses both semantic and dependency weights to perform video retrieval for the environment education of infants. In the proposed system, the user searches multimedia data for environment education through entering keywords. The mobile agent then computes both semantic and dependency weights, ensuring the accuracy of data retrieved using calculated weights as annotative information of key frames. As a result of implementing and testing the prototype of the proposed system, a higher precision of approximately 96.5% was obtained.

Book ChapterDOI
25 Jul 2005
TL;DR: There will be presented the new way of application of structural methods of pattern recognition, in the form of picture languages, for creation of knowledge representation for medical pattern analysis and understanding.
Abstract: In the paper, there will be presented the new way of application of structural methods of pattern recognition, in the form of picture languages, for creation of knowledge representation for medical pattern analysis and understanding. Specialist medical databases storing data in visual form constitute a large group among multimedia databases; those patterns originate from numerous diagnostic examinations of practically all organs of the human body. One of the main problems in the fast accessing and interpretation of such visual information is the way of transformation of these visual data into a form enabling intelligent analysis and understanding of medical meaning of these patterns. The basic problems leading to achieve this goal is the method of representation and indexing the important objects constituting the data contents. In the paper we describe some examples presenting ways of applying picture languages techniques in the creation of intelligent cognitive knowledge representation for selected class of medical images.

Journal Article
TL;DR: The disadvantage of present Network Examination Systems is described, and the view of using B/S Architecture and Multimedia Database in the system is pointed out.
Abstract: This paper describes the disadvantage of present Network Examination Systems, and points out the view of using B/S Architecture and Multimedia Database in the system. Then it makes a detailed research on the selection of Multimedia data model, and provides the implementation methods of the storage and access in an actual Network Examination System which is based on B/S Architecture.

Proceedings Article
01 Sep 2005
TL;DR: This paper presents a novel browsing technique based on a new indexing scheme, the Hierarchical Cellular Tree, which is designed to bring an effective solution especially for indexing large-scale multimedia databases and furthermore to provide an enhanced browsing capability, which enables user to make a guided tour within the database.
Abstract: One of the challenges in the development of content-based multimedia retrieval application is to achieve an efficient browsing and navigation scheme. Since browsing requires the capability of handling the entire database, a particular visualization system and tool(s) for navigation should be provided. Otherwise, browsing may turn out to be a disorienting process. Database items should be organized and especially for large databases the underlying organization scheme such as the indexing structure should provide a hierarchical representation of the database. This paper presents a novel browsing technique based on a new indexing scheme, the Hierarchical Cellular Tree, which is designed to bring an effective solution especially for indexing large-scale multimedia databases and furthermore to provide an enhanced browsing capability, which enables user to make a guided tour within the database. A pre-emptive cell search mechanism is introduced in order to prevent the corruption of large multimedia item collections due to the limited discrimination obtained from visual and aural descriptors. In addition, similar items are focused within appropriate cellular structures, which will be subject to mitosis operations when the dissimilarity emerges as a result of irrelevant item insertions. Mitosis operations ensure to keep the cells in a focused and compact form and yet the cells can grow into any dimension as long as compactness prevails. Experimental results show that the HCT indexing body can conveniently be used for efficient browsing and navigation operations among the multimedia database items.