scispace - formally typeset
Search or ask a question

Showing papers on "Multimedia database published in 2003"


Book
24 Nov 2003
TL;DR: Semantics in Multimedia Databases Mobility Multimedia Middleware Appendix A Abbreviations and Acronyms Index.
Abstract: 15 Introduction Multimedia Content: Context Multimedia Systems and Databases (Multi)Media Data and Multimedia Metadata Purpose and Organization of the Book MPEG-7: The Multimedia Content Description Standard Introduction MPEG-7 and Multimedia Database Systems Principles for Creating MPEG-7 Documents MPEG-7 Description Definition Language Step-by-Step Approach for Creating an MPEG-7 Document Extending the Description Schema of MPEG-7 Encoding and Decoding of MPEG-7 Documents for Delivery-Binary Format for MPEG-7 Audio Part of MPEG-7 MPEG-7 Supporting Tools and Reference Software MPEG-21: The Multimedia Framework Standard MPEG-21 Concepts and Structure Digital Item Declaration Intellectual Property Management and Protection, Rights Expression Language, and Rights and Data Dictionary Digital Item Adaptation Interoperable MPEG: From MPEG-1 to MPEG-21 Ongoing MPEG-21 Research Projects Multimedia Database Management Systems Multimedia Indexing Multimedia Index Structures Multimedia Data Models: Implementation and Communication Content-Based Retrieval Multimedia Query Languages, Optimization, and Processing Multimedia Database Management System Products Summary Distributed Multimedia Database Systems Architectural Considerations Video Servers and Streaming Multimedia Communication Client-Design Issues Content Adaptation Summary Concluding Remarks Summary of the Book Semantics in Multimedia Databases Mobility Multimedia Middleware Appendix A Abbreviations and Acronyms Index

101 citations


Journal ArticleDOI
TL;DR: A learning-based automatic framework is proposed to support the multimedia data indexing and querying of spatio-temporal relationships of vehicle objects in a traffic video sequence and the results show that the proposed framework is effective in automating data collection and access for complex traffic situations.
Abstract: One key technology of intelligent transportation systems is the use of advanced sensor systems for on-line surveillance to gather detailed information on traffic conditions. Traffic video analysis can provide a wide range of useful information to traffic planners. In this context, the object-level indexing of video data can enable vehicle classification, traffic flow analysis, incident detection and analysis at intersections, vehicle tracking for traffic operations, and update of design warrants. In this paper, a learning-based automatic framework is proposed to support the multimedia data indexing and querying of spatio-temporal relationships of vehicle objects in a traffic video sequence. The spatio-temporal relationships of vehicle objects are captured via the proposed unsupervised image/video segmentation method and object tracking algorithm, and modeled using a multimedia augmented transition network model and multimedia input strings. An efficient and effective background learning and subtraction technique is employed to eliminate the complex background details in the traffic video frames. It substantially enhances the efficiency of the segmentation process and the accuracy of the segmentation results to enable more accurate video indexing and annotation. The paper uses four real-life traffic video sequences from several road intersections under different weather conditions in the study experiments. The results show that the proposed framework is effective in automating data collection and access for complex traffic situations.

77 citations


Journal ArticleDOI
TL;DR: This work investigates new image representations/kernels derived from probabilistic models of the class of images considered and presents a new feature selection method which can be used to reduce the dimensionality of the image representation without significant losses in terms of the performance of the detection-search-system.
Abstract: The success of a multimedia information system depends heavily on the way the data is represented. Although there are "natural" ways to represent numerical data, it is not clear what is a good way to represent multimedia data, such as images, video, or sound. We investigate various image representations where the quality of the representation is judged based on how well a system for searching through an image database can perform-although the same techniques and representations can be used for other types of object detection tasks or multimedia data analysis problems. The system is based on a machine learning method used to develop object detection models from example images that can subsequently be used for examples to detect-search-images of a particular object in an image database. As a base classifier for the detection task, we use support vector machines (SVM), a kernel based learning method. Within the framework of kernel classifiers, we investigate new image representations/kernels derived from probabilistic models of the class of images considered and present a new feature selection method which can be used to reduce the dimensionality of the image representation without significant losses in terms of the performance of the detection-search-system.

52 citations


Patent
31 Mar 2003
TL;DR: In this paper, a technique for controlling one or more sensors used to capture data associated with an event comprises the following steps/operations: first, sensor data captured in accordance with the event is processed.
Abstract: Techniques for intelligent and automatic control of sensors for capturing data associated with real time events. Preferably, the sensors are associated with a multimedia database system. For example, a technique for controlling one or more sensors used to capture data associated with an event comprises the following steps/operations. First, sensor data captured in accordance with the event is processed. Then, the one or more sensors are automatically controlled based on information pertaining to the continual activity of at least one of one or more objects and one or more persons associated with the event in real time obtained using at least a portion of the processed data captured in accordance with the event.

50 citations


Journal ArticleDOI
TL;DR: The “adaptation-chain” of (MPEG-conformant metadata based adaptation is described: from the creation stage at the server side, through its usage in the network, up to the consumption at the client, and how the metadata are used to steer the adaptation processes.
Abstract: The ADMITS project (Adaptation in Distributed Multimedia IT Systems) is building an experimental distributed multimedia system for investigations into adaptation, which we consider an increasingly important tool for multimedia systems. A number of possible adaptation entities (server, proxy, clients, routers) are being explored, different algorithms for media, component and application-level adaptations are being implemented and evaluated, and experimental data are being derived to gain insight into when, where and how to adapt, and how individual, distributed adaptation steps interoperate and interact with each other. In this paper the “adaptation-chain” of (MPEG-conforming) metadata based adaptation is described: from the creation stage at the server side, through its usage in the network (actually in a proxy), up to the consumption at the client. The metadata are used to steer the adaptation processes. MPEG-conformant metadata, the so-called variation descriptions, are introduced; an example of a complete MPEG-7 document describing temporal scaling of an MPEG-4 video is given. The meta-database designed to store the metadata is briefly discussed. We describe how the metadata can be extracted from MPEG-4 visual elementary streams and initial results from a temporal video scaling experiment are given. We further present how the metadata can be utilized by enhanced cache replacement algorithms in a proxy server in order to realize quality-based caching; experimental results using these algorithms are also given. Finally, an adaptive query and presentation interface to the meta- and media database is outlined.

48 citations


Proceedings ArticleDOI
19 May 2003
TL;DR: The current work in content-based image retrieval (CBIR) to exploit the information content of these images directly by using image processing is presented.
Abstract: We summarize research and development for the extraction and distribution of biomedical information from a collection of 17,000 spine x-ray images collected by the second National Health and Nutrition Examination Survey (NHANES II). We present a history of the technical milestones of this work, including the data collection as film, digitization, quality control, archiving technology, database organization, medical expert content evaluation, and Web data distribution. We conclude by presenting our current work in content-based image retrieval (CBIR) to exploit the information content of these images directly by using image processing. We provide an overview and current research results from this CBIR work, which includes: extensive segmentation research, focusing on Active Shape Modeling and Active Contour methods; alternative techniques for shape representation, including invariant moments, simple polygon approximation, and Fourier descriptors; neural network classification of shapes into biomedical categories, such as “anterior osteophytes present/not present”; and the implementation of a prototype CBIR system for the vertebrae that supports hybrid text/image queries using MATLAB and the MySQL relational database system.

44 citations


BookDOI
01 Oct 2003
TL;DR: This book discusses Database Systems, Data Modeling, and Workflow Technology: the Support for Collaboration.
Abstract: 1. Management of Data: State-of-the-Art and Emerging Trends.- 1 Introduction.- 2 Survey of the Volume.- 2. Database Systems: from File Systems to Modern Database Systems.- 1 Introduction - Database Concepts.- 2 Database System Generations.- 3 Network Database Systems.- 4 Hierarchical Database Systems.- 5 Relational Database Systems.- 6 Object-Oriented Database Systems.- 7 Federated, Mediated Database Systems and Data Warehouses.- 8 Conclusions.- 3. Data Modeling.- 1 Introduction.- 2 Early Concerns in Data Management.- 3 Abstraction in Data Modeling.- 4 Semantic Data Models.- 5 Models of Reality and Perception.- 6 Toward Cognition-Based Data Management.- 7 A Cognitive Approach to Data Modeling.- 8 Research Directions.- 4. Object-Oriented Database Systems.- 1 Introduction and Motivation.- 2 Object-Oriented Data Modeling.- 3 The Query Language OQL.- 4 Physical Object Management.- 5 Architecture of Client-Server-Systems.- 6 Indexing.- 7 Dealing with Set-Valued Attributes.- 8 Query Optimization.- 9 Conclusion.- 5. High Performance Parallel Database Management Systems.- 1 Introduction.- 2 Partitioning Strategies.- 3 Join Using Inter-Operator Parallelism.- 4 ORE: a Framework for Data Migration.- 5 Conclusions and Future Research Directions.- 6. Advanced Database Systems.- 1 Introduction.- 2 Preliminaries.- 3 Data Models and Modeling for Complex Objects.- 4 Advanced Query Languages.- 5 Advanced Database Server Capabilities.- 6 Conclusions and Outlook.- 7. Parallel and Distributed Multimedia Database Systems.- 1 Introduction.- 2 Media Fundamentals.- 3 MPEG as an Example of Media Compression.- 4 Organisation and Retrieval of Multimedia Data.- 5 Data Models for Multimedia Data.- 6 Multimedia Retrieval Sequence Using Images as an Example.- 7 Requirements for Multimedia Applications.- 8 Parallel and Distributed Processing of Multimedia Data.- 9 Parallel and Distributed Techniques for Multimedia Databases.- 10 Case Study: Cairo - Cluster Architecture for Image Retrieval and Organisation.- 11 Conclusions.- 8. Workflow Technology: the Support for Collaboration.- 1 Introduction.- 2 Application Scenario and Collaboration Requirements.- 3 Commercial Technologies Addressing Collaboration Requirements.- 4 Evaluation of Current Workflow Management Technology.- 5 Research Problems, Related Work, and Directions.- 6 Summary.- 9. Data Warehouses.- 1 Introduction.- 2 Basics.- 3 The Database of a Data Warehouse.- 4 The Data Warehouse Concept.- 5 Data Analysis of a Data Warehouse.- 6 Building a Data Warehouse.- 7 Future Research Directions.- 8 Conclusions.- 10. Mobile Computing.- 1 Introduction.- 2 Mobile Computing Infrastructure.- 3 Mobile Computing Software Architectures and Models.- 4 Disconnected Operation.- 5 Weak Connectivity.- 6 Data Delivery by Broadcast.- 7 Mobile Computing Resources and Pointers.- 8 Conclusions.- 11. Data Mining.- 1 Introduction.- 2 Mining Associations.- 3 Classification and Prediction.- 4 Clustering.- 5 Conclusions.- List of Contributors.

38 citations


Book
01 Jan 2003
TL;DR: This book discusses the real-time nature of Multimedia data, and the Semantic Nature ofMultimedia Data, and some of the techniques used to design and implement these systems.
Abstract: Contents 1 Introduction to Multimedia Databases 1.1 Introduction 1.2 What is Essential about Database Systems? 1.3 What is Different about Multimedia Data? 1.4 Multimedia Applications 1.5 What is in the Rest of the Book? 2 Multimedia Data 2.1 Multimedia Data Size 2.2 Real-time Nature of Multimedia 2.3 Why is the Semantic Nature of Multimedia Data a Problem? 2.4 Summary of Chapter 3 The Human Sensory System and Multimedia 3.1 Introduction - Human Information Processing 3.2 Human Brain and Multimedia Information 3.3 The Senses 3.4 Converting Data into Sensory Perception 3.5 Summary of Chapter 4 An Introduction to SQL and Multimedia 4.1 Introduction to SQL 4.2 Methods Using PL/SQL Stored Procedures 4.3 Manipulating Large Objects 4.4 Summary of Chapter 5 Querying Multimedia Data 5.1 Introduction 5.2 Manipulating Multimedia Data 5.3 What is the Classification Problem? 5.4 Summary of Chapter 6 Modeling Multimedia Databases 6.1 Issues of Designing Multimedia Database Management Systems 6.2 Semantic Data Modeling 6.3 Object-oriented Design 6.4 Object Methods 6.5 Object-relational Approach 6.6 Notes 7 Using Multimedia Metadata 7.1 Introduction 7.2 Classifying Metadata 7.3 Generating and Extracting Metadata 7.4 The Role of Metadata Standards 7.5 Digital Rights Management 7.6 Domain-dependent Metadata 7.7 Developing Ontologies 7.8 Summary of Chapter 8 Multimedia Database Architecture and Performance 8.1 Introduction to Multimedia Architecture Requirements 8.2 Performance Issues in Specific Implementations 8.3 Content Management 8.4 Summary of Chapter 9 Multimedia and the Internet 9.1 Introduction 9.2 Delivery of Multimedia Data 9.3 Media Streams 9.4 Network Protocols 9.5 User Datagram Protocol 9.6 Quality-of-service Issues - Internet Service Models 9.7 Packets and Datagrams - Sequence and Loss 9.8 Network Architecture 9.9 Requirements of Applications Involving Multicasting and Interactivity 9.10 Sumamry of Chapter 10 Dealing with Text Databases 10.1 Introduction 10.2 Querying Character Data Using SQL 10.3 Statistical Methods for Text Analysis 10.4 Querying Multimedia Text 10.5 Content-dependent Metadata 10.6 Indexing Technologies for Text 10.7 Summary of Chapter 11 Dealing with Image Databases 11.1 Introduction 11.2 Technologies for Image Processing 11.3 The Role of Feature Extraction 11.4 Retrieval Methods 11.5 Image Analysis and Object Recognition 11.6 Image Classification 11.7 Image Database Software 11.8 Developing Image Media Databases 11.9 Summary of Chapter 12 Dealing with Video Databases 12.1 Introduction 12.2 Video Analysis and Segmentation 12.3 Storage of Video Objects 12.4 Disk Scheduling 12.5 Dealing with Moving Images 12.6 Metadata for Speech 12.7 Metadata for Video 12.8 Manipulating Video Data 12.9 Video Query Process 12.10 Video Applications 12.11 Summary of Chapter Appendices Appendix A Normalization and Relational Databases Appendix B Metadata Standards Appendix C SQL Notes Appendix D Acronyms Appendix E Glossary Appendix F References Index Dunckley: Multimedia Databases Running heads Introduction to Multimedia Databases Introduction What is Essential about Database Systems? What is Different about Multimedia Data? Multimedia Applications What is in the Rest of the Book? Multimedia Data Multimedia Data Size Real-time Nature of Multimedia The Semantic Nature of Multimedia Data Summary of Chapter The Human Sensory System and Multimedia Introduction - Human Information Processing Human Brain and Multimedia Information The Senses Converting Data into Sensory Perception Summary of Chapter An Introduction to SQL and Multimedia Introduction to SQL Methods Using PL/SQL Stored Procedures Manipulating Large Objects Summary of Chapter Querying Multimedia Data Introduction Manipulating Multimedia Data What is the Classification Problem? Summary of Chapter Modeling Multimedia Databases Designing Multimedia Database Management Systems Semantic Data Modeling Object-oriented Design Object Methods Object-relational Approach Notes Using Multimedia Metadata Introduction Classifying Metadata Generating and Extracting Metadata The Role of Metadata Standards Digital Rights Management Domain-dependent Metadata Developing Ontologies Summary of Chapter Multimedia Database Architecture and Performance Introduction to Multimedia Architecture Requirements Performance Issues in Specific Implementations Content Management Summary of Chapter Multimedia and the Internet Introduction Delivery of Multimedia Data Media Streams Network Protocols User Datagram Protocol Quality-of-service Issues - Internet Service Models Packets and Datagrams - Sequence and Loss Network Architecture Applications Involving Multicasting and Interactivity Sumamry of Chapter Dealing with Text Databases Introduction Querying Character Data Using SQL Statistical Methods for Text Analysis Querying Multimedia Text Content-dependent Metadata Indexing Technologies for Text Summary of Chapter Dealing with Image Databases Introduction Technologies for Image Processing The Role of Feature Extraction Retrieval Methods Image Analysis and Object Recognition Image Classification Image Database Software Developing Image Media Databases Summary of Chapter Dealing with Video Databases Introduction Video Analysis and Segmentation Storage of Video Objects Disk Scheduling Dealing with Moving Images Metadata for Speech Metadata for Video Manipulating Video Data Video Query Process Video Applications Summary of Chapter Appendices Normalization and Relational Databases Metadata Standards SQL Notes Acronyms Glossary References Index Index

32 citations


Journal ArticleDOI
01 Mar 2003
TL;DR: A multi-paradigm querying approach of MediaLand is presented, in which multimedia queries are processed based on a seamless integration of various existing search approaches, and MediaLand offers the feature of "media independence" which is analogous to the notion of "data independence" from the classic ANSI SPARC standard.
Abstract: To truly meet the requirements of multimedia database (MMDB) management, an integrated framework for modeling, managing and retrieving various kinds of media data in a uniform way is necessary. MediaLand is an experimental MMDB platform being developed at Microsoft Research Asia for users with different levels of experiences and expertise to manage and search multimedia repositories easily, efficiently, and cooperatively. Key features of MediaLand include a uniform data model for describing all kinds of media objects and their relationships, and a 4-tier architecture based on this data model. In this paper, a multi-paradigm querying approach of MediaLand is presented, in which multimedia queries are processed based on a seamless integration of various existing search approaches. In doing so, MediaLand also offers the feature of "media independence" which is analogous to the notion of "data independence" from the classic ANSI SPARC standard. By incorporating a rich set of facilities and techniques, MediaLand lays down a good foundation for addressing further research issues, such as multimedia query rewriting, optimization, and presentation.

21 citations


Proceedings ArticleDOI
02 Nov 2003
TL;DR: This paper focuses on the low-level scheduling in APEX, a disk scheduling framework with QoS support, designed for environments with highly varying disk bandwidth usage, and demonstrates its robustness with respect to misaligned bandwidth allocation.
Abstract: In this paper, we present APEX, a disk scheduling framework with QoS support, designed for environments with highly varying disk bandwidth usage. In particular, we focus on a Learning-on-Demand scenario supported by a multimedia database management system, where students can search for, and play back multimedia-based learning material. APEX is based on a two-level scheduling architecture, where the upper level realizes different service classes using a set of queues, while the lower level distributes available disk bandwidth among these queues.In this paper, we focus on the low-level scheduling in APEX, which is based on an extended token bucket algorithm. The disk requests scheduled for service are assembled into batches, which render possible good efficiency for the disk. Combined with a very efficient work-conservation scheme, this enables APEX to apply bandwidth where it is needed, without efficiency loss. We demonstrate, through simulations, that APEX provides both higher throughput and lower response times than other mixed-media disk schedulers, while still avoiding deadline violations for real-time requests. We also show its robustness with respect to misaligned bandwidth allocation.

21 citations


Journal ArticleDOI
TL;DR: An efficient k-distance join algorithm that uses spatial indexes such as R-trees and is further optimized by novel strategies for selecting a sweeping axis and direction and the proposed adaptive multistage algorithms outperform previous work by up to an order of magnitude.
Abstract: A spatial distance join is a relatively new type of operation introduced for spatial and multimedia database applications. Additional requirements for ranking and stopping cardinality are often combined with the spatial distance join in online query processing or Internet search environments. These requirements pose new challenges as well as opportunities for more efficient processing of spatial distance join queries. In this paper, we first present an efficient k-distance join algorithm that uses spatial indexes such as R-trees. Bidirectional node expansion and plane-sweeping techniques are used for fast pruning of distant pairs, and the plane-sweeping is further optimized by novel strategies for selecting a sweeping axis and direction. Furthermore, we propose adaptive multistage algorithms for k-distance join and incremental distance join operations. Our performance study shows that the proposed adaptive multistage algorithms outperform previous work by up to an order of magnitude for both k-distance, join and incremental distance join queries, under various operational conditions.

Patent
06 Aug 2003
TL;DR: In this article, a multimedia correspondence memory correspondingly stores multimedia information collected by the multimedia information collection unit, and an information recognition unit recognizes the metadata stored in the correspondence memory and analyzes the metadata as personal data.
Abstract: In a multimedia information collection control apparatus, a multimedia information collection unit collects information from more than one kind of medium (multimedia). A multimedia correspondence memory correspondingly stores multimedia information collected by the multimedia information collection unit. An information recognition unit recognizes the multimedia information stored in the multimedia correspondence memory and analyzes the multimedia information as personal data according to the recognition result. A multimedia database relationally stores the multimedia information as the personal data analyzed by the information recognition unit.

Patent
06 Jun 2003
TL;DR: In this article, a technique for generating at least one alert message may include the following steps/operations: sensor data captured by one or more sensors is processed, then, at least 1 alert message is automatically generated based on information obtained using at least a portion of the processed data and pertaining to a continual activity of one or multiple objects and/or one or different persons and to an associated previous activity of the same or different objects and or persons.
Abstract: Techniques for intelligent and automatic alert management. Such techniques may be realized in conjunction with a multimedia database system, such that interesting and important multimedia data associated with real time events may be captured, and alerts generated based on the captured data. In one aspect of the invention, a technique for generating at least one alert message may include the following steps/operations. Sensor data captured by one or more sensors is processed. Then, at least one alert message is automatically generated based on information obtained using at least a portion of the processed data and pertaining to a continual activity of one or more objects and/or one or more persons and to an associated previous activity of the same or different objects and/or persons. The at least one alert message may also be generated based on a varying degree of complexity of activity. The one or more sensors may be associated with a multimedia database system.

Journal ArticleDOI
TL;DR: In this paper, it is shown how extended possibilistic truth values can be employed to cope with the handling of missing information in database systems.
Abstract: In database systems one often has to deal with constraints in order to compel the semantics of the stored data or to express some querying criteria. This is especially the case for multimedia database systems where information of different media types, as e.g. sound, video, images and texts, needs to be managed. A problem with the modelling of (the uncertainty about) the degree of satisfaction of a constraint, is the handling of missing information. In this paper, it is shown how extended possibilistic truth values can be employed to cope with this problem. The notion of an extended possibilistic truth value has been obtained from the assumption that the truth value, which expresses the degree of satisfaction of a constraint, can be undefined. This is for example the case if the constraint cannot be evaluated due to the non-applicability of (some of) its elements. An illustrative database definition and database querying example is presented and discussed.

Patent
06 Mar 2003
TL;DR: In this paper, the authors present a method and a system for constructing a multimedia database by indexing multimedia data, and a method for providing a multimedia data search service using the database.
Abstract: Provided are a method and a system for constructing a multimedia database by indexing multimedia data, and a method and a system for providing a multimedia data search service using a multimedia database. (Claim) Accordingly, it is possible to quickly and precisely search for desired data.

01 Jan 2003
TL;DR: ChildPhon is a multimedia database for child phonological data storage and analysis, which automatically performs parses and calculations, based on the phonetic transcriptions of the target words and the child’s renditions of these words.
Abstract: Over the past decade, researchers have enjoyed greater access to computerized tools for conducting research on linguistic production data. In the field of language acquisition, one of the greatest contributions in this regard comes from the CHILDES project (e.g. MacWhinney 1996), which provides the research community with a number of freeware tools for multimedia data storage and analysis. However, these tools focus mainly on morphological and syntactic aspects of acquisition. As regards phonological acquisition, very few tools are currently available, a problem which is due, at least in part, to the IPA symbols used in phonetic transcriptions. As a result, phonological investigations of child language based on large corpora of data remain a challenge. In this paper, I introduce ChildPhon, a new software aimed at filling part of this gap. ChildPhon is a multimedia database for child phonological data storage and analysis, which automatically performs parses and calculations, based on the phonetic transcriptions of the target words and the child’s renditions of these words. Although the features of the database introduced in this paper are fully functional, the reader is invited to approach the current version of ChildPhon as an ongoing, and still expanding, software development project. The paper is organized as follows. In §2, I provide background information concerning the precursor and early steps of this software development project. In §3, I discuss in some detail the database’s general information and transcription fields, and provide indications concerning the conventions required for data entry. In §4, I focus on the automatic calculations performed by the tool. In §5, I briefly discuss some of the database’s additional advantages. In §6, I offer a glance at future development plans. A brief conclusion follows in §7.

Journal ArticleDOI
01 Mar 2003
TL;DR: This paper proposes a language, in the style of constraint database languages, for formal specification of QoS constraints, and believes this paper represents a first step towards the development of a database framework for quality of service management in video databases.
Abstract: Quality of Service (QoS) is defined as a set of perceivable attributes expressed in a user-friendly language with parameters that may be subjective or objective. Objective parameters are those related to a particular service and are measurable and verifiable. Subjective parameters are those based on the opinions of the end-users. We believe that quality of service should become an integral part of multimedia database systems and users should be able to query by requiring a quality of service from the system. The specification and enforcement of QoS presents an interesting challenge in multimedia systems development. A deal of effort has been done on QoS specification and control at the system and the network levels, but less work has been done at the application/user level. In this paper, we propose a language, in the style of constraint database languages, for formal specification of QoS constraints. The satisfaction by the system of the user quality requirements can be viewed as a constraint satisfaction problem. We believe this paper represents a first step towards the development of a database framework for quality of service management in video databases. The contribution of this paper lies in providing a logical framework for specifying and enforcing quality of service in video databases. To our knowledge, this work is the first from a database perspective on quality of service management.

Journal ArticleDOI
TL;DR: A constraint-based rule language can serve as the foundation for providing this level of quality of service in video databases.
Abstract: Multimedia database systems require not only that correct results are delivered with acceptable delay, but also that they're delivered in real time with acceptable quality. A constraint-based rule language can serve as the foundation for providing this level of quality of service in video databases.

Proceedings ArticleDOI
19 May 2003
TL;DR: The experimental results show that the new proposed structure outperforms existing multi-feature string index approaches for music database retrieval and also improves the query accuracy.
Abstract: The management of large collections of music data in a multimedia database has received much attention in the past few years. In the most of current works, the researchers extract the features, such as melodies, rhythms and chords, from the music data and develop indices that will help to retrieve the relevant music quickly and improve the query accuracy. Several reports have pointed out that these features of music can be transformed and represented in the forms of music feature strings. However, these approaches lack of scalability while increasing the music data. Recently, we proposed an approach to transform the music data into a numeric forms and developed an index structure base on R-tree for effective retrieval. This numeric index approach performed more efficiently than existing string index approaches for music database retrieval. In this paper, we will extend our study to develop a multi-feature numeric index structure for music data. Our experimental results show that the new proposed structure outperforms existing multi-feature string index approaches.

Proceedings ArticleDOI
01 Mar 2003
TL;DR: CIMWOS is a multimedia, multimodal and multilingual system supporting content-based indexing, archiving, retrieval, and on-demand delivery of audiovisual content employing state-of-the-art algorithms for text, speech and image processing.
Abstract: CIMWOS is a multimedia, multimodal and multilingual system supporting content-based indexing, archiving, retrieval, and on-demand delivery of audiovisual content. The system uses a multifaceted approach to locate important segments within multimedia material employing state-of-the-art algorithms for text, speech and image processing. The audio processing operations employ robust continuous speech recognition, speech/non-speech classification, speaker clustering and speaker identification. Text processing tools operate on the text stream produced by the speech recogniser and perform named entity detection, term recognition, topic detection, and story segmentation. Image processing includes video segmentation and key frame extraction, face detection and face identification, object and scene recognition, video text detection and character recognition. All outputs converge to a textual XML metadata annotation scheme following the MPEG-7 standard. These XML annotations are further merged and loaded into the CIMWOS multimedia database. Additionally, they can be dynamically transformed for interchanging semantic-based information. The retrieval engine is based on a weighted boolean model with intelligent indexing components. An ergonomic and user-friendly web-based interface allows the user to efficiently retrieve video segments by a combination of media description, content metadata and natural language text. The system is currently under evaluation.

Journal Article
TL;DR: A general architecture of the multimedia data mining, including multimedia database, multimedia mining engine, multimediamining engine and multimedia mining interface is presented, which emphasizes on several mining techniques, such as classification, association and clustering.
Abstract: This paper presents a general architecture of the multimedia data mining(M3), including multimedia database(MD), multimedia mining engine(MME) and multimedia mining interface(MMI). It emphasizes on several mining techniques, such as classification, association and clustering. Special mining methods and content of image, video and audio are discussed too.

Book ChapterDOI
01 Jan 2003
TL;DR: The main aim is to explain the process of multimedia retrieval by using images as an example, and gives an overview over an existing cluster-based prototype for image retrieval named Cairo.
Abstract: This chapter presents an introduction to the area of parallel and distributed multimedia database systems. The first part describes the characteristics of multimedia data and depicts the storage and annotation of such data in conventional and in multimedia databases. The main aim is to explain the process of multimedia retrieval by using images as an example. The related computational, storage, and network requirements create an urgent need for the integration of parallel and distributed computer architectures in modern multimedia information systems. Different hardware and software aspects have to be examined, for example the partitioning of multimedia data and the distribution over multiple nodes have a decisive impact on the performance, efficiency, and the usability of such multimedia databases. Other distributed aspects such as streaming techniques, proxy and client issues, security, etc. are only briefly mentioned and are not in the focus of this chapter. The last section gives an overview over an existing cluster-based prototype for image retrieval named Cairo.

Proceedings ArticleDOI
19 May 2003
TL;DR: The main function of this system is to query the English sentence pattern through keywords from the English multimedia corpus, which not only leaches English grammar but also allows teachers to understand the most frequent mistakes.
Abstract: This work presents a novel English distance learning system that was developed through multimedia database and Internet technologies called English multimedia corpus. The system includes English articles, dialogs, and videos. A student can study English writing and reading as well as view Web browser listings to connect the Corpus server In the system, semantic query and "Link grammar" are applied to construct the English multimedia corpus system. Furthermore, it promotes the query level from keyword-base and content-based query to a semantic level. The main function of this system is to query the English sentence pattern through keywords from the English multimedia corpus. The other function is to detect grammatical errors in written English. Thus, the system not only leaches English grammar but also, due to its database, allows teachers to understand the most frequent mistakes.

Proceedings ArticleDOI
14 Oct 2003
TL;DR: A direct interface of retrieval space for a multimedia database retrieved by the combination of value representing a level of impression is proposed and its effectiveness is clarified by a pilot system and its evaluation.
Abstract: This paper proposes a direct interface of retrieval space for a multimedia database retrieved by the combination of value representing a level of impression. Conventional interfaces are insufficient for a user to obtain a multimedia data which fits his/her intention, efficiently. An approach in this paper adopts a multimedia data which he/she knows well as a basic point. He/she can select one of the data around the basic point with watching some relative relations to the basic point by his/her eye. Its effectiveness is clarified by a pilot system and its evaluation.

Patent
12 Feb 2003
TL;DR: In this article, a distributed server system for transmitting multimedia data and a method thereof are provided to distribute DB servers for providing multimedia data, switching servers and feeding servers for streaming the multimedia data using a memory pool and a pool buffer.
Abstract: PURPOSE: A distributed server system for transmitting multimedia data and a method thereof are provided to distribute DB servers for providing multimedia data and switching servers and feeding servers for streaming the multimedia data, so as to correct a transfer rate based on the distributed servers and manage stream multimedia data using a memory pool and a pool buffer. CONSTITUTION: A mobile communication client(500) delivers client packets including distributed server identification information, multimedia data streaming request information and mobile communication terminal location information, and receives multimedia data in real time to provide the mobile communication terminal(600) with the received multimedia data. At least one distributed parallel switching server(300) connected to the mobile communication client(500) through a wired network(400) receives the client packets and delivers the client packets to other distributed servers according to the distributed server identification information, receives multimedia data and manage the multimedia data through a pool buffer. Also, the switching server(300) allocates lines and controls connection to transmit the multimedia data to the mobile communication client(500). At least one distributed parallel feeding server(200) receives and processes the client packets from the switching server(300), and requests multimedia data search to receive and store the searched multimedia data. Also, the feeding server(200) manages the stored multimedia data through the pool buffer, and generates control packets including control codes and server packets including multimedia data to transmit the generated packets to the switching server(300) while maintaining a regular rate. And at least one distributed parallel DB server(100) including a multimedia database searches and provides the multimedia data upon the search request of the feeding server(200), and has a management panel for storing, deleting and modifying the multimedia data.

Book ChapterDOI
01 Jan 2003
TL;DR: This chapter introduces the reader to many critical issues involved in multimedia retrieval over the Internet and proposes a general architectural framework for a broad array of retrievals of multimedia data required by various applications.
Abstract: The objective of this chapter is to introduce the reader to a general architectural framework for a broad array of retrievals of multimedia data required by various applications. This framework contains more than the traditional client/server architecture and even more than the existing three-tier architectures. This chapter introduces the reader to many critical issues involved in multimedia retrieval over the Internet. A new architectural framework is proposed to cover a variety of multimedia applications over the Internet and the World Wide Web. This framework has the three main objectives of (1) proposing a layered architecture to facilitated design and separate different issues, (2) covering a large number of multimedia applications, and finally, (3) making use of existing and well-established technology, such as Mobile Agents, SQL databases, and cache managements schemes. The proposed architectural framework separates issues involved in multimedia retrieval into five layers, namely: keyword searching and data servers, proxy servers, domain and department archives, mobile user agents, and the users. Through these five layers, various customized solutions to a large array of problems will be proposed and applied. The chapter offers, but is not limited to, solutions for different problems that arise in retrieval of multimedia data. A list of important open problems is identified at the end of the chapter. INTRODUCTION Multimedia data plays an essential role in today’s e-business applications. Indeed, a large array of e-business applications relies directly or indirectly on multimedia data, ranging from simple news lines to sophisticated multimedia libraries. We can say without hesitation that most of data being transferre nowadays through the Internet involves some multimedia component. This chapter appears in the book, Architectural Issues of Web-Enabled Electronic Business, edited by Shi Nan Si and V.K Murthy. Copyright © 2003, Idea Group Inc. Copying or distributing in print or electronic forms without written permission of Idea Group Inc. is prohibited. 701 E. Chocolate Avenue, Hershey PA 17033-1240, USA Tel: 717/533-8845; Fax 717/533-8661; URL-http://www.idea-group.com ITB8700 IDEA GROUP PUBLISHING Retrieval of Multimedia Data 171 Copyright Idea Grou p Inc. Copy right Idea Grou p Inc. Copy right Idea Grou p Inc. Copy right Idea Grou p Inc. Overview of the Architecture The objective of this chapter is to propose a general architectural framework for a broad array of retrievals of multimedia data required by various applications. This framework contains more than the traditional client/server architecture and even more than the existing three-tier architectures. To the best of our knowledge, our model subsumes all the currently existing models. The most severe problem with multimedia applications is performance. This problem may incorporate many issues that are beyond the control of the application vendor, or even the network administrator. In our proposal, we suggest different architectural levels to act as performance degradation protectors. The main goal of the architectural level hierarchy is to hide the technical details of the search process from the user. The second major goal of the hierarchy is the intelligent filtering of data in order to avoid undue overload on the system. The proposed framework supports different types of multimedia data retrieval that will serve different application areas. These types include meta-data retrieval and both feature and semantic Content-Based Retrieval (CBR) that will support Content-Based Video on Demand Applications. As one of the goals of the e-business system is to hide all complicated details from the user, making the most complicated transactions look straightforward, we employ mobile agents to represent the user at different levels and to perform all the necessary work for him. Our proposal for employing mobile agents includes two entities for each user, namely, a Home base and Mobile Agent Instances. The Home base resides at the server closest to the user, represents the user, and acts on his behalf. In addition, the Home base applies suitable data reduction techniques to data before sending it back to the user. Moreover, the Home base is responsible for collecting data returned by mobile agent instances and for organizing it into media presentations. Every mobile agent instance propagates and replicates itself in the system, moving as close to the data servers as possible, in order to extract the data requested by the user. In order to reduce network traffic, different mobile units may also apply some data filtering techniques before sending data back. The proposed architecture contains the following five layers: 1. keyword searching and data servers; 2. proxy servers; 3. domain and department archives; 4. mobile agents for the users; and 5. the users. Review of Multimedia Content-Based Retrieval Models In the early days of multimedia databases, multimedia data was being handled externally. As multimedia data is inserted into the database, less effort is devoted to identify, analyze and determine its contents, considering it as black boxes knowing everything about their locations, labels, creators, etc., but knowing nothing about their contents. However, most of the listed applications incorporate queries not only about the external data, but also about the contents of the black boxes. Content-based modeling, indexing, and retrieval makeup the research area that deals with such contents of multimedia data. There are two types of multimedia CBR described below. Feature content-based querying, indexing, and retrieval (Moharrum, 2000) of multimedia data incorporate retrieving multimedia data based on visual or audible features extracted 13 more pages are available in the full version of this document, which may be purchased using the "Add to Cart" button on the publisher's webpage: www.igi-global.com/chapter/retrieval-multimedia-dataweb/5198

Proceedings ArticleDOI
02 Nov 2003
TL;DR: A novel interactive video information system called the Drama Characters' Popularity Voting System (DCPVS) is constructed by applying the results of off-line object recognition to provide description annotation, retrieval, and statistics in the video associated with an object over the Internet.
Abstract: Although numerous attempts have been made to determine algorithms and approaches for building up a video information system, not many practical applications have been proposed. In this paper, a novel interactive video information system called the Drama Characters' Popularity Voting System (DCPVS) is constructed by applying the results of off-line object recognition. The system's purpose is to provide description annotation, retrieval, and statistics in the video associated with an object, such as a character as a basic unit, over the Internet. By using the proposed system, multiple users in a network can enjoy the same video and can vote for the characters they like in it. The voting information is collected and stored in the server, which then provides the statistics regarding the popularity of different characters or the voting rates within different periods of the video.

Patent
15 Aug 2003
TL;DR: In this paper, the distance from the center of gravity of a cluster of database data to the closest neighboring vector is calculated, and the sum of the preceding two distances is calculated.
Abstract: Method has the following steps: calculation of the distance from the center of gravity of a cluster of database data to the closest neighboring vector; calculation of the distance of the request vector to the center of gravity; calculation of the sum of the preceding two distances; calculation of the smallest sum value for the collection of database clusters; and elimination of the clusters whose smallest distance is greater that the smallest sum value.

Patent
28 Nov 2003
TL;DR: In this article, a server for supplying a multimedia streaming and a method for interlocking a streaming system and a multimedia database system in the device are provided to improve a reliability of a system by managing information of multimedia data and multimedia reproduction data in a database.
Abstract: PURPOSE: A server for supplying a multimedia streaming and a method for interlocking a streaming system and a multimedia database system in the device are provided to improve a reliability of a system by managing information of multimedia data and multimedia reproduction data in a database, thereby overcoming inconsistency between information data and multimedia reproduction data. CONSTITUTION: A streaming system(100) transmits multimedia data to a streaming client. A multimedia database system(200) transmits stored multimedia data to the streaming system(100). A shared memory(350) temporarily stores data for transmitting multimedia data in the multimedia database system(200) to the streaming system(100). A message queue(300) transmits and receives a message between the streaming system(100) and the multimedia database system(200). The message queue(300) exists in a stack which is a temporary storing memory in a streaming server. A transmission managing unit(130) packetizes media data in accordance with a protocol for transmitting multimedia data to the streaming client. A media managing unit(120) reads multimedia data from the shared memory(350), decodes the read multimedia data, and transmits the multimedia data to the transmission managing unit(130). If a streaming is requested from the streaming client, a database connector(110) creates a corresponding multimedia streaming session and transmits a task message to the multimedia database system(200). After terminating a streaming task, the database connector(110) deletes a streaming session. A database server(200) includes multimedia data and stores multimedia data requested from the streaming system(100) in the shared memory(350). A media client(210) interprets a message transmitted from the database connector(110) and requests that the database server(200) executes a corresponding task.

Journal ArticleDOI
TL;DR: In database systems one often has to deal with constraints in order to compel the semantics of the stored data or to express some querying criteria, especially for multimedia databanks.
Abstract: In database systems one often has to deal with constraints in order to compel the semantics of the stored data or to express some querying criteria. This is especially the case for multimedia datab...